Selective constraint, background selection, and mutation accumulation variability within and between human populations

Alan Hodgkinson, Ferran Casals, Youssef Idaghdhour, Jean Christophe Grenier, Ryan D. Hernandez, Philip Awadalla

Research output: Contribution to journalArticle

Abstract

Background: Regions of the genome that are under evolutionary constraint across multiple species have previously been used to identify functional sequences in the human genome. Furthermore, it is known that there is an inverse relationship between evolutionary constraint and the allele frequency of a mutation segregating in human populations, implying a direct relationship between interspecies divergence and fitness in humans. Here we utilise this relationship to test differences in the accumulation of putatively deleterious mutations both between populations and on the individual level.Results: Using whole genome and exome sequencing data from Phase 1 of the 1000 Genome Project for 1,092 individuals from 14 worldwide populations we show that minor allele frequency (MAF) varies as a function of constraint around both coding regions and non-coding sites genome-wide, implying that negative, rather than positive, selection primarily drives the distribution of alleles among individuals via background selection. We find a strong relationship between effective population size and the depth of depression in MAF around the most conserved genes, suggesting that populations with smaller effective size are carrying more deleterious mutations, which also translates into higher genetic load when considering the number of putatively deleterious alleles segregating within each population. Finally, given the extreme richness of the data, we are now able to classify individual genomes by the accumulation of mutations at functional sites using high coverage 1000 Genomes data. Using this approach we detect differences between 'healthy' individuals within populations for the distributions of putatively deleterious rare alleles they are carrying.Conclusions: These findings demonstrate the extent of background selection in the human genome and highlight the role of population history in shaping patterns of diversity between human individuals. Furthermore, we provide a framework for the utility of personal genomic data for the study of genetic fitness and diseases.

Original languageEnglish (US)
Article number495
JournalBMC Genomics
Volume14
Issue number1
DOIs
StatePublished - Jul 23 2013

Fingerprint

Genome
Gene Frequency
Population
Alleles
Human Genome
Mutation
Genetic Load
Genetic Fitness
Exome
Inborn Genetic Diseases
Population Density
Mutation Accumulation
Demography
Genes

ASJC Scopus subject areas

  • Biotechnology
  • Genetics

Cite this

Selective constraint, background selection, and mutation accumulation variability within and between human populations. / Hodgkinson, Alan; Casals, Ferran; Idaghdhour, Youssef; Grenier, Jean Christophe; Hernandez, Ryan D.; Awadalla, Philip.

In: BMC Genomics, Vol. 14, No. 1, 495, 23.07.2013.

Research output: Contribution to journalArticle

Hodgkinson, Alan ; Casals, Ferran ; Idaghdhour, Youssef ; Grenier, Jean Christophe ; Hernandez, Ryan D. ; Awadalla, Philip. / Selective constraint, background selection, and mutation accumulation variability within and between human populations. In: BMC Genomics. 2013 ; Vol. 14, No. 1.
@article{4e2c6f768cb448789a7b10a12b061944,
title = "Selective constraint, background selection, and mutation accumulation variability within and between human populations",
abstract = "Background: Regions of the genome that are under evolutionary constraint across multiple species have previously been used to identify functional sequences in the human genome. Furthermore, it is known that there is an inverse relationship between evolutionary constraint and the allele frequency of a mutation segregating in human populations, implying a direct relationship between interspecies divergence and fitness in humans. Here we utilise this relationship to test differences in the accumulation of putatively deleterious mutations both between populations and on the individual level.Results: Using whole genome and exome sequencing data from Phase 1 of the 1000 Genome Project for 1,092 individuals from 14 worldwide populations we show that minor allele frequency (MAF) varies as a function of constraint around both coding regions and non-coding sites genome-wide, implying that negative, rather than positive, selection primarily drives the distribution of alleles among individuals via background selection. We find a strong relationship between effective population size and the depth of depression in MAF around the most conserved genes, suggesting that populations with smaller effective size are carrying more deleterious mutations, which also translates into higher genetic load when considering the number of putatively deleterious alleles segregating within each population. Finally, given the extreme richness of the data, we are now able to classify individual genomes by the accumulation of mutations at functional sites using high coverage 1000 Genomes data. Using this approach we detect differences between 'healthy' individuals within populations for the distributions of putatively deleterious rare alleles they are carrying.Conclusions: These findings demonstrate the extent of background selection in the human genome and highlight the role of population history in shaping patterns of diversity between human individuals. Furthermore, we provide a framework for the utility of personal genomic data for the study of genetic fitness and diseases.",
author = "Alan Hodgkinson and Ferran Casals and Youssef Idaghdhour and Grenier, {Jean Christophe} and Hernandez, {Ryan D.} and Philip Awadalla",
year = "2013",
month = "7",
day = "23",
doi = "10.1186/1471-2164-14-495",
language = "English (US)",
volume = "14",
journal = "BMC Genomics",
issn = "1471-2164",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - Selective constraint, background selection, and mutation accumulation variability within and between human populations

AU - Hodgkinson, Alan

AU - Casals, Ferran

AU - Idaghdhour, Youssef

AU - Grenier, Jean Christophe

AU - Hernandez, Ryan D.

AU - Awadalla, Philip

PY - 2013/7/23

Y1 - 2013/7/23

N2 - Background: Regions of the genome that are under evolutionary constraint across multiple species have previously been used to identify functional sequences in the human genome. Furthermore, it is known that there is an inverse relationship between evolutionary constraint and the allele frequency of a mutation segregating in human populations, implying a direct relationship between interspecies divergence and fitness in humans. Here we utilise this relationship to test differences in the accumulation of putatively deleterious mutations both between populations and on the individual level.Results: Using whole genome and exome sequencing data from Phase 1 of the 1000 Genome Project for 1,092 individuals from 14 worldwide populations we show that minor allele frequency (MAF) varies as a function of constraint around both coding regions and non-coding sites genome-wide, implying that negative, rather than positive, selection primarily drives the distribution of alleles among individuals via background selection. We find a strong relationship between effective population size and the depth of depression in MAF around the most conserved genes, suggesting that populations with smaller effective size are carrying more deleterious mutations, which also translates into higher genetic load when considering the number of putatively deleterious alleles segregating within each population. Finally, given the extreme richness of the data, we are now able to classify individual genomes by the accumulation of mutations at functional sites using high coverage 1000 Genomes data. Using this approach we detect differences between 'healthy' individuals within populations for the distributions of putatively deleterious rare alleles they are carrying.Conclusions: These findings demonstrate the extent of background selection in the human genome and highlight the role of population history in shaping patterns of diversity between human individuals. Furthermore, we provide a framework for the utility of personal genomic data for the study of genetic fitness and diseases.

AB - Background: Regions of the genome that are under evolutionary constraint across multiple species have previously been used to identify functional sequences in the human genome. Furthermore, it is known that there is an inverse relationship between evolutionary constraint and the allele frequency of a mutation segregating in human populations, implying a direct relationship between interspecies divergence and fitness in humans. Here we utilise this relationship to test differences in the accumulation of putatively deleterious mutations both between populations and on the individual level.Results: Using whole genome and exome sequencing data from Phase 1 of the 1000 Genome Project for 1,092 individuals from 14 worldwide populations we show that minor allele frequency (MAF) varies as a function of constraint around both coding regions and non-coding sites genome-wide, implying that negative, rather than positive, selection primarily drives the distribution of alleles among individuals via background selection. We find a strong relationship between effective population size and the depth of depression in MAF around the most conserved genes, suggesting that populations with smaller effective size are carrying more deleterious mutations, which also translates into higher genetic load when considering the number of putatively deleterious alleles segregating within each population. Finally, given the extreme richness of the data, we are now able to classify individual genomes by the accumulation of mutations at functional sites using high coverage 1000 Genomes data. Using this approach we detect differences between 'healthy' individuals within populations for the distributions of putatively deleterious rare alleles they are carrying.Conclusions: These findings demonstrate the extent of background selection in the human genome and highlight the role of population history in shaping patterns of diversity between human individuals. Furthermore, we provide a framework for the utility of personal genomic data for the study of genetic fitness and diseases.

UR - http://www.scopus.com/inward/record.url?scp=84880334976&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84880334976&partnerID=8YFLogxK

U2 - 10.1186/1471-2164-14-495

DO - 10.1186/1471-2164-14-495

M3 - Article

C2 - 23875710

AN - SCOPUS:84880334976

VL - 14

JO - BMC Genomics

JF - BMC Genomics

SN - 1471-2164

IS - 1

M1 - 495

ER -