251
|
Druet T, Ahariz N, Cambisano N, Tamma N, Michaux C, Coppieters W, Charlier C, Georges M. Selection in action: dissecting the molecular underpinnings of the increasing muscle mass of Belgian Blue Cattle. BMC Genomics 2014; 15:796. [PMID: 25228463 PMCID: PMC4190573 DOI: 10.1186/1471-2164-15-796] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Accepted: 09/08/2014] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND Belgian Blue cattle are famous for their exceptional muscular development or "double-muscling". This defining feature emerged following the fixation of a loss-of-function variant in the myostatin gene in the eighties. Since then, sustained selection has further increased muscle mass of Belgian Blue animals to a comparable extent. In the present paper, we study the genetic determinants of this second wave of muscle growth. RESULTS A scan for selective sweeps did not reveal the recent fixation of another allele with major effect on muscularity. However, a genome-wide association study identified two genome-wide significant and three suggestive quantitative trait loci (QTL) affecting specific muscle groups and jointly explaining 8-21% of the heritability. The top two QTL are caused by presumably recent mutations on unique haplotypes that have rapidly risen in frequency in the population. While one appears on its way to fixation, the ascent of the other is compromised as the likely underlying MRC2 mutation causes crooked tail syndrome in homozygotes. Genomic prediction models indicate that the residual additive variance is largely polygenic. CONCLUSIONS Contrary to complex traits in humans which have a near-exclusive polygenic architecture, muscle mass in beef cattle (as other production traits under directional selection), appears to be controlled by (i) a handful of recent mutations with large effect that rapidly sweep through the population, and (ii) a large number of presumably older variants with very small effects that rise slowly in the population (polygenic adaptation).
Collapse
|
252
|
Long AD, Macdonald SJ, King EG. Dissecting complex traits using the Drosophila Synthetic Population Resource. Trends Genet 2014; 30:488-95. [PMID: 25175100 DOI: 10.1016/j.tig.2014.07.009] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Revised: 07/28/2014] [Accepted: 07/28/2014] [Indexed: 11/25/2022]
Abstract
For most complex traits we have a poor understanding of the positions, phenotypic effects, and population frequencies of the underlying genetic variants contributing to their variation. Recently, several groups have developed multi-parent advanced intercross mapping panels in different model organisms in an attempt to improve our ability to characterize causative genetic variants. These panels are powerful and are particularly well suited to the dissection of phenotypic variation generated by rare alleles and loci segregating multiple functional alleles. We describe studies using one such panel, the Drosophila Synthetic Population Resource (DSPR), and the implications for our understanding of the genetic basis of complex traits. In particular, we note that many loci of large effect appear to be multiallelic. If multiallelism is a general rule, analytical approaches designed to identify multiallelic variants should be a priority for both genome-wide association studies (GWASs) and multi-parental panels.
Collapse
|
253
|
The genetic architecture of coordinately evolving male wing pigmentation and courtship behavior in Drosophila elegans and Drosophila gunungcola. G3-GENES GENOMES GENETICS 2014; 4:2079-93. [PMID: 25168010 PMCID: PMC4232533 DOI: 10.1534/g3.114.013037] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Many adaptive phenotypes consist of combinations of simpler traits that act synergistically, such as morphological traits and the behaviors that use those traits. Genetic correlations between components of such combinatorial traits, in the form of pleiotropic or tightly linked genes, can in principle promote the evolution and maintenance of these traits. In the Oriental Drosophila melanogaster species group, male wing pigmentation shows phylogenetic correlations with male courtship behavior; species with male-specific apical wing melanin spots also exhibit male visual wing displays, whereas species lacking these spots generally lack the displays. In this study, we investigated the quantitative genetic basis of divergence in male wing spots and displays between D. elegans, which possesses both traits, and its sibling species D. gunungcola, which lacks them. We found that divergence in wing spot size is determined by at least three quantitative trait loci (QTL) and divergence in courtship score is determined by at least four QTL. On the autosomes, QTL locations for pigmentation and behavior were generally separate, but on the X chromosome two clusters of QTL were found affecting both wing pigmentation and courtship behavior. We also examined the genetic basis of divergence in three components of male courtship, wing display, circling, and body shaking. Each of these showed a distinct genetic architecture, with some QTL mapping to similar positions as QTL for overall courtship score. Pairwise tests for interactions between marker loci revealed evidence of epistasis between putative QTL for wing pigmentation but not those for courtship behavior. The clustering of X-linked QTL for male pigmentation and behavior is consistent with the concerted evolution of these traits and motivates fine-scale mapping studies to elucidate the nature of the contributing genetic factors in these intervals.
Collapse
|
254
|
Mooney MA, Nigg JT, McWeeney SK, Wilmot B. Functional and genomic context in pathway analysis of GWAS data. Trends Genet 2014; 30:390-400. [PMID: 25154796 DOI: 10.1016/j.tig.2014.07.004] [Citation(s) in RCA: 86] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2014] [Revised: 07/18/2014] [Accepted: 07/18/2014] [Indexed: 02/07/2023]
Abstract
Gene set analysis (GSA) is a promising tool for uncovering the polygenic effects associated with complex diseases. However, the available techniques reflect a wide variety of hypotheses about how genetic effects interact to contribute to disease susceptibility. The lack of consensus about the best way to perform GSA has led to confusion in the field and has made it difficult to compare results across methods. A clear understanding of the various choices made during GSA - such as how gene sets are defined, how single-nucleotide polymorphisms (SNPs) are assigned to genes, and how individual SNP-level effects are aggregated to produce gene- or pathway-level effects - will improve the interpretability and comparability of results across methods and studies. In this review we provide an overview of the various data sources used to construct gene sets and the statistical methods used to test for gene set association, as well as provide guidelines for ensuring the comparability of results.
Collapse
|
255
|
Abstract
Allergic asthma is a complex disease characterized in part by granulocytic inflammation of the airways. In addition to eosinophils, neutrophils (PMN) are also present, particularly in cases of severe asthma. We sought to identify the genetic determinants of neutrophilic inflammation in a mouse model of house dust mite (HDM)-induced asthma. We applied an HDM model of allergic asthma to the eight founder strains of the Collaborative Cross (CC) and 151 incipient lines of the CC (preCC). Lung lavage fluid was analyzed for PMN count and the concentration of CXCL1, a hallmark PMN chemokine. PMN and CXCL1 were strongly correlated in preCC mice. We used quantitative trait locus (QTL) mapping to identify three variants affecting PMN, one of which colocalized with a QTL for CXCL1 on chromosome (Chr) 7. We used lung eQTL data to implicate a variant in the gene Zfp30 in the CXCL1/PMN response. This genetic variant regulates both CXCL1 and PMN by altering Zfp30 expression, and we model the relationships between the QTL and these three endophenotypes. We show that Zfp30 is expressed in airway epithelia in the normal mouse lung and that altering Zfp30 expression in vitro affects CXCL1 responses to an immune stimulus. Our results provide strong evidence that Zfp30 is a novel regulator of neutrophilic airway inflammation.
Collapse
|
256
|
Genome-wide association study of opioid dependence: multiple associations mapped to calcium and potassium pathways. Biol Psychiatry 2014; 76:66-74. [PMID: 24143882 PMCID: PMC3992201 DOI: 10.1016/j.biopsych.2013.08.034] [Citation(s) in RCA: 152] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Revised: 07/29/2013] [Accepted: 08/27/2013] [Indexed: 01/15/2023]
Abstract
BACKGROUND We report a genome-wide association study (GWAS) of two populations, African-American and European-American (AA, EA) for opioid dependence (OD) in three sets of subjects, to identify pathways, genes, and alleles important in OD risk. METHODS The design employed three phases (on the basis of separate sample collections). Phase 1 included our discovery GWAS dataset consisting of 5697 subjects (58% AA) diagnosed with opioid and/or other substance dependence and control subjects. Subjects were genotyped with the Illumina OmniQuad microarray, yielding 890,000 single nucleotide polymorphisms (SNPs) suitable for analysis. Additional genotypes were imputed with the 1000 Genomes reference panel. Top-ranked findings were further evaluated in Phase 2 by incorporating information from the publicly available Study of Addiction: Genetics and Environment dataset, with GWAS data from 4063 subjects (32% AA). In Phase 3, the most significant SNPs from Phase 2 were genotyped in 2549 independent subjects (32% AA). Analyses were performed with case-control and ordinal trait designs. RESULTS Most significant results emerged from the AA subgroup. Genome-wide-significant associations (p < 5.0 × 10(-8)) were observed with SNPs from multiple loci-KCNG2*rs62103177 was most significant after combining results from datasets in every phase of the study. The most compelling results were obtained with genes involved in potassium signaling pathways (e.g., KCNC1 and KCNG2). Pathway analysis also implicated genes involved in calcium signaling and long-term potentiation. CONCLUSIONS This is the first study to identify risk variants for OD with GWAS. Our results strongly implicate risk pathways and provide insights into novel therapeutic and prevention strategies and might biologically bridge OD and other non-substance dependence psychiatric traits where similar pathways have been implicated.
Collapse
|
257
|
Identification of a QTL in Mus musculus for alcohol preference, withdrawal, and Ap3m2 expression using integrative functional genomics and precision genetics. Genetics 2014; 197:1377-93. [PMID: 24923803 DOI: 10.1534/genetics.114.166165] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Extensive genetic and genomic studies of the relationship between alcohol drinking preference and withdrawal severity have been performed using animal models. Data from multiple such publications and public data resources have been incorporated in the GeneWeaver database with >60,000 gene sets including 285 alcohol withdrawal and preference-related gene sets. Among these are evidence for positional candidates regulating these behaviors in overlapping quantitative trait loci (QTL) mapped in distinct mouse populations. Combinatorial integration of functional genomics experimental results revealed a single QTL positional candidate gene in one of the loci common to both preference and withdrawal. Functional validation studies in Ap3m2 knockout mice confirmed these relationships. Genetic validation involves confirming the existence of segregating polymorphisms that could account for the phenotypic effect. By exploiting recent advances in mouse genotyping, sequence, epigenetics, and phylogeny resources, we confirmed that Ap3m2 resides in an appropriately segregating genomic region. We have demonstrated genetic and alcohol-induced regulation of Ap3m2 expression. Although sequence analysis revealed no polymorphisms in the Ap3m2-coding region that could account for all phenotypic differences, there are several upstream SNPs that could. We have identified one of these to be an H3K4me3 site that exhibits strain differences in methylation. Thus, by making cross-species functional genomics readily computable we identified a common QTL candidate for two related bio-behavioral processes via functional evidence and demonstrate sufficiency of the genetic locus as a source of variation underlying two traits.
Collapse
|
258
|
Fan R, Wang Y, Mills JL, Wilson AF, Bailey-Wilson JE, Xiong M. Functional linear models for association analysis of quantitative traits. Genet Epidemiol 2014; 37:726-42. [PMID: 24130119 DOI: 10.1002/gepi.21757] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2013] [Revised: 07/15/2013] [Accepted: 08/14/2013] [Indexed: 12/19/2022]
Abstract
Functional linear models are developed in this paper for testing associations between quantitative traits and genetic variants, which can be rare variants or common variants or the combination of the two. By treating multiple genetic variants of an individual in a human population as a realization of a stochastic process, the genome of an individual in a chromosome region is a continuum of sequence data rather than discrete observations. The genome of an individual is viewed as a stochastic function that contains both linkage and linkage disequilibrium (LD) information of the genetic markers. By using techniques of functional data analysis, both fixed and mixed effect functional linear models are built to test the association between quantitative traits and genetic variants adjusting for covariates. After extensive simulation analysis, it is shown that the F-distributed tests of the proposed fixed effect functional linear models have higher power than that of sequence kernel association test (SKAT) and its optimal unified test (SKAT-O) for three scenarios in most cases: (1) the causal variants are all rare, (2) the causal variants are both rare and common, and (3) the causal variants are common. The superior performance of the fixed effect functional linear models is most likely due to its optimal utilization of both genetic linkage and LD information of multiple genetic variants in a genome and similarity among different individuals, while SKAT and SKAT-O only model the similarities and pairwise LD but do not model linkage and higher order LD information sufficiently. In addition, the proposed fixed effect models generate accurate type I error rates in simulation studies. We also show that the functional kernel score tests of the proposed mixed effect functional linear models are preferable in candidate gene analysis and small sample problems. The methods are applied to analyze three biochemical traits in data from the Trinity Students Study.
Collapse
|
259
|
Abstract
Understanding the functional mechanisms underlying genetic signals associated with complex traits and common diseases, such as cancer, diabetes and Alzheimer's disease, is a formidable challenge. Many genetic signals discovered through genome-wide association studies map to non-protein coding sequences, where their molecular consequences are difficult to evaluate. This article summarizes concepts for the systematic interpretation of non-coding genetic signals using genome annotation data sets in different cellular systems. We outline strategies for the global analysis of multiple association intervals and the in-depth molecular investigation of individual intervals. We highlight experimental techniques to validate candidate (potential causal) regulatory variants, with a focus on novel genome-editing techniques including CRISPR/Cas9. These approaches are also applicable to low-frequency and rare variants, which have become increasingly important in genomic studies of complex traits and diseases. There is a pressing need to translate genetic signals into biological mechanisms, leading to prognostic, diagnostic and therapeutic advances.
Collapse
|
260
|
Gelernter J, Kranzler HR, Sherva R, Almasy L, Koesterer R, Smith AH, Anton R, Preuss UW, Ridinger M, Rujescu D, Wodarz N, Zill P, Zhao H, Farrer LA. Genome-wide association study of alcohol dependence:significant findings in African- and European-Americans including novel risk loci. Mol Psychiatry 2014; 19:41-9. [PMID: 24166409 PMCID: PMC4165335 DOI: 10.1038/mp.2013.145] [Citation(s) in RCA: 290] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/02/2013] [Revised: 09/13/2013] [Accepted: 09/24/2013] [Indexed: 01/26/2023]
Abstract
We report a GWAS of alcohol dependence (AD) in European-American (EA) and African-American (AA) populations, with replication in independent samples of EAs, AAs and Germans. Our sample for discovery and replication was 16 087 subjects, the largest sample for AD GWAS to date. Numerous genome-wide significant (GWS) associations were identified, many novel. Most associations were population specific, but in several cases were GWS in EAs and AAs for different SNPs at the same locus,showing biological convergence across populations. We confirmed well-known risk loci mapped to alcohol-metabolizing enzyme genes, notably ADH1B (EAs: Arg48His, P=1.17 × 10(-31); AAs: Arg369Cys, P=6.33 × 10(-17)) and ADH1C in AAs (Thr151Thr, P=4.94 × 10(-10)), and identified novel risk loci mapping to the ADH gene cluster on chromosome 4 and extending centromerically beyond it to include GWS associations at LOC100507053 in AAs (P=2.63 × 10(-11)), PDLIM5 in EAs (P=2.01 × 10(-8)), and METAP in AAs (P=3.35 × 10(-8)). We also identified a novel GWS association (1.17 × 10(-10)) mapped to chromosome 2 at rs1437396, between MTIF2 and CCDC88A, across all of the EA and AA cohorts, with supportive gene expression evidence, and population-specific GWS for markers on chromosomes 5, 9 and 19. Several of the novel associations implicate direct involvement of, or interaction with, genes previously identified as schizophrenia risk loci. Confirmation of known AD risk loci supports the overall validity of the study; the novel loci are worthy of genetic and biological follow-up. The findings support a convergence of risk genes (but not necessarily risk alleles) between populations, and, to a lesser extent, between psychiatric traits.
Collapse
|
261
|
Bartlett CW, Hou L, Flax JF, Hare A, Cheong SY, Fermano Z, Zimmerman-Bier B, Cartwright C, Azaro MA, Buyske S, Brzustowicz LM. A genome scan for loci shared by autism spectrum disorder and language impairment. Am J Psychiatry 2014; 171:72-81. [PMID: 24170272 PMCID: PMC4431698 DOI: 10.1176/appi.ajp.2013.12081103] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
OBJECTIVE The authors conducted a genetic linkage study of families that have both autism spectrum disorder (ASD) and language-impaired probands to find common communication impairment loci. The hypothesis was that these families have a high genetic loading for impairments in language ability, thus influencing the language and communication deficits of the family members with ASD. Comprehensive behavioral phenotyping of the families also enabled linkage analysis of quantitative measures, including normal, subclinical, and disordered variation in all family members for the three general autism symptom domains: social, communication, and compulsive behaviors. METHOD The primary linkage analysis coded persons with either ASD or specific language impairment as "affected." The secondary linkage analysis consisted of quantitative metrics of autism-associated behaviors capturing normal to clinically severe variation, measured in all family members. RESULTS Linkage to language phenotypes was established at two novel chromosomal loci, 15q23-26 and 16p12. The secondary analysis of normal and disordered quantitative variation in social and compulsive behaviors established linkage to two loci for social behaviors (at 14q and 15q) and one locus for repetitive behaviors (at 13q). CONCLUSION These data indicate shared etiology of ASD and specific language impairment at two novel loci. Additionally, nonlanguage phenotypes based on social aloofness and rigid personality traits showed compelling evidence for linkage in this study group. Further genetic mapping is warranted at these loci.
Collapse
|
262
|
Wang Z, Wang Y, Wang N, Wang J, Wang Z, Vallejos CE, Wu R. Towards a comprehensive picture of the genetic landscape of complex traits. Brief Bioinform 2014; 15:30-42. [PMID: 22930650 PMCID: PMC3896925 DOI: 10.1093/bib/bbs049] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2012] [Accepted: 07/09/2012] [Indexed: 12/11/2022] Open
Abstract
The formation of phenotypic traits, such as biomass production, tumor volume and viral abundance, undergoes a complex process in which interactions between genes and developmental stimuli take place at each level of biological organization from cells to organisms. Traditional studies emphasize the impact of genes by directly linking DNA-based markers with static phenotypic values. Functional mapping, derived to detect genes that control developmental processes using growth equations, has proven powerful for addressing questions about the roles of genes in development. By treating phenotypic formation as a cohesive system using differential equations, a different approach-systems mapping-dissects the system into interconnected elements and then map genes that determine a web of interactions among these elements, facilitating our understanding of the genetic machineries for phenotypic development. Here, we argue that genetic mapping can play a more important role in studying the genotype-phenotype relationship by filling the gaps in the biochemical and regulatory process from DNA to end-point phenotype. We describe a new framework, named network mapping, to study the genetic architecture of complex traits by integrating the regulatory networks that cause a high-order phenotype. Network mapping makes use of a system of differential equations to quantify the rule by which transcriptional, proteomic and metabolomic components interact with each other to organize into a functional whole. The synthesis of functional mapping, systems mapping and network mapping provides a novel avenue to decipher a comprehensive picture of the genetic landscape of complex phenotypes that underlie economically and biomedically important traits.
Collapse
|
263
|
Yvert G. 'Particle genetics': treating every cell as unique. Trends Genet 2013; 30:49-56. [PMID: 24315431 DOI: 10.1016/j.tig.2013.11.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2013] [Revised: 11/06/2013] [Accepted: 11/13/2013] [Indexed: 12/18/2022]
Abstract
Genotype-phenotype relations are usually inferred from a deterministic point of view. For example, quantitative trait loci (QTL), which describe regions of the genome associated with a particular phenotype, are based on a mean trait difference between genotype categories. However, living systems comprise huge numbers of cells (the 'particles' of biology). Each cell can exhibit substantial phenotypic individuality, which can have dramatic consequences at the organismal level. Now, with technology capable of interrogating individual cells, it is time to consider how genotypes shape the probability laws of single cell traits. The possibility of mapping single cell probabilistic trait loci (PTL), which link genomic regions to probabilities of cellular traits, is a promising step in this direction. This approach requires thinking about phenotypes in probabilistic terms, a concept that statistical physicists have been applying to particles for a century. Here, I describe PTL and discuss their potential to enlarge our understanding of genotype-phenotype relations.
Collapse
|
264
|
The evolutionary genetics of the genes underlying phenotypic associations for loblolly pine (Pinus taeda, Pinaceae). Genetics 2013; 195:1353-72. [PMID: 24121773 DOI: 10.1534/genetics.113.157198] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
A primary goal of evolutionary genetics is to discover and explain the genetic basis of fitness-related traits and how this genetic basis evolves within natural populations. Unprecedented technological advances have fueled the discovery of genetic variants associated with ecologically relevant phenotypes in many different life forms, as well as the ability to scan genomes for deviations from selectively neutral models of evolution. Theoretically, the degree of overlap between lists of genomic regions identified using each approach is related to the genetic architecture of fitness-related traits and the strength and type of natural selection molding variation at these traits within natural populations. Here we address for the first time in a plant the degree of overlap between these lists, using patterns of nucleotide diversity and divergence for >7000 unique amplicons described from the extensive expressed sequence tag libraries generated for loblolly pine (Pinus taeda L.) in combination with the >1000 published genetic associations. We show that loci associated with phenotypic traits are distinct with regard to neutral expectations. Phenotypes measured at the whole plant level (e.g., disease resistance) exhibit an approximately twofold increase in the proportion of adaptive nonsynonymous substitutions over the genome-wide average. As expected for polygenic traits, these signals were apparent only when loci were considered at the level of functional sets. The ramifications of this result are discussed in light of the continued efforts to dissect the genetic basis of quantitative traits.
Collapse
|
265
|
Green JWM, Snoek LB, Kammenga JE, Harvey SC. Genetic mapping of variation in dauer larvae development in growing populations of Caenorhabditis elegans. Heredity (Edinb) 2013; 111:306-13. [PMID: 23715016 PMCID: PMC3807260 DOI: 10.1038/hdy.2013.50] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2013] [Revised: 04/15/2013] [Accepted: 04/22/2013] [Indexed: 11/09/2022] Open
Abstract
In the nematode Caenorhabditis elegans, the appropriate induction of dauer larvae development within growing populations is likely to be a primary determinant of genotypic fitness. The underlying genetic architecture of natural genetic variation in dauer formation has, however, not been thoroughly investigated. Here, we report extensive natural genetic variation in dauer larvae development within growing populations across multiple wild isolates. Moreover, bin mapping of introgression lines (ILs) derived from the genetically divergent isolates N2 and CB4856 reveals 10 quantitative trait loci (QTLs) affecting dauer formation. Comparison of individual ILs to N2 identifies an additional eight QTLs, and sequential IL analysis reveals six more QTLs. Our results also show that a behavioural, laboratory-derived, mutation controlled by the neuropeptide Y receptor homolog npr-1 can affect dauer larvae development in growing populations. These findings illustrate the complex genetic architecture of variation in dauer larvae formation in C. elegans and may help to understand how the control of variation in dauer larvae development has evolved.
Collapse
|
266
|
Wang Z, Liu X, Yang BZ, Gelernter J. The role and challenges of exome sequencing in studies of human diseases. Front Genet 2013; 4:160. [PMID: 24032039 PMCID: PMC3752524 DOI: 10.3389/fgene.2013.00160] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2013] [Accepted: 08/04/2013] [Indexed: 01/19/2023] Open
Abstract
Recent advances in next-generation sequencing technologies have transformed the genetics study of human diseases; this is an era of unprecedented productivity. Exome sequencing, the targeted sequencing of the protein-coding portion of the human genome, has been shown to be a powerful and cost-effective method for detection of disease variants underlying Mendelian disorders. Increasing effort has been made in the interest of the identification of rare variants associated with complex traits in sequencing studies. Here we provided an overview of the application fields for exome sequencing in human diseases. We describe a general framework of computation and bioinformatics for handling sequencing data. We then demonstrate data quality and agreement between exome sequencing and exome microarray (chip) genotypes using data collected on the same set of subjects in a genetic study of panic disorder. Our results show that, in sequencing data, the data quality was generally higher for variants within the exonic target regions, compared to that outside the target regions, due to the target enrichment. We also compared genotype concordance for variant calls obtained by exome sequencing vs. exome genotyping microarrays. The overall consistency rate was >99.83% and the heterozygous consistency rate was >97.55%. The two platforms share a large amount of agreement over low frequency variants in the exonic regions, while exome sequencing provides much more information on variants not included on exome genotyping microarrays. The results demonstrate that exome sequencing data are of high quality and can be used to investigate the role of rare coding variants in human diseases.
Collapse
|
267
|
Genome-wide prediction of traits with different genetic architecture through efficient variable selection. Genetics 2013; 195:573-87. [PMID: 23934883 DOI: 10.1534/genetics.113.150078] [Citation(s) in RCA: 101] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In genome-based prediction there is considerable uncertainty about the statistical model and method required to maximize prediction accuracy. For traits influenced by a small number of quantitative trait loci (QTL), predictions are expected to benefit from methods performing variable selection [e.g., BayesB or the least absolute shrinkage and selection operator (LASSO)] compared to methods distributing effects across the genome [ridge regression best linear unbiased prediction (RR-BLUP)]. We investigate the assumptions underlying successful variable selection by combining computer simulations with large-scale experimental data sets from rice (Oryza sativa L.), wheat (Triticum aestivum L.), and Arabidopsis thaliana (L.). We demonstrate that variable selection can be successful when the number of phenotyped individuals is much larger than the number of causal mutations contributing to the trait. We show that the sample size required for efficient variable selection increases dramatically with decreasing trait heritabilities and increasing extent of linkage disequilibrium (LD). We contrast and discuss contradictory results from simulation and experimental studies with respect to superiority of variable selection methods over RR-BLUP. Our results demonstrate that due to long-range LD, medium heritabilities, and small sample sizes, superiority of variable selection methods cannot be expected in plant breeding populations even for traits like FRIGIDA gene expression in Arabidopsis and flowering time in rice, assumed to be influenced by a few major QTL. We extend our conclusions to the analysis of whole-genome sequence data and infer upper bounds for the number of causal mutations which can be identified by LASSO. Our results have major impact on the choice of statistical method needed to make credible inferences about genetic architecture and prediction accuracy of complex traits.
Collapse
|
268
|
Whitcomb DC, Lowry LW. Genetic risk factors for pancreatic disorders. Gastroenterology 2013; 144:1292-302. [PMID: 23622139 PMCID: PMC3684061 DOI: 10.1053/j.gastro.2013.01.069] [Citation(s) in RCA: 182] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/18/2012] [Revised: 01/15/2013] [Accepted: 01/17/2013] [Indexed: 02/06/2023]
Abstract
A combination of genetic, environmental, and metabolic factors contribute to the development and recurrence of acute and chronic pancreatitis; information on all of these is required to manage patients effectively. For example, variants that affect regulation of the protease, serine (PRSS)1-PRSS2, and claudin (CLDN)2 loci, rather than their coding sequences, interact with other genetic and environmental factors to affect disease development. New strategies are needed to use these data and determine their contribution to pathogenesis, because these variants differ from previously studied, rare variants in exons (coding regions) of genes such as PRSS1, SPINK1, cystic fibrosis transmembrane conductance regulator (CFTR), chymotrypsin (CTR)C, and calcium-sensing receptor (CASR). Learning how various genetic factors affect pancreatic cells and systems could lead to etiology-based therapies rather than treatment of symptoms.
Collapse
|
269
|
Abstract
Determining the genetic architecture of liability for complex neuropsychiatric disorders like autism spectrum disorders and schizophrenia poses a tremendous challenge for contemporary biomedical research. Here we discuss how genetic studies first tested, and rejected, the hypothesis that common variants with large effects account for the prevalence of these disorders. We then explore how the discovery of structural variation has contributed to our understanding of the etiology of these disorders. The rise of fast and inexpensive oligonucleotide sequencing and methods of targeted enrichment and their influence on the search for rare genetic variation contributing to complex neuropsychiatric disorders is the next focus of our article. Finally, we consider the technical challenges and future prospects for the use of next-generation sequencing to reveal the complex genetic architecture of complex neuropsychiatric disorders in both research and the clinical settings.
Collapse
|
270
|
Dudycha JL, Snoke-Smith M, Alía R. Correlated responses to clonal selection in populations of Daphnia pulicaria: mechanisms of genetic correlation and the creative power of sex. Ecol Evol 2013; 3:204-16. [PMID: 23467851 PMCID: PMC3586631 DOI: 10.1002/ece3.444] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2012] [Revised: 10/26/2012] [Accepted: 11/01/2012] [Indexed: 11/10/2022] Open
Abstract
Genetic correlations among traits alter evolutionary trajectories due to indirect selection. Pleiotropy, chance linkage, and selection can all lead to genetic correlations, but have different consequences for phenotypic evolution. We sought to assess the mechanisms contributing to correlations with size at maturity in the cyclic parthenogen Daphnia pulicaria. We selected on size in each of four populations that differ in the frequency of sex, and evaluated correlated responses in a life table. Size at advanced adulthood, reproductive output, and adult growth rate clearly showed greater responses in high-sex populations, with a similar pattern in neonate size and r. This pattern is expected only when trait correlations are favored by selection and the frequency of sex favors the creation and demographic expansion of highly fit clones. Juvenile growth and age at maturity did not diverge consistently. The inter-clutch interval appeared to respond more strongly in low-sex populations, but this was not statistically significant. Our data support the hypothesis that correlated selection is the strongest driver of genetic correlations, and suggest that in organisms with both sexual and asexual reproduction, adaptation can be enhanced by recombination.
Collapse
|
271
|
Bo W, Fu G, Wang Z, Xu F, Shen Y, Xu J, Huang Z, Gai J, Vallejos CE, Wu R. Systems mapping: how to map genes for biomass allocation toward an ideotype. Brief Bioinform 2013; 15:660-9. [PMID: 23428353 DOI: 10.1093/bib/bbs089] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The recent availability of high-throughput genetic and genomic data allows the genetic architecture of complex traits to be systematically mapped. The application of these genetic results to design and breed new crop types can be made possible through systems mapping. Systems mapping is a computational model that dissects a complex phenotype into its underlying components, coordinates different components in terms of biological laws through mathematical equations and maps specific genes that mediate each component and its connection with other components. Here, we present a new direction of systems mapping by integrating this tool with carbon economy. With an optimal spatial distribution of carbon fluxes between sources and sinks, plants tend to maximize whole-plant growth and competitive ability under limited availability of resources. We argue that such an economical strategy for plant growth and development, once integrated with systems mapping, will not only provide mechanistic insights into plant biology, but also help to spark a renaissance of interest in ideotype breeding in crops and trees.
Collapse
|
272
|
Christin PA, Boxall SF, Gregory R, Edwards EJ, Hartwell J, Osborne CP. Parallel recruitment of multiple genes into c4 photosynthesis. Genome Biol Evol 2013; 5:2174-87. [PMID: 24179135 PMCID: PMC3845648 DOI: 10.1093/gbe/evt168] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/24/2013] [Indexed: 11/12/2022] Open
Abstract
During the diversification of living organisms, novel adaptive traits usually evolve through the co-option of preexisting genes. However, most enzymes are encoded by gene families, whose members vary in their expression and catalytic properties. Each may therefore differ in its suitability for recruitment into a novel function. In this work, we test for the presence of such a gene recruitment bias using the example of C4 photosynthesis, a complex trait that evolved recurrently in flowering plants as a response to atmospheric CO2 depletion. We combined the analysis of complete nuclear genomes and high-throughput transcriptome data for three grass species that evolved the C4 trait independently. For five of the seven enzymes analyzed, the same gene lineage was recruited across the independent C4 origins, despite the existence of multiple copies. The analysis of a closely related C3 grass confirmed that C4 expression patterns were not present in the C3 ancestors but were acquired during the evolutionary transition to C4 photosynthesis. The significant bias in gene recruitment indicates that some genes are more suitable for a novel function, probably because the mutations they accumulated brought them closer to the characteristics required for the new function.
Collapse
|
273
|
Abstract
Three recent breakthroughs have resulted in the current widespread use of DNA information: the genomic selection (GS) methodology, which is a form of marker-assisted selection on a genome-wide scale, and the discovery of large numbers of single-nucleotide markers and cost effective methods to genotype them. GS estimates the effect of thousands of DNA markers simultaneously. Nonlinear estimation methods yield higher accuracy, especially for traits with major genes. The marker effects are estimated in a genotyped and phenotyped training population and are used for the estimation of breeding values of selection candidates by combining their genotypes with the estimated marker effects. The benefits of GS are greatest when selection is for traits that are not themselves recorded on the selection candidates before they can be selected. In the future, genome sequence data may replace SNP genotypes as markers. This could increase GS accuracy because the causative mutations should be included in the data.
Collapse
|
274
|
Forsberg LA, Absher D, Dumanski JP. Non-heritable genetics of human disease: spotlight on post-zygotic genetic variation acquired during lifetime. J Med Genet 2013; 50:1-10. [PMID: 23172682 PMCID: PMC3534255 DOI: 10.1136/jmedgenet-2012-101322] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2012] [Revised: 10/18/2012] [Accepted: 10/19/2012] [Indexed: 01/06/2023]
Abstract
The heritability of most common, multifactorial diseases is rather modest and known genetic effects account for a small part of it. The remaining portion of disease aetiology has been conventionally ascribed to environmental effects, with an unknown part being stochastic. This review focuses on recent studies highlighting stochastic events of potentially great importance in human disease-the accumulation of post-zygotic structural aberrations with age in phenotypically normal humans. These findings are in agreement with a substantial mutational load predicted to occur during lifetime within the human soma. A major consequence of these results is that the genetic profile of a single tissue collected at one time point should be used with caution as a faithful portrait of other tissues from the same subject or the same tissue throughout life. Thus, the design of studies in human genetics interrogating a single sample per subject or applying lymphoblastoid cell lines may come into question. Sporadic disorders are common in medicine. We wish to stress the non-heritable genetic variation as a potentially important factor behind the development of sporadic diseases. Moreover, associations between post-zygotic mutations, clonal cell expansions and their relation to cancer predisposition are central in this context. Post-zygotic mutations are amenable to robust examination and are likely to explain a sizable part of non-heritable disease causality, which has routinely been thought of as synonymous with environmental factors. In view of the widespread accumulation of genetic aberrations with age and strong predictions of disease risk from such analyses, studies of post-zygotic mutations may be a fruitful approach for delineation of variants that are causative for common human disorders.
Collapse
|
275
|
Pérez-de-Castro AM, Vilanova S, Cañizares J, Pascual L, Blanca JM, Díez MJ, Prohens J, Picó B. Application of genomic tools in plant breeding. Curr Genomics 2012; 13:179-95. [PMID: 23115520 PMCID: PMC3382273 DOI: 10.2174/138920212800543084] [Citation(s) in RCA: 90] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2011] [Revised: 09/16/2011] [Accepted: 10/11/2011] [Indexed: 02/08/2023] Open
Abstract
Plant breeding has been very successful in developing improved varieties using conventional tools and methodologies. Nowadays, the availability of genomic tools and resources is leading to a new revolution of plant breeding, as they facilitate the study of the genotype and its relationship with the phenotype, in particular for complex traits. Next Generation Sequencing (NGS) technologies are allowing the mass sequencing of genomes and transcriptomes, which is producing a vast array of genomic information. The analysis of NGS data by means of bioinformatics developments allows discovering new genes and regulatory sequences and their positions, and makes available large collections of molecular markers. Genome-wide expression studies provide breeders with an understanding of the molecular basis of complex traits. Genomic approaches include TILLING and EcoTILLING, which make possible to screen mutant and germplasm collections for allelic variants in target genes. Re-sequencing of genomes is very useful for the genome-wide discovery of markers amenable for high-throughput genotyping platforms, like SSRs and SNPs, or the construction of high density genetic maps. All these tools and resources facilitate studying the genetic diversity, which is important for germplasm management, enhancement and use. Also, they allow the identification of markers linked to genes and QTLs, using a diversity of techniques like bulked segregant analysis (BSA), fine genetic mapping, or association mapping. These new markers are used for marker assisted selection, including marker assisted backcross selection, ‘breeding by design’, or new strategies, like genomic selection. In conclusion, advances in genomics are providing breeders with new tools and methodologies that allow a great leap forward in plant breeding, including the ‘superdomestication’ of crops and the genetic dissection and breeding for complex traits.
Collapse
|