1
|
Xia H, Hao Z, Shen Y, Tu Z, Yang L, Zong Y, Li H. Genome-wide association study of multiyear dynamic growth traits in hybrid Liriodendron identifies robust genetic loci associated with growth trajectories. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2023; 115:1544-1563. [PMID: 37272730 DOI: 10.1111/tpj.16337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 04/30/2023] [Accepted: 05/29/2023] [Indexed: 06/06/2023]
Abstract
The genetic factors underlying growth traits differ over time points or stages. However, most current studies of phenotypes at single time points do not capture all loci or explain the genetic differences underlying growth trajectories. Hybrid Liriodendron exhibits obvious heterosis and is widely cultivated, although its complex genetic mechanism underlying growth traits remains unknown. A genome-wide association study (GWAS) is an effective method for elucidating the genetic architecture by identifying genetic loci underlying complex quantitative traits. In the present study, using a GWAS, we identified robust loci associated with growth trajectories in hybrid Liriodendron populations. We selected 233 hybrid progenies derived from 25 crosses for resequencing, and measured their tree height (H) and diameter at breast height (DBH) for 11 consecutive years; 192 972 high-quality single nucleotide polymorphisms (SNPs) were obtained. The dynamics of the multiyear single-trait GWAS showed that year-specific SNPs predominated, and only five robust SNPs for DBH were identified in at least three different years. Multitrait GWAS analysis with model parameters as latent variables also revealed 62 SNPs for H and 52 for DBH associated with the growth trajectory, displaying different biomass accumulation patterns, among which four SNPs exerted pleiotropic effects. All identified SNPs also exhibited temporal variations in effect sizes and inheritance patterns potentially related to different growth and developmental stages. The haplotypes resulting from these significant SNPs might pyramid favorable loci, benefitting the selection of superior genotypes. The present study provides insights into the genetic architecture of dynamic growth traits and lays a basis for future molecular-assisted breeding.
Collapse
Affiliation(s)
- Hui Xia
- State Key Laboratory of Tree Genetics and Breeding, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, China
| | - Ziyuan Hao
- State Key Laboratory of Tree Genetics and Breeding, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, China
| | - Yufang Shen
- State Key Laboratory of Tree Genetics and Breeding, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, China
| | - Zhonghua Tu
- State Key Laboratory of Tree Genetics and Breeding, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, China
| | - Lichun Yang
- State Key Laboratory of Tree Genetics and Breeding, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, China
| | - Yaxian Zong
- State Key Laboratory of Tree Genetics and Breeding, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, China
| | - Huogen Li
- State Key Laboratory of Tree Genetics and Breeding, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, China
| |
Collapse
|
2
|
Fu Y, Zhao H, Huang J, Zhu H, Luan X, Bu S, Liu Z, Wang X, Peng Z, Meng L, Liu G, Zhang G, Wang S. Dynamic analysis of QTLs on plant height with single segment substitution lines in rice. Sci Rep 2022; 12:5465. [PMID: 35361859 PMCID: PMC8971505 DOI: 10.1038/s41598-022-09536-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Accepted: 03/24/2022] [Indexed: 01/22/2023] Open
Abstract
Dynamic regulation of QTLs remains mysterious. Single segment substitution lines (SSSLs) and conditional QTL mapping and functional QTL mappings are ideal materials and methods to explore dynamics of QTLs for complex traits. This paper analyzed the dynamics of QTLs on plant height with SSSLs in rice. Five SSSLs were verified with plant height QTLs first. All five QTLs had significant positive effects at one or more developmental stages except QTL1. They interacted each other, with negative effects before 49 d after transplanting and positive effects since then. The five QTLs selectively expressed in specific periods, mainly in the periods from 35 to 42 d and from 49 to 56 d after transplanting. Expressions of epistasis were dispersedly in various periods, negative effects appearing mainly before 35 d. The five QTLs brought the inflexion point ahead of schedule, accelerated growth and degradation, and changed the peak plant height, while their interactions had the opposite effects. The information will be helpful to understand the genetic mechanism for developmental traits.
Collapse
Affiliation(s)
- Yu Fu
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Hongyuan Zhao
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Jiongkai Huang
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Haitao Zhu
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Xin Luan
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Suhong Bu
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Zupei Liu
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Xiaoling Wang
- National Engineering Research Center of Rice (Nanchang), Rice Research Institute, Jiangxi Academy of Agricultural Sciences, Nanchang, 330299, People's Republic of China
| | - Zhiqin Peng
- National Engineering Research Center of Rice (Nanchang), Rice Research Institute, Jiangxi Academy of Agricultural Sciences, Nanchang, 330299, People's Republic of China
| | - Lijun Meng
- Agricultural Genomics Institute in Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 440307, People's Republic of China.
| | - Guifu Liu
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China.
| | - Guiquan Zhang
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China.
| | - Shaokui Wang
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China.
| |
Collapse
|
3
|
Luan X, Xiong L, Xu H, Zhu H, Bu S, Meng L, Liu G, Wang S. Functional mapping of tillering QTLs using the Wang-Lan-Ding model and a SSSL population. Mol Genet Genomics 2021; 296:1279-1286. [PMID: 34536132 DOI: 10.1007/s00438-021-01819-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 08/29/2021] [Indexed: 11/24/2022]
Abstract
Understanding dynamic changes in the genetic architecture of quantitative traits is crucial in developmental genetics. Functional mapping is an appropriate method that passes a mathematical equation to describe a biological developmental process with the genetic mapping framework. Appropriate genetic model and applicable mapping population are indispensable condition for functional mapping of important agronomic traits in plants. Based on the Wang-Lan-Ding model, we ever applied a DH population to carry out functional mapping QTLs underlying growth trajectory on tiller number. However, inconsistent genetic background among the DH lines might disturb the mapping results. With the advent of innovative research materials, single segment substitution lines, allows us to do more precise genetic analyses. Thus functional mapping was again conducted on tiller number using the Wang-Lan-Ding model and a single segment substitution line population with the genetic background of Huajingxian 74 so as to explore QTL dynamic mechanism to regulate developmental traits. We detected that all five single segment substitution lines harbored tillering QTLs with additives and/or dominances to influence the four functional parameters, the optimum tillering time (t0), the maximum tiller number (K), the tillering increased rate (r) and the tillering decreased rate (c), which were estimated from the Wang-Lan-Ding model and with some biological meaning. They mainly brought the inflexion point (t0) delay, the peak increase (K) and the degradation (c) acceleration, while the growth (r) slow down. Moreover, epistatic interactions among these QTLs were confirmed to be prevalent. A total of 39 significant epistatic effects were detected to associate with the four parameters, occupying 34.8% of 112 pairs of epistatic interactions investigated. Contrary to the QTL effects, these epistatic effects largely decreased t0, K and c, while increased r. Our results indicated that the five QTL effects and their epistatic effects significantly changed the shape and trajectory of tiller number via influence of the four functional parameters. Rational use of these QTLs is expected to improve tillering number of rice by molecular design breeding.
Collapse
Affiliation(s)
- Xin Luan
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Liang Xiong
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Haiming Xu
- College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, 310058, China
| | - Haitao Zhu
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Suhong Bu
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Lijun Meng
- Agricultural Genomics Institute in Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 440307, People's Republic of China.
| | - Guifu Liu
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China.
| | - Shaokui Wang
- Guangdong Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou, 510642, People's Republic of China.
| |
Collapse
|
4
|
Yan W, Letizia P, Zhou W. Three cobblers worth the mastermind? The potential of ensemble in crowdsourced classification problems. DECISION SCIENCES 2021. [DOI: 10.1111/deci.12516] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Wangcheng Yan
- Advanced Institute of Business Tongji University Shanghai China
| | - Paolo Letizia
- Stokely Management Center University of Tennessee Knoxville Tennessee
| | - Wenjun Zhou
- Stokely Management Center University of Tennessee Knoxville Tennessee
| |
Collapse
|
5
|
Moreira FF, Oliveira HR, Volenec JJ, Rainey KM, Brito LF. Integrating High-Throughput Phenotyping and Statistical Genomic Methods to Genetically Improve Longitudinal Traits in Crops. FRONTIERS IN PLANT SCIENCE 2020; 11:681. [PMID: 32528513 PMCID: PMC7264266 DOI: 10.3389/fpls.2020.00681] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Accepted: 04/30/2020] [Indexed: 05/28/2023]
Abstract
The rapid development of remote sensing in agronomic research allows the dynamic nature of longitudinal traits to be adequately described, which may enhance the genetic improvement of crop efficiency. For traits such as light interception, biomass accumulation, and responses to stressors, the data generated by the various high-throughput phenotyping (HTP) methods requires adequate statistical techniques to evaluate phenotypic records throughout time. As a consequence, information about plant functioning and activation of genes, as well as the interaction of gene networks at different stages of plant development and in response to environmental stimulus can be exploited. In this review, we outline the current analytical approaches in quantitative genetics that are applied to longitudinal traits in crops throughout development, describe the advantages and pitfalls of each approach, and indicate future research directions and opportunities.
Collapse
Affiliation(s)
- Fabiana F. Moreira
- Department of Agronomy, Purdue University, West Lafayette, IN, United States
| | - Hinayah R. Oliveira
- Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
| | - Jeffrey J. Volenec
- Department of Agronomy, Purdue University, West Lafayette, IN, United States
| | - Katy M. Rainey
- Department of Agronomy, Purdue University, West Lafayette, IN, United States
| | - Luiz F. Brito
- Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
| |
Collapse
|
6
|
Ye M, Jiang L, Chen C, Zhu X, Wang M, Wu R. np 2 QTL: networking phenotypic plasticity quantitative trait loci across heterogeneous environments. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2019; 99:796-806. [PMID: 31009134 DOI: 10.1111/tpj.14355] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2018] [Revised: 03/29/2019] [Accepted: 04/04/2019] [Indexed: 05/18/2023]
Abstract
Despite its critical importance to our understanding of plant growth and adaptation, the question of how environment-induced plastic response is affected genetically remains elusive. Previous studies have shown that the reaction norm of an organism across environmental index obeys the allometrical scaling law of part-whole relationships. The implementation of this phenomenon into functional mapping can characterize how quantitative trait loci (QTLs) modulate the phenotypic plasticity of complex traits to heterogeneous environments. Here, we assemble functional mapping and allometry theory through Lokta-Volterra ordinary differential equations (LVODE) into an R-based computing platform, np2 QTL, aimed to map and visualize phenotypic plasticity QTLs. Based on LVODE parameters, np2 QTL constructs a bidirectional, signed and weighted network of QTL-QTL epistasis, whose emergent properties reflect the ecological mechanisms for genotype-environment interactions over any range of environmental change. The utility of np2 QTL was validated by comprehending the genetic architecture of phenotypic plasticity via the reanalysis of published plant height data involving 3502 recombinant inbred lines of maize planted in multiple discrete environments. np2 QTL also provides a tool for constructing a predictive model of phenotypic responses in extreme environments relative to the median environment.
Collapse
Affiliation(s)
- Meixia Ye
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, 100083, China
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Libo Jiang
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, 100083, China
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Chixiang Chen
- Departments of Public Health Sciences and Statistics, Center for Statistical Genetics, Pennsylvania State University, Hershey, PA, 17033, USA
| | - Xuli Zhu
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, 100083, China
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Ming Wang
- Departments of Public Health Sciences and Statistics, Center for Statistical Genetics, Pennsylvania State University, Hershey, PA, 17033, USA
| | - Rongling Wu
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, 100083, China
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
- Departments of Public Health Sciences and Statistics, Center for Statistical Genetics, Pennsylvania State University, Hershey, PA, 17033, USA
| |
Collapse
|
7
|
Jiang L, Zhang M, Sang M, Ye M, Wu R. Evo-Devo-EpiR: a genome-wide search platform for epistatic control on the evolution of development. Brief Bioinform 2017; 18:754-760. [PMID: 27473062 DOI: 10.1093/bib/bbw062] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Indexed: 11/14/2022] Open
Abstract
Evo-devo is a theory proposed to study how phenotypes evolve by comparing the developmental processes of different organisms or the same organism experiencing changing environments. It has been recognized that nonallelic interactions at different genes or quantitative trait loci, known as epistasis, may play a pivotal role in the evolution of development, but it has proven difficult to quantify and elucidate this role into a coherent picture. We implement a high-dimensional genome-wide association study model into the evo-devo paradigm and pack it into the R-based Evo-Devo-EpiR, aimed at facilitating the genome-wide landscaping of epistasis for the diversification of phenotypic development. By analyzing a high-throughput assay of DNA markers and their pairs simultaneously, Evo-Devo-EpiR is equipped with a capacity to systematically characterize various epistatic interactions that impact on the pattern and timing of development and its evolution. Enabling a global search for all possible genetic interactions for developmental processes throughout the whole genome, Evo-Devo-EpiR provides a computational tool to illustrate a precise genotype-phenotype map at interface between epistasis, development and evolution.
Collapse
|
8
|
Qi J, Sun J, Wang J. E-Index for Differentiating Complex Dynamic Traits. BIOMED RESEARCH INTERNATIONAL 2016; 2016:5761983. [PMID: 27064292 PMCID: PMC4811058 DOI: 10.1155/2016/5761983] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2015] [Revised: 10/28/2015] [Accepted: 02/11/2016] [Indexed: 11/21/2022]
Abstract
While it is a daunting challenge in current biology to understand how the underlying network of genes regulates complex dynamic traits, functional mapping, a tool for mapping quantitative trait loci (QTLs) and single nucleotide polymorphisms (SNPs), has been applied in a variety of cases to tackle this challenge. Though useful and powerful, functional mapping performs well only when one or more model parameters are clearly responsible for the developmental trajectory, typically being a logistic curve. Moreover, it does not work when the curves are more complex than that, especially when they are not monotonic. To overcome this inadaptability, we therefore propose a mathematical-biological concept and measurement, E-index (earliness-index), which cumulatively measures the earliness degree to which a variable (or a dynamic trait) increases or decreases its value. Theoretical proofs and simulation studies show that E-index is more general than functional mapping and can be applied to any complex dynamic traits, including those with logistic curves and those with nonmonotonic curves. Meanwhile, E-index vector is proposed as well to capture more subtle differences of developmental patterns.
Collapse
Affiliation(s)
- Jiandong Qi
- School of Information, Beijing Forestry University, Beijing 100083, China
| | - Jianfeng Sun
- School of Information, Beijing Forestry University, Beijing 100083, China
| | - Jianxin Wang
- School of Information, Beijing Forestry University, Beijing 100083, China
- Center for Computational Biology, Beijing Forestry University, Beijing 100083, China
| |
Collapse
|
9
|
Yan Q, Zhu X, Jiang L, Ye M, Sun L, Terblanche JS, Wu R. A computing platform to map ecological metabolism by integrating functional mapping and the metabolic theory of ecology. Brief Bioinform 2016; 18:137-144. [DOI: 10.1093/bib/bbv116] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Revised: 12/17/2015] [Indexed: 11/12/2022] Open
|
10
|
Li J, Wang Z, Li R, Wu R. BAYESIAN GROUP LASSO FOR NONPARAMETRIC VARYING-COEFFICIENT MODELS WITH APPLICATION TO FUNCTIONAL GENOME-WIDE ASSOCIATION STUDIES. Ann Appl Stat 2015; 9:640-664. [PMID: 26478762 DOI: 10.1214/15-aoas808] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Although genome-wide association studies (GWAS) have proven powerful for comprehending the genetic architecture of complex traits, they are challenged by a high dimension of single-nucleotide polymorphisms (SNPs) as predictors, the presence of complex environmental factors, and longitudinal or functional natures of many complex traits or diseases. To address these challenges, we propose a high-dimensional varying-coefficient model for incorporating functional aspects of phenotypic traits into GWAS to formulate a so-called functional GWAS or fGWAS. Bayesian group lasso and the associated MCMC algorithms are developed to identify significant SNPs and estimate how they affect longitudinal traits through time-varying genetic actions. The model is generalized to analyze the genetic control of complex traits using subject-specific sparse longitudinal data. The statistical properties of the new model are investigated through simulation studies. We use the new model to analyze a real GWAS data set from the Framingham Heart Study, leading to the identification of several significant SNPs associated with age-specific changes of body mass index. The fGWAS model, equipped with Bayesian group lassso, will provide a useful tool for genetic and developmental analysis of complex traits or diseases.
Collapse
Affiliation(s)
- Jiahan Li
- Department of Applied and Computational, Mathematics and Statistics, The University of Notre Dame, Notre Dame, IN 46556.
| | - Zhong Wang
- Center for Computational Biology, Beijing Forestry University, Beijing, China 100083.
| | - Runze Li
- Department of Statistics, The Methodology Center, The Pennsylvania State University, University Park, PA 16802.
| | - Rongling Wu
- Center for Statistical Genetics, The Pennsylvania State University, Hershey, PA 17033.
| |
Collapse
|
11
|
Huang Z, Tong C, Bo W, Pang X, Wang Z, Xu J, Gai J, Wu R. An allometric model for mapping seed development in plants. Brief Bioinform 2015; 15:562-70. [PMID: 23543351 DOI: 10.1093/bib/bbt019] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Despite a tremendous effort to map quantitative trait loci (QTLs) responsible for agriculturally and biologically important traits in plants, our understanding of how a QTL governs the developmental process of plant seeds remains elusive. In this article, we address this issue by describing a model for functional mapping of seed development through the incorporation of the relationship between vegetative and reproductive growth. The time difference of reproductive from vegetative growth is described by Reeve and Huxley’s allometric equation. Thus, the implementation of this equation into the framework of functional mapping allows dynamic QTLs for seed development to be identified more precisely. By estimating and testing mathematical parameters that define Reeve and Huxley’s allometric equations of seed growth, the dynamic pattern of the genetic effects of the QTLs identified can be analyzed. We used the model to analyze a soybean data, leading to the detection of QTLs that control the growth of seed dry weight. Three dynamic QTLs, located in two different linkage groups, were detected to affect growth curves of seed dry weight. The QTLs detected may be used to improve seed yield with marker-assisted selection by altering the pattern of seed development in a hope to achieve a maximum size of seeds at a harvest time.
Collapse
|
12
|
Liu L, Lai Y, Cheng J, Wang L, Du W, Wang Z, Zhang H. Dynamic quantitative trait locus analysis of seed vigor at three maturity stages in rice. PLoS One 2014; 9:e115732. [PMID: 25536503 PMCID: PMC4275237 DOI: 10.1371/journal.pone.0115732] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2014] [Accepted: 11/28/2014] [Indexed: 12/17/2022] Open
Abstract
Seed vigor is an important characteristic of seed quality. In this study, one rice population of recombinant inbred lines (RILs) was used to determine the genetic characteristics of seed vigor, including the germination potential, germination rate, germination index and time for 50% of germination, at 4 (early), 5 (middle) and 6 weeks (late) after heading in two years. A total of 24 additive and 9 epistatic quantitative trait loci (QTL) for seed vigor were identified using QTL Cartographer and QTLNetwork program respectively in 2012; while 32 simple sequence repeat (SSR) markers associated with seed vigor were detected using bulked segregant analysis (BSA) in 2013. The additive, epistatic and QTL × development interaction effects regulated the dry maturity developmental process to improve seed vigor in rice. The phenotypic variation explained by each additive, epistatic QTL and QTL × development interaction ranged from 5.86 to 40.67%, 4.64 to 11.28% and 0.01 to 1.17%, respectively. The QTLs were rarely co-localized among the different maturity stages; more QTLs were expressed at the early maturity stage followed by the late and middle stages. Twenty additive QTLs were stably expressed in two years which might play important roles in establishment of seed vigor in different environments. By comparing chromosomal positions of these stably expressed additive QTLs with those previously identified, the regions of QTL for seed vigor are likely to coincide with QTL for grain size, low temperature germinability and seed dormancy; while 5 additive QTL might represent novel genes. Using four selected RILs, three cross combinations of seed vigor for the development of RIL populations were predicted; 19 elite alleles could be pyramided by each combination.
Collapse
Affiliation(s)
- Liangfeng Liu
- The Laboratory of Seed Science and Technology, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing Jiangsu, PR China
| | - Yanyan Lai
- The Laboratory of Seed Science and Technology, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing Jiangsu, PR China
| | - Jinping Cheng
- The Laboratory of Seed Science and Technology, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing Jiangsu, PR China
| | - Ling Wang
- The Laboratory of Seed Science and Technology, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing Jiangsu, PR China
| | - Wenli Du
- The Laboratory of Seed Science and Technology, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing Jiangsu, PR China
| | - Zhoufei Wang
- The Laboratory of Seed Science and Technology, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing Jiangsu, PR China
| | - Hongsheng Zhang
- The Laboratory of Seed Science and Technology, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing Jiangsu, PR China
| |
Collapse
|
13
|
Ye M, Jiang L, Mao K, Wang Y, Wang Z, Wu R. Functional mapping of seasonal transition in perennial plants. Brief Bioinform 2014; 16:526-35. [DOI: 10.1093/bib/bbu025] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2014] [Accepted: 06/25/2014] [Indexed: 11/13/2022] Open
|
14
|
Wang Z, Li H, Wang J, Li J, Wu R. Statistical resolution of missing longitudinal data in clinical pharmacogenomics. Adv Drug Deliv Rev 2013; 65:980-6. [PMID: 23523630 DOI: 10.1016/j.addr.2013.03.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2013] [Revised: 02/13/2013] [Accepted: 03/13/2013] [Indexed: 11/16/2022]
Abstract
Clinical pharmacogenomics, integrating genomic information with clinical practices to facilitate the prediction of drug response, has recently emerged as a vital area of public health. In clinical trials, phenotypic data on drug response are often longitudinal, with some patients dropping out early due to physiological or other unpredictable reasons. The genetic analysis of such missing longitudinal data presents a significant challenge in clinical pharmacogenomics. We develop a statistical algorithm for detecting haplotypes that control longitudinal responses subject to non-ignorable dropout. The model was derived by incorporating the selection model into a dynamic model - functional mapping, aimed to discover genetic variants that contribute to phenotypic variation in longitudinal traits. The selection models is a statistical approach for analyzing missing longitudinal data by assuming that dropout depends on the outcome of drug response. The model derived can jointly characterize the genetic control of longitudinal responses and dropout events. Simulation studies were performed to investigate the statistical properties of the model and validate its practical usefulness. The model will find its implications for clinical pharmacogenomics toward personalized medicine.
Collapse
Affiliation(s)
- Zhong Wang
- Center for Computational Biology, Beijing Forestry University, Beijing, China
| | | | | | | | | |
Collapse
|
15
|
Londono D, Chen KM, Musolf A, Wang R, Shen T, Brandon J, Herring JA, Wise CA, Zou H, Jin M, Yu L, Finch SJ, Matise TC, Gordon D. A novel method for analyzing genetic association with longitudinal phenotypes. Stat Appl Genet Mol Biol 2013; 12:241-61. [PMID: 23502345 DOI: 10.1515/sagmb-2012-0070] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Knowledge of genes influencing longitudinal patterns may offer information about predicting disease progression. We developed a systematic procedure for testing association between SNP genotypes and longitudinal phenotypes. We evaluated false positive rates and statistical power to localize genes for disease progression. We used genome-wide SNP data from the Framingham Heart Study. With longitudinal data from two real studies unrelated to Framingham, we estimated three trajectory curves from each study. We performed simulations by randomly selecting 500 individuals. In each simulation replicate, we assigned each individual to one of the three trajectory groups based on the underlying hypothesis (null or alternative), and generated corresponding longitudinal data. Individual Bayesian posterior probabilities (BPPs) for belonging to a specific trajectory curve were estimated. These BPPs were treated as a quantitative trait and tested (using the Wald test) for genome-wide association. Empirical false positive rates and power were calculated. Our method maintained the expected false positive rate for all simulation models. Also, our method achieved high empirical power for most simulations. Our work presents a method for disease progression gene mapping. This method is potentially clinically significant as it may allow doctors to predict disease progression based on genotype and determine treatment accordingly.
Collapse
Affiliation(s)
- Douglas Londono
- Department of Genetics, Rutgers, The State University of New Jersey, 145 Bevier Road, Piscataway, NJ 08854, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Abstract
Owing to their ability and flexibility to describe individual gene expression at different time points, random regression (RR) analyses have become a popular procedure for the genetic analysis of dynamic traits whose phenotypes are collected over time. Specifically, when modelling the dynamic patterns of gene expressions in the RR framework, B-splines have been proved successful as an alternative to orthogonal polynomials. In the so-called Bayesian B-spline quantitative trait locus (QTL) mapping, B-splines are used to characterize the patterns of QTL effects and individual-specific time-dependent environmental errors over time, and the Bayesian shrinkage estimation method is employed to estimate model parameters. Extensive simulations demonstrate that (1) in terms of statistical power, Bayesian B-spline mapping outperforms the interval mapping based on the maximum likelihood; (2) for the simulated dataset with complicated growth curve simulated by B-splines, Legendre polynomial-based Bayesian mapping is not capable of identifying the designed QTLs accurately, even when higher-order Legendre polynomials are considered and (3) for the simulated dataset using Legendre polynomials, the Bayesian B-spline mapping can find the same QTLs as those identified by Legendre polynomial analysis. All simulation results support the necessity and flexibility of B-spline in Bayesian mapping of dynamic traits. The proposed method is also applied to a real dataset, where QTLs controlling the growth trajectory of stem diameters in Populus are located.
Collapse
|
17
|
Das K, Li J, Fu G, Wang Z, Li R, Wu R. Dynamic semiparametric Bayesian models for genetic mapping of complex trait with irregular longitudinal data. Stat Med 2012; 32:509-23. [PMID: 22903809 DOI: 10.1002/sim.5535] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2010] [Accepted: 04/13/2012] [Indexed: 01/27/2023]
Abstract
Many phenomena of fundamental importance to biology and biomedicine arise as a dynamic curve, such as organ growth and HIV dynamics. The genetic mapping of these traits is challenged by longitudinal variables measured at irregular and possibly subject-specific time points, in which case nonnegative definiteness of the estimated covariance matrix needs to be guaranteed. We present a semiparametric approach for genetic mapping within the mixture-model setting by jointly modeling mean and covariance structures for irregular longitudinal data. Penalized spline is used to model the mean functions of individual quantitative trait locus (QTL) genotypes as latent variables, whereas an extended generalized linear model is used to approximate the covariance matrix. The parameters for modeling the mean-covariances are estimated by MCMC, using the Gibbs sampler and the Metropolis-Hastings algorithm. We derive the full conditional distributions for the mean and covariance parameters and compute Bayes factors to test the hypothesis about the existence of significant QTLs. We used the model to screen the existence of specific QTLs for age-specific change of body mass index with a sparse longitudinal data set. The new model provides powerful means for broadening the application of genetic mapping to reveal the genetic control of dynamic traits.
Collapse
Affiliation(s)
- Kiranmoy Das
- Center for Statistical Genetics, The Pennsylvania State University, Hershey, PA 17033, U.S.A
| | | | | | | | | | | |
Collapse
|
18
|
Das K, Li R, Huang Z, Gai J, Wu R. A Bayesian Framework for Functional Mapping through Joint Modeling of Longitudinal and Time-to-Event Data. INTERNATIONAL JOURNAL OF PLANT GENOMICS 2012; 2012:680634. [PMID: 22685454 PMCID: PMC3364578 DOI: 10.1155/2012/680634] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/20/2012] [Revised: 03/13/2012] [Accepted: 03/19/2012] [Indexed: 06/01/2023]
Abstract
The most powerful and comprehensive approach of study in modern biology is to understand the whole process of development and all events of importance to development which occur in the process. As a consequence, joint modeling of developmental processes and events has become one of the most demanding tasks in statistical research. Here, we propose a joint modeling framework for functional mapping of specific quantitative trait loci (QTLs) which controls developmental processes and the timing of development and their causal correlation over time. The joint model contains two submodels, one for a developmental process, known as a longitudinal trait, and the other for a developmental event, known as the time to event, which are connected through a QTL mapping framework. A nonparametric approach is used to model the mean and covariance function of the longitudinal trait while the traditional Cox proportional hazard (PH) model is used to model the event time. The joint model is applied to map QTLs that control whole-plant vegetative biomass growth and time to first flower in soybeans. Results show that this model should be broadly useful for detecting genes controlling physiological and pathological processes and other events of interest in biomedicine.
Collapse
Affiliation(s)
- Kiranmoy Das
- Department of Statistics, Temple University, Philadelphia, PA 19122, USA
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA
- Center for Statistical Genetics, The Pennsylvania State University, Hershey, PA 17033, USA
| | - Runze Li
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA
| | - Zhongwen Huang
- Department of Agronomy, Henan Institute of Science and Technology, Xinxiang 453003, China
- National Center for Soybean Improvement, National Key Laboratory of Crop Genetics and Germplasm Enhancement, Soybean Research Institute, Nanjing Agricultural University, Nanjing 210095, China
| | - Junyi Gai
- Department of Agronomy, Henan Institute of Science and Technology, Xinxiang 453003, China
- National Center for Soybean Improvement, National Key Laboratory of Crop Genetics and Germplasm Enhancement, Soybean Research Institute, Nanjing Agricultural University, Nanjing 210095, China
| | - Rongling Wu
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA
- Center for Statistical Genetics, The Pennsylvania State University, Hershey, PA 17033, USA
| |
Collapse
|
19
|
Wang S, Xiong W, Ma W, Chanock S, Jedrychowski W, Wu R, Perera FP. Gene-environment interactions on growth trajectories. Genet Epidemiol 2012; 36:206-13. [PMID: 22311237 PMCID: PMC3380164 DOI: 10.1002/gepi.21613] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2011] [Accepted: 12/09/2011] [Indexed: 11/09/2022]
Abstract
It has been suggested that children with larger brains tend to perform better on IQ tests or cognitive function tests. Prenatal head growth and head growth in infancy are two crucial periods for subsequent intelligence. Studies have shown that environmental exposure to air pollutants during pregnancy is associated with fetal growth reduction, developmental delay, and reduced IQ. Meanwhile, genetic polymorphisms may modify the effect of environment on head growth. However, studies on gene-environment or gene-gene interactions on growth trajectories have been quite limited partly due to the difficulty to quantitatively measure interactions on growth trajectories. Moreover, it is known that assessing the significance of gene-environment or gene-gene interactions on cross-sectional outcomes empirically using the permutation procedures may bring substantial errors in the tests. We proposed a score that quantitatively measures interactions on growth trajectories and developed an algorithm with a parametric bootstrap procedure to empirically assess the significance of the interactions on growth trajectories under the likelihood framework. We also derived a Wald statistic to test for interactions on growth trajectories and compared it to the proposed parametric bootstrap procedure. Through extensive simulation studies, we demonstrated the feasibility and power of the proposed testing procedures. We applied our method to a real dataset with head circumference measures from birth to age 7 on a cohort currently being conducted by the Columbia Center for Children's Environmental Health (CCCEH) in Krakow, Poland, and identified several significant gene-environment interactions on head circumference growth trajectories.
Collapse
Affiliation(s)
- Shuang Wang
- Department of Biostatistics, Mailman School of Public Health, Columbia University, 722 West 168th Street, New York, NY 10032, USA.
| | | | | | | | | | | | | |
Collapse
|
20
|
Fu G, Wang Z, Li J, Wu R. A mathematical framework for functional mapping of complex phenotypes using delay differential equations. J Theor Biol 2011; 289:206-16. [PMID: 21871898 DOI: 10.1016/j.jtbi.2011.08.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2011] [Revised: 06/17/2011] [Accepted: 08/02/2011] [Indexed: 11/27/2022]
Abstract
All biological phenomena occurring at different levels of organization from cells to organisms can be modeled as a dynamic system, in which the underlying components interact dynamically to comprehend its biological function. Such a systems modeling approach facilitates the use of biochemically and biophysically detailed mathematical models to describe and quantify "living cells," leading to an in-depth and precise understanding of the behavior, development and function of a biological system. Here, we illustrate how this approach can be used to map genes or quantitative trait loci (QTLs) that control a complex trait using the example of the circadian rhythm system which has been at the forefront of analytical mathematical modeling for many years. We integrate a system of biologically meaningful delay differential equations (DDEs) into functional mapping, a statistical model designed to map dynamic QTLs involved in biological processes. The DDEs model the ability of circadian rhythm to generate autonomously sustained oscillations with a period close to 24h, in terms of time-varying mRNA and protein abundances. By incorporating the Runge-Kutta fourth order algorithm within the likelihood-based context of functional mapping, we estimated the genetic parameters that define the periodic pattern of QTL effects on time-varying mRNA and protein abundances and their dynamic association as well as the linkage disequilibrium of the QTL and a marker. We prove theorems about how to choose appropriate parameters to guarantee periodic oscillations. We further used simulation studies to investigate how a QTL influences the period and the amplitude of circadian oscillations through changing model parameters. The model provides a quantitative framework for assessing the interplay between genetic effects of QTLs and rhythmic responses.
Collapse
Affiliation(s)
- Guifang Fu
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA
| | | | | | | |
Collapse
|
21
|
Arroyo J, Hutzler J, Bermejo C, Ragni E, García-Cantalejo J, Botías P, Piberger H, Schott A, Sanz AB, Strahl S. Functional and genomic analyses of blocked protein O-mannosylation in baker's yeast. Mol Microbiol 2011; 79:1529-46. [DOI: 10.1111/j.1365-2958.2011.07537.x] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
22
|
Lin M, Berg A, Wu R. Modeling the genetic etiology of pharmacokinetic-pharmacodynamic links with the ARMA process. J Biopharm Stat 2010; 20:351-72. [PMID: 20309763 DOI: 10.1080/10543400903572795] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Substantial variability exists among different patients in response to drugs. The identification of genetic factors that contribute to the interpersonal differentiation has been an important task for pharmacogenetic research and drug discovery. In this article, we have derived a high-dimensional statistical model for unveiling the genetic machinery for drug response by integrating two different but biologically related processes--pharmacokinetics (PK) and pharmacodynamics (PD)--into a genetic mapping framework. Using an integrated model of PK and PD, we can identify specific DNA sequence variants and test how they relate to the differential effect of the body to the drug (PK) and the effect of the drug on the body (PD). To effectively model a two-stage hierarchic structure of the covariance matrix at the PD and PK level, we have for the first time introduced an autoregressive moving-average (ARMA) process to the mixture-based likelihood function for sequence mapping. Closed-form estimates of the determinant and inverse of the ARMA-based covariance matrix are incorporated into the estimation step, which significantly increases the computational efficiency. Simulation studies have been performed to test the statistical behavior of our model. Potential applications of this model to pharmacogenetic research are discussed.
Collapse
Affiliation(s)
- Min Lin
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, North Carolina, USA.
| | | | | |
Collapse
|
23
|
Li Q, Huang Z, Xu M, Wang C, Gai J, Huang Y, Pang X, Wu R. Functional mapping of genotype-environment interactions for soybean growth by a semiparametric approach. PLANT METHODS 2010; 6:13. [PMID: 20525184 PMCID: PMC2903578 DOI: 10.1186/1746-4811-6-13] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2010] [Accepted: 06/02/2010] [Indexed: 05/14/2023]
Abstract
BACKGROUND Functional mapping is a powerful approach for mapping quantitative trait loci (QTLs) that control biological processes. Functional mapping incorporates mathematical aspects of growth and development into a general QTL mapping framework and has been recently integrated with composite interval mapping to build up a so-called composite functional mapping model, aimed to separate multiple linked QTLs on the same chromosomal region. RESULTS This article reports the principle of using composite functional mapping to estimate the effects of QTL-environment interactions on growth trajectories by parametrically modeling the tested QTL in a marker interval and nonparametrically modeling the markers outside the interval as co-factors. With this new model, we can characterize the dynamic patterns of the genetic effects of QTLs governing growth trajectories, estimate the global effects of the underlying QTLs during the course of growth and development, and test the differentiation in the shapes of QTL genotype-specific growth curves between different environments. By analyzing a real example from a soybean genome project, our model detects several QTLs that cause significant genotype-environment interactions for plant height growth processes. CONCLUSIONS The model provides a basis for deciphering the genetic architecture of trait expression adjusted to different biotic and abiotic environments for any organism.
Collapse
Affiliation(s)
- Qin Li
- Department of Statistics, University of Florida, Gainesville, FL 32611 USA
| | - Zhongwen Huang
- Department of Agronomy, Henan Institute of Science and Technology, Xinxiang, Henan 453003, China
- National Center for Soybean Improvement, Nanjing Agricultural University, Nanjing, Jiangsu 210095, China
| | - Meng Xu
- Key Laboratory of Forest Genetics and Tree Engineering, Nanjing Forestry University, Nanjing, Jiangsu 210037, China
| | - Chenguang Wang
- Department of Statistics, University of Florida, Gainesville, FL 32611 USA
| | - Junyi Gai
- National Center for Soybean Improvement, Nanjing Agricultural University, Nanjing, Jiangsu 210095, China
| | - Youjun Huang
- School of Forestry and Biotechnology, Zhejiang Forestry University, Lin'an, Zhejiang 311300, China
| | - Xiaoming Pang
- Center for Computational Biology, Beijing Forestry University, Beijing 100083, China
- National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Beijing Forestry University, Beijing 100083, China
| | - Rongling Wu
- Center for Computational Biology, Beijing Forestry University, Beijing 100083, China
- National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Beijing Forestry University, Beijing 100083, China
| |
Collapse
|
24
|
Ahn K, Luo J, Berg A, Keefe D, Wu R. Functional mapping of drug response with pharmacodynamic-pharmacokinetic principles. Trends Pharmacol Sci 2010; 31:306-11. [PMID: 20488563 DOI: 10.1016/j.tips.2010.04.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2010] [Revised: 04/21/2010] [Accepted: 04/21/2010] [Indexed: 12/15/2022]
Abstract
Recent research in pharmacogenomics has inspired our hope to predict drug response by linking it with DNA information extracted from the human genome. However, many genetic models of drug response do not incorporate biochemical principles of host-drug interactions, limiting the effectiveness of the predictive models. We argue that functional mapping, a computational tool aimed at identifying genes and genetic networks that control dynamic traits, can help explain the detailed genetic architecture of drug response by incorporating pharmacokinetic and pharmacodynamic processes. Functional mapping is particularly powerful in determining the genetic commonality and differences of drug efficacy vs. drug toxicity and drug sensitivity vs. drug resistance. We pinpoint several future directions in which functional mapping can be coupled with systems biology to unravel the genetic and metabolic machinery of drug response.
Collapse
Affiliation(s)
- Kwangmi Ahn
- Center for Statistical Genetics, Pennsylvania State University, Hershey, PA 17033, USA
| | | | | | | | | |
Collapse
|
25
|
|
26
|
Min L, Yang R, Wang X, Wang B. Bayesian analysis for genetic architecture of dynamic traits. Heredity (Edinb) 2010; 106:124-33. [PMID: 20332806 DOI: 10.1038/hdy.2010.20] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
The dissection of the genetic architecture of quantitative traits, including the number and locations of quantitative trait loci (QTL) and their main and epistatic effects, has been an important topic in current QTL mapping. We extend the Bayesian model selection framework for mapping multiple epistatic QTL affecting continuous traits to dynamic traits in experimental crosses. The extension inherits the efficiency of Bayesian model selection and the flexibility of the Legendre polynomial model fitting to the change in genetic and environmental effects with time. We illustrate the proposed method by simultaneously detecting the main and epistatic QTLs for the growth of leaf age in a doubled-haploid population of rice. The behavior and performance of the method are also shown by computer simulation experiments. The results show that our method can more quickly identify interacting QTLs for dynamic traits in the models with many numbers of genetic effects, enhancing our understanding of genetic architecture for dynamic traits. Our proposed method can be treated as a general form of mapping QTL for continuous quantitative traits, being easier to extend to multiple traits and to a single trait with repeat records.
Collapse
Affiliation(s)
- L Min
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiaotong University, Shanghai, PR China
| | | | | | | |
Collapse
|
27
|
Abstract
SummaryAs an emerging powerful approach for mapping quantitative trait loci (QTLs) responsible for dynamic traits, functional mapping models the time-dependent mean vector with biologically meaningful equations and are likely to generate biologically relevant and interpretable results. Given the autocorrelation nature of a dynamic trait, functional mapping needs the implementation of the models for the structure of the covariance matrix. In this article, we have provided a comprehensive set of approaches for modelling the covariance structure and incorporated each of these approaches into the framework of functional mapping. The Bayesian information criterion (BIC) values are used as a model selection criterion to choose the optimal combination of the submodels for the mean vector and covariance structure. In an example for leaf age growth from a rice molecular genetic project, the best submodel combination was found between the Gaussian model for the correlation structure, power equation of order 1 for the variance and the power curve for the mean vector. Under this combination, several significant QTLs for leaf age growth trajectories were detected on different chromosomes. Our model can be well used to study the genetic architecture of dynamic traits of agricultural values.
Collapse
|
28
|
Yap JS, Fan J, Wu R. Nonparametric modeling of longitudinal covariance structure in functional mapping of quantitative trait loci. Biometrics 2009; 65:1068-77. [PMID: 19302406 PMCID: PMC2987658 DOI: 10.1111/j.1541-0420.2009.01222.x] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Estimation of the covariance structure of longitudinal processes is a fundamental prerequisite for the practical deployment of functional mapping designed to study the genetic regulation and network of quantitative variation in dynamic complex traits. We present a nonparametric approach for estimating the covariance structure of a quantitative trait measured repeatedly at a series of time points. Specifically, we adopt Huang et al.'s (2006, Biometrika 93, 85-98) approach of invoking the modified Cholesky decomposition and converting the problem into modeling a sequence of regressions of responses. A regularized covariance estimator is obtained using a normal penalized likelihood with an L(2) penalty. This approach, embedded within a mixture likelihood framework, leads to enhanced accuracy, precision, and flexibility of functional mapping while preserving its biological relevance. Simulation studies are performed to reveal the statistical properties and advantages of the proposed method. A real example from a mouse genome project is analyzed to illustrate the utilization of the methodology. The new method will provide a useful tool for genome-wide scanning for the existence and distribution of quantitative trait loci underlying a dynamic trait important to agriculture, biology, and health sciences.
Collapse
Affiliation(s)
- John Stephen Yap
- Department of Statistics, University of Florida, Gainesville, Florida 32611, USA
| | | | | |
Collapse
|
29
|
A statistical model for genetic mapping of viral infection by integrating epidemiological behavior. Stat Appl Genet Mol Biol 2009; 8:Article 38. [PMID: 19799557 DOI: 10.2202/1544-6115.1475] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Large-scale studies of genetic variation may be helpful for understanding the genetic control mechanisms of viral infection and, ultimately, predicting and eliminating infectious disease outbreaks. We propose a new statistical model for detecting specific DNA sequence variants that are responsible for viral infection. This model considers additive, dominance and epistatic effects of haplotypes from three different genomes, recipient, transmitter and virus, through an epidemiological process. The model is constructed within the maximum likelihood framework and implemented with the EM algorithm. A number of hypothesis tests about population genetic structure and diversity and the pattern of genetic control are formulated. A series of closed forms for the EM algorithm to estimate haplotype frequencies and haplotype effects in a network of genetic interactions among three genomes are derived. Simulation studies were performed to test the statistical properties of the model, recommending necessary sample sizes for obtaining reasonably good accuracy and precision of parameter estimation. By integrating, for the first time, the epidemiological principle of viral infection into genetic mapping, the new model shall find an immediate application to studying the genetic architecture of viral infection.
Collapse
|
30
|
A differential equation model for functional mapping of a virus-cell dynamic system. J Math Biol 2009; 61:1-15. [DOI: 10.1007/s00285-009-0288-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2009] [Revised: 07/13/2009] [Indexed: 11/26/2022]
|
31
|
Li N, Das K, Wu R. Functional mapping of human growth trajectories. J Theor Biol 2009; 261:33-42. [PMID: 19632241 DOI: 10.1016/j.jtbi.2009.07.020] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2009] [Revised: 07/05/2009] [Accepted: 07/08/2009] [Indexed: 12/18/2022]
Abstract
Human height is an important trait from biological and social perspectives. Genes have been widely recognized to be involved in human body growth, but their detailed controlling mechanisms are poorly understood. Here, we present a computational model for functional mapping of quantitative trait loci (QTLs) that control trajectories of human height growth through an interactive network. The model integrates mathematical equations of human growth curves into the mixture model-based functional mapping framework, allowing the identification and mapping of individual QTLs responsible for the developmental pattern of human growth. The model was derived on a random sample of subjects from a natural population, for each of which molecular markers within candidate genes or throughout the entire genome are typed and height data from childhood to adulthood are collected. A series of testable hypotheses are formulated about the genetic control of developmental timing and duration at different stages. The model was used to characterize epistatic QTLs for height growth hidden in 548 Japanese girls which is a semi-real data set with simulated the marker genotypes. With an increasing availability of genetic polymorphic data, the model will have great implications for probing the genetic and developmental mechanisms of human body growth and its associated diseases.
Collapse
Affiliation(s)
- Ning Li
- Department of Epidemiology and Biostatistics, University of Florida, Gainesville, FL 32611, USA
| | | | | |
Collapse
|
32
|
A Bayesian Algorithm for Functional Mapping of Dynamic Complex Traits. ALGORITHMS 2009. [DOI: 10.3390/a2020667] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
33
|
A computational approach to the functional clustering of periodic gene-expression profiles. Genetics 2008; 180:821-34. [PMID: 18780724 DOI: 10.1534/genetics.108.093690] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
DNA microarray analysis has emerged as a leading technology to enhance our understanding of gene regulation and function in cellular mechanism controls on a genomic scale. This technology has advanced to unravel the genetic machinery of biological rhythms by collecting massive gene-expression data in a time course. Here, we present a statistical model for clustering periodic patterns of gene expression in terms of different transcriptional profiles. The model incorporates biologically meaningful Fourier series approximations of gene periodic expression into a mixture-model-based likelihood function, thus producing results that are likely to be closer to biological relevance, as compared to those from existing models. Also because the structures of the time-dependent means and covariance matrix are modeled, the new approach displays increased statistical power and precision of parameter estimation. The approach was used to reanalyze a real example with 800 periodically expressed transcriptional genes in yeast, leading to the identification of 13 distinct patterns of gene-expression cycles. The model proposed can be useful for characterizing the complex biological effects of gene expression and generate testable hypotheses about the workings of developmental systems in a more precise quantitative way.
Collapse
|
34
|
Hou W, Li H, Zhang B, Huang M, Wu R. A nonlinear mixed-effect mixture model for functional mapping of dynamic traits. Heredity (Edinb) 2008; 101:321-8. [PMID: 18612322 DOI: 10.1038/hdy.2008.53] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Functional mapping has emerged as a next-generation statistical tool for mapping quantitative trait loci (QTL) that affect complex dynamic traits. In this article, we incorporated the idea of nonlinear mixed-effect (NLME) models into the mixture-based framework of functional mapping, aimed to generalize the spectrum of applications for functional mapping. NLME-based functional mapping, implemented with the linearization algorithm based on the first-order Taylor expansion, can provide reasonable estimates of QTL genotypic-specific curve parameters (fixed effect) and the between-individual variation of these parameters (random effect). Results from simulation studies suggest that the NLME-based model is more general than traditional functional mapping. The new model can be useful for the identification of the ontogenetic patterns of QTL genetic effects during time course.
Collapse
Affiliation(s)
- W Hou
- Department of Statistics, University of Florida, Gainesville, FL 32611, USA
| | | | | | | | | |
Collapse
|
35
|
Abstract
Functional mapping is a useful tool for mapping quantitative trait loci (QTL) that control dynamic traits. It incorporates mathematical aspects of biological processes into the mixture model-based likelihood setting for QTL mapping, thus increasing the power of QTL detection and the precision of parameter estimation. However, in many situations there is no obvious functional form and, in such cases, this strategy will not be optimal. Here we propose to use nonparametric function estimation, typically implemented with B-splines, to estimate the underlying functional form of phenotypic trajectories, and then construct a nonparametric test to find evidence of existing QTL. Using the representation of a nonparametric regression as a mixed model, the final test statistic is a likelihood ratio test. We consider two types of genetic maps: dense maps and general maps, and the power of nonparametric functional mapping is investigated through simulation studies and demonstrated by examples.
Collapse
Affiliation(s)
- Jie Yang
- Genetics Institute, University of Florida, Gainesville, Florida 32611, USA. jyang81@.ufl.edu
| | | | | |
Collapse
|
36
|
Das K, Wu R. A statistical model for the identification of genes governing the incidence of cancer with age. Theor Biol Med Model 2008; 5:7. [PMID: 18416827 PMCID: PMC2365934 DOI: 10.1186/1742-4682-5-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2007] [Accepted: 04/16/2008] [Indexed: 01/29/2023] Open
Abstract
The cancer incidence increases with age. This epidemiological pattern of cancer incidence can be attributed to molecular and cellular processes of individual subjects. Also, the incidence of cancer with ages can be controlled by genes. Here we present a dynamic statistical model for explaining the epidemiological pattern of cancer incidence based on individual genes that regulate cancer formation and progression. We incorporate the mathematical equations of age-specific cancer incidence into a framework for functional mapping aimed at identifying quantitative trait loci (QTLs) for dynamic changes of a complex trait. The mathematical parameters that specify differences in the curve of cancer incidence among QTL genotypes are estimated within the context of maximum likelihood. The model provides testable quantitative hypotheses about the initiation and duration of genetic expression for QTLs involved in cancer progression. Computer simulation was used to examine the statistical behavior of the model. The model can be used as a tool for explaining the epidemiological pattern of cancer incidence.
Collapse
Affiliation(s)
- Kiranmoy Das
- Department of Statistics, University of Florida, Gainesville, FL 32611, USA.
| | | |
Collapse
|
37
|
Cui Y, Li S, Li G. Functional mapping imprinted quantitative trait loci underlying developmental characteristics. Theor Biol Med Model 2008; 5:6. [PMID: 18346281 PMCID: PMC2324076 DOI: 10.1186/1742-4682-5-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2008] [Accepted: 03/17/2008] [Indexed: 12/29/2022] Open
Abstract
Background Genomic imprinting, a phenomenon referring to nonequivalent expression of alleles depending on their parental origins, has been widely observed in nature. It has been shown recently that the epigenetic modification of an imprinted gene can be detected through a genetic mapping approach. Such an approach is developed based on traditional quantitative trait loci (QTL) mapping focusing on single trait analysis. Recent studies have shown that most imprinted genes in mammals play an important role in controlling embryonic growth and post-natal development. For a developmental character such as growth, current approach is less efficient in dissecting the dynamic genetic effect of imprinted genes during individual ontology. Results Functional mapping has been emerging as a powerful framework for mapping quantitative trait loci underlying complex traits showing developmental characteristics. To understand the genetic architecture of dynamic imprinted traits, we propose a mapping strategy by integrating the functional mapping approach with genomic imprinting. We demonstrate the approach through mapping imprinted QTL controlling growth trajectories in an inbred F2 population. The statistical behavior of the approach is shown through simulation studies, in which the parameters can be estimated with reasonable precision under different simulation scenarios. The utility of the approach is illustrated through real data analysis in an F2 family derived from LG/J and SM/J mouse stains. Three maternally imprinted QTLs are identified as regulating the growth trajectory of mouse body weight. Conclusion The functional iQTL mapping approach developed here provides a quantitative and testable framework for assessing the interplay between imprinted genes and a developmental process, and will have important implications for elucidating the genetic architecture of imprinted traits.
Collapse
Affiliation(s)
- Yuehua Cui
- Department of Statistics & Probability, Michigan State University, East Lansing, MI 48824, USA.
| | | | | |
Collapse
|
38
|
A semiparametric approach for composite functional mapping of dynamic quantitative traits. Genetics 2007; 177:1859-70. [PMID: 17947431 DOI: 10.1534/genetics.107.077321] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Functional mapping has emerged as a powerful tool for mapping quantitative trait loci (QTL) that control developmental patterns of complex dynamic traits. Original functional mapping has been constructed within the context of simple interval mapping, without consideration of separate multiple linked QTL for a dynamic trait. In this article, we present a statistical framework for mapping QTL that affect dynamic traits by capitalizing on the strengths of functional mapping and composite interval mapping. Within this so-called composite functional-mapping framework, functional mapping models the time-dependent genetic effects of a QTL tested within a marker interval using a biologically meaningful parametric function, whereas composite interval mapping models the time-dependent genetic effects of the markers outside the test interval to control the genome background using a flexible nonparametric approach based on Legendre polynomials. Such a semiparametric framework was formulated by a maximum-likelihood model and implemented with the EM algorithm, allowing for the estimation and the test of the mathematical parameters that define the QTL effects and the regression coefficients of the Legendre polynomials that describe the marker effects. Simulation studies were performed to investigate the statistical behavior of composite functional mapping and compare its advantage in separating multiple linked QTL as compared to functional mapping. We used the new mapping approach to analyze a genetic mapping example in rice, leading to the identification of multiple QTL, some of which are linked on the same chromosome, that control the developmental trajectory of leaf age.
Collapse
|
39
|
Wu S, Yang J, Wu R. Semiparametric functional mapping of quantitative trait loci governing long-term HIV dynamics. ACTA ACUST UNITED AC 2007; 23:i569-76. [PMID: 17646344 DOI: 10.1093/bioinformatics/btm164] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
MOTIVATION Functional mapping has proven to be powerful for characterizing quantitative trait loci (QTL) that control complex dynamic traits. More recently, functional mapping has been extended to identify the host QTL responsible for HIV dynamics by incorporating a parametric bi-exponential function for earlier stages of viral load trajectories. However, existing functional mapping cannot be used to map long-term HIV dynamics because no mathematical functions are available for later stages of HIV dynamic changes. RESULTS We derived a statistical model for functional mapping of dynamic QTL through characterizing HIV load trajectories during a long-term period semiparametrically. The new model was constructed within the maximum likelihood framework and implemented with the EM-simplex algorithm. It allows for the test of differences in the genetic control of short- and long-term HIV dynamics and the characterization of the effects of viral-host genome interaction. Extensive simulation studies have been performed to test the statistical behavior of this model. The new model will provide an important tool for genetic and genomic studies of human complex diseases like HIV/AIDS and their pathological progression. AVAILABILITY Available on request from the corresponding author.
Collapse
Affiliation(s)
- Song Wu
- Department of Statistics, University of Florida, Gainesville, FL 32611, USA
| | | | | |
Collapse
|
40
|
Yap JS, Wang C, Wu R. A computational approach for functional mapping of quantitative trait loci that regulate thermal performance curves. PLoS One 2007; 2:e554. [PMID: 17579725 PMCID: PMC1892808 DOI: 10.1371/journal.pone.0000554] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2007] [Accepted: 05/24/2007] [Indexed: 11/18/2022] Open
Abstract
Whether and how thermal reaction norm is under genetic control is fundamental to understand the mechanistic basis of adaptation to novel thermal environments. However, the genetic study of thermal reaction norm is difficult because it is often expressed as a continuous function or curve. Here we derive a statistical model for dissecting thermal performance curves into individual quantitative trait loci (QTL) with the aid of a genetic linkage map. The model is constructed within the maximum likelihood context and implemented with the EM algorithm. It integrates the biological principle of responses to temperature into a framework for genetic mapping through rigorous mathematical functions established to describe the pattern and shape of thermal reaction norms. The biological advantages of the model lie in the decomposition of the genetic causes for thermal reaction norm into its biologically interpretable modes, such as hotter-colder, faster-slower and generalist-specialist, as well as the formulation of a series of hypotheses at the interface between genetic actions/interactions and temperature-dependent sensitivity. The model is also meritorious in statistics because the precision of parameter estimation and power of QTLdetection can be increased by modeling the mean-covariance structure with a small set of parameters. The results from simulation studies suggest that the model displays favorable statistical properties and can be robust in practical genetic applications. The model provides a conceptual platform for testing many ecologically relevant hypotheses regarding organismic adaptation within the Eco-Devo paradigm.
Collapse
Affiliation(s)
- John Stephen Yap
- Department of Statistics, University of Florida, Gainesville, Florida, United States of America
| | - Chenguang Wang
- Department of Statistics, University of Florida, Gainesville, Florida, United States of America
| | - Rongling Wu
- Department of Statistics, University of Florida, Gainesville, Florida, United States of America
| |
Collapse
|
41
|
Zhao W, Li H, Hou W, Wu R. Wavelet-based parametric functional mapping of developmental trajectories with high-dimensional data. Genetics 2007; 176:1879-92. [PMID: 17435222 PMCID: PMC1931562 DOI: 10.1534/genetics.107.070920] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The biological and statistical advantages of functional mapping result from joint modeling of the mean-covariance structures for developmental trajectories of a complex trait measured at a series of time points. While an increased number of time points can better describe the dynamic pattern of trait development, significant difficulties in performing functional mapping arise from prohibitive computational times required as well as from modeling the structure of a high-dimensional covariance matrix. In this article, we develop a statistical model for functional mapping of quantitative trait loci (QTL) that govern the developmental process of a quantitative trait on the basis of wavelet dimension reduction. By breaking an original signal down into a spectrum by taking its averages (smooth coefficients) and differences (detail coefficients), we used the discrete Haar wavelet shrinkage technique to transform an inherently high-dimensional biological problem into its tractable low-dimensional representation within the framework of functional mapping constructed by a Gaussian mixture model. Unlike conventional nonparametric modeling of wavelet shrinkage, we incorporate mathematical aspects of developmental trajectories into the smooth coefficients used for QTL mapping, thus preserving the biological relevance of functional mapping in formulating a number of hypothesis tests at the interplay between gene actions/interactions and developmental patterns for complex phenotypes. This wavelet-based parametric functional mapping has been statistically examined and compared with full-dimensional functional mapping through simulation studies. It holds great promise as a powerful statistical tool to unravel the genetic machinery of developmental trajectories with large-scale high-dimensional data.
Collapse
Affiliation(s)
- Wei Zhao
- Department of Statistics, University of Florida, Gainesville, Florida 32611, USA
| | | | | | | |
Collapse
|
42
|
Lin M, Li H, Hou W, Johnson JA, Wu R. Modeling sequence-sequence interactions for drug response. Bioinformatics 2007; 23:1251-7. [PMID: 17392331 DOI: 10.1093/bioinformatics/btm110] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Genetic interactions or epistasis may play an important role in the genetic etiology of drug response. With the availability of large-scale, high-density single nucleotide polymorphism markers, a great challenge is how to associate haplotype structures and complex drug response through its underlying pharmacodynamic mechanisms. RESULTS We have derived a general statistical model for detecting an interactive network of DNA sequence variants that encode pharmacodynamic processes based on the haplotype map constructed by single nucleotide polymorphisms. The model was validated by a pharmacogenetic study for two predominant beta-adrenergic receptor (betaAR) subtypes expressed in the heart, beta1AR and beta2AR. Haplotypes from these two receptors trigger significant interaction effects on the response of heart rate to different dose levels of dobutamine. This model will have implications for pharmacogenetic and pharmacogenomic research and drug discovery. AVAILABILITY A computer program written in Matlab can be downloaded from the webpage of statistical genetics group at the University of Florida. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Min Lin
- Department of Statistics, University of Florida, Gainesville, FL 32611, USA
| | | | | | | | | |
Collapse
|
43
|
Abstract
Existing methods for mapping quantitative trait loci (QTL) in time-to-failure experiments assume that the QTL effect is constant over the course of the study. This assumption may be violated when the gene(s) underlying the QTL are up- or downregulated on a biologically meaningful timescale. In such situations, models that assume a constant effect can fail to detect QTL in a whole-genome scan. To investigate this possibility, we utilize an extension of the Cox model (EC model) within an interval-mapping framework. In its simplest form, this model assumes that the QTL effect changes at some time point t0 and follows a linear function before and after this change point. The approximate time point at which this change occurs is estimated. Using simulated and real data, we compare the mapping performance of the EC model to the Cox proportional hazards (CPH) model, which explicitly assumes a constant effect. The results show that the EC model detects time-dependent QTL, which the CPH model fails to detect. At the same time, the EC model recovers all of the QTL the CPH model detects. We conclude that potentially important QTL may be missed if their time-dependent effects are not accounted for.
Collapse
Affiliation(s)
- Frank Johannes
- Center for Developmental and Health Genetics, Pennsylvania State University, University Park, Pennsylvania 16803, USA.
| |
Collapse
|
44
|
Li H, Kim BR, Wu R. Identification of quantitative trait nucleotides that regulate cancer growth: a simulation approach. J Theor Biol 2006; 242:426-39. [PMID: 16650875 DOI: 10.1016/j.jtbi.2006.03.010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2005] [Revised: 02/01/2006] [Accepted: 03/17/2006] [Indexed: 11/22/2022]
Abstract
A general growth model derived from basic cellular properties can be used to describe the dynamic process of cancer growth with mathematical equations. It has been recognized that cancer growth is under genetic control, with a multitude of interacting genes each segregating in a Mendelian fashion and displaying environmental sensitivity. In this article, we integrate the mathematical aspects of the pervasive growth model into a statistical framework for the identification of quantitative trait nucleotides that underlie cancer growth. This integrative framework is constructed with a single nucleotide polymorphism-based haplotype blocking analysis. Simulation studies have been performed to demonstrate the usefulness of the model. The proposed model provides a generic platform model for testing and detecting specific DNA sequence variants that regulates the timing of cancer emergence, growth and differentiation.
Collapse
Affiliation(s)
- Hongying Li
- Department of Statistics, University of Florida, Gainesville, FL 32611, USA
| | | | | |
Collapse
|
45
|
Wu R, Lin M. Functional mapping - how to map and study the genetic architecture of dynamic complex traits. Nat Rev Genet 2006; 7:229-37. [PMID: 16485021 DOI: 10.1038/nrg1804] [Citation(s) in RCA: 220] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
The development of any organism is a complex dynamic process that is controlled by a network of genes as well as by environmental factors. Traditional mapping approaches for analysing phenotypic data measured at a single time point are too simple to reveal the genetic control of developmental processes. A general statistical mapping framework, called functional mapping, has been proposed to characterize, in a single step, the quantitative trait loci (QTLs) or nucleotides (QTNs) that underlie a complex dynamic trait. Functional mapping estimates mathematical parameters that describe the developmental mechanisms of trait formation and expression for each QTL or QTN. The approach provides a useful quantitative and testable framework for assessing the interplay between gene actions or interactions and developmental changes.
Collapse
Affiliation(s)
- Rongling Wu
- School of Forestry and Biotechnology, Zhejiang Forestry University, Lin'an, Zhejiang 311300, People's Republic of China.
| | | |
Collapse
|
46
|
Lin M, Wu R. A joint model for nonparametric functional mapping of longitudinal trajectory and time-to-event. BMC Bioinformatics 2006; 7:138. [PMID: 16539724 PMCID: PMC1479376 DOI: 10.1186/1471-2105-7-138] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2005] [Accepted: 03/15/2006] [Indexed: 11/10/2022] Open
Abstract
Background The characterization of the relationship between a longitudinal response process and a time-to-event has been a pressing challenge in biostatistical research. This has emerged as an important issue in genetic studies when one attempts to detect the common genes or quantitative trait loci (QTL) that govern both a longitudinal trajectory and developmental event. Results We present a joint statistical model for functional mapping of dynamic traits in which the event times and longitudinal traits are taken to depend on a common set of genetic mechanisms. By fitting the Legendre polynomial of orthogonal properties for the time-dependent mean vector, our model does not rely on any curve, which is different from earlier parametric models of functional mapping. This newly developed nonparametric model is demonstrated and validated by an example for a forest tree in which stemwood growth and the time to first flower are jointly modelled. Conclusion Our model allows for the detection of specific QTL that govern both longitudinal traits and developmental processes through either pleiotropic effects or close linkage, or both. This model will have great implications for integrating longitudinal and event data to gain better insights into comprehensive biology and biomedicine.
Collapse
Affiliation(s)
- Min Lin
- Department of Statistics, University of Florida, Gainesville, FL 32611, USA
- Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina 27710, USA
| | - Rongling Wu
- Department of Statistics, University of Florida, Gainesville, FL 32611, USA
| |
Collapse
|
47
|
Abstract
"Naturally occurring" or "programmed" cell death (PCD) in which the cell uses specialized cellular machinery to kill itself is a ubiquitous phenomenon that occurs early in organ development. Such a cell suicide mechanism that enables metazoans to control cell number and eliminate cells threatening the organism's survival has been thought to be under genetic control. In this report, we develop a novel statistical model for mapping specific genes or quantitative trait loci (QTL) that are responsible for the PCD process based on polymorphic molecular markers. This model incorporates the biological mechanisms of PCD that undergoes two different developmental stages, exponential growth and polynomial death. We derived a parametric approach to model the exponential growth and a nonparametric approach based on the Legendre function to model the polynomial death. A series of stationary and nonstationary models has been used to approximate the structure of the covariance matrix among cell numbers at a multitude of different times. The statistical behavior of our model is investigated through simulation studies and validated by a real example in rice.
Collapse
Affiliation(s)
- Yuehua Cui
- Department of Statistics, University of Florida, Gainesville, Florida 32611, USA
| | | | | |
Collapse
|
48
|
Hou W, Garvan CW, Littell RC, Behnke M, Eyler FD, Wu R. A framework to monitor environment-induced major genes for developmental trajectories: implication for a prenatal cocaine exposure study. Stat Med 2006; 25:4020-35. [PMID: 16463362 DOI: 10.1002/sim.2513] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Whether there are specific genes involved in response to different environmental agents and how such genes regulate developmental trajectories during lifetime are of fundamental importance in health, clinical and pharmaceutical research. In this article, we present a novel statistical model for monitoring environment-induced genes of major effects on longitudinal outcomes of a trait. This model is derived within the maximum likelihood framework, incorporated by mathematical aspects of growth and developmental processes. A typical structural model is implemented to approximate time-dependent covariance matrices for the longitudinal trait. This model allows for a number of biologically meaningful hypothesis tests regarding the effects of major genes on overall growth trajectories or particular stages of development. It can be used to test whether and how major genetic effects are expressed differently under altered environmental agents. In a well-designed case-control study, our model has been employed to detect cocaine-dependent genes that affect growth trajectories for head circumference during childhood. The detected gene triggers significant effects on growth curves in both cocaine-exposed (case) and unexposed groups (control), but with different extents. Significant genotype-environment interactions due to this so-called environment-sensitive gene are promising for further studies toward its genomic mapping using polymorphic molecular markers.
Collapse
Affiliation(s)
- Wei Hou
- Department of Statistics, University of Florida, Gainesville, FL 32611, USA
| | | | | | | | | | | |
Collapse
|
49
|
Wu S, Yang J, Wu R. Multilocus linkage disequilibrium mapping of epistatic quantitative trait loci that regulate HIV dynamics: a simulation approach. Stat Med 2006; 25:3826-49. [PMID: 16435340 DOI: 10.1002/sim.2489] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The time-dependent change of HIV particle load, i.e. HIV dynamics, is likely to be controlled by a multitude of quantitative trait loci (QTL) that interact with each other as well as with various developmental and environmental factors in a coordinated manner. In this article, we have derived a new statistical model for mapping the epistatic QTL responsible for HIV dynamics in a natural human population. This model, constructed on the integrated theme of functional mapping and linkage disequilibrium (LD) mapping, can make use of information from multiple markers genotyped from the human genome. It allows for the test and estimation of genetic actions and interactions involved in the control of HIV progression and provides a general platform to identify the detailed genetic architecture of resistance or susceptibility of humans to HIV on a dynamic scale. We have generalized this model to accommodate various complicated clincal designs for AIDS studies. Simulation studies with different scenarios are performed to examine the statistical behaviour of the model. The genetic and statistical extensions of this mapping model to HIV/AIDS genomic research are discussed.
Collapse
Affiliation(s)
- Song Wu
- Department of Statistics, University of Florida, Gainesville, FL 32611, U.S.A.
| | | | | |
Collapse
|
50
|
Wu R, Hou W. A hyperspace model to decipher the genetic architecture of developmental processes: allometry meets ontogeny. Genetics 2005; 172:627-37. [PMID: 16157673 PMCID: PMC1456188 DOI: 10.1534/genetics.105.045310] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
To better utilize limited resources for their survival and reproduction, all organisms undergo developmental changes in both body size and shape during ontogeny. The genetic analysis of size change with increasing age, i.e., growth, has received considerable attention in quantitative developmental genetic studies, but the genetic architecture of ontogenetic changes in body shape and its associated allometry have been poorly understood partly due to the lack of analytical tools. In this article, we attempt to construct a multivariate statistical framework for studying the genetic regulation of ontogenetic growth and shape. We have integrated biologically meaningful mathematical functions of growth curves and developmental allometry into the estimation process of genetic mapping aimed at identifying individual quantitative trait loci (QTL) for phenotypic variation. This model defined with high dimensions can characterize the ontogenetic patterns of genetic effects of QTL over the lifetime of an organism and assess the interplay between genetic actions/interactions and phenotypic integration. The closed forms for the residual covariance matrix and its determinant and inverse were derived to overcome the computational complexity typical of our high-dimensional model. We used a worked example to validate the utility of this model. The implications of this model for genetic research of evo-devo are discussed.
Collapse
Affiliation(s)
- Rongling Wu
- Department of Statistics, University of Florida, Gainesville, Florida 32611, USA.
| | | |
Collapse
|