1
|
Fan K, Subedi S, Yang G, Lu X, Ren J, Wu C. Is Seeing Believing? A Practitioner's Perspective on High-Dimensional Statistical Inference in Cancer Genomics Studies. ENTROPY (BASEL, SWITZERLAND) 2024; 26:794. [PMID: 39330127 PMCID: PMC11430850 DOI: 10.3390/e26090794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Revised: 08/23/2024] [Accepted: 09/06/2024] [Indexed: 09/28/2024]
Abstract
Variable selection methods have been extensively developed for and applied to cancer genomics data to identify important omics features associated with complex disease traits, including cancer outcomes. However, the reliability and reproducibility of the findings are in question if valid inferential procedures are not available to quantify the uncertainty of the findings. In this article, we provide a gentle but systematic review of high-dimensional frequentist and Bayesian inferential tools under sparse models which can yield uncertainty quantification measures, including confidence (or Bayesian credible) intervals, p values and false discovery rates (FDR). Connections in high-dimensional inferences between the two realms have been fully exploited under the "unpenalized loss function + penalty term" formulation for regularization methods and the "likelihood function × shrinkage prior" framework for regularized Bayesian analysis. In particular, we advocate for robust Bayesian variable selection in cancer genomics studies due to its ability to accommodate disease heterogeneity in the form of heavy-tailed errors and structured sparsity while providing valid statistical inference. The numerical results show that robust Bayesian analysis incorporating exact sparsity has yielded not only superior estimation and identification results but also valid Bayesian credible intervals under nominal coverage probabilities compared with alternative methods, especially in the presence of heavy-tailed model errors and outliers.
Collapse
Affiliation(s)
- Kun Fan
- Department of Statistics, Kansas State University, Manhattan, KS 66506, USA
| | - Srijana Subedi
- Department of Statistics, Kansas State University, Manhattan, KS 66506, USA
| | - Gongshun Yang
- Department of Statistics, Kansas State University, Manhattan, KS 66506, USA
| | - Xi Lu
- Department of Pharmaceutical Health Outcomes and Policy, College of Pharmacy, University of Houston, Houston, TX 77204, USA
| | - Jie Ren
- Department of Biostatistics and Health Data Sciences, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Cen Wu
- Department of Statistics, Kansas State University, Manhattan, KS 66506, USA
| |
Collapse
|
2
|
Gong H, Zhou Z, Bu C, Zhang D, Fang Q, Zhang XY, Song Y. Computational dissection of genetic variation modulating the response of multiple photosynthetic phenotypes to the light environment. BMC Genomics 2024; 25:81. [PMID: 38243219 PMCID: PMC10799405 DOI: 10.1186/s12864-024-09968-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 01/03/2024] [Indexed: 01/21/2024] Open
Abstract
BACKGROUND The expression of biological traits is modulated by genetics as well as the environment, and the level of influence exerted by the latter may vary across characteristics. Photosynthetic traits in plants are complex quantitative traits that are regulated by both endogenous genetic factors and external environmental factors such as light intensity and CO2 concentration. The specific processes impacted occur dynamically and continuously as the growth of plants changes. Although studies have been conducted to explore the genetic regulatory mechanisms of individual photosynthetic traits or to evaluate the effects of certain environmental variables on photosynthetic traits, the systematic impact of environmental variables on the dynamic process of integrated plant growth and development has not been fully elucidated. RESULTS In this paper, we proposed a research framework to investigate the genetic mechanism of high-dimensional complex photosynthetic traits in response to the light environment at the genome level. We established a set of high-dimensional equations incorporating environmental regulators to integrate functional mapping and dynamic screening of gene‒environment complex systems to elucidate the process and pattern of intrinsic genetic regulatory mechanisms of three types of photosynthetic phenotypes of Populus simonii that varied with light intensity. Furthermore, a network structure was established to elucidate the crosstalk among significant QTLs that regulate photosynthetic phenotypic systems. Additionally, the detection of key QTLs governing the response of multiple phenotypes to the light environment, coupled with the intrinsic differences in genotype expression, provides valuable insights into the regulatory mechanisms that drive the transition of photosynthetic activity and photoprotection in the face of varying light intensity gradients. CONCLUSIONS This paper offers a comprehensive approach to unraveling the genetic architecture of multidimensional variations in photosynthetic phenotypes, considering the combined impact of integrated environmental factors from multiple perspectives.
Collapse
Affiliation(s)
- Huiying Gong
- College of Science, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, P. R. China
| | - Ziyang Zhou
- College of Science, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, P. R. China
| | - Chenhao Bu
- College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, P. R. China
| | - Deqiang Zhang
- College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, P. R. China
| | - Qing Fang
- Faculty of Science, Yamagata University, Yamagata, 990, Japan
| | - Xiao-Yu Zhang
- College of Science, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, P. R. China.
| | - Yuepeng Song
- College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, P. R. China.
| |
Collapse
|
3
|
Chen W, Yu W, Dong A, Zeng Y, Yuan H, Zheng B, Wu R. The Genetic Architecture of Juvenile Growth Traits in the Conifer Torreya grandis as Revealed by Joint Linkage and Linkage Disequilibrium Mapping. FRONTIERS IN PLANT SCIENCE 2022; 13:858187. [PMID: 35832218 PMCID: PMC9271899 DOI: 10.3389/fpls.2022.858187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 05/27/2022] [Indexed: 06/15/2023]
Abstract
Despite its high economical and ornamental values, Torreya grandis, a dioecious non-timber coniferous species, has long been an underrepresented species. However, the advent and application of advanced genotyping technologies have stimulated its genetic research, making it possible to gain new insight into the genetic architecture of complex traits that may not be detected for model species. We apply an open-pollination (OP) mapping strategy to conduct a QTL mapping experiment of T. grandis, in which nearly 100 unrelated trees randomly chosen from the species' natural distribution and their half-sib progeny are simultaneously genotyped. This strategy allows us to simultaneously estimate the recombination fractions and linkage disequilibrium (LD) coefficients between each pair of markers. We reconstruct a high-density linkage map of 4,203 SNPs covering a total distance of 8,393.95 cM and plot pairwise normalized LD values against genetic distances to build up a linkage-LD map. We identify 13 QTLs for stem basal diameter growth and 4 QTLs for stem height growth in juvenile seedlings. From the linkage-LD map, we infer the evolutionary history of T. grandis and each of its QTLs. The slow decay of QTL-related LDs indicates that these QTLs and their harboring genomic regions are evolutionarily relatively young, suggesting that they can better utilized by clonal propagation rather than through seed propagation. Genetic results from the OP sampling strategy could provide useful guidance for genetic studies of other dioecious species.
Collapse
Affiliation(s)
- Wenchong Chen
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| | - Weiwu Yu
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| | - Ang Dong
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Yanru Zeng
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| | - Huwei Yuan
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| | - Bingsong Zheng
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| | - Rongling Wu
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
- Center for Statistical Genetics, Department of Public Health Sciences, Department of Statistics, The Pennsylvania State University, Hershey, PA, United States
| |
Collapse
|
4
|
Jiang L, Sun L, Ye M, Wang J, Wang Y, Bogard M, Lacaze X, Fournier A, Beauchêne K, Gouache D, Wu R. Functional mapping of N deficiency‐induced response in wheat yield‐component traits by implementing high‐throughput phenotyping. THE PLANT JOURNAL 2019; 97:1105-1119. [PMID: 30536457 DOI: 10.1111/tpj.14186] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Revised: 11/09/2018] [Accepted: 11/23/2018] [Indexed: 05/25/2023]
Affiliation(s)
- Libo Jiang
- Center for Computational Biology College of Biological Sciences and Technology Beijing Forestry University Beijing 100083 China
| | - Lidan Sun
- Beijing Key Laboratory of Ornamental Plants Germplasm Innovation and Molecular Breeding National Engineering Research Center for Floriculture College of Landscape Architecture Beijing Forestry University Beijing 100083 China
| | - Meixia Ye
- Center for Computational Biology College of Biological Sciences and Technology Beijing Forestry University Beijing 100083 China
| | - Jing Wang
- Center for Computational Biology College of Biological Sciences and Technology Beijing Forestry University Beijing 100083 China
| | - Yaqun Wang
- Department of Biostatistics Rutgers University New Brunswick NJ 08901 USA
| | - Matthieu Bogard
- Arvalis Institut du Végétal 3‐5 Rue Joseph et Marie Hackin 75116 Paris France
| | - Xavier Lacaze
- Arvalis Institut du Végétal 3‐5 Rue Joseph et Marie Hackin 75116 Paris France
| | - Antoine Fournier
- Arvalis Institut du Végétal 3‐5 Rue Joseph et Marie Hackin 75116 Paris France
| | - Katia Beauchêne
- Arvalis Institut du Végétal 3‐5 Rue Joseph et Marie Hackin 75116 Paris France
| | - David Gouache
- Arvalis Institut du Végétal 3‐5 Rue Joseph et Marie Hackin 75116 Paris France
| | - Rongling Wu
- Center for Computational Biology College of Biological Sciences and Technology Beijing Forestry University Beijing 100083 China
- Center for Statistical Genetics Departments of Public Health Sciences and Statistics Pennsylvania State University Hershey PA 17033 USA
| |
Collapse
|
5
|
Camargo AV, Mackay I, Mott R, Han J, Doonan JH, Askew K, Corke F, Williams K, Bentley AR. Functional Mapping of Quantitative Trait Loci (QTLs) Associated With Plant Performance in a Wheat MAGIC Mapping Population. FRONTIERS IN PLANT SCIENCE 2018; 9:887. [PMID: 30038630 PMCID: PMC6047115 DOI: 10.3389/fpls.2018.00887] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 06/07/2018] [Indexed: 05/18/2023]
Abstract
In crop genetic studies, the mapping of longitudinal data describing the spatio-temporal nature of agronomic traits can elucidate the factors influencing their formation and development. Here, we combine the mapping power and precision of a MAGIC wheat population with robust computational methods to track the spatio- temporal dynamics of traits associated with wheat performance. NIAB MAGIC lines were phenotyped throughout their lifecycle under smart house conditions. Growth models were fitted to the data describing growth trajectories of plant area, height, water use and senescence and fitted parameters were mapped as quantitative traits. Trait data from single time points were also mapped to determine when and how markers became and ceased to be significant. Assessment of temporal dynamics allowed the identification of marker-trait associations and tracking of trait development against the genetic contribution of key markers. We establish a data-driven approach for understanding complex agronomic traits and accelerate research in plant breeding.
Collapse
Affiliation(s)
- Anyela V. Camargo
- The John Bingham Laboratory, National Institute of Agricultural Botany, Cambridge, United Kingdom
- *Correspondence: Anyela V. Camargo
| | - Ian Mackay
- The John Bingham Laboratory, National Institute of Agricultural Botany, Cambridge, United Kingdom
| | - Richard Mott
- Division of Bioscience, Genetics Institute, University College London, London, United Kingdom
| | - Jiwan Han
- National Plant Phenomics Centre, Institute of Biological Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, United Kingdom
| | - John H. Doonan
- National Plant Phenomics Centre, Institute of Biological Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, United Kingdom
| | - Karen Askew
- National Plant Phenomics Centre, Institute of Biological Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, United Kingdom
| | - Fiona Corke
- National Plant Phenomics Centre, Institute of Biological Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, United Kingdom
| | - Kevin Williams
- National Plant Phenomics Centre, Institute of Biological Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, United Kingdom
| | - Alison R. Bentley
- The John Bingham Laboratory, National Institute of Agricultural Botany, Cambridge, United Kingdom
| |
Collapse
|
6
|
Ye M, Jiang L, Mao K, Wang Y, Wang Z, Wu R. Functional mapping of seasonal transition in perennial plants. Brief Bioinform 2014; 16:526-35. [DOI: 10.1093/bib/bbu025] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2014] [Accepted: 06/25/2014] [Indexed: 11/13/2022] Open
|