1
|
Hereward JP, Smith TJ, Gloag R, Brookes DR, Walter GH. Reassessing Hybridisation in Australian Tetragonula Stingless Bees Using Multiple Genetic Markers. Ecol Evol 2025; 15:e70912. [PMID: 39896774 PMCID: PMC11775563 DOI: 10.1002/ece3.70912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2024] [Revised: 01/06/2025] [Accepted: 01/10/2025] [Indexed: 02/04/2025] Open
Abstract
We re-examined reports of hybridisation in three cryptic stingless bee species in the genus Tetragonula in South East Queensland, Australia (T. carbonaria, T. davenporti and T. hockingsi). Previous studies on this group using microsatellite markers proposed that hybridisation occasionally takes place. In contrast, we find that using 1745 SNPs we could reliably separate the three species, with no evidence of contemporary (or recent) hybridisation. We found identical amplicon sequences of the nuclear gene EF1alpha across most individuals of the three species, but low and moderate species-specific polymorphisms in the nuclear gene Opsin and the mitochondrial 16S rRNA gene, respectively, with no cases of mito-nuclear discordance at these genes. We confirm that nuclear divergence across these species is low, based on 10-26 kb of non-coding sequence flanking EF1alpha and Opsin (0.7%-1% pairwise difference between species). However, we find mitogenomes to be far more diverged than nuclear genomes (21.6%-23.6% pairwise difference between species). Based on these comprehensive analyses of multiple marker types, we conclude there is no ongoing gene flow among the Tetragonula species of South East Queensland, despite their morphological similarity to one another and the low nuclear divergence among them. The higher resolution provided by multiple SNP markers may lead to lower estimates of contemporary hybridisation more generally.
Collapse
Affiliation(s)
- James P. Hereward
- School of the EnvironmentThe University of QueenslandBrisbaneQueenslandAustralia
| | - Tobias J. Smith
- School of the EnvironmentThe University of QueenslandBrisbaneQueenslandAustralia
| | - Ros Gloag
- School of Life and Environmental SciencesThe University of SydneySydneyNew South WalesAustralia
| | - Dean R. Brookes
- School of the EnvironmentThe University of QueenslandBrisbaneQueenslandAustralia
- USDA ARS, Australian Biological Control Laboratory (ABCL), CSIRO, Ecosciences PrecinctDutton ParkAustralia
| | - Gimme H. Walter
- School of the EnvironmentThe University of QueenslandBrisbaneQueenslandAustralia
| |
Collapse
|
2
|
Snoek BL, Sterken MG, Nijveen H, Volkers RJM, Riksen J, Rosenstiel PC, Schulenburg H, Kammenga JE. The genetics of gene expression in a Caenorhabditis elegans multiparental recombinant inbred line population. G3 (BETHESDA, MD.) 2021; 11:jkab258. [PMID: 34568931 PMCID: PMC8496280 DOI: 10.1093/g3journal/jkab258] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 07/17/2021] [Indexed: 11/29/2022]
Abstract
Studying genetic variation of gene expression provides a powerful way to unravel the molecular components underlying complex traits. Expression quantitative trait locus (eQTL) studies have been performed in several different model species, yet most of these linkage studies have been based on the genetic segregation of two parental alleles. Recently, we developed a multiparental segregating population of 200 recombinant inbred lines (mpRILs) derived from four wild isolates (JU1511, JU1926, JU1931, and JU1941) in the nematode Caenorhabditis elegans. We used RNA-seq to investigate how multiple alleles affect gene expression in these mpRILs. We found 1789 genes differentially expressed between the parental lines. Transgression, expression beyond any of the parental lines in the mpRILs, was found for 7896 genes. For expression QTL mapping almost 9000 SNPs were available. By combining these SNPs and the RNA-seq profiles of the mpRILs, we detected almost 6800 eQTLs. Most trans-eQTLs (63%) co-locate in six newly identified trans-bands. The trans-eQTLs found in previous two-parental allele eQTL experiments and this study showed some overlap (17.5-46.8%), highlighting on the one hand that a large group of genes is affected by polymorphic regulators across populations and conditions, on the other hand, it shows that the mpRIL population allows identification of novel gene expression regulatory loci. Taken together, the analysis of our mpRIL population provides a more refined insight into C. elegans complex trait genetics and eQTLs in general, as well as a starting point to further test and develop advanced statistical models for detection of multiallelic eQTLs and systems genetics studying the genotype-phenotype relationship.
Collapse
Affiliation(s)
- Basten L Snoek
- Laboratory of Nematology, Wageningen University, NL-6708 PB Wageningen, The Netherlands
- Theoretical Biology and Bioinformatics, Utrecht University, 3584 CH Utrecht, The Netherlands
| | - Mark G Sterken
- Laboratory of Nematology, Wageningen University, NL-6708 PB Wageningen, The Netherlands
| | - Harm Nijveen
- Bioinformatics Group, Wageningen University, NL-6708 PB Wageningen, The Netherlands
| | - Rita J M Volkers
- Laboratory of Nematology, Wageningen University, NL-6708 PB Wageningen, The Netherlands
| | - Joost Riksen
- Laboratory of Nematology, Wageningen University, NL-6708 PB Wageningen, The Netherlands
| | - Philip C Rosenstiel
- Institute for Clinical Molecular Biology, University of Kiel, 24098 Kiel, Germany
- Competence Centre for Genomic Analysis (CCGA) Kiel, University of Kiel, 24098 Kiel, Germany
| | - Hinrich Schulenburg
- Zoological Institute, University of Kiel, 24098 Kiel, Germany
- Max Planck Institute for Evolutionary Biology, 24306 Ploen, Germany
| | - Jan E Kammenga
- Laboratory of Nematology, Wageningen University, NL-6708 PB Wageningen, The Netherlands
| |
Collapse
|
3
|
Impact of genotypic errors with equal and unequal family contribution on accuracy of genomic prediction in aquaculture using simulation. Sci Rep 2021; 11:18318. [PMID: 34526591 PMCID: PMC8443606 DOI: 10.1038/s41598-021-97873-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Accepted: 08/31/2021] [Indexed: 11/08/2022] Open
Abstract
Genotypic errors, conflict between recorded genotype and the true genotype, can lead to false or biased population genetic parameters. Here, the effect of genotypic errors on accuracy of genomic predictions and genomic relationship matrix are investigated using a simulation study based on population and genomic structure comparable to black tiger prawn, Penaeus monodon. Fifty full-sib families across five generations with phenotypic and genotypic information on 53 K SNPs were simulated. Ten replicates of different scenarios with three heritability estimates, equal and unequal family contributions were generated. Within each scenario, four SNP densities and three genotypic error rates in each SNP density were implemented. Results showed that family contribution did not have a substantial impact on accuracy of predictions across different datasets. In the absence of genotypic errors, 3 K SNP density was found to be efficient in estimating the accuracy, whilst increasing the SNP density from 3 to 20 K resulted in a marginal increase in accuracy of genomic predictions using the current population and genomic parameters. In addition, results showed that the presence of even 10% errors in a 10 and 20 K SNP panel might not have a severe impact on accuracy of predictions. However, below 10 K marker density, even a 5% error can result in lower accuracy of predictions.
Collapse
|
4
|
iDEP Web Application for RNA-Seq Data Analysis. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2021; 2284:417-443. [PMID: 33835455 DOI: 10.1007/978-1-0716-1307-8_22] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
RNA sequencing (RNA-seq) has become a routine method for transcriptomic profiling. We developed a user-friendly web app called iDEP (integrated differential expression and pathway analysis) to help biologists interpret read counts or other types of expression matrices derived from read mapping. With iDEP, users can easily conduct exploratory data analysis, identify differentially expressed genes, and perform pathway analysis. Due to its intuitive user interface and massive annotation database, iDEP is being widely adopted for interactive analysis of RNA-seq data. Using a public dataset on the effect of heat shock on mouse with and without functional Hsf1, we demonstrate how users can prepare data files and conduct in-depth analysis. We also discuss the importance of critical interpretion of results (avoid p-hacking and rationalizing) and validation of significant pathways by using different methods and independent annotation databases.
Collapse
|
5
|
Tran PMH, Tran LKH, Nechtman J, Dos Santos B, Purohit S, Satter KB, Dun B, Kolhe R, Sharma S, Bollag R, She JX. Comparative analysis of transcriptomic profile, histology, and IDH mutation for classification of gliomas. Sci Rep 2020; 10:20651. [PMID: 33244057 PMCID: PMC7692499 DOI: 10.1038/s41598-020-77777-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Accepted: 11/10/2020] [Indexed: 12/29/2022] Open
Abstract
Gliomas are currently classified through integration of histology and mutation information, with new developments in DNA methylation classification. However, discrepancies exist amongst the major classification methods. This study sought to compare transcriptome-based classification to the established methods. RNAseq and microarray data were obtained for 1032 gliomas from the TCGA and 395 gliomas from REMBRANDT. Data were analyzed using unsupervised and supervised learning and other statistical methods. Global transcriptomic profiles defined four transcriptomic glioma subgroups with 91.4% concordance with the WHO-defined mutation subtypes. Using these subgroups, 168 genes were selected for the development of 1000 linear support vector classifiers (LSVC). Based on plurality voting of 1000 LSVC, the final ensemble classifier confidently classified all but 17 TCGA gliomas to one of the four transcriptomic profile (TP) groups. The classifier was validated using a gene expression microarray dataset. TP1 cases include IDHwt, glioblastoma high immune infiltration and cellular proliferation and poor survival prognosis. TP2a is characterized as IDHmut-codel, oligodendrogliomas with high tumor purity. TP2b tissue is mostly composed of neurons and few infiltrating malignant cells. TP3 exhibit increased NOTCH signaling, are astrocytoma and IDHmut-non-codel. TP groups are highly concordant with both WHO integrated histology and mutation classification as well as methylation-based classification of gliomas. Transcriptomic profiling provides a robust and objective method to classify gliomas with high agreement to the current WHO guidelines and may provide additional survival prediction to the current methods.
Collapse
Affiliation(s)
- Paul M H Tran
- Center for Biotechnology and Genomic Medicine, Augusta, USA
| | - Lynn K H Tran
- Center for Biotechnology and Genomic Medicine, Augusta, USA
| | - John Nechtman
- Center for Biotechnology and Genomic Medicine, Augusta, USA
| | | | - Sharad Purohit
- Center for Biotechnology and Genomic Medicine, Augusta, USA
- Department of Obstetrics and Gynecology, Medical College of Georgia, Augusta, USA
- Department of Undergraduate Health Professionals, College of Allied Health Sciences, Augusta, USA
| | | | - Boying Dun
- Center for Biotechnology and Genomic Medicine, Augusta, USA
- Jinfiniti Precision Medicine, Inc., Augusta, USA
- Department of Obstetrics and Gynecology, Medical College of Georgia, Augusta, USA
| | - Ravindra Kolhe
- Department of Pathology, Medical College of Georgia, Augusta University, 1120 15th Street, Augusta, GA, 30912, USA
| | - Suash Sharma
- Department of Pathology, Medical College of Georgia, Augusta University, 1120 15th Street, Augusta, GA, 30912, USA
| | - Roni Bollag
- Department of Pathology, Medical College of Georgia, Augusta University, 1120 15th Street, Augusta, GA, 30912, USA
| | - Jin-Xiong She
- Center for Biotechnology and Genomic Medicine, Augusta, USA.
- Jinfiniti Precision Medicine, Inc., Augusta, USA.
- Department of Obstetrics and Gynecology, Medical College of Georgia, Augusta, USA.
| |
Collapse
|
6
|
Sterken MG, Bevers RPJ, Volkers RJM, Riksen JAG, Kammenga JE, Snoek BL. Dissecting the eQTL Micro-Architecture in Caenorhabditis elegans. Front Genet 2020; 11:501376. [PMID: 33240309 PMCID: PMC7670075 DOI: 10.3389/fgene.2020.501376] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2019] [Accepted: 10/13/2020] [Indexed: 01/11/2023] Open
Abstract
The study of expression quantitative trait loci (eQTL) using natural variation in inbred populations has yielded detailed information about the transcriptional regulation of complex traits. Studies on eQTL using recombinant inbred lines (RILs) led to insights on cis and trans regulatory loci of transcript abundance. However, determining the underlying causal polymorphic genes or variants is difficult, but ultimately essential for the understanding of regulatory networks of complex traits. This requires insight into whether associated loci are single eQTL or a combination of closely linked eQTL, and how this QTL micro-architecture depends on the environment. We addressed these questions by testing for independent replication of previously mapped eQTL in Caenorhabditis elegans using new data from introgression lines (ILs). Both populations indicate that the overall heritability of gene expression, number, and position of eQTL differed among environments. Across environments we were able to replicate 70% of the cis- and 40% of the trans-eQTL using the ILs. Testing eight different simulation models, we suggest that additive effects explain up to 60-93% of RIL/IL heritability for all three environments. Closely linked eQTL explained up to 40% of RIL/IL heritability in the control environment whereas only 7% in the heat-stress and recovery environments. In conclusion, we show that reproducibility of eQTL was higher for cis vs. trans eQTL and that the environment affects the eQTL micro-architecture.
Collapse
Affiliation(s)
- Mark G. Sterken
- Laboratory of Nematology, Wageningen University & Research, Wageningen, Netherlands
| | - Roel P. J. Bevers
- Laboratory of Nematology, Wageningen University & Research, Wageningen, Netherlands
| | - Rita J. M. Volkers
- Laboratory of Nematology, Wageningen University & Research, Wageningen, Netherlands
| | - Joost A. G. Riksen
- Laboratory of Nematology, Wageningen University & Research, Wageningen, Netherlands
| | - Jan E. Kammenga
- Laboratory of Nematology, Wageningen University & Research, Wageningen, Netherlands
| | - Basten L. Snoek
- Laboratory of Nematology, Wageningen University & Research, Wageningen, Netherlands
- Theoretical Biology & Bioinformatics, Utrecht University, Utrecht, Netherlands
| |
Collapse
|
7
|
Noble LM, Miah A, Kaur T, Rockman MV. The Ancestral Caenorhabditis elegans Cuticle Suppresses rol-1. G3 (BETHESDA, MD.) 2020; 10:2385-2395. [PMID: 32423919 PMCID: PMC7341120 DOI: 10.1534/g3.120.401336] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Accepted: 05/09/2020] [Indexed: 12/30/2022]
Abstract
Genetic background commonly modifies the effects of mutations. We discovered that worms mutant for the canonical rol-1 gene, identified by Brenner in 1974, do not roll in the genetic background of the wild strain CB4856. Using linkage mapping, association analysis and gene editing, we determined that N2 carries an insertion in the collagen gene col-182 that acts as a recessive enhancer of rol-1 rolling. From population and comparative genomics, we infer the insertion is derived in N2 and related laboratory lines, likely arising during the domestication of Caenorhabditis elegans, and breaking a conserved protein. The ancestral version of col-182 also modifies the phenotypes of four other classical cuticle mutant alleles, and the effects of natural genetic variation on worm shape and locomotion. These results underscore the importance of genetic background and the serendipity of Brenner's choice of strain.
Collapse
Affiliation(s)
- Luke M Noble
- Institut de Biologie, École Normale Supérieure, CNRS 8197, Inserm U1024, PSL Research University, F-75005 Paris, France
| | - Asif Miah
- Center for Genomics and Systems Biology, Department of Biology, New York University, NY, 10003
| | - Taniya Kaur
- Center for Genomics and Systems Biology, Department of Biology, New York University, NY, 10003
| | - Matthew V Rockman
- Center for Genomics and Systems Biology, Department of Biology, New York University, NY, 10003
| |
Collapse
|
8
|
Blay N, Casas E, Galván-Femenía I, Graffelman J, de Cid R, Vavouri T. Assessment of kinship detection using RNA-seq data. Nucleic Acids Res 2020; 47:e136. [PMID: 31501877 PMCID: PMC6868348 DOI: 10.1093/nar/gkz776] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2019] [Revised: 08/23/2019] [Accepted: 08/29/2019] [Indexed: 01/23/2023] Open
Abstract
Analysis of RNA sequencing (RNA-seq) data from related individuals is widely used in clinical and molecular genetics studies. Prediction of kinship from RNA-seq data would be useful for confirming the expected relationships in family based studies and for highlighting samples from related individuals in case-control or population based studies. Currently, reconstruction of pedigrees is largely based on SNPs or microsatellites, obtained from genotyping arrays, whole genome sequencing and whole exome sequencing. Potential problems with using RNA-seq data for kinship detection are the low proportion of the genome that it covers, the highly skewed coverage of exons of different genes depending on expression level and allele-specific expression. In this study we assess the use of RNA-seq data to detect kinship between individuals, through pairwise identity by descent (IBD) estimates. First, we obtained high quality SNPs after successive filters to minimize the effects due to allelic imbalance as well as errors in sequencing, mapping and genotyping. Then, we used these SNPs to calculate pairwise IBD estimates. By analysing both real and simulated RNA-seq data we show that it is possible to identify up to second degree relationships using RNA-seq data of even low to moderate sequencing depth.
Collapse
Affiliation(s)
- Natalia Blay
- Program for Predictive and Personalized Medicine of Cancer, Germans Trias i Pujol Research Institute (PMPPC-IGTP), Badalona 08916, Spain.,Josep Carreras Leukaemia Research Institute (IJC), Campus ICO-Germans Trias i Pujol, Universitat Autònoma de Barcelona, Badalona 08916, Spain.,Masters Programme in Bioinformatics and Biostatistics, Universitat Oberta de Catalunya (UOC), Barcelona 08035, Spain
| | - Eduard Casas
- Program for Predictive and Personalized Medicine of Cancer, Germans Trias i Pujol Research Institute (PMPPC-IGTP), Badalona 08916, Spain.,Josep Carreras Leukaemia Research Institute (IJC), Campus ICO-Germans Trias i Pujol, Universitat Autònoma de Barcelona, Badalona 08916, Spain.,Doctoral Programme in Biomedicine, Universitat de Barcelona, Barcelona 08007, Spain
| | - Iván Galván-Femenía
- Program for Predictive and Personalized Medicine of Cancer, Germans Trias i Pujol Research Institute (PMPPC-IGTP), Badalona 08916, Spain.,Genomes for Life - GCAT lab Group - Germans Trias i Pujol Research Institute, Can Ruti Campus, Ctra de Can Ruti, Camí de les Escoles s/n, Badalona, Barcelona 08916, Spain
| | - Jan Graffelman
- Department of Statistics and Operations Research Universitat Politècnica de Catalunya, Barcelona 08028, Spain.,Department of Biostatistics, University of Washington, Seattle, WA 98105-946, USA
| | - Rafael de Cid
- Program for Predictive and Personalized Medicine of Cancer, Germans Trias i Pujol Research Institute (PMPPC-IGTP), Badalona 08916, Spain.,Genomes for Life - GCAT lab Group - Germans Trias i Pujol Research Institute, Can Ruti Campus, Ctra de Can Ruti, Camí de les Escoles s/n, Badalona, Barcelona 08916, Spain
| | - Tanya Vavouri
- Program for Predictive and Personalized Medicine of Cancer, Germans Trias i Pujol Research Institute (PMPPC-IGTP), Badalona 08916, Spain.,Josep Carreras Leukaemia Research Institute (IJC), Campus ICO-Germans Trias i Pujol, Universitat Autònoma de Barcelona, Badalona 08916, Spain
| |
Collapse
|
9
|
Snoek BL, Sterken MG, Hartanto M, van Zuilichem AJ, Kammenga JE, de Ridder D, Nijveen H. WormQTL2: an interactive platform for systems genetics in Caenorhabditis elegans. Database (Oxford) 2020; 2020:baz149. [PMID: 31960906 PMCID: PMC6971878 DOI: 10.1093/database/baz149] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Revised: 11/30/2019] [Accepted: 12/13/2019] [Indexed: 12/19/2022]
Abstract
Quantitative genetics provides the tools for linking polymorphic loci to trait variation. Linkage analysis of gene expression is an established and widely applied method, leading to the identification of expression quantitative trait loci (eQTLs). (e)QTL detection facilitates the identification and understanding of the underlying molecular components and pathways, yet (e)QTL data access and mining often is a bottleneck. Here, we present WormQTL2, a database and platform for comparative investigations and meta-analyses of published (e)QTL data sets in the model nematode worm C. elegans. WormQTL2 integrates six eQTL studies spanning 11 conditions as well as over 1000 traits from 32 studies and allows experimental results to be compared, reused and extended upon to guide further experiments and conduct systems-genetic analyses. For example, one can easily screen a locus for specific cis-eQTLs that could be linked to variation in other traits, detect gene-by-environment interactions by comparing eQTLs under different conditions, or find correlations between QTL profiles of classical traits and gene expression. WormQTL2 makes data on natural variation in C. elegans and the identified QTLs interactively accessible, allowing studies beyond the original publications. Database URL: www.bioinformatics.nl/WormQTL2/.
Collapse
Affiliation(s)
- Basten L Snoek
- Laboratory of Nematology, Wageningen University, Droevendaalsesteeg 1, NL-6708 PB Wageningen, The Netherlands
- Theoretical Biology and Bioinformatics, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands
| | - Mark G Sterken
- Laboratory of Nematology, Wageningen University, Droevendaalsesteeg 1, NL-6708 PB Wageningen, The Netherlands
| | - Margi Hartanto
- Laboratory of Nematology, Wageningen University, Droevendaalsesteeg 1, NL-6708 PB Wageningen, The Netherlands
- Bioinformatics Group, Wageningen University, Droevendaalsesteeg 1, NL-6708 PB Wageningen, The Netherlands
| | - Albert-Jan van Zuilichem
- Laboratory of Nematology, Wageningen University, Droevendaalsesteeg 1, NL-6708 PB Wageningen, The Netherlands
| | - Jan E Kammenga
- Laboratory of Nematology, Wageningen University, Droevendaalsesteeg 1, NL-6708 PB Wageningen, The Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Wageningen University, Droevendaalsesteeg 1, NL-6708 PB Wageningen, The Netherlands
| | - Harm Nijveen
- Bioinformatics Group, Wageningen University, Droevendaalsesteeg 1, NL-6708 PB Wageningen, The Netherlands
| |
Collapse
|
10
|
Boja E, Težak Ž, Zhang B, Wang P, Johanson E, Hinton D, Rodriguez H. Right data for right patient-a precisionFDA NCI-CPTAC Multi-omics Mislabeling Challenge. Nat Med 2019; 24:1301-1302. [PMID: 30194412 PMCID: PMC6892367 DOI: 10.1038/s41591-018-0180-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
To address a critical roadblock that can occur in translational and clinical research, the National Cancer Institute and the Food and Drug Administration, in coordination with the DREAM Challenges, are launching the first computational challenge using multi-omics datasets to detect and correct specimen mislabeling.
Collapse
Affiliation(s)
- Emily Boja
- Office of Cancer Clinical Proteomics Research, Center for Strategic Scientific Initiatives, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.
| | - Živana Težak
- Office of In Vitro Diagnostics and Radiological Health, Center for Devices and Radiological Health, US Food and Drug Administration, Silver Spring, MD, USA.
| | - Bing Zhang
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Pei Wang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Elaine Johanson
- Office of Health Informatics, Office of the Chief Scientist, Office of the Commissioner, US Food and Drug Administration, Silver Spring, MD, USA
| | - Denise Hinton
- Office of the Chief Scientist, Office of the Commissioner, US Food and Drug Administration, Silver Spring, MD, USA
| | - Henry Rodriguez
- Office of Cancer Clinical Proteomics Research, Center for Strategic Scientific Initiatives, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
11
|
Serin EAR, Snoek LB, Nijveen H, Willems LAJ, Jiménez-Gómez JM, Hilhorst HWM, Ligterink W. Construction of a High-Density Genetic Map from RNA-Seq Data for an Arabidopsis Bay-0 × Shahdara RIL Population. Front Genet 2017; 8:201. [PMID: 29259624 PMCID: PMC5723289 DOI: 10.3389/fgene.2017.00201] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Accepted: 11/21/2017] [Indexed: 12/17/2022] Open
Abstract
High-density genetic maps are essential for high resolution mapping of quantitative traits. Here, we present a new genetic map for an Arabidopsis Bayreuth × Shahdara recombinant inbred line (RIL) population, built on RNA-seq data. RNA-seq analysis on 160 RILs of this population identified 30,049 single-nucleotide polymorphisms (SNPs) covering the whole genome. Based on a 100-kbp window SNP binning method, 1059 bin-markers were identified, physically anchored on the genome. The total length of the RNA-seq genetic map spans 471.70 centimorgans (cM) with an average marker distance of 0.45 cM and a maximum marker distance of 4.81 cM. This high resolution genotyping revealed new recombination breakpoints in the population. To highlight the advantages of such high-density map, we compared it to two publicly available genetic maps for the same population, comprising 69 PCR-based markers and 497 gene expression markers derived from microarray data, respectively. In this study, we show that SNP markers can effectively be derived from RNA-seq data. The new RNA-seq map closes many existing gaps in marker coverage, saturating the previously available genetic maps. Quantitative trait locus (QTL) analysis for published phenotypes using the available genetic maps showed increased QTL mapping resolution and reduced QTL confidence interval using the RNA-seq map. The new high-density map is a valuable resource that facilitates the identification of candidate genes and map-based cloning approaches.
Collapse
Affiliation(s)
- Elise A R Serin
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen University, Wageningen, Netherlands
| | - L B Snoek
- Laboratory of Nematology, Wageningen University, Wageningen, Netherlands.,Theoretical Biology and Bioinformatics, Utrecht University, Utrecht, Netherlands
| | - Harm Nijveen
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen University, Wageningen, Netherlands.,Laboratory of Bioinformatics, Wageningen University, Wageningen, Netherlands
| | - Leo A J Willems
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen University, Wageningen, Netherlands
| | - Jose M Jiménez-Gómez
- Department of Plant Breeding and Genetics, Max Planck Institute for Plant Breeding Research, Cologne, Germany.,Institut Jean-Pierre Bourgin, Institut National de la Recherche Agronomique, AgroParisTech, Centre National de la Recherche Scientifique, Université Paris-Saclay, Versailles Cedex, France
| | - Henk W M Hilhorst
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen University, Wageningen, Netherlands
| | - Wilco Ligterink
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen University, Wageningen, Netherlands
| |
Collapse
|