1
|
Yu L, Dittrich ACN, Zhang X, Brock JR, Thirumalaikumar VP, Melandri G, Skirycz A, Edger PP, Thorp KR, Hinze L, Pauli D, Nelson ADL. Regulation of a single inositol 1-phosphate synthase homeologue by HSFA6B contributes to fibre yield maintenance under drought conditions in upland cotton. PLANT BIOTECHNOLOGY JOURNAL 2024. [PMID: 39031479 DOI: 10.1111/pbi.14402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Revised: 05/15/2024] [Accepted: 05/21/2024] [Indexed: 07/22/2024]
Abstract
Drought stress substantially impacts crop physiology resulting in alteration of growth and productivity. Understanding the genetic and molecular crosstalk between stress responses and agronomically important traits such as fibre yield is particularly complicated in the allopolyploid species, upland cotton (Gossypium hirsutum), due to reduced sequence variability between A and D subgenomes. To better understand how drought stress impacts yield, the transcriptomes of 22 genetically and phenotypically diverse upland cotton accessions grown under well-watered and water-limited conditions in the Arizona low desert were sequenced. Gene co-expression analyses were performed, uncovering a group of stress response genes, in particular transcription factors GhDREB2A-A and GhHSFA6B-D, associated with improved yield under water-limited conditions in an ABA-independent manner. DNA affinity purification sequencing (DAP-seq), as well as public cistrome data from Arabidopsis, were used to identify targets of these two TFs. Among these targets were two lint yield-associated genes previously identified through genome-wide association studies (GWAS)-based approaches, GhABP-D and GhIPS1-A. Biochemical and phylogenetic approaches were used to determine that GhIPS1-A is positively regulated by GhHSFA6B-D, and that this regulatory mechanism is specific to Gossypium spp. containing the A (old world) genome. Finally, an SNP was identified within the GhHSFA6B-D binding site in GhIPS1-A that is positively associated with yield under water-limiting conditions. These data lay out a regulatory connection between abiotic stress and fibre yield in cotton that appears conserved in other systems such as Arabidopsis.
Collapse
Affiliation(s)
- Li'ang Yu
- Boyce Thompson Institute, Cornell University, Ithaca, NY, USA
| | | | - Xiaodan Zhang
- Boyce Thompson Institute, Cornell University, Ithaca, NY, USA
| | - Jordan R Brock
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
| | | | | | | | - Patrick P Edger
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
| | - Kelly R Thorp
- United States Department of Agriculture-Agricultural Research Service, Arid Land Agricultural Research Center, Maricopa, AZ, USA
| | - Lori Hinze
- United States Department of Agriculture-Agricultural Research Service, Southern Plains Agricultural Research Center, College Station, TX, USA
| | - Duke Pauli
- School of Plant Sciences, University of Arizona, Tucson, AZ, USA
- Agroecosystem Research in the Desert (ARID), University of Arizona, Tucson, AZ, USA
| | | |
Collapse
|
2
|
Duman ET, Sitte M, Conrads K, Mackay A, Ludewig F, Ströbel P, Ellenrieder V, Hessmann E, Papantonis A, Salinas G. A single-cell strategy for the identification of intronic variants related to mis-splicing in pancreatic cancer. NAR Genom Bioinform 2024; 6:lqae057. [PMID: 38800828 PMCID: PMC11127633 DOI: 10.1093/nargab/lqae057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 04/24/2024] [Accepted: 05/23/2024] [Indexed: 05/29/2024] Open
Abstract
Most clinical diagnostic and genomic research setups focus almost exclusively on coding regions and essential splice sites, thereby overlooking other non-coding variants. As a result, intronic variants that can promote mis-splicing events across a range of diseases, including cancer, are yet to be systematically investigated. Such investigations would require both genomic and transcriptomic data, but there currently exist very few datasets that satisfy these requirements. We address this by developing a single-nucleus full-length RNA-sequencing approach that allows for the detection of potentially pathogenic intronic variants. We exemplify the potency of our approach by applying pancreatic cancer tumor and tumor-derived specimens and linking intronic variants to splicing dysregulation. We specifically find that prominent intron retention and pseudo-exon activation events are shared by the tumors and affect genes encoding key transcriptional regulators. Our work paves the way for the assessment and exploitation of intronic mutations as powerful prognostic markers and potential therapeutic targets in cancer.
Collapse
Affiliation(s)
- Emre Taylan Duman
- NGS-Core Unit for Integrative Genomics, Institute of Pathology, University Medical Center, Göttingen, Germany
| | - Maren Sitte
- NGS-Core Unit for Integrative Genomics, Institute of Pathology, University Medical Center, Göttingen, Germany
| | - Karly Conrads
- Clinic of Gastroenterology, Gastrointestinal Oncology and Endocrinology, University Medical Center, Göttingen, Germany
- Clinical Research Unit 5002 (CRU5002), University Medical Center, Göttingen, Germany
- Institute of Medical Bioinformatics, University Medical Center, Göttingen, Germany
| | - Adi Mackay
- Clinical Research Unit 5002 (CRU5002), University Medical Center, Göttingen, Germany
- Institute of Pathology, University Medical Center, Göttingen, Germany
| | - Fabian Ludewig
- NGS-Core Unit for Integrative Genomics, Institute of Pathology, University Medical Center, Göttingen, Germany
| | - Philipp Ströbel
- Clinical Research Unit 5002 (CRU5002), University Medical Center, Göttingen, Germany
- Institute of Pathology, University Medical Center, Göttingen, Germany
| | - Volker Ellenrieder
- Clinic of Gastroenterology, Gastrointestinal Oncology and Endocrinology, University Medical Center, Göttingen, Germany
- Clinical Research Unit 5002 (CRU5002), University Medical Center, Göttingen, Germany
- Comprehensive Cancer Center Lower Saxony (CCC-N), Göttingen, Germany
| | - Elisabeth Hessmann
- Clinic of Gastroenterology, Gastrointestinal Oncology and Endocrinology, University Medical Center, Göttingen, Germany
- Clinical Research Unit 5002 (CRU5002), University Medical Center, Göttingen, Germany
- Comprehensive Cancer Center Lower Saxony (CCC-N), Göttingen, Germany
| | - Argyris Papantonis
- Clinical Research Unit 5002 (CRU5002), University Medical Center, Göttingen, Germany
- Institute of Pathology, University Medical Center, Göttingen, Germany
- Comprehensive Cancer Center Lower Saxony (CCC-N), Göttingen, Germany
| | - Gabriela Salinas
- NGS-Core Unit for Integrative Genomics, Institute of Pathology, University Medical Center, Göttingen, Germany
- Clinical Research Unit 5002 (CRU5002), University Medical Center, Göttingen, Germany
| |
Collapse
|
3
|
Zhan W, Cui L, Yang S, Zhang K, Zhang Y, Yang J. Natural variations of heterosis-related allele-specific expression genes in promoter regions lead to allele-specific expression in maize. BMC Genomics 2024; 25:476. [PMID: 38745122 PMCID: PMC11092226 DOI: 10.1186/s12864-024-10395-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Accepted: 05/08/2024] [Indexed: 05/16/2024] Open
Abstract
BACKGROUND Heterosis has successfully enhanced maize productivity and quality. Although significant progress has been made in delineating the genetic basis of heterosis, the molecular mechanisms underlying its genetic components remain less explored. Allele-specific expression (ASE), the imbalanced expression between two parental alleles in hybrids, is increasingly being recognized as a factor contributing to heterosis. ASE is a complex process regulated by both epigenetic and genetic variations in response to developmental and environmental conditions. RESULTS In this study, we explored the differential characteristics of ASE by analyzing the transcriptome data of two maize hybrids and their parents under four light conditions. On the basis of allele expression patterns in different hybrids under various conditions, ASE genes were divided into three categories: bias-consistent genes involved in basal metabolic processes in a functionally complementary manner, bias-reversal genes adapting to the light environment, and bias-specific genes maintaining cell homeostasis. We observed that 758 ASE genes (ASEGs) were significantly overlapped with heterosis quantitative trait loci (QTLs), and high-frequency variations in the promoter regions of heterosis-related ASEGs were identified between parents. In addition, 10 heterosis-related ASEGs participating in yield heterosis were selected during domestication. CONCLUSIONS The comprehensive analysis of ASEGs offers a distinctive perspective on how light quality influences gene expression patterns and gene-environment interactions, with implications for the identification of heterosis-related ASEGs to enhance maize yield.
Collapse
Affiliation(s)
- Weimin Zhan
- College of Agronomy, Henan Agricultural University, Zhengzhou, 450002, China
- Guangdong Provincial Key Laboratory of Plant Adaptation and Molecular Design, Guangzhou Key Laboratory of Crop Gene Editing, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Lianhua Cui
- College of Agronomy, Henan Agricultural University, Zhengzhou, 450002, China
| | - Shuling Yang
- College of Agronomy, Henan Agricultural University, Zhengzhou, 450002, China
| | - Kangni Zhang
- College of Agronomy, Henan Agricultural University, Zhengzhou, 450002, China
| | - Yanpei Zhang
- College of Agronomy, Henan Agricultural University, Zhengzhou, 450002, China.
| | - Jianping Yang
- College of Agronomy, Henan Agricultural University, Zhengzhou, 450002, China.
| |
Collapse
|
4
|
Al-Yazeedi T, Adams S, Tandonnet S, Turner A, Kim J, Lee J, Pires-daSilva A. The contribution of an X chromosome QTL to non-Mendelian inheritance and unequal chromosomal segregation in Auanema freiburgense. Genetics 2024; 227:iyae032. [PMID: 38431281 PMCID: PMC11075566 DOI: 10.1093/genetics/iyae032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2023] [Revised: 02/14/2024] [Accepted: 02/15/2024] [Indexed: 03/05/2024] Open
Abstract
Auanema freiburgense is a nematode with males, females, and selfing hermaphrodites. When XO males mate with XX females, they typically produce a low proportion of XO offspring because they eliminate nullo-X spermatids. This process ensures that most sperm carry an X chromosome, increasing the likelihood of X chromosome transmission compared to random segregation. This occurs because of an unequal distribution of essential cellular organelles during sperm formation, likely dependent on the X chromosome. Some sperm components are selectively segregated into the X chromosome's daughter cell, while others are discarded with the nullo-X daughter cell. Intriguingly, the interbreeding of 2 A. freiburgense strains results in hybrid males capable of producing viable nullo-X sperm. Consequently, when these hybrid males mate with females, they yield a high percentage of male offspring. To uncover the genetic basis of nullo-spermatid elimination and X chromosome drive, we generated a genome assembly for A. freiburgense and genotyped the intercrossed lines. This analysis identified a quantitative trait locus spanning several X chromosome genes linked to the non-Mendelian inheritance patterns observed in A. freiburgense. This finding provides valuable clues to the underlying factors involved in asymmetric organelle partitioning during male meiotic division and thus non-Mendelian transmission of the X chromosome and sex ratios.
Collapse
Affiliation(s)
- Talal Al-Yazeedi
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, UK
| | - Sally Adams
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, UK
| | - Sophie Tandonnet
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, UK
| | - Anisa Turner
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, UK
| | - Jun Kim
- Institute of Molecular Biology and Genetics, Seoul National University, Seoul 08826, South Korea
| | - Junho Lee
- Institute of Molecular Biology and Genetics, Seoul National University, Seoul 08826, South Korea
| | | |
Collapse
|
5
|
Garcia IS, Silva-Vignato B, Cesar ASM, Petrini J, da Silva VH, Morosini NS, Goes CP, Afonso J, da Silva TR, Lima BD, Clemente LG, Regitano LCDA, Mourão GB, Coutinho LL. Novel putative causal mutations associated with fat traits in Nellore cattle uncovered by eQTLs located in open chromatin regions. Sci Rep 2024; 14:10094. [PMID: 38698200 PMCID: PMC11066111 DOI: 10.1038/s41598-024-60703-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 04/26/2024] [Indexed: 05/05/2024] Open
Abstract
Intramuscular fat (IMF) and backfat thickness (BFT) are critical economic traits impacting meat quality. However, the genetic variants controlling these traits need to be better understood. To advance knowledge in this area, we integrated RNA-seq and single nucleotide polymorphisms (SNPs) identified in genomic and transcriptomic data to generate a linkage disequilibrium filtered panel of 553,581 variants. Expression quantitative trait loci (eQTL) analysis revealed 36,916 cis-eQTLs and 14,408 trans-eQTLs. Association analysis resulted in three eQTLs associated with BFT and 24 with IMF. Functional enrichment analysis of genes regulated by these 27 eQTLs revealed noteworthy pathways that can play a fundamental role in lipid metabolism and fat deposition, such as immune response, cytoskeleton remodeling, iron transport, and phospholipid metabolism. We next used ATAC-Seq assay to identify and overlap eQTL and open chromatin regions. Six eQTLs were in regulatory regions, four in predicted insulators and possible CCCTC-binding factor DNA binding sites, one in an active enhancer region, and the last in a low signal region. Our results provided novel insights into the transcriptional regulation of IMF and BFT, unraveling putative regulatory variants.
Collapse
Affiliation(s)
- Ingrid Soares Garcia
- Department of Animal Science, College of Agriculture "Luiz de Queiroz", University of São Paulo, Piracicaba, SP, Brazil
| | - Bárbara Silva-Vignato
- Department of Animal Science, College of Agriculture "Luiz de Queiroz", University of São Paulo, Piracicaba, SP, Brazil
| | - Aline Silva Mello Cesar
- Department of Agroindustry, Food and Nutrition, College of Agriculture "Luiz de Queiroz", University of São Paulo, Piracicaba, SP, Brazil
| | - Juliana Petrini
- Department of Animal Science, College of Agriculture "Luiz de Queiroz", University of São Paulo, Piracicaba, SP, Brazil
| | - Vinicius Henrique da Silva
- Department of Animal Science, College of Agriculture "Luiz de Queiroz", University of São Paulo, Piracicaba, SP, Brazil
| | - Natália Silva Morosini
- Department of Animal Science, College of Agriculture "Luiz de Queiroz", University of São Paulo, Piracicaba, SP, Brazil
| | - Carolina Purcell Goes
- Department of Animal Science, College of Agriculture "Luiz de Queiroz", University of São Paulo, Piracicaba, SP, Brazil
| | | | - Thaís Ribeiro da Silva
- Department of Animal Science, College of Agriculture "Luiz de Queiroz", University of São Paulo, Piracicaba, SP, Brazil
| | - Beatriz Delcarme Lima
- Department of Animal Science, College of Agriculture "Luiz de Queiroz", University of São Paulo, Piracicaba, SP, Brazil
| | - Luan Gaspar Clemente
- Department of Animal Science, College of Agriculture "Luiz de Queiroz", University of São Paulo, Piracicaba, SP, Brazil
| | | | - Gerson Barreto Mourão
- Department of Animal Science, College of Agriculture "Luiz de Queiroz", University of São Paulo, Piracicaba, SP, Brazil
| | - Luiz Lehmann Coutinho
- Department of Animal Science, College of Agriculture "Luiz de Queiroz", University of São Paulo, Piracicaba, SP, Brazil.
| |
Collapse
|
6
|
Ginete C, Delgadinho M, Santos B, Miranda A, Silva C, Guerreiro P, Chimusa ER, Brito M. Genetic Modifiers of Sickle Cell Anemia Phenotype in a Cohort of Angolan Children. Genes (Basel) 2024; 15:469. [PMID: 38674403 PMCID: PMC11049512 DOI: 10.3390/genes15040469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 04/04/2024] [Accepted: 04/05/2024] [Indexed: 04/28/2024] Open
Abstract
The aim of this study was to identify genetic markers in the HBB Cluster; HBS1L-MYB intergenic region; and BCL11A, KLF1, FOX3, and ZBTB7A genes associated with the heterogeneous phenotypes of Sickle Cell Anemia (SCA) using next-generation sequencing, as well as to assess their influence and prevalence in an Angolan population. Hematological, biochemical, and clinical data were considered to determine patients' severity phenotypes. Samples from 192 patients were sequenced, and 5,019,378 variants of high quality were registered. A catalog of candidate modifier genes that clustered in pathophysiological pathways important for SCA was generated, and candidate genes associated with increasing vaso-occlusive crises (VOC) and with lower fetal hemoglobin (HbF) were identified. These data support the polygenic view of the genetic architecture of SCA phenotypic variability. Two single nucleotide polymorphisms in the intronic region of 2q16.1, harboring the BCL11A gene, are genome-wide and significantly associated with decreasing HbF. A set of variants was identified to nominally be associated with increasing VOC and are potential genetic modifiers harboring phenotypic variation among patients. To the best of our knowledge, this is the first investigation of clinical variation in SCA in Angola using a well-customized and targeted sequencing approach.
Collapse
Affiliation(s)
- Catarina Ginete
- H&TRC-Health & Technology Research Center, ESTeSL-Escola Superior de Tecnologia da Saúde, Instituto Politécnico de Lisboa, 1990-096 Lisbon, Portugal; (C.G.); (M.D.); (C.S.); (P.G.)
| | - Mariana Delgadinho
- H&TRC-Health & Technology Research Center, ESTeSL-Escola Superior de Tecnologia da Saúde, Instituto Politécnico de Lisboa, 1990-096 Lisbon, Portugal; (C.G.); (M.D.); (C.S.); (P.G.)
| | - Brígida Santos
- Centro de Investigação em Saúde de Angola (CISA), Bengo 9999, Angola;
- Hospital Pediátrico David Bernardino (HPDB), Luanda 3067, Angola
| | - Armandina Miranda
- Instituto Nacional de Saúde Doutor Ricardo Jorge (INSA), 1649-016 Lisbon, Portugal;
| | - Carina Silva
- H&TRC-Health & Technology Research Center, ESTeSL-Escola Superior de Tecnologia da Saúde, Instituto Politécnico de Lisboa, 1990-096 Lisbon, Portugal; (C.G.); (M.D.); (C.S.); (P.G.)
- Centro de Estatística e Aplicações, Universidade de Lisboa, 1649-013 Lisbon, Portugal
| | - Paulo Guerreiro
- H&TRC-Health & Technology Research Center, ESTeSL-Escola Superior de Tecnologia da Saúde, Instituto Politécnico de Lisboa, 1990-096 Lisbon, Portugal; (C.G.); (M.D.); (C.S.); (P.G.)
| | - Emile R. Chimusa
- Department of Applied Sciences, Faculty of Health and Life Sciences, Northumbria University, Newcastle upon Tyne NE1 8ST, UK;
| | - Miguel Brito
- H&TRC-Health & Technology Research Center, ESTeSL-Escola Superior de Tecnologia da Saúde, Instituto Politécnico de Lisboa, 1990-096 Lisbon, Portugal; (C.G.); (M.D.); (C.S.); (P.G.)
- Centro de Investigação em Saúde de Angola (CISA), Bengo 9999, Angola;
| |
Collapse
|
7
|
Han B, Tian D, Li X, Liu S, Tian F, Liu D, Wang S, Zhao K. Multiomics Analyses Provide New Insight into Genetic Variation of Reproductive Adaptability in Tibetan Sheep. Mol Biol Evol 2024; 41:msae058. [PMID: 38552245 PMCID: PMC10980521 DOI: 10.1093/molbev/msae058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 01/13/2024] [Accepted: 03/12/2024] [Indexed: 04/02/2024] Open
Abstract
Domestication and artificial selection during production-oriented breeding have greatly shaped the level of genomic variability in sheep. However, the genetic variation associated with increased reproduction remains elusive. Here, two groups of samples from consecutively monotocous and polytocous sheep were collected for genome-wide association, transcriptomic, proteomic, and metabolomic analyses to explore the genetic variation in fecundity in Tibetan sheep. Genome-wide association study revealed strong associations between BMPR1B (p.Q249R) and litter size, as well as between PAPPA and lambing interval; these findings were validated in 1,130 individuals. Furthermore, we constructed the first single-cell atlas of Tibetan sheep ovary tissues and identified a specific mural granulosa cell subtype with PAPPA-specific expression and differential expression of BMPR1B between the two groups. Bulk RNA-seq indicated that BMPR1B and PAPPA expressions were similar between the two groups of sheep. 3D protein structure prediction and coimmunoprecipitation analysis indicated that mutation and mutually exclusive exons of BMPR1B are the main mechanisms for prolific Tibetan sheep. We propose that PAPPA is a key gene for stimulating ovarian follicular growth and development, and steroidogenesis. Our work reveals the genetic variation in reproductive performance in Tibetan sheep, providing insights and valuable genetic resources for the discovery of genes and regulatory mechanisms that improve reproductive success.
Collapse
Affiliation(s)
- Buying Han
- Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
- University of Chinese Academy of Sciences, Beijing, China
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
| | - Dehong Tian
- Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
| | - Xue Li
- Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
| | - Sijia Liu
- Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
| | - Fei Tian
- Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
| | - Dehui Liu
- Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
- University of Chinese Academy of Sciences, Beijing, China
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
| | - Song Wang
- Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
- University of Chinese Academy of Sciences, Beijing, China
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
| | - Kai Zhao
- Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, China
| |
Collapse
|
8
|
Zhou Y, Kathiresan N, Yu Z, Rivera LF, Yang Y, Thimma M, Manickam K, Chebotarov D, Mauleon R, Chougule K, Wei S, Gao T, Green CD, Zuccolo A, Xie W, Ware D, Zhang J, McNally KL, Wing RA. A high-performance computational workflow to accelerate GATK SNP detection across a 25-genome dataset. BMC Biol 2024; 22:13. [PMID: 38273258 PMCID: PMC10809545 DOI: 10.1186/s12915-024-01820-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 01/09/2024] [Indexed: 01/27/2024] Open
Abstract
BACKGROUND Single-nucleotide polymorphisms (SNPs) are the most widely used form of molecular genetic variation studies. As reference genomes and resequencing data sets expand exponentially, tools must be in place to call SNPs at a similar pace. The genome analysis toolkit (GATK) is one of the most widely used SNP calling software tools publicly available, but unfortunately, high-performance computing versions of this tool have yet to become widely available and affordable. RESULTS Here we report an open-source high-performance computing genome variant calling workflow (HPC-GVCW) for GATK that can run on multiple computing platforms from supercomputers to desktop machines. We benchmarked HPC-GVCW on multiple crop species for performance and accuracy with comparable results with previously published reports (using GATK alone). Finally, we used HPC-GVCW in production mode to call SNPs on a "subpopulation aware" 16-genome rice reference panel with ~ 3000 resequenced rice accessions. The entire process took ~ 16 weeks and resulted in the identification of an average of 27.3 M SNPs/genome and the discovery of ~ 2.3 million novel SNPs that were not present in the flagship reference genome for rice (i.e., IRGSP RefSeq). CONCLUSIONS This study developed an open-source pipeline (HPC-GVCW) to run GATK on HPC platforms, which significantly improved the speed at which SNPs can be called. The workflow is widely applicable as demonstrated successfully for four major crop species with genomes ranging in size from 400 Mb to 2.4 Gb. Using HPC-GVCW in production mode to call SNPs on a 25 multi-crop-reference genome data set produced over 1.1 billion SNPs that were publicly released for functional and breeding studies. For rice, many novel SNPs were identified and were found to reside within genes and open chromatin regions that are predicted to have functional consequences. Combined, our results demonstrate the usefulness of combining a high-performance SNP calling architecture solution with a subpopulation-aware reference genome panel for rapid SNP discovery and public deployment.
Collapse
Affiliation(s)
- Yong Zhou
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
- Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Nagarajan Kathiresan
- KAUST Supercomputing Laboratory (KSL), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Zhichao Yu
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Luis F Rivera
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Yujian Yang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Manjula Thimma
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Keerthana Manickam
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Dmytro Chebotarov
- International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines
| | - Ramil Mauleon
- International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Sharon Wei
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Tingting Gao
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Carl D Green
- Information Technology Department, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Andrea Zuccolo
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
- Crop Science Research Center (CSRC), Scuola Superiore Sant'Anna, Pisa, 56127, Italy
| | - Weibo Xie
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
- USDA ARS NEA Plant, Soil & Nutrition Laboratory Research Unit, Ithaca, NY, 14853, USA
| | - Jianwei Zhang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Kenneth L McNally
- International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines
| | - Rod A Wing
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
- Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.
- International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines.
| |
Collapse
|
9
|
Gao AW, Alam GE, Zhu Y, Li W, Katsyuba E, Sulc J, Li TY, Li X, Overmyer KA, Lalou A, Mouchiroud L, Sleiman MB, Cornaglia M, Morel JD, Houtkooper RH, Coon JJ, Auwerx J. High-content phenotypic analysis of a C. elegans recombinant inbred population identifies genetic and molecular regulators of lifespan. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.15.575638. [PMID: 38293129 PMCID: PMC10827074 DOI: 10.1101/2024.01.15.575638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2024]
Abstract
Lifespan is influenced by complex interactions between genetic and environmental factors. Studying those factors in model organisms of a single genetic background limits their translational value for humans. Here, we mapped lifespan determinants in 85 genetically diverse C. elegans recombinant intercross advanced inbred lines (RIAILs). We assessed molecular profiles - transcriptome, proteome, and lipidome - and life-history traits, including lifespan, development, growth dynamics, and reproduction. RIAILs exhibited large variations in lifespan, which positively correlated with developmental time. Among the top candidates obtained from multi-omics data integration and QTL mapping, we validated known and novel longevity modulators, including rict-1, gfm-1 and mltn-1. We translated their relevance to humans using UK Biobank data and showed that variants in RICTOR and GFM1 are associated with an elevated risk of age-related heart disease, dementia, diabetes, kidney, and liver diseases. We organized our dataset as a resource (https://lisp-lms.shinyapps.io/RIAILs/) that allows interactive explorations for new longevity targets.
Collapse
Affiliation(s)
- Arwen W. Gao
- Laboratory of Integrative Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- Laboratory Genetic Metabolic Diseases, Amsterdam Gastroenterology, Endocrinology, and Metabolism, Amsterdam UMC, University of Amsterdam, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
| | - Gaby El Alam
- Laboratory of Integrative Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Yunyun Zhu
- Department of Biomolecular Chemistry, University of Wisconsin, Madison, WI 53506, USA
| | - Weisha Li
- Laboratory Genetic Metabolic Diseases, Amsterdam Gastroenterology, Endocrinology, and Metabolism, Amsterdam UMC, University of Amsterdam, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
| | - Elena Katsyuba
- Laboratory of Integrative Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- Nagi Bioscience SA, EPFL Innovation Park, CH-1025 Saint-Sulpice, Switzerland
| | - Jonathan Sulc
- Laboratory of Integrative Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Terytty Y. Li
- Laboratory of Integrative Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- Present address: State Key Laboratory of Genetic Engineering, Shanghai Key Laboratory of Metabolic Remodeling and Health, Laboratory of Longevity and Metabolic Adaptations, Institute of Metabolism and Integrative Biology, Fudan University, Shanghai, China
| | - Xiaoxu Li
- Laboratory of Integrative Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Katherine A. Overmyer
- Department of Biomolecular Chemistry, University of Wisconsin, Madison, WI 53506, USA
- National Center for Quantitative Biology of Complex Systems, Madison, WI 53706, USA
- Morgridge Institute for Research, Madison, WI 53515, USA
| | - Amelia Lalou
- Laboratory of Integrative Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Laurent Mouchiroud
- Laboratory of Integrative Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- Nagi Bioscience SA, EPFL Innovation Park, CH-1025 Saint-Sulpice, Switzerland
| | - Maroun Bou Sleiman
- Laboratory of Integrative Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Matteo Cornaglia
- Nagi Bioscience SA, EPFL Innovation Park, CH-1025 Saint-Sulpice, Switzerland
| | - Jean-David Morel
- Laboratory of Integrative Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Riekelt H. Houtkooper
- Laboratory Genetic Metabolic Diseases, Amsterdam Gastroenterology, Endocrinology, and Metabolism, Amsterdam UMC, University of Amsterdam, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
| | - Joshua J. Coon
- Department of Biomolecular Chemistry, University of Wisconsin, Madison, WI 53506, USA
- National Center for Quantitative Biology of Complex Systems, Madison, WI 53706, USA
- Morgridge Institute for Research, Madison, WI 53515, USA
- Department of Chemistry, University of Wisconsin, Madison, WI 53506, USA
| | - Johan Auwerx
- Laboratory of Integrative Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| |
Collapse
|
10
|
Contreras Yametti GP, Robbins G, Chowdhury A, Narang S, Ostrow TH, Kilberg H, Greenberg J, Kramer L, Raetz E, Tsirigos A, Evensen NA, Carroll WL. SETD2 mutations do not contribute to clonal fitness in response to chemotherapy in childhood B cell acute lymphoblastic leukemia. Leuk Lymphoma 2024; 65:78-90. [PMID: 37874744 DOI: 10.1080/10428194.2023.2273752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 10/14/2023] [Indexed: 10/26/2023]
Abstract
Mutations in genes encoding epigenetic regulators are commonly observed at relapse in B cell acute lymphoblastic leukemia (B-ALL). Loss-of-function mutations in SETD2, an H3K36 methyltransferase, have been observed in B-ALL and other cancers. Previous studies on mutated SETD2 in solid tumors and acute myelogenous leukemia support a role in promoting resistance to DNA damaging agents. We did not observe chemoresistance, an impaired DNA damage response, nor increased mutation frequency in response to thiopurines using CRISPR-mediated knockout in wild-type B-ALL cell lines. Likewise, restoration of SETD2 in cell lines with hemizygous mutations did not increase sensitivity. SETD2 mutations affected the chromatin landscape and transcriptional output that was unique to each cell line. Collectively our data does not support a role for SETD2 mutations in driving clonal evolution and relapse in B-ALL, which is consistent with the lack of enrichment of SETD2 mutations at relapse in most studies.
Collapse
Affiliation(s)
- Gloria P Contreras Yametti
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Gabriel Robbins
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Ashfiyah Chowdhury
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Sonali Narang
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Talia H Ostrow
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Harrison Kilberg
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Joshua Greenberg
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Lindsay Kramer
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Elizabeth Raetz
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Aristotelis Tsirigos
- Departments of Pediatrics and Pathology, Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Nikki A Evensen
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - William L Carroll
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
- Department of Pathology, NYU Langone Health, New York, NY, USA
| |
Collapse
|
11
|
Sarel-Gallily R, Keshet G, Kinreich S, Haim-Abadi G, Benvenisty N. EpiTyping: analysis of epigenetic aberrations in parental imprinting and X-chromosome inactivation using RNA-seq. Nat Protoc 2023; 18:3881-3917. [PMID: 37914783 DOI: 10.1038/s41596-023-00898-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 07/28/2023] [Indexed: 11/03/2023]
Abstract
Human pluripotent stem cells (hPSCs) hold a central role in studying human development, in disease modeling and in regenerative medicine. These cells not only acquire genetic modifications when kept in culture, but they may also harbor epigenetic aberrations, mainly involving parental imprinting and X-chromosome inactivation. Here we present a detailed bioinformatic protocol for detecting such aberrations using RNA sequencing data. We provide a pipeline designed to process and analyze RNA sequencing data for the identification of abnormal biallelic expression of imprinted genes, and thus detect loss of imprinting. Furthermore, we show how to differentiate among X-chromosome inactivation, full activation and aberrant erosion of X chromosome in female hPSCs. In addition to providing bioinformatic tools, we discuss the impact of such epigenetic variations in hPSCs on their utility for various purposes. This pipeline can be used by any user with basic understanding of the Linux command line. It is available on GitHub as a software container ( https://github.com/Gal-Keshet/EpiTyping ) and produces reliable results in 1-4 d.
Collapse
Affiliation(s)
- Roni Sarel-Gallily
- The Azrieli Center for Stem Cells and Genetic Research, Department of Genetics, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Gal Keshet
- The Azrieli Center for Stem Cells and Genetic Research, Department of Genetics, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel.
| | - Shay Kinreich
- The Azrieli Center for Stem Cells and Genetic Research, Department of Genetics, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Guy Haim-Abadi
- The Azrieli Center for Stem Cells and Genetic Research, Department of Genetics, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Nissim Benvenisty
- The Azrieli Center for Stem Cells and Genetic Research, Department of Genetics, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel.
| |
Collapse
|
12
|
Zhang YJ, Luo Z, Sun Y, Liu J, Chen Z. From beasts to bytes: Revolutionizing zoological research with artificial intelligence. Zool Res 2023; 44:1115-1131. [PMID: 37933101 PMCID: PMC10802096 DOI: 10.24272/j.issn.2095-8137.2023.263] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 10/30/2023] [Indexed: 11/08/2023] Open
Abstract
Since the late 2010s, Artificial Intelligence (AI) including machine learning, boosted through deep learning, has boomed as a vital tool to leverage computer vision, natural language processing and speech recognition in revolutionizing zoological research. This review provides an overview of the primary tasks, core models, datasets, and applications of AI in zoological research, including animal classification, resource conservation, behavior, development, genetics and evolution, breeding and health, disease models, and paleontology. Additionally, we explore the challenges and future directions of integrating AI into this field. Based on numerous case studies, this review outlines various avenues for incorporating AI into zoological research and underscores its potential to enhance our understanding of the intricate relationships that exist within the animal kingdom. As we build a bridge between beast and byte realms, this review serves as a resource for envisioning novel AI applications in zoological research that have not yet been explored.
Collapse
Affiliation(s)
- Yu-Juan Zhang
- Chongqing Key Laboratory of Vector Insects
- Chongqing Key Laboratory of Animal Biology
- College of Life Science, Chongqing Normal University, Chongqing 401331, China
| | - Zeyu Luo
- Chongqing Key Laboratory of Vector Insects
- Chongqing Key Laboratory of Animal Biology
- College of Life Science, Chongqing Normal University, Chongqing 401331, China
| | - Yawen Sun
- Chongqing Key Laboratory of Vector Insects
- Chongqing Key Laboratory of Animal Biology
- College of Life Science, Chongqing Normal University, Chongqing 401331, China
| | - Junhao Liu
- Chongqing Key Laboratory of Vector Insects
- Chongqing Key Laboratory of Animal Biology
- College of Life Science, Chongqing Normal University, Chongqing 401331, China
| | - Zongqing Chen
- School of Mathematical Sciences
- National Center for Applied Mathematics in Chongqing, Chongqing Normal University, Chongqing 401331, China. E-mail:
| |
Collapse
|
13
|
Wang HQ, Cong PK, He T, Yu XF, Huo YN. A novel pathogenic splicing mutation of RPGR in a Chinese family with X-linked retinitis pigmentosa verified by minigene splicing assay. Int J Ophthalmol 2023; 16:1595-1600. [PMID: 37854381 PMCID: PMC10559041 DOI: 10.18240/ijo.2023.10.06] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 07/13/2023] [Indexed: 10/20/2023] Open
Abstract
AIM To report a novel splicing mutation in the RPGR gene (encoding retinitis pigmentosa GTPase regulator) in a three-generation Chinese family with X-linked retinitis pigmentosa (XLRP). METHODS Comprehensive ophthalmic examinations including best corrected visual acuity, fundus photography, vision field, and pattern-visual evoked potential were performed to identify the disease phenotype of a six-year-old boy from the family (proband). Genomic DNA was extracted from peripheral blood of five available members of the pedigree. Whole-exome sequencing (WES), Sanger sequencing, and pSPL3-based exon trapping were used to investigate the aberrant splicing of RPGR. Human Splice Finder v3.1 and NNSPLICE v0.9 were used for in silico prediction of splice site variants. RESULTS The proband was diagnosed as having retinitis pigmentosa (RP). He had severe symptoms with early onset. A novel splicing mutation, c.619+1G>C in RPGR was identified in the proband by WES and in four family members by Sanger sequencing. Minigene splicing assays verified that c.619+1G>C in RPGR would result in the formation of a damaging alternative transcript in which the last 91 bp of exon 6 were skipped, leading to the subsequent deletion of 623 correct amino acids (c.529_619del p.Val177Glnfs*16). CONCLUSION We identify a novel splice donor site mutation causing aberrant splicing of RPGR. Our findings add to the catalog of pathological mutations of RPGR and further emphasize the functional importance of RPGR in RP pathogenesis and its complex clinical phenotypes.
Collapse
Affiliation(s)
- Hui-Qin Wang
- Department of Ophthalmology, the Second People's Hospital of Quzhou, Quzhou 324022, Zhejiang Province, China
| | - Pei-Kuan Cong
- Key Laboratory of Growth Regulation and Translational Research of Zhejiang Province, School of Life Sciences, Westlake University, Hangzhou 310024, Zhejiang Province, China
| | - Tian He
- Department of Ophthalmology, Children's Hospital of Hangzhou, Hangzhou 310005, Zhejiang Province, China
| | - Xiao-Feng Yu
- Department of Ophthalmology, the Second People's Hospital of Quzhou, Quzhou 324022, Zhejiang Province, China
| | - Ya-Nan Huo
- Department of Ophthalmology, the Second Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou 310020, Zhejiang Province, China
| |
Collapse
|
14
|
Song C, Zhang Y, Zhang Y, Yi S, Pan H, Liao R, Wang Y, Han B. Genome sequencing-based transcriptomic analysis reveals novel genes in Peucedanum praeruptorum. BMC Genom Data 2023; 24:53. [PMID: 37723451 PMCID: PMC10506206 DOI: 10.1186/s12863-023-01157-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Accepted: 09/13/2023] [Indexed: 09/20/2023] Open
Abstract
BACKGROUND Peucedanum praeruptorum Dunn, a traditional Chinese herbal medicine, contains coumarin and volatile oil components that have clinical application value. However, early bolting often occurs in the medicinal materials of Apiaceae plants. The rhizomes of the medicinal parts are gradually lignified after bolting, resulting in a sharp decrease in the content of coumarins. At present, the link between coumarin biosynthesis and early bolting in P. praeruptorum has not been elucidated. RESULTS Combining the genome sequencing and the previous transcriptome sequencing results, we reanalyzed the differential transcripts of P. praeruptorum before and after bolting. A total of 62,088 new transcripts were identified, of which 31,500 were unknown transcripts. Functional classification and annotation showed that many genes were involved in the regulation of transcription, defense response, and carbohydrate metabolic processes. The main domains are the pentatricopeptide repeat, protein kinase, RNA recognition motif, leucine-rich repeat, and ankyrin repeat domains, indicating their pivotal roles in protein modification and signal transduction. Gene structure analysis showed that skipped exon (SE) was the most dominant alternative splicing, followed by the alternative 3' splice site (A3SS) and the alternative 5' splice site (A5SS). Functional enrichment of differentially expressed genes showed that these differentially expressed genes mainly include transmembrane transporters, channel proteins, DNA-binding proteins, polysaccharide-binding proteins, etc. In addition, genes involved in peroxisome, hexose phosphate pathway, phosphatidylinositol signaling system, and inositol phosphate metabolism pathway were greatly enriched. A protein-protein interaction network analysis discoverd 1,457 pairs of proteins that interact with each other. The expression levels of six UbiA genes, three UGT genes, and four OMT genes were higher during the bolting stage. This observation suggests their potential involvement in the catalytic processes of prenylation, glycosylation, and methylation of coumarins, respectively. A total of 100 peroxidase (PRX) genes were identified being involved in lignin polymerization, but only nine PRX genes were highly expressed at the bolting stage. It is worth noting that 73 autophagy-related genes (ATGs) were first identified from the KEGG pathway-enriched genes. Some ATGs, such as BHQH00009837, BHQH00013830, and novel8944, had higher expression levels after bolting. CONCLUSIONS Comparative transcriptome analysis and large-scale genome screening provide guidance and new opinions for the identification of bolting-related genes in P. praeruptorum.
Collapse
Affiliation(s)
- Cheng Song
- Anhui Dabieshan Academy of Traditional Chinese Medicine, Anhui Engineering Laboratory for Conservation and Sustainable Utilization of Traditional Chinese Medicine Resources, Anhui Engineering Research Center for Eco-agriculture of Traditional Chinese Medicine, College of Biological and Pharmaceutical Engineering, West Anhui University, Lu'an, 237012, China.
| | - Yingyu Zhang
- Henan Key Laboratory of Rare Diseases, The First Affiliated Hospital, College of Clinical Medicine of Henan, University of Science and Technology, Luoyang, 471003, China
| | - Yunpeng Zhang
- Shanghai Key Laboratory of Regulatory Biology, School of Life Science, East China Normal University, Shanghai, 200241, China
| | - Shanyong Yi
- Anhui Dabieshan Academy of Traditional Chinese Medicine, Anhui Engineering Laboratory for Conservation and Sustainable Utilization of Traditional Chinese Medicine Resources, Anhui Engineering Research Center for Eco-agriculture of Traditional Chinese Medicine, College of Biological and Pharmaceutical Engineering, West Anhui University, Lu'an, 237012, China
| | - Haoyu Pan
- Anhui Dabieshan Academy of Traditional Chinese Medicine, Anhui Engineering Laboratory for Conservation and Sustainable Utilization of Traditional Chinese Medicine Resources, Anhui Engineering Research Center for Eco-agriculture of Traditional Chinese Medicine, College of Biological and Pharmaceutical Engineering, West Anhui University, Lu'an, 237012, China
- School of Life Science, Anhui Agricultural University, Hefei, 230036, China
| | - Ranran Liao
- Anhui Dabieshan Academy of Traditional Chinese Medicine, Anhui Engineering Laboratory for Conservation and Sustainable Utilization of Traditional Chinese Medicine Resources, Anhui Engineering Research Center for Eco-agriculture of Traditional Chinese Medicine, College of Biological and Pharmaceutical Engineering, West Anhui University, Lu'an, 237012, China
- School of Pharmacy, Anhui University of Chinese Medicine, Hefei, 230012, China
| | - Yuanyuan Wang
- Anhui Dabieshan Academy of Traditional Chinese Medicine, Anhui Engineering Laboratory for Conservation and Sustainable Utilization of Traditional Chinese Medicine Resources, Anhui Engineering Research Center for Eco-agriculture of Traditional Chinese Medicine, College of Biological and Pharmaceutical Engineering, West Anhui University, Lu'an, 237012, China
- School of Pharmacy, Anhui University of Chinese Medicine, Hefei, 230012, China
| | - Bangxing Han
- Anhui Dabieshan Academy of Traditional Chinese Medicine, Anhui Engineering Laboratory for Conservation and Sustainable Utilization of Traditional Chinese Medicine Resources, Anhui Engineering Research Center for Eco-agriculture of Traditional Chinese Medicine, College of Biological and Pharmaceutical Engineering, West Anhui University, Lu'an, 237012, China.
| |
Collapse
|
15
|
Podobnik M, Singh AP, Fu Z, Dooley CM, Frohnhöfer HG, Firlej M, Stednitz SJ, Elhabashy H, Weyand S, Weir JR, Lu J, Nüsslein-Volhard C, Irion U. kcnj13 regulates pigment cell shapes in zebrafish and has diverged by cis-regulatory evolution between Danio species. Development 2023; 150:dev201627. [PMID: 37530080 PMCID: PMC10482006 DOI: 10.1242/dev.201627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 07/21/2023] [Indexed: 08/03/2023]
Abstract
Teleost fish of the genus Danio are excellent models to study the genetic and cellular bases of pigment pattern variation in vertebrates. The two sister species Danio rerio and Danio aesculapii show divergent patterns of horizontal stripes and vertical bars that are partly caused by the divergence of the potassium channel gene kcnj13. Here, we show that kcnj13 is required only in melanophores for interactions with xanthophores and iridophores, which cause location-specific pigment cell shapes and thereby influence colour pattern and contrast in D. rerio. Cis-regulatory rather than protein coding changes underlie kcnj13 divergence between the two Danio species. Our results suggest that homotypic and heterotypic interactions between the pigment cells and their shapes diverged between species by quantitative changes in kcnj13 expression during pigment pattern diversification.
Collapse
Affiliation(s)
- Marco Podobnik
- Max Planck Institute for Biology, 72076 Tübingen, Germany
| | - Ajeet P. Singh
- Chemical Biology and Therapeutics, Novartis Institutes for BioMedical Research, Cambridge, MA 02139, USA
| | - Zhenqiang Fu
- School of Marine Sciences, Sun Yat-sen University, Zhuhai 519082, China
| | - Christopher M. Dooley
- Department of Genetics, Max Planck Institute for Heart and Lung Research, 61231 Bad Nauheim, Germany
| | | | - Magdalena Firlej
- Friedrich Miescher Laboratory of the Max Planck Society, 72076 Tübingen, Germany
| | - Sarah J. Stednitz
- Department of Anatomy & Physiology, University of Melbourne, Victoria, 3010, Melbourne, Australia
| | - Hadeer Elhabashy
- Department of Protein Evolution, Max Planck Institute for Biology, 72076 Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, 72076 Tübingen, Germany
- Department of Computer Science, University of Tübingen, 72076 Tübingen, Germany
| | - Simone Weyand
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1QW, UK
| | - John R. Weir
- Friedrich Miescher Laboratory of the Max Planck Society, 72076 Tübingen, Germany
| | - Jianguo Lu
- School of Marine Sciences, Sun Yat-sen University, Zhuhai 519082, China
| | | | - Uwe Irion
- Max Planck Institute for Biology, 72076 Tübingen, Germany
| |
Collapse
|
16
|
Fachrul M, Karkey A, Shakya M, Judd LM, Harshegyi T, Sim KS, Tonks S, Dongol S, Shrestha R, Salim A, Baker S, Pollard AJ, Khor CC, Dolecek C, Basnyat B, Dunstan SJ, Holt KE, Inouye M. Direct inference and control of genetic population structure from RNA sequencing data. Commun Biol 2023; 6:804. [PMID: 37532769 PMCID: PMC10397182 DOI: 10.1038/s42003-023-05171-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Accepted: 07/24/2023] [Indexed: 08/04/2023] Open
Abstract
RNAseq data can be used to infer genetic variants, yet its use for estimating genetic population structure remains underexplored. Here, we construct a freely available computational tool (RGStraP) to estimate RNAseq-based genetic principal components (RG-PCs) and assess whether RG-PCs can be used to control for population structure in gene expression analyses. Using whole blood samples from understudied Nepalese populations and the Geuvadis study, we show that RG-PCs had comparable results to paired array-based genotypes, with high genotype concordance and high correlations of genetic principal components, capturing subpopulations within the dataset. In differential gene expression analysis, we found that inclusion of RG-PCs as covariates reduced test statistic inflation. Our paper demonstrates that genetic population structure can be directly inferred and controlled for using RNAseq data, thus facilitating improved retrospective and future analyses of transcriptomic data.
Collapse
Affiliation(s)
- Muhamad Fachrul
- Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia.
- Department of Clinical Pathology, University of Melbourne, Parkville, VIC, Australia.
- School of BioSciences, The University of Melbourne, Parkville, VIC, Australia.
| | - Abhilasha Karkey
- Oxford University Clinical Research Unit, Patan Academy of Health Sciences, Kathmandu, Nepal
- Patan Academy of Health Sciences, Patan Hospital, Lalitpur, Nepal
| | - Mila Shakya
- Oxford University Clinical Research Unit, Patan Academy of Health Sciences, Kathmandu, Nepal
- Patan Academy of Health Sciences, Patan Hospital, Lalitpur, Nepal
| | - Louise M Judd
- Department of Infectious Diseases, Central Clinical School, Monash University, Melbourne, VIC, Australia
| | - Taylor Harshegyi
- Department of Infectious Diseases, Central Clinical School, Monash University, Melbourne, VIC, Australia
| | - Kar Seng Sim
- Genome Institute of Singapore, Singapore, Singapore
| | - Susan Tonks
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford, and the NIHR Oxford Biomedical Research Centre, Oxford, UK
| | - Sabina Dongol
- Oxford University Clinical Research Unit, Patan Academy of Health Sciences, Kathmandu, Nepal
- Patan Academy of Health Sciences, Patan Hospital, Lalitpur, Nepal
| | | | - Agus Salim
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC, Australia
- School of Mathematics and Statistics, The University of Melbourne, Melbourne, VIC, Australia
- Department of Population Health, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
| | - Stephen Baker
- Department of Medicine, University of Cambridge, Cambridge, UK
| | - Andrew J Pollard
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford, and the NIHR Oxford Biomedical Research Centre, Oxford, UK
| | | | - Christiane Dolecek
- Nuffield Department of Medicine, Centre for Tropical Medicine and Global Health, University of Oxford, Oxford, UK
- Mahidol Oxford Tropical Medicine Research Unit, Mahidol University, Bangkok, Thailand
| | - Buddha Basnyat
- Oxford University Clinical Research Unit, Patan Academy of Health Sciences, Kathmandu, Nepal
- Nuffield Department of Medicine, Centre for Tropical Medicine and Global Health, University of Oxford, Oxford, UK
| | - Sarah J Dunstan
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, VIC, Australia
| | - Kathryn E Holt
- Department of Infectious Diseases, Central Clinical School, Monash University, Melbourne, VIC, Australia
- Department of Infection Biology, London School of Hygiene & Tropical Medicine, London, UK
| | - Michael Inouye
- Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia.
- Department of Clinical Pathology, University of Melbourne, Parkville, VIC, Australia.
- Cambridge Baker Systems Genomics Initiative, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK.
- Health Data Research UK Cambridge, Wellcome Genome Campus and University of Cambridge, Cambridge, UK.
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK.
- British Heart Foundation Centre of Research Excellence, University of Cambridge, Cambridge, UK.
- Victor Phillip Dahdaleh Heart and Lung Research Institute, University of Cambridge, Cambridge, UK.
| |
Collapse
|
17
|
Vigorito E, Barton A, Pitzalis C, Lewis MJ, Wallace C. BBmix: a Bayesian beta-binomial mixture model for accurate genotyping from RNA-sequencing. Bioinformatics 2023; 39:btad393. [PMID: 37338536 PMCID: PMC10318392 DOI: 10.1093/bioinformatics/btad393] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 05/15/2023] [Accepted: 06/19/2023] [Indexed: 06/21/2023] Open
Abstract
MOTIVATION While many pipelines have been developed for calling genotypes using RNA-sequencing (RNA-Seq) data, they all have adapted DNA genotype callers that do not model biases specific to RNA-Seq such as allele-specific expression (ASE). RESULTS Here, we present Bayesian beta-binomial mixture model (BBmix), a Bayesian beta-binomial mixture model that first learns the expected distribution of read counts for each genotype, and then deploys those learned parameters to call genotypes probabilistically. We benchmarked our model on a wide variety of datasets and showed that our method generally performed better than competitors, mainly due to an increase of up to 1.4% in the accuracy of heterozygous calls, which may have a big impact in reducing false positive rate in applications sensitive to genotyping error such as ASE. Moreover, BBmix can be easily incorporated into standard pipelines for calling genotypes. We further show that parameters are generally transferable within datasets, such that a single learning run of less than 1 h is sufficient to call genotypes in a large number of samples. AVAILABILITY AND IMPLEMENTATION We implemented BBmix as an R package that is available for free under a GPL-2 licence at https://gitlab.com/evigorito/bbmix and https://cran.r-project.org/package=bbmix with accompanying pipeline at https://gitlab.com/evigorito/bbmix_pipeline.
Collapse
Affiliation(s)
- Elena Vigorito
- MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, United Kingdom
| | - Anne Barton
- Division of Musculoskeletal and Dermatological Sciences, University of Manchester, Manchester M13 9PL, United Kingdom
| | - Costantino Pitzalis
- Centre for Experimental Medicine and Rheumatology, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London EC1M 6BQ, United Kingdom
| | - Myles J Lewis
- Centre for Experimental Medicine and Rheumatology, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London EC1M 6BQ, United Kingdom
| | - Chris Wallace
- MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, United Kingdom
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, University of Cambridge, Cambridge CB2 0AW, United Kingdom
| |
Collapse
|
18
|
Cook DE, Venkat A, Yelizarov D, Pouliot Y, Chang PC, Carroll A, De La Vega FM. A deep-learning-based RNA-seq germline variant caller. BIOINFORMATICS ADVANCES 2023; 3:vbad062. [PMID: 37416509 PMCID: PMC10320079 DOI: 10.1093/bioadv/vbad062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 03/31/2023] [Accepted: 05/30/2023] [Indexed: 07/08/2023]
Abstract
Summary RNA sequencing (RNA-seq) can be applied to diverse tasks including quantifying gene expression, discovering quantitative trait loci and identifying gene fusion events. Although RNA-seq can detect germline variants, the complexities of variable transcript abundance, target capture and amplification introduce challenging sources of error. Here, we extend DeepVariant, a deep-learning-based variant caller, to learn and account for the unique challenges presented by RNA-seq data. Our DeepVariant RNA-seq model produces highly accurate variant calls from RNA-sequencing data, and outperforms existing approaches such as Platypus and GATK. We examine factors that influence accuracy, how our model addresses RNA editing events and how additional thresholding can be used to facilitate our models' use in a production pipeline. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
|
19
|
Lai X, Zhang Z, Zhang Z, Liu S, Bai C, Chen Z, Qadri QR, Fang Y, Wang Z, Pan Y, Wang Q. Integrated microbiome-metabolome-genome axis data of Laiwu and Lulai pigs. Sci Data 2023; 10:280. [PMID: 37179393 PMCID: PMC10183000 DOI: 10.1038/s41597-023-02191-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 04/27/2023] [Indexed: 05/15/2023] Open
Abstract
Excessive fat deposition can trigger metabolic diseases, and it is crucial to identify factors that can break the link between fat deposition and metabolic diseases. Healthy obese Laiwu pigs (LW) are high in fat content but resistant to metabolic diseases. In this study, we compared the fecal microbiome, fecal and blood metabolome, and genome of LW and Lulai pigs (LU) to identify factors that can block the link between fat deposition and metabolic diseases. Our results show significant differences in Spirochetes and Treponema, which are involved in carbohydrate metabolism, between LW and LU. The fecal and blood metabolome composition was similar, and some anti-metabolic disease components of blood metabolites were different between the two breeds of pigs. The predicted differential RNA is mainly enriched in lipid metabolism and glucose metabolism, which is consistent with the functions of differential microbiota and metabolites. The down-regulated gene RGP1 is strongly negatively correlated with Treponema. Our omics data would provide valuable resources for further scientific research on healthy obesity in both human and porcine.
Collapse
Affiliation(s)
- Xueshuang Lai
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, PR China
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou, 310030, PR China
| | - Zhenyang Zhang
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou, 310030, PR China
| | - Zhe Zhang
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou, 310030, PR China
| | - Shengqiang Liu
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou, 310030, PR China
- Hainan institute, Zhejiang University, Sanya, 310014, PR China
| | - Chunyan Bai
- Department of Animal Science, College of Animal Sciences, Jilin University, Changchui, 130015, PR China
| | - Zitao Chen
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou, 310030, PR China
| | - Qamar Raza Qadri
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, PR China
| | - Yifei Fang
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, PR China
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou, 310030, PR China
| | - Zhen Wang
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou, 310030, PR China
| | - Yuchun Pan
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou, 310030, PR China.
- Hainan institute, Zhejiang University, Sanya, 310014, PR China.
| | - Qishan Wang
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou, 310030, PR China.
- Hainan institute, Zhejiang University, Sanya, 310014, PR China.
| |
Collapse
|
20
|
Marrella MA, Biase FH. Robust identification of regulatory variants (eQTLs) using a differential expression framework developed for RNA-sequencing. J Anim Sci Biotechnol 2023; 14:62. [PMID: 37143150 PMCID: PMC10161580 DOI: 10.1186/s40104-023-00861-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 03/05/2023] [Indexed: 05/06/2023] Open
Abstract
BACKGROUND A gap currently exists between genetic variants and the underlying cell and tissue biology of a trait, and expression quantitative trait loci (eQTL) studies provide important information to help close that gap. However, two concerns that arise with eQTL analyses using RNA-sequencing data are normalization of data across samples and the data not following a normal distribution. Multiple pipelines have been suggested to address this. For instance, the most recent analysis of the human and farm Genotype-Tissue Expression (GTEx) project proposes using trimmed means of M-values (TMM) to normalize the data followed by an inverse normal transformation. RESULTS In this study, we reasoned that eQTL analysis could be carried out using the same framework used for differential gene expression (DGE), which uses a negative binomial model, a statistical test feasible for count data. Using the GTEx framework, we identified 35 significant eQTLs (P < 5 × 10-8) following the ANOVA model and 39 significant eQTLs (P < 5 × 10-8) following the additive model. Using a differential gene expression framework, we identified 930 and six significant eQTLs (P < 5 × 10-8) following an analytical framework equivalent to the ANOVA and additive model, respectively. When we compared the two approaches, there was no overlap of significant eQTLs between the two frameworks. Because we defined specific contrasts, we identified trans eQTLs that more closely resembled what we expect from genetic variants showing complete dominance between alleles. Yet, these were not identified by the GTEx framework. CONCLUSIONS Our results show that transforming RNA-sequencing data to fit a normal distribution prior to eQTL analysis is not required when the DGE framework is employed. Our proposed approach detected biologically relevant variants that otherwise would not have been identified due to data transformation to fit a normal distribution.
Collapse
Affiliation(s)
- Mackenzie A Marrella
- School of Animal Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
| | - Fernando H Biase
- School of Animal Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA.
| |
Collapse
|
21
|
Nie C, Zhang Y, Zhang X, Xia W, Sun H, Zhang S, Li N, Ding Z, Lv Y, Wang N. Genome assembly, resequencing and genome-wide association analyses provide novel insights into the origin, evolution and flower colour variations of flowering cherry. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2023; 114:519-533. [PMID: 36786729 DOI: 10.1111/tpj.16151] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 02/09/2023] [Accepted: 02/10/2023] [Indexed: 05/10/2023]
Abstract
Flowering cherry is a very popular species around the world. High-quality genome resources for different elite cultivars are needed, and the understanding of their origins and the regulation of key ornamental traits are limited for this tree. Here, a high-quality chromosome-scale genome of Prunus campanulata 'Plena' (PCP), which is a native and elite flowering cherry cultivar in China, was generated. The contig N50 of the genome was 18.31 Mb, and 99.98% of its contigs were anchored to eight chromosomes. Furthermore, a total of 306 accessions of flowering cherry germplasm and six lines of outgroups were collected. Resequencing of these 312 lines was performed, and 761 267 high-quality genomic variants were obtained. The origins of flowering cherry were predicted, and these 306 accessions could be classified into three clades, A, B and C. According to phylogenetic analysis, we predicted two origins of flowering cherry. Flowering cherry in clade A originated in southern China, such as in the Himalayan Mountains, while clades B and C originated in northeastern China. Finally, a genome-wide association study of flower colour was performed for all 312 accessions of flowering cherry germplasm. A total of seven quantitative trait loci (QTLs) were identified. One gene encoding glycosylate transferase was predicted as the candidate gene for one QTL. Taken together, our results provide a valuable genomic resource and novel insights into the origin, evolution and flower colour variations of flowering cherry.
Collapse
Affiliation(s)
- Chaoren Nie
- School of Landscape Architecture, Beijing Forestry of University, Beijing, 100083, China
- Wuhan Institute of Landscape Architecture, Wuhan, 430081, China
| | - Yingjie Zhang
- Yantai Academy of Agricultural Sciences, Yantai, Shandong, 265500, China
| | - Xiaoqin Zhang
- Wuhan Institute of Landscape Architecture, Wuhan, 430081, China
| | - Wensheng Xia
- Wuhan Institute of Landscape Architecture, Wuhan, 430081, China
| | - Hongbing Sun
- Wuhan Institute of Landscape Architecture, Wuhan, 430081, China
| | - Sisi Zhang
- Wuhan Institute of Landscape Architecture, Wuhan, 430081, China
| | - Na Li
- Wuhan Institute of Landscape Architecture, Wuhan, 430081, China
| | - Zhaoquan Ding
- Wuhan Institute of Landscape Architecture, Wuhan, 430081, China
| | - Yingmin Lv
- School of Landscape Architecture, Beijing Forestry of University, Beijing, 100083, China
| | - Nian Wang
- College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| |
Collapse
|
22
|
Ruiz-De-La-Cruz G, Sifuentes-Rincón AM, Casas E, Paredes-Sánchez FA, Parra-Bracamonte GM, Riley DG, Perry GA, Welsh TH, Randel RD. Genetic Variants and Their Putative Effects on microRNA-Seed Sites: Characterization of the 3' Untranslated Region of Genes Associated with Temperament. Genes (Basel) 2023; 14:genes14051004. [PMID: 37239364 DOI: 10.3390/genes14051004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 04/21/2023] [Accepted: 04/25/2023] [Indexed: 05/28/2023] Open
Abstract
The 3' untranslated region has an important role in gene regulation through microRNAs, and it has been estimated that microRNAs regulate up to 50% of coding genes in mammals. With the aim of allelic variant identification of 3' untranslated region microRNA seed sites, the 3' untranslated region was searched for seed sites of four temperament-associated genes (CACNG4, EXOC4, NRXN3, and SLC9A4). The microRNA seed sites were predicted in the four genes, and the CACNG4 gene had the greatest number with 12 predictions. To search for variants affecting the predicted microRNA seed sites, the four 3' untranslated regions were re-sequenced in a Brahman cattle population. Eleven single nucleotide polymorphisms were identified in the CACNG4, and eleven in the SLC9A4. Rs522648682:T>G of the CACNG4 gene was located at the predicted seed site for bta-miR-191. Rs522648682:T>G evidenced an association with both exit velocity (p = 0.0054) and temperament score (p = 0.0097). The genotype TT had a lower mean exit velocity (2.93 ± 0.4 m/s) compared with the TG and GG genotypes (3.91 ± 0.46 m/s and 3.67 ± 0.46 m/s, respectively). The allele associated with the temperamental phenotype antagonizes the seed site, disrupting the bta-miR-191 recognition. The G allele of CACNG4-rs522648682 has the potential to influence bovine temperament through a mechanism associated with unspecific recognition of bta-miR-191.
Collapse
Affiliation(s)
- Gilberto Ruiz-De-La-Cruz
- Laboratorio de Biotecnología Animal, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico
| | - Ana María Sifuentes-Rincón
- Laboratorio de Biotecnología Animal, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico
| | - Eduardo Casas
- National Animal Disease Center, Agricultural Research Service, Unite States Department of Agriculture, Ames, IA 50010, USA
| | | | - Gaspar Manuel Parra-Bracamonte
- Laboratorio de Biotecnología Animal, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico
| | - David G Riley
- Department of Animal Science, Texas A&M University, College Station, TX 77843, USA
| | | | - Thomas H Welsh
- Department of Animal Science, Texas A&M University, College Station, TX 77843, USA
| | | |
Collapse
|
23
|
Chu Y, Meng Q, Yu J, Zhang J, Chen J, Kang Y. Strain-Level Dynamics Reveal Regulatory Roles in Atopic Eczema by Gut Bacterial Phages. Microbiol Spectr 2023; 11:e0455122. [PMID: 36951555 PMCID: PMC10101075 DOI: 10.1128/spectrum.04551-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Accepted: 03/06/2023] [Indexed: 03/24/2023] Open
Abstract
The vast population of bacterial phages or viruses (virome) plays pivotal roles in the ecology of human microbial flora and health conditions. Obstacles, including poor viral sequence inference, strain-sensitive virus-host relationship, and the high diversity among individuals, hinder the in-depth understanding of the human virome. We conducted longitudinal studies of the virome based on constructing a high-quality personal reference metagenome (PRM). By applying long-read sequencing for representative samples, we could build a PRM of high continuity that allows accurate annotation and abundance estimation of viruses and bacterial species in all samples of the same individual by aligning short sequencing reads to the PRM. We applied this approach to a series of fecal samples collected for 6 months from a 2-year-old boy who had experienced a 2-month flare-up of atopic eczema (dermatitis) in this period. We identified 31 viral strains in the patient's gut microbiota and deciphered their strain-level relationship to their bacterial hosts. Among them, a lytic crAssphage developed into a dozen substrains and coordinated downregulation in the catabolism of aromatic amino acids (AAAs) in their host bacteria which govern the production of immune-active AAA derivates. The metabolic alterations confirmed based on metabolomic assays cooccurred with symptom remission. Our PRM-based analysis provides an easy approach for deciphering the dynamics of the strain-level human gut virome in the context of entire microbiota. Close temporal correlations among virome alteration, microbial metabolism, and disease remission suggest a potential mechanism for how bacterial phages in microbiota are intimately related to human health. IMPORTANCE The vast populations of viruses or bacteriophages in human gut flora remain mysterious. However, poor annotation and abundance estimation remain obstacles to strain-level analysis and clarification of their roles in microbiome ecology and metabolism associated with human health and diseases. We demonstrate that a personal reference metagenome (PRM)-based approach provides strain-level resolution for analyzing the gut microbiota-associated virome. When applying such an approach to longitudinal samples collected from a 2-year-old boy who has experienced a 2-month flare-up of atopic eczema, we observed thriving substrains of a lytic crAssphage, showing temporal correlation with downregulated catabolism of aromatic amino acids, lower production of immune-active metabolites, and remission of the disease. The PRM-based approach is practical and powerful for strain-centric analysis of the human gut virome, and the underlying mechanism of how strain-level virome dynamics affect disease deserves further investigation.
Collapse
Affiliation(s)
- Yanan Chu
- Beijing Institute of Genomics, Chinese Academy of Sciences/China National Center for Bioinformation, Beijing, China
| | - Qingren Meng
- School of Medicine, Southern University of Science and Technology, Shenzhen, China
| | - Jun Yu
- University of Chinese Academy of Sciences, Beijing, China
| | - Juan Zhang
- Department of Pediatric, Peking University Third Hospital, Beijing, China
| | - Jing Chen
- Beijing Institute of Genomics, Chinese Academy of Sciences/China National Center for Bioinformation, Beijing, China
| | - Yu Kang
- Beijing Institute of Genomics, Chinese Academy of Sciences/China National Center for Bioinformation, Beijing, China
| |
Collapse
|
24
|
Fang Y, Ji Z, Zhou W, Abante J, Koldobskiy MA, Ji H, Feinberg A. DNA methylation entropy is associated with DNA sequence features and developmental epigenetic divergence. Nucleic Acids Res 2023; 51:2046-2065. [PMID: 36762477 PMCID: PMC10018346 DOI: 10.1093/nar/gkad050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Revised: 12/02/2022] [Accepted: 02/04/2023] [Indexed: 02/11/2023] Open
Abstract
Epigenetic information defines tissue identity and is largely inherited in development through DNA methylation. While studied mostly for mean differences, methylation also encodes stochastic change, defined as entropy in information theory. Analyzing allele-specific methylation in 49 human tissue sample datasets, we find that methylation entropy is associated with specific DNA binding motifs, regulatory DNA, and CpG density. Then applying information theory to 42 mouse embryo methylation datasets, we find that the contribution of methylation entropy to time- and tissue-specific patterns of development is comparable to the contribution of methylation mean, and methylation entropy is associated with sequence and chromatin features conserved with human. Moreover, methylation entropy is directly related to gene expression variability in development, suggesting a role for epigenetic entropy in developmental plasticity.
Collapse
Affiliation(s)
- Yuqi Fang
- Center for Epigenetics, Johns Hopkins University, 855 N. Wolfe St., Baltimore, MD 21205, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Zhicheng Ji
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, 615 N. Wolfe St., Baltimore, MD 21205, USA
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC 27708, USA
| | - Weiqiang Zhou
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, 615 N. Wolfe St., Baltimore, MD 21205, USA
| | - Jordi Abante
- Department of Electrical & Computer Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Michael A Koldobskiy
- Center for Epigenetics, Johns Hopkins University, 855 N. Wolfe St., Baltimore, MD 21205, USA
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, 855 N. Wolfe St., Baltimore, MD 21205, USA
| | - Hongkai Ji
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, 615 N. Wolfe St., Baltimore, MD 21205, USA
| | - Andrew P Feinberg
- Center for Epigenetics, Johns Hopkins University, 855 N. Wolfe St., Baltimore, MD 21205, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, 615 N. Wolfe St., Baltimore, MD 21205, USA
- Department of Medicine, Johns Hopkins University School of Medicine, 600 N Wolfe St, Baltimore, MD 21205, USA
| |
Collapse
|
25
|
Kakade P, Sircaik S, Maufrais C, Ene IV, Bennett RJ. Aneuploidy and gene dosage regulate filamentation and host colonization by Candida albicans. Proc Natl Acad Sci U S A 2023; 120:e2218163120. [PMID: 36893271 PMCID: PMC10089209 DOI: 10.1073/pnas.2218163120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 02/02/2023] [Indexed: 03/11/2023] Open
Abstract
Aneuploidy is a frequent occurrence in fungal species where it can alter gene expression and promote adaptation to a variety of environmental cues. Multiple forms of aneuploidy have been observed in the opportunistic fungal pathogen Candida albicans, which is a common component of the human gut mycobiome but can escape this niche and cause life-threatening systemic disease. Using a barcode sequencing (Bar-seq) approach, we evaluated a set of diploid C. albicans strains and found that a strain carrying a third copy of chromosome (Chr) 7 was associated with increased fitness during both gastrointestinal (GI) colonization and systemic infection. Our analysis revealed that the presence of a Chr 7 trisomy resulted in decreased filamentation, both in vitro and during GI colonization, relative to isogenic euploid controls. A target gene approach demonstrated that NRG1, encoding a negative regulator of filamentation located on Chr 7, contributes to increased fitness of the aneuploid strain due to inhibition of filamentation in a gene dosage-dependent fashion. Together, these experiments establish how aneuploidy enables the reversible adaptation of C. albicans to its host via gene dosage-dependent regulation of morphology.
Collapse
Affiliation(s)
- Pallavi Kakade
- Molecular Microbiology and Immunology Department, Brown University, Providence, RI02912
| | - Shabnam Sircaik
- Molecular Microbiology and Immunology Department, Brown University, Providence, RI02912
| | - Corinne Maufrais
- Institut Pasteur Bioinformatic Hub, Université Paris Cité, Paris75015, France
- Institut Pasteur, Université Paris Cité, Fungal Heterogeneity Lab, Paris75015, France
| | - Iuliana V. Ene
- Institut Pasteur, Université Paris Cité, Fungal Heterogeneity Lab, Paris75015, France
| | - Richard J. Bennett
- Molecular Microbiology and Immunology Department, Brown University, Providence, RI02912
| |
Collapse
|
26
|
Yan C, Song MH, Jiang D, Ren JL, Lv Y, Chang J, Huang S, Zaher H, Li JT. Genomic evidence reveals intraspecific divergence of the hot-spring snake (Thermophis baileyi), an endangered reptile endemic to the Qinghai-Tibet plateau. Mol Ecol 2023; 32:1335-1350. [PMID: 36073004 DOI: 10.1111/mec.16687] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 09/04/2022] [Accepted: 09/06/2022] [Indexed: 11/27/2022]
Abstract
Understanding how and why species evolve requires knowledge on intraspecific divergence. In this study, we examined intraspecific divergence in the endangered hot-spring snake (Thermophis baileyi), an endemic species on the Qinghai-Tibet Plateau (QTP). Whole-genome resequencing of 58 sampled individuals from 15 populations was performed to identify the drivers of intraspecific divergence and explore the potential roles of genes under selection. Our analyses resolved three groups, with major intergroup admixture occurring in regions of group contact. Divergence probably occurred during the Pleistocene as a result of glacial climatic oscillations, Yadong-Gulu rift, and geothermal fields differentiation, while complex gene flow between group pairs reflected a unique intraspecific divergence pattern on the QTP. Intergroup fixed loci involved selected genes functionally related to divergence and local adaptation, especially adaptation to hot spring microenvironments in different geothermal fields. Analysis of structural variants, genetic diversity, inbreeding, and genetic load indicated that the hot-spring snake population has declined to a low level with decreased diversity, which is important for the conservation management of this endangered species. Our study demonstrated that the integration of demographic history, gene flow, genomic divergence genes, and other information is necessary to distinguish the evolutionary processes involved in speciation.
Collapse
Affiliation(s)
- Chaochao Yan
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China
| | - Meng-Huan Song
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Dechun Jiang
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China
| | - Jin-Long Ren
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Yunyun Lv
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China
| | - Jiang Chang
- State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing, China
| | - Song Huang
- College of Life Sciences, Anhui Normal University, Wuhu, China
| | - Hussam Zaher
- Museu de Zoologia, Universidade de São Paulo, São Paulo, São Paulo, Brazil
| | - Jia-Tang Li
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China.,University of Chinese Academy of Sciences, Beijing, China.,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.,Mangkang Biodiversity and Ecological Station, Tibet Ecological Safety Monitor Network, Changdu, China
| |
Collapse
|
27
|
Kui L, Majeed A, Wang X, Yang Z, Chen J, He L, Di Y, Li X, Qian Z, Jiao Y, Wang G, Liu L, Xu R, Gu S, Yang Q, Chen S, Lou H, Meng Y, Xie L, Xu F, Shen Q, Singh A, Gruber K, Pan Y, Hao T, Dong Y, Li F. A chromosome-level genome assembly for Erianthus fulvus provides insights into its biofuel potential and facilitates breeding for improvement of sugarcane. PLANT COMMUNICATIONS 2023:100562. [PMID: 36814384 PMCID: PMC10363513 DOI: 10.1016/j.xplc.2023.100562] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 12/21/2022] [Accepted: 02/16/2023] [Indexed: 06/18/2023]
Abstract
Erianthus produces substantial biomass, exhibits a good Brix value, and shows wide environmental adaptability, making it a potential biofuel plant. In contrast to closely related sorghum and sugarcane, Erianthus can grow in degraded soils, thus releasing pressure on agricultural lands used for biofuel production. However, the lack of genomic resources for Erianthus hinders its genetic improvement, thus limiting its potential for biofuel production. In the present study, we generated a chromosome-scale reference genome for Erianthus fulvus Nees. The genome size estimated by flow cytometry was 937 Mb, and the assembled genome size was 902 Mb, covering 96.26% of the estimated genome size. A total of 35 065 protein-coding genes were predicted, and 67.89% of the genome was found to be repetitive. A recent whole-genome duplication occurred approximately 74.10 million years ago in the E. fulvus genome. Phylogenetic analysis showed that E. fulvus is evolutionarily closer to S. spontaneum and diverged after S. bicolor. Three of the 10 chromosomes of E. fulvus formed through rearrangements of ancestral chromosomes. Phylogenetic reconstruction of the Saccharum complex revealed a polyphyletic origin of the complex and a sister relationship of E. fulvus with Saccharum sp., excluding S. arundinaceum. On the basis of the four amino acid residues that provide substrate specificity, the E. fulvus SWEET proteins were classified as mono- and disaccharide sugar transporters. Ortho-QTL genes identified for 10 biofuel-related traits may aid in the rapid screening of E. fulvus populations to enhance breeding programs for improved biofuel production. The results of this study provide valuable insights for breeding programs aimed at improving biofuel production in E. fulvus and enhancing sugarcane introgression programs.
Collapse
Affiliation(s)
- Ling Kui
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; Shenzhen Qianhai Shekou Free Trade Zone Hospital, Shenzhen 518067, China
| | - Aasim Majeed
- Plant Molecular Genetics Laboratory, School of Agricultural Biotechnology, Punjab Agricultural University, Ludhiana, India
| | - Xianhong Wang
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China; The Key Laboratory of Crop Production and Smart Agriculture of Yunnan Province, Kunming, Yunnan 650201, China
| | - Zijiang Yang
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Jian Chen
- International Genome Center, Jiangsu University, Zhenjiang, Jiangsu 212013, China
| | - Lilian He
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Yining Di
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Xuzhen Li
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan Agricultural University, Kunming, Yunnan 650201, China; Yunnan Plateau Characteristic Agriculture Industry Research Institute, Kunming, Yunnan 650201, China
| | - Zhenfeng Qian
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Yinming Jiao
- Shenzhen Qianhai Shekou Free Trade Zone Hospital, Shenzhen 518067, China
| | - Guoyun Wang
- Shenzhen Qianhai Shekou Free Trade Zone Hospital, Shenzhen 518067, China
| | - Lufeng Liu
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; The Key Laboratory of Crop Production and Smart Agriculture of Yunnan Province, Kunming, Yunnan 650201, China
| | - Rong Xu
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Shujie Gu
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Qinghui Yang
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Shuying Chen
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Hongbo Lou
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Yu Meng
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Linyan Xie
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Fu Xu
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Qingqing Shen
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Amit Singh
- Institute of Molecular Biosciences, University of Graz, 8010 Graz, Austria
| | - Karl Gruber
- Institute of Molecular Biosciences, University of Graz, 8010 Graz, Austria
| | - Yunbing Pan
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan Agricultural University, Kunming, Yunnan 650201, China; Yunnan Plateau Characteristic Agriculture Industry Research Institute, Kunming, Yunnan 650201, China
| | - Tingting Hao
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan Agricultural University, Kunming, Yunnan 650201, China; Yunnan Plateau Characteristic Agriculture Industry Research Institute, Kunming, Yunnan 650201, China
| | - Yang Dong
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan Agricultural University, Kunming, Yunnan 650201, China; Yunnan Plateau Characteristic Agriculture Industry Research Institute, Kunming, Yunnan 650201, China.
| | - Fusheng Li
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China; The Key Laboratory of Crop Production and Smart Agriculture of Yunnan Province, Kunming, Yunnan 650201, China.
| |
Collapse
|
28
|
Pasitka L, Cohen M, Ehrlich A, Gildor B, Reuveni E, Ayyash M, Wissotsky G, Herscovici A, Kaminker R, Niv A, Bitcover R, Dadia O, Rudik A, Voloschin A, Shimoni M, Cinnamon Y, Nahmias Y. Spontaneous immortalization of chicken fibroblasts generates stable, high-yield cell lines for serum-free production of cultured meat. NATURE FOOD 2023; 4:35-50. [PMID: 37118574 DOI: 10.1038/s43016-022-00658-w] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 11/03/2022] [Indexed: 04/30/2023]
Abstract
Cellular agriculture could meet growing demand for animal products, but yields are typically low and regulatory bodies restrict genetic modification for cultured meat production. Here we demonstrate the spontaneous immortalization and genetic stability of fibroblasts derived from several chicken breeds. Cell lines were adapted to grow as single-cell suspensions using serum-free culture medium, reaching densities of 108 × 106 cells per ml in continuous culture, corresponding to yields of 36% w/v. We show that lecithin activates peroxisome proliferator-activated receptor gamma (PPARγ), inducing adipogenesis in immortalized fibroblasts. Blending cultured adipocyte-like cells with extruded soy protein, formed chicken strips in which texture was supported by animal and plant proteins while aroma and flavour were driven by cultured animal fat. Visual and sensory analysis graded the product 4.5/5.0, with 85% of participants extremely likely to replace their food choice with this cultured meat product. Immortalization without genetic modification and high-yield manufacturing are critical for the market realization of cultured meat.
Collapse
Affiliation(s)
- L Pasitka
- Grass Center for Bioengineering, Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - M Cohen
- Grass Center for Bioengineering, Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
- Department of Cell and Developmental Biology, Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - A Ehrlich
- Grass Center for Bioengineering, Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| | | | | | - M Ayyash
- Grass Center for Bioengineering, Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
- Believer Meats, Rehovot, Israel
| | | | | | | | - A Niv
- Believer Meats, Rehovot, Israel
| | | | - O Dadia
- Believer Meats, Rehovot, Israel
| | - A Rudik
- Believer Meats, Rehovot, Israel
| | | | | | - Y Cinnamon
- Institute of Animal Science, Agricultural Research Organization, The Volcani Center, Bet Dagan, Israel
| | - Y Nahmias
- Grass Center for Bioengineering, Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel.
- Department of Cell and Developmental Biology, Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel.
- Believer Meats, Rehovot, Israel.
| |
Collapse
|
29
|
Liang Z, Liu K, Jiang C, Yang A, Yan J, Han X, Zhang C, Cong P, Zhang L. Insertion of a TRIM-like sequence in MdFLS2-1 promoter is associated with its allele-specific expression in response to Alternaria alternata in apple. FRONTIERS IN PLANT SCIENCE 2022; 13:1090621. [PMID: 36643297 PMCID: PMC9834810 DOI: 10.3389/fpls.2022.1090621] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Accepted: 12/09/2022] [Indexed: 06/17/2023]
Abstract
Alternaria blotch disease, caused by Alternaria alternata apple pathotype (AAAP), is one of the major fungal diseases in apple. Early field observations revealed, the anther-derived homozygote Hanfu line (HFTH1) was highly susceptible to AAAP, whereas Hanfu (HF) exhibited resistance to AAAP. To understand the molecular mechanisms underlying the difference in sensitivity of HF and HFTH1 to AAAP, we performed allele-specific expression (ASE) analysis and comparative transcriptomic analysis before and after AAAP inoculation. We reported an important immune gene, namely, MdFLS2, which displayed strong ASE in HF with much lower expression levels of HFTH1-derived alleles. Transient overexpression of the dominant allele of MdFLS2-1 from HF in GL-3 apple leaves could enhance resistance to AAAP and induce expression of genes related to salicylic acid pathway. In addition, MdFLS2-1 was identified with an insertion of an 85-bp terminal-repeat retrotransposon in miniature (TRIM) element-like sequence in the upstream region of the nonreference allele. In contrast, only one terminal direct repeat (TDR) from TRIM-like sequence was present in the upstream region of the HFTH1-derived allele MdFLS2-2. Furthermore, the results of luciferase and β-glucuronidase reporter assays demonstrated that the intact TRIM-like sequence has enhancer activity. This suggested that insertion of the TRIM-like sequence regulates the expression level of the allele of MdFLS2, in turn, affecting the sensitivity of HF and HFTH1 to AAAP.
Collapse
Affiliation(s)
- Zhaolin Liang
- Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Xingcheng, China
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops (Germplasm Resources Utilization), Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Ministry of Agriculture, Xingcheng, China
| | - Kai Liu
- Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Xingcheng, China
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops (Germplasm Resources Utilization), Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Ministry of Agriculture, Xingcheng, China
| | - Chunyang Jiang
- Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Xingcheng, China
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops (Germplasm Resources Utilization), Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Ministry of Agriculture, Xingcheng, China
| | - An Yang
- Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Xingcheng, China
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops (Germplasm Resources Utilization), Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Ministry of Agriculture, Xingcheng, China
| | - Jiadi Yan
- Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Xingcheng, China
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops (Germplasm Resources Utilization), Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Ministry of Agriculture, Xingcheng, China
| | - Xiaolei Han
- Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Xingcheng, China
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops (Germplasm Resources Utilization), Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Ministry of Agriculture, Xingcheng, China
| | - Caixia Zhang
- Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Xingcheng, China
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops (Germplasm Resources Utilization), Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Ministry of Agriculture, Xingcheng, China
| | - Peihua Cong
- Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Xingcheng, China
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops (Germplasm Resources Utilization), Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Ministry of Agriculture, Xingcheng, China
| | - Liyi Zhang
- Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Xingcheng, China
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops (Germplasm Resources Utilization), Research Institute of Pomology, Chinese Academy of Agricultural Sciences, Ministry of Agriculture, Xingcheng, China
| |
Collapse
|
30
|
Intra- and Interspecies RNA-Seq Based Variants in the Lactation Process of Ruminants. Animals (Basel) 2022; 12:ani12243592. [PMID: 36552512 PMCID: PMC9774614 DOI: 10.3390/ani12243592] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Revised: 11/07/2022] [Accepted: 12/16/2022] [Indexed: 12/23/2022] Open
Abstract
The RNA-Seq data provides new opportunities for the detection of transcriptome variants' single nucleotide polymorphisms (SNPs) in various species and tissues. Herein, milk samples from two sheep breeds and two cow breeds were utilized to characterize the genetic variation in the coding regions in three stages (before-peak (BP), peak (P), and after-peak (AP)) of the lactation process. In sheep breeds Assaf and Churra, 100,462 and 97,768, 65,996 and 62,161, and 78,656 and 39,245 variants were observed for BP, P, and AP lactation stages, respectively. The number of specific variants was 59,798 and 76,419, 11,483 and 49,210, and 104,033 and 320,817 in cow breeds Jersy and Kashmiri, respectively, for BP, P, and AP stages. Via the transcriptome analysis of variation in regions containing QTL for fat, protein percentages, and milk yield, we detected a number of pathways and genes harboring mutations that could influence milk production attributes. Many SNPs detected here can be regarded as appropriate markers for custom SNP arrays or genotyping platforms to conduct association analyses among commercial populations. The results of this study offer new insights into milk production genetic mechanisms in cow and sheep breeds, which can contribute to designing suitable breeding systems for optimal milk production.
Collapse
|
31
|
Li J, Mukiibi R, Jiminez J, Wang Z, Akanno EC, Timsit E, Plastow GS. Applying multi-omics data to study the genetic background of bovine respiratory disease infection in feedlot crossbred cattle. Front Genet 2022; 13:1046192. [PMID: 36579334 PMCID: PMC9790935 DOI: 10.3389/fgene.2022.1046192] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 11/28/2022] [Indexed: 12/15/2022] Open
Abstract
Bovine respiratory disease (BRD) is the most common and costly infectious disease affecting the wellbeing and productivity of beef cattle in North America. BRD is a complex disease whose development is dependent on environmental factors and host genetics. Due to the polymicrobial nature of BRD, our understanding of the genetic and molecular mechanisms underlying the disease is still limited. This knowledge would augment the development of better genetic/genomic selection strategies and more accurate diagnostic tools to reduce BRD prevalence. Therefore, this study aimed to utilize multi-omics data (genomics, transcriptomics, and metabolomics) analyses to study the genetic and molecular mechanisms of BRD infection. Blood samples of 143 cattle (80 BRD; 63 non-BRD animals) were collected for genotyping, RNA sequencing, and metabolite profiling. Firstly, a genome-wide association study (GWAS) was performed for BRD susceptibility using 207,038 SNPs. Two SNPs (Chr5:25858264 and BovineHD1800016801) were identified as associated (p-value <1 × 10-5) with BRD susceptibility. Secondly, differential gene expression between BRD and non-BRD animals was studied. At the significance threshold used (log2FC>2, logCPM>2, and FDR<0.01), 101 differentially expressed (DE) genes were identified. These DE genes significantly (p-value <0.05) enriched several immune responses related functions such as inflammatory response. Additionally, we performed expression quantitative trait loci (eQTL) analysis and identified 420 cis-eQTLs and 144 trans-eQTLs significantly (FDR <0.05) associated with the expression of DE genes. Interestingly, eQTL results indicated the most significant SNP (Chr5:25858264) identified via GWAS was a cis-eQTL for DE gene GPR84. This analysis also demonstrated that an important SNP (rs209419196) located in the promoter region of the DE gene BPI significantly influenced the expression of this gene. Finally, the abundance of 31 metabolites was significantly (FDR <0.05) different between BRD and non-BRD animals, and 17 of them showed correlations with multiple DE genes, which shed light on the interactions between immune response and metabolism. This study identified associations between genome, transcriptome, metabolome, and BRD phenotype of feedlot crossbred cattle. The findings may be useful for the development of genomic selection strategies for BRD susceptibility, and for the development of new diagnostic and therapeutic tools.
Collapse
Affiliation(s)
- Jiyuan Li
- Livestock Gentec, Department of Agriculture, Food & Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Robert Mukiibi
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, Scotland, United Kingdom
| | - Janelle Jiminez
- Livestock Gentec, Department of Agriculture, Food & Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Zhiquan Wang
- Livestock Gentec, Department of Agriculture, Food & Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Everestus C. Akanno
- Livestock Gentec, Department of Agriculture, Food & Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Edouard Timsit
- Faculty of Veterinary Medicine, University of Calgary, Calgary, AB, Canada
| | - Graham S. Plastow
- Livestock Gentec, Department of Agriculture, Food & Nutritional Science, University of Alberta, Edmonton, AB, Canada,*Correspondence: Graham S. Plastow,
| |
Collapse
|
32
|
Li B, Gschwend AR, Hovick SM, Gutek A, McHale L, Harrison SK, Regnier EE. Evolution of weedy giant ragweed ( Ambrosia trifida): Multiple origins and gene expression variability facilitates weediness. Ecol Evol 2022; 12:e9590. [PMID: 36514541 PMCID: PMC9731915 DOI: 10.1002/ece3.9590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 11/16/2022] [Indexed: 12/13/2022] Open
Abstract
Agricultural weeds may originate from wild populations, but the origination patterns and genetics underlying this transition remain largely unknown. Analysis of weedy-wild paired populations from independent locations may provide evidence to identify key genetic variation contributing to this adaptive shift. We performed genetic variation and expression analyses on transcriptome data from 67 giant ragweed samples collected from different locations in Ohio, Iowa, and Minnesota and found geographically separated weedy populations likely originated independently from their adjacent wild populations, but subsequent spreading of weedy populations also occurred locally. By using eight closely related weedy-wild paired populations, we identified thousands of unique transcripts in weedy populations that reflect shared or specific functions corresponding, respectively, to both convergently evolved and population-specific weediness processes. In addition, differential expression of specific groups of genes was detected between weedy and wild giant ragweed populations using gene expression diversity and gene co-expression network analyses. Our study suggests an integrated route of weedy giant ragweed origination, consisting of independent origination combined with the subsequent spreading of certain weedy populations, and provides several lines of evidence to support the hypothesis that gene expression variability plays a key role in the evolution of weedy species.
Collapse
Affiliation(s)
- Bo Li
- Department of Horticulture and Crop SciencesThe Ohio State UniversityColumbusOhioUSA
| | - Andrea R. Gschwend
- Department of Horticulture and Crop SciencesThe Ohio State UniversityColumbusOhioUSA
| | - Stephen M. Hovick
- Department of Evolution, Ecology and Organismal BiologyThe Ohio State UniversityColumbusOhioUSA
| | - Amanda Gutek
- Department of Horticulture and Crop SciencesThe Ohio State UniversityColumbusOhioUSA
| | - Leah McHale
- Department of Horticulture and Crop SciencesThe Ohio State UniversityColumbusOhioUSA
| | - S. Kent Harrison
- Department of Horticulture and Crop SciencesThe Ohio State UniversityColumbusOhioUSA
| | - Emilie E. Regnier
- Department of Horticulture and Crop SciencesThe Ohio State UniversityColumbusOhioUSA
| |
Collapse
|
33
|
Perry I, Hernadi SB, Cunha L, Short S, Marchbank A, Spurgeon DJ, Orozco-terWengel P, Kille P. Molecular insights into high-altitude adaption and acclimatisation of Aporrectodea caliginosa. Life Sci Alliance 2022; 5:5/11/e202201513. [PMID: 35977843 PMCID: PMC9386962 DOI: 10.26508/lsa.202201513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 07/28/2022] [Accepted: 07/29/2022] [Indexed: 12/02/2022] Open
Abstract
A megabase genome assembly for Aporrectodea caliginosa is presented with transcriptomic and SNP-based evidence for acclimatisation and adaption to extreme weather conditions found at high altitude. Here, we explore the high-altitude adaptions and acclimatisation of Aporrectodea caliginosa. Population diversity is assessed through mitochondrial barcoding, identifying closely related populations across the island of Pico (Azores). We present the first megabase N50 assembly size (1.2 Mbp) genome for A. caliginosa. High- and low-altitude populations were exposed experimentally to a range of oxygen and temperature conditions, simulating altitudinal conditions, and the transcriptomic responses explored. SNP densities are assessed to identify signatures of selective pressure and their link to differentially expressed genes. The high-altitude A. caliginosa population had lower differential expression and fewer co-expressed genes between conditions, indicating a more condition-refined epigenetic response. Genes identified as under adaptive pressure through Fst and nucleotide diversity in the high-altitude population clustered around the differentially expressed an upstream environmental response control gene, HMGB1. The high-altitude population of A. caliginosa indicated adaption and acclimatisation to high-altitude conditions and suggested resilience to extreme weather events. This mechanistic understanding could help offer a strategy in further identifying other species capable of maintaining soil fertility in extreme environments.
Collapse
Affiliation(s)
- Iain Perry
- Organisms and Environment, Cardiff University, Wales, UK .,Wales Gene Park, Cardiff University, Wales, UK
| | | | - Luis Cunha
- Department of Life Sciences, Centre for Functional Ecology, University of Coimbra, Coimbra, Portugal.,School of Applied Sciences, University of South Wales, Wales, UK
| | - Stephen Short
- Organisms and Environment, Cardiff University, Wales, UK.,UK Centre for Ecology and Hydrology, Maclean Building, Wallingford, UK
| | | | - David J Spurgeon
- UK Centre for Ecology and Hydrology, Maclean Building, Wallingford, UK
| | | | - Peter Kille
- Organisms and Environment, Cardiff University, Wales, UK
| |
Collapse
|
34
|
Qian Z, Li X, He L, Gu S, Shen Q, Rao X, Zhang R, Di Y, Xie L, Wang X, Chen S, Dong Y, Li F. EfGD: the Erianthus fulvus genome database. Database (Oxford) 2022; 2022:6679393. [PMID: 36043401 PMCID: PMC9428683 DOI: 10.1093/database/baac076] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2022] [Revised: 08/10/2022] [Accepted: 08/25/2022] [Indexed: 11/30/2022]
Abstract
Erianthus fulvus (TaxID: 154759) is a valuable germplasm resource in sugarcane breeding and research and has excellent agronomic traits, such as drought resistance, cold resistance, barren tolerance and high brix. With a stable chromosome number (2n = 20) and a small genome (0.9 Gb), it is an ideal candidate for research on sugarcane. Next-generation sequencing technology has enabled a growing number of studies to focus on genomics. Due to the large amount of omics data available, a centralized platform is necessary for ensuring the consistency, independence and maintainability of these large-scale datasets through storage, analysis and integration. Here, we present a comprehensive database for the E. fulvus genome, EfGD. By using the new high-quality reference genome and its annotations, the EfGD provides the largest whole-genome sequencing reference dataset for E. fulvus, which archives 27 165 protein-coding genes and 55 564 488 SNPs from 202 newly resequenced genomes. Furthermore, we created a user-friendly graphical interface for visualizing genomic diversity, population structure and evolution and provided other tools on an open platform. Database URL: https://efgenome.ynau.edu.cn
Collapse
Affiliation(s)
- Zhenfeng Qian
- The Key Laboratory for Crop Production and Intelligent Agriculture of Yunnan Province, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- College of Agronomy and Biotechnology, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- Sugarcane Research Institute, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
| | - Xuzhen Li
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- College of Biological Big Data, Yunnan Agriculture University , Kunming, No. 95 Jinhei Road, Yunnan 650201, China
| | - Lilian He
- College of Agronomy and Biotechnology, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- Sugarcane Research Institute, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
| | - Shujie Gu
- College of Agronomy and Biotechnology, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- Sugarcane Research Institute, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
| | - Qingqing Shen
- College of Agronomy and Biotechnology, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- Sugarcane Research Institute, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
| | - Xibing Rao
- College of Agronomy and Biotechnology, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- Sugarcane Research Institute, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
| | - Rongqiong Zhang
- College of Agronomy and Biotechnology, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- Sugarcane Research Institute, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
| | - Yining Di
- College of Agronomy and Biotechnology, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- Sugarcane Research Institute, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
| | - Linyan Xie
- College of Agronomy and Biotechnology, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- Sugarcane Research Institute, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
| | - Xianhong Wang
- The Key Laboratory for Crop Production and Intelligent Agriculture of Yunnan Province, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- College of Agronomy and Biotechnology, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- Sugarcane Research Institute, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
| | - Shuying Chen
- College of Agronomy and Biotechnology, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- Sugarcane Research Institute, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
| | - Yang Dong
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- College of Biological Big Data, Yunnan Agriculture University , Kunming, No. 95 Jinhei Road, Yunnan 650201, China
| | - Fusheng Li
- The Key Laboratory for Crop Production and Intelligent Agriculture of Yunnan Province, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- College of Agronomy and Biotechnology, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
- Sugarcane Research Institute, Yunnan Agricultural University , No. 95 Jinhei Road, Kunming 650201, China
| |
Collapse
|
35
|
Silva-Vignato B, Cesar ASM, Afonso J, Moreira GCM, Poleti MD, Petrini J, Garcia IS, Clemente LG, Mourão GB, Regitano LCDA, Coutinho LL. Integrative Analysis Between Genome-Wide Association Study and Expression Quantitative Trait Loci Reveals Bovine Muscle Gene Expression Regulatory Polymorphisms Associated With Intramuscular Fat and Backfat Thickness. Front Genet 2022; 13:935238. [PMID: 35991540 PMCID: PMC9386181 DOI: 10.3389/fgene.2022.935238] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Accepted: 06/23/2022] [Indexed: 11/13/2022] Open
Abstract
Understanding the architecture of gene expression is fundamental to unravel the molecular mechanisms regulating complex traits in bovine, such as intramuscular fat content (IMF) and backfat thickness (BFT). These traits are economically important for the beef industry since they affect carcass and meat quality. Our main goal was to identify gene expression regulatory polymorphisms within genomic regions (QTL) associated with IMF and BFT in Nellore cattle. For that, we used RNA-Seq data from 193 Nellore steers to perform SNP calling analysis. Then, we combined the RNA-Seq SNP and a high-density SNP panel to obtain a new dataset for further genome-wide association analysis (GWAS), totaling 534,928 SNPs. GWAS was performed using the Bayes B model. Twenty-one relevant QTL were associated with our target traits. The expression quantitative trait loci (eQTL) analysis was performed using Matrix eQTL with the complete SNP dataset and 12,991 genes, revealing a total of 71,033 cis and 36,497 trans-eQTL (FDR < 0.05). Intersecting with QTL for IMF, we found 231 eQTL regulating the expression levels of 117 genes. Within those eQTL, three predicted deleterious SNPs were identified. We also identified 109 eQTL associated with BFT and affecting the expression of 54 genes. This study revealed genomic regions and regulatory SNPs associated with fat deposition in Nellore cattle. We highlight the transcription factors FOXP4, FOXO3, ZSCAN2, and EBF4, involved in lipid metabolism-related pathways. These results helped us to improve our knowledge about the genetic architecture behind important traits in cattle.
Collapse
Affiliation(s)
- Bárbara Silva-Vignato
- Department of Animal Science, College of Agriculture “Luiz de Queiroz”, University of São Paulo, Piracicaba, Brazil
| | - Aline Silva Mello Cesar
- Department of Agroindustry, Food, and Nutrition, College of Agriculture “Luiz de Queiroz”, University of São Paulo, Piracicaba, Brazil
| | | | | | - Mirele Daiana Poleti
- College of Animal Science and Food Engineering, University of São Paulo, Pirassununga, Brazil
| | - Juliana Petrini
- Department of Animal Science, College of Agriculture “Luiz de Queiroz”, University of São Paulo, Piracicaba, Brazil
| | - Ingrid Soares Garcia
- Department of Animal Science, College of Agriculture “Luiz de Queiroz”, University of São Paulo, Piracicaba, Brazil
| | - Luan Gaspar Clemente
- Department of Animal Science, College of Agriculture “Luiz de Queiroz”, University of São Paulo, Piracicaba, Brazil
| | - Gerson Barreto Mourão
- Department of Animal Science, College of Agriculture “Luiz de Queiroz”, University of São Paulo, Piracicaba, Brazil
| | | | - Luiz Lehmann Coutinho
- Department of Animal Science, College of Agriculture “Luiz de Queiroz”, University of São Paulo, Piracicaba, Brazil
- *Correspondence: Luiz Lehmann Coutinho,
| |
Collapse
|
36
|
Zhang WY, Yuan Y, Zhang HY, He YM, Liu CL, Xu L, Yang BG, Ren HX, Wang GF, E GX. Genetic basis investigation of wattle phenotype in goat using genome-wide sequence data. Anim Genet 2022; 53:700-705. [PMID: 35748186 DOI: 10.1111/age.13235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Revised: 06/06/2022] [Accepted: 06/13/2022] [Indexed: 11/29/2022]
Abstract
In domestic goats, wattles often appear in even numbers, mostly on the neck and a few under the ear. Goat wattle is composed of ectopic cartilage tissue covered by skin and was reported as a dominant inheritance. Thirty-eight goats from two Southwest Chinese breeds were studied to elucidate the genetic basis of wattle phenotype in goat. Their genomes were sequenced for wide-genome selective sweep analysis (WGSA) and a genome-wide association study (GWAS). The WGSA results revealed 500 candidate genes identified by fixation index and π ratio and 261 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways enriched with 195 genes and 38 significantly enriched KEGG items. In particular, three chondrogenesis-related pathways (Wnt, Hippo and MAPK signaling pathways) were found. Among the 500 genes, 474 were enriched to 2855 Gene Ontology items, and four (BMP2, BMP4, RARA and MSX1) were annotated in the regulation and development of chondrogenesis. Four chondrogenesis-related genes (GREM1, NEDD4, ATG7 and ITGA1) were identified from 519 single-nucleotide polymorphisms (SNPs) with a GWAS above the threshold. Six and 11 SNPs on chromosome 10 are located on GREM1 and NEDD4 respectively, and the highest numbers of SNPs on chromosomes 20 and 22 are located on ITGA1 and ATG7 respectively. All of these genes are related to cartilage development. This study identified a series of genes related to chondroplasia by GWAS and WGSA and presented the possibility that wattle inheritance may be influenced by multiple genes. This work provides a new theoretical understanding of the hereditary basis of wattle phenotype.
Collapse
Affiliation(s)
- Wei-Yi Zhang
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Ying Yuan
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Hao-Yuan Zhang
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Yong-Meng He
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Cheng-Li Liu
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Lu Xu
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Bai-Gao Yang
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Hang-Xing Ren
- Chongqing Academy of Animal Sciences, Chongqing, China
| | - Gao-Fu Wang
- Chongqing Academy of Animal Sciences, Chongqing, China
| | - Guang-Xin E
- College of Animal Science and Technology, Southwest University, Chongqing, China
| |
Collapse
|
37
|
Characterization of Potential Molecular Markers in Lac Insect Kerria lacca (Kerr) Responsible for Lac Production. INSECTS 2022; 13:insects13060545. [PMID: 35735882 PMCID: PMC9225327 DOI: 10.3390/insects13060545] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 06/07/2022] [Accepted: 06/10/2022] [Indexed: 02/04/2023]
Abstract
Kerria lacca (Kerr) is an important lac insect extensively used in industrial products in the form of resin, wax and dye. The scarce knowledge on molecular markers for K. lacca is a barrier in elucidating genetic information. Our study identified a total of 16,921 single-nucleotide polymorphisms (SNPs), and 6231 insertions and deletions (InDels)-of which, intergenic variation accounted for 41.22% and 56.30%, and exonic variation accounted for 39.10% and 17.46%, of SNPs and InDels, respectively. Observation of SNPs suggested that nucleotide substitution frequency and transition to transversion (Ts/Tv) ratio were highest at the late adult stage, 3.97, compared to at the other stages, with a genome-wide Ts/Tv ratio of 2.95. The maximum number of SNPs, 2853 (16.86%), was identified in chromosome 8, while the lowest, 1126 (6.65%), was identified in chromosome 7. The maximum and minimum numbers of InDels were located on chromosome 1 and 7, with 834 (13.38%) and 519 (8.33%), respectively. Annotation showed that highest numbers of exonic and intergenic SNPs were present at the late adult stage, whereas the maximum number of InDels was found at the larval stage. On the basis of gene function, 47 gene variations were screened and 23 candidate genes were identified in associations with lac production. Concluding work will enhance knowledge on molecular markers to facilitate an increase in lac production in K. lacca as well as other lac insects.
Collapse
|
38
|
Novel homozygous nonsense mutation of MLIP and compensatory alternative splicing. NPJ Genom Med 2022; 7:36. [PMID: 35672413 PMCID: PMC9174206 DOI: 10.1038/s41525-022-00307-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 05/11/2022] [Indexed: 11/25/2022] Open
Abstract
Despite the growing accessibility of clinical sequencing, functional interpretation of variants remains a major hurdle to molecular diagnostics of Mendelian diseases. We aimed to describe a new adult-onset myopathy with muscle weakness and hyperCKemia caused by a nonsense variant in muscular LMNA-interacting protein (MLIP). Following RNA-sequencing, differential expression analysis uncovered a significant downregulation of this gene, which had a surprisingly mild effect on MLIP protein expression. RT-PCR and long-read sequencing (LRS) both support an important transcriptome shift in the patient, where decreased MLIP levels are seemingly due to nonsense-mediated decay of transcripts containing the exon 5 mutation. Moreover, a compensatory mechanism upregulates the functionally lacking isoforms and generates novel transcripts. These results support the recently discovered clinical implications of MLIP variants in myopathies, highlighting for the first time its relevance in adult-onset cases. These results also underline the power of LRS as a tool for the functional assessment of variants of unknown significance (VUS), as well as the definition of accurate isoform profile annotations in a tissue-specific manner.
Collapse
|
39
|
Targeted RNAseq Improves Clinical Diagnosis of Very Early-Onset Pediatric Immune Dysregulation. J Pers Med 2022; 12:jpm12060919. [PMID: 35743704 PMCID: PMC9224647 DOI: 10.3390/jpm12060919] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 05/26/2022] [Accepted: 05/27/2022] [Indexed: 02/05/2023] Open
Abstract
Despite increased use of whole exome sequencing (WES) for the clinical analysis of rare disease, overall diagnostic yield for most disorders hovers around 30%. Previous studies of mRNA have succeeded in increasing diagnoses for clearly defined disorders of monogenic inheritance. We asked if targeted RNA sequencing could provide similar benefits for primary immunodeficiencies (PIDs) and very early-onset inflammatory bowel disease (VEOIBD), both of which are difficult to diagnose due to high heterogeneity and variable severity. We performed targeted RNA sequencing of a panel of 260 immune-related genes for a cohort of 13 patients (seven suspected PID cases and six VEOIBD) and analyzed variants, splicing, and exon usage. Exonic variants were identified in seven cases, some of which had been previously prioritized by exome sequencing. For four cases, allele specific expression or lack thereof provided additional insights into possible disease mechanisms. In addition, we identified five instances of aberrant splicing associated with four variants. Three of these variants had been previously classified as benign in ClinVar based on population frequency. Digenic or oligogenic inheritance is suggested for at least two patients. In addition to validating the use of targeted RNA sequencing, our results show that rare disease research will benefit from incorporating contributing genetic factors into the diagnostic approach.
Collapse
|
40
|
Ahmed Z, Renart EG, Zeeshan S. Investigating underlying human immunity genes, implicated diseases and their relationship to COVID-19. Per Med 2022; 19:229-250. [PMID: 35261286 PMCID: PMC8919975 DOI: 10.2217/pme-2021-0132] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Aim: A human immunogenetics variation study was conducted in samples collected from diverse COVID-19 populations. Materials & methods: Whole-genome and whole-exome sequencing (WGS/WES), data processing, analysis and visualization pipeline were applied to identify variants associated with genes of interest. Results: A total of 2886 mutations were found across the entire set of 13 genomes. Functional annotation of the gene variants revealed mutation type and protein change. Many variants were found to be biologically implicated in COVID-19. The involvement of these genes was also found in multiple other diseases. Conclusion: The analysis determined that ACE2, TMPRSS4, TMPRSS2, SLC6A20 and FYCOI had functional implications and TMPRSS4 was the gene most altered in virally infected patients. The quest to establish an understanding of the genetics underlying COVID-19 is a central focus of life sciences today. COVID-19 is triggered by SARS-CoV-2, a single-stranded RNA respiratory virus. Several clinical-genomics studies have emerged positing different human gene mutations occurring due to COVID-19. A global analysis of these genes was conducted targeting major components of the immune system to identify possible variations likely to be involved in COVID-19 predisposition. Gene-variant analysis was performed on whole-genome sequencing samples collected from diverse populations. ACE2, TMPRSS4, TMPRSS2, SLC6A20 and FYCOI were found to have functional implications and TMPRSS4 may have a role in the severity of clinical manifestations of COVID-19.
Collapse
Affiliation(s)
- Zeeshan Ahmed
- Rutgers Institute for Health, Health Care Policy & Aging Research, Rutgers University, 112 Paterson Street, New Brunswick, NJ 08901, USA.,Department of Medicine, Robert Wood Johnson Medical School, Rutgers Biomedical & Health Sciences, 125 Paterson Street, New Brunswick, NJ 08901, USA
| | - Eduard Gibert Renart
- Rutgers Institute for Health, Health Care Policy & Aging Research, Rutgers University, 112 Paterson Street, New Brunswick, NJ 08901, USA
| | - Saman Zeeshan
- Rutgers Cancer Institute of New Jersey, Rutgers University, 195 Little Albany St, New Brunswick, NJ 08901, USA
| |
Collapse
|
41
|
Herrera-Rivero M, Gandhi S, Witten A, Ghalawinji A, Schotten U, Stoll M. Cardiac chamber-specific genetic alterations suggest candidate genes and pathways implicating the left ventricle in the pathogenesis of atrial fibrillation. Genomics 2022; 114:110320. [PMID: 35218871 DOI: 10.1016/j.ygeno.2022.110320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 01/12/2022] [Accepted: 02/19/2022] [Indexed: 11/15/2022]
Abstract
It is believed that the atria play a predominant role in the initiation and maintenance of atrial fibrillation (AF), while the role of left ventricular dysfunction in the pathophysiology remains enigmatic. We sought to dissect chamber specificity of AF-associated transcriptional changes using RNA-sequencing. We performed intra- and inter-chamber differential expression analyses comparing AF against sinus rhythm to identify genes specifically dysregulated in human left atria, right atria, and left ventricle (LV), and integrated known AF genetic associations with expression quantitative trait loci datasets to inform the potential for disease causal contributions within each chamber. Inter-chamber patterns changed drastically. Vast AF-associated transcriptional changes specific to LV, enriched for biological pathway terms implicating mitochondrial function, developmental processes and immunity, were supported at the genetic level, but no major enrichments for candidate genes specific to the atria were found. Our observations suggest an active role of the LV in the pathogenesis of AF.
Collapse
Affiliation(s)
- Marisol Herrera-Rivero
- Department of Genetic Epidemiology, Institute of Human Genetics, University of Münster, Münster, Germany
| | - Shrey Gandhi
- Department of Genetic Epidemiology, Institute of Human Genetics, University of Münster, Münster, Germany; Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Anika Witten
- Department of Genetic Epidemiology, Institute of Human Genetics, University of Münster, Münster, Germany
| | - Amer Ghalawinji
- Department of Genetic Epidemiology, Institute of Human Genetics, University of Münster, Münster, Germany
| | - Ulrich Schotten
- Department of Physiology, CARIM School for Cardiovascular Diseases, Maastricht University, Maastricht, the Netherlands
| | - Monika Stoll
- Department of Genetic Epidemiology, Institute of Human Genetics, University of Münster, Münster, Germany; Department of Biochemistry, Genetic Epidemiology and Statistical Genetics, CARIM School for Cardiovascular Diseases, Maastricht University, Maastricht, the Netherlands.
| |
Collapse
|
42
|
Liu J, Shen Q, Bao H. Comparison of seven SNP calling pipelines for the next-generation sequencing data of chickens. PLoS One 2022; 17:e0262574. [PMID: 35100292 PMCID: PMC8803190 DOI: 10.1371/journal.pone.0262574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 12/29/2021] [Indexed: 11/18/2022] Open
Abstract
Single nucleotide polymorphisms (SNPs) are widely used in genome-wide association studies and population genetics analyses. Next-generation sequencing (NGS) has become convenient, and many SNP-calling pipelines have been developed for human NGS data. We took advantage of a gap knowledge in selecting the appropriated SNP calling pipeline to handle with high-throughput NGS data. To fill this gap, we studied and compared seven SNP calling pipelines, which include 16GT, genome analysis toolkit (GATK), Bcftools-single (Bcftools single sample mode), Bcftools-multiple (Bcftools multiple sample mode), VarScan2-single (VarScan2 single sample mode), VarScan2-multiple (VarScan2 multiple sample mode) and Freebayes pipelines, using 96 NGS data with the different depth gradients of approximately 5X, 10X, 20X, 30X, 40X, and 50X coverage from 16 Rhode Island Red chickens. The sixteen chickens were also genotyped with a 50K SNP array, and the sensitivity and specificity of each pipeline were assessed by comparison to the results of SNP arrays. For each pipeline, except Freebayes, the number of detected SNPs increased as the input read depth increased. In comparison with other pipelines, 16GT, followed by Bcftools-multiple, obtained the most SNPs when the input coverage exceeded 10X, and Bcftools-multiple obtained the most when the input was 5X and 10X. The sensitivity and specificity of each pipeline increased with increasing input. Bcftools-multiple had the highest sensitivity numerically when the input ranged from 5X to 30X, and 16GT showed the highest sensitivity when the input was 40X and 50X. Bcftools-multiple also had the highest specificity, followed by GATK, at almost all input levels. For most calling pipelines, there were no obvious changes in SNP numbers, sensitivities or specificities beyond 20X. In conclusion, (1) if only SNPs were detected, the sequencing depth did not need to exceed 20X; (2) the Bcftools-multiple may be the best choice for detecting SNPs from chicken NGS data, but for a single sample or sequencing depth greater than 20X, 16GT was recommended. Our findings provide a reference for researchers to select suitable pipelines to obtain SNPs from the NGS data of chickens or nonhuman animals.
Collapse
Affiliation(s)
- Jing Liu
- National Engineering Laboratory for Animal Breeding, Beijing Key Laboratory for Animal Genetic Improvement, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Qingmiao Shen
- National Engineering Laboratory for Animal Breeding, Beijing Key Laboratory for Animal Genetic Improvement, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Haigang Bao
- National Engineering Laboratory for Animal Breeding, Beijing Key Laboratory for Animal Genetic Improvement, College of Animal Science and Technology, China Agricultural University, Beijing, China
- * E-mail:
| |
Collapse
|
43
|
Han J, Munro JE, Kocoski A, Barry AE, Bahlo M. Population-level genome-wide STR discovery and validation for population structure and genetic diversity assessment of Plasmodium species. PLoS Genet 2022; 18:e1009604. [PMID: 35007277 PMCID: PMC8782505 DOI: 10.1371/journal.pgen.1009604] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Revised: 01/21/2022] [Accepted: 12/14/2021] [Indexed: 11/18/2022] Open
Abstract
Short tandem repeats (STRs) are highly informative genetic markers that have been used extensively in population genetics analysis. They are an important source of genetic diversity and can also have functional impact. Despite the availability of bioinformatic methods that permit large-scale genome-wide genotyping of STRs from whole genome sequencing data, they have not previously been applied to sequencing data from large collections of malaria parasite field samples. Here, we have genotyped STRs using HipSTR in more than 3,000 Plasmodium falciparum and 174 Plasmodium vivax published whole-genome sequence data from samples collected across the globe. High levels of noise and variability in the resultant callset necessitated the development of a novel method for quality control of STR genotype calls. A set of high-quality STR loci (6,768 from P. falciparum and 3,496 from P. vivax) were used to study Plasmodium genetic diversity, population structures and genomic signatures of selection and these were compared to genome-wide single nucleotide polymorphism (SNP) genotyping data. In addition, the genome-wide information about genetic variation and other characteristics of STRs in P. falciparum and P. vivax have been available in an interactive web-based R Shiny application PlasmoSTR (https://github.com/bahlolab/PlasmoSTR).
Collapse
Affiliation(s)
- Jiru Han
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia
- Department of Medical Biology, The University of Melbourne, Melbourne, Australia
| | - Jacob E. Munro
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia
- Department of Medical Biology, The University of Melbourne, Melbourne, Australia
| | - Anthony Kocoski
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia
- Department of Mathematics and Statistics, The University of Melbourne, Melbourne, Australia
| | - Alyssa E. Barry
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia
- Department of Medical Biology, The University of Melbourne, Melbourne, Australia
- Disease Elimination Program, Burnet Institute, Melbourne, Australia
- IMPACT Institute for Innovation in Mental and Physical Health and Clinical Translation, Deakin University, Geelong, Australia
| | - Melanie Bahlo
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia
- Department of Medical Biology, The University of Melbourne, Melbourne, Australia
- * E-mail:
| |
Collapse
|
44
|
Brouard JS, Bissonnette N. Variant Calling from RNA-seq Data Using the GATK Joint Genotyping Workflow. Methods Mol Biol 2022; 2493:205-233. [PMID: 35751817 DOI: 10.1007/978-1-0716-2293-3_13] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The Genome Analysis Toolkit (GATK) developed at the Broad Institute provides state-of-the-art pipelines for germline and somatic variant discovery and genotyping. Unfortunately, the fully validated GATK pipeline for calling variant on RNAseq data is a Per-sample workflow that does not include the recent improvements seen in modern workflows, especially the possibility to perform joint genotyping analysis. Here, we describe how modern GATK commands from distinct workflows can be combined to call variants on RNAseq samples. We provide a detailed tutorial that starts with raw RNAseq reads and ends with filtered variants, of which some were shown to be associated with bovine paratuberculosis.
Collapse
|
45
|
Karimi MR, Karimi AH, Abolmaali S, Sadeghi M, Schmitz U. Prospects and challenges of cancer systems medicine: from genes to disease networks. Brief Bioinform 2021; 23:6361045. [PMID: 34471925 PMCID: PMC8769701 DOI: 10.1093/bib/bbab343] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 08/02/2021] [Accepted: 08/03/2021] [Indexed: 12/20/2022] Open
Abstract
It is becoming evident that holistic perspectives toward cancer are crucial in deciphering the overwhelming complexity of tumors. Single-layer analysis of genome-wide data has greatly contributed to our understanding of cellular systems and their perturbations. However, fundamental gaps in our knowledge persist and hamper the design of effective interventions. It is becoming more apparent than ever, that cancer should not only be viewed as a disease of the genome but as a disease of the cellular system. Integrative multilayer approaches are emerging as vigorous assets in our endeavors to achieve systemic views on cancer biology. Herein, we provide a comprehensive review of the approaches, methods and technologies that can serve to achieve systemic perspectives of cancer. We start with genome-wide single-layer approaches of omics analyses of cellular systems and move on to multilayer integrative approaches in which in-depth descriptions of proteogenomics and network-based data analysis are provided. Proteogenomics is a remarkable example of how the integration of multiple levels of information can reduce our blind spots and increase the accuracy and reliability of our interpretations and network-based data analysis is a major approach for data interpretation and a robust scaffold for data integration and modeling. Overall, this review aims to increase cross-field awareness of the approaches and challenges regarding the omics-based study of cancer and to facilitate the necessary shift toward holistic approaches.
Collapse
Affiliation(s)
| | | | | | - Mehdi Sadeghi
- Department of Cell & Molecular Biology, Semnan University, Semnan, Iran
| | - Ulf Schmitz
- Department of Molecular & Cell Biology, James Cook University, Townsville, QLD 4811, Australia
| |
Collapse
|
46
|
Ahmed Z, Renart EG, Mishra D, Zeeshan S. JWES: a new pipeline for whole genome/exome sequence data processing, management, and gene-variant discovery, annotation, prediction, and genotyping. FEBS Open Bio 2021; 11:2441-2452. [PMID: 34370400 PMCID: PMC8409305 DOI: 10.1002/2211-5463.13261] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Revised: 07/18/2021] [Accepted: 08/02/2021] [Indexed: 01/07/2023] Open
Abstract
Whole genome and exome sequencing (WGS/WES) are the most popular next‐generation sequencing (NGS) methodologies and are at present often used to detect rare and common genetic variants of clinical significance. We emphasize that automated sequence data processing, management, and visualization should be an indispensable component of modern WGS and WES data analysis for sequence assembly, variant detection (SNPs, SVs), imputation, and resolution of haplotypes. In this manuscript, we present a newly developed findable, accessible, interoperable, and reusable (FAIR) bioinformatics‐genomics pipeline Java based Whole Genome/Exome Sequence Data Processing Pipeline (JWES) for efficient variant discovery and interpretation, and big data modeling and visualization. JWES is a cross‐platform, user‐friendly, product line application, that entails three modules: (a) data processing, (b) storage, and (c) visualization. The data processing module performs a series of different tasks for variant calling, the data storage module efficiently manages high‐volume gene‐variant data, and the data visualization module supports variant data interpretation with Circos graphs. The performance of JWES was tested and validated in‐house with different experiments, using Microsoft Windows, macOS Big Sur, and UNIX operating systems. JWES is an open‐source and freely available pipeline, allowing scientists to take full advantage of all the computing resources available, without requiring much computer science knowledge. We have successfully applied JWES for processing, management, and gene‐variant discovery, annotation, prediction, and genotyping of WGS and WES data to analyze variable complex disorders. In summary, we report the performance of JWES with some reproducible case studies, using open access and in‐house generated, high‐quality datasets.
Collapse
Affiliation(s)
- Zeeshan Ahmed
- Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA.,Department of Medicine, Rutgers Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, New Brunswick, NJ, USA
| | - Eduard Gibert Renart
- Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
| | - Deepshikha Mishra
- Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
| | - Saman Zeeshan
- Rutgers Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
| |
Collapse
|
47
|
Identification of cancer-related mutations in human pluripotent stem cells using RNA-seq analysis. Nat Protoc 2021; 16:4522-4537. [PMID: 34363070 DOI: 10.1038/s41596-021-00591-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 06/16/2021] [Indexed: 01/10/2023]
Abstract
Human pluripotent stem cells (hPSCs) are known to acquire genetic aberrations during in vitro propagation. In addition to recurrent chromosomal aberrations, it has recently been shown that these cells also gain point mutations in cancer-related genes, predominantly in TP53. The need for routine quality control of hPSCs is critical for both basic research and clinical applications. Here we discuss the relevance of detecting mutations for various hPSCs applications, and present a detailed protocol to identify cancer-related point mutations using data from RNA sequencing, an assay commonly performed during the growth and differentiation of hPSCs. In this protocol, we describe how to process and align the sequencing data, analyze it and conservatively interpret the results in order to generate an accurate estimation of mutations in tumor-related genes. This pipeline is designed to work in high throughput and is available as a software container at https://github.com/elyadlezmi/RNA2CM . The protocol requires minimal command-line skills and can be carried out in 1-2 d.
Collapse
|
48
|
Ahmed Z, Renart EG, Zeeshan S. Genomics pipelines to investigate susceptibility in whole genome and exome sequenced data for variant discovery, annotation, prediction and genotyping. PeerJ 2021; 9:e11724. [PMID: 34395068 PMCID: PMC8320519 DOI: 10.7717/peerj.11724] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 06/14/2021] [Indexed: 12/12/2022] Open
Abstract
Over the last few decades, genomics is leading toward audacious future, and has been changing our views about conducting biomedical research, studying diseases, and understanding diversity in our society across the human species. The whole genome and exome sequencing (WGS/WES) are two of the most popular next-generation sequencing (NGS) methodologies that are currently being used to detect genetic variations of clinical significance. Investigating WGS/WES data for the variant discovery and genotyping is based on the nexus of different data analytic applications. Although several bioinformatics applications have been developed, and many of those are freely available and published. Timely finding and interpreting genetic variants are still challenging tasks among diagnostic laboratories and clinicians. In this study, we are interested in understanding, evaluating, and reporting the current state of solutions available to process the NGS data of variable lengths and types for the identification of variants, alleles, and haplotypes. Residing within the scope, we consulted high quality peer reviewed literature published in last 10 years. We were focused on the standalone and networked bioinformatics applications proposed to efficiently process WGS and WES data, and support downstream analysis for gene-variant discovery, annotation, prediction, and interpretation. We have discussed our findings in this manuscript, which include but not are limited to the set of operations, workflow, data handling, involved tools, technologies and algorithms and limitations of the assessed applications.
Collapse
Affiliation(s)
- Zeeshan Ahmed
- Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA.,Department of Medicine, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
| | - Eduard Gibert Renart
- Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
| | - Saman Zeeshan
- Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
| |
Collapse
|
49
|
de Jong TV, Kim P, Guryev V, Mulligan MK, Williams RW, Redei EE, Chen H. Whole genome sequencing of nearly isogenic WMI and WLI inbred rats identifies genes potentially involved in depression and stress reactivity. Sci Rep 2021; 11:14774. [PMID: 34285244 PMCID: PMC8292482 DOI: 10.1038/s41598-021-92993-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 06/17/2021] [Indexed: 02/06/2023] Open
Abstract
The WMI and WLI inbred rats were generated from the stress-prone, and not yet fully inbred, Wistar Kyoto (WKY) strain. These were selected using bi-directional selection for immobility in the forced swim test and were then sib-mated for over 38 generations. Despite the low level of genetic diversity among WKY progenitors, the WMI substrain is significantly more vulnerable to stress relative to the counter-selected WLI strain. Here we quantify numbers and classes of genomic sequence variants distinguishing these substrains with the long term goal of uncovering functional and behavioral polymorphism that modulate sensitivity to stress and depression-like phenotypes. DNA from WLI and WMI was sequenced using Illumina xTen, IonTorrent, and 10X Chromium linked-read platforms to obtain a combined coverage of ~ 100X for each strain. We identified 4,296 high quality homozygous SNPs and indels between the WMI and WLI. We detected high impact variants in genes previously implicated in depression (e.g. Gnat2), depression-like behavior (e.g. Prlr, Nlrp1a), other psychiatric disease (e.g. Pou6f2, Kdm5a, Reep3, Wdfy3), and responses to psychological stressors (e.g. Pigr). High coverage sequencing data confirm that the two substrains are nearly coisogenic. Nonetheless, the small number of sequence variants contributes to numerous well characterized differences including depression-like behavior, stress reactivity, and addiction related phenotypes. These selected substrains are an ideal resource for forward and reverse genetic studies using a reduced complexity cross.
Collapse
Affiliation(s)
| | - Panjun Kim
- University of Tennessee Health Science Center, Memphis, TN, USA
| | - Victor Guryev
- European Research Institute for the Biology of Ageing, University of Groningen, Groningen, The Netherlands
| | | | | | - Eva E Redei
- Northwestern University - Chicago, Chicago, IL, USA
| | - Hao Chen
- University of Tennessee Health Science Center, Memphis, TN, USA.
| |
Collapse
|
50
|
Kuburas A, Mason BN, Hing B, Wattiez AS, Reis AS, Sowers LP, Moldovan Loomis C, Garcia-Martinez LF, Russo AF. PACAP Induces Light Aversion in Mice by an Inheritable Mechanism Independent of CGRP. J Neurosci 2021; 41:4697-4715. [PMID: 33846231 PMCID: PMC8260237 DOI: 10.1523/jneurosci.2200-20.2021] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 02/26/2021] [Accepted: 03/27/2021] [Indexed: 01/18/2023] Open
Abstract
The neuropeptides CGRP (calcitonin gene-related peptide) and PACAP (pituitary adenylate cyclase-activating polypeptide) have emerged as mediators of migraine, yet the potential overlap of their mechanisms remains unknown. Infusion of PACAP, like CGRP, can cause migraine in people, and both peptides share similar vasodilatory and nociceptive functions. In this study, we have used light aversion in mice as a surrogate for migraine-like photophobia to compare CGRP and PACAP and ask whether CGRP or PACAP actions were dependent on each other. Similar to CGRP, PACAP induced light aversion in outbred CD-1 mice. The light aversion was accompanied by increased resting in the dark, but not anxiety in a light-independent open field assay. Unexpectedly, about one-third of the CD-1 mice did not respond to PACAP, which was not seen with CGRP. The responder and nonresponder phenotypes were stable, inheritable, and not sex linked, although there was a trend for greater responses among male mice. RNA-sequencing analysis of trigeminal ganglia yielded hierarchical clustering of responder and nonresponder mice and revealed a number of candidate genes, including greater expression of the Trpc5 and Kcnk12 ion channels and glycoprotein hormones and receptors in a subset of male responder mice. Importantly, an anti-PACAP monoclonal antibody could block PACAP-induced light aversion but not CGRP-induced light aversion. Conversely, an anti-CGRP antibody could not block PACAP-induced light aversion. Thus, we propose that CGRP and PACAP act by independent convergent pathways that cause a migraine-like symptom in mice.SIGNIFICANCE STATEMENT The relationship between the neuropeptides CGRP (calcitonin gene-related peptide) and PACAP (pituitary adenylate cyclase-activating polypeptide) in migraine is relevant given that both peptides can induce migraine in people, yet to date only drugs that target CGRP are available. Using an outbred strain of mice, we were able to show that most, but not all, mice respond to PACAP in a preclinical photophobia assay. Our finding that CGRP and PACAP monoclonal antibodies do not cross-inhibit the other peptide indicates that CGRP and PACAP actions are independent and suggests that PACAP-targeted drugs may be effective in patients who do not respond to CGRP-based therapeutics.
Collapse
Affiliation(s)
- Adisa Kuburas
- Department of Molecular Physiology and Biophysics, University of Iowa, Iowa City, Iowa 52242
| | - Bianca N Mason
- Department of Molecular Physiology and Biophysics, University of Iowa, Iowa City, Iowa 52242
- Molecular and Cellular Biology Program, University of Iowa, Iowa City, Iowa 52242
| | - Benjamin Hing
- Department of Molecular Physiology and Biophysics, University of Iowa, Iowa City, Iowa 52242
| | - Anne-Sophie Wattiez
- Department of Molecular Physiology and Biophysics, University of Iowa, Iowa City, Iowa 52242
| | - Alyssa S Reis
- Department of Molecular Physiology and Biophysics, University of Iowa, Iowa City, Iowa 52242
| | - Levi P Sowers
- Department of Molecular Physiology and Biophysics, University of Iowa, Iowa City, Iowa 52242
- Center for the Prevention and Treatment of Visual Loss, Veterans Affairs Health Care System, Iowa City, Iowa 52246
| | | | | | - Andrew F Russo
- Department of Molecular Physiology and Biophysics, University of Iowa, Iowa City, Iowa 52242
- Department of Neurology, University of Iowa, Iowa City, Iowa 52242
- Center for the Prevention and Treatment of Visual Loss, Veterans Affairs Health Care System, Iowa City, Iowa 52246
| |
Collapse
|