1
|
Zhou X, Wu X, Pei C, He M, Chu M, Guo X, Liang C, Bao P, Yan P. Integrative analysis of Iso-Seq and RNA-seq data reveals transcriptome complexity and differential isoform in skin tissues of different hair length Yak. BMC Genomics 2024; 25:498. [PMID: 38773419 PMCID: PMC11106907 DOI: 10.1186/s12864-024-10345-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2024] [Accepted: 04/25/2024] [Indexed: 05/23/2024] Open
Abstract
BACKGROUND The hair follicle development process is regulated by sophisticated genes and signaling networks, and the hair grows from the hair follicle. The Tianzhu white yak population exhibits differences in hair length, especially on the forehead and shoulder region. However, the genetic mechanism is still unclear. Isoform sequencing (Iso-seq) technology with advantages in long reads sequencing. Hence, we combined the Iso-seq and RNA-seq methods to investigate the transcript complexity and difference between long-haired yak (LHY) and normal-haired yak (NHY). RESULTS The hair length measurement result showed a significant difference between LHY and NHY on the forehead and the shoulder (P-value < 0.001). The skin samples from the forehead and the shoulder of LHY and NHY were pooled for isoform sequencing (Iso-seq). We obtained numerous long transcripts, including novel isoforms, long non-coding RNA, alternative splicing events, and alternative polyadenylation events. Combined with RNA-seq data, we performed differential isoforms (DEIs) analysis between LHY and NHY. We found that some hair follicle and skin development-related DEIs, like BMP4, KRT2, IGF2R, and COL1A2 in the forehead skin; BMP1, KRT1, FGF5, COL2A1, and IGFBP5 in the shoulder skin. Enrichment analysis revealed that DEIs in both two comparable groups significantly participated in skin and hair follicle development-related pathways, like ECM-receptor interaction, focal adhesion, and PI3K-Akt signaling pathways. The results indicated that the hair follicle development of Tianzhu white yak may influence the hair length difference. Besides, the protein-protein interaction (PPI) network of DEIs showed COL2A1 and COL3A1 exhibited a high degree of centrality, and these two genes were suggested as potential candidates for the hair length growth of Tianzhu white yak. CONCLUSIONS The results provided a comprehensive analysis of the transcriptome complexity and identified differential transcripts that enhance our understanding of the molecular mechanisms underlying the variation in hair length growth in Tianzhu white yak.
Collapse
Affiliation(s)
- Xuelan Zhou
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, 730050, Lanzhou, P.R. China
- Key Laboratory of Yak Breeding in Gansu Province, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, 730050, Lanzhou, P.R. China
| | - Xiaoyun Wu
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, 730050, Lanzhou, P.R. China
- Key Laboratory of Yak Breeding in Gansu Province, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, 730050, Lanzhou, P.R. China
| | - Chengfang Pei
- Animal Husbandry Technology Promotion Station of Tianzhu County, 733000, Wuwei, P.R. China
| | - Meilan He
- Animal Husbandry Technology Promotion Station of Tianzhu County, 733000, Wuwei, P.R. China
| | - Min Chu
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, 730050, Lanzhou, P.R. China
- Key Laboratory of Yak Breeding in Gansu Province, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, 730050, Lanzhou, P.R. China
| | - Xian Guo
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, 730050, Lanzhou, P.R. China
- Key Laboratory of Yak Breeding in Gansu Province, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, 730050, Lanzhou, P.R. China
| | - Chunnian Liang
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, 730050, Lanzhou, P.R. China
- Key Laboratory of Yak Breeding in Gansu Province, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, 730050, Lanzhou, P.R. China
| | - Pengjia Bao
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, 730050, Lanzhou, P.R. China.
- Key Laboratory of Yak Breeding in Gansu Province, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, 730050, Lanzhou, P.R. China.
| | - Ping Yan
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, 730050, Lanzhou, P.R. China.
- Key Laboratory of Yak Breeding in Gansu Province, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, 730050, Lanzhou, P.R. China.
| |
Collapse
|
2
|
Lee DJ, Kim JH, Lee TH, Park ME, Ahn BO, Lee SJ, Cho JY, Kim CK. Selection of Catechin Biosynthesis-Related Genes and Functional Analysis from Chromosome-Level Genome Assembly in C. sinensis L. Variety 'Sangmok'. Int J Mol Sci 2024; 25:3634. [PMID: 38612446 PMCID: PMC11011610 DOI: 10.3390/ijms25073634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 03/14/2024] [Accepted: 03/18/2024] [Indexed: 04/14/2024] Open
Abstract
Camellia is an important plant genus that includes well-known species such as C. sinensis, C. oleifera, and C. japonica. The C. sinensis cultivar 'Sangmok', one of Korea's standard types of tea landraces, is a small evergreen tree or shrub. Genome annotation has shown that Korean tea plants have special and unique benefits and superior components, such as catechin. The genome of Camellia sinensis cultivar 'Sangmok' was assembled on the chromosome level, with a length of 2678.62 Mbp and GC content of 38.16%. Further, 15 chromosome-scale scaffolds comprising 82.43% of the assembly (BUSCO completeness, 94.3%) were identified. Analysis of 68,151 protein-coding genes showed an average of 5.003 exons per gene. Among 82,481 coding sequences, the majority (99.06%) were annotated by Uniprot/Swiss-Prot. Further analysis revealed that 'Sangmok' is closely related to C. sinensis, with a divergence time of 60 million years ago. A total of 3336 exclusive gene families in 'Sangmok' were revealed by gene ontology analysis to play roles in auxin transport and cellular response mechanisms. By comparing these exclusive genes with 551 similar catechin genes, 17 'Sangmok'-specific catechin genes were identified by qRT-PCR, including those involved in phytoalexin biosynthesis and related to cytochrome P450. The 'Sangmok' genome exhibited distinctive genes compared to those of related species. This comprehensive genomic investigation enhances our understanding of the genetic architecture of 'Sangmok' and its specialized functions. The findings contribute valuable insights into the evolutionary and functional aspects of this plant species.
Collapse
Affiliation(s)
- Dong-Jun Lee
- Genomics Division, National Institute of Agricultural Sciences (NAS), Jeonju 54874, Republic of Korea; (D.-J.L.)
| | - Jin-Hyun Kim
- Genomics Division, National Institute of Agricultural Sciences (NAS), Jeonju 54874, Republic of Korea; (D.-J.L.)
| | - Tae-Ho Lee
- Genomics Division, National Institute of Agricultural Sciences (NAS), Jeonju 54874, Republic of Korea; (D.-J.L.)
| | - Myung-Eun Park
- Genomics Division, National Institute of Agricultural Sciences (NAS), Jeonju 54874, Republic of Korea; (D.-J.L.)
| | - Byung-Ohg Ahn
- National Agrobiodiversity Center, National Institute of Agricultural Sciences (NAS), Jeonju 54874, Republic of Korea
| | - So-Jin Lee
- Research Institute of Climate Change and Agriculture (RICCA), Jeju-si 63240, Republic of Korea
| | - Jeong-Yong Cho
- Department of Food Science and Biotechnology, Chonnam National University, Gwangju 61186, Republic of Korea
| | - Chang-Kug Kim
- Genomics Division, National Institute of Agricultural Sciences (NAS), Jeonju 54874, Republic of Korea; (D.-J.L.)
| |
Collapse
|
3
|
Wang W, Li W, Liu W, Wang Z, Xie B, Yang X, Tang Z. Exploring Multi-Tissue Alternative Splicing and Skeletal Muscle Metabolism Regulation in Obese- and Lean-Type Pigs. Genes (Basel) 2024; 15:196. [PMID: 38397185 PMCID: PMC10888101 DOI: 10.3390/genes15020196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 01/26/2024] [Accepted: 01/28/2024] [Indexed: 02/25/2024] Open
Abstract
Alternative splicing (AS) is a crucial mechanism in post-transcriptional regulation, contributing significantly to the diversity of the transcriptome and proteome. In this study, we performed a comprehensive AS profile in nine tissues obtained from Duroc (lean-type) and Luchuan (obese-type) pigs. Notably, 94,990 AS events from 14,393 genes were identified. Among these AS events, it was observed that 80% belonged to the skipped exon (SE) type. Functional enrichment analysis showed that genes with more than ten AS events were closely associated with tissue-specific functions. Additionally, the analysis of overlap between differentially alternative splicing genes (DSGs) and differentially expressed genes (DEGs) revealed the highest number of overlapped genes in the heart and skeletal muscle. The novelty of our study is that it identified and validated three genes (PYGM, MAPK11 and CAMK2B) in the glucagon signaling pathway, and their alternative splicing differences were highly significant across two pig breeds. In conclusion, our study offers novel insights into the molecular regulation of diverse tissue physiologies and the phenotypic differences between obese- and lean-type pigs, which are helpful for pig breeding.
Collapse
Affiliation(s)
- Wei Wang
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education and Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan 430070, China;
- Kunpeng Institute of Modern Agriculture at Foshan, Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Foshan 528226, China; (W.L.); (W.L.); (Z.W.)
| | - Wangchang Li
- Kunpeng Institute of Modern Agriculture at Foshan, Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Foshan 528226, China; (W.L.); (W.L.); (Z.W.)
- Guangxi Key Laboratory of Animal Breeding, Disease Control and Prevention, College of Animal Science & Technology, Guangxi University, Nanning 530004, China
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Weiwei Liu
- Kunpeng Institute of Modern Agriculture at Foshan, Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Foshan 528226, China; (W.L.); (W.L.); (Z.W.)
- Guangxi Key Laboratory of Animal Breeding, Disease Control and Prevention, College of Animal Science & Technology, Guangxi University, Nanning 530004, China
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Zishuai Wang
- Kunpeng Institute of Modern Agriculture at Foshan, Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Foshan 528226, China; (W.L.); (W.L.); (Z.W.)
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Bingkun Xie
- Animal Husbandry Research Institute, Guangxi Vocational University of Agriculture, Nanning 530001, China;
| | - Xiaogan Yang
- Guangxi Key Laboratory of Animal Breeding, Disease Control and Prevention, College of Animal Science & Technology, Guangxi University, Nanning 530004, China
| | - Zhonglin Tang
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education and Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan 430070, China;
- Kunpeng Institute of Modern Agriculture at Foshan, Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Foshan 528226, China; (W.L.); (W.L.); (Z.W.)
- Guangxi Key Laboratory of Animal Breeding, Disease Control and Prevention, College of Animal Science & Technology, Guangxi University, Nanning 530004, China
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
- Animal Husbandry Research Institute, Guangxi Vocational University of Agriculture, Nanning 530001, China;
| |
Collapse
|
4
|
Beiki H, Murdoch BM, Park CA, Kern C, Kontechy D, Becker G, Rincon G, Jiang H, Zhou H, Thorne J, Koltes JE, Michal JJ, Davenport K, Rijnkels M, Ross PJ, Hu R, Corum S, McKay S, Smith TPL, Liu W, Ma W, Zhang X, Xu X, Han X, Jiang Z, Hu ZL, Reecy JM. Enhanced bovine genome annotation through integration of transcriptomics and epi-transcriptomics datasets facilitates genomic biology. Gigascience 2024; 13:giae019. [PMID: 38626724 PMCID: PMC11020238 DOI: 10.1093/gigascience/giae019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Revised: 07/29/2023] [Accepted: 03/27/2024] [Indexed: 04/18/2024] Open
Abstract
BACKGROUND The accurate identification of the functional elements in the bovine genome is a fundamental requirement for high-quality analysis of data informing both genome biology and genomic selection. Functional annotation of the bovine genome was performed to identify a more complete catalog of transcript isoforms across bovine tissues. RESULTS A total of 160,820 unique transcripts (50% protein coding) representing 34,882 unique genes (60% protein coding) were identified across tissues. Among them, 118,563 transcripts (73% of the total) were structurally validated by independent datasets (PacBio isoform sequencing data, Oxford Nanopore Technologies sequencing data, de novo assembled transcripts from RNA sequencing data) and comparison with Ensembl and NCBI gene sets. In addition, all transcripts were supported by extensive data from different technologies such as whole transcriptome termini site sequencing, RNA Annotation and Mapping of Promoters for the Analysis of Gene Expression, chromatin immunoprecipitation sequencing, and assay for transposase-accessible chromatin using sequencing. A large proportion of identified transcripts (69%) were unannotated, of which 86% were produced by annotated genes and 14% by unannotated genes. A median of two 5' untranslated regions were expressed per gene. Around 50% of protein-coding genes in each tissue were bifunctional and transcribed both coding and noncoding isoforms. Furthermore, we identified 3,744 genes that functioned as noncoding genes in fetal tissues but as protein-coding genes in adult tissues. Our new bovine genome annotation extended more than 11,000 annotated gene borders compared to Ensembl or NCBI annotations. The resulting bovine transcriptome was integrated with publicly available quantitative trait loci data to study tissue-tissue interconnection involved in different traits and construct the first bovine trait similarity network. CONCLUSIONS These validated results show significant improvement over current bovine genome annotations.
Collapse
Affiliation(s)
- Hamid Beiki
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA
| | - Brenda M Murdoch
- Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA
| | - Carissa A Park
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA
| | - Chandlar Kern
- Department of Animal Science, Pennsylvania State University, PA 16802, USA
| | - Denise Kontechy
- Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA
| | - Gabrielle Becker
- Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA
| | | | - Honglin Jiang
- Department of Animal and Poultry Sciences, Virginia Tech, VA 24060, USA
| | - Huaijun Zhou
- Department of Animal Science, University of California, Davis, CA 95616, USA
| | - Jacob Thorne
- Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA
| | - James E Koltes
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA
| | - Jennifer J Michal
- Department of Animal Science, Washington State University, WA 99164, USA
| | - Kimberly Davenport
- Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA
| | - Monique Rijnkels
- Department of Veterinary Integrative Biosciences, Texas A&M University, TX 77843, USA
| | - Pablo J Ross
- Department of Animal Science, University of California, Davis, CA 95616, USA
| | - Rui Hu
- Department of Animal and Poultry Sciences, Virginia Tech, VA 24060, USA
| | - Sarah Corum
- Zoetis, Parsippany-Troy Hills, NJ 07054, USA
| | | | | | - Wansheng Liu
- Department of Animal Science, Pennsylvania State University, PA 16802, USA
| | - Wenzhi Ma
- Department of Animal Science, Pennsylvania State University, PA 16802, USA
| | - Xiaohui Zhang
- Department of Animal Science, Washington State University, WA 99164, USA
| | - Xiaoqing Xu
- Department of Animal Science, University of California, Davis, CA 95616, USA
| | - Xuelei Han
- Department of Animal Science, Washington State University, WA 99164, USA
| | - Zhihua Jiang
- Department of Animal Science, Washington State University, WA 99164, USA
| | - Zhi-Liang Hu
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA
| | - James M Reecy
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA
| |
Collapse
|
5
|
Liu Z, Zhang X, Huang L, Huo H, Wang P, Li W, Dai H, Yang F, Fu G, Zhao G, Sun YH, Huo J. Long- and short-read RNA sequencing from five reproductive organs of boar. Sci Data 2023; 10:678. [PMID: 37798273 PMCID: PMC10556096 DOI: 10.1038/s41597-023-02595-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 09/25/2023] [Indexed: 10/07/2023] Open
Abstract
The production of semen in boars involves multiple reproductive glands, including the testis (Tes), epididymis (Epi), vesicular gland (VG), prostate gland (PG), and bulbourethral gland (BG). However, previous studies on boar reproduction primarily focused on the testis, with little attention paid to the other glands. Here, we integrated single-molecule long-read sequencing with short-read sequencing to characterize the RNA landscape from five glands of Banna mini-pig inbred line (BMI) and Diannan small-ear pigs (DSE). We identified 110,996 full-length isoforms from 22,298 genes, and classified the alternative splicing (AS) events in these five glands. Transcriptome-wide variation analysis indicated that the number of single nucleotide polymorphisms (SNPs) in five tissues of BMI was significantly lower than that in the non-inbred pig, DSE, revealing the effect of inbreeding on BMI. Additionally, we performed small-RNA sequencing and identified 299 novel miRNAs across all glands. Overall, our findings provide a comprehensive overview of the RNA landscape within these five glands, paving the path for future investigations on reproductive biology and the impact of inbreeding on pig transcriptome.
Collapse
Affiliation(s)
- Zhipeng Liu
- College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China
| | - Xia Zhang
- College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China
- College of Life Science, Lyuliang University, Lvliang, 033001, Shanxi, China
| | - Libin Huang
- Department of Biology, College of Science, Northeastern University, Boston, Massachusetts, 02115, USA
| | - Hailong Huo
- Yunnan Open University, Kunming, 650500, Yunnan, China
| | - Pei Wang
- College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China
| | - Weizhen Li
- College of Veterinary Medicine, Yunnan Agricultural University, Kunming, 650201, Yunnan, China
| | - Hongmei Dai
- College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China
| | - Fuhua Yang
- College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China
| | - Guowen Fu
- College of Veterinary Medicine, Yunnan Agricultural University, Kunming, 650201, Yunnan, China
| | - Guiying Zhao
- College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China.
| | - Yu H Sun
- Department of Biology, University of Rochester, Rochester, New York, 14627, USA.
| | - Jinlong Huo
- College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China.
- Department of Biology, University of Rochester, Rochester, New York, 14627, USA.
| |
Collapse
|
6
|
Xia Y, Jin Z, Zhang C, Ouyang L, Dong Y, Li J, Guo L, Jing B, Shi Y, Miao S, Xi R. TAGET: a toolkit for analyzing full-length transcripts from long-read sequencing. Nat Commun 2023; 14:5935. [PMID: 37741817 PMCID: PMC10518008 DOI: 10.1038/s41467-023-41649-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Accepted: 09/13/2023] [Indexed: 09/25/2023] Open
Abstract
Single-molecule Real-time Isoform Sequencing (Iso-seq) of transcriptomes by PacBio can generate very long and accurate reads, thus providing an ideal platform for full-length transcriptome analysis. We present an integrated computational toolkit named TAGET for Iso-seq full-length transcript data analyses, including transcript alignment, annotation, gene fusion detection, and quantification analyses such as differential expression gene analysis and differential isoform usage analysis. We evaluate the performance of TAGET using a public Iso-seq dataset and newly sequenced Iso-seq datasets from tumor patients. TAGET gives significantly more precise novel splice site prediction and enables more accurate novel isoform and gene fusion discoveries, as validated by experimental validations and comparisons with RNA-seq data. We identify and experimentally validate a differential isoform usage gene ECM1, and further show that its isoform ECM1b may be a tumor-suppressor in laryngocarcinoma. Our results demonstrate that TAGET provides a valuable computational toolkit and can be applied to many full-length transcriptome studies.
Collapse
Affiliation(s)
- Yuchao Xia
- College of Science, Beijing Information Science and Technology University, 100192, Beijing, China
- Beijing GeneX Health Co.,Ltd, 100195, Beijing, China
| | - Zijie Jin
- Peking University International Cancer Institute, Health Science Center, Peking University, 100191, Beijing, China
- School of Mathematical Sciences, Peking University, 100871, Beijing, China
| | | | - Linkun Ouyang
- Academy for Advanced Interdisciplinary Studies, Peking University, 100871, Beijing, China
| | - Yuhao Dong
- Beijing GeneX Health Co.,Ltd, 100195, Beijing, China
| | - Juan Li
- Department of Biomedical Engineering, College of Future Technology, Peking University, 100871, Beijing, China
| | - Lvze Guo
- Beijing GeneX Health Co.,Ltd, 100195, Beijing, China
| | - Biyang Jing
- Beijing GeneX Health Co.,Ltd, 100195, Beijing, China
| | - Yang Shi
- BeiGene (Beijing) Co., Ltd., Beijing, China
| | - Susheng Miao
- Department of Head and Neck Surgery, Harbin Medical University Cancer Hospital, 150081, Harbin, China.
| | - Ruibin Xi
- School of Mathematical Sciences, Peking University, 100871, Beijing, China.
- Academy for Advanced Interdisciplinary Studies, Peking University, 100871, Beijing, China.
- Center for Statistical Science, Peking University, 100871, Beijing, China.
| |
Collapse
|
7
|
Zhao P, Gu L, Gao Y, Pan Z, Liu L, Li X, Zhou H, Yu D, Han X, Qian L, Liu GE, Fang L, Wang Z. Young SINEs in pig genomes impact gene regulation, genetic diversity, and complex traits. Commun Biol 2023; 6:894. [PMID: 37652983 PMCID: PMC10471783 DOI: 10.1038/s42003-023-05234-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 08/09/2023] [Indexed: 09/02/2023] Open
Abstract
Transposable elements (TEs) are a major source of genetic polymorphisms and play a role in chromatin architecture, gene regulatory networks, and genomic evolution. However, their functional role in pigs and contributions to complex traits are largely unknown. We created a catalog of TEs (n = 3,087,929) in pigs and found that young SINEs were predominantly silenced by histone modifications, DNA methylation, and decreased accessibility. However, some transcripts from active young SINEs showed high tissue-specificity, as confirmed by analyzing 3570 RNA-seq samples. We also detected 211,067 dimorphic SINEs in 374 individuals, including 340 population-specific ones associated with local adaptation. Mapping these dimorphic SINEs to genome-wide associations of 97 complex traits in pigs, we found 54 candidate genes (e.g., ANK2 and VRTN) that might be mediated by TEs. Our findings highlight the important roles of young SINEs and provide a supplement for genotype-to-phenotype associations and modern breeding in pigs.
Collapse
Affiliation(s)
- Pengju Zhao
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Lihong Gu
- Institute of Animal Science & Veterinary Medicine, Hainan Academy of Agricultural Sciences, No. 14 Xingdan Road, Haikou, 571100, China
| | - Yahui Gao
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - Zhangyuan Pan
- Department of Animal Science, University of California, Davis, CA, 95616, USA
| | - Lei Liu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China
| | - Xingzheng Li
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China
| | - Huaijun Zhou
- Department of Animal Science, University of California, Davis, CA, 95616, USA
| | - Dongyou Yu
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Xinyan Han
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Lichun Qian
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA.
| | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, 8000, Denmark.
| | - Zhengguang Wang
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China.
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China.
| |
Collapse
|
8
|
Jiang YF, Wang S, Wang CL, Xu RH, Wang WW, Jiang Y, Wang MS, Jiang L, Dai LH, Wang JR, Chu XH, Zeng YQ, Fang LZ, Wu DD, Zhang Q, Ding XD. Pangenome obtained by long-read sequencing of 11 genomes reveal hidden functional structural variants in pigs. iScience 2023; 26:106119. [PMID: 36852268 PMCID: PMC9958381 DOI: 10.1016/j.isci.2023.106119] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 12/21/2022] [Accepted: 01/30/2023] [Indexed: 02/05/2023] Open
Abstract
Long-read sequencing (LRS) facilitates both the genome assembly and the discovery of structural variants (SVs). Here, we built a graph-based pig pangenome by incorporating 11 LRS genomes with an average of 94.01% BUSCO completeness score, revealing 206-Mb novel sequences. We discovered 183,352 nonredundant SVs (63% novel), representing 12.12% of the reference genome. By genotyping SVs in an additional 196 short-read sequencing samples, we identified thousands of population stratified SVs. Particularly, we detected 7,568 Tibetan specific SVs, some of which demonstrate significant population differentiation between Tibetan and low-altitude pigs, which might be associated with the high-altitude hypoxia adaptation in Tibetan pigs. Further integrating functional genomic data, the most promising candidate genes within the SVs that might contribute to the high-altitude hypoxia adaptation were discovered. Overall, our study generates a benchmark pangenome resource for illustrating the important roles of SVs in adaptive evolution, domestication, and genetic improvement of agronomic traits in pigs.
Collapse
Affiliation(s)
- Yi-Fan Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Sheng Wang
- State Key Laboratory of Genetic Resources and Evolution, Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Chong-Long Wang
- Key Laboratory of Pig Molecular Quantitative Genetics of Anhui Academy of Agricultural Sciences, Anhui Provincial Key Laboratory of Livestock and Poultry Product Safety Engineering, Institute of Animal Husbandry and Veterinary Medicine, Anhui Academy of Agricultural Sciences, Hefei 230031, China
| | - Ru-Hai Xu
- Key Laboratory of Animal Genetics and Breeding of Zhejiang Province, Institute of Animal Husbandry and Veterinary Science, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Wen-Wen Wang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Taian 271001, China
| | - Yao Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
- Key Laboratory of Pig Molecular Quantitative Genetics of Anhui Academy of Agricultural Sciences, Anhui Provincial Key Laboratory of Livestock and Poultry Product Safety Engineering, Institute of Animal Husbandry and Veterinary Medicine, Anhui Academy of Agricultural Sciences, Hefei 230031, China
| | - Ming-Shan Wang
- State Key Laboratory of Genetic Resources and Evolution, Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Li Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Li-He Dai
- Key Laboratory of Animal Genetics and Breeding of Zhejiang Province, Institute of Animal Husbandry and Veterinary Science, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Jie-Ru Wang
- Key Laboratory of Pig Molecular Quantitative Genetics of Anhui Academy of Agricultural Sciences, Anhui Provincial Key Laboratory of Livestock and Poultry Product Safety Engineering, Institute of Animal Husbandry and Veterinary Medicine, Anhui Academy of Agricultural Sciences, Hefei 230031, China
| | - Xiao-Hong Chu
- Key Laboratory of Animal Genetics and Breeding of Zhejiang Province, Institute of Animal Husbandry and Veterinary Science, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Yong-Qing Zeng
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Taian 271001, China
| | - Ling-Zhao Fang
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, 8000, Denmark
| | - Dong-Dong Wu
- State Key Laboratory of Genetic Resources and Evolution, Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Taian 271001, China
| | - Xiang-Dong Ding
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| |
Collapse
|
9
|
Li J, Guan D, Halstead MM, Islas-Trejo AD, Goszczynski DE, Ernst CW, Cheng H, Ross P, Zhou H. Transcriptome annotation of 17 porcine tissues using nanopore sequencing technology. Anim Genet 2023; 54:35-44. [PMID: 36385508 DOI: 10.1111/age.13274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 10/20/2022] [Accepted: 11/01/2022] [Indexed: 11/18/2022]
Abstract
The annotation of animal genomes plays an important role in elucidating molecular mechanisms behind the genetic control of economically important traits. Here, we employed long-read sequencing technology, Oxford Nanopore Technology, to annotate the pig transcriptome across 17 tissues from two Yorkshire littermate pigs. More than 9.8 million reads were obtained from a single flow cell, and 69 781 unique transcripts at 50 108 loci were identified. Of these transcripts, 16 255 were found to be novel isoforms, and 22 344 were found at loci that were novel and unannotated in the Ensembl (release 102) and NCBI (release 106) annotations. Novel transcripts were mostly expressed in cerebellum, followed by lung, liver, spleen, and hypothalamus. By comparing the unannotated transcripts to existing databases, there were 21 285 (95.3%) transcripts matched to the NT database (v5) and 13 676 (61.2%) matched to the NR database (v5). Moreover, there were 4324 (19.4%) transcripts matched to the SwissProt database (v5), corresponding to 11 356 proteins. Tissue-specific gene expression analyses showed that 9749 transcripts were highly tissue-specific, and cerebellum contained the most tissue-specific transcripts. As the same samples were used for the annotation of cis-regulatory elements in the pig genome, the transcriptome annotation generated by this study provides an additional and complementary annotation resource for the Functional Annotation of Animal Genomes effort to comprehensively annotate the pig genome.
Collapse
Affiliation(s)
- Jinghui Li
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Dailu Guan
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Michelle M Halstead
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Alma D Islas-Trejo
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Daniel E Goszczynski
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Catherine W Ernst
- Department of Animal Science, Michigan State University, East Lansing, Michigan, USA
| | - Hao Cheng
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Pablo Ross
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Huaijun Zhou
- Department of Animal Science, University of California Davis, Davis, California, USA
| |
Collapse
|
10
|
Jia D, Jin C, Gong S, Wang X, Wu T. RNA-Seq and Iso-Seq Reveal the Important Role of COMT and CCoAOMT Genes in Accumulation of Scopoletin in Noni ( Morinda citrifolia). Genes (Basel) 2022; 13:1993. [PMID: 36360230 PMCID: PMC9689816 DOI: 10.3390/genes13111993] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 10/27/2022] [Indexed: 11/24/2023] Open
Abstract
Scopoletin, the main component of clinical drugs and the functional component of health products, is highly abundant in noni fruit (Morinda citrifolia). Multiple enzyme genes regulate scopoletin accumulation. In the present study, differentially expressed genes of noni were analyzed by RNA sequencing (RNA-Seq) and the full-length genes by isoform-sequencing (Iso-Seq) to find the critical genes in the scopoletin accumulation mechanism pathway. A total of 32,682 full-length nonchimeric reads (FLNC) were obtained, out of which 16,620 non-redundant transcripts were validated. Based on KEGG (Kyoto Encyclopedia of Genes and Genomes) annotation and differential expression analysis, two differentially expressed genes, caffeic acid 3-O-methyltransferase (COMT) and caffeoyl-CoA O-methyltransferase (CCoAOMT), were found in the scopoletin accumulation pathway of noni. Real-time quantitative polymerase chain reaction (q-PCR), phylogenetic tree analysis, gene expression analysis, and the change in scopoletin content confirmed that these two proteins are important in this pathway. Based on these results, the current study supposed that COMT and CCoAOMT play a significant role in the accumulation of scopoletin in noni fruit, and COMT (gene number: gene 7446, gene 8422, and gene 6794) and CCoAOMT (gene number: gene 12,084) were more significant. These results provide the importance of COMT and CCoAOMT and a basis for further understanding the accumulation mechanism of scopoletin in noni.
Collapse
Affiliation(s)
| | | | | | | | - Tian Wu
- Southwest Landscape Architecture Engineering Research Center of State Forestry Administration, Landscape Architecture and Horticulture Science School, Southwest Forestry University, Kunming 650000, China
| |
Collapse
|
11
|
Guan D, Halstead MM, Islas-Trejo AD, Goszczynski DE, Cheng HH, Ross PJ, Zhou H. Prediction of transcript isoforms in 19 chicken tissues by Oxford Nanopore long-read sequencing. Front Genet 2022; 13:997460. [PMID: 36246588 PMCID: PMC9561881 DOI: 10.3389/fgene.2022.997460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 08/30/2022] [Indexed: 11/22/2022] Open
Abstract
To identify and annotate transcript isoforms in the chicken genome, we generated Nanopore long-read sequencing data from 68 samples that encompassed 19 diverse tissues collected from experimental adult male and female White Leghorn chickens. More than 23.8 million reads with mean read length of 790 bases and average quality of 18.2 were generated. The annotation and subsequent filtering resulted in the identification of 55,382 transcripts at 40,547 loci with mean length of 1,700 bases. We predicted 30,967 coding transcripts at 19,461 loci, and 16,495 lncRNA transcripts at 15,512 loci. Compared to existing reference annotations, we found ∼52% of annotated transcripts could be partially or fully matched while ∼47% were novel. Seventy percent of novel transcripts were potentially transcribed from lncRNA loci. Based on our annotation, we quantified transcript expression across tissues and found two brain tissues (i.e., cerebellum and cortex) expressed the highest number of transcripts and loci. Furthermore, ∼22% of the transcripts displayed tissue specificity with the reproductive tissues (i.e., testis and ovary) exhibiting the most tissue-specific transcripts. Despite our wide sampling, ∼20% of Ensembl reference loci were not detected. This suggests that deeper sequencing and additional samples that include different breeds, cell types, developmental stages, and physiological conditions, are needed to fully annotate the chicken genome. The application of Nanopore sequencing in this study demonstrates the usefulness of long-read data in discovering additional novel loci (e.g., lncRNA loci) and resolving complex transcripts (e.g., the longest transcript for the TTN locus).
Collapse
Affiliation(s)
- Dailu Guan
- Department of Animal Science, University of California Davis, Davis, CA, United States
| | - Michelle M. Halstead
- Department of Animal Science, University of California Davis, Davis, CA, United States
| | - Alma D. Islas-Trejo
- Department of Animal Science, University of California Davis, Davis, CA, United States
| | - Daniel E. Goszczynski
- Department of Animal Science, University of California Davis, Davis, CA, United States
| | - Hans H. Cheng
- USDA, ARS, USNPRC, Avian Disease and Oncology Laboratory, East Lansing, MI, United States
| | - Pablo J. Ross
- Department of Animal Science, University of California Davis, Davis, CA, United States
- *Correspondence: Pablo J. Ross, ; Huaijun Zhou,
| | - Huaijun Zhou
- Department of Animal Science, University of California Davis, Davis, CA, United States
- *Correspondence: Pablo J. Ross, ; Huaijun Zhou,
| |
Collapse
|
12
|
Wiarda JE, Trachsel JM, Sivasankaran SK, Tuggle CK, Loving CL. Intestinal single-cell atlas reveals novel lymphocytes in pigs with similarities to human cells. Life Sci Alliance 2022; 5:5/10/e202201442. [PMID: 35995567 PMCID: PMC9396248 DOI: 10.26508/lsa.202201442] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 08/04/2022] [Accepted: 08/04/2022] [Indexed: 12/13/2022] Open
Abstract
Single-cell RNA sequencing of porcine ileal lymphocytes reveals similarities to human cells and discovery of porcine intestinal innate lymphoid cells. Lymphocytes can heavily influence intestinal health, but resolving intestinal lymphocyte function is challenging as the intestine contains a vastly heterogeneous mixture of cells. Pigs are an advantageous biomedical model, but deeper understanding of intestinal lymphocytes is warranted to improve model utility. Twenty-six cell types were identified in the porcine ileum by single-cell RNA sequencing and further compared with cells in human and murine ileum. Though general consensus of cell subsets across species was revealed, some porcine-specific lymphocyte subsets were identified. Differential tissue dissection and in situ analyses conferred spatial context, revealing similar locations of lymphocyte subsets in Peyer’s patches and epithelium in pig-to-human comparisons. Like humans, activated and effector lymphocytes were abundant in the ileum but not periphery of pigs, suggesting tissue-specific and/or activation-associated gene expression. Gene signatures for peripheral and ileal innate lymphoid cells newly discovered in pigs were defined and highlighted similarities to human innate lymphoid cells. Overall, we reveal novel lymphocyte subsets in pigs and highlight utility of pigs for intestinal research applications.
Collapse
Affiliation(s)
- Jayne E Wiarda
- Food Safety and Enteric Pathogens Research Unit, National Animal Disease Center, Agricultural Research Service, United States Department of Agriculture, Ames, IA, USA.,Immunobiology Graduate Program, Iowa State University, Ames, IA, USA.,Oak Ridge Institute for Science and Education, Agricultural Research Service Participation Program, Oak Ridge, TN, USA
| | - Julian M Trachsel
- Food Safety and Enteric Pathogens Research Unit, National Animal Disease Center, Agricultural Research Service, United States Department of Agriculture, Ames, IA, USA
| | - Sathesh K Sivasankaran
- Food Safety and Enteric Pathogens Research Unit, National Animal Disease Center, Agricultural Research Service, United States Department of Agriculture, Ames, IA, USA.,Genome Informatics Facility, Iowa State University, Ames, IA, USA
| | | | - Crystal L Loving
- Food Safety and Enteric Pathogens Research Unit, National Animal Disease Center, Agricultural Research Service, United States Department of Agriculture, Ames, IA, USA
| |
Collapse
|
13
|
Sage SE, Nicholson P, Leeb T, Gerber V, Jagannathan V. Long-Read Transcriptome of Equine Bronchoalveolar Cells. Genes (Basel) 2022; 13:genes13101722. [PMID: 36292607 PMCID: PMC9602388 DOI: 10.3390/genes13101722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 09/09/2022] [Accepted: 09/21/2022] [Indexed: 11/16/2022] Open
Abstract
We used Pacific Biosciences long-read isoform sequencing to generate full-length transcript sequences in equine bronchoalveolar lavage fluid (BALF) cells. Our dataset consisted of 313,563 HiFi reads comprising 805 Mb of polished sequence information. The resulting equine BALF transcriptome consisted of 14,234 full-length transcript isoforms originating from 7017 unique genes. These genes consisted of 6880 previously annotated genes and 137 novel genes. We identified 3428 novel transcripts in addition to 10,806 previously known transcripts. These included transcripts absent from existing genome annotations, transcripts mapping to putative novel (unannotated) genes and fusion transcripts incorporating exons from multiple genes. We provide transcript-level data for equine BALF cells as a resource to the scientific community.
Collapse
Affiliation(s)
- Sophie Elena Sage
- Swiss Institute of Equine Medicine, Department of Clinical Veterinary Medicine, Vetsuisse Faculty, University of Bern, 3001 Bern, Switzerland
| | - Pamela Nicholson
- Next Generation Sequencing Platform, University of Bern, 3001 Bern, Switzerland
| | - Tosso Leeb
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3001 Bern, Switzerland
| | - Vinzenz Gerber
- Swiss Institute of Equine Medicine, Department of Clinical Veterinary Medicine, Vetsuisse Faculty, University of Bern, 3001 Bern, Switzerland
| | - Vidhya Jagannathan
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3001 Bern, Switzerland
- Correspondence:
| |
Collapse
|
14
|
Li S, Wang J, Li J, Yue M, Liu C, Ma L, Liu Y. Integrative analysis of transcriptome complexity in pig granulosa cells by long-read isoform sequencing. PeerJ 2022; 10:e13446. [PMID: 35637716 PMCID: PMC9147391 DOI: 10.7717/peerj.13446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 04/26/2022] [Indexed: 01/14/2023] Open
Abstract
Background In intensive and large-scale farms, abnormal estradiol levels in sows can cause reproductive disorders. The high incidence rate of reproductive disturbance will induce the elimination of productive sows in large quantities, and the poor management will bring great losses to the pig farms. The change in estradiol level has an important effect on follicular development and estrus of sows. To solve this practical problem and improve the productive capacity of sows, it is significant to further clarify the regulatory mechanism of estradiol synthesis in porcine granulosa cells (GCs). The most important function of granulosa cells is to synthesize estradiol. Thus, the studies about the complex transcriptome in porcine GCs are significant. As for precursor-messenger RNAs (pre-mRNAs), their post-transcriptional modification, such as alternative polyadenylation (APA) and alternative splicing (AS), together with long non-coding RNAs (lncRNAs), may regulate the functions of granulosa cells. However, the above modification events and their function are unclear within pig granulosa cells. Methods Combined PacBio long-read isoform sequencing (Iso-Seq) was conducted in this work for generating porcine granulosa cells' transcriptomic data. We discovered new transcripts and possible gene loci via comparison against reference genome. Later, combined Iso-Seq data were adopted to uncover those post-transcriptional modifications such as APA or AS, together with lncRNA within porcine granulosa cells. For confirming that the Iso-Seq data were reliable, we chose four AS genes and analyzed them through RT-PCR. Results The present article illustrated that pig GCs had a complex transcriptome, which gave rise to 8,793 APA, 3,465 AS events, 703 candidate new gene loci, as well as 92 lncRNAs. The results of this study revealed the complex transcriptome in pig GCs. It provided a basis for the interpretation of the molecular mechanism in GCs.
Collapse
Affiliation(s)
- Shuxin Li
- School of Life Science and Technology, Inner Mongolia University of Science & Technology, Baotou, Inner Mongolia, China
| | - Jiarui Wang
- School of Life Science and Technology, Inner Mongolia University of Science & Technology, Baotou, Inner Mongolia, China
| | - Jiale Li
- School of Life Science and Technology, Inner Mongolia University of Science & Technology, Baotou, Inner Mongolia, China
| | - Meihong Yue
- School of Life Science and Technology, Inner Mongolia University of Science & Technology, Baotou, Inner Mongolia, China
| | - Chuncheng Liu
- School of Life Science and Technology, Inner Mongolia University of Science & Technology, Baotou, Inner Mongolia, China
| | - Libing Ma
- School of Life Science and Technology, Inner Mongolia University of Science & Technology, Baotou, Inner Mongolia, China
| | - Ying Liu
- School of Life Science and Technology, Inner Mongolia University of Science & Technology, Baotou, Inner Mongolia, China
| |
Collapse
|
15
|
Shu Z, Wang L, Wang J, Zhang L, Hou X, Yan H, Wang L. Integrative Analysis of Nanopore and Illumina Sequencing Reveals Alternative Splicing Complexity in Pig Longissimus Dorsi Muscle. Front Genet 2022; 13:877646. [PMID: 35480309 PMCID: PMC9035893 DOI: 10.3389/fgene.2022.877646] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 03/22/2022] [Indexed: 11/13/2022] Open
Abstract
Alternative splicing (AS) is a key step in the post-transcriptional regulation of gene expression that can affect intramuscular fat (IMF). In this study, longissimus dorsi muscles from 30 pigs in high- and low- IMF groups were used to perform Oxford Nanopore Technologies (ONT) full-length sequencing and Illumina strand-specific RNA-seq. A total of 43,688 full-length transcripts were identified, with 4,322 novel genes and 30,795 novel transcripts. Using AStalavista, a total of 14,728 AS events were detected in the longissimus dorsi muscle. About 17.79% of the genes produced splicing isoforms, in which exon skipping was the most frequent AS event. By analyzing the expression differences of mRNAs and splicing isoforms, we found that differentially expressed mRNAs with splicing isoforms could participate in skeletal muscle development and fatty acid metabolism, which might determine muscle-related traits. SERBP1, MYL1, TNNT3, and TNNT1 were identified with multiple splicing isoforms, with significant differences in expression. AS events occurring in IFI6 and GADD45G may cause significant differences in gene expression. Other AS events, such as ONT.15153.3, may regulate the function of ART1 by regulating the expression of different transcripts. Moreover, co-expression and protein-protein interaction (PPI) analysis indicated that several genes (MRPL27, AAR2, PYGM, PSMD4, SCNM1, and HNRNPDL) may be related to intramuscular fat. The splicing isoforms investigated in our research provide a reference for the study of alternative splicing regulation of intramuscular fat deposition.
Collapse
|
16
|
Yuan Z, Ge L, Zhang W, Lv X, Wang S, Cao X, Sun W. Preliminary Results about Lamb Meat Tenderness Based on the Study of Novel Isoforms and Alternative Splicing Regulation Pathways Using Iso-seq, RNA-seq and CTCF ChIP-seq Data. Foods 2022; 11:foods11081068. [PMID: 35454655 PMCID: PMC9025809 DOI: 10.3390/foods11081068] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2022] [Revised: 03/29/2022] [Accepted: 04/04/2022] [Indexed: 01/27/2023] Open
Abstract
Tenderness is an important indicator of meat quality. Novel isoforms associated with meat tenderness and the role of the CCCTC-binding factor (CTCF) in regulating alternative splicing to produce isoforms in sheep are largely unknown. The current project studied six sheep from two crossbred populations (Dorper × Hu × Hu, DHH and Dorper × Dorper × Hu, DDH) with divergent meat tenderness. Pooled Iso-seq data were used to annotate the sheep genomes. Then, the updated genome annotation and six RNA-seq data were combined to identify differentially expressed isoforms (DEIs) in muscles between DHH and DDH. These data were also combined with peaks detected from CTCF ChIP-seq data to investigate the regulatory role of CTCF for the alternative splicing. As a result, a total of 624 DEIs were identified between DDH and DHH. For example, isoform 7.524.18 transcribed from CAPN3 may be associated with meat tenderness. In addition, a total of 86 genes were overlapped between genes with transcribed DEIs and genes in differential peaks identified by CTCF ChIP-seq. Among these overlapped genes, ANKRD23 produces different isoforms which may be regulated by CTCF via methylation. As preliminary research, our results identified novel isoforms associated with meat tenderness and revealed the possible regulating mechanisms of alternative splicing to produce isoforms.
Collapse
Affiliation(s)
- Zehu Yuan
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education, Yangzhou University, Yangzhou 225000, China
| | - Ling Ge
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225000, China
| | - Weibo Zhang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225000, China
| | - Xiaoyang Lv
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education, Yangzhou University, Yangzhou 225000, China
| | - Shanhe Wang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225000, China
| | - Xiukai Cao
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education, Yangzhou University, Yangzhou 225000, China
| | - Wei Sun
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education, Yangzhou University, Yangzhou 225000, China
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225000, China
| |
Collapse
|
17
|
Ross EM, Sanjana H, Nguyen LT, Cheng Y, Moore SS, Hayes BJ. Extensive Variation in Gene Expression is Revealed in 13 Fertility-Related Genes Using RNA-Seq, ISO-Seq, and CAGE-Seq From Brahman Cattle. Front Genet 2022; 13:784663. [PMID: 35401673 PMCID: PMC8990236 DOI: 10.3389/fgene.2022.784663] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Accepted: 01/24/2022] [Indexed: 11/13/2022] Open
Abstract
Fertility is a key driver of economic profitability in cattle production. A number of studies have identified genes associated with fertility using genome wide association studies and differential gene expression analysis; however, the genes themselves are poorly characterized in cattle. Here, we selected 13 genes from the literature which have previously been shown to have strong evidence for an association with fertility in Brahman cattle (Bos taurus indicus) or closely related breeds. We examine the expression variation of the 13 genes that are associated with cattle fertility using RNA-seq, CAGE-seq, and ISO-seq data from 11 different tissue samples from an adult Brahman cow and a Brahman fetus. Tissues examined include blood, liver, lung, kidney, muscle, spleen, ovary, and uterus from the cow and liver and lung from the fetus. The analysis revealed several novel isoforms, including seven from SERPINA7. The use of three expression characterization methodologies (5′ cap selected ISO-seq, CAGE-seq, and RNA-seq) allowed the identification of isoforms that varied in their length of 5′ and 3′ untranslated regions, variation otherwise undetectable (collapsed as degraded RNA) in generic isoform identification pipelines. The combinations of different sequencing technologies allowed us to overcome the limitations of relatively low sequence depth in the ISO-seq data. The lower sequence depth of the ISO-seq data was also reflected in the lack of observed expression of some genes that were observed in the CAGE-seq and RNA-seq data from the same tissue. We identified allele specific expression that was tissue-specific in AR, IGF1, SOX9, STAT3, and TAF9B. Finally, we characterized an exon of TAF9B as partially nested within the neighboring gene phosphoglycerate kinase 1. As this study only examined two animals, even more transcriptional variation may be present in a genetically diverse population. This analysis reveals the large amount of transcriptional variation within mammalian fertility genes and illuminates the fact that the transcriptional landscape cannot be fully characterized using a single technology alone.
Collapse
Affiliation(s)
- Elizabeth M. Ross
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
- *Correspondence: Elizabeth M. Ross,
| | - Hari Sanjana
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| | - Loan T. Nguyen
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| | - YuanYuan Cheng
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW, Australia
| | - Stephen S. Moore
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| | - Ben J. Hayes
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| |
Collapse
|
18
|
Healey HM, Bassham S, Cresko WA. Single-cell Iso-Sequencing enables rapid genome annotation for scRNAseq analysis. Genetics 2022; 220:iyac017. [PMID: 35143654 PMCID: PMC8893252 DOI: 10.1093/genetics/iyac017] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 01/24/2022] [Indexed: 12/02/2022] Open
Abstract
Single-cell RNA sequencing is a powerful technique that continues to expand across various biological applications. However, incomplete 3'-UTR annotations can impede single-cell analysis resulting in genes that are partially or completely uncounted. Performing single-cell RNA sequencing with incomplete 3'-UTR annotations can hinder the identification of cell identities and gene expression patterns and lead to erroneous biological inferences. We demonstrate that performing single-cell isoform sequencing in tandem with single-cell RNA sequencing can rapidly improve 3'-UTR annotations. Using threespine stickleback fish (Gasterosteus aculeatus), we show that gene models resulting from a minimal embryonic single-cell isoform sequencing dataset retained 26.1% greater single-cell RNA sequencing reads than gene models from Ensembl alone. Furthermore, pooling our single-cell sequencing isoforms with a previously published adult bulk Iso-Seq dataset from stickleback, and merging the annotation with the Ensembl gene models, resulted in a marginal improvement (+0.8%) over the single-cell isoform sequencing only dataset. In addition, isoforms identified by single-cell isoform sequencing included thousands of new splicing variants. The improved gene models obtained using single-cell isoform sequencing led to successful identification of cell types and increased the reads identified of many genes in our single-cell RNA sequencing stickleback dataset. Our work illuminates single-cell isoform sequencing as a cost-effective and efficient mechanism to rapidly annotate genomes for single-cell RNA sequencing.
Collapse
Affiliation(s)
- Hope M Healey
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA
| | - Susan Bassham
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA
| | - William A Cresko
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA
- Presidential Initiative in Data Science, University of Oregon, Eugene, OR 97403, USA
| |
Collapse
|
19
|
Karlsson M, Sjöstedt E, Oksvold P, Sivertsson Å, Huang J, Álvez MB, Arif M, Li X, Lin L, Yu J, Ma T, Xu F, Han P, Jiang H, Mardinoglu A, Zhang C, von Feilitzen K, Xu X, Wang J, Yang H, Bolund L, Zhong W, Fagerberg L, Lindskog C, Pontén F, Mulder J, Luo Y, Uhlen M. Genome-wide annotation of protein-coding genes in pig. BMC Biol 2022; 20:25. [PMID: 35073880 PMCID: PMC8788080 DOI: 10.1186/s12915-022-01229-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 01/07/2022] [Indexed: 12/25/2022] Open
Abstract
BACKGROUND There is a need for functional genome-wide annotation of the protein-coding genes to get a deeper understanding of mammalian biology. Here, a new annotation strategy is introduced based on dimensionality reduction and density-based clustering of whole-body co-expression patterns. This strategy has been used to explore the gene expression landscape in pig, and we present a whole-body map of all protein-coding genes in all major pig tissues and organs. RESULTS An open-access pig expression map ( www.rnaatlas.org ) is presented based on the expression of 350 samples across 98 well-defined pig tissues divided into 44 tissue groups. A new UMAP-based classification scheme is introduced, in which all protein-coding genes are stratified into tissue expression clusters based on body-wide expression profiles. The distribution and tissue specificity of all 22,342 protein-coding pig genes are presented. CONCLUSIONS Here, we present a new genome-wide annotation strategy based on dimensionality reduction and density-based clustering. A genome-wide resource of the transcriptome map across all major tissues and organs in pig is presented, and the data is available as an open-access resource ( www.rnaatlas.org ), including a comparison to the expression of human orthologs.
Collapse
Affiliation(s)
- Max Karlsson
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Evelina Sjöstedt
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | - Per Oksvold
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Åsa Sivertsson
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Jinrong Huang
- BGI-Shenzhen, Shenzhen, China
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, Qingdao, China
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
| | - María Bueno Álvez
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Muhammad Arif
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Xiangyu Li
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Lin Lin
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus, Denmark
| | - Jiaying Yu
- BGI-Shenzhen, Shenzhen, China
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, Qingdao, China
| | - Tao Ma
- MGI, BGI-Shenzhen, Shenzhen, China
| | - Fengping Xu
- BGI-Shenzhen, Shenzhen, China
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, Qingdao, China
| | - Peng Han
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, Qingdao, China
| | | | - Adil Mardinoglu
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Cheng Zhang
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Kalle von Feilitzen
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Xun Xu
- BGI-Shenzhen, Shenzhen, China
| | | | | | - Lars Bolund
- BGI-Shenzhen, Shenzhen, China
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, Qingdao, China
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
| | - Wen Zhong
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Linn Fagerberg
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Cecilia Lindskog
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | - Fredrik Pontén
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | - Jan Mulder
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Yonglun Luo
- BGI-Shenzhen, Shenzhen, China
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, Qingdao, China
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus, Denmark
| | - Mathias Uhlen
- Department of Protein Science, Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden.
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden.
| |
Collapse
|
20
|
Hao W, Yang Z, Sun Y, Li J, Zhang D, Liu D, Yang X. Characterization of Alternative Splicing Events in Porcine Skeletal Muscles with Different Intramuscular Fat Contents. Biomolecules 2022; 12:biom12020154. [PMID: 35204660 PMCID: PMC8961525 DOI: 10.3390/biom12020154] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Revised: 01/11/2022] [Accepted: 01/14/2022] [Indexed: 02/06/2023] Open
Abstract
Meat quality is one of the most important economic traits in pig breeding and production. Intramuscular fat (IMF) is a major factor that improves meat quality. To better understand the alternative splicing (AS) events underlying meat quality, long-read isoform sequencing (Iso-seq) was used to identify differential (D)AS events between the longissimus thoracis (LT) and semitendinosus (ST), which differ in IMF content, together with short-read RNA-seq. Through Iso-seq analysis, we identified a total of 56,789 novel transcripts covering protein-coding genes, lncRNA, and fusion transcripts that were not previously annotated in pigs. We also identified 456,965 AS events, among which 3930 were DAS events, corresponding to 2364 unique genes. Through integrative analysis of Iso-seq and RNA-seq, we identified 1174 differentially expressed genes (DEGs), among which 122 were DAS genes, i.e., DE-DAS genes. There are 12 overlapped pathways between the top 20 DEGs and DE-DAS genes, as revealed by KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis, indicating that DE-DAS genes play important roles in the differential phenotype of LT and ST. Further analysis showed that upregulated DE-DAS genes are more important than downregulated ones in IMF deposition. Fatty acid degradation and the PPAR (peroxisome proliferator-activated receptor) signaling pathway were found to be the most important pathways regulating the differential fat deposition of the two muscles. The results update the existing porcine genome annotations and provide data for the in-depth exploration of the mechanisms underlying meat quality and IMF deposition.
Collapse
Affiliation(s)
- Wanjun Hao
- College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, China; (W.H.); (Z.Y.); (Y.S.); (J.L.)
| | - Zewei Yang
- College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, China; (W.H.); (Z.Y.); (Y.S.); (J.L.)
| | - Yuanlu Sun
- College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, China; (W.H.); (Z.Y.); (Y.S.); (J.L.)
| | - Jiaxin Li
- College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, China; (W.H.); (Z.Y.); (Y.S.); (J.L.)
| | - Dongjie Zhang
- Institute of Animal Husbandry, Heilongjiang Academy of Agricultural Sciences, Harbin 150086, China;
| | - Di Liu
- Institute of Animal Husbandry, Heilongjiang Academy of Agricultural Sciences, Harbin 150086, China;
- Correspondence: (D.L.); (X.Y.); Tel.: +86-451-8667-7458 (D.L.); +86-451-5519-1738 (X.Y.)
| | - Xiuqin Yang
- College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, China; (W.H.); (Z.Y.); (Y.S.); (J.L.)
- Correspondence: (D.L.); (X.Y.); Tel.: +86-451-8667-7458 (D.L.); +86-451-5519-1738 (X.Y.)
| |
Collapse
|
21
|
Al Kadi M, Jung N, Okuzaki D. UNAGI: Yeast Transcriptome Reconstruction and Gene Discovery Using Nanopore Sequencing. Methods Mol Biol 2022; 2477:79-89. [PMID: 35524113 DOI: 10.1007/978-1-0716-2257-5_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Computational approaches are the main approaches used in genome annotation. However, accuracy is low. Untranslated regions are not identified, complex isoforms are not predicted correctly and discovery rate of noncoding RNA is low. RNA-seq has revolutionized transcriptome reconstruction over the last decade. However, fragmentation included in cDNA sequencing leads to information loss, requiring transcripts to be assembled and reconstructed, thus affecting the accuracy of reconstructed transcriptome. Recently, long-read sequencing has been introduced with technologies such as Oxford Nanopore sequencing. cDNA is sequenced directly without fragmentation producing long reads that don't need to be assembled keeping the transcript structure intact and increasing the accuracy of transcriptome reconstruction.Here we present a protocol and a pipeline to reconstruct the transcriptome of compact genomes including yeasts. It involves generating full-length cDNA and using Oxford Nanopore ligation-based sequencing kit to sequence multiple samples in the same run. The pipeline (1) strands the generated long reads, (2) corrects the reads by mapping them to the reference genome, (3) identifies transcripts including 5'UTR and 3'UTR, (4) profiles the isoforms, filtering out artifacts resulting from low accuracy in sequencing, and (5) improves accuracy of provided annotations. Using long reads improves the accuracy of transcriptome reconstruction and helps in discovering a significant number of novel RNAs.
Collapse
Affiliation(s)
- Mohamad Al Kadi
- Department of Bacterial Infections, Research Institute for Microbial Diseases, Osaka University, Osaka, Japan
| | - Nicolas Jung
- Department of Infection Metagenomics, Research Institute for Microbial Diseases, Osaka University, Osaka, Japan
| | - Daisuke Okuzaki
- Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Osaka, Japan.
- Single Cell Genomics, Human Immunology, WPI Immunology Frontier Research Center, Osaka University, Osaka, Japan.
- Genome Information Research Center, Research Institute for Microbial Diseases, Osaka University, Osaka, Japan.
| |
Collapse
|
22
|
Treitli SC, Peña-Diaz P, Hałakuc P, Karnkowska A, Hampl V. High quality genome assembly of the amitochondriate eukaryote Monocercomonoides exilis. Microb Genom 2021; 7. [PMID: 34951395 PMCID: PMC8767320 DOI: 10.1099/mgen.0.000745] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Monocercomonoides exilis is considered the first known eukaryote to completely lack mitochondria. This conclusion is based primarily on a genomic and transcriptomic study which failed to identify any mitochondrial hallmark proteins. However, the available genome assembly has limited contiguity and around 1.5 % of the genome sequence is represented by unknown bases. To improve the contiguity, we re-sequenced the genome and transcriptome of M. exilis using Oxford Nanopore Technology (ONT). The resulting draft genome is assembled in 101 contigs with an N50 value of 1.38 Mbp, almost 20 times higher than the previously published assembly. Using a newly generated ONT transcriptome, we further improve the gene prediction and add high quality untranslated region (UTR) annotations, in which we identify two putative polyadenylation signals present in the 3′UTR regions and characterise the Kozak sequence in the 5′UTR regions. All these improvements are reflected by higher BUSCO genome completeness values. Regardless of an overall more complete genome assembly without missing bases and a better gene prediction, we still failed to identify any mitochondrial hallmark genes, thus further supporting the hypothesis on the absence of mitochondrion.
Collapse
Affiliation(s)
- Sebastian Cristian Treitli
- Department of Parasitology, Faculty of Science, Charles University, BIOCEV, Průmyslová 595, 252 42 Vestec, Czech Republic
| | - Priscila Peña-Diaz
- Department of Parasitology, Faculty of Science, Charles University, BIOCEV, Průmyslová 595, 252 42 Vestec, Czech Republic
| | - Paweł Hałakuc
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
| | - Anna Karnkowska
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
| | - Vladimír Hampl
- Department of Parasitology, Faculty of Science, Charles University, BIOCEV, Průmyslová 595, 252 42 Vestec, Czech Republic
| |
Collapse
|
23
|
Le Page L, Gillespie A, Schwartz JC, Prawits LM, Schlerka A, Farrell CP, Hammond JA, Baldwin CL, Telfer JC, Hammer SE. Subpopulations of swine γδ T cells defined by TCRγ and WC1 gene expression. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2021; 125:104214. [PMID: 34329647 DOI: 10.1016/j.dci.2021.104214] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 07/24/2021] [Accepted: 07/24/2021] [Indexed: 06/13/2023]
Abstract
γδ T cells constitute a major portion of lymphocytes in the blood of both ruminants and swine. Subpopulations of swine γδ T cells have been distinguished by CD2 and CD8α expression. However, it was not clear if they have distinct expression profiles of their T-cell receptor (TCR) or WC1 genes. Identifying receptor expression will contribute to understanding the functional differences between these subpopulations and their contributions to immune protection. Here, we annotated three genomic assemblies of the swine TCRγ gene locus finding four gene cassettes containing C, J and V genes, although some haplotypes carried a null TRGC gene (TRGC4). Genes in the TRGC1 cassette were homologs of bovine TRGC5 cassette while the others were not homologous to bovine genes. Here we evaluated three principal populations of γδ T cells (CD2+/SWC5-, CD2-/SWC5+, and CD2-/SWC5-). Both CD2- subpopulations transcribed WC1 co-receptor genes, albeit with different patterns of gene expression but CD2+ cells did not. All subpopulations transcribed TCR genes from all four cassettes, although there were differences in expression levels. Finally, the CD2+ and CD2- γδ T-cell populations differed in their representation in various organs and tissues, presumably at least partially reflective of different ligand specificities for their receptors.
Collapse
Affiliation(s)
- Lauren Le Page
- Department of Veterinary & Animal Sciences, University of Massachusetts, Amherst, MA, USA
| | - Alexandria Gillespie
- Department of Veterinary & Animal Sciences, University of Massachusetts, Amherst, MA, USA
| | | | - Lisa-Maria Prawits
- Institute of Immunology, Department of Pathobiology, University of Veterinary Medicine, Vienna, Austria
| | - Angela Schlerka
- Institute of Immunology, Department of Pathobiology, University of Veterinary Medicine, Vienna, Austria
| | - Colin P Farrell
- Division of Hematology, University of Utah School of Medicine, Salt Lake City, UT, USA
| | | | - Cynthia L Baldwin
- Department of Veterinary & Animal Sciences, University of Massachusetts, Amherst, MA, USA
| | - Janice C Telfer
- Department of Veterinary & Animal Sciences, University of Massachusetts, Amherst, MA, USA
| | - Sabine E Hammer
- Institute of Immunology, Department of Pathobiology, University of Veterinary Medicine, Vienna, Austria.
| |
Collapse
|
24
|
Shields EJ, Sorida M, Sheng L, Sieriebriennikov B, Ding L, Bonasio R. Genome annotation with long RNA reads reveals new patterns of gene expression and improves single-cell analyses in an ant brain. BMC Biol 2021; 19:254. [PMID: 34838024 PMCID: PMC8626913 DOI: 10.1186/s12915-021-01188-w] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Accepted: 11/10/2021] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Functional genomic analyses rely on high-quality genome assemblies and annotations. Highly contiguous genome assemblies have become available for a variety of species, but accurate and complete annotation of gene models, inclusive of alternative splice isoforms and transcription start and termination sites, remains difficult with traditional approaches. RESULTS Here, we utilized full-length isoform sequencing (Iso-Seq), a long-read RNA sequencing technology, to obtain a comprehensive annotation of the transcriptome of the ant Harpegnathos saltator. The improved genome annotations include additional splice isoforms and extended 3' untranslated regions for more than 4000 genes. Reanalysis of RNA-seq experiments using these annotations revealed several genes with caste-specific differential expression and tissue- or caste-specific splicing patterns that were missed in previous analyses. The extended 3' untranslated regions afforded great improvements in the analysis of existing single-cell RNA-seq data, resulting in the recovery of the transcriptomes of 18% more cells. The deeper single-cell transcriptomes obtained with these new annotations allowed us to identify additional markers for several cell types in the ant brain, as well as genes differentially expressed across castes in specific cell types. CONCLUSIONS Our results demonstrate that Iso-Seq is an efficient and effective approach to improve genome annotations and maximize the amount of information that can be obtained from existing and future genomic datasets in Harpegnathos and other organisms.
Collapse
Affiliation(s)
- Emily J Shields
- Epigenetics Institute and Department of Cell and Developmental Biology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
- Department of Urology and Institute of Neuropathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Masato Sorida
- Epigenetics Institute and Department of Cell and Developmental Biology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Lihong Sheng
- Epigenetics Institute and Department of Cell and Developmental Biology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Bogdan Sieriebriennikov
- Department of Biology, New York University, New York, NY, USA
- Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY, USA
| | - Long Ding
- Department of Biology, New York University, New York, NY, USA
| | - Roberto Bonasio
- Epigenetics Institute and Department of Cell and Developmental Biology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
| |
Collapse
|
25
|
Lagarrigue S, Lorthiois M, Degalez F, Gilot D, Derrien T. LncRNAs in domesticated animals: from dog to livestock species. Mamm Genome 2021; 33:248-270. [PMID: 34773482 PMCID: PMC9114084 DOI: 10.1007/s00335-021-09928-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 10/19/2021] [Indexed: 11/29/2022]
Abstract
Animal genomes are pervasively transcribed into multiple RNA molecules, of which many will not be translated into proteins. One major component of this transcribed non-coding genome is the long non-coding RNAs (lncRNAs), which are defined as transcripts longer than 200 nucleotides with low coding-potential capabilities. Domestic animals constitute a unique resource for studying the genetic and epigenetic basis of phenotypic variations involving protein-coding and non-coding RNAs, such as lncRNAs. This review presents the current knowledge regarding transcriptome-based catalogues of lncRNAs in major domesticated animals (pets and livestock species), covering a broad phylogenetic scale (from dogs to chicken), and in comparison with human and mouse lncRNA catalogues. Furthermore, we describe different methods to extract known or discover novel lncRNAs and explore comparative genomics approaches to strengthen the annotation of lncRNAs. We then detail different strategies contributing to a better understanding of lncRNA functions, from genetic studies such as GWAS to molecular biology experiments and give some case examples in domestic animals. Finally, we discuss the limitations of current lncRNA annotations and suggest research directions to improve them and their functional characterisation.
Collapse
Affiliation(s)
| | - Matthias Lorthiois
- Univ Rennes, CNRS, IGDR (Institut de Génétique et Développement de Rennes) - UMR 6290, 2 av Prof Leon Bernard, F-35000, Rennes, France
| | - Fabien Degalez
- INRAE, INSTITUT AGRO, PEGASE UMR 1348, 35590, Saint-Gilles, France
| | - David Gilot
- CLCC Eugène Marquis, INSERM, Université Rennes, UMR_S 1242, 35000, Rennes, France
| | - Thomas Derrien
- Univ Rennes, CNRS, IGDR (Institut de Génétique et Développement de Rennes) - UMR 6290, 2 av Prof Leon Bernard, F-35000, Rennes, France.
| |
Collapse
|
26
|
Yuan Z, Ge L, Sun J, Zhang W, Wang S, Cao X, Sun W. Integrative analysis of Iso-Seq and RNA-seq data reveals transcriptome complexity and differentially expressed transcripts in sheep tail fat. PeerJ 2021; 9:e12454. [PMID: 34760406 PMCID: PMC8571958 DOI: 10.7717/peerj.12454] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 10/18/2021] [Indexed: 01/22/2023] Open
Abstract
Background Nowadays, both customers and producers prefer thin-tailed fat sheep. To effectively breed for this phenotype, it is important to identify candidate genes and uncover the genetic mechanism related to tail fat deposition in sheep. Accumulating evidence suggesting that post-transcriptional modification events of precursor-messenger RNA (pre-mRNA), including alternative splicing (AS) and alternative polyadenylation (APA), may regulate tail fat deposition in sheep. Differentially expressed transcripts (DETs) analysis is a way to identify candidate genes related to tail fat deposition. However, due to the technological limitation, post-transcriptional modification events in the tail fat of sheep and DETs between thin-tailed and fat-tailed sheep remains unclear. Methods In the present study, we applied pooled PacBio isoform sequencing (Iso-Seq) to generate transcriptomic data of tail fat tissue from six sheep (three thin-tailed sheep and three fat-tailed sheep). By comparing with reference genome, potential gene loci and novel transcripts were identified. Post-transcriptional modification events, including AS and APA, and lncRNA in sheep tail fat were uncovered using pooled Iso-Seq data. Combining Iso-Seq data with six RNA-sequencing (RNA-Seq) data, DETs between thin- and fat-tailed sheep were identified. Protein protein interaction (PPI) network, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were implemented to investigate the potential functions of DETs. Results In the present study, we revealed the transcriptomic complexity of the tail fat of sheep, result in 9,001 potential novel gene loci, 17,834 AS events, 5,791 APA events, and 3,764 lncRNAs. Combining Iso-Seq data with RNA-Seq data, we identified hundreds of DETs between thin- and fat-tailed sheep. Among them, 21 differentially expressed lncRNAs, such as ENSOART00020036299, ENSOART00020033641, ENSOART00020024562, ENSOART00020003848 and 9.53.1 may regulate tail fat deposition. Many novel transcripts were identified as DETs, including 15.527.13 (DGAT2), 13.624.23 (ACSS2), 11.689.28 (ACLY), 11.689.18 (ACLY), 11.689.14 (ACLY), 11.660.12 (ACLY), 22.289.6 (SCD), 22.289.3 (SCD) and 22.289.14 (SCD). Most of the identified DETs have been enriched in GO and KEGG pathways related to extracellular matrix (ECM). Our result revealed the transcriptome complexity and identified many candidate transcripts in tail fat, which could enhance the understanding of molecular mechanisms behind tail fat deposition.
Collapse
Affiliation(s)
- Zehu Yuan
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education, Yangzhou University, Yangzhou, China
| | - Ling Ge
- College of Animal Science and Technology, Yangzhou University, Yangzhou, China
| | - Jingyi Sun
- College of Veterinary Medicine, Yangzhou University, Yangzhou, China
| | - Weibo Zhang
- College of Animal Science and Technology, Yangzhou University, Yangzhou, China
| | - Shanhe Wang
- College of Animal Science and Technology, Yangzhou University, Yangzhou, China
| | - Xiukai Cao
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education, Yangzhou University, Yangzhou, China
| | - Wei Sun
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education, Yangzhou University, Yangzhou, China.,College of Animal Science and Technology, Yangzhou University, Yangzhou, China
| |
Collapse
|
27
|
Chang T, An B, Liang M, Duan X, Du L, Cai W, Zhu B, Gao X, Chen Y, Xu L, Zhang L, Gao H, Li J. PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle. Front Genet 2021; 12:664974. [PMID: 34527015 PMCID: PMC8437344 DOI: 10.3389/fgene.2021.664974] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Accepted: 08/06/2021] [Indexed: 12/02/2022] Open
Abstract
Cattle (Bos taurus) is one of the most widely distributed livestock species in the world, and provides us with high-quality milk and meat which have a huge impact on the quality of human life. Therefore, accurate and complete transcriptome and genome annotation are of great value to the research of cattle breeding. In this study, we used error-corrected PacBio single-molecule real-time (SMRT) data to perform whole-transcriptome profiling in cattle. Then, 22.5 Gb of subreads was generated, including 381,423 circular consensus sequences (CCSs), among which 276,295 full-length non-chimeric (FLNC) sequences were identified. After correction by Illumina short reads, we obtained 22,353 error-corrected isoforms. A total of 305 alternative splicing (AS) events and 3,795 alternative polyadenylation (APA) sites were detected by transcriptome structural analysis. Furthermore, we identified 457 novel genes, 120 putative transcription factors (TFs), and 569 novel long non-coding RNAs (lncRNAs). Taken together, this research improves our understanding and provides new insights into the complexity of full-length transcripts in cattle.
Collapse
Affiliation(s)
- Tianpeng Chang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Bingxing An
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Mang Liang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xinghai Duan
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Lili Du
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Wentao Cai
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Bo Zhu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xue Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yan Chen
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lingyang Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lupei Zhang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Huijiang Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Junya Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| |
Collapse
|
28
|
Lim KS, Cheng J, Putz A, Dong Q, Bai X, Beiki H, Tuggle CK, Dyck MK, Canada PG, Fortin F, Harding JCS, Plastow GS, Dekkers JCM. Quantitative analysis of the blood transcriptome of young healthy pigs and its relationship with subsequent disease resilience. BMC Genomics 2021; 22:614. [PMID: 34384354 PMCID: PMC8361860 DOI: 10.1186/s12864-021-07912-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 07/26/2021] [Indexed: 12/13/2022] Open
Abstract
Background Disease resilience, which is the ability of an animal to maintain performance under disease, is important for pigs in commercial herds, where they are exposed to various pathogens. Our objective was to investigate population-level gene expression profiles in the blood of 912 healthy F1 barrows at ~ 27 days of age for associations with performance and health before and after their exposure to a natural polymicrobial disease challenge at ~ 43 days of age. Results Most significant (q < 0.20) associations of the level of expression of individual genes in blood of young healthy pigs were identified for concurrent growth rate and subjective health scores prior to the challenge, and for mortality, a combined mortality-treatment trait, and feed conversion rate after the challenge. Gene set enrichment analyses revealed three groups of gene ontology biological process terms that were related to disease resilience: 1) immune and stress response-related terms were enriched among genes whose increased expression was unfavorably associated with both pre- and post-challenge traits, 2) heme-related terms were enriched among genes that had favorable associations with both pre- and post-challenge traits, and 3) terms related to protein localization and viral gene expression were enriched among genes that were associated with reduced performance and health traits after but not before the challenge. Conclusions Gene expression profiles in blood from young healthy piglets provide insight into their performance when exposed to disease and other stressors. The expression of genes involved in stress response, heme metabolism, and baseline expression of host genes related to virus propagation were found to be associated with host response to disease. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07912-8.
Collapse
Affiliation(s)
- Kyu-Sang Lim
- Department of Animal Science, Iowa State University, Ames, Iowa, 50011, USA
| | - Jian Cheng
- Department of Animal Science, Iowa State University, Ames, Iowa, 50011, USA
| | - Austin Putz
- Department of Animal Science, Iowa State University, Ames, Iowa, 50011, USA.,Swine Business Unit, Hendrix Genetics, Boxmeer, 5831, CK, The Netherlands
| | - Qian Dong
- Department of Animal Science, Iowa State University, Ames, Iowa, 50011, USA.,Department of Epidemiology and Cancer Control, St. Jude Children's Research Hospital, Memphis, TN, 38105, USA
| | - Xuechun Bai
- Department of Agriculture, Food and Nutritional Science, University of Alberta, Edmonton, AB, T6G 2R3, Canada
| | - Hamid Beiki
- Department of Animal Science, Iowa State University, Ames, Iowa, 50011, USA
| | | | - Michael K Dyck
- Department of Agriculture, Food and Nutritional Science, University of Alberta, Edmonton, AB, T6G 2R3, Canada
| | - Pig Gen Canada
- PigGen Canada Research Consortium, Guelph, Ontario, N1H4G8, Canada
| | - Frederic Fortin
- Centre de Développement du Porc du Québec Inc. (CDPQ), Québec City, QC, G1V 4M6, Canada
| | - John C S Harding
- Department of Large Animal Clinical Sciences, University of Saskatchewan, Saskatoon, SK, S7N 5A2, Canada
| | - Graham S Plastow
- Department of Agriculture, Food and Nutritional Science, University of Alberta, Edmonton, AB, T6G 2R3, Canada
| | - Jack C M Dekkers
- Department of Animal Science, Iowa State University, Ames, Iowa, 50011, USA.
| |
Collapse
|
29
|
Callahan BJ, Grinevich D, Thakur S, Balamotis MA, Yehezkel TB. Ultra-accurate microbial amplicon sequencing with synthetic long reads. MICROBIOME 2021; 9:130. [PMID: 34090540 PMCID: PMC8179091 DOI: 10.1186/s40168-021-01072-3] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 04/06/2021] [Indexed: 05/08/2023]
Abstract
BACKGROUND Out of the many pathogenic bacterial species that are known, only a fraction are readily identifiable directly from a complex microbial community using standard next generation DNA sequencing. Long-read sequencing offers the potential to identify a wider range of species and to differentiate between strains within a species, but attaining sufficient accuracy in complex metagenomes remains a challenge. METHODS Here, we describe and analytically validate LoopSeq, a commercially available synthetic long-read (SLR) sequencing technology that generates highly accurate long reads from standard short reads. RESULTS LoopSeq reads are sufficiently long and accurate to identify microbial genes and species directly from complex samples. LoopSeq perfectly recovered the full diversity of 16S rRNA genes from known strains in a synthetic microbial community. Full-length LoopSeq reads had a per-base error rate of 0.005%, which exceeds the accuracy reported for other long-read sequencing technologies. 18S-ITS and genomic sequencing of fungal and bacterial isolates confirmed that LoopSeq sequencing maintains that accuracy for reads up to 6 kb in length. LoopSeq full-length 16S rRNA reads could accurately classify organisms down to the species level in rinsate from retail meat samples, and could differentiate strains within species identified by the CDC as potential foodborne pathogens. CONCLUSIONS The order-of-magnitude improvement in length and accuracy over standard Illumina amplicon sequencing achieved with LoopSeq enables accurate species-level and strain identification from complex- to low-biomass microbiome samples. The ability to generate accurate and long microbiome sequencing reads using standard short read sequencers will accelerate the building of quality microbial sequence databases and removes a significant hurdle on the path to precision microbial genomics. Video abstract.
Collapse
Affiliation(s)
- Benjamin J. Callahan
- Department of Population Health and Pathobiology, College of Veterinary Medicine, North Carolina State University, Raleigh, NC USA
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC USA
| | - Dmitry Grinevich
- Department of Population Health and Pathobiology, College of Veterinary Medicine, North Carolina State University, Raleigh, NC USA
| | - Siddhartha Thakur
- Department of Population Health and Pathobiology, College of Veterinary Medicine, North Carolina State University, Raleigh, NC USA
| | | | | |
Collapse
|
30
|
Halstead MM, Islas-Trejo A, Goszczynski DE, Medrano JF, Zhou H, Ross PJ. Large-Scale Multiplexing Permits Full-Length Transcriptome Annotation of 32 Bovine Tissues From a Single Nanopore Flow Cell. Front Genet 2021; 12:664260. [PMID: 34093657 PMCID: PMC8173071 DOI: 10.3389/fgene.2021.664260] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 04/06/2021] [Indexed: 12/18/2022] Open
Abstract
A comprehensive annotation of transcript isoforms in domesticated species is lacking. Especially considering that transcriptome complexity and splicing patterns are not well-conserved between species, this presents a substantial obstacle to genomic selection programs that seek to improve production, disease resistance, and reproduction. Recent advances in long-read sequencing technology have made it possible to directly extrapolate the structure of full-length transcripts without the need for transcript reconstruction. In this study, we demonstrate the power of long-read sequencing for transcriptome annotation by coupling Oxford Nanopore Technology (ONT) with large-scale multiplexing of 93 samples, comprising 32 tissues collected from adult male and female Hereford cattle. More than 30 million uniquely mapping full-length reads were obtained from a single ONT flow cell, and used to identify and characterize the expression dynamics of 99,044 transcript isoforms at 31,824 loci. Of these predicted transcripts, 21% exactly matched a reference transcript, and 61% were novel isoforms of reference genes, substantially increasing the ratio of transcript variants per gene, and suggesting that the complexity of the bovine transcriptome is comparable to that in humans. Over 7,000 transcript isoforms were extremely tissue-specific, and 61% of these were attributed to testis, which exhibited the most complex transcriptome of all interrogated tissues. Despite profiling over 30 tissues, transcription was only detected at about 60% of reference loci. Consequently, additional studies will be necessary to continue characterizing the bovine transcriptome in additional cell types, developmental stages, and physiological conditions. However, by here demonstrating the power of ONT sequencing coupled with large-scale multiplexing, the task of exhaustively annotating the bovine transcriptome - or any mammalian transcriptome - appears significantly more feasible.
Collapse
Affiliation(s)
| | | | | | | | | | - Pablo J. Ross
- Department of Animal Science, University of California, Davis, Davis, CA, United States
| |
Collapse
|
31
|
Yu X, Yu K, Chen B, Liao Z, Qin Z, Yao Q, Huang Y, Liang J, Huang W. Nanopore long-read RNAseq reveals regulatory mechanisms of thermally variable reef environments promoting heat tolerance of scleractinian coral Pocillopora damicornis. ENVIRONMENTAL RESEARCH 2021; 195:110782. [PMID: 33503412 DOI: 10.1016/j.envres.2021.110782] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 01/15/2021] [Accepted: 01/19/2021] [Indexed: 06/12/2023]
Abstract
Some scleractinian corals exhibit high thermal adaptability to climate changes, although the mechanism of their adaptation is unclear. This study investigated the adaptability of scleractinian coral Pocillopora damicornis to thermally variable reef environments by applying a nanopore-based RNA sequencing method to characterize different transcription responses that promote heat tolerance of P. damicornis. We identified 1414 novel genes and optimized 6256 mis-annotated loci. Based on full-length transcriptome data, we identified complex alternative polyadenylation and alternative splicing events, which can improve our understanding of the genome annotation and gene structures of P. damicornis. Furthermore, we constructed differentially expressed lncRNA-mRNA co-expression networks, which may play a crucial role in the P. damicornis thermal adaptive response. KEGG function enrichment analysis revealed that P. damicornis from the high-temperature pool had a lower metabolic rate than that from the low-temperature pool. We hypothesize that metabolic readjustment, in the form of a lower metabolic rate, positively correlated with increased heat tolerance in P. damicornis in thermally variable reef environments. Our study provides novel insights into lncRNAs that promote thermally tolerance of scleractinian corals in the thermally variable reef environment, suggesting potential mechanisms for their adaptation to global warming in the future.
Collapse
Affiliation(s)
- Xiaopeng Yu
- Guangxi Laboratory on the Study of Coral Reefs in the South China Sea, Coral Reef Research Center of China, School of Marine Sciences, Guangxi University, Nanning, China
| | - Kefu Yu
- Guangxi Laboratory on the Study of Coral Reefs in the South China Sea, Coral Reef Research Center of China, School of Marine Sciences, Guangxi University, Nanning, China; Southern Marine Science and Engineering Guangdong Laboratory, Zhuhai, China.
| | - Biao Chen
- Guangxi Laboratory on the Study of Coral Reefs in the South China Sea, Coral Reef Research Center of China, School of Marine Sciences, Guangxi University, Nanning, China
| | - Zhiheng Liao
- Guangxi Laboratory on the Study of Coral Reefs in the South China Sea, Coral Reef Research Center of China, School of Marine Sciences, Guangxi University, Nanning, China
| | - Zhenjun Qin
- Guangxi Laboratory on the Study of Coral Reefs in the South China Sea, Coral Reef Research Center of China, School of Marine Sciences, Guangxi University, Nanning, China
| | - Qiucui Yao
- Guangxi Laboratory on the Study of Coral Reefs in the South China Sea, Coral Reef Research Center of China, School of Marine Sciences, Guangxi University, Nanning, China
| | - Yanhua Huang
- Guangxi Laboratory on the Study of Coral Reefs in the South China Sea, Coral Reef Research Center of China, School of Marine Sciences, Guangxi University, Nanning, China
| | - Jiayuan Liang
- Guangxi Laboratory on the Study of Coral Reefs in the South China Sea, Coral Reef Research Center of China, School of Marine Sciences, Guangxi University, Nanning, China
| | - Wen Huang
- Guangxi Laboratory on the Study of Coral Reefs in the South China Sea, Coral Reef Research Center of China, School of Marine Sciences, Guangxi University, Nanning, China
| |
Collapse
|
32
|
Dong Q, Lunney JK, Lim KS, Nguyen Y, Hess AS, Beiki H, Rowland RRR, Walker K, Reecy JM, Tuggle CK, Dekkers JCM. Gene expression in tonsils in swine following infection with porcine reproductive and respiratory syndrome virus. BMC Vet Res 2021; 17:88. [PMID: 33618723 PMCID: PMC7901068 DOI: 10.1186/s12917-021-02785-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Accepted: 02/02/2021] [Indexed: 12/18/2022] Open
Abstract
Background Porcine reproductive and respiratory syndrome (PRRS) is a threat to pig production worldwide. Our objective was to understand mechanisms of persistence of PRRS virus (PRRSV) in tonsil. Transcriptome data from tonsil samples collected at 42 days post infection (dpi) were generated by RNA-seq and NanoString on 51 pigs that were selected to contrast the two PRRSV isolates used, NVSL and KS06, high and low tonsil viral level at 42 dpi, and the favorable and unfavorable genotypes at a genetic marker (WUR) for the putative PRRSV resistance gene GBP5. Results The number of differentially expressed genes (DEGs) differed markedly between models with and without accounting for cell-type enrichments (CE) in the samples that were predicted from the RNA-seq data. This indicates that differences in cell composition in tissues that consist of multiple cell types, such as tonsil, can have a large impact on observed differences in gene expression. Based on both the NanoString and the RNA-seq data, KS06-infected pigs showed greater activation, or less inhibition, of immune response in tonsils at 42 dpi than NVSL-infected pigs, with and without accounting for CE. This suggests that the NVSL virus may be better than the KS06 virus at evading host immune response and persists in tonsils by weakening, or preventing, host immune responses. Pigs with high viral levels showed larger CE of immune cells than low viral level pigs, potentially to trigger stronger immune responses. Presence of high tonsil virus was associated with a stronger immune response, especially innate immune response through interferon signaling, but these differences were not significant when accounting for CE. Genotype at WUR was associated with different effects on immune response in tonsils of pigs during the persistence stage, depending on viral isolate and tonsil viral level. Conclusions Results of this study provide insights into the effects of PRRSV isolate, tonsil viral level, and WUR genotype on host immune response and into potential mechanisms of PRRSV persistence in tonsils that could be targeted to improve strategies to reduce viral rebreaks. Finally, to understand transcriptome responses in tissues that consist of multiple cell types, it is important to consider differences in cell composition. Supplementary Information The online version contains supplementary material available at 10.1186/s12917-021-02785-1.
Collapse
Affiliation(s)
- Qian Dong
- Department of Animal Science, Iowa State University, Ames, Iowa, 50011, USA
| | | | - Kyu-Sang Lim
- Department of Animal Science, Iowa State University, Ames, Iowa, 50011, USA
| | - Yet Nguyen
- Department of Statistics, Iowa State University, Ames, Iowa, 50011, USA
| | - Andrew S Hess
- Department of Animal Science, Iowa State University, Ames, Iowa, 50011, USA
| | - Hamid Beiki
- Department of Animal Science, Iowa State University, Ames, Iowa, 50011, USA
| | - Raymond R R Rowland
- College of Veterinary Medicine, University of Illinois at Urbana-Champaign, Urbana, IL, 61802, USA
| | | | - James M Reecy
- Department of Animal Science, Iowa State University, Ames, Iowa, 50011, USA
| | | | - Jack C M Dekkers
- Department of Animal Science, Iowa State University, Ames, Iowa, 50011, USA.
| |
Collapse
|
33
|
Donovan LL, Johansen JV, Ros NF, Jaberi E, Linnet K, Johansen SS, Ozenne B, Issazadeh-Navikas S, Hansen HD, Knudsen GM. Effects of a single dose of psilocybin on behaviour, brain 5-HT 2A receptor occupancy and gene expression in the pig. Eur Neuropsychopharmacol 2021; 42:1-11. [PMID: 33288378 DOI: 10.1016/j.euroneuro.2020.11.013] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Revised: 11/12/2020] [Accepted: 11/16/2020] [Indexed: 12/20/2022]
Abstract
Psilocybin has in some studies shown promise as treatment of major depressive disorder and psilocybin therapy was in 2019 twice designated as breakthrough therapy by the U.S. Food and Drug Administration (FDA). A very particular feature is that ingestion of just a single dose of psilocybin is associated with lasting changes in personality and mood. The underlying molecular mechanism behind its effect is, however, unknown. In a translational pig model, we here present the effects of a single dose of psilocybin on pig behaviour, receptor occupancy and gene expression in the brain. An acute i.v. injection of 0.08 mg/kg psilocybin to awake female pigs induced characteristic behavioural changes in terms of headshakes, scratching and rubbing, lasting around 20 min. A similar dose was associated with a cerebral 5-HT2A receptor occupancy of 67%, as determined by positron emission tomography, and plasma psilocin levels were comparable to what in humans is associated with an intense psychedelic experience. We found that 19 genes were differentially expressed in prefrontal cortex one day after psilocybin injection, and 3 genes after 1 week. Gene Set Enrichment Analysis demonstrated that multiple immunological pathways were regulated 1 week after psilocybin exposure. This provides a framework for future investigations of the lasting molecular mechanisms induced by a single dose of psilocybin. In the light of an ongoing debate as to whether psilocybin is a safe treatment for depression and other mental illnesses, it is reassuring that our data suggest that any effects on gene expression are very modest.
Collapse
Affiliation(s)
- Lene Lundgaard Donovan
- Neurobiology Research Unit 8057 and The Center for Experimental Medicine Neuropharmacology, Copenhagen University Hospital Rigshospitalet, Blegdamsvej 9, 2100 Copenhagen O, Denmark; Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Jens Vilstrup Johansen
- Biotech Research and Innovation Centre (BRIC), Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Nídia Fernandez Ros
- Neurobiology Research Unit 8057 and The Center for Experimental Medicine Neuropharmacology, Copenhagen University Hospital Rigshospitalet, Blegdamsvej 9, 2100 Copenhagen O, Denmark
| | - Elham Jaberi
- Biotech Research and Innovation Centre (BRIC), Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Kristian Linnet
- Department of Forensic Medicine, Section of Forensic Chemistry, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Sys Stybe Johansen
- Department of Forensic Medicine, Section of Forensic Chemistry, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Brice Ozenne
- Neurobiology Research Unit 8057 and The Center for Experimental Medicine Neuropharmacology, Copenhagen University Hospital Rigshospitalet, Blegdamsvej 9, 2100 Copenhagen O, Denmark; Department of Public Health, Section of Biostatistics, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Shohreh Issazadeh-Navikas
- Biotech Research and Innovation Centre (BRIC), Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Hanne Demant Hansen
- Neurobiology Research Unit 8057 and The Center for Experimental Medicine Neuropharmacology, Copenhagen University Hospital Rigshospitalet, Blegdamsvej 9, 2100 Copenhagen O, Denmark
| | - Gitte Moos Knudsen
- Neurobiology Research Unit 8057 and The Center for Experimental Medicine Neuropharmacology, Copenhagen University Hospital Rigshospitalet, Blegdamsvej 9, 2100 Copenhagen O, Denmark; Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark.
| |
Collapse
|
34
|
Tracing selection signatures in the pig genome gives evidence for selective pressures on a unique curly hair phenotype in Mangalitza. Sci Rep 2020; 10:22142. [PMID: 33335158 PMCID: PMC7747725 DOI: 10.1038/s41598-020-79037-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Accepted: 12/02/2020] [Indexed: 12/30/2022] Open
Abstract
Selection for desirable traits and breed-specific phenotypes has left distinctive footprints in the genome of pigs. As representative of a breed with strong selective traces aiming for robustness, health and performance, the Mangalitza pig, a native curly-haired pig breed from Hungary, was investigated in this study. Whole genome sequencing and SNP chip genotyping was performed to detect runs of homozygosity (ROH) in Mangalitza and Mangalitza-crossbreeds. We identified breed specific ROH regions harboring genes associated with the development of the curly hair type and further characteristics of this breed. Further analysis of two matings of Mangalitza with straight-coated pig breeds confirmed an autosomal dominant inheritance of curly hair. Subsequent scanning of the genome for variant effects on this trait revealed two variants potentially affecting hair follicle development and differentiation. Validation in a large sample set as well as in imputed SNP data confirmed these variants to be Mangalitza-specific. Herein, we demonstrated how strong artificial selection has shaped the genome in Mangalitza pigs and left traces in the form of selection signatures. This knowledge on genomic variation promoting unique phenotypes like curly hair provides an important resource for futures studies unraveling genetic effects for special characteristics in livestock.
Collapse
|
35
|
Feng W, Zhao P, Zheng X, Hu Z, Liu J. Profiling Novel Alternative Splicing within Multiple Tissues Provides Useful Insights into Porcine Genome Annotation. Genes (Basel) 2020; 11:genes11121405. [PMID: 33255998 PMCID: PMC7760890 DOI: 10.3390/genes11121405] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Revised: 11/24/2020] [Accepted: 11/24/2020] [Indexed: 12/22/2022] Open
Abstract
Alternative splicing (AS) is a process during gene expression that results in a single gene coding for different protein variants. AS contributes to transcriptome and proteome diversity. In order to characterize AS in pigs, genome-wide transcripts and AS events were detected using RNA sequencing of 34 different tissues in Duroc pigs. In total, 138,403 AS events and 29,270 expressed genes were identified. An alternative donor site was the most common AS form and accounted for 44% of the total AS events. The percentage of the other three AS forms (exon skipping, alternative acceptor site, and intron retention) was approximately 19%. The results showed that the most common AS events involving alternative donor sites could produce different transcripts or proteins that affect the biological processes. The expression of genes with tissue-specific AS events showed that gene functions were consistent with tissue functions. AS increased proteome diversity and resulted in novel proteins that gained or lost important functional domains. In summary, these findings extend porcine genome annotation and highlight roles that AS could play in determining tissue identity.
Collapse
|
36
|
Salavati M, Caulton A, Clark R, Gazova I, Smith TPL, Worley KC, Cockett NE, Archibald AL, Clarke SM, Murdoch BM, Clark EL. Global Analysis of Transcription Start Sites in the New Ovine Reference Genome ( Oar rambouillet v1.0). Front Genet 2020; 11:580580. [PMID: 33193703 PMCID: PMC7645153 DOI: 10.3389/fgene.2020.580580] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Accepted: 09/09/2020] [Indexed: 02/04/2023] Open
Abstract
The overall aim of the Ovine FAANG project is to provide a comprehensive annotation of the new highly contiguous sheep reference genome sequence (Oar rambouillet v1.0). Mapping of transcription start sites (TSS) is a key first step in understanding transcript regulation and diversity. Using 56 tissue samples collected from the reference ewe Benz2616, we have performed a global analysis of TSS and TSS-Enhancer clusters using Cap Analysis Gene Expression (CAGE) sequencing. CAGE measures RNA expression by 5' cap-trapping and has been specifically designed to allow the characterization of TSS within promoters to single-nucleotide resolution. We have adapted an analysis pipeline that uses TagDust2 for clean-up and trimming, Bowtie2 for mapping, CAGEfightR for clustering, and the Integrative Genomics Viewer (IGV) for visualization. Mapping of CAGE tags indicated that the expression levels of CAGE tag clusters varied across tissues. Expression profiles across tissues were validated using corresponding polyA+ mRNA-Seq data from the same samples. After removal of CAGE tags with <10 read counts, 39.3% of TSS overlapped with 5' ends of 31,113 transcripts that had been previously annotated by NCBI (out of a total of 56,308 from the NCBI annotation). For 25,195 of the transcripts, previously annotated by NCBI, no TSS meeting stringent criteria were identified. A further 14.7% of TSS mapped to within 50 bp of annotated promoter regions. Intersecting these predicted TSS regions with annotated promoter regions (±50 bp) revealed 46% of the predicted TSS were "novel" and previously un-annotated. Using whole-genome bisulfite sequencing data from the same tissues, we were able to determine that a proportion of these "novel" TSS were hypo-methylated (32.2%) indicating that they are likely to be reproducible rather than "noise". This global analysis of TSS in sheep will significantly enhance the annotation of gene models in the new ovine reference assembly. Our analyses provide one of the highest resolution annotations of transcript regulation and diversity in a livestock species to date.
Collapse
Affiliation(s)
- Mazdak Salavati
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, United Kingdom
- Centre for Tropical Livestock Genetics and Health (CTLGH), Roslin Institute, University of Edinburgh, Midlothian, United Kingdom
| | - Alex Caulton
- AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand
- Genetics Otago, Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | - Richard Clark
- Genetics Core, Edinburgh Clinical Research Facility, The University of Edinburgh, Edinburgh, United Kingdom
| | - Iveta Gazova
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, United Kingdom
- MRC Human Genetics Unit, The University of Edinburgh, Edinburgh, United Kingdom
| | - Timothy P. L. Smith
- USDA, Agricultural Research Service, U.S. Meat Animal Research Center, Clay Center, NE, United States
| | - Kim C. Worley
- Baylor College of Medicine, Houston, TX, United States
| | - Noelle E. Cockett
- Department of Animal, Dairy and Veterinary Sciences, Utah State University, Logan, UT, United States
| | - Alan L. Archibald
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, United Kingdom
| | | | - Brenda M. Murdoch
- Department of Animal, Veterinary and Food Sciences, University of Idaho, Moscow, ID, United States
| | - Emily L. Clark
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, United Kingdom
- Centre for Tropical Livestock Genetics and Health (CTLGH), Roslin Institute, University of Edinburgh, Midlothian, United Kingdom
| |
Collapse
|
37
|
Müller T, Boileau E, Talyan S, Kehr D, Varadi K, Busch M, Most P, Krijgsveld J, Dieterich C. Updated and enhanced pig cardiac transcriptome based on long-read RNA sequencing and proteomics. J Mol Cell Cardiol 2020; 150:23-31. [PMID: 33049256 DOI: 10.1016/j.yjmcc.2020.10.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 09/29/2020] [Accepted: 10/05/2020] [Indexed: 12/14/2022]
Abstract
Clinically translatable large animal models have become indispensable for cardiovascular research, clinically relevant proof of concept studies and for novel therapeutic interventions. In particular, the pig has emerged as an essential cardiovascular disease model, because its heart, circulatory system, and blood supply are anatomically and functionally similar to that of humans. Currently, molecular and omics-based studies in the pig are hampered by the incompleteness of the genome and the lack of diversity of the corresponding transcriptome annotation. Here, we employed Nanopore long-read sequencing and in-depth proteomics on top of Illumina RNA-seq to enhance the pig cardiac transcriptome annotation. We assembled 15,926 transcripts, stratified into coding and non-coding, and validated our results by complementary mass spectrometry. A manual review of several gene loci, which are associated with cardiac function, corroborated the utility of our enhanced annotation. All our data are available for download and are provided as tracks for integration in genome browsers. We deem this resource as highly valuable for molecular research in an increasingly relevant large animal model.
Collapse
Affiliation(s)
- Torsten Müller
- German Cancer Research Center (DKFZ), Functional and Structural Genomics, Im Neuenheimer Feld 581, 69120 Heidelberg, Germany; Heidelberg University, Medical Faculty, Im Neuenheimer Feld 672, 69120 Heidelberg, Germany
| | - Etienne Boileau
- Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, University Hospital Heidelberg, Heidelberg, Germany; German Center for Cardiovascular Research (DZHK), Partner Site Heidelberg/Mannheim, 69120 Heidelberg, Germany
| | - Sweta Talyan
- Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, University Hospital Heidelberg, Heidelberg, Germany; German Center for Cardiovascular Research (DZHK), Partner Site Heidelberg/Mannheim, 69120 Heidelberg, Germany
| | - Dorothea Kehr
- German Center for Cardiovascular Research (DZHK), Partner Site Heidelberg/Mannheim, 69120 Heidelberg, Germany; Division of Molecular and Translational Cardiology, Department of Internal Medicine III, Heidelberg University Hospital, INF 410, 69120 Heidelberg, Germany
| | - Karl Varadi
- German Center for Cardiovascular Research (DZHK), Partner Site Heidelberg/Mannheim, 69120 Heidelberg, Germany; Division of Molecular and Translational Cardiology, Department of Internal Medicine III, Heidelberg University Hospital, INF 410, 69120 Heidelberg, Germany
| | - Martin Busch
- German Center for Cardiovascular Research (DZHK), Partner Site Heidelberg/Mannheim, 69120 Heidelberg, Germany; Division of Molecular and Translational Cardiology, Department of Internal Medicine III, Heidelberg University Hospital, INF 410, 69120 Heidelberg, Germany
| | - Patrick Most
- German Center for Cardiovascular Research (DZHK), Partner Site Heidelberg/Mannheim, 69120 Heidelberg, Germany; Division of Molecular and Translational Cardiology, Department of Internal Medicine III, Heidelberg University Hospital, INF 410, 69120 Heidelberg, Germany; Center for Translational Medicine, Jefferson University, 1020 Locust Street, Philadelphia, PA 19107, USA
| | - Jeroen Krijgsveld
- German Cancer Research Center (DKFZ), Functional and Structural Genomics, Im Neuenheimer Feld 581, 69120 Heidelberg, Germany; Heidelberg University, Medical Faculty, Im Neuenheimer Feld 672, 69120 Heidelberg, Germany
| | - Christoph Dieterich
- Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, University Hospital Heidelberg, Heidelberg, Germany; German Center for Cardiovascular Research (DZHK), Partner Site Heidelberg/Mannheim, 69120 Heidelberg, Germany.
| |
Collapse
|
38
|
Hu H, Jia Q, Xi J, Zhou B, Li Z. Integrated analysis of lncRNA, miRNA and mRNA reveals novel insights into the fertility regulation of large white sows. BMC Genomics 2020; 21:636. [PMID: 32928107 PMCID: PMC7490888 DOI: 10.1186/s12864-020-07055-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Accepted: 09/06/2020] [Indexed: 01/20/2023] Open
Abstract
Background Improving sow fertility is extremely important as it can lead to increased reproductive efficiency and thus profitability for swine producers. There are considerable differences in fertility rates among individual animals, but the underlying molecular mechanisms remain unclear. In this study, by using different types of RNA libraries, we investigated the complete transcriptome of ovarian tissue during the luteal (L) and follicular (F) phases of the estrous cycle in Large White pigs with high (H) and low (L) fecundity, and performed a comprehensive analysis of long noncoding RNAs (lncRNAs), mRNAs and micro RNAs (miRNAs) from 16 samples by combining RNA sequencing (RNA-seq) with bioinformatics. Results In total, 24,447 lncRNAs, 27,370 mRNAs, and 216 known miRNAs were identified in ovarian tissues. The genomic features of lncRNAs, such as length distribution and number of exons, were further analyzed. We selected a threshold of P < 0.05 and |log2 (fold change)| ≥ 1 to obtain the differentially expressed lncRNAs, miRNAs and mRNAs by pairwise comparison (LH vs. LL, FH vs. FL). Bioinformatics analysis of these differentially expressed RNAs revealed multiple significantly enriched pathways (P < 0.05) that were closely involved in the reproductive process, such as ovarian steroidogenesis, lysosome, steroid biosynthesis, and the estrogen and GnRH signaling pathways. Moreover, bioinformatics screening of differentially expressed miRNAs that share common miRNA response elements (MREs) with lncRNAs and their downstream mRNA targets were performed. Finally, we constructed lncRNA–miRNA–mRNA regulation networks. The key genes in these networks were verified by Reverse Transcription Real-time Quantitative PCR (RT-qRCR), which were consistent with the results from RNA-Seq data. Conclusions These results provide further insights into the fertility of pigs andcan contribute to further experimental investigation of the functions of these genes.
Collapse
Affiliation(s)
- Huiyan Hu
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science and Technology, Hebei Agricultural University, Lekai South Street No. 2596, Baoding, 071000, Hebei, China
| | - Qing Jia
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science and Technology, Hebei Agricultural University, Lekai South Street No. 2596, Baoding, 071000, Hebei, China. .,Engineering Research Center for Agriculture in Hebei Mountainous Areas, Baoding, 071000, Hebei, China.
| | - Jianzhong Xi
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science and Technology, Hebei Agricultural University, Lekai South Street No. 2596, Baoding, 071000, Hebei, China
| | - Bo Zhou
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science and Technology, Hebei Agricultural University, Lekai South Street No. 2596, Baoding, 071000, Hebei, China
| | - Zhiqiang Li
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science and Technology, Hebei Agricultural University, Lekai South Street No. 2596, Baoding, 071000, Hebei, China
| |
Collapse
|
39
|
Warr A, Affara N, Aken B, Beiki H, Bickhart DM, Billis K, Chow W, Eory L, Finlayson HA, Flicek P, Girón CG, Griffin DK, Hall R, Hannum G, Hourlier T, Howe K, Hume DA, Izuogu O, Kim K, Koren S, Liu H, Manchanda N, Martin FJ, Nonneman DJ, O'Connor RE, Phillippy AM, Rohrer GA, Rosen BD, Rund LA, Sargent CA, Schook LB, Schroeder SG, Schwartz AS, Skinner BM, Talbot R, Tseng E, Tuggle CK, Watson M, Smith TPL, Archibald AL. An improved pig reference genome sequence to enable pig genetics and genomics research. Gigascience 2020; 9:5858065. [PMID: 32543654 PMCID: PMC7448572 DOI: 10.1093/gigascience/giaa051] [Citation(s) in RCA: 153] [Impact Index Per Article: 38.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Revised: 03/12/2020] [Accepted: 04/22/2020] [Indexed: 01/05/2023] Open
Abstract
Background The domestic pig (Sus scrofa) is important both as a food source and
as a biomedical model given its similarity in size, anatomy, physiology, metabolism,
pathology, and pharmacology to humans. The draft reference genome (Sscrofa10.2) of a
purebred Duroc female pig established using older clone-based sequencing methods was
incomplete, and unresolved redundancies, short-range order and orientation errors, and
associated misassembled genes limited its utility. Results We present 2 annotated highly contiguous chromosome-level genome assemblies created
with more recent long-read technologies and a whole-genome shotgun strategy, 1 for the
same Duroc female (Sscrofa11.1) and 1 for an outbred, composite-breed male (USMARCv1.0).
Both assemblies are of substantially higher (>90-fold) continuity and accuracy than
Sscrofa10.2. Conclusions These highly contiguous assemblies plus annotation of a further 11 short-read
assemblies provide an unprecedented view of the genetic make-up of this important
agricultural and biomedical model species. We propose that the improved Duroc assembly
(Sscrofa11.1) become the reference genome for genomic research in pigs.
Collapse
Affiliation(s)
- Amanda Warr
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK
| | - Nabeel Affara
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge CB2 1QP, UK
| | - Bronwen Aken
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Hamid Beiki
- Department of Animal Science, 2255 Kildee Hall, Iowa State University, Ames, IA 50011-3150, USA
| | - Derek M Bickhart
- Dairy Forage Research Center, USDA-ARS, 1925 Linden Drive, Madison, WI 53706, USA
| | - Konstantinos Billis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - William Chow
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Lel Eory
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK
| | - Heather A Finlayson
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Carlos G Girón
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Darren K Griffin
- School of Biosciences, University of Kent, Giles Lane, Canterbury CT2 7NJ, UK
| | - Richard Hall
- Pacific Biosciences, 1305 O'Brien Drive, Menlo Park, CA 94025, USA
| | | | - Thibaut Hourlier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Kerstin Howe
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - David A Hume
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK.,Mater Research Institute-University of Queensland, Translational Research Institute, Brisbane QLD 4104, Australia
| | - Osagie Izuogu
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Kristi Kim
- Pacific Biosciences, 1305 O'Brien Drive, Menlo Park, CA 94025, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA
| | - Haibou Liu
- Department of Animal Science, 2255 Kildee Hall, Iowa State University, Ames, IA 50011-3150, USA
| | - Nancy Manchanda
- Bioinformatics and Computational Biology Program, Iowa State University, 2014 Molecular Biology Building, Ames, IA 50011, USA
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Dan J Nonneman
- USDA-ARS U.S. Meat Animal Research Center, 844 Road 313, Clay Center, NE 68933, USA
| | - Rebecca E O'Connor
- School of Biosciences, University of Kent, Giles Lane, Canterbury CT2 7NJ, UK
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA
| | - Gary A Rohrer
- USDA-ARS U.S. Meat Animal Research Center, 844 Road 313, Clay Center, NE 68933, USA
| | - Benjamin D Rosen
- Animal Genomics and Improvement Laboratory, USDA-ARS, 10300 Baltimore Avenue, Beltsville, MD 20705-2350, USA
| | - Laurie A Rund
- Department of Animal Sciences, University of Illinois, 1201 West Gregory Drive, Urbana, IL 61801, USA
| | - Carole A Sargent
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge CB2 1QP, UK
| | - Lawrence B Schook
- Department of Animal Sciences, University of Illinois, 1201 West Gregory Drive, Urbana, IL 61801, USA
| | - Steven G Schroeder
- Animal Genomics and Improvement Laboratory, USDA-ARS, 10300 Baltimore Avenue, Beltsville, MD 20705-2350, USA
| | | | - Ben M Skinner
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge CB2 1QP, UK
| | - Richard Talbot
- Edinburgh Genomics, University of Edinburgh, Charlotte Auerbach Road, Edinburgh EH9 3FL, UK
| | - Elizabeth Tseng
- Pacific Biosciences, 1305 O'Brien Drive, Menlo Park, CA 94025, USA
| | - Christopher K Tuggle
- Department of Animal Science, 2255 Kildee Hall, Iowa State University, Ames, IA 50011-3150, USA.,Bioinformatics and Computational Biology Program, Iowa State University, 2014 Molecular Biology Building, Ames, IA 50011, USA
| | - Mick Watson
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK
| | - Timothy P L Smith
- USDA-ARS U.S. Meat Animal Research Center, 844 Road 313, Clay Center, NE 68933, USA
| | - Alan L Archibald
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK
| |
Collapse
|
40
|
Huang T, Yang M, Dong K, Xu M, Liu J, Chen Z, Zhu S, Chen W, Yin J, Jin K, Deng Y, Guan Z, Huang X, Yang J, Han R, Yao M. A transcriptional landscape of 28 porcine tissues obtained by super deepSAGE sequencing. BMC Genomics 2020; 21:229. [PMID: 32171242 PMCID: PMC7071599 DOI: 10.1186/s12864-020-6628-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2019] [Accepted: 02/26/2020] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Gene expression regulators identified in transcriptome profiling experiments may serve as ideal targets for genetic manipulations in farm animals. RESULTS In this study, we developed a gene expression profile of 76,000+ unique transcripts for 224 porcine samples from 28 tissues collected from 32 animals using Super deepSAGE technology. Excellent sequencing depth was achieved for each multiplexed library, and replicated samples from the same tissues clustered together, demonstrating the high quality of Super deepSAGE data. Comparison with previous research indicated that our results not only have good reproducibility but also have greatly extended the coverage of the sample types as well as the number of genes. Clustering analysis revealed ten groups of genes showing distinct expression patterns among these samples. Our analysis of over-represented binding motifs identified 41 regulators, and we demonstrated a potential application of this dataset in infectious diseases and immune biology research by identifying an LPS-dependent transcription factor, runt-related transcription factor 1 (RUNX1), in peripheral blood mononuclear cells (PBMCs). The selected genes are specifically responsible for the transcription of toll-like receptor 2 (TLR2), lymphocyte-specific protein tyrosine kinase (LCK), and vav1 oncogene (VAV1), which belong to the T and B cell signaling pathways. CONCLUSIONS The Super deepSAGE technology and tissue-differential expression profiles are valuable resources for investigating the porcine gene expression regulation. The identified RUNX1 target genes belong to the T and B cell signaling pathways, making them novel potential targets for the diagnosis and therapy of bacterial infections and other immune disorders.
Collapse
Affiliation(s)
- Tinghua Huang
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Min Yang
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Kaihui Dong
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Mingjiang Xu
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Jinhui Liu
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Zhi Chen
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Shijia Zhu
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Wang Chen
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Jun Yin
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Kai Jin
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Yu Deng
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Zhou Guan
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Xiali Huang
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Jun Yang
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Rongxun Han
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China
| | - Min Yao
- College of Animal Science, Yangtze University, Jingzhou, 434025, Hubei, China.
| |
Collapse
|
41
|
Summers KM, Bush SJ, Wu C, Su AI, Muriuki C, Clark EL, Finlayson HA, Eory L, Waddell LA, Talbot R, Archibald AL, Hume DA. Functional Annotation of the Transcriptome of the Pig, Sus scrofa, Based Upon Network Analysis of an RNAseq Transcriptional Atlas. Front Genet 2020; 10:1355. [PMID: 32117413 PMCID: PMC7034361 DOI: 10.3389/fgene.2019.01355] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Accepted: 12/11/2019] [Indexed: 12/15/2022] Open
Abstract
The domestic pig (Sus scrofa) is both an economically important livestock species and a model for biomedical research. Two highly contiguous pig reference genomes have recently been released. To support functional annotation of the pig genomes and comparative analysis with large human transcriptomic data sets, we aimed to create a pig gene expression atlas. To achieve this objective, we extended a previous approach developed for the chicken. We downloaded RNAseq data sets from public repositories, down-sampled to a common depth, and quantified expression against a reference transcriptome using the mRNA quantitation tool, Kallisto. We then used the network analysis tool Graphia to identify clusters of transcripts that were coexpressed across the merged data set. Consistent with the principle of guilt-by-association, we identified coexpression clusters that were highly tissue or cell-type restricted and contained transcription factors that have previously been implicated in lineage determination. Other clusters were enriched for transcripts associated with biological processes, such as the cell cycle and oxidative phosphorylation. The same approach was used to identify coexpression clusters within RNAseq data from multiple individual liver and brain samples, highlighting cell type, process, and region-specific gene expression. Evidence of conserved expression can add confidence to assignment of orthology between pig and human genes. Many transcripts currently identified as novel genes with ENSSSCG or LOC IDs were found to be coexpressed with annotated neighbouring transcripts in the same orientation, indicating they may be products of the same transcriptional unit. The meta-analytic approach to utilising public RNAseq data is extendable to include new data sets and new species and provides a framework to support the Functional Annotation of Animals Genomes (FAANG) initiative.
Collapse
Affiliation(s)
- Kim M. Summers
- Mater Research Institute-University of Queensland, Translational Research Institute, Woolloongabba, QLD, Australia
| | - Stephen J. Bush
- Nuffield Department of Clinical Medicine, University of Oxford, Oxford, United Kingdom
| | - Chunlei Wu
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, United States
| | - Andrew I. Su
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, United States
| | - Charity Muriuki
- The Roslin Institute, University of Edinburgh, Midlothian, United Kingdom
| | - Emily L. Clark
- The Roslin Institute, University of Edinburgh, Midlothian, United Kingdom
| | | | - Lel Eory
- The Roslin Institute, University of Edinburgh, Midlothian, United Kingdom
| | - Lindsey A. Waddell
- The Roslin Institute, University of Edinburgh, Midlothian, United Kingdom
| | - Richard Talbot
- The Roslin Institute, University of Edinburgh, Midlothian, United Kingdom
| | - Alan L. Archibald
- The Roslin Institute, University of Edinburgh, Midlothian, United Kingdom
| | - David A. Hume
- Mater Research Institute-University of Queensland, Translational Research Institute, Woolloongabba, QLD, Australia
| |
Collapse
|