1
|
Qiao G, Xu P, Guo T, He X, Yue Y, Yang B. Genome-wide detection of structural variation in some sheep breeds using whole-genome long-read sequencing data. J Anim Breed Genet 2024; 141:403-414. [PMID: 38247268 DOI: 10.1111/jbg.12846] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 12/21/2023] [Accepted: 12/29/2023] [Indexed: 01/23/2024]
Abstract
Genomic structural variants (SVs) constitute a significant proportion of genetic variation in the genome. The rapid development of long-reads sequencing has facilitated the detection of long-fragment SVs. There is no published study to detect SVs using long-read data from sheep. We applied a long-read mapping approach to detect SVs and characterized a total of 30,771 insertions, deletions, inversions and translocations. We identified 716, 916, 842 and 303 specific SVs in Southdown sheep, Alpine merino sheep, Qilian White Tibetan sheep and Oula sheep, respectively. We annotated these SVs and found that these SV-related genes were primarily enriched in the well-established pathways involved in the regulation of the immune system, growth and development and environmental adaptability. We detected and annotated SVs based on NGS resequencing data to validate the accuracy based on third-generation detection. Moreover, five candidate SVs were verified using the PCR method in 50 sheep. Our study is the first to use a long-reads sequencing approach to construct a novel structural variation map in sheep. We have completed a preliminary exploration of the potential effects of SVs on sheep.
Collapse
Affiliation(s)
- Guoyan Qiao
- Lanzhou Institute of Husbandry and Pharmaceutical Sciences of Chinese Academy of Agricultural Sciences, Lanzhou, China
- College of Ecological Agriculture and Animal Husbandry, Qinghai Communications Technical College, Xining, China
| | - Pan Xu
- State Key Laboratory of Grassland Agro-Ecosystems, Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs, Engineering Research Center of Grassland Industry, Ministry of Education, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, China
| | - Tingting Guo
- Lanzhou Institute of Husbandry and Pharmaceutical Sciences of Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Xue He
- Lanzhou Institute of Husbandry and Pharmaceutical Sciences of Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Yaojing Yue
- Lanzhou Institute of Husbandry and Pharmaceutical Sciences of Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Bohui Yang
- Lanzhou Institute of Husbandry and Pharmaceutical Sciences of Chinese Academy of Agricultural Sciences, Lanzhou, China
| |
Collapse
|
2
|
Shi K, Dong H, Du H, Li Y, Zhou L, Liang C, Şakiroğlu M, Wang Z. The chromosome-level assembly of the wild diploid alfalfa genome provides insights into the full landscape of genomic variations between cultivated and wild alfalfa. PLANT BIOTECHNOLOGY JOURNAL 2024; 22:1757-1772. [PMID: 38288521 PMCID: PMC11123407 DOI: 10.1111/pbi.14300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 11/22/2023] [Accepted: 01/15/2024] [Indexed: 05/25/2024]
Abstract
Alfalfa (Medicago sativa L.) is one of the most important forage legumes in the world, including autotetraploid (M. sativa ssp. sativa) and diploid alfalfa (M. sativa ssp. caerulea, progenitor of autotetraploid alfalfa). Here, we reported a high-quality genome of ZW0012 (diploid alfalfa, 769 Mb, contig N50 = 5.5 Mb), which was grouped into the Northern group in population structure analysis, suggesting that our genome assembly filled a major gap among the members of M. sativa complex. During polyploidization, large phenotypic differences occurred between diploids and tetraploids, and the genetic information underlying its massive phenotypic variations remains largely unexplored. Extensive structural variations (SVs) were identified between ZW0012 and XinJiangDaYe (an autotetraploid alfalfa with released genome). We identified 71 ZW0012-specific PAV genes and 1296 XinJiangDaYe-specific PAV genes, mainly involved in defence response, cell growth, and photosynthesis. We have verified the positive roles of MsNCR1 (a XinJiangDaYe-specific PAV gene) in nodulation using an Agrobacterium rhizobia-mediated transgenic method. We also demonstrated that MsSKIP23_1 and MsFBL23_1 (two XinJiangDaYe-specific PAV genes) regulated leaf size by transient overexpression and virus-induced gene silencing analysis. Our study provides a high-quality reference genome of an important diploid alfalfa germplasm and a valuable resource of variation landscape between diploid and autotetraploid, which will facilitate the functional gene discovery and molecular-based breeding for the cultivars in the future.
Collapse
Affiliation(s)
- Kun Shi
- College of Grassland Science and TechnologycChina Agricultural UniversityBeijingChina
| | - Hongbin Dong
- College of Grassland Science and TechnologycChina Agricultural UniversityBeijingChina
| | - Huilong Du
- School of Life Sciences, Institute of Life Sciences and Green DevelopmentHebei UniversityBaodingChina
| | - Yuxian Li
- School of Life SciencesNorth China University of Science and TechnologyTangshanChina
| | - Le Zhou
- College of Grassland Science and TechnologycChina Agricultural UniversityBeijingChina
| | - Chengzhi Liang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental BiologyChinese Academy of SciencesBeijingChina
| | - Muhammet Şakiroğlu
- Department of BioengineeringAdana AlparslanTürkeş Science and Technology UniversityAdanaTurkey
| | - Zan Wang
- College of Grassland Science and TechnologycChina Agricultural UniversityBeijingChina
| |
Collapse
|
3
|
Fernandez-Muñoz JM, Guerrero-Gimenez ME, Ciocca LA, Germanó MJ, Zoppino FCM. Mutational landscape of HSP family on human breast cancer. Sci Rep 2024; 14:12471. [PMID: 38816397 PMCID: PMC11139924 DOI: 10.1038/s41598-024-61807-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Accepted: 05/09/2024] [Indexed: 06/01/2024] Open
Abstract
Breast cancer (BRCA) is a prevalent malignancy with the highest incidence among females. BRCA can be categorized into five intrinsic molecular subtypes (LumA, LumB, HER2, Basal, and Normal), each characterized by varying molecular and clinical features determined by the expression of intrinsic genes (PAM50). The Heat Shock Protein (HSP) family is composed of 95 genes evolutionary conservated, they have critical roles in proteostasis in both normal and cancerous processes. Many studies have linked HSP to the development and spread of cancer. They modulate the activity of multiple proteins expressed by oncogenes and anti-oncogenes through a range of interactions. In this study, we evaluate the mutational changes that HSP undergoes in BRCA mainly from the TCGA database. We observe that Copy Number Variations (CNV) are the more frequent events analyzed surpassing the occurrence of point mutations, indels, and translation start site mutations. The Basal subtype showcased the highest count of amplified CNV, including subtype-specific changes, whereas the Luminals tumors accumulated the greatest number of deletion CNV. Meanwhile, the HER2 subtype exhibited a comparatively lower frequency of CNV alterations when compared to the other subtypes. This study integrates CNV and expression data, finding associations between these two variables and the influence of CNV on the deregulation of HSP expression. To enhance the role of HSP as a risk predictor in BRCA, we succeeded in identifying CNV profiles as a prognostic marker. We included Artificial Intelligence to improve the clustering of patients, and we achieved a molecular CNV signature as a significant risk factor independent of known classic markers, including molecular subtypes PAM50. This research enhances the comprehension of HSP DNA alterations in BRCA and its relation with predicting the risk of affected individuals providing insights to develop guide personalized treatment strategies.
Collapse
Affiliation(s)
- Juan Manuel Fernandez-Muñoz
- Laboratory of Data Science and Genomics, IMBECU CONICET UNCuyo, 5500, Mendoza, Argentina
- Medicine School, National University of Cuyo, 5500, Mendoza, Argentina
| | - Martin Eduardo Guerrero-Gimenez
- Laboratory of Data Science and Genomics, IMBECU CONICET UNCuyo, 5500, Mendoza, Argentina
- Medicine School, National University of Cuyo, 5500, Mendoza, Argentina
| | | | - María José Germanó
- Laboratory of Data Science and Genomics, IMBECU CONICET UNCuyo, 5500, Mendoza, Argentina
- Medicine School, National University of Cuyo, 5500, Mendoza, Argentina
| | - Felipe Carlos Martin Zoppino
- Laboratory of Data Science and Genomics, IMBECU CONICET UNCuyo, 5500, Mendoza, Argentina.
- Medicine School, National University of Cuyo, 5500, Mendoza, Argentina.
| |
Collapse
|
4
|
Malamon JS, Farrell JJ, Xia LC, Dombroski BA, Das RG, Way J, Kuzma AB, Valladares O, Leung YY, Scanlon AJ, Lopez IAB, Brehony J, Worley KC, Zhang NR, Wang LS, Farrer LA, Schellenberg GD, Lee WP, Vardarajan BN. A comparative study of structural variant calling in WGS from Alzheimer's disease families. Life Sci Alliance 2024; 7:e202302181. [PMID: 38418088 PMCID: PMC10902710 DOI: 10.26508/lsa.202302181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 02/07/2024] [Accepted: 02/08/2024] [Indexed: 03/01/2024] Open
Abstract
Detecting structural variants (SVs) in whole-genome sequencing poses significant challenges. We present a protocol for variant calling, merging, genotyping, sensitivity analysis, and laboratory validation for generating a high-quality SV call set in whole-genome sequencing from the Alzheimer's Disease Sequencing Project comprising 578 individuals from 111 families. Employing two complementary pipelines, Scalpel and Parliament, for SV/indel calling, we assessed sensitivity through sample replicates (N = 9) with in silico variant spike-ins. We developed a novel metric, D-score, to evaluate caller specificity for deletions. The accuracy of deletions was evaluated by Sanger sequencing. We generated a high-quality call set of 152,301 deletions of diverse sizes. Sanger sequencing validated 114 of 146 detected deletions (78.1%). Scalpel excelled in accuracy for deletions ≤100 bp, whereas Parliament was optimal for deletions >900 bp. Overall, 83.0% and 72.5% of calls by Scalpel and Parliament were validated, respectively, including all 11 deletions called by both Parliament and Scalpel between 101 and 900 bp. Our flexible protocol successfully generated a high-quality deletion call set and a truth set of Sanger sequencing-validated deletions with precise breakpoints spanning 1-17,000 bp.
Collapse
Affiliation(s)
- John S Malamon
- Department of Pathology and Laboratory Medicine, Penn Neurodegeneration Genomics Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - John J Farrell
- Biomedical Genetics Section, Department of Medicine, Boston University School of Medicine, Boston University, Boston, MA, USA
| | - Li Charlie Xia
- https://ror.org/03mtd9a03 Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
- Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA, USA
| | - Beth A Dombroski
- Department of Pathology and Laboratory Medicine, Penn Neurodegeneration Genomics Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Rueben G Das
- Department of Pathology and Laboratory Medicine, Penn Neurodegeneration Genomics Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Jessica Way
- Broad Institute, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Amanda B Kuzma
- Department of Pathology and Laboratory Medicine, Penn Neurodegeneration Genomics Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Otto Valladares
- Department of Pathology and Laboratory Medicine, Penn Neurodegeneration Genomics Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Yuk Yee Leung
- Department of Pathology and Laboratory Medicine, Penn Neurodegeneration Genomics Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Allison J Scanlon
- Department of Pathology and Laboratory Medicine, Penn Neurodegeneration Genomics Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Irving Antonio Barrera Lopez
- Department of Pathology and Laboratory Medicine, Penn Neurodegeneration Genomics Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Jack Brehony
- Department of Pathology and Laboratory Medicine, Penn Neurodegeneration Genomics Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Kim C Worley
- https://ror.org/02pttbw34 Human Genome Sequencing Center, and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Nancy R Zhang
- Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA, USA
| | - Li-San Wang
- Department of Pathology and Laboratory Medicine, Penn Neurodegeneration Genomics Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Lindsay A Farrer
- Biomedical Genetics Section, Department of Medicine, Boston University School of Medicine, Boston University, Boston, MA, USA
- Departments of Neurology and Ophthalmology, Boston University School of Medicine, Boston University, Boston, MA, USA
- Departments of Epidemiology and Biostatistics, Boston University School of Public Health, Boston, MA, USA
| | - Gerard D Schellenberg
- Department of Pathology and Laboratory Medicine, Penn Neurodegeneration Genomics Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Wan-Ping Lee
- Department of Pathology and Laboratory Medicine, Penn Neurodegeneration Genomics Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Badri N Vardarajan
- https://ror.org/01esghr10 Gertrude H. Sergievsky Center and Taub Institute of Aging Brain, Department of Neurology, Columbia University Medical Center, New York, NY, USA
| |
Collapse
|
5
|
Rodrigues Alves Barbosa V, Maroilley T, Diao C, Colvin-James L, Perrier R, Tarailo-Graovac M. Single variant, yet "double trouble": TSC and KBG syndrome because of a large de novo inversion. Life Sci Alliance 2024; 7:e202302115. [PMID: 38253421 PMCID: PMC10803213 DOI: 10.26508/lsa.202302115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 01/12/2024] [Accepted: 01/16/2024] [Indexed: 01/24/2024] Open
Abstract
Despite the advances in high-throughput sequencing, many rare disease patients remain undiagnosed. In particular, the patients with well-defined clinical phenotypes and established clinical diagnosis, yet missing or partial genetic diagnosis, may hold a clue to more complex genetic mechanisms of a disease that could be missed by available clinical tests. Here, we report a patient with a clinical diagnosis of Tuberous sclerosis, combined with unusual secondary features, but negative clinical tests including TSC1 and TSC2 Short-read whole-genome sequencing combined with advanced bioinformatics analyses were successful in uncovering a de novo pericentric 87-Mb inversion with breakpoints in TSC2 and ANKRD11, which explains the TSC clinical diagnosis, and confirms a second underlying monogenic disorder, KBG syndrome. Our findings illustrate how complex variants, such as large inversions, may be missed by clinical tests and further highlight the importance of well-defined clinical diagnoses in uncovering complex molecular mechanisms of a disease, such as complex variants and "double trouble" effects.
Collapse
Affiliation(s)
- Victoria Rodrigues Alves Barbosa
- https://ror.org/03yjb2x39 Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, Canada
- https://ror.org/03yjb2x39 Department of Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, Canada
- https://ror.org/03yjb2x39 Alberta Children's Hospital Research Institute, University of Calgary, Calgary, Canada
| | - Tatiana Maroilley
- https://ror.org/03yjb2x39 Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, Canada
- https://ror.org/03yjb2x39 Department of Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, Canada
- https://ror.org/03yjb2x39 Alberta Children's Hospital Research Institute, University of Calgary, Calgary, Canada
| | - Catherine Diao
- https://ror.org/03yjb2x39 Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, Canada
- https://ror.org/03yjb2x39 Department of Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, Canada
- https://ror.org/03yjb2x39 Alberta Children's Hospital Research Institute, University of Calgary, Calgary, Canada
| | - Leslie Colvin-James
- https://ror.org/03yjb2x39 Department of Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, Canada
- https://ror.org/03yjb2x39 Alberta Children's Hospital Research Institute, University of Calgary, Calgary, Canada
| | - Renee Perrier
- https://ror.org/03yjb2x39 Department of Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, Canada
- https://ror.org/03yjb2x39 Alberta Children's Hospital Research Institute, University of Calgary, Calgary, Canada
| | - Maja Tarailo-Graovac
- https://ror.org/03yjb2x39 Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, Canada
- https://ror.org/03yjb2x39 Department of Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, Canada
- https://ror.org/03yjb2x39 Alberta Children's Hospital Research Institute, University of Calgary, Calgary, Canada
| |
Collapse
|
6
|
Yeo NKW, Lim CK, Yaung KN, Khoo NKH, Arkachaisri T, Albani S, Yeo JG. Genetic interrogation for sequence and copy number variants in systemic lupus erythematosus. Front Genet 2024; 15:1341272. [PMID: 38501057 PMCID: PMC10944961 DOI: 10.3389/fgene.2024.1341272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Accepted: 02/20/2024] [Indexed: 03/20/2024] Open
Abstract
Early-onset systemic lupus erythematosus presents with a more severe disease and is associated with a greater genetic burden, especially in patients from Black, Asian or Hispanic ancestries. Next-generation sequencing techniques, notably whole exome sequencing, have been extensively used in genomic interrogation studies to identify causal disease variants that are increasingly implicated in the development of autoimmunity. This Review discusses the known casual variants of polygenic and monogenic systemic lupus erythematosus and its implications under certain genetic disparities while suggesting an age-based sequencing strategy to aid in clinical diagnostics and patient management for improved patient care.
Collapse
Affiliation(s)
- Nicholas Kim-Wah Yeo
- Translational Immunology Institute, SingHealth Duke-NUS Academic Medical Centre, Singapore, Singapore
- Duke-NUS Medical School, Singapore, Singapore
| | - Che Kang Lim
- Duke-NUS Medical School, Singapore, Singapore
- Department of Clinical Translation Research, Singapore General Hospital, Singapore, Singapore
| | - Katherine Nay Yaung
- Translational Immunology Institute, SingHealth Duke-NUS Academic Medical Centre, Singapore, Singapore
- Duke-NUS Medical School, Singapore, Singapore
| | - Nicholas Kim Huat Khoo
- Translational Immunology Institute, SingHealth Duke-NUS Academic Medical Centre, Singapore, Singapore
| | - Thaschawee Arkachaisri
- Translational Immunology Institute, SingHealth Duke-NUS Academic Medical Centre, Singapore, Singapore
- Duke-NUS Medical School, Singapore, Singapore
- Rheumatology and Immunology Service, KK Women's and Children's Hospital, Singapore, Singapore
| | - Salvatore Albani
- Translational Immunology Institute, SingHealth Duke-NUS Academic Medical Centre, Singapore, Singapore
- Duke-NUS Medical School, Singapore, Singapore
- Rheumatology and Immunology Service, KK Women's and Children's Hospital, Singapore, Singapore
| | - Joo Guan Yeo
- Translational Immunology Institute, SingHealth Duke-NUS Academic Medical Centre, Singapore, Singapore
- Duke-NUS Medical School, Singapore, Singapore
- Rheumatology and Immunology Service, KK Women's and Children's Hospital, Singapore, Singapore
| |
Collapse
|
7
|
Poot M. Methods of Detection and Mechanisms of Origin of Complex Structural Genome Variations. Methods Mol Biol 2024; 2825:39-65. [PMID: 38913302 DOI: 10.1007/978-1-0716-3946-7_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/25/2024]
Abstract
Based on classical karyotyping, structural genome variations (SVs) have generally been considered to be either "simple" (with one or two breakpoints) or "complex" (with more than two breakpoints). Studying the breakpoints of SVs at nucleotide resolution revealed additional, subtle structural variations, such that even "simple" SVs turned out to be "complex." Genome-wide sequencing methods, such as fosmid and paired-end mapping, short-read and long-read whole genome sequencing, and single-molecule optical mapping, also indicated that the number of SVs per individual was considerably larger than expected from karyotyping and high-resolution chromosomal array-based studies. Interestingly, SVs were detected in studies of cohorts of individuals without clinical phenotypes. The common denominator of all SVs appears to be a failure to accurately repair DNA double-strand breaks (DSBs) or to halt cell cycle progression if DSBs persist. This review discusses the various DSB response mechanisms during the mitotic cell cycle and during meiosis and their regulation. Emphasis is given to the molecular mechanisms involved in the formation of translocations, deletions, duplications, and inversions during or shortly after meiosis I. Recently, CRISPR-Cas9 studies have provided unexpected insights into the formation of translocations and chromothripsis by both breakage-fusion-bridge and micronucleus-dependent mechanisms.
Collapse
Affiliation(s)
- Martin Poot
- Department of Human Genetics, University of Wuerzburg, Wuerzburg, Germany
| |
Collapse
|
8
|
Ye R, Wang A, Bu B, Luo P, Deng W, Zhang X, Yin S. Viral oncogenes, viruses, and cancer: a third-generation sequencing perspective on viral integration into the human genome. Front Oncol 2023; 13:1333812. [PMID: 38188304 PMCID: PMC10768168 DOI: 10.3389/fonc.2023.1333812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 12/06/2023] [Indexed: 01/09/2024] Open
Abstract
The link between viruses and cancer has intrigued scientists for decades. Certain viruses have been shown to be vital in the development of various cancers by integrating viral DNA into the host genome and activating viral oncogenes. These viruses include the Human Papillomavirus (HPV), Hepatitis B and C Viruses (HBV and HCV), Epstein-Barr Virus (EBV), and Human T-Cell Leukemia Virus (HTLV-1), which are all linked to the development of a myriad of human cancers. Third-generation sequencing technologies have revolutionized our ability to study viral integration events at unprecedented resolution in recent years. They offer long sequencing capabilities along with the ability to map viral integration sites, assess host gene expression, and track clonal evolution in cancer cells. Recently, researchers have been exploring the application of Oxford Nanopore Technologies (ONT) nanopore sequencing and Pacific BioSciences (PacBio) single-molecule real-time (SMRT) sequencing in cancer research. As viral integration is crucial to the development of cancer via viruses, third-generation sequencing would provide a novel approach to studying the relationship interlinking viral oncogenes, viruses, and cancer. This review article explores the molecular mechanisms underlying viral oncogenesis, the role of viruses in cancer development, and the impact of third-generation sequencing on our understanding of viral integration into the human genome.
Collapse
Affiliation(s)
- Ruichen Ye
- Department of Pathology, Albert Einstein College of Medicine, Bronx, NY, United States
- Einstein Pathology Single-cell & Bioinformatics Laboratory, Bronx, NY, United States
- Stony Brook University, Stony Brook, NY, United States
| | - Angelina Wang
- Tufts Friedman School of Nutrition, Boston, MA, United States
| | - Brady Bu
- Horace Mann School, Bronx, NY, United States
| | - Pengxiang Luo
- Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Wenjun Deng
- Clinical Proteomics Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, United States
| | - Xinyi Zhang
- Department of Respiratory Diseases, The Second Affiliated Hospital of Nanchang University, Nanchang, China
| | - Shanye Yin
- Department of Pathology, Albert Einstein College of Medicine, Bronx, NY, United States
- Einstein Pathology Single-cell & Bioinformatics Laboratory, Bronx, NY, United States
| |
Collapse
|
9
|
Louw N, Carstens N, Lombard Z. Incorporating CNV analysis improves the yield of exome sequencing for rare monogenic disorders-an important consideration for resource-constrained settings. Front Genet 2023; 14:1277784. [PMID: 38155715 PMCID: PMC10753787 DOI: 10.3389/fgene.2023.1277784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Accepted: 11/22/2023] [Indexed: 12/30/2023] Open
Abstract
Exome sequencing (ES) is a recommended first-tier diagnostic test for many rare monogenic diseases. It allows for the detection of both single-nucleotide variants (SNVs) and copy number variants (CNVs) in coding exonic regions of the genome in a single test, and this dual analysis is a valuable approach, especially in limited resource settings. Single-nucleotide variants are well studied; however, the incorporation of copy number variant analysis tools into variant calling pipelines has not been implemented yet as a routine diagnostic test, and chromosomal microarray is still more widely used to detect copy number variants. Research shows that combined single and copy number variant analysis can lead to a diagnostic yield of up to 58%, increasing the yield with as much as 18% from the single-nucleotide variant only pipeline. Importantly, this is achieved with the consideration of computational costs only, without incurring any additional sequencing costs. This mini review provides an overview of copy number variant analysis from exome data and what the current recommendations are for this type of analysis. We also present an overview on rare monogenic disease research standard practices in resource-limited settings. We present evidence that integrating copy number variant detection tools into a standard exome sequencing analysis pipeline improves diagnostic yield and should be considered a significantly beneficial addition, with relatively low-cost implications. Routine implementation in underrepresented populations and limited resource settings will promote generation and sharing of CNV datasets and provide momentum to build core centers for this niche within genomic medicine.
Collapse
Affiliation(s)
- Nadja Louw
- Division of Human Genetics, National Health Laboratory Service and School of Pathology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Nadia Carstens
- Division of Human Genetics, National Health Laboratory Service and School of Pathology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
- Genomics Platform, South African Medical Research Council, Cape Town, South Africa
| | - Zané Lombard
- Division of Human Genetics, National Health Laboratory Service and School of Pathology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | | |
Collapse
|
10
|
Meng X, Wang M, Luo M, Sun L, Yan Q, Liu Y. Systematic evaluation of multiple NGS platforms for structural variants detection. J Biol Chem 2023; 299:105436. [PMID: 37944616 PMCID: PMC10724692 DOI: 10.1016/j.jbc.2023.105436] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 10/29/2023] [Accepted: 10/31/2023] [Indexed: 11/12/2023] Open
Abstract
Structural variations (SV) are critical genome changes affecting human diseases. Although many hybridization-based methods exist, evaluating SVs through next-generation sequencing (NGS) data is still necessary for broader research exploration. Here, we comprehensively compared the performance of 16 SV callers and multiple NGS platforms using NA12878 whole genome sequencing (WGS) datasets. The results indicated that several SV callers performed well relatively, such as Manta, GRIDSS, LUMPY, TARDIS, FermiKit, and Wham. Meanwhile, all NGS platforms have a similar performance using a single software. Additionally, we found that the source of undetected SVs was mostly from long reads datasets, therefore, the more appropriate strategy for accurate SV detection will be an integration of long and shorter reads in the future. At present, in the period of NGS as a mainstream method in bioinformatics, our study would provide helpful and comprehensive guidelines for specific categories of SV research.
Collapse
Affiliation(s)
- Xuan Meng
- School of Medicine, Southern University of Science and Technology, Shenzhen, China
| | - Miao Wang
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China
| | - Mingjie Luo
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China
| | - Lei Sun
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China
| | - Qin Yan
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China
| | - Yongfeng Liu
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China.
| |
Collapse
|
11
|
Kosuthova K, Solc R. Inversions on human chromosomes. Am J Med Genet A 2023; 191:672-683. [PMID: 36495134 DOI: 10.1002/ajmg.a.63063] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 11/13/2022] [Accepted: 11/17/2022] [Indexed: 12/14/2022]
Abstract
Human chromosome inversions are types of balanced structural variations, making them difficult to analyze. Thanks to PEM (paired-end sequencing and mapping), there has been tremendous progress in studying inversions. Inversions play an important role as an evolutionary factor, contributing to the formation of gonosomes, speciation of chimpanzees and humans, and inv17q21.3 or inv8p23.1 exhibit the features of natural selection. Both inversions have been related to pathogenic phenotype by directly affecting a gene structure (e.g., inv5p15.1q14.1), regulating gene expression (e.g., inv7q21.3q35) and by predisposing to other secondary arrangements (e.g., inv7q11.23). A polymorphism of human inversions is documented by the InvFEST database (a database that stores information about clinical predictions, validations, frequency of inversions, etc.), but only a small fraction of these inversions is validated, and a detailed analysis is complicated by the frequent location of breakpoints within regions of repetitive sequences.
Collapse
Affiliation(s)
- Klara Kosuthova
- Department of Anthropology and Human Genetics, Faculty of Science, Charles University, Prague, Czech Republic
| | - Roman Solc
- Department of Anthropology and Human Genetics, Faculty of Science, Charles University, Prague, Czech Republic
| |
Collapse
|
12
|
Fetit R, Barbato MI, Theil T, Pratt T, Price DJ. 16p11.2 deletion accelerates subpallial maturation and increases variability in human iPSC-derived ventral telencephalic organoids. Development 2023; 150:dev201227. [PMID: 36826401 PMCID: PMC10110424 DOI: 10.1242/dev.201227] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 01/19/2023] [Indexed: 02/25/2023]
Abstract
Inhibitory interneurons regulate cortical circuit activity, and their dysfunction has been implicated in autism spectrum disorder (ASD). 16p11.2 microdeletions are genetically linked to 1% of ASD cases. However, few studies investigate the effects of this microdeletion on interneuron development. Using ventral telencephalic organoids derived from human induced pluripotent stem cells, we have investigated the effect of this microdeletion on organoid size, progenitor proliferation and organisation into neural rosettes, ganglionic eminence marker expression at early developmental timepoints, and expression of the neuronal marker NEUN at later stages. At early stages, deletion organoids exhibited greater variations in size with concomitant increases in relative neural rosette area and the expression of the ventral telencephalic marker COUPTFII, with increased variability in these properties. Cell cycle analysis revealed an increase in total cell cycle length caused primarily by an elongated G1 phase, the duration of which also varied more than normal. At later stages, deletion organoids increased their NEUN expression. We propose that 16p11.2 microdeletions increase developmental variability and may contribute to ASD aetiology by lengthening the cell cycle of ventral progenitors, promoting premature differentiation into interneurons.
Collapse
Affiliation(s)
- Rana Fetit
- Simons Initiative for the Developing Brain, Hugh Robson Building, Edinburgh Medical School Biomedical Sciences, The University of Edinburgh, Edinburgh EH8 9XD, UK
- Centre for Discovery Brain Sciences, Hugh Robson Building, Edinburgh Medical School Biomedical Sciences, The University of Edinburgh, Edinburgh EH8 9XD, UK
| | - Michela Ilaria Barbato
- Simons Initiative for the Developing Brain, Hugh Robson Building, Edinburgh Medical School Biomedical Sciences, The University of Edinburgh, Edinburgh EH8 9XD, UK
- Centre for Discovery Brain Sciences, Hugh Robson Building, Edinburgh Medical School Biomedical Sciences, The University of Edinburgh, Edinburgh EH8 9XD, UK
| | - Thomas Theil
- Simons Initiative for the Developing Brain, Hugh Robson Building, Edinburgh Medical School Biomedical Sciences, The University of Edinburgh, Edinburgh EH8 9XD, UK
- Centre for Discovery Brain Sciences, Hugh Robson Building, Edinburgh Medical School Biomedical Sciences, The University of Edinburgh, Edinburgh EH8 9XD, UK
| | - Thomas Pratt
- Simons Initiative for the Developing Brain, Hugh Robson Building, Edinburgh Medical School Biomedical Sciences, The University of Edinburgh, Edinburgh EH8 9XD, UK
- Centre for Discovery Brain Sciences, Hugh Robson Building, Edinburgh Medical School Biomedical Sciences, The University of Edinburgh, Edinburgh EH8 9XD, UK
| | - David J. Price
- Simons Initiative for the Developing Brain, Hugh Robson Building, Edinburgh Medical School Biomedical Sciences, The University of Edinburgh, Edinburgh EH8 9XD, UK
- Centre for Discovery Brain Sciences, Hugh Robson Building, Edinburgh Medical School Biomedical Sciences, The University of Edinburgh, Edinburgh EH8 9XD, UK
| |
Collapse
|
13
|
Characterization of the immunoglobulin lambda chain locus from diverse populations reveals extensive genetic variation. Genes Immun 2023; 24:21-31. [PMID: 36539592 PMCID: PMC10041605 DOI: 10.1038/s41435-022-00188-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 10/07/2022] [Accepted: 10/13/2022] [Indexed: 12/24/2022]
Abstract
Immunoglobulins (IGs), crucial components of the adaptive immune system, are encoded by three genomic loci. However, the complexity of the IG loci severely limits the effective use of short read sequencing, limiting our knowledge of population diversity in these loci. We leveraged existing long read whole-genome sequencing (WGS) data, fosmid technology, and IG targeted single-molecule, real-time (SMRT) long-read sequencing (IG-Cap) to create haplotype-resolved assemblies of the IG Lambda (IGL) locus from 6 ethnically diverse individuals. In addition, we generated 10 diploid assemblies of IGL from a diverse cohort of individuals utilizing IG-Cap. From these 16 individuals, we identified significant allelic diversity, including 36 novel IGLV alleles. In addition, we observed highly elevated single nucleotide variation (SNV) in IGLV genes relative to IGL intergenic and genomic background SNV density. By comparing SNV calls between our high quality assemblies and existing short read datasets from the same individuals, we show a high propensity for false-positives in the short read datasets. Finally, for the first time, we nucleotide-resolved common 5-10 Kb duplications in the IGLC region that contain functional IGLJ and IGLC genes. Together these data represent a significant advancement in our understanding of genetic variation and population diversity in the IGL locus.
Collapse
|
14
|
Giovenale AMG, Ruotolo G, Soriano AA, Turco EM, Rotundo G, Casamassa A, D’Anzi A, Vescovi AL, Rosati J. Deepening the understanding of CNVs on chromosome 15q11-13 by using hiPSCs: An overview. Front Cell Dev Biol 2023; 10:1107881. [PMID: 36684422 PMCID: PMC9852989 DOI: 10.3389/fcell.2022.1107881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Accepted: 12/16/2022] [Indexed: 01/09/2023] Open
Abstract
The human α7 neuronal nicotinic acetylcholine receptor gene (CHRNA7) is widely expressed in the central and peripheral nervous systems. This receptor is implicated in both brain development and adult neurogenesis thanks to its ability to mediate acetylcholine stimulus (Ach). Copy number variations (CNVs) of CHRNA7 gene have been identified in humans and are genetically linked to cognitive impairments associated with multiple disorders, including schizophrenia, bipolar disorder, epilepsy, Alzheimer's disease, and others. Currently, α7 receptor analysis has been commonly performed in animal models due to the impossibility of direct investigation of the living human brain. But the use of model systems has shown that there are very large differences between humans and mice when researchers must study the CNVs and, in particular, the CNV of chromosome 15q13.3 where the CHRNA7 gene is present. In fact, human beings present genomic alterations as well as the presence of genes of recent origin that are not present in other model systems as well as they show a very heterogeneous symptomatology that is associated with both their genetic background and the environment where they live. To date, the induced pluripotent stem cells, obtained from patients carrying CNV in CHRNA7 gene, are a good in vitro model for studying the association of the α7 receptor to human diseases. In this review, we will outline the current state of hiPSCs technology applications in neurological diseases caused by CNVs in CHRNA7 gene. Furthermore, we will discuss some weaknesses that emerge from the overall analysis of the published articles.
Collapse
Affiliation(s)
- Angela Maria Giada Giovenale
- Cellular Reprogramming Unit, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, Italy,Department of Biotechnology and Biosciences, University of Milano-Bicocca, Milan, Italy
| | - Giorgia Ruotolo
- Cellular Reprogramming Unit, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, Italy,Department of Biotechnology and Biosciences, University of Milano-Bicocca, Milan, Italy
| | - Amata Amy Soriano
- Cellular Reprogramming Unit, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, Italy
| | - Elisa Maria Turco
- Cellular Reprogramming Unit, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, Italy
| | - Giovannina Rotundo
- Cellular Reprogramming Unit, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, Italy
| | - Alessia Casamassa
- Cellular Reprogramming Unit, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, Italy
| | - Angela D’Anzi
- Cellular Reprogramming Unit, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, Italy
| | - Angelo Luigi Vescovi
- Cellular Reprogramming Unit, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, Italy,Department of Biotechnology and Biosciences, University of Milano-Bicocca, Milan, Italy,*Correspondence: Jessica Rosati, ; Angelo Luigi Vescovi,
| | - Jessica Rosati
- Cellular Reprogramming Unit, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, Italy,*Correspondence: Jessica Rosati, ; Angelo Luigi Vescovi,
| |
Collapse
|
15
|
Remnants of SIRE1 retrotransposons in human genome? J Genet 2022. [DOI: 10.1007/s12041-022-01398-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
16
|
Abstract
In cancer, complex genome rearrangements and other structural alterations, including the amplification of oncogenes on circular extrachromosomal DNA (ecDNA) elements, drive the formation and progression of tumors. ecDNA is a particularly challenging structural alteration. By untethering oncogenes from chromosomal constraints, it elevates oncogene copy number, drives intratumoral genetic heterogeneity, promotes rapid tumor evolution, and results in treatment resistance. The profound changes in DNA shape and nuclear architecture generated by ecDNA alter the transcriptional landscape of tumors by catalyzing new types of regulatory interactions that do not occur on chromosomes. The current suite of tools for interrogating cancer genomes is well suited for deciphering sequence but has limited ability to resolve the complex changes in DNA structure and dynamics that ecDNA generates. Here, we review the challenges of resolving ecDNA form and function and discuss the emerging tool kit for deciphering ecDNA architecture and spatial organization, including what has been learned to date about how this dramatic change in shape alters tumor development, progression, and drug resistance.
Collapse
Affiliation(s)
- Vineet Bafna
- Department of Computer Science and Engineering and Halıcıoğlu Data Science Institute, University of California, San Diego, La Jolla, California, USA;
| | - Paul S Mischel
- Department of Pathology and ChEM-H, Stanford University School of Medicine, Stanford, California, USA;
| |
Collapse
|
17
|
Rooney K, Sadikovic B. DNA Methylation Episignatures in Neurodevelopmental Disorders Associated with Large Structural Copy Number Variants: Clinical Implications. Int J Mol Sci 2022; 23:ijms23147862. [PMID: 35887210 PMCID: PMC9324454 DOI: 10.3390/ijms23147862] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 07/11/2022] [Accepted: 07/14/2022] [Indexed: 02/06/2023] Open
Abstract
Large structural chromosomal deletions and duplications, referred to as copy number variants (CNVs), play a role in the pathogenesis of neurodevelopmental disorders (NDDs) through effects on gene dosage. This review focuses on our current understanding of genomic disorders that arise from large structural chromosome rearrangements in patients with NDDs, as well as difficulties in overlap of clinical presentation and molecular diagnosis. We discuss the implications of epigenetics, specifically DNA methylation (DNAm), in NDDs and genomic disorders, and consider the implications and clinical impact of copy number and genomic DNAm testing in patients with suspected genetic NDDs. We summarize evidence of global methylation episignatures in CNV-associated disorders that can be used in the diagnostic pathway and may provide insights into the molecular pathogenesis of genomic disorders. Finally, we discuss the potential for combining CNV and DNAm assessment into a single diagnostic assay.
Collapse
Affiliation(s)
- Kathleen Rooney
- Department of Pathology and Laboratory Medicine, Western University, London, ON N6A 3K7, Canada;
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London, ON N6A 5W9, Canada
| | - Bekim Sadikovic
- Department of Pathology and Laboratory Medicine, Western University, London, ON N6A 3K7, Canada;
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London, ON N6A 5W9, Canada
- Correspondence: ; Tel.: +1-519-685-8500 (ext. 53074)
| |
Collapse
|
18
|
Otto M, Zheng Y, Wiehe T. Recombination, selection and the evolution of tandem gene arrays. Genetics 2022; 221:6572811. [PMID: 35460227 PMCID: PMC9252282 DOI: 10.1093/genetics/iyac052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Accepted: 03/17/2022] [Indexed: 11/16/2022] Open
Abstract
Multigene families—immunity genes or sensory receptors, for instance—are often subject to diversifying selection. Allelic diversity may be favored not only through balancing or frequency-dependent selection at individual loci but also by associating different alleles in multicopy gene families. Using a combination of analytical calculations and simulations, we explored a population genetic model of epistatic selection and unequal recombination, where a trade-off exists between the benefit of allelic diversity and the cost of copy abundance. Starting from the neutral case, where we showed that gene copy number is Gamma distributed at equilibrium, we derived also the mean and shape of the limiting distribution under selection. Considering a more general model, which includes variable population size and population substructure, we explored by simulations mean fitness and some summary statistics of the copy number distribution. We determined the relative effects of selection, recombination, and demographic parameters in maintaining allelic diversity and shaping the mean fitness of a population. One way to control the variance of copy number is by lowering the rate of unequal recombination. Indeed, when encoding recombination by a rate modifier locus, we observe exactly this prediction. Finally, we analyzed the empirical copy number distribution of 3 genes in human and estimated recombination and selection parameters of our model.
Collapse
Affiliation(s)
- Moritz Otto
- Institut für Genetik, Universität zu Köln, Zülpicher Straße 47a, 50674 Köln, Germany
| | - Yichen Zheng
- Institut für Genetik, Universität zu Köln, Zülpicher Straße 47a, 50674 Köln, Germany
| | - Thomas Wiehe
- Institut für Genetik, Universität zu Köln, Zülpicher Straße 47a, 50674 Köln, Germany
| |
Collapse
|
19
|
Hu T, Li J, Long M, Wu J, Zhang Z, Xie F, Zhao J, Yang H, Song Q, Lian S, Shi J, Guo X, Yuan D, Lang D, Yu G, Liang B, Zhou X, Ishibashi T, Fan X, Yu W, Wang D, Wang Y, Peng IF, Wang S. Detection of Structural Variations and Fusion Genes in Breast Cancer Samples Using Third-Generation Sequencing. Front Cell Dev Biol 2022; 10:854640. [PMID: 35493102 PMCID: PMC9043247 DOI: 10.3389/fcell.2022.854640] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 03/23/2022] [Indexed: 11/16/2022] Open
Abstract
Background: Structural variations (SVs) are common genetic alterations in the human genome that could cause different phenotypes and diseases, including cancer. However, the detection of structural variations using the second-generation sequencing was limited by its short read length, which restrained our understanding of structural variations. Methods: In this study, we developed a 28-gene panel for long-read sequencing and employed it to Oxford Nanopore Technologies and Pacific Biosciences platforms. We analyzed structural variations in the 28 breast cancer-related genes through long-read genomic and transcriptomic sequencing of tumor, para-tumor, and blood samples in 19 breast cancer patients. Results: Our results showed that some somatic SVs were recurring among the selected genes, though the majority of them occurred in the non-exonic region. We found evidence supporting the existence of hotspot regions for SVs, which extended our previous understanding that they exist only for single nucleotide variations. Conclusion: In conclusion, we employed long-read genomic and transcriptomic sequencing to identify SVs from breast cancer patients and proved that this approach holds great potential in clinical application.
Collapse
Affiliation(s)
- Taobo Hu
- Department of Breast Surgery, Peking University People’s Hospital, Beijing, China
| | - Jingjing Li
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China
- GrandOmics Inc., Beijing, China
| | - Mengping Long
- Department of Pathology, Peking University Cancer Hospital, Beijing, China
| | - Jinbo Wu
- Department of Breast Surgery, Peking University People’s Hospital, Beijing, China
| | - Zhen Zhang
- Department of Statistics, The Chinese University of Hong Kong, Sha Tin, China
| | - Fei Xie
- Department of Breast Surgery, Peking University People’s Hospital, Beijing, China
| | - Jin Zhao
- Department of Breast Surgery, Peking University People’s Hospital, Beijing, China
| | - Houpu Yang
- Department of Breast Surgery, Peking University People’s Hospital, Beijing, China
| | - Qianqian Song
- Department of Biostatistics, School of Public Health, Peking University, Beijing, China
| | - Sheng Lian
- Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology, Kowloon, Hong Kong SAR, China
| | - Jiandong Shi
- Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology, Kowloon, Hong Kong SAR, China
| | | | | | | | | | - Baosheng Liang
- Department of Biostatistics, School of Public Health, Peking University, Beijing, China
| | - Xiaohua Zhou
- Department of Biostatistics, School of Public Health, Peking University, Beijing, China
| | - Toyotaka Ishibashi
- Division of Life Science, Hong Kong University of Science and Technology, Kowloon, Hong Kong SAR, China
| | - Xiaodan Fan
- Department of Statistics, The Chinese University of Hong Kong, Sha Tin, China
| | - Weichuan Yu
- Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology, Kowloon, Hong Kong SAR, China
| | | | - Yang Wang
- GrandOmics Inc., Beijing, China
- *Correspondence: Yang Wang, ; I-Feng Peng, ; Shu Wang,
| | - I-Feng Peng
- GrandOmics Inc., Beijing, China
- *Correspondence: Yang Wang, ; I-Feng Peng, ; Shu Wang,
| | - Shu Wang
- Department of Breast Surgery, Peking University People’s Hospital, Beijing, China
- *Correspondence: Yang Wang, ; I-Feng Peng, ; Shu Wang,
| |
Collapse
|
20
|
Giles Doran C, Pennington SR. Copy number alteration signatures as biomarkers in cancer: a review. Biomark Med 2022; 16:371-386. [PMID: 35195030 DOI: 10.2217/bmm-2021-0476] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
Within certain cancers, extensive copy number alterations (CNAs) contribute to a complex and heterogenic genomic profile. This makes it difficult to understand and unravel the distinct molecular dynamics shaping the disease while preventing clinically effective patient stratification. CNA signature analysis represents a novel genomic stratification tool for probing this complexity, offering an intricate framework for deriving CNA patterns at the molecular level. This allows the underlying genomic mechanisms of specific cancers to be revealed, leading to the potential identification of therapeutic targets and prognostic associations. This review outlines the molecular and methodological basis of CNA signatures and focuses on recent advances highlighting their clinical utility, limitations and prospective future as novel diagnostic and prognostic cancer biomarkers.
Collapse
Affiliation(s)
- Conor Giles Doran
- UCD Conway Institute, School of Medicine, University College Dublin, Belfield, Dublin 4, Ireland
| | - Stephen R Pennington
- UCD Conway Institute, School of Medicine, University College Dublin, Belfield, Dublin 4, Ireland
| |
Collapse
|
21
|
Frequency and clinical significance of chromosomal inversions prenatally diagnosed by second trimester amniocentesis. Sci Rep 2022; 12:2215. [PMID: 35140290 PMCID: PMC8828714 DOI: 10.1038/s41598-022-06024-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Accepted: 01/07/2022] [Indexed: 11/09/2022] Open
Abstract
To compare the frequency and clinical significance of familial and de novo chromosomal inversions during prenatal diagnosis. This was a retrospective study of inversions diagnosed prenatally in an Asian population by applying conventional GTG-banding to amniocyte cultures. Data from 2005 to 2019 were extracted from a single-center laboratory database. The types, frequencies, and inheritance patterns of multiple inversions were analyzed. Pericentric variant inversions of chromosome 9 or Y were excluded. In total, 56 (0.27%) fetuses with inversions were identified in the 15-year database of 21,120 confirmative diagnostic procedures. Pericentric and paracentric inversions accounted for 62.5% (35/56) and 37.5% of the inversions, respectively. Familial inversions accounted for nearly 90% of cases, and de novo mutation was identified in two pericentric and two paracentric cases. Inversions were most frequently identified on chromosomes 1 and 2 (16.1% of all inversions), followed by chromosomes 6, 7, and 10 (8.9% of all cases). The indications for invasive testing were as follows: advanced maternal age (67.3%), abnormal ultrasound findings (2.1%), abnormal serum aneuploidy screening (20.4%), and other indications (10.2%). The mode of inheritance was available for 67.9% of cases (38/56), with 89.5% of inversions being inherited (34/38). A slight preponderance of inheritance in female fetuses was observed. Three patients with inherited inversions opted for termination (two had severe central nervous system lesions and one had thalassemia major). Gestation continued for 53 fetuses, who exhibited no structural defects at birth or significant developmental problems a year after birth. Our study indicates that approximately 90% of prenatally diagnosed inversions involve familial inheritance, are spreading, and behave like founder effect mutations in this isolated population on an island. This finding can help to alleviate anxiety during prenatal counseling, which further underscores the importance of parental chromosomal analysis, further genetic studies, and appropriate counseling in cases where a nonfamilial inversion is diagnosed.
Collapse
|
22
|
Nam H, Lee IH, Sa JK, Kim SS, Pyeon HJ, Lee KH, Lee K, Lee SH, Joo KM. Effects of Long-Term In Vitro Expansion on Genetic Stability and Tumor Formation Capacity of Stem Cells. Stem Cell Rev Rep 2021; 18:241-257. [PMID: 34738209 DOI: 10.1007/s12015-021-10290-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/18/2021] [Indexed: 12/30/2022]
Abstract
Stem cell therapeutics are emerging as novel alternative treatments for various neurodegenerative diseases based on their regenerative potentials. However, stem cell transplantation might have side effects such as tumor formation that limit their clinical applications. Especially, in vitro expansion of stem cells might provoke genetic instability and tumorigenic potential. To address this issue, we analyzed genomic alterations of adult human multipotent neural cells (ahMNCs), a type of human adult neural stem cells, after a long-term in vitro culture process (passage 15) using sensitive analysis techniques including karyotyping, array comparative genomic hybridization (aCGH), and whole exome sequencing (WES). Although karyotyping did not find any major abnormalities in chromosomal number or structure, diverse copy number variations (CNVs) and genetic mutations were detected by aCGH and WES in all five independent ahMNCs. However, the number of CNVs and genetic mutations did not increase and many of them did not persist as in vitro culture progressed. Although most observed CNVs and genetic mutations were not shared by all five ahMNCs, nonsynonymous missense mutations at MUC4 were found in three out of five long-term cultured ahMNC lines. The genetic instability did not confer in vivo tumorigenic potential to ahMNCs. Collectively, these results indicate that, although genetic instability can be induced by long-term in vitro expansion of stem cells, it is not sufficient to fully exert tumor formation capacity of stem cells. Other functional effects of such genetic instability need to be further elucidated.
Collapse
Affiliation(s)
- Hyun Nam
- Department of Anatomy & Cell Biology, Sungkyunkwan University School of Medicine, 2066 Seobu-ro, Suwon, Gyeonggi-do, 16419, South Korea.,Single Cell Network Research Center, Sungkyunkwan University School of Medicine, Suwon, 16419, South Korea.,Stem Cell and Regenerative Medicine Center, Research Institute for Future Medicine, Samsung Medical Center, Seoul, 06351, South Korea
| | - In-Hee Lee
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, 02115, USA
| | - Jason K Sa
- Department of Biomedical Sciences, Korea University College of Medicine, Seoul, South Korea
| | - Sung Soo Kim
- Department of Anatomy & Cell Biology, Sungkyunkwan University School of Medicine, 2066 Seobu-ro, Suwon, Gyeonggi-do, 16419, South Korea.,Single Cell Network Research Center, Sungkyunkwan University School of Medicine, Suwon, 16419, South Korea.,Stem Cell and Regenerative Medicine Center, Research Institute for Future Medicine, Samsung Medical Center, Seoul, 06351, South Korea
| | - Hee-Jang Pyeon
- Department of Anatomy & Cell Biology, Sungkyunkwan University School of Medicine, 2066 Seobu-ro, Suwon, Gyeonggi-do, 16419, South Korea.,Single Cell Network Research Center, Sungkyunkwan University School of Medicine, Suwon, 16419, South Korea.,Stem Cell and Regenerative Medicine Center, Research Institute for Future Medicine, Samsung Medical Center, Seoul, 06351, South Korea
| | - Kee Hang Lee
- Department of Anatomy & Cell Biology, Sungkyunkwan University School of Medicine, 2066 Seobu-ro, Suwon, Gyeonggi-do, 16419, South Korea.,Single Cell Network Research Center, Sungkyunkwan University School of Medicine, Suwon, 16419, South Korea.,Stem Cell and Regenerative Medicine Center, Research Institute for Future Medicine, Samsung Medical Center, Seoul, 06351, South Korea
| | - Kyunghoon Lee
- Department of Anatomy & Cell Biology, Sungkyunkwan University School of Medicine, 2066 Seobu-ro, Suwon, Gyeonggi-do, 16419, South Korea.,Single Cell Network Research Center, Sungkyunkwan University School of Medicine, Suwon, 16419, South Korea.,Department of Health Sciences and Technology, SAIHST, Sungkyunkwan University, Seoul, 06351, South Korea
| | - Sun-Ho Lee
- Single Cell Network Research Center, Sungkyunkwan University School of Medicine, Suwon, 16419, South Korea. .,Stem Cell and Regenerative Medicine Center, Research Institute for Future Medicine, Samsung Medical Center, Seoul, 06351, South Korea. .,Biomedical Institute for Convergence at Sungkyunkwan University (BICS), Sungkyunkwan University, Suwon, 16419, South Korea. .,Department of Neurosurgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, South Korea.
| | - Kyeung Min Joo
- Department of Anatomy & Cell Biology, Sungkyunkwan University School of Medicine, 2066 Seobu-ro, Suwon, Gyeonggi-do, 16419, South Korea. .,Single Cell Network Research Center, Sungkyunkwan University School of Medicine, Suwon, 16419, South Korea. .,Stem Cell and Regenerative Medicine Center, Research Institute for Future Medicine, Samsung Medical Center, Seoul, 06351, South Korea. .,Department of Health Sciences and Technology, SAIHST, Sungkyunkwan University, Seoul, 06351, South Korea. .,Department of Neurosurgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, South Korea.
| |
Collapse
|
23
|
Glessner JT, Hou X, Zhong C, Zhang J, Khan M, Brand F, Krawitz P, Sleiman PMA, Hakonarson H, Wei Z. DeepCNV: a deep learning approach for authenticating copy number variations. Brief Bioinform 2021; 22:bbaa381. [PMID: 33429424 PMCID: PMC8681111 DOI: 10.1093/bib/bbaa381] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 11/24/2020] [Accepted: 11/26/2020] [Indexed: 12/14/2022] Open
Abstract
Copy number variations (CNVs) are an important class of variations contributing to the pathogenesis of many disease phenotypes. Detecting CNVs from genomic data remains difficult, and the most currently applied methods suffer from an unacceptably high false positive rate. A common practice is to have human experts manually review original CNV calls for filtering false positives before further downstream analysis or experimental validation. Here, we propose DeepCNV, a deep learning-based tool, intended to replace human experts when validating CNV calls, focusing on the calls made by one of the most accurate CNV callers, PennCNV. The sophistication of the deep neural network algorithm is enriched with over 10 000 expert-scored samples that are split into training and testing sets. Variant confidence, especially for CNVs, is a main roadblock impeding the progress of linking CNVs with the disease. We show that DeepCNV adds to the confidence of the CNV calls with an optimal area under the receiver operating characteristic curve of 0.909, exceeding other machine learning methods. The superiority of DeepCNV was also benchmarked and confirmed using an experimental wet-lab validation dataset. We conclude that the improvement obtained by DeepCNV results in significantly fewer false positive results and failures to replicate the CNV association results.
Collapse
Affiliation(s)
- Joseph T Glessner
- Center for Applied Genomics, Department of Human Genetics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Perelman School of Medicine, Department of Pediatrics, University of Pennsylvania, Philadelphia, PA 19102, USA
| | - Xiurui Hou
- Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA
| | - Cheng Zhong
- Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA
| | | | - Munir Khan
- Center for Applied Genomics, Department of Human Genetics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Perelman School of Medicine, Department of Pediatrics, University of Pennsylvania, Philadelphia, PA 19102, USA
| | | | | | - Patrick M A Sleiman
- Perelman School of Medicine, Department of Pediatrics, University of Pennsylvania, Philadelphia, PA 19102, USA
| | - Hakon Hakonarson
- Perelman School of Medicine, Department of Pediatrics, University of Pennsylvania, Philadelphia, PA 19102, USA
| | - Zhi Wei
- Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA
| |
Collapse
|
24
|
Mostovoy Y, Yilmaz F, Chow SK, Chu C, Lin C, Geiger EA, Meeks NJL, Chatfield KC, Coughlin CR, Surti U, Kwok PY, Shaikh TH. Genomic regions associated with microdeletion/microduplication syndromes exhibit extreme diversity of structural variation. Genetics 2021; 217:6066166. [PMID: 33724415 DOI: 10.1093/genetics/iyaa038] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2020] [Accepted: 12/18/2020] [Indexed: 11/12/2022] Open
Abstract
Segmental duplications (SDs) are a class of long, repetitive DNA elements whose paralogs share a high level of sequence similarity with each other. SDs mediate chromosomal rearrangements that lead to structural variation in the general population as well as genomic disorders associated with multiple congenital anomalies, including the 7q11.23 (Williams-Beuren Syndrome, WBS), 15q13.3, and 16p12.2 microdeletion syndromes. Population-level characterization of SDs has generally been lacking because most techniques used for analyzing these complex regions are both labor and cost intensive. In this study, we have used a high-throughput technique to genotype complex structural variation with a single molecule, long-range optical mapping approach. We characterized SDs and identified novel structural variants (SVs) at 7q11.23, 15q13.3, and 16p12.2 using optical mapping data from 154 phenotypically normal individuals from 26 populations comprising five super-populations. We detected several novel SVs for each locus, some of which had significantly different prevalence between populations. Additionally, we localized the microdeletion breakpoints to specific paralogous duplicons located within complex SDs in two patients with WBS, one patient with 15q13.3, and one patient with 16p12.2 microdeletion syndromes. The population-level data presented here highlights the extreme diversity of large and complex SVs within SD-containing regions. The approach we outline will greatly facilitate the investigation of the role of inter-SD structural variation as a driver of chromosomal rearrangements and genomic disorders.
Collapse
Affiliation(s)
- Yulia Mostovoy
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Feyza Yilmaz
- Department of Integrative Biology, University of Colorado Denver, Denver, CO 80204, USA.,Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Stephen K Chow
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Catherine Chu
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Chin Lin
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Elizabeth A Geiger
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Naomi J L Meeks
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Kathryn C Chatfield
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA.,Department of Pediatrics, Section of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Curtis R Coughlin
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Urvashi Surti
- Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Pui-Yan Kwok
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA.,Department of Dermatology, UCSF School of Medicine, San Francisco, CA 94143, USA.,Institute for Human Genetics, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Tamim H Shaikh
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| |
Collapse
|
25
|
Huang Y, Huang W, Meng Z, Braz GT, Li Y, Wang K, Wang H, Lai J, Jiang J, Dong Z, Jin W. Megabase-scale presence-absence variation with Tripsacum origin was under selection during maize domestication and adaptation. Genome Biol 2021; 22:237. [PMID: 34416918 PMCID: PMC8377971 DOI: 10.1186/s13059-021-02448-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 08/02/2021] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Structural variants (SVs) significantly drive genome diversity and environmental adaptation for diverse species. Unlike the prevalent small SVs (< kilobase-scale) in higher eukaryotes, large-size SVs rarely exist in the genome, but they function as one of the key evolutionary forces for speciation and adaptation. RESULTS In this study, we discover and characterize several megabase-scale presence-absence variations (PAVs) in the maize genome. Surprisingly, we identify a 3.2 Mb PAV fragment that shows high integrity and is present as complete presence or absence in the natural diversity panel. This PAV is embedded within the nucleolus organizer region (NOR), where the suppressed recombination is found to maintain the PAV against the evolutionary variation. Interestingly, by analyzing the sequence of this PAV, we not only reveal the domestication trace from teosinte to modern maize, but also the footprints of its origin from Tripsacum, shedding light on a previously unknown contribution from Tripsacum to the speciation of Zea species. The functional consequence of the Tripsacum segment migration is also investigated, and environmental fitness conferred by the PAV may explain the whole segment as a selection target during maize domestication and improvement. CONCLUSIONS These findings provide a novel perspective that Tripsacum contributes to Zea speciation, and also instantiate a strategy for evolutionary and functional analysis of the "fossil" structure variations during genome evolution and speciation.
Collapse
Affiliation(s)
- Yumin Huang
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China
| | - Wei Huang
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China
| | - Zhuang Meng
- Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps (MOE), Fujian Agriculture and Forestry University, Fuzhou, 350002, Fujian, China
| | - Guilherme Tomaz Braz
- Department of Plant Biology, Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
| | - Yunfei Li
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China
| | - Kai Wang
- Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps (MOE), Fujian Agriculture and Forestry University, Fuzhou, 350002, Fujian, China
| | - Hai Wang
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China
| | - Jinsheng Lai
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China
| | - Jiming Jiang
- Department of Plant Biology, Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
| | - Zhaobin Dong
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China.
| | - Weiwei Jin
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
26
|
Trost B, Loureiro LO, Scherer SW. Discovery of genomic variation across a generation. Hum Mol Genet 2021; 30:R174-R186. [PMID: 34296264 PMCID: PMC8490016 DOI: 10.1093/hmg/ddab209] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Revised: 07/09/2021] [Accepted: 07/19/2021] [Indexed: 11/12/2022] Open
Abstract
Over the past 30 years (the timespan of a generation), advances in genomics technologies have revealed tremendous and unexpected variation in the human genome and have provided increasingly accurate answers to long-standing questions of how much genetic variation exists in human populations and to what degree the DNA complement changes between parents and offspring. Tracking the characteristics of these inherited and spontaneous (or de novo) variations has been the basis of the study of human genetic disease. From genome-wide microarray and next-generation sequencing scans, we now know that each human genome contains over 3 million single nucleotide variants when compared with the ~ 3 billion base pairs in the human reference genome, along with roughly an order of magnitude more DNA—approximately 30 megabase pairs (Mb)—being ‘structurally variable’, mostly in the form of indels and copy number changes. Additional large-scale variations include balanced inversions (average of 18 Mb) and complex, difficult-to-resolve alterations. Collectively, ~1% of an individual’s genome will differ from the human reference sequence. When comparing across a generation, fewer than 100 new genetic variants are typically detected in the euchromatic portion of a child’s genome. Driven by increasingly higher-resolution and higher-throughput sequencing technologies, newer and more accurate databases of genetic variation (for instance, more comprehensive structural variation data and phasing of combinations of variants along chromosomes) of worldwide populations will emerge to underpin the next era of discovery in human molecular genetics.
Collapse
Affiliation(s)
- Brett Trost
- The Centre for Applied Genomics and Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - Livia O Loureiro
- The Centre for Applied Genomics and Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - Stephen W Scherer
- The Centre for Applied Genomics and Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada.,McLaughlin Centre and Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| |
Collapse
|
27
|
Fetit R, Hillary RF, Price DJ, Lawrie SM. The neuropathology of autism: A systematic review of post-mortem studies of autism and related disorders. Neurosci Biobehav Rev 2021; 129:35-62. [PMID: 34273379 DOI: 10.1016/j.neubiorev.2021.07.014] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 05/13/2021] [Accepted: 07/10/2021] [Indexed: 02/07/2023]
Abstract
Post-mortem studies allow for the direct investigation of brain tissue in those with autism and related disorders. Several review articles have focused on aspects of post-mortem abnormalities but none has brought together the entire post-mortem literature. Here, we systematically review the evidence from post-mortem studies of autism, and of related disorders that present with autistic features. The literature consists of a small body of studies with small sample sizes, but several remarkably consistent findings are evident. Cortical layering is largely undisturbed, but there are consistent reductions in minicolumn numbers and aberrant myelination. Transcriptomics repeatedly implicate abberant synaptic, metabolic, proliferation, apoptosis and immune pathways. Sufficient replicated evidence is available to implicate non-coding RNA, aberrant epigenetic profiles, GABAergic, glutamatergic and glial dysfunction in autism pathogenesis. Overall, the cerebellum and frontal cortex are most consistently implicated, sometimes revealing distinct region-specific alterations. The literature on related disorders such as Rett syndrome, Fragile X and copy number variations (CNVs) predisposing to autism is particularly small and inconclusive. Larger studies, matched for gender, developmental stage, co-morbidities and drug treatment are required.
Collapse
Affiliation(s)
- Rana Fetit
- Simons Initiative for the Developing Brain, University of Edinburgh, Hugh Robson Building, George Square, Edinburgh, EH8 9XD, UK.
| | - Robert F Hillary
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - David J Price
- Simons Initiative for the Developing Brain, University of Edinburgh, Hugh Robson Building, George Square, Edinburgh, EH8 9XD, UK
| | - Stephen M Lawrie
- Division of Psychiatry, Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, EH10 5HF, UK; Patrick Wild Centre, Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, EH10 5HF, UK
| |
Collapse
|
28
|
Li M, Yin F, Song L, Mao X, Li F, Fan C, Zuo X, Xia Q. Nucleic Acid Tests for Clinical Translation. Chem Rev 2021; 121:10469-10558. [PMID: 34254782 DOI: 10.1021/acs.chemrev.1c00241] [Citation(s) in RCA: 78] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Nucleic acids, including deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), are natural biopolymers composed of nucleotides that store, transmit, and express genetic information. Overexpressed or underexpressed as well as mutated nucleic acids have been implicated in many diseases. Therefore, nucleic acid tests (NATs) are extremely important. Inspired by intracellular DNA replication and RNA transcription, in vitro NATs have been extensively developed to improve the detection specificity, sensitivity, and simplicity. The principles of NATs can be in general classified into three categories: nucleic acid hybridization, thermal-cycle or isothermal amplification, and signal amplification. Driven by pressing needs in clinical diagnosis and prevention of infectious diseases, NATs have evolved to be a rapidly advancing field. During the past ten years, an explosive increase of research interest in both basic research and clinical translation has been witnessed. In this review, we aim to provide comprehensive coverage of the progress to analyze nucleic acids, use nucleic acids as recognition probes, construct detection devices based on nucleic acids, and utilize nucleic acids in clinical diagnosis and other important fields. We also discuss the new frontiers in the field and the challenges to be addressed.
Collapse
Affiliation(s)
- Min Li
- Institute of Molecular Medicine, Department of Liver Surgery, Shanghai Key Laboratory for Nucleic Acid Chemistry and Nanomedicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China
| | - Fangfei Yin
- Institute of Molecular Medicine, Department of Liver Surgery, Shanghai Key Laboratory for Nucleic Acid Chemistry and Nanomedicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China
| | - Lu Song
- Institute of Molecular Medicine, Department of Liver Surgery, Shanghai Key Laboratory for Nucleic Acid Chemistry and Nanomedicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China.,Division of Physical Biology, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Institute of Applied Physics, Chinese Academy of Sciences, Shanghai 201800, China
| | - Xiuhai Mao
- Institute of Molecular Medicine, Department of Liver Surgery, Shanghai Key Laboratory for Nucleic Acid Chemistry and Nanomedicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China
| | - Fan Li
- Institute of Molecular Medicine, Department of Liver Surgery, Shanghai Key Laboratory for Nucleic Acid Chemistry and Nanomedicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China
| | - Chunhai Fan
- School of Chemistry and Chemical Engineering, Frontiers Science Center for Transformative Molecules and National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Xiaolei Zuo
- Institute of Molecular Medicine, Department of Liver Surgery, Shanghai Key Laboratory for Nucleic Acid Chemistry and Nanomedicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China.,School of Chemistry and Chemical Engineering, Frontiers Science Center for Transformative Molecules and National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Qiang Xia
- Institute of Molecular Medicine, Department of Liver Surgery, Shanghai Key Laboratory for Nucleic Acid Chemistry and Nanomedicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China
| |
Collapse
|
29
|
Meshcheryakova A, Pietschmann P, Zimmermann P, Rogozin IB, Mechtcheriakova D. AID and APOBECs as Multifaceted Intrinsic Virus-Restricting Factors: Emerging Concepts in the Light of COVID-19. Front Immunol 2021; 12:690416. [PMID: 34276680 PMCID: PMC8282206 DOI: 10.3389/fimmu.2021.690416] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Accepted: 06/07/2021] [Indexed: 12/23/2022] Open
Abstract
The AID (activation-induced cytidine deaminase)/APOBEC (apolipoprotein B mRNA editing enzyme catalytic subunit) family with its multifaceted mode of action emerges as potent intrinsic host antiviral system that acts against a variety of DNA and RNA viruses including coronaviruses. All family members are cytosine-to-uracil deaminases that either have a profound role in driving a strong and specific humoral immune response (AID) or restricting the virus itself by a plethora of mechanisms (APOBECs). In this article, we highlight some of the key aspects apparently linking the AID/APOBECs and SARS-CoV-2. Among those is our discovery that APOBEC4 shows high expression in cell types and anatomical parts targeted by SARS-CoV-2. Additional focus is given by us to the lymphoid structures and AID as the master regulator of germinal center reactions, which result in antibody production by plasma and memory B cells. We propose the dissection of the AID/APOBECs gene signature towards decisive determinants of the patient-specific and/or the patient group-specific antiviral response. Finally, the patient-specific mapping of the AID/APOBEC polymorphisms should be considered in the light of COVID-19.
Collapse
Affiliation(s)
- Anastasia Meshcheryakova
- Department of Pathophysiology and Allergy Research, Center of Pathophysiology, Infectiology and Immunology, Medical University of Vienna, Vienna, Austria
| | - Peter Pietschmann
- Department of Pathophysiology and Allergy Research, Center of Pathophysiology, Infectiology and Immunology, Medical University of Vienna, Vienna, Austria
| | | | - Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States
| | - Diana Mechtcheriakova
- Department of Pathophysiology and Allergy Research, Center of Pathophysiology, Infectiology and Immunology, Medical University of Vienna, Vienna, Austria
| |
Collapse
|
30
|
Sivaprakasam B, Sadagopan P. Development of shiny dashboard application for “genome-wide association study on analysis of SNPs injected in Homo sapiens genome (snips-HsG)”. GENE REPORTS 2021. [DOI: 10.1016/j.genrep.2021.101033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
31
|
Trajkova S, Di Gregorio E, Ferrero GB, Carli D, Pavinato L, Delplancq G, Kuentz P, Brusco A. New Insights into Potocki-Shaffer Syndrome: Report of Two Novel Cases and Literature Review. Brain Sci 2020; 10:brainsci10110788. [PMID: 33126574 PMCID: PMC7693731 DOI: 10.3390/brainsci10110788] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Revised: 10/16/2020] [Accepted: 10/27/2020] [Indexed: 12/24/2022] Open
Abstract
Potocki-Shaffer syndrome (PSS) is a rare non-recurrent contiguous gene deletion syndrome involving chromosome 11p11.2. Current literature implies a minimal region with haploinsufficiency of three genes, ALX4 (parietal foramina), EXT2 (multiple exostoses), and PHF21A (craniofacial anomalies, and intellectual disability). The rest of the PSS phenotype is still not associated with a specific gene. We report a systematic review of the literature and included two novel cases. Because deletions are highly variable in size, we defined three groups of patients considering the PSS-genes involved. We found 23 full PSS cases (ALX4, EXT2, and PHF21A), 14 cases with EXT2-ALX4, and three with PHF21A only. Among the latter, we describe a novel male child showing developmental delay, café-au-lait spots, liner postnatal overgrowth and West-like epileptic encephalopathy. We suggest PSS cases may have epileptic spasms early in life, and PHF21A is likely to be the causative gene. Given their subtle presentation these may be overlooked and if left untreated could lead to a severe type or deterioration in the developmental plateau. If our hypothesis is correct, a timely therapy may ameliorate PSS phenotype and improve patients’ outcomes. Our analysis also shows PHF21A is a candidate for the overgrowth phenotype.
Collapse
Affiliation(s)
- Slavica Trajkova
- Department of Medical Sciences, University of Torino, 10126 Turin, Italy; (S.T.); (L.P.)
| | - Eleonora Di Gregorio
- Medical Genetics Unit, Città della Salute e della Scienza, University Hospital, 10126 Turin, Italy; (E.D.)
| | - Giovanni Battista Ferrero
- Department of Public Health and Paediatrics, University of Torino, 10126 Turin, Italy; (G.B.F.); (D.C.)
| | - Diana Carli
- Department of Public Health and Paediatrics, University of Torino, 10126 Turin, Italy; (G.B.F.); (D.C.)
| | - Lisa Pavinato
- Department of Medical Sciences, University of Torino, 10126 Turin, Italy; (S.T.); (L.P.)
| | - Geoffroy Delplancq
- Centre de Génétique Humaine, Université de Franche-Comté, 25000 Besançon, France; (G.D.)
- Service de Pédiatrie, CHU, 25000 Besançon, France
| | - Paul Kuentz
- Oncobiologie Génétique Bioinformatique, PCBio, Centre Hospitalier Universitaire de Besançon, 25000 Besançon, France; (P.K.)
- UMR-Inserm 1231 GAD, Génétique des Anomalies du développement, Université de Bourgogne Franche-Comté, 21000 Dijon, France
- Fédération Hospitalo-Universitaire Médecine Translationnelle et Anomalies du Développement (FHU TRANSLAD), Centre Hospitalier Universitaire de Dijon et Université de Bourgogne Franche-Comté, 21000 Dijon, France
| | - Alfredo Brusco
- Department of Medical Sciences, University of Torino, 10126 Turin, Italy; (S.T.); (L.P.)
- Medical Genetics Unit, Città della Salute e della Scienza, University Hospital, 10126 Turin, Italy; (E.D.)
- Correspondence: (A.B.)
| |
Collapse
|
32
|
Single-cell strand sequencing of a macaque genome reveals multiple nested inversions and breakpoint reuse during primate evolution. Genome Res 2020; 30:1680-1693. [PMID: 33093070 PMCID: PMC7605249 DOI: 10.1101/gr.265322.120] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Accepted: 09/02/2020] [Indexed: 12/14/2022]
Abstract
Rhesus macaque is an Old World monkey that shared a common ancestor with human ∼25 Myr ago and is an important animal model for human disease studies. A deep understanding of its genetics is therefore required for both biomedical and evolutionary studies. Among structural variants, inversions represent a driving force in speciation and play an important role in disease predisposition. Here we generated a genome-wide map of inversions between human and macaque, combining single-cell strand sequencing with cytogenetics. We identified 375 total inversions between 859 bp and 92 Mbp, increasing by eightfold the number of previously reported inversions. Among these, 19 inversions flanked by segmental duplications overlap with recurrent copy number variants associated with neurocognitive disorders. Evolutionary analyses show that in 17 out of 19 cases, the Hominidae orientation of these disease-associated regions is always derived. This suggests that duplicated sequences likely played a fundamental role in generating inversions in humans and great apes, creating architectures that nowadays predispose these regions to disease-associated genetic instability. Finally, we identified 861 genes mapping at 156 inversions breakpoints, with some showing evidence of differential expression in human and macaque cell lines, thus highlighting candidates that might have contributed to the evolution of species-specific features. This study depicts the most accurate fine-scale map of inversions between human and macaque using a two-pronged integrative approach, such as single-cell strand sequencing and cytogenetics, and represents a valuable resource toward understanding of the biology and evolution of primate species.
Collapse
|
33
|
Sun C, Kovacs P, Guiu-Jurado E. Genetics of Obesity in East Asians. Front Genet 2020; 11:575049. [PMID: 33193685 PMCID: PMC7606890 DOI: 10.3389/fgene.2020.575049] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Accepted: 09/17/2020] [Indexed: 12/31/2022] Open
Abstract
Obesity has become a public health problem worldwide. Compared with Europe, people in Asia tend to suffer from type 2 diabetes with a lower body mass index (BMI). Genome-wide association studies (GWASs) have identified over 750 loci associated with obesity. Although the majority of GWAS results were conducted in individuals of European ancestry, a recent GWAS in individuals of Asian ancestry has made a significant contribution to the identification of obesity susceptibility loci. Indeed, owing to the multifactorial character of obesity with a strong environmental component, the revealed loci may have distinct contributions in different ancestral genetic backgrounds and in different environments as presented through diet and exercise among other factors. Uncovering novel, yet unrevealed genes in non-European ancestries may further contribute to explaining the missing heritability for BMI. In this review, we aimed to summarize recent advances in obesity genetics in individuals of Asian ancestry. We therefore compared proposed mechanisms underlying susceptibility loci for obesity associated with individuals of European and Asian ancestries and discussed whether known genetic variants might explain ethnic differences in obesity risk. We further acknowledged that GWAS implemented in individuals of Asian ancestries have not only validated the potential role of previously specified obesity susceptibility loci but also exposed novel ones, which have been missed in the initial genetic studies in individuals of European ancestries. Thus, multi-ethnic studies have a great potential not only to contribute to a better understanding of the complex etiology of human obesity but also potentially of ethnic differences in the prevalence of obesity, which may ultimately pave new avenues in more targeted and personalized obesity treatments.
Collapse
Affiliation(s)
| | - Peter Kovacs
- Medical Department III – Endocrinology, Nephrology, Rheumatology, University of Leipzig Medical Center, Leipzig, Germany
| | | |
Collapse
|
34
|
Ortiz-Prado E, Simbaña-Rivera K, Gómez-Barreno L, Tamariz L, Lister A, Baca JC, Norris A, Adana-Diaz L. Potential research ethics violations against an indigenous tribe in Ecuador: a mixed methods approach. BMC Med Ethics 2020; 21:100. [PMID: 33069227 PMCID: PMC7568418 DOI: 10.1186/s12910-020-00542-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Accepted: 10/06/2020] [Indexed: 01/22/2023] Open
Abstract
Background Biomedical and ethnographic studies among indigenous people are common practice in health and geographical research. Prior health research misconduct has been documented, particularly when obtaining genetic material. The objective of this study was to crossmatch previously published data with the perceptions of the Waorani peoples about the trading of their genetic material and other biological samples. Methods We conducted a mixed methods study design using a tailored 15-item questionnaire in 72 participants and in-depth interviews in 55 participants belonging to 20 Waorani communities about their experiences and perceptions of participating in biomedical research projects. Additionally, we conducted a systematic review of the literature in order to crossmatch the published results of studies stating the approval of an ethics committee and individual consent within their work. Results A total of 40 men (60%) and 32 women (40%), with a mean age of 57 ± 15 years agreed to be interviewed for inclusion. Five main categories around the violation of good clinical practices were identified, concerning the obtention of blood samples from a recently contacted Waorani native community within the Amazonian region of Ecuador. These themes are related to the lack of adequate communication between community members and researchers as well as the voluntariness to participate in health research. Additionally, over 40 years, a total of 38 manuscripts related to the use of biological samples in Waorani indigenous people were published. The majority of the studies (68%) did not state within their article obtaining research ethics board approval, and 71% did not report obtaining the informed consent of the participants prior to the execution of the project. Conclusion Clinical Research on the Waorani community in the Ecuadorian Amazon basin has been performed on several occasions. Unfortunately, the majority of these projects did not follow the appropriate ethical and professional standards in either reporting the results or fulfilling them. The results of our investigation suggest that biological material, including genetic material, has been used by researchers globally, with some omitting the minimum information required to guarantee transparency and good clinical practices. We highlight the importance of stating ethics within research to avoid breaches in research transparency.
Collapse
Affiliation(s)
- Esteban Ortiz-Prado
- One Health Research Group, Faculty of Medicine, Universidad de Las Americas, Ecuador Calle de los Colimes y Avenida De los Granados, Quito, 170137, Ecuador.
| | - Katherine Simbaña-Rivera
- One Health Research Group, Faculty of Medicine, Universidad de Las Americas, Ecuador Calle de los Colimes y Avenida De los Granados, Quito, 170137, Ecuador
| | - Lenin Gómez-Barreno
- One Health Research Group, Faculty of Medicine, Universidad de Las Americas, Ecuador Calle de los Colimes y Avenida De los Granados, Quito, 170137, Ecuador
| | - Leonardo Tamariz
- Division of Population Health and Computational Medicine, University of Miami, Florida, USA
| | - Alex Lister
- Public Health Program, Faculty of Medicine, University of Southampton, Southampton, England
| | - Juan Carlos Baca
- Grassland Group, Technical University of Munich, Munich, Germany
| | | | - Lila Adana-Diaz
- Faculty of Psychology, Universidad de Las Americas, Quito, Ecuador
| |
Collapse
|
35
|
Novel InDels of GHR, GHRH, GHRHR and Their Association with Growth Traits in Seven Chinese Sheep Breeds. Animals (Basel) 2020; 10:ani10101883. [PMID: 33076416 PMCID: PMC7602648 DOI: 10.3390/ani10101883] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Revised: 10/08/2020] [Accepted: 10/12/2020] [Indexed: 12/18/2022] Open
Abstract
The GH growth axis plays an important role in the growth and development of animals and runs through the whole life of animals. Many studies have shown that molecular mutations in key genes of the GH axis will affect the growth and development of animals. The purpose of this study was to explore the distribution characteristics of InDels of GHR, GHRH, and GHRHR in seven Chinese sheep populations, and to further explore the relationship between InDels and sheep growth traits. GHR showed high variation in Chinese sheep, and GHR-53 showed the highest minimum allele frequency (MAF). There was only one InDel mutation site in both GHRH and GHRHR. The genotype frequencies of Hu sheep (HS), Tong sheep (TS), and Lanzhou fat-tail sheep (LFTS) were quite different from other breeds. The association between GHR, GHRH, and GHRHR InDels and body size traits in seven varieties were analyzed. The results showed that there was no significant relationship between GHRH and body size traits in the seven sheep populations. There was a positive association between GHR-21 and hip height of LFSH (p < 0.05). GHR-43 reduced body height and chest depth of Small tail han sheep (STHS) and hip width of TS. GHR-44 significantly affected the body weight of HS, the body height of STHS and the head depth of TS. GHR-53 significantly reduced cannon girth of HS, chest of STHS and forehead width of TS. GHRHR-2 significantly reduced the body weight of LFHS. To sum up, this study revealed the effects of GHR, GHRH, and GHRHR InDels on sheep phenotypic traits, which indicated their potential application prospects in the genetic improvement of mutton sheep.
Collapse
|
36
|
Guo J, Cao K, Deng C, Li Y, Zhu G, Fang W, Chen C, Wang X, Wu J, Guan L, Wu S, Guo W, Yao JL, Fei Z, Wang L. An integrated peach genome structural variation map uncovers genes associated with fruit traits. Genome Biol 2020; 21:258. [PMID: 33023652 PMCID: PMC7539501 DOI: 10.1186/s13059-020-02169-y] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2020] [Accepted: 09/23/2020] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Genome structural variations (SVs) have been associated with key traits in a wide range of agronomically important species; however, SV profiles of peach and their functional impacts remain largely unexplored. RESULTS Here, we present an integrated map of 202,273 SVs from 336 peach genomes. A substantial number of SVs have been selected during peach domestication and improvement, which together affect 2268 genes. Genome-wide association studies of 26 agronomic traits using these SVs identify a number of candidate causal variants. A 9-bp insertion in Prupe.4G186800, which encodes a NAC transcription factor, is shown to be associated with early fruit maturity, and a 487-bp deletion in the promoter of PpMYB10.1 is associated with flesh color around the stone. In addition, a 1.67 Mb inversion is highly associated with fruit shape, and a gene adjacent to the inversion breakpoint, PpOFP1, regulates flat shape formation. CONCLUSIONS The integrated peach SV map and the identified candidate genes and variants represent valuable resources for future genomic research and breeding in peach.
Collapse
Affiliation(s)
- Jian Guo
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
- College of Horticulture & Forestry Sciences, Huazhong Agricultural University, Wuhan, China
| | - Ke Cao
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Cecilia Deng
- The New Zealand Institute for Plant & Food Research Limited, Private Bag 92169, Auckland, 1142, New Zealand
| | - Yong Li
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Gengrui Zhu
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Weichao Fang
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Changwen Chen
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Xinwei Wang
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Jinlong Wu
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Liping Guan
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Shan Wu
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY, USA
| | - Wenwu Guo
- College of Horticulture & Forestry Sciences, Huazhong Agricultural University, Wuhan, China
| | - Jia-Long Yao
- The New Zealand Institute for Plant & Food Research Limited, Private Bag 92169, Auckland, 1142, New Zealand.
| | - Zhangjun Fei
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY, USA.
- US Department of Agriculture-Agricultural Research Service, Robert W. Holley Center for Agriculture and Health, Ithaca, NY, USA.
| | - Lirong Wang
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China.
| |
Collapse
|
37
|
Hall A, Bandres-Ciga S, Diez-Fairen M, Quinn JP, Billingsley KJ. Genetic Risk Profiling in Parkinson's Disease and Utilizing Genetics to Gain Insight into Disease-Related Biological Pathways. Int J Mol Sci 2020; 21:E7332. [PMID: 33020390 PMCID: PMC7584037 DOI: 10.3390/ijms21197332] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 09/30/2020] [Accepted: 10/01/2020] [Indexed: 12/18/2022] Open
Abstract
Parkinson's disease (PD) is a complex disorder underpinned by both environmental and genetic factors. The latter only began to be understood around two decades ago, but since then great inroads have rapidly been made into deconvoluting the genetic component of PD. In particular, recent large-scale projects such as genome-wide association (GWA) studies have provided insight into the genetic risk factors associated with genetically ''complex'' PD (PD that cannot readily be attributed to single deleterious mutations). Here, we discuss the plethora of genetic information provided by PD GWA studies and how this may be utilized to generate polygenic risk scores (PRS), which may be used in the prediction of risk and trajectory of PD. We also comment on how pathway-specific genetic profiling can be used to gain insight into PD-related biological pathways, and how this may be further utilized to nominate causal PD genes and potentially druggable therapeutic targets. Finally, we outline the current limits of our understanding of PD genetics and the potential contribution of variation currently uncaptured in genetic studies, focusing here on uncatalogued structural variants.
Collapse
Affiliation(s)
- Ashley Hall
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular & Integrative Biology, University of Liverpool, L69 7BE, UK; (A.H.); (J.P.Q.)
| | - Sara Bandres-Ciga
- Molecular Genetics Section, Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA;
| | - Monica Diez-Fairen
- Neurogenetics Group, University Hospital MutuaTerrassa, Sant Antoni 19, 08221 Terrassa, Barcelona, Spain;
| | - John P. Quinn
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular & Integrative Biology, University of Liverpool, L69 7BE, UK; (A.H.); (J.P.Q.)
| | - Kimberley J. Billingsley
- Molecular Genetics Section, Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA;
| |
Collapse
|
38
|
Wohlers I, Künstner A, Munz M, Olbrich M, Fähnrich A, Calonga-Solís V, Ma C, Hirose M, El-Mosallamy S, Salama M, Busch H, Ibrahim S. An integrated personal and population-based Egyptian genome reference. Nat Commun 2020; 11:4719. [PMID: 32948767 PMCID: PMC7501257 DOI: 10.1038/s41467-020-17964-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Accepted: 07/24/2020] [Indexed: 02/05/2023] Open
Abstract
A small number of de novo assembled human genomes have been reported to date, and few have been complemented with population-based genetic variation, which is particularly important for North Africa, a region underrepresented in current genome-wide references. Here, we combine long- and short-read whole-genome sequencing data with recent assembly approaches into a de novo assembly of an Egyptian genome. The assembly demonstrates well-balanced quality metrics and is complemented with variant phasing via linked reads into haploblocks, which we associate with gene expression changes in blood. To construct an Egyptian genome reference, we identify genome-wide genetic variation within a cohort of 110 Egyptian individuals. We show that differences in allele frequencies and linkage disequilibrium between Egyptians and Europeans may compromise the transferability of European ancestry-based genetic disease risk and polygenic scores, substantiating the need for multi-ethnic genome references. Thus, the Egyptian genome reference will be a valuable resource for precision medicine.
Collapse
Affiliation(s)
- Inken Wohlers
- Medical Systems Biology Division, Lübeck Institute of Experimental Dermatology and Institute for Cardiogenetics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
| | - Axel Künstner
- Medical Systems Biology Division, Lübeck Institute of Experimental Dermatology and Institute for Cardiogenetics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
| | - Matthias Munz
- Medical Systems Biology Division, Lübeck Institute of Experimental Dermatology and Institute for Cardiogenetics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
| | - Michael Olbrich
- Medical Systems Biology Division, Lübeck Institute of Experimental Dermatology and Institute for Cardiogenetics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
| | - Anke Fähnrich
- Medical Systems Biology Division, Lübeck Institute of Experimental Dermatology and Institute for Cardiogenetics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
| | - Verónica Calonga-Solís
- Medical Systems Biology Division, Lübeck Institute of Experimental Dermatology and Institute for Cardiogenetics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
- Department of Genetics, Federal University of Paraná (UFPR), Centro Politécnico, Jardim das Américas, 81531-990, Curitiba, Brazil
| | - Caixia Ma
- Novogene (UK) Company Limited, 25 Cambridge Science Park, Milton Road, CB4 0FW, Cambridge, UK
| | - Misa Hirose
- Genetics Division, Lübeck Institute of Experimental Dermatology, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany
| | - Shaaban El-Mosallamy
- Medical Experimental Research Center (MERC), Mansoura University, Elgomhouria St., Dakahlia Governorate, 35516, Mansoura, Egypt
| | - Mohamed Salama
- Medical Experimental Research Center (MERC), Mansoura University, Elgomhouria St., Dakahlia Governorate, 35516, Mansoura, Egypt
- Institute of Global Health and Human Ecology, The American University in Cairo, AUC avenue, 11835, Cairo, Egypt
| | - Hauke Busch
- Medical Systems Biology Division, Lübeck Institute of Experimental Dermatology and Institute for Cardiogenetics, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany.
| | - Saleh Ibrahim
- Genetics Division, Lübeck Institute of Experimental Dermatology, University of Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany.
| |
Collapse
|
39
|
Jia H, Wei H, Zhu D, Ma J, Yang H, Wang R, Feng X. PASA: Identifying More Credible Structural Variants of Hedou12. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:1493-1503. [PMID: 31425044 DOI: 10.1109/tcbb.2019.2934463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Although plenty of structural variant detecting approaches for human genomes can be looked up in the literatures, little has been acknowledged on the effectiveness of those structural variant softwares for plant genomes. Moreover, it has been demonstrated frequent occurrences for those structural variant detecting softwares to find too many false structural variants. In this paper, we devote to detect deletions, insertions, and inversions, in total of three kinds of structural variants occurring in Hedou12 genome in contrast to Williams82 genome. To find more potential structural variants, we try to develop new principles to detect discordant and split read map sets supporting structural variants. Aiming to enhance the precision of structural variant detections, we propose two new sequencing characteristic based probability models, which use the sequencing parameters of Hedou12 genome as well as the parameters for Hedou12 paired-end reads to be aligned onto Williams82, to evaluate the probability for a potential structural variant to occur in. To remove the false members from those potential structural variants, we propose a set cover problem model to describe formally on which potential structural variants it should accept to achieve as high as possible a probability summation. This will achieve a solution with more credible structural variants, which can be verified by comparing with DELLY version 0.5.8 and LUMPY version 0.2.2.3. Our algorithm has been verified to be able to find deletions, insertions, and inversions in Hedou12 in contrast to Williams82 DELLY as well as LUMPY fails to find.
Collapse
|
40
|
Harris CJ, Waters AM, Tracy ET, Christison-Lagay E, Baertshiger RM, Ehrlich P, Abdessalam S, Aldrink JH, Rhee DS, Dasgupta R, Rodeberg DA, Lautz TB. Precision oncology: A primer for pediatric surgeons from the APSA cancer committee. J Pediatr Surg 2020; 55:1706-1713. [PMID: 31718869 DOI: 10.1016/j.jpedsurg.2019.10.017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/25/2019] [Revised: 10/01/2019] [Accepted: 10/02/2019] [Indexed: 01/17/2023]
Abstract
Although most children with cancer can be cured of their disease, a subset of patients with adverse tumor types or biological features, and those with relapsed or refractory disease have significantly worse prognosis. Furthermore, current cytotoxic therapy is associated with significant late effects. Precision oncology, using molecular therapeutics targeted against unique genetic features of the patient's tumor, offers the potential to transform the multimodal therapy for these patients. Potentiated by advances in sequencing technology and molecular therapeutic development, and accelerated by large-scale multi-institutional basket trials, the field of pediatric precision oncology has entered the mainstream. These novel therapeutics have important implications for surgical decision making, as well as pre- and postoperative care. This review summarizes the current state of precision medicine in pediatric oncology including the active North American and European precision oncology clinical trials. LEVEL OF EVIDENCE: Treatment study Level V.
Collapse
Affiliation(s)
- Courtney J Harris
- Department of Surgery, Northwestern University Feinberg School of Medicine, Chicago, IL, USA; Division of Pediatric Surgery, Ann and Robert H. Lurie Children's Hospital of Chicago, Chicago, IL, USA
| | - Alicia M Waters
- Division of Pediatric Surgery, Department of Surgery, University of Alabama at Birmingham, Children's of Alabama
| | - Elisabeth T Tracy
- Division of Pediatric Surgery, Department of Surgery, Duke University Medical Center, Durham, NC, USA
| | - Emily Christison-Lagay
- Division of Pediatric Surgery, Department of Surgery, Yale-New Haven Children's Hospital, Yale School of Medicine, New Haven, CT
| | - Reto M Baertshiger
- Division of Pediatric Surgery, Department of Surgery, Dartmouth Hitchcock Medical Center, Lebanon, NH, USA
| | - Peter Ehrlich
- Section of Pediatric Surgery, Department of Surgery University of Michigan School of Medicine, Ann Arbor, MI
| | - Shahab Abdessalam
- Division of Pediatric Surgery, Boys Town National Research Hospital, Omaha, NE
| | - Jennifer H Aldrink
- Division of Pediatric Surgery, Department of Surgery, Nationwide Children's Hospital, The Ohio State University College of Medicine, Columbus, OH
| | - Daniel S Rhee
- Division of Pediatric Surgery, Department of Surgery, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Roshni Dasgupta
- Division of Pediatric Surgery, Cincinnati Children's Hospital Medical Center, Cincinnati, OH
| | - David A Rodeberg
- Division of Pediatric Surgery, Department of Surgery, East Carolina University, Greenville, NC
| | - Timothy B Lautz
- Department of Surgery, Northwestern University Feinberg School of Medicine, Chicago, IL, USA; Division of Pediatric Surgery, Ann and Robert H. Lurie Children's Hospital of Chicago, Chicago, IL, USA.
| |
Collapse
|
41
|
Recurrent inversion toggling and great ape genome evolution. Nat Genet 2020; 52:849-858. [PMID: 32541924 PMCID: PMC7415573 DOI: 10.1038/s41588-020-0646-x] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Accepted: 05/15/2020] [Indexed: 01/14/2023]
Abstract
Inversions play an important role in disease and evolution but are difficult to characterize because their breakpoints map to large repeats. We increased by sixfold the number (n = 1,069) of previously reported great ape inversions by using single-cell DNA template strand and long-read sequencing. We find that the X chromosome is most enriched (2.5-fold) for inversions, on the basis of its size and duplication content. There is an excess of differentially expressed primate genes near the breakpoints of large (>100 kilobases (kb)) inversions but not smaller events. We show that when great ape lineage-specific duplications emerge, they preferentially (approximately 75%) occur in an inverted orientation compared to that at their ancestral locus. We construct megabase-pair scale haplotypes for individual chromosomes and identify 23 genomic regions that have recurrently toggled between a direct and an inverted state over 15 million years. The direct orientation is most frequently the derived state for human polymorphisms that predispose to recurrent copy number variants associated with neurodevelopmental disease.
Collapse
|
42
|
Soylev A, Le TM, Amini H, Alkan C, Hormozdiari F. Discovery of tandem and interspersed segmental duplications using high-throughput sequencing. Bioinformatics 2020; 35:3923-3930. [PMID: 30937433 DOI: 10.1093/bioinformatics/btz237] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Revised: 01/20/2019] [Accepted: 03/29/2019] [Indexed: 01/01/2023] Open
Abstract
MOTIVATION Several algorithms have been developed that use high-throughput sequencing technology to characterize structural variations (SVs). Most of the existing approaches focus on detecting relatively simple types of SVs such as insertions, deletions and short inversions. In fact, complex SVs are of crucial importance and several have been associated with genomic disorders. To better understand the contribution of complex SVs to human disease, we need new algorithms to accurately discover and genotype such variants. Additionally, due to similar sequencing signatures, inverted duplications or gene conversion events that include inverted segmental duplications are often characterized as simple inversions, likewise, duplications and gene conversions in direct orientation may be called as simple deletions. Therefore, there is still a need for accurate algorithms to fully characterize complex SVs and thus improve calling accuracy of more simple variants. RESULTS We developed novel algorithms to accurately characterize tandem, direct and inverted interspersed segmental duplications using short read whole genome sequencing datasets. We integrated these methods to our TARDIS tool, which is now capable of detecting various types of SVs using multiple sequence signatures such as read pair, read depth and split read. We evaluated the prediction performance of our algorithms through several experiments using both simulated and real datasets. In the simulation experiments, using a 30× coverage TARDIS achieved 96% sensitivity with only 4% false discovery rate. For experiments that involve real data, we used two haploid genomes (CHM1 and CHM13) and one human genome (NA12878) from the Illumina Platinum Genomes set. Comparison of our results with orthogonal PacBio call sets from the same genomes revealed higher accuracy for TARDIS than state-of-the-art methods. Furthermore, we showed a surprisingly low false discovery rate of our approach for discovery of tandem, direct and inverted interspersed segmental duplications prediction on CHM1 (<5% for the top 50 predictions). AVAILABILITY AND IMPLEMENTATION TARDIS source code is available at https://github.com/BilkentCompGen/tardis, and a corresponding Docker image is available at https://hub.docker.com/r/alkanlab/tardis/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Arda Soylev
- Department of Computer Engineering, Bilkent University, Ankara.,Department of Computer Engineering, Konya Food and Agriculture University, Konya, Turkey
| | - Thong Minh Le
- UC-Davis Genome Center, University of California, Davis, CA, USA.,Department of Computer Science, University of California, Davis, CA, USA
| | - Hajar Amini
- Department of Neurology, School of Medicine, University of California, Davis, CA, USA
| | - Can Alkan
- Department of Computer Engineering, Bilkent University, Ankara.,Bilkent-Hacettepe Health Sciences and Technologies Program, Ankara, Turkey.,Department of Computer Science, ETH Zürich, Zurich, Switzerland
| | - Fereydoun Hormozdiari
- UC-Davis Genome Center, University of California, Davis, CA, USA.,Department of Biochemistry and Molecular Medicine, University of California, Davis, CA, USA.,MIND Institute, University of California, Davis, CA, USA
| |
Collapse
|
43
|
Wang Z, Guo J, Guo Y, Yang Y, Teng T, Yu Q, Wang T, Zhou M, Zhu Q, Wang W, Zhang Q, Yang H. Genome-Wide Detection of CNVs and Association With Body Weight in Sheep Based on 600K SNP Arrays. Front Genet 2020; 11:558. [PMID: 32582291 PMCID: PMC7297042 DOI: 10.3389/fgene.2020.00558] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Accepted: 05/07/2020] [Indexed: 01/30/2023] Open
Abstract
Copy number variations (CNVs) are important genomic structural variations and can give rise to significant phenotypic diversity. Herein, we used high-density 600K SNP arrays to detect CNVs in two synthetic lines of sheep (DS and SHH) and in Hu sheep (a local Chinese breed). A total of 919 CNV regions (CNVRs) were detected with a total length of 48.17 Mb, accounting for 1.96% of the sheep genome. These CNVRs consisted of 730 gains, 102 losses, and 87 complex CNVRs. These CNVRs were significantly enriched in the segmental duplication (SD) region. A CNVR-based cluster analysis of the three breeds revealed that the DS and SHH breeds share a close genetic relationship. Functional analysis revealed that some genes in these CNVRs were also significantly enriched in the olfactory transduction pathway (oas04740), including members of the OR gene family such as OR6C76, OR4Q2, and OR4K14. Using association analyses and previous gene annotations, we determined that a subset of identified genes was likely to be associated with body weight, including FOXF2, MAPK12, MAP3K11, STRBP, and C14orf132. Together, these results offer valuable information that will guide future efforts to explore the genetic basis for body weight in sheep.
Collapse
Affiliation(s)
- Zhipeng Wang
- College of Animal Science and Technology, Northeast Agricultural University, Harbin, China.,Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, Harbin, China
| | - Jing Guo
- College of Animal Science and Technology, Northeast Agricultural University, Harbin, China.,Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, Harbin, China
| | - Yuanyuan Guo
- College of Animal Science and Technology, Northeast Agricultural University, Harbin, China.,Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, Harbin, China
| | - Yonglin Yang
- State Key Laboratory of Sheep Genetic Improvement and Healthy Production, Xinjiang Academy of Agricultural and Reclamation Science, Shihezi, China
| | - Teng Teng
- Institute of Animal Nutrition, Northeast Agricultural University, Harbin, China
| | - Qian Yu
- State Key Laboratory of Sheep Genetic Improvement and Healthy Production, Xinjiang Academy of Agricultural and Reclamation Science, Shihezi, China
| | - Tao Wang
- College of Animal Science and Technology, Northeast Agricultural University, Harbin, China.,Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, Harbin, China
| | - Meng Zhou
- College of Animal Science and Technology, Northeast Agricultural University, Harbin, China.,Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, Harbin, China
| | - Qiusi Zhu
- College of Animal Science and Technology, Northeast Agricultural University, Harbin, China.,Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, Harbin, China
| | - Wenwen Wang
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Shandong Agricultural University, Tai'an, China
| | - Qin Zhang
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Shandong Agricultural University, Tai'an, China
| | - Hua Yang
- State Key Laboratory of Sheep Genetic Improvement and Healthy Production, Xinjiang Academy of Agricultural and Reclamation Science, Shihezi, China
| |
Collapse
|
44
|
Zeng T, Zhang D, Li Y, Li C, Liu X, Shi Y, Song Y, Li Y, Wang T. Identification of genomic insertion and flanking sequences of the transgenic drought-tolerant maize line "SbSNAC1-382" using the single-molecule real-time (SMRT) sequencing method. PLoS One 2020; 15:e0226455. [PMID: 32275664 PMCID: PMC7147794 DOI: 10.1371/journal.pone.0226455] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Accepted: 03/22/2020] [Indexed: 11/22/2022] Open
Abstract
Safety assessment of genetically modified (GM) crops is crucial at the product-development phase before GM crops are placed on the market. Determining characteristics of sequences flanking exogenous insertion sequences is essential for the safety assessment and marketing of transgenic crops. In this study, we used genome walking and whole-genome sequencing (WGS) to identify the flanking sequence characteristics of the SbSNAC1 transgenic drought-tolerant maize line "SbSNAC1-382", but both of the two methods failed. Then, we constructed a genomic fosmid library of the transgenic maize line, which contained 4.18×105 clones with an average insertion fragment of 35 kb, covering 5.85 times the maize genome. Subsequently, three positive clones were screened by pairs of specific primers, and one of the three positive clones was sequenced by using single-molecule real-time (SMRT) sequencing technology. More than 1.95 Gb sequence data (~105× coverage) for the sequenced clone were generated. The junction reads mapped to the boundaries of T-DNA, and the flanking sequences in the transgenic line were identified by comparing all sequencing reads with the maize reference genome and the sequence of the transgenic vector. Furthermore, the putative insertion loci and flanking sequences were confirmed by PCR amplification and Sanger sequencing. The results indicated that two copies of the exogenous T-DNA fragments were inserted at the same genomic site, and the exogenous T-DNA fragments were integrated at the position of Chromosome 5 from 177155650 to 177155696 in the transgenic line 382. In this study, we demonstrated the successful application of the SMRT technology for the characterization of genomic insertion and flanking sequences.
Collapse
Affiliation(s)
- Tingru Zeng
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Dengfeng Zhang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yongxiang Li
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Chunhui Li
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xuyang Liu
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yunsu Shi
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yanchun Song
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yu Li
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Tianyu Wang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| |
Collapse
|
45
|
Karaoğlanoğlu F, Ricketts C, Ebren E, Rasekh ME, Hajirasouliha I, Alkan C. VALOR2: characterization of large-scale structural variants using linked-reads. Genome Biol 2020; 21:72. [PMID: 32192518 PMCID: PMC7083023 DOI: 10.1186/s13059-020-01975-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 02/24/2020] [Indexed: 12/31/2022] Open
Abstract
Most existing methods for structural variant detection focus on discovery and genotyping of deletions, insertions, and mobile elements. Detection of balanced structural variants with no gain or loss of genomic segments, for example, inversions and translocations, is a particularly challenging task. Furthermore, there are very few algorithms to predict the insertion locus of large interspersed segmental duplications and characterize translocations. Here, we propose novel algorithms to characterize large interspersed segmental duplications, inversions, deletions, and translocations using linked-read sequencing data. We redesign our earlier algorithm, VALOR, and implement our new algorithms in a new software package, called VALOR2.
Collapse
Affiliation(s)
- Fatih Karaoğlanoğlu
- Department of Computer Engineering, Bilkent University, Ankara, 06800 Turkey
| | - Camir Ricketts
- Tri-Institutional Computational Biology & Medicine Program, Cornell University, 1300 York Ave, New York, 10065 NY USA
- Department of Physiology and Biophysics, Institute for Computational Biomedicine, Weill Cornell Medicine, 1300 York Ave, New York, 10065 NY USA
| | - Ezgi Ebren
- Department of Computer Engineering, Bilkent University, Ankara, 06800 Turkey
| | - Marzieh Eslami Rasekh
- Graduate Program in Bioinformatics, Boston University, 24 Cummington Mall, Boston, 02215 MA USA
| | - Iman Hajirasouliha
- Department of Physiology and Biophysics, Institute for Computational Biomedicine, Weill Cornell Medicine, 1300 York Ave, New York, 10065 NY USA
- Englander Institute for Precision Medicine, The Meyer Cancer Center, Weill Cornell Medicine, 1300 York Ave, New York, 10065 NY USA
| | - Can Alkan
- Department of Computer Engineering, Bilkent University, Ankara, 06800 Turkey
- Bilkent-Hacettepe Health Sciences and Technologies Program, Bilkent University, Ankara, 06800 Turkey
| |
Collapse
|
46
|
Abstract
Identifying structural variation (SV) is essential for genome interpretation but has been historically difficult due to limitations inherent to available genome technologies. Detection methods that use ensemble algorithms and emerging sequencing technologies have enabled the discovery of thousands of SVs, uncovering information about their ubiquity, relationship to disease and possible effects on biological mechanisms. Given the variability in SV type and size, along with unique detection biases of emerging genomic platforms, multiplatform discovery is necessary to resolve the full spectrum of variation. Here, we review modern approaches for investigating SVs and proffer that, moving forwards, studies integrating biological information with detection will be necessary to comprehensively understand the impact of SV in the human genome.
Collapse
Affiliation(s)
- Steve S Ho
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Alexander E Urban
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA, USA
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | - Ryan E Mills
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA.
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
47
|
Yin D, Ji C, Song Q, Zhang W, Zhang X, Zhao K, Chen CY, Wang C, He G, Liang Z, Ma X, Li Z, Tang Y, Wang Y, Li K, Ning L, Zhang H, Zhao K, Li X, Yu H, Lei Y, Wang M, Ma L, Zheng H, Zhang Y, Zhang J, Hu W, Chen ZJ. Comparison of Arachis monticola with Diploid and Cultivated Tetraploid Genomes Reveals Asymmetric Subgenome Evolution and Improvement of Peanut. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2020; 7:1901672. [PMID: 32099754 PMCID: PMC7029647 DOI: 10.1002/advs.201901672] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 10/16/2019] [Indexed: 05/05/2023]
Abstract
Like many important crops, peanut is a polyploid that underwent polyploidization, evolution, and domestication. The wild allotetraploid peanut species Arachis monticola (A. monticola) is an important and unique link from the wild diploid species to cultivated tetraploid species in the Arachis lineage. However, little is known about A. monticola and its role in the evolution and domestication of this important crop. A fully annotated sequence of ≈2.6 Gb A. monticola genome and comparative genomics of the Arachis species is reported. Genomic reconstruction of 17 wild diploids from AA, BB, EE, KK, and CC groups and 30 tetraploids demonstrates a monophyletic origin of A and B subgenomes in allotetraploid peanuts. The wild and cultivated tetraploids undergo asymmetric subgenome evolution, including homoeologous exchanges, homoeolog expression bias, and structural variation (SV), leading to subgenome functional divergence during peanut domestication. Significantly, SV-associated homoeologs tend to show expression bias and correlation with pod size increase from diploids to wild and cultivated tetraploids. Moreover, genomic analysis of disease resistance genes shows the unique alleles present in the wild peanut can be introduced into breeding programs to improve some resistance traits in the cultivated peanuts. These genomic resources are valuable for studying polyploid genome evolution, domestication, and improvement of peanut production and resistance.
Collapse
Affiliation(s)
- Dongmei Yin
- College of AgronomyHenan Agricultural UniversityZhengzhou450002China
| | - Changmian Ji
- Biomarker Technologies CorporationBeijing101300China
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off‐Season Reproduction RegionsInstitute of Tropical Bioscience and BiotechnologyChinese Academy of Tropical Agricultural SciencesHaikou571101China
| | - Qingxin Song
- State Key Laboratory of Crop Genetics and Germplasm EnhancementNanjing Agricultural UniversityNanjing210095China
- Department of Molecular Biosciences and Center for Computational Biology and BioinformaticsThe University of Texas at AustinAustin78705USA
| | - Wanke Zhang
- State Key Lab of Plant GenomicsInstitute of Genetics and Developmental BiologyINASEEDChinese Academy of SciencesBeijing100101China
| | - Xingguo Zhang
- College of AgronomyHenan Agricultural UniversityZhengzhou450002China
| | - Kunkun Zhao
- College of AgronomyHenan Agricultural UniversityZhengzhou450002China
| | | | | | - Guohao He
- Department of Agricultural and Environmental SciencesTuskegee UniversityTuskegeeAL36088USA
| | - Zhe Liang
- Centre for Organismal StudiesUniversity of HeidelbergD‐69120HeidelbergGermany
| | - Xingli Ma
- College of AgronomyHenan Agricultural UniversityZhengzhou450002China
| | - Zhongfeng Li
- College of AgronomyHenan Agricultural UniversityZhengzhou450002China
| | - Yueyi Tang
- Shandong Peanut Research InstituteQingdao266000China
| | - Yuejun Wang
- National Key Laboratory of Plant Molecular GeneticsCenter for Excellence in Molecular Plant SciencesInstitute of Plant Physiology and EcologyShanghai Institutes for Biological SciencesChinese Academy of SciencesShanghai200032China
| | - Ke Li
- College of AgronomyHenan Agricultural UniversityZhengzhou450002China
| | - Longlong Ning
- College of AgronomyHenan Agricultural UniversityZhengzhou450002China
| | - Hui Zhang
- College of AgricultureAuburn UniversityAuburnAL36849USA
| | - Kai Zhao
- College of AgronomyHenan Agricultural UniversityZhengzhou450002China
| | - Xuming Li
- Biomarker Technologies CorporationBeijing101300China
| | - Haiyan Yu
- Biomarker Technologies CorporationBeijing101300China
| | - Yan Lei
- Biomarker Technologies CorporationBeijing101300China
| | | | - Liming Ma
- Biomarker Technologies CorporationBeijing101300China
| | - Hongkun Zheng
- Biomarker Technologies CorporationBeijing101300China
| | - Yijing Zhang
- National Key Laboratory of Plant Molecular GeneticsCenter for Excellence in Molecular Plant SciencesInstitute of Plant Physiology and EcologyShanghai Institutes for Biological SciencesChinese Academy of SciencesShanghai200032China
| | - Jinsong Zhang
- State Key Lab of Plant GenomicsInstitute of Genetics and Developmental BiologyINASEEDChinese Academy of SciencesBeijing100101China
| | - Wei Hu
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off‐Season Reproduction RegionsInstitute of Tropical Bioscience and BiotechnologyChinese Academy of Tropical Agricultural SciencesHaikou571101China
| | - Z. Jeffrey Chen
- State Key Laboratory of Crop Genetics and Germplasm EnhancementNanjing Agricultural UniversityNanjing210095China
- Department of Molecular Biosciences and Center for Computational Biology and BioinformaticsThe University of Texas at AustinAustin78705USA
| |
Collapse
|
48
|
Rare copy number variants in over 100,000 European ancestry subjects reveal multiple disease associations. Nat Commun 2020; 11:255. [PMID: 31937769 PMCID: PMC6959272 DOI: 10.1038/s41467-019-13624-1] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Accepted: 11/14/2019] [Indexed: 01/05/2023] Open
Abstract
Copy number variants (CNVs) are suggested to have a widespread impact on the human genome and phenotypes. To understand the role of CNVs across human diseases, we examine the CNV genomic landscape of 100,028 unrelated individuals of European ancestry, using SNP and CGH array datasets. We observe an average CNV burden of ~650 kb, identifying a total of 11,314 deletion, 5625 duplication, and 2746 homozygous deletion CNV regions (CNVRs). In all, 13.7% are unreported, 58.6% overlap with at least one gene, and 32.8% interrupt coding exons. These CNVRs are significantly more likely to overlap OMIM genes (2.94-fold), GWAS loci (1.52-fold), and non-coding RNAs (1.44-fold), compared with random distribution (P < 1 × 10−3). We uncover CNV associations with four major disease categories, including autoimmune, cardio-metabolic, oncologic, and neurological/psychiatric diseases, and identify several drug-repurposing opportunities. Our results demonstrate robust frequency definition for large-scale rare variant association studies, identify CNVs associated with major disease categories, and illustrate the pleiotropic impact of CNVs in human disease. Associations of copy number variations (CNVs) with complex traits are challenging to study because of their low frequency. Here, the authors analyse SNP array and array comparative genomic hybridization data of 100,028 individuals and report their associations with immune-related, cardiometabolic and neuropsychiatric diseases as well as cancer.
Collapse
|
49
|
Li L, Hu B, Li X, Li L. Characterization of mTERF family in allotetraploid peanut and their expression levels in response to dehydration stress. BIOTECHNOL BIOTEC EQ 2020. [DOI: 10.1080/13102818.2020.1825121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Affiliation(s)
- Limei Li
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Science, South China Normal University, Guangzhou, Guangdong, PR China
| | - Bo Hu
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Science, South China Normal University, Guangzhou, Guangdong, PR China
| | - Xiaoyun Li
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Science, South China Normal University, Guangzhou, Guangdong, PR China
| | - Ling Li
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Science, South China Normal University, Guangzhou, Guangdong, PR China
| |
Collapse
|
50
|
Dai Z, Li T, Li J, Han Z, Pan Y, Tang S, Diao X, Luo M. High-throughput long paired-end sequencing of a Fosmid library by PacBio. PLANT METHODS 2019; 15:142. [PMID: 31788019 PMCID: PMC6878638 DOI: 10.1186/s13007-019-0525-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Accepted: 11/12/2019] [Indexed: 06/10/2023]
Abstract
BACKGROUND Large insert paired-end sequencing technologies are important tools for assembling genomes, delineating associated breakpoints and detecting structural rearrangements. To facilitate the comprehensive detection of inter- and intra-chromosomal structural rearrangements or variants (SVs) and complex genome assembly with long repeats and segmental duplications, we developed a new method based on single-molecule real-time synthesis sequencing technology for generating long paired-end sequences of large insert DNA libraries. RESULTS A Fosmid vector, pHZAUFOS3, was developed with the following new features: (1) two 18-bp non-palindromic I-SceI sites flank the cloning site, and another two sites are present in the skeleton of the vector, allowing long DNA inserts (and the long paired-ends in this paper) to be recovered as single fragments and the vector (~ 8 kb) to be fragmented into 2-3 kb fragments by I-SceI digestion and therefore was effectively removed from the long paired-ends (5-10 kb); (2) the chloramphenicol (Cm) resistance gene and replicon (oriV), necessary for colony growth, are located near the two sides of the cloning site, helping to increase the proportion of the paired-end fragments to single-end fragments in the paired-end libraries. Paired-end libraries were constructed by ligating the size-selected, mechanically sheared pooled Fosmid DNA fragments to the Ampicillin (Amp) resistance gene fragment and screening the colonies with Cm and Amp. We tested this method on yeast and Setaria italica Yugu1. Fosmid-size paired-ends with an average length longer than 2 kb for each end were generated. The N50 scaffold lengths of the de novo assemblies of the yeast and S. italica Yugu1 genomes were significantly improved. Five large and five small structural rearrangements or assembly errors spanning tens of bp to tens of kb were identified in S. italica Yugu1 including deletions, inversions, duplications and translocations. CONCLUSIONS We developed a new method for long paired-end sequencing of large insert libraries, which can efficiently improve the quality of de novo genome assembly and identify large and small structural rearrangements or assembly errors.
Collapse
Affiliation(s)
- Zhaozhao Dai
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070 China
| | - Tong Li
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070 China
| | - Jiadong Li
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070 China
| | - Zhifei Han
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070 China
| | - Yonglong Pan
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070 China
| | - Sha Tang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 10081 China
| | - Xianmin Diao
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 10081 China
| | - Meizhong Luo
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070 China
| |
Collapse
|