Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zheng GX, Lau BT, Schnall-Levin M, Jarosz M, Bell JM, Hindson CM, Kyriazopoulou-Panagiotopoulou S, Masquelier DA, Merrill L, Terry JM, Mudivarti PA, Wyatt PW, Bharadwaj R, Makarewicz AJ, Li Y, Belgrader P, Price AD, Lowe AJ, Marks P, Vurens GM, Hardenbol P, Montesclaros L, Luo M, Greenfield L, Wong A, Birch DE, Short SW, Bjornson KP, Patel P, Hopmans ES, Wood C, Kaur S, Lockwood GK, Stafford D, Delaney JP, Wu I, Ordonez HS, Grimes SM, Greer S, Lee JY, Belhocine K, Giorda KM, Heaton WH, McDermott GP, Bent ZW, Meschi F, Kondov NO, Wilson R, Bernate JA, Gauby S, Kindwall A, Bermejo C, Fehr AN, Chan A, Saxonov S, Ness KD, Hindson BJ, Ji HP. Haplotyping germline and cancer genomes with high-throughput linked-read sequencing. Nat Biotechnol 2016;34:303-11. [PMID: 26829319 DOI: 10.1038/nbt.3432] [Citation(s) in RCA: 438] [Impact Index Per Article: 54.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2015] [Accepted: 11/12/2015] [Indexed: 01/13/2023]

For:	Zheng GX, Lau BT, Schnall-Levin M, Jarosz M, Bell JM, Hindson CM, Kyriazopoulou-Panagiotopoulou S, Masquelier DA, Merrill L, Terry JM, Mudivarti PA, Wyatt PW, Bharadwaj R, Makarewicz AJ, Li Y, Belgrader P, Price AD, Lowe AJ, Marks P, Vurens GM, Hardenbol P, Montesclaros L, Luo M, Greenfield L, Wong A, Birch DE, Short SW, Bjornson KP, Patel P, Hopmans ES, Wood C, Kaur S, Lockwood GK, Stafford D, Delaney JP, Wu I, Ordonez HS, Grimes SM, Greer S, Lee JY, Belhocine K, Giorda KM, Heaton WH, McDermott GP, Bent ZW, Meschi F, Kondov NO, Wilson R, Bernate JA, Gauby S, Kindwall A, Bermejo C, Fehr AN, Chan A, Saxonov S, Ness KD, Hindson BJ, Ji HP. Haplotyping germline and cancer genomes with high-throughput linked-read sequencing. Nat Biotechnol 2016;34:303-11. [PMID: 26829319 DOI: 10.1038/nbt.3432] [Citation(s) in RCA: 438] [Impact Index Per Article: 54.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2015] [Accepted: 11/12/2015] [Indexed: 01/13/2023]

Number

Cited by Other Article(s)

101

Espejo Valle-Inclan J, Besselink NJ, de Bruijn E, Cameron DL, Ebler J, Kutzera J, van Lieshout S, Marschall T, Nelen M, Priestley P, Renkens I, Roemer MG, van Roosmalen MJ, Wenger AM, Ylstra B, Fijneman RJ, Kloosterman WP, Cuppen E. A multi-platform reference for somatic structural variation detection. CELL GENOMICS 2022;2:100139. [PMID: 36778136 PMCID: PMC9903816 DOI: 10.1016/j.xgen.2022.100139] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/10/2020] [Revised: 05/06/2021] [Accepted: 05/06/2022] [Indexed: 10/18/2022]

Affiliation(s)

Jose Espejo Valle-Inclan Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands
Nicolle J.M. Besselink Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands
Ewart de Bruijn Hartwig Medical Foundation, Amsterdam, the Netherlands
Daniel L. Cameron Hartwig Medical Foundation, Amsterdam, the Netherlands,3Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC, Australia
Jana Ebler Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
Joachim Kutzera Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands
Stef van Lieshout Hartwig Medical Foundation, Amsterdam, the Netherlands
Tobias Marschall Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
Marcel Nelen Department of Human Genetics, Radboud UMC, Nijmegen, the Netherlands
Peter Priestley Hartwig Medical Foundation, Amsterdam, the Netherlands
Ivo Renkens Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands
Margaretha G.M. Roemer Department of Pathology, Amsterdam UMC, Vrije Universiteit Amsterdam, Cancer Center Amsterdam, Amsterdam, the Netherlands
Markus J. van Roosmalen Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands
Aaron M. Wenger Pacific Biosciences, Menlo Park, CA, USA
Bauke Ylstra Department of Pathology, Amsterdam UMC, Vrije Universiteit Amsterdam, Cancer Center Amsterdam, Amsterdam, the Netherlands
Remond J.A. Fijneman Department of Pathology, Netherlands Cancer Institute, Amsterdam, the Netherlands
Wigard P. Kloosterman Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands,∗Corresponding author
Edwin Cuppen Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands,2Hartwig Medical Foundation, Amsterdam, the Netherlands,∗∗Corresponding author

Collapse

102

Linked-read whole-genome sequencing resolves common and private structural variants in multiple myeloma. Blood Adv 2022;6:5009-5023. [PMID: 35675515 PMCID: PMC9631623 DOI: 10.1182/bloodadvances.2021006720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 05/31/2022] [Indexed: 01/18/2023] Open

Abstract

Linked-read WGS can be performed without DNA purification and allows for resolution of the diverse structural variants found in MM.

Linked-read WGS can, as a standalone assay, provide comprehensive genetics in myeloma and other diseases with complex genomes.

Multiple myeloma (MM) is an incurable and aggressive plasma cell malignancy characterized by a complex karyotype with multiple structural variants (SVs) and copy-number variations (CNVs). Linked-read whole-genome sequencing (lrWGS) allows for refined detection and reconstruction of SVs by providing long-range genetic information from standard short-read sequencing. This makes lrWGS an attractive solution for capturing the full genomic complexity of MM. Here we show that high-quality lrWGS data can be generated from low numbers of cells subjected to fluorescence-activated cell sorting (FACS) without DNA purification. Using this protocol, we analyzed MM cells after FACS from 37 patients with MM using lrWGS. We found high concordance between lrWGS and fluorescence in situ hybridization (FISH) for the detection of recurrent translocations and CNVs. Outside of the regions investigated by FISH, we identified >150 additional SVs and CNVs across the cohort. Analysis of the lrWGS data allowed for resolution of the structure of diverse SVs affecting the MYC and t(11;14) loci, causing the duplication of genes and gene regulatory elements. In addition, we identified private SVs causing the dysregulation of genes recurrently involved in translocations with the IGH locus and show that these can alter the molecular classification of MM. Overall, we conclude that lrWGS allows for the detection of aberrations critical for MM prognostics and provides a feasible route for providing comprehensive genetics. Implementing lrWGS could provide more accurate clinical prognostics, facilitate genomic medicine initiatives, and greatly improve the stratification of patients included in clinical trials.

Collapse

103

Chiu R, Rajan-Babu IS, Birol I, Friedman JM. Linked-read sequencing for detecting short tandem repeat expansions. Sci Rep 2022;12:9352. [PMID: 35672336 PMCID: PMC9174224 DOI: 10.1038/s41598-022-13024-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 05/19/2022] [Indexed: 11/09/2022] Open

104

Smith SE, Huang W, Tiamani K, Unterer M, Khan Mirzaei M, Deng L. Emerging technologies in the study of the virome. Curr Opin Virol 2022;54:101231. [DOI: 10.1016/j.coviro.2022.101231] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 04/16/2022] [Accepted: 04/19/2022] [Indexed: 11/03/2022]

105

Bhat GR, Sethi I, Rah B, Kumar R, Afroze D. Innovative in Silico Approaches for Characterization of Genes and Proteins. Front Genet 2022;13:865182. [PMID: 35664302 PMCID: PMC9159363 DOI: 10.3389/fgene.2022.865182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Accepted: 04/11/2022] [Indexed: 11/13/2022] Open

Abstract

Bioinformatics is an amalgamation of biology, mathematics and computer science. It is a science which gathers the information from biology in terms of molecules and applies the informatic techniques to the gathered information for understanding and organizing the data in a useful manner. With the help of bioinformatics, the experimental data generated is stored in several databases available online like nucleotide database, protein databases, GENBANK and others. The data stored in these databases is used as reference for experimental evaluation and validation. Till now several online tools have been developed to analyze the genomic, transcriptomic, proteomics, epigenomics and metabolomics data. Some of them include Human Splicing Finder (HSF), Exonic Splicing Enhancer Mutation taster, and others. A number of SNPs are observed in the non-coding, intronic regions and play a role in the regulation of genes, which may or may not directly impose an effect on the protein expression. Many mutations are thought to influence the splicing mechanism by affecting the existing splice sites or creating a new sites. To predict the effect of mutation (SNP) on splicing mechanism/signal, HSF was developed. Thus, the tool is helpful in predicting the effect of mutations on splicing signals and can provide data even for better understanding of the intronic mutations that can be further validated experimentally. Additionally, rapid advancement in proteomics have steered researchers to organize the study of protein structure, function, relationships, and dynamics in space and time. Thus the effective integration of all of these technological interventions will eventually lead to steering up of next-generation systems biology, which will provide valuable biological insights in the field of research, diagnostic, therapeutic and development of personalized medicine.

Collapse

106

Gao Y, Ma L, Liu GE. Initial Analysis of Structural Variation Detections in Cattle Using Long-Read Sequencing Methods. Genes (Basel) 2022;13:828. [PMID: 35627213 PMCID: PMC9142105 DOI: 10.3390/genes13050828] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Revised: 05/01/2022] [Accepted: 05/04/2022] [Indexed: 02/01/2023] Open

107

Deng S, Feng Y, Pauklin S. 3D chromatin architecture and transcription regulation in cancer. J Hematol Oncol 2022;15:49. [PMID: 35509102 PMCID: PMC9069733 DOI: 10.1186/s13045-022-01271-x] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 04/21/2022] [Indexed: 12/18/2022] Open

108

Shearman JR, Naktang C, Sonthirod C, Kongkachana W, U-Thoomporn S, Jomchai N, Maknual C, Yamprasai S, Promchoo W, Ruang-Areerate P, Pootakham W, Tangphatsornruang S. Assembly of a hybrid mangrove, Bruguiera hainesii, and its two ancestral contributors, Bruguiera cylindrica and Bruguiera gymnorhiza. Genomics 2022;114:110382. [PMID: 35526741 DOI: 10.1016/j.ygeno.2022.110382] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 04/19/2022] [Accepted: 05/02/2022] [Indexed: 01/14/2023]

Affiliation(s)

Jeremy R Shearman National Omics Center, National Science and Technology Development Agency, 111 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani 12120, Thailand
Chaiwat Naktang National Omics Center, National Science and Technology Development Agency, 111 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani 12120, Thailand
Chutima Sonthirod National Omics Center, National Science and Technology Development Agency, 111 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani 12120, Thailand
Wasitthee Kongkachana National Omics Center, National Science and Technology Development Agency, 111 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani 12120, Thailand
Sonicha U-Thoomporn National Omics Center, National Science and Technology Development Agency, 111 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani 12120, Thailand
Nukoon Jomchai National Omics Center, National Science and Technology Development Agency, 111 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani 12120, Thailand
Chatree Maknual Department of Marine and Coastal Resources, 120 The Government Complex, Chaengwatthana Rd., Thung Song Hong, Bangkok 10210, Thailand
Suchart Yamprasai Department of Marine and Coastal Resources, 120 The Government Complex, Chaengwatthana Rd., Thung Song Hong, Bangkok 10210, Thailand
Waratthaya Promchoo Department of Marine and Coastal Resources, 120 The Government Complex, Chaengwatthana Rd., Thung Song Hong, Bangkok 10210, Thailand
Panthita Ruang-Areerate National Omics Center, National Science and Technology Development Agency, 111 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani 12120, Thailand
Wirulda Pootakham National Omics Center, National Science and Technology Development Agency, 111 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani 12120, Thailand
Sithichoke Tangphatsornruang National Omics Center, National Science and Technology Development Agency, 111 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani 12120, Thailand.

Collapse

109

Chen J, Zhong J, He X, Li X, Ni P, Safner T, Šprem N, Han J. The de novo assembly of a European wild boar genome revealed unique patterns of chromosomal structural variations and segmental duplications. Anim Genet 2022;53:281-292. [PMID: 35238061 PMCID: PMC9314987 DOI: 10.1111/age.13181] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 02/12/2022] [Accepted: 02/12/2022] [Indexed: 02/05/2023]

Abstract

The rapid progress of sequencing technology has greatly facilitated the de novo genome assembly of pig breeds. However, the assembly of the wild boar genome is still lacking, hampering our understanding of chromosomal and genomic evolution during domestication from wild boars into domestic pigs. Here, we sequenced and de novo assembled a European wild boar genome (ASM2165605v1) using the long‐range information provided by 10× Linked‐Reads sequencing. We achieved a high‐quality assembly with contig N50 of 26.09 Mb. Additionally, 1.64% of the contigs (222) with lengths from 107.65 kb to 75.36 Mb covered 90.3% of the total genome size of ASM2165605v1 (~2.5 Gb). Mapping analysis revealed that the contigs can fill 24.73% (93/376) of the gaps present in the orthologous regions of the updated pig reference genome (Sscrofa11.1). We further improved the contigs into chromosome level with a reference‐assistant scaffolding method. Using the ‘assembly‐to‐assembly’ approach, we identified intra‐chromosomal large structural variations (SVs, length >1 kb) between ASM2165605v1 and Sscrofa11.1 assemblies. Interestingly, we found that the number of SV events on the X chromosome deviated significantly from the linear models fitting autosomes (R² > 0.64, p < 0.001). Specifically, deletions and insertions were deficient on the X chromosome by 66.14 and 58.41% respectively, whereas duplications and inversions were excessive on the X chromosome by 71.96 and 107.61% respectively. We further used the large segmental duplications (SDs, >1 kb) events as a proxy to understand the large‐scale inter‐chromosomal evolution, by resolving parental‐derived relationships for SD pairs. We revealed a significant excess of SD movements from the X chromosome to autosomes (p < 0.001), consistent with the expectation of meiotic sex chromosome inactivation. Enrichment analyses indicated that the genes within derived SD copies on autosomes were significantly related to biological processes involving nervous system, lipid biosynthesis and sperm motility (p < 0.01). Together, our analyses of the de novo assembly of ASM2165605v1 provides insight into the SVs between European wild boar and domestic pig, in addition to the ongoing process of meiotic sex chromosome inactivation in driving inter‐chromosomal interaction between the sex chromosome and autosomes.

Collapse

110

Leng Z, Li L, Zhou X, Dong G, Chen S, Shang G, Kou H, Yang B, Liu H. Novel Insights into the Stemness and Immune Privilege of Mesenchymal Stem Cells from Human Wharton Jelly by Single-Cell RNA Sequencing. Med Sci Monit 2022;28:e934660. [PMID: 35153292 PMCID: PMC8855628 DOI: 10.12659/msm.934660] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Accepted: 10/24/2021] [Indexed: 11/23/2022] Open

Abstract

BACKGROUND Fundamental and clinical interest in mesenchymal stem cells (MSCs) has risen dramatically over the past 3 decades. The immunomodulatory and differentiation abilities are the main mechanisms in vitro and in vivo. However, increasing evidence casts doubt on the stemness and immunogenicity of MSCs. MATERIAL AND METHODS We conducted a high-throughput 10x RNA sequencing and Smart-seq2 scRNA-seq analysis to reveal gene expression of Wharton jelly MSCs (WJ-MSCs) at a single-cell level. Multipotent differentiation, subpopulations, marker genes, human leucocyte antigen (HLA) gene expression, and cell cluster trajectory analysis were evaluated. RESULTS The WJ-MSCs had considerable heterogeneity between cells in terms of gene expression. They highly, partially, and hardly expressed genes related to mesodermal differentiation, endodermal differentiation, and ectodermal differentiation, respectively. Some cells seem to be bipotent or unipotent stem cells. Further, Monocle and cell cluster trajectory analysis demonstrated that 1 of the 3 divided clusters performed as stem cells, accounting for 12.6% of the population. The marker genes for a stem cell cluster were CRIM1, GLS, PLOD2, NEXN, ACTR2, FN1, MBNL1, LMOD1, COL3A1, NCL, SEC62, EPRS, COL5A2, COL8A1, and VCAN. In addition, the MSCs also highly, partially, and hardly expressed HLA-I antigen genes, HLA-II genes, and the HLA-G gene, respectively, indicating that MSCs probably have immunogenicity. A Kyoto Encyclopedia of Genes and Genomes pathway analysis of the 3 clusters demonstrated that they were mainly connected with viral infectious diseases, cancer, and endocrine and metabolic disorders. The most expressed transcription factors were zf-C2H2, HMG/HMGY, and Homeobox. CONCLUSIONS We found that only a subpopulation of WJ-MSCs are real stem cells and WJ-MSCs probably do not have immune privilege.

Collapse

111

Mueller JC, Botero-Delgadillo E, Espíndola-Hernández P, Gilsenan C, Ewels P, Gruselius J, Kempenaers B. Local selection signals in the genome of Blue tits emphasize regulatory and neuronal evolution. Mol Ecol 2022;31:1504-1514. [PMID: 34995389 DOI: 10.1111/mec.16345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 11/18/2021] [Accepted: 12/15/2021] [Indexed: 11/30/2022]

112

Yuan Y. Applications of Optical Mapping for Plant Genome Assembly and Structural Variation Detection. Methods Mol Biol 2022;2443:245-257. [PMID: 35037210 DOI: 10.1007/978-1-0716-2067-0_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

113

Visendi P. De Novo Assembly of Linked Reads Using Supernova 2.0. Methods Mol Biol 2022;2443:233-243. [PMID: 35037209 DOI: 10.1007/978-1-0716-2067-0_12] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

114

Tran TM, Kim SC, Modavi C, Abate AR. Robotic automation of droplet microfluidics. BIOMICROFLUIDICS 2022;16:014102. [PMID: 35145570 PMCID: PMC8816516 DOI: 10.1063/5.0064265] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Accepted: 11/23/2021] [Indexed: 06/14/2023]

115

Athanasopoulou K, Boti MA, Adamopoulos PG, Skourou PC, Scorilas A. Third-Generation Sequencing: The Spearhead towards the Radical Transformation of Modern Genomics. Life (Basel) 2021;12:life12010030. [PMID: 35054423 PMCID: PMC8780579 DOI: 10.3390/life12010030] [Citation(s) in RCA: 68] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 12/20/2021] [Accepted: 12/23/2021] [Indexed: 12/14/2022] Open

116

Prunier J, Carrier A, Gilbert I, Poisson W, Albert V, Taillon J, Bourret V, Côté SD, Droit A, Robert C. CNVs with adaptive potential in Rangifer tarandus: genome architecture and new annotated assembly. Life Sci Alliance 2021;5:5/3/e202101207. [PMID: 34911809 PMCID: PMC8711850 DOI: 10.26508/lsa.202101207] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Revised: 11/29/2021] [Accepted: 11/29/2021] [Indexed: 01/13/2023] Open

117

Xiong X, Kelkar YD, Geden CJ, Zhang C, Wang Y, Jongepier E, Martinson EO, Verhulst EC, Gadau J, Werren JH, Wang X. Long-Read Assembly and Annotation of the Parasitoid Wasp Muscidifurax raptorellus, a Biological Control Agent for Filth Flies. Front Genet 2021;12:748135. [PMID: 34868218 PMCID: PMC8633841 DOI: 10.3389/fgene.2021.748135] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 10/04/2021] [Indexed: 12/30/2022] Open

118

Tarabichi M, Demeulemeester J, Verfaillie A, Flanagan AM, Van Loo P, Konopka T. A pan-cancer landscape of somatic mutations in non-unique regions of the human genome. Nat Biotechnol 2021;39:1589-1596. [PMID: 34282324 PMCID: PMC7612106 DOI: 10.1038/s41587-021-00971-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 06/02/2021] [Indexed: 12/27/2022]

119

Miller DB, Piccolo SR. trioPhaser: using Mendelian inheritance logic to improve genomic phasing of trios. BMC Bioinformatics 2021;22:559. [PMID: 34809557 PMCID: PMC8607709 DOI: 10.1186/s12859-021-04470-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 11/08/2021] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

When analyzing DNA sequence data of an individual, knowing which nucleotide was inherited from each parent can be beneficial when trying to identify certain types of DNA variants. Mendelian inheritance logic can be used to accurately phase (haplotype) the majority (67-83%) of an individual's heterozygous nucleotide positions when genotypes are available for both parents (trio). However, when all members of a trio are heterozygous at a position, Mendelian inheritance logic cannot be used to phase. For such positions, a computational phasing algorithm can be used. Existing phasing algorithms use a haplotype reference panel, sequencing reads, and/or parental genotypes to phase an individual; however, they are limited in that they can only phase certain types of variants, require a specific genotype build, require large amounts of storage capacity, and/or require long run times. We created trioPhaser to address these challenges.

RESULTS

trioPhaser uses gVCF files from an individual and their parents as initial input, and then outputs a phased VCF file. Input trio data are first phased using Mendelian inheritance logic. Then, the positions that cannot be phased using inheritance information alone are phased by the SHAPEIT4 phasing algorithm. Using whole-genome sequencing data of 52 trios, we show that trioPhaser, on average, increases the total number of phased positions by 21.0% and 10.5%, respectively, when compared to the number of positions that SHAPEIT4 or Mendelian inheritance logic can phase when either is used alone. In addition, we show that the accuracy of the phased calls output by trioPhaser are similar to linked-read and read-backed phasing.

CONCLUSION

trioPhaser is a containerized software tool that uses both Mendelian inheritance logic and SHAPEIT4 to phase trios when gVCF files are available. By implementing both phasing methods, more variant positions are phased compared to what either method is able to phase alone.

Collapse

120

Wu C, Yin Y, Zhu L, Zhang Y, Li YZ. Metagenomic sequencing-driven multidisciplinary approaches to shed light on the untapped microbial natural products. Drug Discov Today 2021;27:730-742. [PMID: 34775105 DOI: 10.1016/j.drudis.2021.11.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2020] [Revised: 10/07/2021] [Accepted: 11/08/2021] [Indexed: 11/17/2022]

121

Yu Y, Chen L, Miao X, Li SC. SpecHap: a diploid phasing algorithm based on spectral graph theory. Nucleic Acids Res 2021;49:e114. [PMID: 34403470 PMCID: PMC8565328 DOI: 10.1093/nar/gkab709] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 07/25/2021] [Accepted: 08/02/2021] [Indexed: 11/30/2022] Open

122

Jia W, Xu C, Li SC. Resolving complex structures at oncovirus integration loci with conjugate graph. Brief Bioinform 2021;22:6359003. [PMID: 34463709 DOI: 10.1093/bib/bbab359] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 08/10/2021] [Accepted: 08/12/2021] [Indexed: 01/10/2023] Open

123

Abbasi A, Alexandrov LB. Significance and limitations of the use of next-generation sequencing technologies for detecting mutational signatures. DNA Repair (Amst) 2021;107:103200. [PMID: 34411908 PMCID: PMC9478565 DOI: 10.1016/j.dnarep.2021.103200] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Revised: 07/30/2021] [Accepted: 08/03/2021] [Indexed: 12/13/2022]

124

Bodrug-Schepers A, Stralis-Pavese N, Buerstmayr H, Dohm JC, Himmelbauer H. Quinoa genome assembly employing genomic variation for guided scaffolding. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021;134:3577-3594. [PMID: 34365519 PMCID: PMC8519820 DOI: 10.1007/s00122-021-03915-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 07/06/2021] [Indexed: 06/13/2023]

Abstract

We propose to use the natural variation between individuals of a population for genome assembly scaffolding. In today's genome projects, multiple accessions get sequenced, leading to variant catalogs. Using such information to improve genome assemblies is attractive both cost-wise as well as scientifically, because the value of an assembly increases with its contiguity. We conclude that haplotype information is a valuable resource to group and order contigs toward the generation of pseudomolecules. Quinoa (Chenopodium quinoa) has been under cultivation in Latin America for more than 7500 years. Recently, quinoa has gained increasing attention due to its stress resistance and its nutritional value. We generated a novel quinoa genome assembly for the Bolivian accession CHEN125 using PacBio long-read sequencing data (assembly size 1.32 Gbp, initial N50 size 608 kbp). Next, we re-sequenced 50 quinoa accessions from Peru and Bolivia. This set of accessions differed at 4.4 million single-nucleotide variant (SNV) positions compared to CHEN125 (1.4 million SNV positions on average per accession). We show how to exploit variation in accessions that are distantly related to establish a genome-wide ordered set of contigs for guided scaffolding of a reference assembly. The method is based on detecting shared haplotypes and their expected continuity throughout the genome (i.e., the effect of linkage disequilibrium), as an extension of what is expected in mapping populations where only a few haplotypes are present. We test the approach using Arabidopsis thaliana data from different populations. After applying the method on our CHEN125 quinoa assembly we validated the results with mate-pairs, genetic markers, and another quinoa assembly originating from a Chilean cultivar. We show consistency between these information sources and the haplotype-based relations as determined by us and obtain an improved assembly with an N50 size of 1079 kbp and ordered contig groups of up to 39.7 Mbp. We conclude that haplotype information in distantly related individuals of the same species is a valuable resource to group and order contigs according to their adjacency in the genome toward the generation of pseudomolecules.

Collapse

125

Hill BM, Bisht K, Atkins GR, Gomez AA, Rumbaugh KP, Wakeman CA, Brown AMV. Lysis-Hi-C as a method to study polymicrobial communities and eDNA. Mol Ecol Resour 2021;22:1029-1042. [PMID: 34669257 PMCID: PMC9215119 DOI: 10.1111/1755-0998.13535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 10/06/2021] [Accepted: 10/11/2021] [Indexed: 11/30/2022]

Abstract

Microbes interact in natural communities in a spatially structured manner, particularly in biofilms and polymicrobial infections. While next generation sequencing approaches provide powerful insights into diversity, metabolic capacity, and mutational profiles of these communities, they generally fail to recover in situ spatial proximity between distinct genotypes in the interactome. Hi‐C is a promising method that has assisted in analysing complex microbiomes, by creating chromatin cross‐links in cells, that aid in identifying adjacent DNA, to improve de novo assembly. This study explored a modified Hi‐C approach involving an initial lysis phase prior to DNA cross‐linking, to test whether adjacent cell chromatin can be cross‐linked, anticipating that this could provide a new avenue for study of spatial‐mutational dynamics in structured microbial communities. An artificial polymicrobial mixture of Pseudomonas aeruginosa, Staphylococcus aureus, and Escherichia coli was lysed for 1–18 h, then prepared for Hi‐C. A murine biofilm infection model was treated with sonication, mechanical lysis, or chemical lysis before Hi‐C. Bioinformatic analyses of resulting Hi‐C interspecies chromatin links showed that while microbial species differed from one another, generally lysis significantly increased links between species and increased the distance of Hi‐C links within species, while also increasing novel plasmid‐chromosome links. The success of this modified lysis‐Hi‐C protocol in creating extracellular DNA links is a promising first step toward a new lysis‐Hi‐C based method to recover genotypic microgeography in polymicrobial communities, with potential future applications in diseases with localized resistance, such as cystic fibrosis lung infections and chronic diabetic ulcers.

Collapse

126

Dias GB, Aldossary AM, El-Shafie HAF, Alhoshani FM, Al-Fageeh MB, Bergman CM, Manee MM. Complete mitochondrial genome of the longhorn date palm stem borer Jebusaea hammerschmidtii (Reiche, 1878). Mitochondrial DNA B Resour 2021;6:3214-3216. [PMID: 34676292 PMCID: PMC8525966 DOI: 10.1080/23802359.2021.1989334] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 08/05/2021] [Indexed: 12/02/2022] Open

127

Miller DB, Robison R, Piccolo SR. Toward a methodology for evaluating DNA variants in nuclear families. PLoS One 2021;16:e0258375. [PMID: 34624066 PMCID: PMC8500447 DOI: 10.1371/journal.pone.0258375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 09/27/2021] [Indexed: 11/22/2022] Open

Abstract

The genetic underpinnings of most pediatric-cancer cases are unknown. Population-based studies use large sample sizes but have accounted for only a small proportion of the estimated heritability of pediatric cancers. Pedigree-based studies are infeasible for most human populations. One alternative is to collect genetic data from a single nuclear family and use inheritance patterns within the family to filter candidate variants. This approach can be applied to common and rare variants, including those that are private to a given family or to an affected individual. We evaluated this approach using genetic data from three nuclear families with 5, 4, and 7 children, respectively. Only one child in each nuclear family had been diagnosed with cancer, and neither parent had been affected. Diagnoses for the affected children were benign low-grade astrocytoma, Wilms tumor (stage 2), and Burkitt's lymphoma, respectively. We used whole-genome sequencing to profile normal cells from each family member and a linked-read technology for genomic phasing. For initial variant filtering, we used global minor allele frequencies, deleteriousness scores, and functional-impact annotations. Next, we used genetic variation in the unaffected siblings as a guide to filter the remaining variants. As a way to evaluate our ability to detect variant(s) that may be relevant to disease status, the corresponding author blinded the primary author to affected status; the primary author then assigned a risk score to each child. Based on this evidence, the primary author predicted which child had been affected in each family. The primary author's prediction was correct for the child who had been diagnosed with a Wilms tumor; the child with Burkitt's lymphoma had the second-highest risk score among the seven children in that family. This study demonstrates a methodology for filtering and evaluating candidate genomic variants and genes within nuclear families that may merit further exploration.

Collapse

128

Arias CF, Dikow RB, McMillan WO, De León LF. De Novo Genome Assembly of the Electric Fish Brachyhypopomus occidentalis (Hypopomidae, Gymnotiformes). Genome Biol Evol 2021;13:6377337. [PMID: 34581791 PMCID: PMC8536545 DOI: 10.1093/gbe/evab223] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/21/2021] [Indexed: 11/20/2022] Open

129

Westfall AK, Telemeco RS, Grizante MB, Waits DS, Clark AD, Simpson DY, Klabacka RL, Sullivan AP, Perry GH, Sears MW, Cox CL, Cox RM, Gifford ME, John-Alder HB, Langkilde T, Angilletta MJ, Leaché AD, Tollis M, Kusumi K, Schwartz TS. A chromosome-level genome assembly for the eastern fence lizard (Sceloporus undulatus), a reptile model for physiological and evolutionary ecology. Gigascience 2021;10:6380105. [PMID: 34599334 PMCID: PMC8486681 DOI: 10.1093/gigascience/giab066] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Revised: 04/16/2021] [Accepted: 09/07/2021] [Indexed: 12/15/2022] Open

Affiliation(s)

Aundrea K Westfall Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
Rory S Telemeco Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA.,Department of Biology, California State University Fresno, Fresno, CA 93740, USA
Mariana B Grizante School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
Damien S Waits Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
Amanda D Clark Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
Dasia Y Simpson Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
Randy L Klabacka Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
Alexis P Sullivan Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
George H Perry Department of Biology, Pennsylvania State University, University Park, PA 16802, USA.,Department of Anthropology, Pennsylvania State University, University Park, PA 16802, USA.,Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
Michael W Sears Department of Biological Sciences, Clemson University, Clemson, SC 29634, USA
Christian L Cox Department of Biology, Georgia Southern University, Statesboro, GA 30460, USA.,Department of Biological Sciences, Florida International University, Miami, FL 33199, USA
Robert M Cox Department of Biology, University of Virginia, Charlottesville, VA 22904, USA
Matthew E Gifford Department of Biology, University of Central Arkansas, Conway, AR 72035, USA
Henry B John-Alder Department of Ecology, Evolution, and Natural Resources, Rutgers University, New Brunswick, NJ 08901, USA
Tracy Langkilde Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
Michael J Angilletta School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
Adam D Leaché Department of Biology, University of Washington, Seattle, WA 98195, USA.,Burke Museum of Natural History and Culture, University of Washington, Seattle, WA 98195, USA
Marc Tollis School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA.,School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ 86011, USA
Kenro Kusumi School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
Tonia S Schwartz Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA

Collapse

130

Morisse P, Lemaitre C, Legeai F. LRez: a C++ API and toolkit for analyzing and managing Linked-Reads data. BIOINFORMATICS ADVANCES 2021;1:vbab022. [PMID: 36700107 PMCID: PMC9710615 DOI: 10.1093/bioadv/vbab022] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Revised: 09/09/2021] [Accepted: 09/20/2021] [Indexed: 01/28/2023]

131

Freire R, Weisweiler M, Guerreiro R, Baig N, Hüttel B, Obeng-Hinneh E, Renner J, Hartje S, Muders K, Truberg B, Rosen A, Prigge V, Bruckmüller J, Lübeck J, Stich B. Chromosome-scale reference genome assembly of a diploid potato clone derived from an elite variety. G3-GENES GENOMES GENETICS 2021;11:6371871. [PMID: 34534288 PMCID: PMC8664475 DOI: 10.1093/g3journal/jkab330] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 09/08/2021] [Indexed: 01/27/2023]

132

Sène MA, Kiesslich S, Djambazian H, Ragoussis J, Xia Y, Kamen AA. Haplotype-resolved de novo assembly of the Vero cell line genome. NPJ Vaccines 2021;6:106. [PMID: 34417462 PMCID: PMC8379168 DOI: 10.1038/s41541-021-00358-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Accepted: 07/12/2021] [Indexed: 01/13/2023] Open

133

Gao X, Mo W, Shi J, Song N, Liang P, Chen J, Shi Y, Guo W, Li X, Yang X, Xin B, Zhao H, Song W, Lai J. HITAC-seq enables high-throughput cost-effective sequencing of plasmids and DNA fragments with identity. J Genet Genomics 2021;48:671-680. [PMID: 34417123 DOI: 10.1016/j.jgg.2021.05.009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 05/03/2021] [Accepted: 05/13/2021] [Indexed: 01/13/2023]

Affiliation(s)

Xiang Gao State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, PR China
Weipeng Mo State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, PR China
Junpeng Shi State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, PR China
Ning Song State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, PR China
Pei Liang Department of Microbiology and Immunology, College of Biological Sciences, China Agricultural University, Beijing 100193, PR China
Jian Chen State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, PR China
Yiting Shi State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, PR China
Weilong Guo Key Laboratory of Crop Heterosis and Utilization, State Key Laboratory for Agrobiotechnology, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, PR China
Xinchen Li State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, PR China
Xiaohong Yang State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, PR China; Center for Crop Functional Genomics and Molecular Breeding, China Agricultural University, Beijing 100193, PR China
Beibei Xin State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, PR China
Haiming Zhao State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, PR China
Weibin Song State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, PR China
Jinsheng Lai State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, PR China; Center for Crop Functional Genomics and Molecular Breeding, China Agricultural University, Beijing 100193, PR China.

Collapse

134

Kwak SH, Powe CE, Jang SS, Callahan MJ, Bernstein SN, Lee SM, Kang S, Park KS, Jang HC, Florez JC, Kim JI, Chae JH. Sequencing Cell-free Fetal DNA in Pregnant Women With GCK-MODY. J Clin Endocrinol Metab 2021;106:2678-2689. [PMID: 34406393 PMCID: PMC8660061 DOI: 10.1210/clinem/dgab265] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Indexed: 11/19/2022]

Affiliation(s)

Soo Heon Kwak Department of Internal Medicine, Seoul National University Hospital, Seoul 03080, Korea Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Camille E Powe Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA Diabetes Unit, Endocrine Division, Massachusetts General Hospital, Boston, MA 02114-2696, USA Harvard Medical School, Boston, MA 02115, USA
Se Song Jang Department of Pediatrics, Seoul National University Children’s Hospital, Seoul 03080, Korea
Michael J Callahan Diabetes Unit, Endocrine Division, Massachusetts General Hospital, Boston, MA 02114-2696, USA
Sarah N Bernstein Harvard Medical School, Boston, MA 02115, USA Department of Obstetrics and Gynecology, Division of Maternal Fetal Medicine, Massachusetts General Hospital, Boston, MA 02114-2696, USA
Seung Mi Lee Department of Obstetrics and Gynecology, Seoul National University Hospital, Seoul 03080, Korea
Sunyoung Kang Department of Internal Medicine, Seoul National University Hospital, Seoul 03080, Korea Department of Internal Medicine, Seoul National University College of Medicine, Seoul 03080, Korea
Kyong Soo Park Department of Internal Medicine, Seoul National University Hospital, Seoul 03080, Korea Department of Internal Medicine, Seoul National University College of Medicine, Seoul 03080, Korea Department of Molecular Medicine and Biopharmaceutical Sciences, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 03080, Korea
Hak C Jang Department of Internal Medicine, Seoul National University College of Medicine, Seoul 03080, Korea Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam 13620, Korea
Jose C Florez Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA Diabetes Unit, Endocrine Division, Massachusetts General Hospital, Boston, MA 02114-2696, USA Harvard Medical School, Boston, MA 02115, USA Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114-2696, USA
Jong-Il Kim Department of Biochemistry and Molecular Biology, Seoul National University College of Medicine, Seoul 03080, Korea Genomic Medicine Institute, Medical Research Center, Seoul National University, Seoul 03080, Korea
Jong Hee Chae Department of Pediatrics, Seoul National University Children’s Hospital, Seoul 03080, Korea Department of Genomic Medicine, Seoul National University Hospital, Seoul 03080, Korea

Collapse

135

Hiltunen M, Ryberg M, Johannesson H. ARBitR: an overlap-aware genome assembly scaffolder for linked reads. Bioinformatics 2021;37:2203-2205. [PMID: 33216122 PMCID: PMC8352505 DOI: 10.1093/bioinformatics/btaa975] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 10/22/2020] [Accepted: 11/10/2020] [Indexed: 12/02/2022] Open

136

Tedersoo L, Albertsen M, Anslan S, Callahan B. Perspectives and Benefits of High-Throughput Long-Read Sequencing in Microbial Ecology. Appl Environ Microbiol 2021;87:e0062621. [PMID: 34132589 PMCID: PMC8357291 DOI: 10.1128/aem.00626-21] [Citation(s) in RCA: 74] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

137

Sakamoto Y, Zaha S, Suzuki Y, Seki M, Suzuki A. Application of long-read sequencing to the detection of structural variants in human cancer genomes. Comput Struct Biotechnol J 2021;19:4207-4216. [PMID: 34527193 PMCID: PMC8350331 DOI: 10.1016/j.csbj.2021.07.030] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 07/20/2021] [Accepted: 07/25/2021] [Indexed: 01/02/2023] Open

138

Musunuri R, Arora K, Corvelo A, Shah M, Shelton J, Zody MC, Narzisi G. Somatic variant analysis of linked-reads sequencing data with Lancet. Bioinformatics 2021;37:1918-1919. [PMID: 33241313 DOI: 10.1093/bioinformatics/btaa888] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 09/03/2020] [Accepted: 10/02/2020] [Indexed: 11/14/2022] Open

139

Xu Z, Dixon JR. Genome reconstruction and haplotype phasing using chromosome conformation capture methodologies. Brief Funct Genomics 2021;19:139-150. [PMID: 31875884 DOI: 10.1093/bfgp/elz026] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Revised: 09/06/2019] [Accepted: 09/15/2019] [Indexed: 12/22/2022] Open

140

Tan KT, Kim H, Carrot-Zhang J, Zhang Y, Kim WJ, Kugener G, Wala JA, Howard TP, Chi YY, Beroukhim R, Li H, Ha G, Alper SL, Perlman EJ, Mullen EA, Hahn WC, Meyerson M, Hong AL. Haplotype-resolved germline and somatic alterations in renal medullary carcinomas. Genome Med 2021;13:114. [PMID: 34261517 PMCID: PMC8281718 DOI: 10.1186/s13073-021-00929-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 06/25/2021] [Indexed: 11/10/2022] Open

Affiliation(s)

Kar-Tong Tan Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Hyunji Kim Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Jian Carrot-Zhang Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Yuxiang Zhang Department of Genetics, Harvard Medical School, Boston, MA, USA
Won Jun Kim Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA
Guillaume Kugener Broad Institute of MIT and Harvard, Cambridge, MA, USA
Jeremiah A Wala Department of Medicine, University of California San Francisco, San Francisco, CA, USA
Thomas P Howard Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA
Yueh-Yun Chi Department of Pediatrics, University of Southern California, Los Angeles, CA, USA
Rameen Beroukhim Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA
Heng Li Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA
Gavin Ha Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Seth L Alper Department of Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
Elizabeth J Perlman Department of Pathology, Northwestern University, Chicago, IL, USA
Elizabeth A Mullen Department of Hematology and Oncology, Boston Children's Hospital, Boston, MA, USA Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
William C Hahn Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA. Broad Institute of MIT and Harvard, Cambridge, MA, USA.
Matthew Meyerson Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA. Broad Institute of MIT and Harvard, Cambridge, MA, USA. Department of Genetics, Harvard Medical School, Boston, MA, USA.
Andrew L Hong Department of Pediatrics, Emory University, Atlanta, GA, USA. Aflac Center for Cancer and Blood Disorders, Children's Healthcare of Atlanta, Atlanta, GA, USA.

Collapse

141

Comparative Genomics of Clinical Isolates of the Emerging Tick-Borne Pathogen Neoehrlichia mikurensis. Microorganisms 2021;9:microorganisms9071488. [PMID: 34361922 PMCID: PMC8303192 DOI: 10.3390/microorganisms9071488] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 07/07/2021] [Accepted: 07/08/2021] [Indexed: 11/17/2022] Open

142

Lin B, Hui J, Mao H. Nanopore Technology and Its Applications in Gene Sequencing. BIOSENSORS-BASEL 2021;11:bios11070214. [PMID: 34208844 PMCID: PMC8301755 DOI: 10.3390/bios11070214] [Citation(s) in RCA: 63] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 06/22/2021] [Accepted: 06/25/2021] [Indexed: 12/14/2022]

143

Meier JI, Salazar PA, Kučka M, Davies RW, Dréau A, Aldás I, Box Power O, Nadeau NJ, Bridle JR, Rolian C, Barton NH, McMillan WO, Jiggins CD, Chan YF. Haplotype tagging reveals parallel formation of hybrid races in two butterfly species. Proc Natl Acad Sci U S A 2021;118:e2015005118. [PMID: 34155138 PMCID: PMC8237668 DOI: 10.1073/pnas.2015005118] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

144

Liu YH, Grubbs GL, Zhang L, Fang X, Dill DL, Sidow A, Zhou X. Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads. BIOINFORMATICS ADVANCES 2021;1:vbab007. [PMID: 36700103 PMCID: PMC9710574 DOI: 10.1093/bioadv/vbab007] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Revised: 06/07/2021] [Accepted: 06/14/2021] [Indexed: 01/28/2023]

145

Yang Y, Huang L, Xu C, Qi L, Wu Z, Li J, Chen H, Wu Y, Fu T, Zhu H, Saand MA, Li J, Liu L, Fan H, Zhou H, Qin W. Chromosome-scale genome assembly of areca palm (Areca catechu). Mol Ecol Resour 2021;21:2504-2519. [PMID: 34133844 DOI: 10.1111/1755-0998.13446] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 06/08/2021] [Accepted: 06/11/2021] [Indexed: 11/28/2022]

Affiliation(s)

Yaodong Yang Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
Liyun Huang Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
Chunyan Xu BGI Genomics, BGI-Shenzhen, Shenzhen, China
Lan Qi Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
Zhangyan Wu BGI Genomics, BGI-Shenzhen, Shenzhen, China
Jia Li Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
Haixin Chen BGI Genomics, BGI-Shenzhen, Shenzhen, China
Yi Wu Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
Tao Fu BGI Genomics, BGI-Shenzhen, Shenzhen, China
Hui Zhu Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
Mumtaz Ali Saand Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
Jing Li Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
Liyun Liu Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
Haikou Fan Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
Huanqi Zhou Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
Weiquan Qin Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China

Collapse

146

Callahan BJ, Grinevich D, Thakur S, Balamotis MA, Yehezkel TB. Ultra-accurate microbial amplicon sequencing with synthetic long reads. MICROBIOME 2021;9:130. [PMID: 34090540 PMCID: PMC8179091 DOI: 10.1186/s40168-021-01072-3] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 04/06/2021] [Indexed: 05/08/2023]

Abstract

BACKGROUND

Out of the many pathogenic bacterial species that are known, only a fraction are readily identifiable directly from a complex microbial community using standard next generation DNA sequencing. Long-read sequencing offers the potential to identify a wider range of species and to differentiate between strains within a species, but attaining sufficient accuracy in complex metagenomes remains a challenge.

METHODS

Here, we describe and analytically validate LoopSeq, a commercially available synthetic long-read (SLR) sequencing technology that generates highly accurate long reads from standard short reads.

RESULTS

LoopSeq reads are sufficiently long and accurate to identify microbial genes and species directly from complex samples. LoopSeq perfectly recovered the full diversity of 16S rRNA genes from known strains in a synthetic microbial community. Full-length LoopSeq reads had a per-base error rate of 0.005%, which exceeds the accuracy reported for other long-read sequencing technologies. 18S-ITS and genomic sequencing of fungal and bacterial isolates confirmed that LoopSeq sequencing maintains that accuracy for reads up to 6 kb in length. LoopSeq full-length 16S rRNA reads could accurately classify organisms down to the species level in rinsate from retail meat samples, and could differentiate strains within species identified by the CDC as potential foodborne pathogens.

CONCLUSIONS

The order-of-magnitude improvement in length and accuracy over standard Illumina amplicon sequencing achieved with LoopSeq enables accurate species-level and strain identification from complex- to low-biomass microbiome samples. The ability to generate accurate and long microbiome sequencing reads using standard short read sequencers will accelerate the building of quality microbial sequence databases and removes a significant hurdle on the path to precision microbial genomics. Video abstract.

Collapse

147

Sun W, Modica S, Dong H, Wolfrum C. Plasticity and heterogeneity of thermogenic adipose tissue. Nat Metab 2021;3:751-761. [PMID: 34158657 DOI: 10.1038/s42255-021-00417-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 05/19/2021] [Indexed: 12/13/2022]

148

Srivastava K, Fratzscher AS, Lan B, Flegel WA. Cataloguing experimentally confirmed 80.7 kb-long ACKR1 haplotypes from the 1000 Genomes Project database. BMC Bioinformatics 2021;22:273. [PMID: 34039276 PMCID: PMC8150616 DOI: 10.1186/s12859-021-04169-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Accepted: 05/04/2021] [Indexed: 12/18/2022] Open

Abstract

Background

Clinically effective and safe genotyping relies on correct reference sequences, often represented by haplotypes. The 1000 Genomes Project recorded individual genotypes across 26 different populations and, using computerized genotype phasing, reported haplotype data. In contrast, we identified long reference sequences by analyzing the homozygous genomic regions in this online database, a concept that has rarely been reported since next generation sequencing data became available.

Study design and methods

Phased genotype data for a 80.6 kb region of chromosome 1 was downloaded for all 2,504 unrelated individuals of the 1000 Genome Project Phase 3 cohort. The data was centered on the ACKR1 gene and bordered by the CADM3 and FCER1A genes. Individuals with heterozygosity at a single site or with complete homozygosity allowed unambiguous assignment of an ACKR1 haplotype. A computer algorithm was developed for extracting these haplotypes from the 1000 Genome Project in an automated fashion. A manual analysis validated the data extracted by the algorithm.

Results

We confirmed 902 ACKR1 haplotypes of varying lengths, the longest at 80,584 nucleotides and shortest at 1,901 nucleotides. The combined length of haplotype sequences comprised 19,895,388 nucleotides with a median of 16,014 nucleotides. Based on our approach, all haplotypes can be considered experimentally confirmed and not affected by the known errors of computerized genotype phasing.

Conclusions

Tracts of homozygosity can provide definitive reference sequences for any gene. They are particularly useful when observed in unrelated individuals of large scale sequence databases. As a proof of principle, we explored the 1000 Genomes Project database for ACKR1 gene data and mined long haplotypes. These haplotypes are useful for high throughput analysis with next generation sequencing. Our approach is scalable, using automated bioinformatics tools, and can be applied to any gene.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12859-021-04169-6.

Collapse

149

Mukhtar M, Sargazi S, Barani M, Madry H, Rahdar A, Cucchiarini M. Application of Nanotechnology for Sensitive Detection of Low-Abundance Single-Nucleotide Variations in Genomic DNA: A Review. NANOMATERIALS (BASEL, SWITZERLAND) 2021;11:1384. [PMID: 34073904 PMCID: PMC8225127 DOI: 10.3390/nano11061384] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Revised: 05/20/2021] [Accepted: 05/21/2021] [Indexed: 01/02/2023]

150

Wu CY, Lau BT, Kim HS, Sathe A, Grimes SM, Ji HP, Zhang NR. Integrative single-cell analysis of allele-specific copy number alterations and chromatin accessibility in cancer. Nat Biotechnol 2021;39:1259-1269. [PMID: 34017141 DOI: 10.1038/s41587-021-00911-w] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Accepted: 04/01/2021] [Indexed: 12/12/2022]