Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zhao S, Agafonov O, Azab A, Stokowy T, Hovig E. Accuracy and efficiency of germline variant calling pipelines for human genome data. Sci Rep 2020;10:20222. [PMID: 33214604 PMCID: PMC7678823 DOI: 10.1038/s41598-020-77218-4] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Accepted: 11/02/2020] [Indexed: 12/30/2022] Open

For:	Zhao S, Agafonov O, Azab A, Stokowy T, Hovig E. Accuracy and efficiency of germline variant calling pipelines for human genome data. Sci Rep 2020;10:20222. [PMID: 33214604 PMCID: PMC7678823 DOI: 10.1038/s41598-020-77218-4] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Accepted: 11/02/2020] [Indexed: 12/30/2022] Open

Number

Cited by Other Article(s)

Servati M, Vaccaro CN, Diller EE, Pellegrino Da Silva R, Mafra F, Cao S, Stanley KB, Cohen-Gadol AA, Parker JG. Metabolic Insight into Glioma Heterogeneity: Mapping Whole Exome Sequencing to In Vivo Imaging with Stereotactic Localization and Deep Learning. Metabolites 2024;14:337. [PMID: 38921472 PMCID: PMC11205750 DOI: 10.3390/metabo14060337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Revised: 06/07/2024] [Accepted: 06/12/2024] [Indexed: 06/27/2024] Open

Abstract

Intratumoral heterogeneity (ITH) complicates the diagnosis and treatment of glioma, partly due to the diverse metabolic profiles driven by underlying genomic alterations. While multiparametric imaging enhances the characterization of ITH by capturing both spatial and functional variations, it falls short in directly assessing the metabolic activities that underpin these phenotypic differences. This gap stems from the challenge of integrating easily accessible, colocated pathology and detailed genomic data with metabolic insights. This study presents a multifaceted approach combining stereotactic biopsy with standard clinical open-craniotomy for sample collection, voxel-wise analysis of MR images, regression-based GAM, and whole-exome sequencing. This work aims to demonstrate the potential of machine learning algorithms to predict variations in cellular and molecular tumor characteristics. This retrospective study enrolled ten treatment-naïve patients with radiologically confirmed glioma. Each patient underwent a multiparametric MR scan (T1W, T1W-CE, T2W, T2W-FLAIR, DWI) prior to surgery. During standard craniotomy, at least 1 stereotactic biopsy was collected from each patient, with screenshots of the sample locations saved for spatial registration to pre-surgical MR data. Whole-exome sequencing was performed on flash-frozen tumor samples, prioritizing the signatures of five glioma-related genes: IDH1, TP53, EGFR, PIK3CA, and NF1. Regression was implemented with a GAM using a univariate shape function for each predictor. Standard receiver operating characteristic (ROC) analyses were used to evaluate detection, with AUC (area under curve) calculated for each gene target and MR contrast combination. Mean AUC for five gene targets and 31 MR contrast combinations was 0.75 ± 0.11; individual AUCs were as high as 0.96 for both IDH1 and TP53 with T2W-FLAIR and ADC, and 0.99 for EGFR with T2W and ADC. These results suggest the possibility of predicting exome-wide mutation events from noninvasive, in vivo imaging by combining stereotactic localization of glioma samples and a semi-parametric deep learning method. The genomic alterations identified, particularly in IDH1, TP53, EGFR, PIK3CA, and NF1, are known to play pivotal roles in metabolic pathways driving glioma heterogeneity. Our methodology, therefore, indirectly sheds light on the metabolic landscape of glioma through the lens of these critical genomic markers, suggesting a complex interplay between tumor genomics and metabolism. This approach holds potential for refining targeted therapy by better addressing the genomic heterogeneity of glioma tumors.

Collapse

Aisagbonhi O, Ghlichloo I, Hong DS, Roma A, Fadare O, Eskander R, Saenz C, Fisch KM, Song W. Comprehensive next-generation sequencing identifies novel putative pathogenic or likely pathogenic germline variants in patients with concurrent tubo-ovarian and endometrial serous and endometrioid carcinomas or precursors. Gynecol Oncol 2024;187:241-248. [PMID: 38833993 DOI: 10.1016/j.ygyno.2024.05.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Revised: 05/22/2024] [Accepted: 05/23/2024] [Indexed: 06/06/2024]

Kalleberg J, Rissman J, Schnabel RD. Overcoming Limitations to Deep Learning in Domesticated Animals with TrioTrain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.15.589602. [PMID: 38659907 PMCID: PMC11042298 DOI: 10.1101/2024.04.15.589602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]

Kosugi S, Terao C. Comparative evaluation of SNVs, indels, and structural variations detected with short- and long-read sequencing data. Hum Genome Var 2024;11:18. [PMID: 38632226 PMCID: PMC11024196 DOI: 10.1038/s41439-024-00276-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 03/12/2024] [Accepted: 03/20/2024] [Indexed: 04/19/2024] Open

Ergun MA, Cinal O, Bakışlı B, Emül AA, Baysan M. COSAP: Comparative Sequencing Analysis Platform. BMC Bioinformatics 2024;25:130. [PMID: 38532317 DOI: 10.1186/s12859-024-05756-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 03/20/2024] [Indexed: 03/28/2024] Open

Abstract

BACKGROUND

Recent improvements in sequencing technologies enabled detailed profiling of genomic features. These technologies mostly rely on short reads which are merged and compared to reference genome for variant identification. These operations should be done with computers due to the size and complexity of the data. The need for analysis software resulted in many programs for mapping, variant calling and annotation steps. Currently, most programs are either expensive enterprise software with proprietary code which makes access and verification very difficult or open-access programs that are mostly based on command-line operations without user interfaces and extensive documentation. Moreover, a high level of disagreement is observed among popular mapping and variant calling algorithms in multiple studies, which makes relying on a single algorithm unreliable. User-friendly open-source software tools that offer comparative analysis are an important need considering the growth of sequencing technologies.

RESULTS

Here, we propose Comparative Sequencing Analysis Platform (COSAP), an open-source platform that provides popular sequencing algorithms for SNV, indel, structural variant calling, copy number variation, microsatellite instability and fusion analysis and their annotations. COSAP is packed with a fully functional user-friendly web interface and a backend server which allows full independent deployment for both individual and institutional scales. COSAP is developed as a workflow management system and designed to enhance cooperation among scientists with different backgrounds. It is publicly available at https://cosap.bio and https://github.com/MBaysanLab/cosap/ . The source code of the frontend and backend services can be found at https://github.com/MBaysanLab/cosap-webapi/ and https://github.com/MBaysanLab/cosap_frontend/ respectively. All services are packed as Docker containers as well. Pipelines that combine algorithms can be customized and new algorithms can be added with minimal coding through modular structure.

CONCLUSIONS

COSAP simplifies and speeds up the process of DNA sequencing analyses providing commonly used algorithms for SNV, indel, structural variant calling, copy number variation, microsatellite instability and fusion analysis as well as their annotations. COSAP is packed with a fully functional user-friendly web interface and a backend server which allows full independent deployment for both individual and institutional scales. Standardized implementations of popular algorithms in a modular platform make comparisons much easier to assess the impact of alternative pipelines which is crucial in establishing reproducibility of sequencing analyses.

Collapse

Fasaludeen A, McTague A, Jose M, Banerjee M, Sundaram S, Madhusoodanan UK, Radhakrishnan A, Menon RN. Genetic variant interpretation for the neurologist - A pragmatic approach in the next-generation sequencing era in childhood epilepsy. Epilepsy Res 2024;201:107341. [PMID: 38447235 DOI: 10.1016/j.eplepsyres.2024.107341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 02/14/2024] [Accepted: 02/29/2024] [Indexed: 03/08/2024]

Kvapilova K, Misenko P, Radvanszky J, Brzon O, Budis J, Gazdarica J, Pos O, Korabecna M, Kasny M, Szemes T, Kvapil P, Paces J, Kozmik Z. Validated WGS and WES protocols proved saliva-derived gDNA as an equivalent to blood-derived gDNA for clinical and population genomic analyses. BMC Genomics 2024;25:187. [PMID: 38365587 PMCID: PMC10873937 DOI: 10.1186/s12864-024-10080-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 02/02/2024] [Indexed: 02/18/2024] Open

Abstract

BACKGROUND

Whole exome sequencing (WES) and whole genome sequencing (WGS) have become standard methods in human clinical diagnostics as well as in population genomics (POPGEN). Blood-derived genomic DNA (gDNA) is routinely used in the clinical environment. Conversely, many POPGEN studies and commercial tests benefit from easy saliva sampling. Here, we evaluated the quality of variant call sets and the level of genotype concordance of single nucleotide variants (SNVs) and small insertions and deletions (indels) for WES and WGS using paired blood- and saliva-derived gDNA isolates employing genomic reference-based validated protocols.

METHODS

The genomic reference standard Coriell NA12878 was repeatedly analyzed using optimized WES and WGS protocols, and data calls were compared with the truth dataset published by the Genome in a Bottle Consortium. gDNA was extracted from the paired blood and saliva samples of 10 participants and processed using the same protocols. A comparison of paired blood-saliva call sets was performed in the context of WGS and WES genomic reference-based technical validation results.

RESULTS

The quality pattern of called variants obtained from genomic-reference-based technical replicates correlates with data calls of paired blood-saliva-derived samples in all levels of tested examinations despite a higher rate of non-human contamination found in the saliva samples. The F1 score of 10 blood-to-saliva-derived comparisons ranged between 0.8030-0.9998 for SNVs and between 0.8883-0.9991 for small-indels in the case of the WGS protocol, and between 0.8643-0.999 for SNVs and between 0.7781-1.000 for small-indels in the case of the WES protocol.

CONCLUSION

Saliva may be considered an equivalent material to blood for genetic analysis for both WGS and WES under strict protocol conditions. The accuracy of sequencing metrics and variant-detection accuracy is not affected by choosing saliva as the gDNA source instead of blood but much more significantly by the genomic context, variant types, and the sequencing technology used.

Collapse

Affiliation(s)

Katerina Kvapilova Faculty of Science, Charles University, Albertov 6, Prague, 128 00, Czech Republic. Institute of Applied Biotechnologies a.s, Služeb 4, Prague, 108 00, Czech Republic.
Pavol Misenko Geneton s.r.o, Ilkovičova 8, Bratislava, 841 04, Slovakia
Jan Radvanszky Geneton s.r.o, Ilkovičova 8, Bratislava, 841 04, Slovakia Institute of Clinical and Translational Research, Biomedical Research Center of the Slovak Academy of Sciences, Dúbravská Cesta 9, Bratislava, 845 05, Slovakia Department of Molecular Biology, Faculty of Natural Sciences, Comenius University, Ilkovičova 3278/6, Karlova Ves, Bratislava, 841 04, Slovakia Comenius University Science Park, Comenius University, Ilkovičova 8, Karlova Ves, Bratislava, 841 04, Slovakia
Ondrej Brzon Institute of Applied Biotechnologies a.s, Služeb 4, Prague, 108 00, Czech Republic
Jaroslav Budis Geneton s.r.o, Ilkovičova 8, Bratislava, 841 04, Slovakia Comenius University Science Park, Comenius University, Ilkovičova 8, Karlova Ves, Bratislava, 841 04, Slovakia Slovak Centre for Scientific and Technical Information, Staré Mesto, Lamačská Cesta 8A, Bratislava, 811 04, Slovakia
Juraj Gazdarica Geneton s.r.o, Ilkovičova 8, Bratislava, 841 04, Slovakia Comenius University Science Park, Comenius University, Ilkovičova 8, Karlova Ves, Bratislava, 841 04, Slovakia Slovak Centre for Scientific and Technical Information, Staré Mesto, Lamačská Cesta 8A, Bratislava, 811 04, Slovakia
Ondrej Pos Geneton s.r.o, Ilkovičova 8, Bratislava, 841 04, Slovakia Comenius University Science Park, Comenius University, Ilkovičova 8, Karlova Ves, Bratislava, 841 04, Slovakia
Marie Korabecna Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University and General University Hospital in Prague, Albertov 4, Prague, 128 00, Czech Republic
Martin Kasny Institute of Applied Biotechnologies a.s, Služeb 4, Prague, 108 00, Czech Republic Department of Botany and Zoology, Faculty of Science, Masaryk University, Kotlářská 2, Brno, 611 37, Czech Republic
Tomas Szemes Geneton s.r.o, Ilkovičova 8, Bratislava, 841 04, Slovakia Department of Molecular Biology, Faculty of Natural Sciences, Comenius University, Ilkovičova 3278/6, Karlova Ves, Bratislava, 841 04, Slovakia Comenius University Science Park, Comenius University, Ilkovičova 8, Karlova Ves, Bratislava, 841 04, Slovakia
Petr Kvapil Institute of Applied Biotechnologies a.s, Služeb 4, Prague, 108 00, Czech Republic
Jan Paces Laboratory of Genomics and Bioinformatics, Institute of Molecular Genetics of the Czech Academy of Sciences, Vídeňská 1083, Prague, 142 20, Czech Republic
Zbynek Kozmik Laboratory of Transcriptional Regulation, Institute of Molecular Genetics of the Czech Academy of Sciences, Vídeňská 1083, Prague, 142 20, Czech Republic

Collapse

Schobers G, Derks R, den Ouden A, Swinkels H, van Reeuwijk J, Bosgoed E, Lugtenberg D, Sun SM, Corominas Galbany J, Weiss M, Blok MJ, Olde Keizer RACM, Hofste T, Hellebrekers D, de Leeuw N, Stegmann A, Kamsteeg EJ, Paulussen ADC, Ligtenberg MJL, Bradley XZ, Peden J, Gutierrez A, Pullen A, Payne T, Gilissen C, van den Wijngaard A, Brunner HG, Nelen M, Yntema HG, Vissers LELM. Genome sequencing as a generic diagnostic strategy for rare disease. Genome Med 2024;16:32. [PMID: 38355605 PMCID: PMC10868087 DOI: 10.1186/s13073-024-01301-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 02/02/2024] [Indexed: 02/16/2024] Open

Abstract

BACKGROUND

To diagnose the full spectrum of hereditary and congenital diseases, genetic laboratories use many different workflows, ranging from karyotyping to exome sequencing. A single generic high-throughput workflow would greatly increase efficiency. We assessed whether genome sequencing (GS) can replace these existing workflows aimed at germline genetic diagnosis for rare disease.

METHODS

We performed short-read GS (NovaSeq™6000; 150 bp paired-end reads, 37 × mean coverage) on 1000 cases with 1271 known clinically relevant variants, identified across different workflows, representative of our tertiary diagnostic centers. Variants were categorized into small variants (single nucleotide variants and indels < 50 bp), large variants (copy number variants and short tandem repeats) and other variants (structural variants and aneuploidies). Variant calling format files were queried per variant, from which workflow-specific true positive rates (TPRs) for detection were determined. A TPR of ≥ 98% was considered the threshold for transition to GS. A GS-first scenario was generated for our laboratory, using diagnostic efficacy and predicted false negative as primary outcome measures. As input, we modeled the diagnostic path for all 24,570 individuals referred in 2022, combining the clinical referral, the transition of the underlying workflow(s) to GS, and the variant type(s) to be detected.

RESULTS

Overall, 95% (1206/1271) of variants were detected. Detection rates differed per variant category: small variants in 96% (826/860), large variants in 93% (341/366), and other variants in 87% (39/45). TPRs varied between workflows (79-100%), with 7/10 being replaceable by GS. Models for our laboratory indicate that a GS-first strategy would be feasible for 84.9% of clinical referrals (750/883), translating to 71% of all individuals (17,444/24,570) receiving GS as their primary test. An estimated false negative rate of 0.3% could be expected.

CONCLUSIONS

GS can capture clinically relevant germline variants in a 'GS-first strategy' for the majority of clinical indications in a genetics diagnostic lab.

Collapse

Affiliation(s)

Gaby Schobers Department of Human Genetics, Radboudumc, Nijmegen, Netherlands Research Institute for Medical Innovation, Radboudumc, Nijmegen, Netherlands
Ronny Derks Department of Human Genetics, Radboudumc, Nijmegen, Netherlands
Amber den Ouden Department of Human Genetics, Radboudumc, Nijmegen, Netherlands
Hilde Swinkels Department of Human Genetics, Radboudumc, Nijmegen, Netherlands
Jeroen van Reeuwijk Department of Human Genetics, Radboudumc, Nijmegen, Netherlands Research Institute for Medical Innovation, Radboudumc, Nijmegen, Netherlands
Ermanno Bosgoed Department of Human Genetics, Radboudumc, Nijmegen, Netherlands
Dorien Lugtenberg Department of Human Genetics, Radboudumc, Nijmegen, Netherlands
Su Ming Sun Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, Netherlands
Jordi Corominas Galbany Department of Human Genetics, Radboudumc, Nijmegen, Netherlands Research Institute for Medical Innovation, Radboudumc, Nijmegen, Netherlands
Marjan Weiss Department of Human Genetics, Radboudumc, Nijmegen, Netherlands
Marinus J Blok Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, Netherlands
Richelle A C M Olde Keizer Department of Human Genetics, Radboudumc, Nijmegen, Netherlands Research Institute for Medical Innovation, Radboudumc, Nijmegen, Netherlands
Tom Hofste Department of Human Genetics, Radboudumc, Nijmegen, Netherlands
Debby Hellebrekers Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, Netherlands
Nicole de Leeuw Department of Human Genetics, Radboudumc, Nijmegen, Netherlands
Alexander Stegmann Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, Netherlands
Erik-Jan Kamsteeg Department of Human Genetics, Radboudumc, Nijmegen, Netherlands
Aimee D C Paulussen Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, Netherlands
Marjolijn J L Ligtenberg Department of Human Genetics, Radboudumc, Nijmegen, Netherlands Research Institute for Medical Innovation, Radboudumc, Nijmegen, Netherlands
Xiangqun Zheng Bradley Illumina Inc., Cambridge, UK
John Peden Illumina Inc., Cambridge, UK
Alejandra Gutierrez Illumina Inc., Cambridge, UK
Adam Pullen Illumina Inc., Cambridge, UK
Tom Payne Illumina Inc., Cambridge, UK
Christian Gilissen Department of Human Genetics, Radboudumc, Nijmegen, Netherlands Research Institute for Medical Innovation, Radboudumc, Nijmegen, Netherlands
Arthur van den Wijngaard Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, Netherlands
Han G Brunner Department of Human Genetics, Radboudumc, Nijmegen, Netherlands Research Institute for Medical Innovation, Radboudumc, Nijmegen, Netherlands Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, Netherlands
Marcel Nelen Department of Human Genetics, Radboudumc, Nijmegen, Netherlands
Helger G Yntema Department of Human Genetics, Radboudumc, Nijmegen, Netherlands Research Institute for Medical Innovation, Radboudumc, Nijmegen, Netherlands
Lisenka E L M Vissers Department of Human Genetics, Radboudumc, Nijmegen, Netherlands. Research Institute for Medical Innovation, Radboudumc, Nijmegen, Netherlands.

Collapse

Petrin AL, Machado-Paula L, Hinkle A, Hovey L, Awotoye W, Chimenti M, Darbro B, Ribeiro-Bicudo LA, Dabdoub SM, Peter T, Murray J, Van Otterloo E, Rengasamy Venugopalan S, Moreno-Uribe LM. Whole genome sequencing of a family with autosomal dominant features within the oculoauriculovertebral spectrum. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.02.07.24301824. [PMID: 38370836 PMCID: PMC10871465 DOI: 10.1101/2024.02.07.24301824] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]

Abstract

Background

Oculoauriculovertebral Spectrum (OAVS) encompasses a wide variety of anomalies on derivatives from the first and second pharyngeal arches including macrostomia, hemifacial microsomia, micrognathia, preauricular tags, ocular and vertebral anomalies. We present the genetic findings of a large three-generation family with multiple members affected with macrostomia, preauricular tags and uni- or bilateral ptosis following an autosomal dominant segregation pattern.

Methods

We generated whole genome sequencing data for the proband, affected parent and unaffected paternal grandparent followed by Sanger sequencing on 23 family members for the top 10 candidate genes: KCND2, PDGFRA, CASP9, NCOA3, WNT10A, SIX1, MTF1, KDR/VEGFR2, LRRK1, and TRIM2. We performed parent and sibling-based transmission disequilibrium tests and burden analysis to explore segregation and burden of candidate gene mutations. Bioinformatic analyses investigated the biological connection between genes and the abnormal phenotypes.

Results

Overall, rare missense mutations in SIX1, KDR/VEGFR2, and PDGFRA showed the best evidence of segregation with the OAV phenotypes in this family. When considering affection with any of the 3 OAVS phenotypes as an outcome, parent-TDTs and sib-TDTs (unadjusted p-values) found that SIX1 (p=0.025, p=0.052), followed by PDGFRA (p=0.180, p=0.069) and KDR/VEGFR2 (p=0.180, p=0.069) have the strongest associations in this family. Burden analysis via a penalized linear mixed model identified SIX1 (RC=0.87) and PDGFRA (RC=0.98) as having the strongest association with OAVS severity. Using phenotype-specific ogfrautcomes, sib-TDTs identified associations between (1) SIX1 with uni- or bilateral ptosis (p=0.049) and ear tags (p=0.01), (2) PDGFRA and KDR/VEGFR2 with ear tags (both p<0.01).

Conclusion

Our study reports the genomic findings of a large family with multiple individuals affected with OAVS phenotypes with autosomal dominant inheritance. Our findings narrow down to three potential candidate genes, SIX1, PDGFRA, and KDR/VEGFR2. Among these, SIX1 has been previously associated with OAVS ear malformations and it is co-expressed with EYA1 during ear development. Attempts to strengthen the genotype-phenotype co-relation underlying the OAVS of phenotypes are essential to discover the etiological factors leading to this complex and burdensome condition as well as for family counseling and prevention efforts.

Collapse

Brancato V, Esposito G, Coppola L, Cavaliere C, Mirabelli P, Scapicchio C, Borgheresi R, Neri E, Salvatore M, Aiello M. Standardizing digital biobanks: integrating imaging, genomic, and clinical data for precision medicine. J Transl Med 2024;22:136. [PMID: 38317237 PMCID: PMC10845786 DOI: 10.1186/s12967-024-04891-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 01/14/2024] [Indexed: 02/07/2024] Open

Abstract

Advancements in data acquisition and computational methods are generating a large amount of heterogeneous biomedical data from diagnostic domains such as clinical imaging, pathology, and next-generation sequencing (NGS), which help characterize individual differences in patients. However, this information needs to be available and suitable to promote and support scientific research and technological development, supporting the effective adoption of the precision medicine approach in clinical practice. Digital biobanks can catalyze this process, facilitating the sharing of curated and standardized imaging data, clinical, pathological and molecular data, crucial to enable the development of a comprehensive and personalized data-driven diagnostic approach in disease management and fostering the development of computational predictive models. This work aims to frame this perspective, first by evaluating the state of standardization of individual diagnostic domains and then by identifying challenges and proposing a possible solution towards an integrative approach that can guarantee the suitability of information that can be shared through a digital biobank. Our analysis of the state of the art shows the presence and use of reference standards in biobanks and, generally, digital repositories for each specific domain. Despite this, standardization to guarantee the integration and reproducibility of the numerical descriptors generated by each domain, e.g. radiomic, pathomic and -omic features, is still an open challenge. Based on specific use cases and scenarios, an integration model, based on the JSON format, is proposed that can help address this problem. Ultimately, this work shows how, with specific standardization and promotion efforts, the digital biobank model can become an enabling technology for the comprehensive study of diseases and the effective development of data-driven technologies at the service of precision medicine.

Collapse

Charron P, Kang M. VariantDetective: an accurate all-in-one pipeline for detecting consensus bacterial SNPs and SVs. Bioinformatics 2024;40:btae066. [PMID: 38366603 PMCID: PMC10898327 DOI: 10.1093/bioinformatics/btae066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 01/16/2024] [Accepted: 02/14/2024] [Indexed: 02/18/2024] Open

Ha YJ, Kang S, Kim J, Kim J, Jo SY, Kim S. Comprehensive benchmarking and guidelines of mosaic variant calling strategies. Nat Methods 2023;20:2058-2067. [PMID: 37828153 PMCID: PMC10703685 DOI: 10.1038/s41592-023-02043-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Accepted: 09/12/2023] [Indexed: 10/14/2023]

Park H, Gim J. A comparative investigation of single nucleotide variant calling for a personal non-Caucasian sequencing sample. Genes Genomics 2023;45:1527-1536. [PMID: 37651066 DOI: 10.1007/s13258-023-01439-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 08/04/2023] [Indexed: 09/01/2023]

Abstract

BACKGROUND

Dropping cost and increasing clinical application of whole genome sequencing (WGS) lead a necessity of efficient (accurate and rapid) variant calling procedures from a personal WGS data (n = 1). A number of variant calling pipelines have been introduced utilizing the human genome reference GRCh38 as a reference and a benchmark dataset called 'NA12878', which are both 'standard' but limited ethnic origin. Considering the nature of variant calling algorithms and recent updates in sequencing protocol, however, it is necessary to revisit the efficiency of the current best pipelines for a personal WGS data from diverse ethnicity.

OBJECTIVE

We discuss the most efficient practices for variant calling of a personal WGS reads, with a particular emphasis on whether (1) ethnic match or mismatch between the reference genome and a WGS data produces a distinct result and more importantly (2) there is an ethnic-specific optimal workflow.

METHODS

Here, we generate an appropriate WGS data, DNA array, and sufficient number of Sanger validated variants from a single Korean subject to perform such a comprehensive comparison. We applied this WGS reads and the 'NA12878' reads to 8 different variant calling pipelines with 2 different reference genomes (GRCh38 and KOREF, a Korean reference genome) to which the WGS reads from different ethnic origins are aligned.

RESULTS

We evaluated the performance of the pipelines with the matched array genotype data and Sanger sequencing validation and demonstrated that: regardless to the ethnic match/mismatch (1) Novoalign-GATK4 showed the most efficient performance with the exceptional calls in MHC region; (2) the overall performance was better with GRCh38, while a significant difference in recall was observed. In addition, we found it is largely reduced computing cost maintaining performance to remove 'markduplication' step with PCR-free WGS data.

CONCLUSION

For variant calling of a personal PCR-free WGS data, regardless of ethnicity consideration, we recommend the use of the Novoalign + GATK4 with GRCh38 and without 'markduplication'.

Collapse

Xiang X, Lu B, Song D, Li J, Shu K, Pu D. Evaluating the performance of low-frequency variant calling tools for the detection of variants from short-read deep sequencing data. Sci Rep 2023;13:20444. [PMID: 37993475 PMCID: PMC10665316 DOI: 10.1038/s41598-023-47135-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 11/09/2023] [Indexed: 11/24/2023] Open

Abstract

Detection of low-frequency variants with high accuracy plays an important role in biomedical research and clinical practice. However, it is challenging to do so with next-generation sequencing (NGS) approaches due to the high error rates of NGS. To accurately distinguish low-level true variants from these errors, many statistical variants calling tools for calling low-frequency variants have been proposed, but a systematic performance comparison of these tools has not yet been performed. Here, we evaluated four raw-reads-based variant callers (SiNVICT, outLyzer, Pisces, and LoFreq) and four UMI-based variant callers (DeepSNVMiner, MAGERI, smCounter2, and UMI-VarCal) considering their capability to call single nucleotide variants (SNVs) with allelic frequency as low as 0.025% in deep sequencing data. We analyzed a total of 54 simulated data with various sequencing depths and variant allele frequencies (VAFs), two reference data, and Horizon Tru-Q sample data. The results showed that the UMI-based callers, except smCounter2, outperformed the raw-reads-based callers regarding detection limit. Sequencing depth had almost no effect on the UMI-based callers but significantly influenced on the raw-reads-based callers. Regardless of the sequencing depth, MAGERI showed the fastest analysis, while smCounter2 consistently took the longest to finish the variant calling process. Overall, DeepSNVMiner and UMI-VarCal performed the best with considerably good sensitivity and precision of 88%, 100%, and 84%, 100%, respectively. In conclusion, the UMI-based callers, except smCounter2, outperformed the raw-reads-based callers in terms of sensitivity and precision. We recommend using DeepSNVMiner and UMI-VarCal for low-frequency variant detection. The results provide important information regarding future directions for reliable low-frequency variant detection and algorithm development, which is critical in genetics-based medical research and clinical applications.

Collapse

Yi D, Nam JW, Jeong H. Toward the functional interpretation of somatic structural variations: bulk- and single-cell approaches. Brief Bioinform 2023;24:bbad297. [PMID: 37587831 PMCID: PMC10516374 DOI: 10.1093/bib/bbad297] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 07/05/2023] [Accepted: 07/23/2023] [Indexed: 08/18/2023] Open

Nguyen BQT, Tran TPD, Nguyen HT, Nguyen TN, Pham TMQ, Nguyen HTP, Tran DH, Nguyen V, Tran TS, Pham TVN, Le MT, Phan MD, Giang H, Nguyen HN, Tran LS. Improvement in neoantigen prediction via integration of RNA sequencing data for variant calling. Front Immunol 2023;14:1251603. [PMID: 37731488 PMCID: PMC10507271 DOI: 10.3389/fimmu.2023.1251603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Accepted: 08/17/2023] [Indexed: 09/22/2023] Open

Grossi A, Rusmini M, Cusano R, Massidda M, Santamaria G, Napoli F, Angelelli A, Fava D, Uva P, Ceccherini I, Maghnie M. Whole genome sequencing in ROHHAD trios proved inconclusive: what's beyond? Front Genet 2023;14:1031074. [PMID: 37609037 PMCID: PMC10440434 DOI: 10.3389/fgene.2023.1031074] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 07/27/2023] [Indexed: 08/24/2023] Open

Alganmi N, Abusamra H. Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools. PLoS One 2023;18:e0288371. [PMID: 37535628 PMCID: PMC10399881 DOI: 10.1371/journal.pone.0288371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2022] [Accepted: 06/26/2023] [Indexed: 08/05/2023] Open

Wilton R, Szalay AS. Short-read aligner performance in germline variant identification. Bioinformatics 2023;39:btad480. [PMID: 37527006 PMCID: PMC10421969 DOI: 10.1093/bioinformatics/btad480] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 06/01/2023] [Accepted: 07/31/2023] [Indexed: 08/03/2023] Open

O'Connell KA, Yosufzai ZB, Campbell RA, Lobb CJ, Engelken HT, Gorrell LM, Carlson TB, Catana JJ, Mikdadi D, Bonazzi VR, Klenk JA. Accelerating genomic workflows using NVIDIA Parabricks. BMC Bioinformatics 2023;24:221. [PMID: 37259021 PMCID: PMC10230726 DOI: 10.1186/s12859-023-05292-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 04/15/2023] [Indexed: 06/02/2023] Open

Abstract

BACKGROUND

As genome sequencing becomes better integrated into scientific research, government policy, and personalized medicine, the primary challenge for researchers is shifting from generating raw data to analyzing these vast datasets. Although much work has been done to reduce compute times using various configurations of traditional CPU computing infrastructures, Graphics Processing Units (GPUs) offer opportunities to accelerate genomic workflows by orders of magnitude. Here we benchmark one GPU-accelerated software suite called NVIDIA Parabricks on Amazon Web Services (AWS), Google Cloud Platform (GCP), and an NVIDIA DGX cluster. We benchmarked six variant calling pipelines, including two germline callers (HaplotypeCaller and DeepVariant) and four somatic callers (Mutect2, Muse, LoFreq, SomaticSniper).

RESULTS

We achieved up to 65 × acceleration with germline variant callers, bringing HaplotypeCaller runtimes down from 36 h to 33 min on AWS, 35 min on GCP, and 24 min on the NVIDIA DGX. Somatic callers exhibited more variation between the number of GPUs and computing platforms. On cloud platforms, GPU-accelerated germline callers resulted in cost savings compared with CPU runs, whereas some somatic callers were more expensive than CPU runs because their GPU acceleration was not sufficient to overcome the increased GPU cost.

CONCLUSIONS

Germline variant callers scaled well with the number of GPUs across platforms, whereas somatic variant callers exhibited more variation in the number of GPUs with the fastest runtimes, suggesting that, at least with the version of Parabricks used here, these workflows are less GPU optimized and require benchmarking on the platform of choice before being deployed at production scales. Our study demonstrates that GPUs can be used to greatly accelerate genomic workflows, thus bringing closer to grasp urgent societal advances in the areas of biosurveillance and personalized medicine.

Collapse

Zhai Y, Bardel C, Vallée M, Iwaz J, Roy P. Performance comparisons between clustering models for reconstructing NGS results from technical replicates. Front Genet 2023;14:1148147. [PMID: 37007945 PMCID: PMC10060969 DOI: 10.3389/fgene.2023.1148147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 03/06/2023] [Indexed: 03/18/2023] Open

Park H, Gim J. A comparative investigation of variant calling and genotyping for a single non-Caucasian whole genome. RESEARCH SQUARE 2023:rs.3.rs-2580940. [PMID: 36945432 PMCID: PMC10029055 DOI: 10.21203/rs.3.rs-2580940/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/08/2023]

Nagasaki M, Sekiya Y, Asakura A, Teraoka R, Otokozawa R, Hashimoto H, Kawaguchi T, Fukazawa K, Inadomi Y, Murata KT, Ohkawa Y, Yamaguchi I, Mizuhara T, Tokunaga K, Sekiya Y, Hanawa T, Yamada R, Matsuda F. Design and implementation of a hybrid cloud system for large-scale human genomic research. Hum Genome Var 2023;10:6. [PMID: 36755016 PMCID: PMC9908893 DOI: 10.1038/s41439-023-00231-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 12/20/2022] [Accepted: 12/21/2022] [Indexed: 02/10/2023] Open

Affiliation(s)

Masao Nagasaki Human Biosciences Unit for the Top Global Course Center for the Promotion of Interdisciplinary Education and Research (CPIER), Kyoto University, Kyoto, Japan. Center for Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
Yayoi Sekiya Human Biosciences Unit for the Top Global Course Center for the Promotion of Interdisciplinary Education and Research (CPIER), Kyoto University, Kyoto, Japan
Akihiro Asakura Human Biosciences Unit for the Top Global Course Center for the Promotion of Interdisciplinary Education and Research (CPIER), Kyoto University, Kyoto, Japan
Ryo Teraoka Human Biosciences Unit for the Top Global Course Center for the Promotion of Interdisciplinary Education and Research (CPIER), Kyoto University, Kyoto, Japan
Ryoko Otokozawa Human Biosciences Unit for the Top Global Course Center for the Promotion of Interdisciplinary Education and Research (CPIER), Kyoto University, Kyoto, Japan
Hiroki Hashimoto Human Biosciences Unit for the Top Global Course Center for the Promotion of Interdisciplinary Education and Research (CPIER), Kyoto University, Kyoto, Japan
Takahisa Kawaguchi Center for Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto, Japan
Keiichiro Fukazawa Academic Center for Computing and Media Studies, Kyoto University, Kyoto, Japan
Yuichi Inadomi Center for Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto, Japan
Ken T Murata ICT Testbed Research and Development Promotion Center National Institute of Information and Communications Technology (NICT), Tokyo, Japan
Yasuyuki Ohkawa Division of Transcriptomics, Medical Institute of Bioregulation, Kyushu University, Fukuoka, Japan
Izumi Yamaguchi Center for Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto, Japan
Takamichi Mizuhara CLEALINK TECHNOLOGY Co., Ltd, Kyoto, Japan
Katsushi Tokunaga Genome Medical Science Project, National Center for Global Health and Medicine, Tokyo, Japan Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
Yuji Sekiya Information Technology Center, The University of Tokyo, Chiba, Japan
Toshihiro Hanawa Information Technology Center, The University of Tokyo, Chiba, Japan
Ryo Yamada Human Biosciences Unit for the Top Global Course Center for the Promotion of Interdisciplinary Education and Research (CPIER), Kyoto University, Kyoto, Japan Center for Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto, Japan
Fumihiko Matsuda Human Biosciences Unit for the Top Global Course Center for the Promotion of Interdisciplinary Education and Research (CPIER), Kyoto University, Kyoto, Japan Center for Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto, Japan

Collapse

Bai H, Zhang X, Bush WS. Pharmacogenomic and Statistical Analysis. Methods Mol Biol 2023;2629:305-330. [PMID: 36929083 DOI: 10.1007/978-1-0716-2986-4_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/18/2023]

Betschart RO, Thiéry A, Aguilera-Garcia D, Zoche M, Moch H, Twerenbold R, Zeller T, Blankenberg S, Ziegler A. Comparison of calling pipelines for whole genome sequencing: an empirical study demonstrating the importance of mapping and alignment. Sci Rep 2022;12:21502. [PMID: 36513709 PMCID: PMC9748128 DOI: 10.1038/s41598-022-26181-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 12/12/2022] [Indexed: 12/14/2022] Open

Affiliation(s)

Raphael O. Betschart Cardio-CARE, Medizincampus Davos, Herman-Burchard-Str. 1, 7265 Davos Wolfgang, Switzerland
Alexandre Thiéry Cardio-CARE, Medizincampus Davos, Herman-Burchard-Str. 1, 7265 Davos Wolfgang, Switzerland
Domingo Aguilera-Garcia grid.412004.30000 0004 0478 9977Institute of Pathology and Molecular Pathology, University Hospital Zurich, Schmelzbergstrasse 12, 8091 Zurich, Switzerland
Martin Zoche grid.412004.30000 0004 0478 9977Institute of Pathology and Molecular Pathology, University Hospital Zurich, Schmelzbergstrasse 12, 8091 Zurich, Switzerland
Holger Moch grid.412004.30000 0004 0478 9977Institute of Pathology and Molecular Pathology, University Hospital Zurich, Schmelzbergstrasse 12, 8091 Zurich, Switzerland
Raphael Twerenbold grid.13648.380000 0001 2180 3484Department of Cardiology, University Heart & Vascular Center, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251 Hamburg, Germany ,4grid.13648.380000 0001 2180 3484University Center of Cardiovascular Research Hamburg, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251 Hamburg, Germany ,5grid.452396.f0000 0004 5937 5237German Center for Cardiovascular Research (DZHK), Partner Site Hamburg/Kiel/Lübeck, Hamburg, Germany
Tanja Zeller grid.13648.380000 0001 2180 3484Department of Cardiology, University Heart & Vascular Center, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251 Hamburg, Germany ,4grid.13648.380000 0001 2180 3484University Center of Cardiovascular Research Hamburg, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251 Hamburg, Germany ,5grid.452396.f0000 0004 5937 5237German Center for Cardiovascular Research (DZHK), Partner Site Hamburg/Kiel/Lübeck, Hamburg, Germany
Stefan Blankenberg Cardio-CARE, Medizincampus Davos, Herman-Burchard-Str. 1, 7265 Davos Wolfgang, Switzerland ,3grid.13648.380000 0001 2180 3484Department of Cardiology, University Heart & Vascular Center, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251 Hamburg, Germany ,4grid.13648.380000 0001 2180 3484University Center of Cardiovascular Research Hamburg, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251 Hamburg, Germany ,5grid.452396.f0000 0004 5937 5237German Center for Cardiovascular Research (DZHK), Partner Site Hamburg/Kiel/Lübeck, Hamburg, Germany
Andreas Ziegler Cardio-CARE, Medizincampus Davos, Herman-Burchard-Str. 1, 7265 Davos Wolfgang, Switzerland ,3grid.13648.380000 0001 2180 3484Department of Cardiology, University Heart & Vascular Center, University Medical Center Hamburg Eppendorf, Martinistr. 52, 20251 Hamburg, Germany ,6School Mathematics, Statistics and Computer Science, Scottsville, Private Bag X01, Pietermaritzburg, 3209 South Africa

Collapse

Chen J, Ying L, Zeng L, Li C, Jia Y, Yang H, Yang G. The novel compound heterozygous rare variants may impact positively selected regions of TUBGCP6, a microcephaly associated gene. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.1059477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open

In vitro germ cell induction from fertile and infertile monozygotic twin research participants. Cell Rep Med 2022;3:100782. [PMID: 36260988 PMCID: PMC9589117 DOI: 10.1016/j.xcrm.2022.100782] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Revised: 07/23/2022] [Accepted: 09/22/2022] [Indexed: 11/08/2022]

Zhang K, Yu L, Lin G, Li J. A multi-laboratory assessment of clinical exome sequencing for detection of hereditary disease variants: 4441 ClinVar variants for clinical genomic test development and validation. Clin Chim Acta 2022;535:99-107. [PMID: 35985503 DOI: 10.1016/j.cca.2022.08.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 08/01/2022] [Accepted: 08/05/2022] [Indexed: 11/30/2022]

Abstract

BACKGROUND AND AIMS

Whole-exome sequencing (WES) technology has become an essential tool in the clinical diagnostic for rare genetic disorders, however, the issues that reduce testing precision, sensitivity, and concordance are not clear under routine testing conditions. The study is to systematically evaluate the comparability of clinical WES testing results in laboratories under routine conditions.

METHODS

We designed a multi-laboratory study across 24 participating laboratories in China. We assessed sequencing quality across capture methods and sequencing platforms, benchmarked the impact of coverage and callable regions on detecting single nucleotide variants (SNVs), small insertions and deletions (Indels) under the same computational approaches, and compared the sensitivity, precision and reproducibility on detecting mutations across laboratories.

RESULTS

High inter-laboratory variability on variants detection were found across participating laboratories. Sample DNA concentration and sequencing evenness are two major variables that lead to the coverage variation. The difference in bioinformatics tools and computational settings affect the sensitivity and precision of the final output. Besides, copy-number variants (CNVs) identification is less reproducible than SNVs and Indels in the WES testing. We also compiled a list of 4441 low coverage ClinVar variants of 1176 genes from this study, which can be used as a source for creating in silico and synthetic DNA reference materials for clinical genetic disorder detection.

CONCLUSIONS

The considerable inter-laboratory variability seen in both sequencing coverage evenness and variants detection highlights the urgent need to improve the precision, sensitivity and comparability of the results generated across different laboratories. The list of low coverage variants can have important implications for the development and validation of clinical genetic disorder tests by laboratories. This study also serves to best practice inform guidelines for detecting clinical genetic disorders by exome sequencing.

Collapse

Mohr DW, Gaughran SJ, Paschall J, Naguib A, Pang AWC, Dudchenko O, Aiden EL, Church DM, Scott AF. A Chromosome-Length Assembly of the Hawaiian Monk Seal (Neomonachus schauinslandi): A History of “Genetic Purging” and Genomic Stability. Genes (Basel) 2022;13:genes13071270. [PMID: 35886053 PMCID: PMC9323584 DOI: 10.3390/genes13071270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 06/29/2022] [Accepted: 07/07/2022] [Indexed: 12/04/2022] Open

Protocol for unbiased, consolidated variant calling from whole exome sequencing data. STAR Protoc 2022;3:101418. [PMID: 35669050 PMCID: PMC9163752 DOI: 10.1016/j.xpro.2022.101418] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open

The human "contaminome": bacterial, viral, and computational contamination in whole genome sequences from 1000 families. Sci Rep 2022;12:9863. [PMID: 35701436 PMCID: PMC9198055 DOI: 10.1038/s41598-022-13269-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 05/18/2022] [Indexed: 01/11/2023] Open

Schmidt J, Berghaus S, Blessing F, Herbeck H, Blessing J, Schierack P, Rödiger S, Roggenbuck D, Wenzel F. Genotyping of familial Mediterranean fever gene (MEFV)-Single nucleotide polymorphism-Comparison of Nanopore with conventional Sanger sequencing. PLoS One 2022;17:e0265622. [PMID: 35298548 PMCID: PMC8929590 DOI: 10.1371/journal.pone.0265622] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Accepted: 03/04/2022] [Indexed: 11/18/2022] Open

Acosta-Uribe J, Aguillón D, Cochran JN, Giraldo M, Madrigal L, Killingsworth BW, Singhal R, Labib S, Alzate D, Velilla L, Moreno S, García GP, Saldarriaga A, Piedrahita F, Hincapié L, López HE, Perumal N, Morelo L, Vallejo D, Solano JM, Reiman EM, Surace EI, Itzcovich T, Allegri R, Sánchez-Valle R, Villegas-Lanau A, White CL, Matallana D, Myers RM, Browning SR, Lopera F, Kosik KS. A neurodegenerative disease landscape of rare mutations in Colombia due to founder effects. Genome Med 2022;14:27. [PMID: 35260199 PMCID: PMC8902761 DOI: 10.1186/s13073-022-01035-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Accepted: 02/26/2022] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The Colombian population, as well as those in other Latin American regions, arose from a recent tri-continental admixture among Native Americans, Spanish invaders, and enslaved Africans, all of whom passed through a population bottleneck due to widespread infectious diseases that left small isolated local settlements. As a result, the current population reflects multiple founder effects derived from diverse ancestries.

METHODS

We characterized the role of admixture and founder effects on the origination of the mutational landscape that led to neurodegenerative disorders under these historical circumstances. Genomes from 900 Colombian individuals with Alzheimer's disease (AD) [n = 376], frontotemporal lobar degeneration-motor neuron disease continuum (FTLD-MND) [n = 197], early-onset dementia not otherwise specified (EOD) [n = 73], and healthy participants [n = 254] were analyzed. We examined their global and local ancestry proportions and screened this cohort for deleterious variants in disease-causing and risk-conferring genes.

RESULTS

We identified 21 pathogenic variants in AD-FTLD related genes, and PSEN1 harbored the majority (11 pathogenic variants). Variants were identified from all three continental ancestries. TREM2 heterozygous and homozygous variants were the most common among AD risk genes (102 carriers), a point of interest because the disease risk conferred by these variants differed according to ancestry. Several gene variants that have a known association with MND in European populations had FTLD phenotypes on a Native American haplotype. Consistent with founder effects, identity by descent among carriers of the same variant was frequent.

CONCLUSIONS

Colombian demography with multiple mini-bottlenecks probably enhanced the detection of founder events and left a proportionally higher frequency of rare variants derived from the ancestral populations. These findings demonstrate the role of genomically defined ancestry in phenotypic disease expression, a phenotypic range of different rare mutations in the same gene, and further emphasize the importance of inclusiveness in genetic studies.

Collapse

Affiliation(s)

Juliana Acosta-Uribe Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
David Aguillón Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
J Nicholas Cochran HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
Margarita Giraldo Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia Instituto Neurológico de Colombia (INDEC), Medellín, Colombia
Lucía Madrigal Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
Bradley W Killingsworth Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
Rijul Singhal Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
Sarah Labib Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
Diana Alzate Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
Lina Velilla Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
Sonia Moreno Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
Gloria P García Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
Amanda Saldarriaga Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
Francisco Piedrahita Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
Liliana Hincapié Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
Hugo E López Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
Nithesh Perumal Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
Leonilde Morelo Department of Internal Medicine, School of Medicine, Universidad del Sinú, Montería, Colombia
Dionis Vallejo Department of Neurology, School of Medicine, Universidad de Antioquia, Medellín, Colombia
Juan Marcos Solano Department of Neurology, School of Medicine, Universidad de Antioquia, Medellín, Colombia
Eric M Reiman Banner Alzheimer's Institute, Phoenix, AZ, USA
Ezequiel I Surace Laboratorio de Enfermedades Neurodegenerativas (Fleni-CONICET), Buenos Aires, Argentina
Tatiana Itzcovich Laboratorio de Enfermedades Neurodegenerativas (Fleni-CONICET), Buenos Aires, Argentina
Ricardo Allegri Centro de Memoria y Envejecimiento (Fleni-CONICET), Buenos Aires, Argentina
Raquel Sánchez-Valle Alzheimer's Disease and Other Cognitive Disorders Unit, Hospital Clínic de Barcelona, IDIBAPS and University of Barcelona, Barcelona, Spain
Andrés Villegas-Lanau Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
Charles L White Neuropathology Section, Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX, USA
Diana Matallana Instituto de Envejecimiento, Department of Psychiatry, School of Medicine, Pontifical Xaverian University, Bogotá, Colombia Department of Mental Health, Hospital Universitario Santa Fe de Bogotá, Bogotá, Colombia
Richard M Myers HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
Sharon R Browning Department of Biostatistics, University of Washington, Seattle, WA, USA
Francisco Lopera Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia.
Kenneth S Kosik Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA.

Collapse

Pillay NS, Ross OA, Christoffels A, Bardien S. Current Status of Next-Generation Sequencing Approaches for Candidate Gene Discovery in Familial Parkinson´s Disease. Front Genet 2022;13:781816. [PMID: 35299952 PMCID: PMC8921601 DOI: 10.3389/fgene.2022.781816] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 01/12/2022] [Indexed: 11/13/2022] Open

Barbitoff YA, Abasov R, Tvorogova VE, Glotov AS, Predeus AV. Systematic benchmark of state-of-the-art variant calling pipelines identifies major factors affecting accuracy of coding sequence variant discovery. BMC Genomics 2022;23:155. [PMID: 35193511 PMCID: PMC8862519 DOI: 10.1186/s12864-022-08365-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Accepted: 02/03/2022] [Indexed: 12/30/2022] Open

Abstract

BACKGROUND

Accurate variant detection in the coding regions of the human genome is a key requirement for molecular diagnostics of Mendelian disorders. Efficiency of variant discovery from next-generation sequencing (NGS) data depends on multiple factors, including reproducible coverage biases of NGS methods and the performance of read alignment and variant calling software. Although variant caller benchmarks are published constantly, no previous publications have leveraged the full extent of available gold standard whole-genome (WGS) and whole-exome (WES) sequencing datasets.

RESULTS

In this work, we systematically evaluated the performance of 4 popular short read aligners (Bowtie2, BWA, Isaac, and Novoalign) and 9 novel and well-established variant calling and filtering methods (Clair3, DeepVariant, Octopus, GATK, FreeBayes, and Strelka2) using a set of 14 "gold standard" WES and WGS datasets available from Genome In A Bottle (GIAB) consortium. Additionally, we have indirectly evaluated each pipeline's performance using a set of 6 non-GIAB samples of African and Russian ethnicity. In our benchmark, Bowtie2 performed significantly worse than other aligners, suggesting it should not be used for medical variant calling. When other aligners were considered, the accuracy of variant discovery mostly depended on the variant caller and not the read aligner. Among the tested variant callers, DeepVariant consistently showed the best performance and the highest robustness. Other actively developed tools, such as Clair3, Octopus, and Strelka2, also performed well, although their efficiency had greater dependence on the quality and type of the input data. We have also compared the consistency of variant calls in GIAB and non-GIAB samples. With few important caveats, best-performing tools have shown little evidence of overfitting.

CONCLUSIONS

The results show surprisingly large differences in the performance of cutting-edge tools even in high confidence regions of the coding genome. This highlights the importance of regular benchmarking of quickly evolving tools and pipelines. We also discuss the need for a more diverse set of gold standard genomes that would include samples of African, Hispanic, or mixed ancestry. Additionally, there is also a need for better variant caller assessment in the repetitive regions of the coding genome.

Collapse

Wang N, Lysenkov V, Orte K, Kairisto V, Aakko J, Khan S, Elo LL. Tool evaluation for the detection of variably sized indels from next generation whole genome and targeted sequencing data. PLoS Comput Biol 2022;18:e1009269. [PMID: 35176018 PMCID: PMC8916674 DOI: 10.1371/journal.pcbi.1009269] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 03/11/2022] [Accepted: 01/30/2022] [Indexed: 11/18/2022] Open

Establishment of reference standards for multifaceted mosaic variant analysis. Sci Data 2022;9:35. [PMID: 35115554 PMCID: PMC8813952 DOI: 10.1038/s41597-022-01133-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Accepted: 12/20/2021] [Indexed: 11/21/2022] Open

Sahraeian SME, Fang LT, Karagiannis K, Moos M, Smith S, Santana-Quintero L, Xiao C, Colgan M, Hong H, Mohiyuddin M, Xiao W. Achieving robust somatic mutation detection with deep learning models derived from reference data sets of a cancer sample. Genome Biol 2022;23:12. [PMID: 34996510 PMCID: PMC8740374 DOI: 10.1186/s13059-021-02592-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 12/28/2021] [Indexed: 12/13/2022] Open

Yeh CH, Chou YJ, Tsai TH, Hsu PWC, Li CH, Chan YH, Tsai SF, Ng SC, Chou KM, Lin YC, Juan YH, Fu TC, Lai CC, Sytwu HK, Tsai TF. Artificial-Intelligence-Assisted Discovery of Genetic Factors for Precision Medicine of Antiplatelet Therapy in Diabetic Peripheral Artery Disease. Biomedicines 2022;10:biomedicines10010116. [PMID: 35052795 PMCID: PMC8773099 DOI: 10.3390/biomedicines10010116] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2021] [Revised: 12/30/2021] [Accepted: 01/04/2022] [Indexed: 12/15/2022] Open

Affiliation(s)

Chi-Hsiao Yeh Department of Thoracic and Cardiovascular Surgery, Chang Gung Memorial Hospital, Taoyuan 333, Taiwan; College of Medicine, Chang Gung University, Taoyuan 333, Taiwan; (Y.-C.L.); (Y.-H.J.); (T.-C.F.) Community Medicine Research Center, Chang Gung Memorial Hospital, Keelung 204, Taiwan
Yi-Ju Chou Institute of Molecular and Genomic Medicine, National Health Research Institutes, Miaoli 350, Taiwan; (Y.-J.C.); (P.W.-C.H.); (S.-F.T.)
Tsung-Hsien Tsai Advanced Tech BU, Acer Inc., New Taipei City 221, Taiwan; (T.-H.T.); (C.-H.L.); (Y.-H.C.)
Paul Wei-Che Hsu Institute of Molecular and Genomic Medicine, National Health Research Institutes, Miaoli 350, Taiwan; (Y.-J.C.); (P.W.-C.H.); (S.-F.T.)
Chun-Hsien Li Advanced Tech BU, Acer Inc., New Taipei City 221, Taiwan; (T.-H.T.); (C.-H.L.); (Y.-H.C.)
Yun-Hsuan Chan Advanced Tech BU, Acer Inc., New Taipei City 221, Taiwan; (T.-H.T.); (C.-H.L.); (Y.-H.C.)
Shih-Feng Tsai Institute of Molecular and Genomic Medicine, National Health Research Institutes, Miaoli 350, Taiwan; (Y.-J.C.); (P.W.-C.H.); (S.-F.T.)
Soh-Ching Ng Department of Internal Medicine, Division of Endocrinology and Metabolism, Chang Gung Memorial Hospital, Keelung 204, Taiwan; (S.-C.N.); (K.-M.C.)
Kuei-Mei Chou Department of Internal Medicine, Division of Endocrinology and Metabolism, Chang Gung Memorial Hospital, Keelung 204, Taiwan; (S.-C.N.); (K.-M.C.)
Yu-Ching Lin College of Medicine, Chang Gung University, Taoyuan 333, Taiwan; (Y.-C.L.); (Y.-H.J.); (T.-C.F.) Department of Medical Imaging and Intervention, Chang Gung Memorial Hospital, Keelung 204, Taiwan
Yu-Hsiang Juan College of Medicine, Chang Gung University, Taoyuan 333, Taiwan; (Y.-C.L.); (Y.-H.J.); (T.-C.F.) Department of Medical Imaging and Intervention, Chang Gung Memorial Hospital, Keelung 204, Taiwan
Tieh-Cheng Fu College of Medicine, Chang Gung University, Taoyuan 333, Taiwan; (Y.-C.L.); (Y.-H.J.); (T.-C.F.) Department of Physical Medicine and Rehabilitation, Chang Gung Memorial Hospital, Keelung 204, Taiwan
Chi-Chun Lai College of Medicine, Chang Gung University, Taoyuan 333, Taiwan; (Y.-C.L.); (Y.-H.J.); (T.-C.F.) Community Medicine Research Center, Chang Gung Memorial Hospital, Keelung 204, Taiwan Department of Ophthalmology, Chang Gung Memorial Hospital, Keelung 204, Taiwan Correspondence: (C.-C.L.); (H.-K.S.); (T.-F.T.); Tel.: +886-2-24313131 (ext. 6101) (C.-C.L.); +886-37-206166 (ext. 31010) (H.-K.S.); +886-2-28267293 (T.-F.T.)
Huey-Kang Sytwu National Institute of Infectious Diseases and Vaccinology, National Health Research Institutes, Miaoli 350, Taiwan National Defense Medical Center, Department & Graduate Institute of Microbiology and Immunology, Taipei 114, Taiwan Correspondence: (C.-C.L.); (H.-K.S.); (T.-F.T.); Tel.: +886-2-24313131 (ext. 6101) (C.-C.L.); +886-37-206166 (ext. 31010) (H.-K.S.); +886-2-28267293 (T.-F.T.)
Ting-Fen Tsai Institute of Molecular and Genomic Medicine, National Health Research Institutes, Miaoli 350, Taiwan; (Y.-J.C.); (P.W.-C.H.); (S.-F.T.) Departments of Life Sciences and Institute of Genome Sciences, National Yang Ming Chiao Tung University, Taipei 112, Taiwan Center for Healthy Longevity and Aging Sciences, National Yang Ming Chiao Tung University, Taipei 112, Taiwan Correspondence: (C.-C.L.); (H.-K.S.); (T.-F.T.); Tel.: +886-2-24313131 (ext. 6101) (C.-C.L.); +886-37-206166 (ext. 31010) (H.-K.S.); +886-2-28267293 (T.-F.T.)

Collapse

The correctness of large scale analysis of genomic data. FOUNDATIONS OF COMPUTING AND DECISION SCIENCES 2021. [DOI: 10.2478/fcds-2021-0024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Yan YH, Chen SX, Cheng LY, Rodriguez AY, Tang R, Cabrera K, Zhang DY. Confirming putative variants at ≤ 5% allele frequency using allele enrichment and Sanger sequencing. Sci Rep 2021;11:11640. [PMID: 34079006 PMCID: PMC8172533 DOI: 10.1038/s41598-021-91142-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Accepted: 05/21/2021] [Indexed: 12/19/2022] Open

Prins BP, Leitsalu L, Pärna K, Fischer K, Metspalu A, Haller T, Snieder H. Advances in Genomic Discovery and Implications for Personalized Prevention and Medicine: Estonia as Example. J Pers Med 2021;11:jpm11050358. [PMID: 33946982 PMCID: PMC8145318 DOI: 10.3390/jpm11050358] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 04/19/2021] [Accepted: 04/25/2021] [Indexed: 02/07/2023] Open

Giles HH, Hegde MR, Lyon E, Stanley CM, Kerr ID, Garlapow ME, Eggington JM. The Science and Art of Clinical Genetic Variant Classification and Its Impact on Test Accuracy. Annu Rev Genomics Hum Genet 2021;22:285-307. [PMID: 33900788 DOI: 10.1146/annurev-genom-121620-082709] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]