1
|
Hazra S, Moulick D, Mukherjee A, Sahib S, Chowardhara B, Majumdar A, Upadhyay MK, Yadav P, Roy P, Santra SC, Mandal S, Nandy S, Dey A. Evaluation of efficacy of non-coding RNA in abiotic stress management of field crops: Current status and future prospective. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2023; 203:107940. [PMID: 37738864 DOI: 10.1016/j.plaphy.2023.107940] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 07/23/2023] [Accepted: 08/04/2023] [Indexed: 09/24/2023]
Abstract
Abiotic stresses are responsible for the major losses in crop yield all over the world. Stresses generate harmful ROS which can impair cellular processes in plants. Therefore, plants have evolved antioxidant systems in defence against the stress-induced damages. The frequency of occurrence of abiotic stressors has increased several-fold due to the climate change experienced in recent times and projected for the future. This had particularly aggravated the risk of yield losses and threatened global food security. Non-coding RNAs are the part of eukaryotic genome that does not code for any proteins. However, they have been recently found to have a crucial role in the responses of plants to both abiotic and biotic stresses. There are different types of ncRNAs, for example, miRNAs and lncRNAs, which have the potential to regulate the expression of stress-related genes at the levels of transcription, post-transcription, and translation of proteins. The lncRNAs are also able to impart their epigenetic effects on the target genes through the alteration of the status of histone modification and organization of the chromatins. The current review attempts to deliver a comprehensive account of the role of ncRNAs in the regulation of plants' abiotic stress responses through ROS homeostasis. The potential applications ncRNAs in amelioration of abiotic stresses in field crops also have been evaluated.
Collapse
Affiliation(s)
- Swati Hazra
- Sharda School of Agricultural Sciences, Sharda University, Greater Noida, Uttar Pradesh 201310, India.
| | - Debojyoti Moulick
- Department of Environmental Science, University of Kalyani, Nadia, West Bengal 741235, India.
| | | | - Synudeen Sahib
- S. S. Cottage, Njarackal, P.O.: Perinad, Kollam, 691601, Kerala, India.
| | - Bhaben Chowardhara
- Department of Botany, Faculty of Science and Technology, Arunachal University of Studies, Arunachal Pradesh 792103, India.
| | - Arnab Majumdar
- Department of Earth Sciences, Indian Institute of Science Education and Research (IISER) Kolkata, West Bengal 741246, India.
| | - Munish Kumar Upadhyay
- Department of Civil Engineering, Indian Institute of Technology Kanpur, Uttar Pradesh 208016, India.
| | - Poonam Yadav
- Institute of Environment and Sustainable Development, Banaras Hindu University, Varanasi, Uttar Pradesh 221005, India.
| | - Priyabrata Roy
- Department of Molecular Biology and Biotechnology, University of Kalyani, West Bengal 741235, India.
| | - Subhas Chandra Santra
- Department of Environmental Science, University of Kalyani, Nadia, West Bengal 741235, India.
| | - Sayanti Mandal
- Department of Biotechnology, Dr. D. Y. Patil Arts, Commerce & Science College (affiliated to Savitribai Phule Pune University), Sant Tukaram Nagar, Pimpri, Pune, Maharashtra-411018, India.
| | - Samapika Nandy
- School of Pharmacy, Graphic Era Hill University, Bell Road, Clement Town, Dehradun, 248002, Uttarakhand, India; Department of Botany, Vedanta College, 33A Shiv Krishna Daw Lane, Kolkata-700054, India.
| | - Abhijit Dey
- Department of Life Sciences, Presidency University, Kolkata, West Bengal 700073, India.
| |
Collapse
|
2
|
Tay Fernandez CG, Bayer PE, Petereit J, Varshney R, Batley J, Edwards D. The conservation of gene models can support genome annotation. THE PLANT GENOME 2023; 16:e20377. [PMID: 37602500 DOI: 10.1002/tpg2.20377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 07/19/2023] [Accepted: 07/24/2023] [Indexed: 08/22/2023]
Abstract
Many genome annotations include false-positive gene models, leading to errors in phylogenetic and comparative studies. Here, we propose a method to support gene model prediction based on evolutionary conservation and use it to identify potentially erroneous annotations. Using this method, we developed a set of 15,345 representative gene models from 12 legume assemblies that can be used to support genome annotations for other legumes.
Collapse
Affiliation(s)
- Cassandria G Tay Fernandez
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, Western Australia, Australia
| | - Philipp E Bayer
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, Western Australia, Australia
| | - Jakob Petereit
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, Western Australia, Australia
| | - Rajeev Varshney
- State Agricultural Biotechnology Centre, Centre for Crop and Food Innovation, Food Futures Institute, Murdoch University, Murdoch, Western Australia, Australia
- Centre of Excellence in Genomics & Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, Telangana, India
| | - Jacqueline Batley
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, Western Australia, Australia
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, Western Australia, Australia
| |
Collapse
|
3
|
Abstract
Within the next decade, the genomes of 1.8 million eukaryotic species will be sequenced. Identifying genes in these sequences is essential to understand the biology of the species. This is challenging due to the transcriptional complexity of eukaryotic genomes, which encode hundreds of thousands of transcripts of multiple types. Among these, a small set of protein-coding mRNAs play a disproportionately large role in defining phenotypes. Due to their sequence conservation, orthology can be established, making it possible to define the universal catalog of eukaryotic protein-coding genes. This catalog should substantially contribute to uncovering the genomic events underlying the emergence of eukaryotic phenotypes. This piece briefly reviews the basics of protein-coding gene prediction, discusses challenges in finalizing annotation of the human genome, and proposes strategies for producing annotations across the eukaryotic Tree of Life. This lays the groundwork for obtaining the catalog of all genes-the Earth's code of life.
Collapse
Affiliation(s)
- Roderic Guigó
- Bioinformatics and Genomics, Center for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology (BIST), Dr. Aiguader 88, 08003 Barcelona, Catalonia
- Universitat Pompeu Fabra (UPF), Barcelona, Catalonia
| |
Collapse
|
4
|
Mbebi AJ, Nikoloski Z. Gene regulatory network inference using mixed-norms regularized multivariate model with covariance selection. PLoS Comput Biol 2023; 19:e1010832. [PMID: 37523414 PMCID: PMC10414675 DOI: 10.1371/journal.pcbi.1010832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 08/10/2023] [Accepted: 07/11/2023] [Indexed: 08/02/2023] Open
Abstract
Despite extensive research efforts, reconstruction of gene regulatory networks (GRNs) from transcriptomics data remains a pressing challenge in systems biology. While non-linear approaches for reconstruction of GRNs show improved performance over simpler alternatives, we do not yet have understanding if joint modelling of multiple target genes may improve performance, even under linearity assumptions. To address this problem, we propose two novel approaches that cast the GRN reconstruction problem as a blend between regularized multivariate regression and graphical models that combine the L2,1-norm with classical regularization techniques. We used data and networks from the DREAM5 challenge to show that the proposed models provide consistently good performance in comparison to contenders whose performance varies with data sets from simulation and experiments from model unicellular organisms Escherichia coli and Saccharomyces cerevisiae. Since the models' formulation facilitates the prediction of master regulators, we also used the resulting findings to identify master regulators over all data sets as well as their plasticity across different environments. Our results demonstrate that the identified master regulators are in line with experimental evidence from the model bacterium E. coli. Together, our study demonstrates that simultaneous modelling of several target genes results in improved inference of GRNs and can be used as an alternative in different applications.
Collapse
Affiliation(s)
- Alain J. Mbebi
- Bioinformatics Department, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Str. 24-25, Germany
- Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, Germany
| | - Zoran Nikoloski
- Bioinformatics Department, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Str. 24-25, Germany
- Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, Germany
| |
Collapse
|
5
|
Deshpande D, Chhugani K, Chang Y, Karlsberg A, Loeffler C, Zhang J, Muszyńska A, Munteanu V, Yang H, Rotman J, Tao L, Balliu B, Tseng E, Eskin E, Zhao F, Mohammadi P, P. Łabaj P, Mangul S. RNA-seq data science: From raw data to effective interpretation. Front Genet 2023; 14:997383. [PMID: 36999049 PMCID: PMC10043755 DOI: 10.3389/fgene.2023.997383] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 02/24/2023] [Indexed: 03/14/2023] Open
Abstract
RNA sequencing (RNA-seq) has become an exemplary technology in modern biology and clinical science. Its immense popularity is due in large part to the continuous efforts of the bioinformatics community to develop accurate and scalable computational tools to analyze the enormous amounts of transcriptomic data that it produces. RNA-seq analysis enables genes and their corresponding transcripts to be probed for a variety of purposes, such as detecting novel exons or whole transcripts, assessing expression of genes and alternative transcripts, and studying alternative splicing structure. It can be a challenge, however, to obtain meaningful biological signals from raw RNA-seq data because of the enormous scale of the data as well as the inherent limitations of different sequencing technologies, such as amplification bias or biases of library preparation. The need to overcome these technical challenges has pushed the rapid development of novel computational tools, which have evolved and diversified in accordance with technological advancements, leading to the current myriad of RNA-seq tools. These tools, combined with the diverse computational skill sets of biomedical researchers, help to unlock the full potential of RNA-seq. The purpose of this review is to explain basic concepts in the computational analysis of RNA-seq data and define discipline-specific jargon.
Collapse
Affiliation(s)
- Dhrithi Deshpande
- Department of Pharmacology and Pharmaceutical Sciences, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, Los Angeles, CA, United States
| | - Karishma Chhugani
- Department of Pharmacology and Pharmaceutical Sciences, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, Los Angeles, CA, United States
| | - Yutong Chang
- Department of Pharmacology and Pharmaceutical Sciences, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, Los Angeles, CA, United States
| | - Aaron Karlsberg
- Department of Clinical Pharmacy, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, Los Angeles, CA, United States
| | - Caitlin Loeffler
- Department of Computer Science, University of California, Los Angeles, CA, United States
| | - Jinyang Zhang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China
| | - Agata Muszyńska
- Małopolska Centre of Biotechnology, Jagiellonian University, Krakow, Poland
- Institute of Automatic Control, Electronics and Computer Science, Silesian University of Technology, Gliwice, Poland
| | - Viorel Munteanu
- Department of Computers, Informatics and Microelectronics, Technical University of Moldova, Chisinau, Moldova
| | - Harry Yang
- Department of Microbiology, Immunology and Molecular Genetics, University of California Los Angeles, Los Angeles, CA, United States
| | - Jeremy Rotman
- Department of Clinical Pharmacy, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, Los Angeles, CA, United States
| | - Laura Tao
- Department of Computational Medicine, David Geffen School of Medicine at UCLA, CHS, Los Angeles, CA, United States
| | - Brunilda Balliu
- Department of Computational Medicine, David Geffen School of Medicine at UCLA, CHS, Los Angeles, CA, United States
| | | | - Eleazar Eskin
- Department of Computer Science, University of California, Los Angeles, CA, United States
- Department of Computational Medicine, David Geffen School of Medicine at UCLA, CHS, Los Angeles, CA, United States
- Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, CA, United States
| | - Fangqing Zhao
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China
- Key Laboratory of Systems Biology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
| | - Pejman Mohammadi
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, United States
| | - Paweł P. Łabaj
- Małopolska Centre of Biotechnology, Jagiellonian University, Krakow, Poland
- Department of Biotechnology, Boku University Vienna, Vienna, Austria
| | - Serghei Mangul
- Department of Clinical Pharmacy, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, Los Angeles, CA, United States
- Department of Quantitative and Computational Biology, USC Dornsife College of Letters, Arts and Sciences, Los Angeles, CA, United States
- *Correspondence: Serghei Mangul,
| |
Collapse
|
6
|
Moore DS, Lickliter R. Development as explanation: Understanding phenotypic stability and variability after the failure of genetic determinism. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2023; 178:72-77. [PMID: 36682588 DOI: 10.1016/j.pbiomolbio.2023.01.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 01/09/2023] [Indexed: 01/21/2023]
Abstract
In the predominately gene-centered view of 20th century biology, the relationship between genotype and phenotype was essentially a relationship between cause and effect, between a plan and a product. Abandoning the idea of genes as inherited instructions or blueprints for phenotypes raises the question of how to best account for observed phenotypic stability and variability within and across generations of a population. We argue that the processes responsible for phenotypic stability and the processes responsible for phenotypic variability are one and the same, namely, the dynamics of development. This argument proposes that stability of phenotypic form is found not because of the transmission of genotypes, genetic programs, or the transfer of internal blueprints, but because similar internal and external conditions-collectively conceptualized as resources of development-can be reliably reconstituted in each generation. Variability of phenotypic form, which is an indispensable feature of any evolving system, relies on these same resources, but because the internal and external conditions of development are not reconstituted identically in succeeding generations, these conditions-and the phenotypes to which they give rise-will always be characterized by at least some variability.
Collapse
Affiliation(s)
- David S Moore
- Pitzer College, Psychology Field Group, 1050 N. Mills Avenue, Claremont, CA, 91711, USA.
| | - Robert Lickliter
- Department of Psychology, Florida International University, 12000 SW 8th Street, Miami, FL, 33199, USA.
| |
Collapse
|
7
|
Is RNA the working genome in eukaryotes ? The 60 year evolution of a conceptual challenge. Exp Cell Res 2023; 424:113493. [PMID: 36746314 DOI: 10.1016/j.yexcr.2023.113493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Revised: 01/23/2023] [Accepted: 01/26/2023] [Indexed: 02/05/2023]
Abstract
About 80 years ago, in 1943, after a century of biochemical and genetic research, DNA was established as the carrier of genetic information. At the onset of Molecular Biology around 1960, the genome of living organisms embodied 3 basic, still unknown paradigms: its composition, organisation and expression. Between 1980 and 1990, its replication was understood, and ideas about its 3D-organisation were suggested and finally confirmed by 2010. The basic mechanisms of gene expression in higher organisms, the synthesis of precursor RNAs and their processing into functional RNAs, were also discovered about 60 years ago in 1961/62. However, some aspects were then, and are still now debated, although the latest results in post-genomic research have confirmed the basic principles. When my history-essay was published in 2003, describing the discovery of RNA processing 40 years earlier, the main facts were not yet generally confirmed or acknowledged. The processing of pre-rRNA to 28 S and 18 S rRNA was clearly demonstrated, confirmed by others and generally accepted as a fact. However, the "giant" size of pre-mRNA 10-100 kb-long and pervasive DNA transcription were still to be confirmed by post-genomic methods. It was found, surprisingly, that up to 90% of DNA is transcribed in the life cycle of eukaryotic organisms thus showing that pervasive transcription was the general rule. In this essay, we shall take a journey through the 60-year history of evolving paradigms of gene expression which followed the emergence of Molecular Biology, and we will also evoke some of the "folklore" in research throughout this period. Most important was the growing recognition that although the genome is encoded in DNA, the Working Genome in eukaryotic organisms is RNA.
Collapse
|
8
|
Petrzilek J, Pasulka J, Malik R, Horvat F, Kataruka S, Fulka H, Svoboda P. De novo emergence, existence, and demise of a protein-coding gene in murids. BMC Biol 2022; 20:272. [PMID: 36482406 PMCID: PMC9733328 DOI: 10.1186/s12915-022-01470-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 11/15/2022] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Genes, principal units of genetic information, vary in complexity and evolutionary history. Less-complex genes (e.g., long non-coding RNA (lncRNA) expressing genes) readily emerge de novo from non-genic sequences and have high evolutionary turnover. Genesis of a gene may be facilitated by adoption of functional genic sequences from retrotransposon insertions. However, protein-coding sequences in extant genomes rarely lack any connection to an ancestral protein-coding sequence. RESULTS We describe remarkable evolution of the murine gene D6Ertd527e and its orthologs in the rodent Muroidea superfamily. The D6Ertd527e emerged in a common ancestor of mice and hamsters most likely as a lncRNA-expressing gene. A major contributing factor was a long terminal repeat (LTR) retrotransposon insertion carrying an oocyte-specific promoter and a 5' terminal exon of the gene. The gene survived as an oocyte-specific lncRNA in several extant rodents while in some others the gene or its expression were lost. In the ancestral lineage of Mus musculus, the gene acquired protein-coding capacity where the bulk of the coding sequence formed through CAG (AGC) trinucleotide repeat expansion and duplications. These events generated a cytoplasmic serine-rich maternal protein. Knock-out of D6Ertd527e in mice has a small but detectable effect on fertility and the maternal transcriptome. CONCLUSIONS While this evolving gene is not showing a clear function in laboratory mice, its documented evolutionary history in Muroidea during the last ~ 40 million years provides a textbook example of how a several common mutation events can support de novo gene formation, evolution of protein-coding capacity, as well as gene's demise.
Collapse
Affiliation(s)
- Jan Petrzilek
- grid.418827.00000 0004 0620 870XInstitute of Molecular Genetics of the Czech Academy of Sciences, Videnska 1083, 142 20 Prague 4, Czech Republic ,grid.22937.3d0000 0000 9259 8492Present address: Vienna BioCenter PhD Program, Doctoral School of the University of Vienna and Medical University of Vienna, Vienna, Austria
| | - Josef Pasulka
- grid.418827.00000 0004 0620 870XInstitute of Molecular Genetics of the Czech Academy of Sciences, Videnska 1083, 142 20 Prague 4, Czech Republic
| | - Radek Malik
- grid.418827.00000 0004 0620 870XInstitute of Molecular Genetics of the Czech Academy of Sciences, Videnska 1083, 142 20 Prague 4, Czech Republic
| | - Filip Horvat
- grid.418827.00000 0004 0620 870XInstitute of Molecular Genetics of the Czech Academy of Sciences, Videnska 1083, 142 20 Prague 4, Czech Republic ,grid.4808.40000 0001 0657 4636Bioinformatics Group, Division of Biology, Faculty of Science, University of Zagreb, Horvatovac 102a, 10000 Zagreb, Croatia
| | - Shubhangini Kataruka
- grid.418827.00000 0004 0620 870XInstitute of Molecular Genetics of the Czech Academy of Sciences, Videnska 1083, 142 20 Prague 4, Czech Republic ,grid.47100.320000000419368710Present address: Department of Genetics, Yale School of Medicine, New Haven, CT 06510 USA
| | - Helena Fulka
- grid.418827.00000 0004 0620 870XInstitute of Molecular Genetics of the Czech Academy of Sciences, Videnska 1083, 142 20 Prague 4, Czech Republic ,grid.418095.10000 0001 1015 3316Current address: Institute of Experimental Medicine of the Czech Academy of Sciences, Videnska 1083, 142 20 Prague 4, Czech Republic
| | - Petr Svoboda
- grid.418827.00000 0004 0620 870XInstitute of Molecular Genetics of the Czech Academy of Sciences, Videnska 1083, 142 20 Prague 4, Czech Republic
| |
Collapse
|
9
|
De Houwer J, Hughes S. Learning in Individual Organisms, Genes, Machines, and Groups: A New Way of Defining and Relating Learning in Different Systems. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE 2022; 18:649-663. [PMID: 36257050 DOI: 10.1177/17456916221114886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Learning is a central concept in many scientific disciplines. Communication about research on learning is, however, hampered by the fact that different researchers define learning in different ways. In this article, we introduce the extended functional definition of learning that can be used across scientific disciplines. We provide examples of how the definition can be applied to individual organisms, genes, machines, and groups. Using the extended functional definition (a) reveals a heuristic framework for research that can be applied across scientific disciplines, (b) allows researchers to engage in intersystem analyses that relate the behavior and learning of different systems, and (c) clarifies how learning differs from other phenomena such as (changes in) behavior, damaging systems, and programming systems.
Collapse
Affiliation(s)
- Jan De Houwer
- Department of Experimental Clinical and Health Psychology, Ghent University
| | - Sean Hughes
- Department of Experimental Clinical and Health Psychology, Ghent University
| |
Collapse
|
10
|
James D, Bonam CM. Biogeographic ancestry information facilitates genetic racial essentialism: Consequences for race‐based judgments. JOURNAL OF APPLIED SOCIAL PSYCHOLOGY 2022. [DOI: 10.1111/jasp.12932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Drexler James
- Department of Psychology University of Minnesota, Twin Cities Minneapolis Minnesota USA
| | - Courtney M. Bonam
- Psychology Department, Critical Race and Ethnic Studies University of California, Santa Cruz Santa Cruz California USA
| |
Collapse
|
11
|
Vihinen M. Individual Genetic Heterogeneity. Genes (Basel) 2022; 13:genes13091626. [PMID: 36140794 PMCID: PMC9498725 DOI: 10.3390/genes13091626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 08/25/2022] [Accepted: 09/08/2022] [Indexed: 11/28/2022] Open
Abstract
Genetic variation has been widely covered in literature, however, not from the perspective of an individual in any species. Here, a synthesis of genetic concepts and variations relevant for individual genetic constitution is provided. All the different levels of genetic information and variation are covered, ranging from whether an organism is unmixed or hybrid, has variations in genome, chromosomes, and more locally in DNA regions, to epigenetic variants or alterations in selfish genetic elements. Genetic constitution and heterogeneity of microbiota are highly relevant for health and wellbeing of an individual. Mutation rates vary widely for variation types, e.g., due to the sequence context. Genetic information guides numerous aspects in organisms. Types of inheritance, whether Mendelian or non-Mendelian, zygosity, sexual reproduction, and sex determination are covered. Functions of DNA and functional effects of variations are introduced, along with mechanism that reduce and modulate functional effects, including TARAR countermeasures and intraindividual genetic conflict. TARAR countermeasures for tolerance, avoidance, repair, attenuation, and resistance are essential for life, integrity of genetic information, and gene expression. The genetic composition, effects of variations, and their expression are considered also in diseases and personalized medicine. The text synthesizes knowledge and insight on individual genetic heterogeneity and organizes and systematizes the central concepts.
Collapse
Affiliation(s)
- Mauno Vihinen
- Department of Experimental Medical Science, BMC B13, Lund University, SE-22184 Lund, Sweden
| |
Collapse
|
12
|
Abstract
Pharmacogenomics is increasingly important to guide objective, safe, and effective individualised prescribing. Personalised prescribing has revolutionised treatments in the past decade, allowing clinicians to maximise drug efficacy and minimise adverse effects based on a person’s genetic profile. Opioids, the gold standard for cancer pain relief, are among the commonest medications prescribed in palliative care practice. This narrative review examines the literature surrounding opioid pharmacogenomics and its applicability to the palliative care cancer population. There is currently limited intersection between the fields of palliative care and pharmacogenomics, but growing evidence presents a need to build linkages between the two disciplines. Pharmacogenomic evidence guiding opioid prescribing is currently available for codeine and tramadol, which relates to CYP2D6 gene variants. However, these medications are prescribed less commonly for pain in palliative care. Research is accelerating with other opioids, where oxycodone (CYP2D6) and methadone (CYP2B6, ABCB1) already have moderate evidence of an association in terms of drug metabolism and downstream analgesic response and side effects. OPRM1 and COMT are receiving increasing attention and have implications for all opioids, with changes in opioid dosage requirements observed but they have not yet been studied widely enough to be considered clinically actionable. Current evidence indicates that incorporation of pharmacogenomic testing into opioid prescribing practice should focus on the CYP2D6 gene and its actionable variants. Although opioid pharmacogenomic tests are not widely used in clinical practice, the progressively reducing costs and rapid turnover means greater accessibility and affordability to patients, and thus, clinicians will be increasingly asked to provide guidance in this area. The upsurge in pharmacogenomic research will likely discover more actionable gene variants to expand international guidelines to impact opioid prescribing. This rapidly expanding area requires consideration and monitoring by clinicians in order for key findings with clinical implications to be accessible, meaningfully interpretable and communicated.
Collapse
|
13
|
Athanasouli M, Rödelsperger C. Analysis of repeat elements in the Pristionchus pacificus genome reveals an ancient invasion by horizontally transferred transposons. BMC Genomics 2022; 23:523. [PMID: 35854227 PMCID: PMC9297572 DOI: 10.1186/s12864-022-08731-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 07/01/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Repetitive sequences and mobile elements make up considerable fractions of individual genomes. While transposition events can be detrimental for organismal fitness, repetitive sequences form an enormous reservoir for molecular innovation. In this study, we aim to add repetitive elements to the annotation of the Pristionchus pacificus genome and assess their impact on novel gene formation. RESULTS Different computational approaches define up to 24% of the P. pacificus genome as repetitive sequences. While retroelements are more frequently found at the chromosome arms, DNA transposons are distributed more evenly. We found multiple DNA transposons, as well as LTR and LINE elements with abundant evidence of expression as single-exon transcripts. When testing whether transposons disproportionately contribute towards new gene formation, we found that roughly 10-20% of genes across all age classes overlap transposable elements with the strongest trend being an enrichment of low complexity regions among the oldest genes. Finally, we characterized a horizontal gene transfer of Zisupton elements into diplogastrid nematodes. These DNA transposons invaded nematodes from eukaryotic donor species and experienced a recent burst of activity in the P. pacificus lineage. CONCLUSIONS The comprehensive annotation of repetitive elements in the P. pacificus genome builds a resource for future functional genomic analyses as well as for more detailed investigations of molecular innovations.
Collapse
Affiliation(s)
- Marina Athanasouli
- Max Planck Institute for Biology, Department for Integrative Evolutionary Biology, Max-Planck-Ring 9, 72076, Tübingen, Germany
| | - Christian Rödelsperger
- Max Planck Institute for Biology, Department for Integrative Evolutionary Biology, Max-Planck-Ring 9, 72076, Tübingen, Germany.
| |
Collapse
|
14
|
Liu D, Li J, Hao W, Lin X, Xia J, Zhu J, Yang S, Yang X. Chimeric RNA TNNI2-ACTA1-V1 Regulates Cell Proliferation by Regulating the Expression of NCOA3. Front Vet Sci 2022; 9:895190. [PMID: 35898549 PMCID: PMC9309209 DOI: 10.3389/fvets.2022.895190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Accepted: 06/15/2022] [Indexed: 11/13/2022] Open
Abstract
Chimeric RNA is a crucial target for tumor diagnosis and drug therapy, also having its unique biological role in normal tissues. TNNI2-ACTA1-V1 (TA-V1), a chimeric RNA discovered by our laboratory in porcine muscle tissue, can inhibit the proliferation of Porcine Skeletal Muscle Satellite Cells (PSCs). The regulatory mechanism of TA-V1 in PSCs remains unclear, but we speculate that NCOA3, DDR2 and RDX may be the target genes of TA-V1. In this study, we explored the effects of NCOA3, DDR2 and RDX on cell viability and cell proliferation by CCK-8 assay, EdU staining and flow cytometry. Furthermore, the regulatory pathway of proliferation in PSCs mediated by TA-V1 through NCOA3 or CyclinD1 was elucidated by co-transfection and co-immunoprecipitation (Co-IP). The results revealed that overexpression of NCOA3 significantly increased cell viability and the expression level of CyclinD1, and also promotes cell proliferation by changing cells from the G1 phase to the S phase. In addition, inhibiting the expression of NCOA3 substantially reduced cell viability and inhibited cell proliferation. Overexpression of DDR2 and RDX had no significant effect on cell viability and proliferation. Co-transfection experiments showed that NCOA3 could rescue the proliferation inhibition of PSCs caused by TA-V1. Co-IP assay indicated that TA-V1 directly interacts with NCOA3. Our study explores the hypothesis that TA-V1 directly regulates NCOA3, indirectly regulating CyclinD1, thereby regulating PSCs proliferation. We provide new putative mechanisms of porcine skeletal muscle growth and lay the foundation for the study of chimeric RNA in normal tissues.
Collapse
|
15
|
Evaluating Plant Gene Models Using Machine Learning. PLANTS 2022; 11:plants11121619. [PMID: 35736770 PMCID: PMC9230120 DOI: 10.3390/plants11121619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 06/12/2022] [Accepted: 06/17/2022] [Indexed: 11/28/2022]
Abstract
Gene models are regions of the genome that can be transcribed into RNA and translated to proteins, or belong to a class of non-coding RNA genes. The prediction of gene models is a complex process that can be unreliable, leading to false positive annotations. To help support the calling of confident conserved gene models and minimize false positives arising during gene model prediction we have developed Truegene, a machine learning approach to classify potential low confidence gene models using 14 gene and 41 protein-based characteristics. Amino acid and nucleotide sequence-based features were calculated for conserved (high confidence) and non-conserved (low confidence) annotated genes from the published Pisum sativum Cameor genome. These features were used to train eXtreme Gradient Boost (XGBoost) classifier models to predict whether a gene model is likely to be real. The optimized models demonstrated a prediction accuracy ranging from 87% to 90% and an F-1 score of 0.91–0.94. We used SHapley Additive exPlanations (SHAP) and feature importance plots to identify the features that contribute to the model predictions, and we show that protein and gene-based features can be used to build accurate models for gene prediction that have applications in supporting future gene annotation processes.
Collapse
|
16
|
Khan A, Saha G, Pal RK. Controlling the Effects of External Perturbations on a Gene Regulatory Network Using Proportional-Integral-Derivative Controller. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1531-1544. [PMID: 33206608 DOI: 10.1109/tcbb.2020.3039038] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Gene regulatory networks are biologically robust, which imparts resilience to living systems against most external perturbations affecting them. However, there is a limit to this and disturbances beyond this limit can impart unwanted signalling on one or more master regulators in a network. Certain disturbances may affect the functioning of other constituent genes of the same network. In most cases, this phenomenon can have some effect on the functioning of the living organism. In this investigation, we have proposed a methodology to mitigate the effects of external perturbations on a genetic network using a proportional-integral-derivative controller. The proposed controller has been used to perturb one or more of the other unaffected master regulators such that the most affected gene/s of the network revert to their normal state. The only required condition of such type of manoeuvring is that there should be multiple master regulators in a network. The proposed technique has been experimented on a 10-gene DREAM4 benchmark network and also on a larger 20-gene network, where only downregulation has been considered due to data constraints. Simulation results indicate that the most vulnerable genes can be reverted to their normal expression levels in 10 out of the 16 simulations performed.
Collapse
|
17
|
Oiwa NN, Li K, Cordeiro CE, Heermann DW. Prediction and comparative analysis of CTCF binding sites based on a first principle approach. Phys Biol 2022; 19. [PMID: 35290214 DOI: 10.1088/1478-3975/ac5dca] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 03/09/2022] [Indexed: 11/12/2022]
Abstract
We calculated the patterns for the CCCTC transcription factor (CTCF) binding sites across many genomes on a first principle approach. The validation of the first principle method was done on the human as well as on the mouse genome. The predicted human CTCF binding sites are consistent with the consensus sequence, ChIP-seq data for the K562 cell, nucleosome positions for IMR90 cell as well as the CTCF binding sites in the mouse HOXA gene. The analysis of Homo sapiens, Mus musculus, Sus scrofa, Capra hircus and Drosophila melanogaster whole genomes shows: binding sites are organized in cluster-like groups, where two consecutive sites obey a power-law with coefficient ranging from to 0.3292 0.0068 to 0.5409 0.0064; the distance between these groups varies from 18.08 0.52kbp to 42.1 2.0kbp. The genome of Aedes aegypti does not show a power law, but 19.9% of binding sites are 144 4 and 287 5bp distant of each other. We run negative tests, confirming the under-representation of CTCF binding sites in Caenorhabditis elegans, Plasmodium falciparum and Arabidopsis thaliana complete genomes.
Collapse
Affiliation(s)
- Nestor Norio Oiwa
- Theoretical Physics, Heidelberg University, Philosophenweg 19, Heidelberg, Baden-Württemberg, 69120, GERMANY
| | - Kunhe Li
- Theoretical Physics, Heidelberg University, Philosophenweg 19, Heidelberg, 69117, GERMANY
| | - Claudette E Cordeiro
- Department of Physics, Universidade Federal Fluminense, Avenida Atlantica s/n, Gragoatal, Niteroi, Rio de Janeiro, 24220-900, BRAZIL
| | - Dieter W Heermann
- Theoretical Physics, Heidelberg University, Philosophenweg 19, Heidelberg, 69120, GERMANY
| |
Collapse
|
18
|
Bellazzi F. The emergence of the postgenomic gene. EUROPEAN JOURNAL FOR PHILOSOPHY OF SCIENCE 2022; 12:17. [PMID: 35222747 PMCID: PMC8847258 DOI: 10.1007/s13194-022-00446-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 01/06/2022] [Indexed: 06/14/2023]
Abstract
The identity and the existence of genes has been challenged by postgenomic discoveries. Specifically, the consideration of molecular and cellular phenomena in which genes are embedded has proved relevant for their understanding. In response to these challenges, I will argue that the complexity of genetic phenomena supports the weak emergence of genes from the DNA. In Section 2, I will expose what genes are taken to be in the postgenomic world. In Section 3, I will present the relevant account of emergence. I consider weak emergence as in Franklin and Knox (Studies for the History and Philosophy of Modern Physics, 64, 68-78, 2018), for which a phenomenon is emergent when it displays novelty and robustness. In Section 4, I will argue that genes are weakly emergent since they are novel, improving explanations, and robust in respect to some perturbations. Then, I will conclude in Section 5 that genes' emergence is a way to allow genes' flexibility and context dependency, without compromising their existence.
Collapse
|
19
|
Li J, Singh U, Bhandary P, Campbell J, Arendsee Z, Seetharam AS, Wurtele ES. Foster thy young: enhanced prediction of orphan genes in assembled genomes. Nucleic Acids Res 2021; 50:e37. [PMID: 34928390 PMCID: PMC9023268 DOI: 10.1093/nar/gkab1238] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 10/22/2021] [Accepted: 12/02/2021] [Indexed: 02/06/2023] Open
Abstract
Proteins encoded by newly-emerged genes ('orphan genes') share no sequence similarity with proteins in any other species. They provide organisms with a reservoir of genetic elements to quickly respond to changing selection pressures. Here, we systematically assess the ability of five gene prediction pipelines to accurately predict genes in genomes according to phylostratal origin. BRAKER and MAKER are existing, popular ab initio tools that infer gene structures by machine learning. Direct Inference is an evidence-based pipeline we developed to predict gene structures from alignments of RNA-Seq data. The BIND pipeline integrates ab initio predictions of BRAKER and Direct inference; MIND combines Direct Inference and MAKER predictions. We use highly-curated Arabidopsis and yeast annotations as gold-standard benchmarks, and cross-validate in rice. Each pipeline under-predicts orphan genes (as few as 11 percent, under one prediction scenario). Increasing RNA-Seq diversity greatly improves prediction efficacy. The combined methods (BIND and MIND) yield best predictions overall, BIND identifying 68% of annotated orphan genes, 99% of ancient genes, and give the highest sensitivity score regardless dataset in Arabidopsis. We provide a light weight, flexible, reproducible, and well-documented solution to improve gene prediction.
Collapse
Affiliation(s)
- Jing Li
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50014, USA.,Center for Metabolic Biology, Iowa State University, Ames, IA 50014, USA.,Genetics and Genomics Graduate Program, Iowa State University, Ames, IA 50014, USA
| | - Urminder Singh
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50014, USA.,Center for Metabolic Biology, Iowa State University, Ames, IA 50014, USA.,Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA 50014, USA
| | - Priyanka Bhandary
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50014, USA.,Center for Metabolic Biology, Iowa State University, Ames, IA 50014, USA.,Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA 50014, USA
| | - Jacqueline Campbell
- Corn Insects and Crop Genetics Research Unit, US Department of Agriculture Agriculture Research Service, Ames, IA 50014, USA
| | - Zebulun Arendsee
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50014, USA.,Center for Metabolic Biology, Iowa State University, Ames, IA 50014, USA.,Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA 50014, USA
| | - Arun S Seetharam
- Genome Informatics Facility, Iowa State University, Ames, IA 50014, USA
| | - Eve Syrkin Wurtele
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50014, USA.,Center for Metabolic Biology, Iowa State University, Ames, IA 50014, USA.,Genetics and Genomics Graduate Program, Iowa State University, Ames, IA 50014, USA.,Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA 50014, USA
| |
Collapse
|
20
|
Zhou H, Tang W, Yang J, Peng J, Guo J, Fan C. MicroRNA-Related Strategies to Improve Cardiac Function in Heart Failure. Front Cardiovasc Med 2021; 8:773083. [PMID: 34869689 PMCID: PMC8639862 DOI: 10.3389/fcvm.2021.773083] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Accepted: 10/25/2021] [Indexed: 12/18/2022] Open
Abstract
Heart failure (HF) describes a group of manifestations caused by the failure of heart function as a pump that supports blood flow through the body. MicroRNAs (miRNAs), as one type of non-coding RNA molecule, have crucial roles in the etiology of HF. Accordingly, miRNAs related to HF may represent potential novel therapeutic targets. In this review, we first discuss the different roles of miRNAs in the development and diseases of the heart. We then outline commonly used miRNA chemical modifications and delivery systems. Further, we summarize the opportunities and challenges for HF-related miRNA therapeutics targets, and discuss the first clinical trial of an antisense drug (CDR132L) in patients with HF. Finally, we outline current and future challenges and potential new directions for miRNA-based therapeutics for HF.
Collapse
Affiliation(s)
- Huatao Zhou
- Department of Cardiovascular Surgery, The Second Xiangya Hospital, Central South University, Changsha, China
| | - Weijie Tang
- Department of Cardiovascular Surgery, The Second Xiangya Hospital, Central South University, Changsha, China
| | - Jinfu Yang
- Department of Cardiovascular Surgery, The Second Xiangya Hospital, Central South University, Changsha, China.,Department of Pharmacology, Hunan Provincial Key Laboratory of Cardiovascular Research, Xiangya School of Pharmaceutical Sciences, Central South University, Changsha, China
| | - Jun Peng
- Department of Pharmacology, Hunan Provincial Key Laboratory of Cardiovascular Research, Xiangya School of Pharmaceutical Sciences, Central South University, Changsha, China
| | - Jianjun Guo
- Hunan Fangsheng Pharmaceutical Co., Ltd. Changsha, China
| | - Chengming Fan
- Department of Cardiovascular Surgery, The Second Xiangya Hospital, Central South University, Changsha, China.,Department of Pharmacology, Hunan Provincial Key Laboratory of Cardiovascular Research, Xiangya School of Pharmaceutical Sciences, Central South University, Changsha, China.,Hunan Fangsheng Pharmaceutical Co., Ltd. Changsha, China
| |
Collapse
|
21
|
Schreiter T, Gieseler RK, Vílchez-Vargas R, Jauregui R, Sowa JP, Klein-Scory S, Broering R, Croner RS, Treckmann JW, Link A, Canbay A. Transcriptome-Wide Analysis of Human Liver Reveals Age-Related Differences in the Expression of Select Functional Gene Clusters and Evidence for a PPP1R10-Governed 'Aging Cascade'. Pharmaceutics 2021; 13:pharmaceutics13122009. [PMID: 34959291 PMCID: PMC8709089 DOI: 10.3390/pharmaceutics13122009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2021] [Revised: 11/17/2021] [Accepted: 11/21/2021] [Indexed: 12/27/2022] Open
Abstract
A transcriptome-wide analysis of human liver for demonstrating differences between young and old humans has not yet been performed. However, identifying major age-related alterations in hepatic gene expression may pinpoint ontogenetic shifts with important hepatic and systemic consequences, provide novel pharmacogenetic information, offer clues to efficiently counteract symptoms of old age, and improve the overarching understanding of individual decline. Next-generation sequencing (NGS) data analyzed by the Mann-Whitney nonparametric test and Ensemble Feature Selection (EFS) bioinformatics identified 44 transcripts among 60,617 total and 19,986 protein-encoding transcripts that significantly (p = 0.0003 to 0.0464) and strikingly (EFS score > 0.3:16 transcripts; EFS score > 0.2:28 transcripts) differ between young and old livers. Most of these age-related transcripts were assigned to the categories 'regulome', 'inflammaging', 'regeneration', and 'pharmacogenes'. NGS results were confirmed by quantitative real-time polymerase chain reaction. Our results have important implications for the areas of ontogeny/aging and the age-dependent increase in major liver diseases. Finally, we present a broadly substantiated and testable hypothesis on a genetically governed 'aging cascade', wherein PPP1R10 acts as a putative ontogenetic master regulator, prominently flanked by IGFALS and DUSP1. This transcriptome-wide analysis of human liver offers potential clues towards developing safer and improved therapeutic interventions against major liver diseases and increased insights into key mechanisms underlying aging.
Collapse
Affiliation(s)
- Thomas Schreiter
- Department of Medicine, University Hospital Knappschaftskrankenhaus Bochum, Ruhr University Bochum, 44892 Bochum, Germany; (T.S.); (R.K.G.); (J.-P.S.); (S.K.-S.)
- Laboratory of Immunology & Molecular Biology, University Hospital Knappschaftskrankenhaus Bochum, Ruhr University Bochum, 44892 Bochum, Germany
| | - Robert K. Gieseler
- Department of Medicine, University Hospital Knappschaftskrankenhaus Bochum, Ruhr University Bochum, 44892 Bochum, Germany; (T.S.); (R.K.G.); (J.-P.S.); (S.K.-S.)
- Laboratory of Immunology & Molecular Biology, University Hospital Knappschaftskrankenhaus Bochum, Ruhr University Bochum, 44892 Bochum, Germany
| | - Ramiro Vílchez-Vargas
- Department of Gastroenterology, Hepatology, and Infectious Diseases, Medical Faculty, Otto-von-Guericke University, 39120 Magdeburg, Germany; (R.V.-V.); (A.L.)
| | - Ruy Jauregui
- Data Science Grasslands, Grasslands Research Centre, AgResearch, Palmerston North 4410, New Zealand;
| | - Jan-Peter Sowa
- Department of Medicine, University Hospital Knappschaftskrankenhaus Bochum, Ruhr University Bochum, 44892 Bochum, Germany; (T.S.); (R.K.G.); (J.-P.S.); (S.K.-S.)
- Laboratory of Immunology & Molecular Biology, University Hospital Knappschaftskrankenhaus Bochum, Ruhr University Bochum, 44892 Bochum, Germany
| | - Susanne Klein-Scory
- Department of Medicine, University Hospital Knappschaftskrankenhaus Bochum, Ruhr University Bochum, 44892 Bochum, Germany; (T.S.); (R.K.G.); (J.-P.S.); (S.K.-S.)
- Laboratory of Immunology & Molecular Biology, University Hospital Knappschaftskrankenhaus Bochum, Ruhr University Bochum, 44892 Bochum, Germany
| | - Ruth Broering
- Department of Gastroenterology and Hepatology, University Hospital Essen, University of Duisburg-Essen, 45147 Essen, Germany;
| | - Roland S. Croner
- Department of General, Visceral, Vascular and Transplantation Surgery, Medical Faculty, Otto-von-Guericke University, 39120 Magdeburg, Germany;
| | - Jürgen W. Treckmann
- Department of General, Visceral and Transplantation Surgery, University Hospital Essen, University of Duisburg-Essen, 45147 Essen, Germany;
| | - Alexander Link
- Department of Gastroenterology, Hepatology, and Infectious Diseases, Medical Faculty, Otto-von-Guericke University, 39120 Magdeburg, Germany; (R.V.-V.); (A.L.)
| | - Ali Canbay
- Department of Medicine, University Hospital Knappschaftskrankenhaus Bochum, Ruhr University Bochum, 44892 Bochum, Germany; (T.S.); (R.K.G.); (J.-P.S.); (S.K.-S.)
- Section of Hepatology and Gastroenterology, University Hospital Knappschaftskrankenhaus Bochum, Ruhr University Bochum, 44892 Bochum, Germany
- Correspondence: ; Tel.: +49-234-299-3401
| |
Collapse
|
22
|
Duch W. Memetics and neural models of conspiracy theories. PATTERNS 2021; 2:100353. [PMID: 34820645 PMCID: PMC8600249 DOI: 10.1016/j.patter.2021.100353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
23
|
Tierney BT, Szymanski E, Henriksen JR, Kostic AD, Patel CJ. Using Cartesian Doubt To Build a Sequencing-Based View of Microbiology. mSystems 2021; 6:e0057421. [PMID: 34636670 PMCID: PMC8510522 DOI: 10.1128/msystems.00574-21] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 09/23/2021] [Indexed: 12/13/2022] Open
Abstract
The technological leap of DNA sequencing generated a tension between modern metagenomics and historical microbiology. We are forcibly harmonizing the output of a modern tool with centuries of experimental knowledge derived from culture-based microbiology. As a thought experiment, we borrow the notion of Cartesian doubt from philosopher Rene Descartes, who used doubt to build a philosophical framework from his incorrigible statement that "I think therefore I am." We aim to cast away preconceived notions and conceptualize microorganisms through the lens of metagenomic sequencing alone. Specifically, we propose funding and building analysis and engineering methods that neither search for nor rely on the assumption of independent genomes bound by lipid barriers containing discrete functional roles and taxonomies. We propose that a view of microbial communities based in sequencing will engender novel insights into metagenomic structure and may capture functional biology not reflected within the current paradigm.
Collapse
Affiliation(s)
- Braden T. Tierney
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
- Section on Pathophysiology and Molecular Pharmacology, Joslin Diabetes Center, Boston, Massachusetts, USA
- Section on Islet Cell and Regenerative Biology, Joslin Diabetes Center, Boston, Massachusetts, USA
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, USA
| | - Erika Szymanski
- Department of English, Colorado State University, Fort Collins, Colorado, USA
| | | | - Aleksandar D. Kostic
- Section on Pathophysiology and Molecular Pharmacology, Joslin Diabetes Center, Boston, Massachusetts, USA
- Section on Islet Cell and Regenerative Biology, Joslin Diabetes Center, Boston, Massachusetts, USA
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, USA
| | - Chirag J. Patel
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
24
|
García S. A, Casamayor JC. On how to generalize species-specific conceptual schemes to generate a species-independent Conceptual Schema of the Genome. BMC Bioinformatics 2021; 22:353. [PMID: 34592923 PMCID: PMC8482561 DOI: 10.1186/s12859-021-04237-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 06/04/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Understanding the genome, with all of its components and intrinsic relationships, is a great challenge. Conceptual modeling techniques have been used as a means to face this challenge. The heterogeneity and idiosyncrasy of genomic use cases mean that conceptual modeling techniques are used to generate conceptual schemes that focus on too specific scenarios (i.e., they are species-specific conceptual schemes). Our research group developed two different conceptual schemes. The first one is the Conceptual Schema of the Human Genome, which is intended to improve Precision Medicine and genetic diagnosis. The second one is the Conceptual Schema of the Citrus Genome, which is intended to identify the genetic cause of relevant phenotypes in the agri-food field. METHODS Our two conceptual schemes have been ontologically compared to identify their similarities and differences. Based on this comparison, several changes have been performed in the Conceptual Schema of the Human Genome in order to obtain the first version of a species-independent Conceptual Schema of the Genome. Identifying the different genome information items used in each genomic case study has been essential in achieving our goal. The changes needed to provide an expanded, more generic version of the Conceptual Schema of the Human Genome are analyzed and discussed. RESULTS This work presents a new CS called the Conceptual Schema of the Genome that is ready to be adapted to any specific working genome-based context (i.e., species-independent). CONCLUSION The generated Conceptual Schema of the Genome works as a global, generic element from which conceptual views can be created in order to work with any specific species. This first working version can be used in the human use case, in the citrus use case, and, potentially, in more use cases of other species.
Collapse
Affiliation(s)
- Alberto García S.
- PROS Research Center, Universitat Politècnica de València, Camino de Vera, Valencia, Spain
| | - Juan Carlos Casamayor
- PROS Research Center, Universitat Politècnica de València, Camino de Vera, Valencia, Spain
| |
Collapse
|
25
|
Van Regenmortel MHV. Design in biology and rational design in vaccinology: A conceptual analysis. Methods 2021; 195:120-127. [PMID: 34352372 DOI: 10.1016/j.ymeth.2021.07.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Revised: 07/20/2021] [Accepted: 07/29/2021] [Indexed: 10/20/2022] Open
Abstract
This review discusses the philosophical foundations of what used to be called "the scientific method" and is nowadays often known as the scientific attitude. It used to be believed that scientific theories and methods aimed at the truth especially in the case of physics, chemistry and astronomy because these sciences were able to develop numerous scientific laws that made it possible to understand and predict many physical phenomena. The situation is different in the case of the biological sciences which deal with highly complex living organisms made up of huge numbers of constituents that undergo continuous dynamic processes; this leads to novel emergent properties in organisms that cannot be predicted because they are not present in the constituents before they have interacted with each other. This is one of the reasons why there are no universal scientific laws in biology. Furthermore, all scientific theories can only achieve a restricted level of predictive success because they remain valid only under the limited range of conditions that were used for establishing the theory' in the first place. Many theories that used to be accepted were subsequently shown to be false, demonstrating that scientific theories always remain tentative and can never be proven beyond and doubt. It is ironical that as scientists have finally accepted that approximate truths are perfectly adequate and that absolute truth is an illusion, a new irrational sociological phenomenon called Post-Truth conveyed by social media, the Internet and fake news has developed in the Western world that is convincing millions of people that truth simply does not exist. Misleading information is circulated with the intention to deceive and science denialism is promoted by denying the remarkable achievements of science and technology during the last centuries. Although the concept of intentional design is widely used to describe the methods that biologists use to make discoveries and inventions, it will be argued that the term is not appropriate for explaining the appearance of life on our planet nor for describing the scientific creativity of scientific investigators. The term rational for describing the development of new vaccines is also unjustified. Because the analysis of the COVID-19 pandemic requires contributions from biomedical and psycho-socioeconomic sciences, one scientific method alone would be insufficient for combatting the pandemic.
Collapse
|
26
|
Ahmed Z, Renart EG, Zeeshan S. Genomics pipelines to investigate susceptibility in whole genome and exome sequenced data for variant discovery, annotation, prediction and genotyping. PeerJ 2021; 9:e11724. [PMID: 34395068 PMCID: PMC8320519 DOI: 10.7717/peerj.11724] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 06/14/2021] [Indexed: 12/12/2022] Open
Abstract
Over the last few decades, genomics is leading toward audacious future, and has been changing our views about conducting biomedical research, studying diseases, and understanding diversity in our society across the human species. The whole genome and exome sequencing (WGS/WES) are two of the most popular next-generation sequencing (NGS) methodologies that are currently being used to detect genetic variations of clinical significance. Investigating WGS/WES data for the variant discovery and genotyping is based on the nexus of different data analytic applications. Although several bioinformatics applications have been developed, and many of those are freely available and published. Timely finding and interpreting genetic variants are still challenging tasks among diagnostic laboratories and clinicians. In this study, we are interested in understanding, evaluating, and reporting the current state of solutions available to process the NGS data of variable lengths and types for the identification of variants, alleles, and haplotypes. Residing within the scope, we consulted high quality peer reviewed literature published in last 10 years. We were focused on the standalone and networked bioinformatics applications proposed to efficiently process WGS and WES data, and support downstream analysis for gene-variant discovery, annotation, prediction, and interpretation. We have discussed our findings in this manuscript, which include but not are limited to the set of operations, workflow, data handling, involved tools, technologies and algorithms and limitations of the assessed applications.
Collapse
Affiliation(s)
- Zeeshan Ahmed
- Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA.,Department of Medicine, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
| | - Eduard Gibert Renart
- Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
| | - Saman Zeeshan
- Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
| |
Collapse
|
27
|
Gene expression regulates metabolite homeostasis during the Crabtree effect: Implications for the adaptation and evolution of Metabolism. Proc Natl Acad Sci U S A 2021; 118:2014013118. [PMID: 33372135 DOI: 10.1073/pnas.2014013118] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
A key issue in both molecular and evolutionary biology has been to define the roles of genes and phenotypes in the adaptation of organisms to environmental changes. The dominant view has been that an organism's metabolic adaptations are driven by gene expression and that gene mutations, independent of the starting phenotype, are responsible for the evolution of new metabolic phenotypes. We propose an alternate hypothesis, in which the phenotype and genotype together determine metabolic adaptation both in the lifetime of the organism and in the evolutionary selection of adaptive metabolic traits. We tested this hypothesis by flux-balance and metabolic-control analysis of the relative roles of the starting phenotype and gene expression in regulating the metabolic adaptations during the Crabtree effect in yeast, when they are switched from a low- to high-glucose environment. Critical for successful short-term adaptation was the ability of the glycogen/trehalose shunt to balance the glycolytic pathway. The role of later gene expression of new isoforms of glycolytic enzymes, rather than flux control, was to provide additional homeostatic mechanisms allowing an increase in the amount and efficiency of adenosine triphosphate and product formation while maintaining glycolytic balance. We further showed that homeostatic mechanisms, by allowing increased phenotypic plasticity, could have played an important role in guiding the evolution of the Crabtree effect. Although our findings are specific to Crabtree yeast, they are likely to be broadly found because of the well-recognized similarities in glucose metabolism across kingdoms and phyla from yeast to humans.
Collapse
|
28
|
Micaglio E, Locati ET, Monasky MM, Romani F, Heilbron F, Pappone C. Role of Pharmacogenetics in Adverse Drug Reactions: An Update towards Personalized Medicine. Front Pharmacol 2021; 12:651720. [PMID: 33995067 PMCID: PMC8120428 DOI: 10.3389/fphar.2021.651720] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Accepted: 03/29/2021] [Indexed: 12/28/2022] Open
Abstract
Adverse drug reactions (ADRs) are an important and frequent cause of morbidity and mortality. ADR can be related to a variety of drugs, including anticonvulsants, anaesthetics, antibiotics, antiretroviral, anticancer, and antiarrhythmics, and can involve every organ or apparatus. The causes of ADRs are still poorly understood due to their clinical heterogeneity and complexity. In this scenario, genetic predisposition toward ADRs is an emerging issue, not only in anticancer chemotherapy, but also in many other fields of medicine, including hemolytic anemia due to glucose-6-phosphate dehydrogenase (G6PD) deficiency, aplastic anemia, porphyria, malignant hyperthermia, epidermal tissue necrosis (Lyell's Syndrome and Stevens-Johnson Syndrome), epilepsy, thyroid diseases, diabetes, Long QT and Brugada Syndromes. The role of genetic mutations in the ADRs pathogenesis has been shown either for dose-dependent or for dose-independent reactions. In this review, we present an update of the genetic background of ADRs, with phenotypic manifestations involving blood, muscles, heart, thyroid, liver, and skin disorders. This review aims to illustrate the growing usefulness of genetics both to prevent ADRs and to optimize the safe therapeutic use of many common drugs. In this prospective, ADRs could become an untoward "stress test," leading to new diagnosis of genetic-determined diseases. Thus, the wider use of pharmacogenetic testing in the work-up of ADRs will lead to new clinical diagnosis of previously unsuspected diseases and to improved safety and efficacy of therapies. Improving the genotype-phenotype correlation through new lab techniques and implementation of artificial intelligence in the future may lead to personalized medicine, able to predict ADR and consequently to choose the appropriate compound and dosage for each patient.
Collapse
Affiliation(s)
- Emanuele Micaglio
- Arrhythmology and Electrophysiology Department, IRCCS Policlinico San Donato, Milan, Italy
| | - Emanuela T Locati
- Arrhythmology and Electrophysiology Department, IRCCS Policlinico San Donato, Milan, Italy
| | - Michelle M Monasky
- Arrhythmology and Electrophysiology Department, IRCCS Policlinico San Donato, Milan, Italy
| | - Federico Romani
- Arrhythmology and Electrophysiology Department, IRCCS Policlinico San Donato, Milan, Italy.,Vita-Salute San Raffaele University, (Vita-Salute University) for Federico Romani, Milan, Italy
| | | | - Carlo Pappone
- Arrhythmology and Electrophysiology Department, IRCCS Policlinico San Donato, Milan, Italy.,Vita-Salute San Raffaele University, (Vita-Salute University) for Federico Romani, Milan, Italy
| |
Collapse
|
29
|
Hounkpe BW, Chenou F, de Lima F, De Paula E. HRT Atlas v1.0 database: redefining human and mouse housekeeping genes and candidate reference transcripts by mining massive RNA-seq datasets. Nucleic Acids Res 2021; 49:D947-D955. [PMID: 32663312 PMCID: PMC7778946 DOI: 10.1093/nar/gkaa609] [Citation(s) in RCA: 100] [Impact Index Per Article: 33.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 07/08/2020] [Indexed: 12/18/2022] Open
Abstract
Housekeeping (HK) genes are constitutively expressed genes that are required for the maintenance of basic cellular functions. Despite their importance in the calibration of gene expression, as well as the understanding of many genomic and evolutionary features, important discrepancies have been observed in studies that previously identified these genes. Here, we present Housekeeping and Reference Transcript Atlas (HRT Atlas v1.0, www.housekeeping.unicamp.br) a web-based database which addresses some of the previously observed limitations in the identification of these genes, and offers a more accurate database of human and mouse HK genes and transcripts. The database was generated by mining massive human and mouse RNA-seq data sets, including 11 281 and 507 high-quality RNA-seq samples from 52 human non-disease tissues/cells and 14 healthy tissues/cells of C57BL/6 wild type mouse, respectively. User can visualize the expression and download lists of 2158 human HK transcripts from 2176 HK genes and 3024 mouse HK transcripts from 3277 mouse HK genes. HRT Atlas also offers the most stable and suitable tissue selective candidate reference transcripts for normalization of qPCR experiments. Specific primers and predicted modifiers of gene expression for some of these HK transcripts are also proposed. HRT Atlas has also been integrated with a regulatory elements resource from Epiregio server.
Collapse
Affiliation(s)
| | - Francine Chenou
- School of Medical Sciences, University of Campinas, Campinas, SP, Brazil
| | - Franciele de Lima
- School of Medical Sciences, University of Campinas, Campinas, SP, Brazil
| | - Erich Vinicius De Paula
- School of Medical Sciences, University of Campinas, Campinas, SP, Brazil
- Hematology and Hemotherapy Center, University of Campinas, Campinas, SP, Brazil
| |
Collapse
|
30
|
Harnessing the Complexity of Data in Precision Oncology. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11668-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
31
|
Bogard B, Francastel C, Hubé F. Multiple information carried by RNAs: total eclipse or a light at the end of the tunnel? RNA Biol 2020; 17:1707-1720. [PMID: 32559119 PMCID: PMC7714488 DOI: 10.1080/15476286.2020.1783868] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 06/06/2020] [Accepted: 06/12/2020] [Indexed: 12/14/2022] Open
Abstract
The findings that an RNA is not necessarily either coding or non-coding, or that a precursor RNA can produce different types of mature RNAs, whether coding or non-coding, long or short, have challenged the dichotomous view of the RNA world almost 15 years ago. Since then, and despite an increasing number of studies, the diversity of information that can be conveyed by RNAs is rarely searched for, and when it is known, it remains largely overlooked in further functional studies. Here, we provide an update with prominent examples of multiple functions that are carried by the same RNA or are produced by the same precursor RNA, to emphasize their biological relevance in most living organisms. An important consequence is that the overall function of their locus of origin results from the balance between various RNA species with distinct functions and fates. The consideration of the molecular basis of this multiplicity of information is obviously crucial for downstream functional studies when the targeted functional molecule is often not the one that is believed.
Collapse
Affiliation(s)
- Baptiste Bogard
- Université De Paris, Epigenetics and Cell Fate, CNRS, Paris, France
| | | | - Florent Hubé
- Université De Paris, Epigenetics and Cell Fate, CNRS, Paris, France
| |
Collapse
|
32
|
Abstract
Since its appearance, Evolutionary Developmental Biology (EvoDevo) has been called an emerging research program, a new paradigm, a new interdisciplinary field, or even a revolution. Behind these formulas, there is the awareness that something is changing in biology. EvoDevo is characterized by a variety of accounts and by an expanding theoretical framework. From an epistemological point of view, what is the relationship between EvoDevo and previous biological tradition? Is EvoDevo the carrier of a new message about how to conceive evolution and development? Furthermore, is it necessary to rethink the way we look at both of these processes? EvoDevo represents the attempt to synthesize two logics, that of evolution and that of development, and the way we conceive one affects the other. This synthesis is far from being fulfilled, but an adequate theory of development may represent a further step towards this achievement. In this article, an epistemological analysis of EvoDevo is presented, with particular attention paid to the relations to the Extended Evolutionary Synthesis (EES) and the Standard Evolutionary Synthesis (SET).
Collapse
|
33
|
Schnable JC. Genes and gene models, an important distinction. THE NEW PHYTOLOGIST 2020; 228:50-55. [PMID: 31241760 DOI: 10.1111/nph.16011] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Accepted: 06/07/2019] [Indexed: 05/22/2023]
Abstract
Genome sequencing has fundamentally changed how plant biologists think about genes. All or nearly all genes can ultimately be associated with a gene model. However, many gene models appear to play little or no role in the traits of an organism. A range of structural, molecular, population and evolutionary features all show a separation between genes with known phenotypes and the overall set of annotated gene models. These different features could be combined to develop models to distinguish the genes that determine the traits of plants from the subset gene other annotated gene models which are unlikely to play a role in doing so. Efforts to identify the subset of annotated gene models likely involved in specifying the characteristics of plants would help aid a wide range of researchers.
Collapse
Affiliation(s)
- James C Schnable
- Department of Agronomy and Horticulture and Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, NE, 68588, USA
| |
Collapse
|
34
|
Ni WJ, Xie F, Leng XM. Terminus-Associated Non-coding RNAs: Trash or Treasure? Front Genet 2020; 11:552444. [PMID: 33101379 PMCID: PMC7522407 DOI: 10.3389/fgene.2020.552444] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Accepted: 08/25/2020] [Indexed: 12/13/2022] Open
Abstract
3′ untranslated regions (3′ UTRs) of protein-coding genes are well known for their important roles in determining the fate of mRNAs in diverse processes, including trafficking, stabilization, translation, and RNA–protein interactions. However, non-coding RNAs (ncRNAs) scattered around 3′ termini of the protein-coding genes, here referred to as terminus-associated non-coding RNAs (TANRs), have not attracted wide attention in RNA research. Indeed, whether TANRs are transcriptional noise, degraded mRNA products, alternative 3′ UTRs, or functional molecules has remained unclear for a long time. As a new category of ncRNAs, TANRs are widespread, abundant, and conserved in diverse eukaryotes. The biogenesis of TANRs mainly follows the same promoter model, the RNA-dependent RNA polymerase activity-dependent model, or the independent promoter model. Functional studies of TANRs suggested that they are significantly involved in the versatile regulation of gene expression. For instance, at the transcriptional level, they can lead to transcriptional interference, induce the formation of gene loops, and participate in transcriptional termination. Furthermore, at the posttranscriptional level, they can act as microRNA sponges, and guide cleavage or modification of target RNAs. Here, we review current knowledge of the potential role of TANRs in the modulation of gene expression. In this review, we comprehensively summarize the current state of knowledge about TANRs, and discuss TANR nomenclature, relation to ncRNAs, cross-talk biogenesis pathways and potential functions. We further outline directions of future studies of TANRs, to promote investigations of this emerging and enigmatic category of RNA.
Collapse
Affiliation(s)
- Wen-Juan Ni
- School of Basic Medicine, Gannan Medical University, Ganzhou, China
| | - Fuhua Xie
- School of Basic Medicine, Gannan Medical University, Ganzhou, China
| | - Xiao-Min Leng
- School of Basic Medicine, Gannan Medical University, Ganzhou, China
| |
Collapse
|
35
|
Gutiérrez-Flores J, Hernández-Lemus E, Cortés-Guzmán F, Ramos E. Do weak interactions affect the biological behavior of DNA? A DFT study of CpG island-like chains. J Mol Model 2020; 26:266. [PMID: 32918237 DOI: 10.1007/s00894-020-04501-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 08/03/2020] [Indexed: 01/06/2023]
Abstract
The origin, stability, and contribution to the formation of noncovalent interactions, such as hydrogen bonds and π - π stacking, have been already widely discussed. However, there are few discussions about the relevance of these weak interactions in DNA performance. In this work, we seek to shed light on the effect of hydrogen bonds and π - π stacking interactions on the biological behavior of DNA through the description of these intermolecular forces in CpG island-like (GC-rich) chains. Furthermore, we made some comparisons with TATA box-like (TA-rich) chains in order to describe hydrogen bond and π - π stacking interactions as a function of the DNA sequence. For hydrogen bonds, we found that there is not a significant effect related to the number of base pairs. Whereas for π - π stacking interactions, the energy tended to decrease as the number of base pairs increased. We observed anticooperative effects for both hydrogen bonds and π - π stacking interactions. These results are in contrast with those of TATA box-like chains since cooperative and additive effects were found for both hydrogen bonds and π - π stacking, respectively. Based on the chemical hardness and density of states, we can conclude that proteins may interact easier with GC-rich chains. We conclude that regardless of the chain length, a protein could interact more easily with these genomics regions because the π - π stacking energies did not increase as a function of the number of base pairs, making, for the first time, a first approximation of the influence of noncovalent interaction on DNA behavior. We did all this work by means of DFT framework included in the DMol3 code (M06-L/DNP). Graphical Abstract Cartoon representation of how nocovalent interactions affect the interaction of DNA with a protein, i.e., how hydrogen bond and π - π stacking interactions influence the biological behavior of DNA.
Collapse
Affiliation(s)
- Jorge Gutiérrez-Flores
- Instituto de Investigaciones en Materiales, Universidad Nacional Autónoma de México, Circuito Exterior s/n, Ciudad Universitaria, Coyoacán, 04510, CDMX, México
| | | | - Fernando Cortés-Guzmán
- Instituto de Química, Universidad Nacional Autónoma de México, Circuito Exterior s/n, Ciudad Universitaria, Coyoacán, 04510, CDMX, México
| | - Estrella Ramos
- Instituto de Investigaciones en Materiales, Universidad Nacional Autónoma de México, Circuito Exterior s/n, Ciudad Universitaria, Coyoacán, 04510, CDMX, México.
| |
Collapse
|
36
|
Kimura T. [Non-coding Natural Antisense RNA: Mechanisms of Action in the Regulation of Target Gene Expression and Its Clinical Implications]. YAKUGAKU ZASSHI 2020; 140:687-700. [PMID: 32378673 DOI: 10.1248/yakushi.20-00002] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Recent advances in high-throughput technologies have revealed that 75% of the human genome is transcribed to RNA, whereas only 3% of transcripts are translated into proteins. Consequently, many long non-coding RNAs (lncRNAs) have been identified, which has improved our understanding of the complexity of biological processes. LncRNAs comprise multiple classes of RNA transcripts that regulate the transcription, stability and translation of protein-coding genes in a genome. Natural antisense transcripts (NATs) form one such class, and the GENCODE v30 catalog contains 16193 lncRNA loci, of which 5611 are antisense loci. This review outlines our emerging understanding of lncRNAs, with a particular focus on how lncRNAs regulate gene expression using interferon-α1 (IFN-α1) mRNA and its antisense partner IFN-α1 antisense (as)RNA as an example. We have identified and characterized the asRNA that determines post-transcriptional IFN-α1 mRNA levels. IFN-α1 asRNA stabilizes IFN-α1 mRNA by cytoplasmic sense-antisense duplex formation, which may enhance the accessibility of an RNA stabilizer protein or decrease the affinity of an RNA decay factor for the RNA. IFN-α1 asRNA can also act as competing molecules in the competing endogenous (ce)RNA network with other members of the IFNA multigene family mRNAs/asRNAs, and other cellular mRNA transcripts. Furthermore, antisense oligoribonucleotides representing functional domains of IFN-α1 asRNA inhibit influenza virus proliferation in the respiratory tract of virus-infected animals. Thus, these findings support, at least in part, the rationale that dissecting the activity of NAT on gene expression regulation promises to reveal previously unanticipated biology, with potential to provide new therapeutic approaches to diseases.
Collapse
Affiliation(s)
- Tominori Kimura
- Laboratory of Microbiology and Cell Biology, Department of Pharmacy, College of Pharmaceutical Sciences, Ritsumeikan University
| |
Collapse
|
37
|
Abstract
The questionable state of psychology as a science has been pointed out repeatedly over last hundred years. Sometimes programs to overcome the obvious limitations of psychology have been also proposed. So far, in vain. Zagaria with coauthors (this issue) bring the subject up again. They demonstrate that psychology today is characterized by the incoherence of definitions of core constructs and lack of consensus in the scientific community. The authors also suggest that psychology would do better by adopting a research program of a specific form of evolutionary psychology. In this paper I show, mostly on the basis of my earlier works on the same subject, that shortcomings of psychology today go much deeper than the authors of the target article have discussed. Psychology today is characterized by fundamental epistemological and methodological problems. As the same shortcomings characterize the version of evolutionary psychology advocated by Zagaria and coauthors, it is not the best candidate to ground the future of psychology. I suggest the psychology misses unifying psychology of a specific kind, which basic principles were outlined by Vygotsky almost a century ago.
Collapse
|
38
|
Abstract
Our understanding of the human genome has continuously expanded since its draft publication in 2001. Over the years, novel assays have allowed us to progressively overlay layers of knowledge above the raw sequence of A's, T's, G's, and C's. The reference human genome sequence is now a complex knowledge base maintained under the shared stewardship of multiple specialist communities. Its complexity stems from the fact that it is simultaneously a template for transcription, a record of evolution, a vehicle for genetics, and a functional molecule. In short, the human genome serves as a frame of reference at the intersection of a diversity of scientific fields. In recent years, the progressive fall in sequencing costs has given increasing importance to the quality of the human reference genome, as hundreds of thousands of individuals are being sequenced yearly, often for clinical applications. Also, novel sequencing-based assays shed light on novel functions of the genome, especially with respect to gene expression regulation. Keeping the human genome annotation up to date and accurate is therefore an ongoing partnership between reference annotation projects and the greater community worldwide.
Collapse
Affiliation(s)
- Daniel R Zerbino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton CB10 1SD, United Kingdom; , ,
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton CB10 1SD, United Kingdom; , ,
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton CB10 1SD, United Kingdom; , ,
| |
Collapse
|
39
|
Yong SY, Raben TG, Lello L, Hsu SDH. Genetic architecture of complex traits and disease risk predictors. Sci Rep 2020; 10:12055. [PMID: 32694572 PMCID: PMC7374622 DOI: 10.1038/s41598-020-68881-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Accepted: 06/30/2020] [Indexed: 01/30/2023] Open
Abstract
Genomic prediction of complex human traits (e.g., height, cognitive ability, bone density) and disease risks (e.g., breast cancer, diabetes, heart disease, atrial fibrillation) has advanced considerably in recent years. Using data from the UK Biobank, predictors have been constructed using penalized algorithms that favor sparsity: i.e., which use as few genetic variants as possible. We analyze the specific genetic variants (SNPs) utilized in these predictors, which can vary from dozens to as many as thirty thousand. We find that the fraction of SNPs in or near genic regions varies widely by phenotype. For the majority of disease conditions studied, a large amount of the variance is accounted for by SNPs outside of coding regions. The state of these SNPs cannot be determined from exome-sequencing data. This suggests that exome data alone will miss much of the heritability for these traits-i.e., existing PRS cannot be computed from exome data alone. We also study the fraction of SNPs and of variance that is in common between pairs of predictors. The DNA regions used in disease risk predictors so far constructed seem to be largely disjoint (with a few interesting exceptions), suggesting that individual genetic disease risks are largely uncorrelated. It seems possible in theory for an individual to be a low-risk outlier in all conditions simultaneously.
Collapse
Affiliation(s)
- Soke Yuen Yong
- Department of Physics and Astronomy, Michigan State University, East Lansing, USA.
| | - Timothy G Raben
- Department of Physics and Astronomy, Michigan State University, East Lansing, USA
| | - Louis Lello
- Department of Physics and Astronomy, Michigan State University, East Lansing, USA.,Genomic Prediction, North Brunswick, NJ, USA
| | - Stephen D H Hsu
- Department of Physics and Astronomy, Michigan State University, East Lansing, USA.,Genomic Prediction, North Brunswick, NJ, USA
| |
Collapse
|
40
|
Stanford BC, Clake DJ, Morris MR, Rogers SM. The power and limitations of gene expression pathway analyses toward predicting population response to environmental stressors. Evol Appl 2020; 13:1166-1182. [PMID: 32684953 PMCID: PMC7359838 DOI: 10.1111/eva.12935] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Revised: 02/03/2020] [Accepted: 02/05/2020] [Indexed: 12/16/2022] Open
Abstract
Rapid environmental changes impact the global distribution and abundance of species, highlighting the urgency to understand and predict how populations will respond. The analysis of differentially expressed genes has elucidated areas of the genome involved in adaptive divergence to past and present environmental change. Such studies however have been hampered by large numbers of differentially expressed genes and limited knowledge of how these genes work in conjunction with each other. Recent methods (broadly termed "pathway analyses") have emerged that aim to group genes that behave in a coordinated fashion to a factor of interest. These methods aid in functional annotation and uncovering biological pathways, thereby collapsing complex datasets into more manageable units, providing more nuanced understandings of both the organism-level effects of modified gene expression, and the targets of adaptive divergence. Here, we reanalyze a dataset that investigated temperature-induced changes in gene expression in marine-adapted and freshwater-adapted threespine stickleback (Gasterosteus aculeatus), using Weighted Gene Co-expression Network Analysis (WGCNA) with PANTHER Gene Ontology (GO)-Slim overrepresentation and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. Six modules exhibited a conserved response and six a divergent response between marine and freshwater stickleback when acclimated to 7°C or 22°C. One divergent module showed freshwater-specific response to temperature, and the remaining divergent modules showed differences in height of reaction norms. PPARAa, a transcription factor that regulates fatty acid metabolism and has been implicated in adaptive divergence, was located in a module that had higher expression at 7°C and in freshwater stickleback. This updated methodology revealed patterns that were not found in the original publication. Although such methods hold promise toward predicting population response to environmental stressors, many limitations remain, particularly with regard to module expression representation, database resources, and cross-database integration.
Collapse
Affiliation(s)
| | - Danielle J. Clake
- Department of Biological SciencesUniversity of CalgaryCalgaryABCanada
| | | | - Sean M. Rogers
- Department of Biological SciencesUniversity of CalgaryCalgaryABCanada
- Bamfield Marine Sciences CentreBamfieldBCCanada
| |
Collapse
|
41
|
Mahood EH, Kruse LH, Moghe GD. Machine learning: A powerful tool for gene function prediction in plants. APPLICATIONS IN PLANT SCIENCES 2020; 8:e11376. [PMID: 32765975 PMCID: PMC7394712 DOI: 10.1002/aps3.11376] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Accepted: 03/19/2020] [Indexed: 05/06/2023]
Abstract
Recent advances in sequencing and informatic technologies have led to a deluge of publicly available genomic data. While it is now relatively easy to sequence, assemble, and identify genic regions in diploid plant genomes, functional annotation of these genes is still a challenge. Over the past decade, there has been a steady increase in studies utilizing machine learning algorithms for various aspects of functional prediction, because these algorithms are able to integrate large amounts of heterogeneous data and detect patterns inconspicuous through rule-based approaches. The goal of this review is to introduce experimental plant biologists to machine learning, by describing how it is currently being used in gene function prediction to gain novel biological insights. In this review, we discuss specific applications of machine learning in identifying structural features in sequenced genomes, predicting interactions between different cellular components, and predicting gene function and organismal phenotypes. Finally, we also propose strategies for stimulating functional discovery using machine learning-based approaches in plants.
Collapse
Affiliation(s)
- Elizabeth H. Mahood
- Plant Biology SectionSchool of Integrative Plant SciencesCornell UniversityIthacaNew York14853USA
| | - Lars H. Kruse
- Plant Biology SectionSchool of Integrative Plant SciencesCornell UniversityIthacaNew York14853USA
| | - Gaurav D. Moghe
- Plant Biology SectionSchool of Integrative Plant SciencesCornell UniversityIthacaNew York14853USA
| |
Collapse
|
42
|
Sharma B, Taganna J. Genome-wide analysis of the U-box E3 ubiquitin ligase enzyme gene family in tomato. Sci Rep 2020; 10:9581. [PMID: 32533036 PMCID: PMC7293263 DOI: 10.1038/s41598-020-66553-1] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Accepted: 05/18/2020] [Indexed: 12/15/2022] Open
Abstract
E3 ubiquitin ligases are a central modifier of plant signaling pathways that act through targeting proteins to the degradation pathway. U-box E3 ubiquitin ligases are a distinct class of E3 ligases that utilize intramolecular interactions for its scaffold stabilization. U-box E3 ubiquitin ligases are prevalent in plants in comparison to animals. However, the evolutionary aspects, genetic organizations, and functional fate of the U-box E3 gene family in plant development, especially in tomato is not well understood. In the present study, we have performed in-silico genome-wide analysis of the U-box E3 ubiquitin ligase gene family in Solanum lycopersicum. We have identified 62 U-box genes with U-box/Ub Fusion Degradation 2 (UFD2) domain. The chromosomal localization, phylogenetic analysis, gene structure, motifs, gene duplication, syntenic regions, promoter, physicochemical properties, and ontology were investigated. The U-box gene family showed significant conservation of the U-box domain throughout the gene family. Duplicated genes discerned noticeable functional transitions among duplicated genes. The gene expression profiles of U-box E3 family members show involvement in abiotic and biotic stress signaling as well as hormonal pathways. We found remarkable participation of the U-box gene family in the vegetative and reproductive tissue development. It is predicted to be actively regulating flowering time and endosperm formation. Our study provides a comprehensive picture of distribution, structural features, promoter elements, evolutionary relationship, and gene expression of the U-box gene family in the tomato. We predict the crucial participation of the U-box gene family in tomato plant development and stress responses.
Collapse
Affiliation(s)
- Bhaskar Sharma
- TERI School of Advanced Studies, 10 Institutional Area, Vasant Kunj, New Delhi, Delhi, 110070, India.
- School of Life and Environmental Sciences, Faculty of Science, Engineering, and Built Environment, Deakin University, Geelong, VIC-3220, Australia.
| | - Joemar Taganna
- SciBiz Informatics, 2/F Unit 3 CFI Building, Maharlika Highway, Brgy. Guindapunan, Palo, Leyte, 6501, Philippines
| |
Collapse
|
43
|
Evolution of novel genes in three-spined stickleback populations. Heredity (Edinb) 2020; 125:50-59. [PMID: 32499660 PMCID: PMC7413265 DOI: 10.1038/s41437-020-0319-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2019] [Revised: 04/27/2020] [Accepted: 04/30/2020] [Indexed: 12/22/2022] Open
Abstract
Eukaryotic genomes frequently acquire new protein-coding genes which may significantly impact an organism’s fitness. Novel genes can be created, for example, by duplication of large genomic regions or de novo, from previously non-coding DNA. Either way, creation of a novel transcript is an essential early step during novel gene emergence. Most studies on the gain-and-loss dynamics of novel genes so far have compared genomes between species, constraining analyses to genes that have remained fixed over long time scales. However, the importance of novel genes for rapid adaptation among populations has recently been shown. Therefore, since little is known about the evolutionary dynamics of transcripts across natural populations, we here study transcriptomes from several tissues and nine geographically distinct populations of an ecological model species, the three-spined stickleback. Our findings suggest that novel genes typically start out as transcripts with low expression and high tissue specificity. Early expression regulation appears to be mediated by gene-body methylation. Although most new and narrowly expressed genes are rapidly lost, those that survive and subsequently spread through populations tend to gain broader and higher expression levels. The properties of the encoded proteins, such as disorder and aggregation propensity, hardly change. Correspondingly, young novel genes are not preferentially under positive selection but older novel genes more often overlap with FST outlier regions. Taken together, expression of the surviving novel genes is rapidly regulated, probably via epigenetic mechanisms, while structural properties of encoded proteins are non-debilitating and might only change much later.
Collapse
|
44
|
Understanding biochemistry: structure and function of nucleic acids. Essays Biochem 2020; 63:433-456. [PMID: 31652314 PMCID: PMC6822018 DOI: 10.1042/ebc20180038] [Citation(s) in RCA: 61] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 08/22/2019] [Accepted: 09/02/2019] [Indexed: 11/17/2022]
Abstract
Nucleic acids, deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), carry genetic information which is read in cells to make the RNA and proteins by which living things function. The well-known structure of the DNA double helix allows this information to be copied and passed on to the next generation. In this article we summarise the structure and function of nucleic acids. The article includes a historical perspective and summarises some of the early work which led to our understanding of this important molecule and how it functions; many of these pioneering scientists were awarded Nobel Prizes for their work. We explain the structure of the DNA molecule, how it is packaged into chromosomes and how it is replicated prior to cell division. We look at how the concept of the gene has developed since the term was first coined and how DNA is copied into RNA (transcription) and translated into protein (translation).
Collapse
|
45
|
Qu J, Zhang J, Zellmer L, He Y, Liu S, Wang C, Yuan C, Xu N, Huang H, Liao DJ. About three-fourths of mouse proteins unexpectedly appear at a low position of SDS-PAGE, often as additional isoforms, questioning whether all protein isoforms have been eliminated in gene-knockout cells or organisms. Protein Sci 2020; 29:978-990. [PMID: 31930537 DOI: 10.1002/pro.3823] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Revised: 01/01/2020] [Accepted: 01/05/2020] [Indexed: 01/08/2023]
Abstract
Most genes in evolutionarily complex genomes are expressed to multiple protein isoforms, but there is not yet any simple high-throughput approach to identify these isoforms. Using an oversimplified top-down LC-MS/MS strategy, we detected, around the 26-kD position of SDS-PAGE, proteins produced from 782 genes in a Cdk4-/- mouse embryonic fibroblast cell line. Interestingly, only 213 (27.24%, about one-fourth) of these 782 genes have their proteins with a theoretical molecular mass (TMM) 10% smaller or larger than 26 kD, that is, between 23 and 29 kD, the range set as allowed variation in SDS-PAGE. These 213 proteins are considered as the wild type (WT). The remaining three-fourths includes proteins from 66 (9.44%) genes with a TMM smaller than 23 kD and proteins from 503 (64.32%, nearly two-thirds) genes with a TMM larger than 29 kD; these proteins are categorized into a larger-group or a smaller-group, respectively, for their appearance at a higher or lower position of SDS-PAGE. For instance, at this 26-kD position we detected proteins from the Rps27a, Snrpf, Hist1h4a, and Rps25 genes whose proteins' TMM is 8.6, 9.7, 11.4, and 13.7 kD, respectively, and detected proteins from the Plelc1 and Prkdc genes, whose largest isoform is 533.9 and 471.1 kD, respectively. We extrapolate that many of those proteins migrating unexpectedly in SDS-PAGE may be isoforms besides the WT protein. Moreover, we also detected a Cdk4 protein in this Cdk4-/- cell line, thus wondering whether some of other gene-knockout cells or organisms show similar incompleteness of the knockout.
Collapse
Affiliation(s)
- Jiayuan Qu
- Department of Biochemistry, China Three Gorges University, Yichang, Hubei Province, China
| | - Ju Zhang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Lucas Zellmer
- Masonic Cancer Center, University of Minnesota, Minneapolis, Minnesota
| | - Yan He
- Key Lab of Endemic and Ethnic Diseases of The Ministry of Education of China in Guizhou Medical University, Guiyang, Guizhou Province, P. R., China
| | - Siqi Liu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | | | - Chengfu Yuan
- Department of Biochemistry, China Three Gorges University, Yichang, Hubei Province, China
| | - Ningzhi Xu
- National Cancer Center/Cancer Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Hai Huang
- Center for Clinical Laboratories, The Affiliated Hospital of Guizhou Medical University, Guiyang, Guizhou Province, China
| | - Dezhong J Liao
- Laboratory for Core Facilities, The Second Hospital, Guizhou University of Traditional Chinese Medicine, Guiyang, Guizhou Province, China
| |
Collapse
|
46
|
WHEELER NICHOLASR, BENCHEK PENELOPE, KUNKLE BRIANW, HAMILTON-NELSON KARAL, WARFE MIKE, FONDRAN JEREMYR, HAINES JONATHANL, BUSH WILLIAMS. Hadoop and PySpark for reproducibility and scalability of genomic sequencing studies. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2020; 25:523-534. [PMID: 31797624 PMCID: PMC6956992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Modern genomic studies are rapidly growing in scale, and the analytical approaches used to analyze genomic data are increasing in complexity. Genomic data management poses logistic and computational challenges, and analyses are increasingly reliant on genomic annotation resources that create their own data management and versioning issues. As a result, genomic datasets are increasingly handled in ways that limit the rigor and reproducibility of many analyses. In this work, we examine the use of the Spark infrastructure for the management, access, and analysis of genomic data in comparison to traditional genomic workflows on typical cluster environments. We validate the framework by reproducing previously published results from the Alzheimer's Disease Sequencing Project. Using the framework and analyses designed using Jupyter notebooks, Spark provides improved workflows, reduces user-driven data partitioning, and enhances the portability and reproducibility of distributed analyses required for large-scale genomic studies.
Collapse
Affiliation(s)
- NICHOLAS R. WHEELER
- Cleveland Institute for Computational Biology, Department of Population and Quantitative Health Sciences, Case Western Reserve University, Wolstein Research Building, 2103 Cornell Road Cleveland OH 44106, USA
| | - PENELOPE BENCHEK
- Cleveland Institute for Computational Biology, Department of Population and Quantitative Health Sciences, Case Western Reserve University, Wolstein Research Building, 2103 Cornell Road Cleveland OH 44106, USA
| | - BRIAN W. KUNKLE
- John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, 1501 NW 10th Ave, Miami, FL 33136, USA
| | - KARA L. HAMILTON-NELSON
- John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, 1501 NW 10th Ave, Miami, FL 33136, USA
| | - MIKE WARFE
- Cleveland Institute for Computational Biology, Center for Advanced Research Computing, University Technology, Case Western Reserve University, Wolstein Research Building, 2103 Cornell Road Cleveland OH 44106, USA
| | - JEREMY R. FONDRAN
- Cleveland Institute for Computational Biology, Center for Advanced Research Computing, University Technology, Case Western Reserve University, Wolstein Research Building, 2103 Cornell Road Cleveland OH 44106, USA
| | - JONATHAN L. HAINES
- Cleveland Institute for Computational Biology, Department of Population and Quantitative Health Sciences, Case Western Reserve University, Wolstein Research Building, 2103 Cornell Road Cleveland OH 44106, USA
| | - WILLIAM S. BUSH
- Cleveland Institute for Computational Biology, Department of Population and Quantitative Health Sciences, Case Western Reserve University, Wolstein Research Building, 2103 Cornell Road Cleveland OH 44106, USA
| |
Collapse
|
47
|
Van Regenmortel MH. Truth in science and in molecular recognition, post‐truth in human affairs. J Mol Recognit 2019; 33:e2827. [DOI: 10.1002/jmr.2827] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
48
|
Rödelsperger C, Prabh N, Sommer RJ. New Gene Origin and Deep Taxon Phylogenomics: Opportunities and Challenges. Trends Genet 2019; 35:914-922. [DOI: 10.1016/j.tig.2019.08.007] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Revised: 08/07/2019] [Accepted: 08/29/2019] [Indexed: 01/22/2023]
|
49
|
Prabhakar AR, Sreeja G, Naik SV. DNA finger printing of S. Mutans present in the saliva of caries active children and those associated with intellectual disability - An RAPD analysis. Saudi Dent J 2019; 31:424-430. [PMID: 31700219 PMCID: PMC6823829 DOI: 10.1016/j.sdentj.2019.04.009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 04/15/2019] [Accepted: 04/18/2019] [Indexed: 10/30/2022] Open
Abstract
Aim The aim of this study is, to evaluate and compare the diversity of S. Mutans genotypes with respect to caries activity among normal children and intellectually disabled children, which would enable the clinician to plan better strategies for early caries detection, management and prevention. Materials and methods Genotyping of S. Mutans was done by collecting the saliva samples from 40 caries active children (20 normal and 20 children associated with intellectual disability by Rapid amplified polymorphic DNA analysis using three arbitrarily primers (P1, P2, P3). Rapid amplified polymorphic DNA (RAPD) is preferred because of its reliability, reproducibility in generating genetic fingerprints of Streptococcus isolates. Results Number of bacterial counts in Group I showed a mean of 111.6500 followed by the Group II with a mean of 102.6500. Therefore, the difference in the number of bacterial counts was not significant between the two groups (p < 0.001). Genotype encoding Primer 1 was present in almost 82.5% of the total population of both groups. Genotype encoding Primer 2 was present in 95% of the total population. Whereas, Genotype encoding Primer 3 was present in 20% of children associated with intellectual disability and 95% of normal children. Interpretation and conclusion There was no significant difference in S. Mutans count of normal caries active children to that of caries active children with intellectual disability, but, there was a significance difference in the distribution of S. Mutans genotypes in both the groups.
Collapse
Affiliation(s)
- A R Prabhakar
- Department of Pedodontics and Preventive Dentistry, Bapuji Dental College and Hospital, Davangere, Karnataka 577004, India
| | - Gudla Sreeja
- Department of Pedodontics and Preventive Dentistry, Bapuji Dental College and Hospital, Davangere, Karnataka 577004, India
| | - Saraswatthi V Naik
- Department of Pedodontics and Preventive Dentistry, Bapuji Dental College and Hospital, Davangere, Karnataka 577004, India
| |
Collapse
|
50
|
Lee H, Zhang Z, Krause HM. Long Noncoding RNAs and Repetitive Elements: Junk or Intimate Evolutionary Partners? Trends Genet 2019; 35:892-902. [PMID: 31662190 DOI: 10.1016/j.tig.2019.09.006] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Revised: 08/22/2019] [Accepted: 09/13/2019] [Indexed: 12/27/2022]
Abstract
Our recent ability to sequence entire genomes, along with all of their transcribed RNAs, has led to the surprising finding that only ∼1% of the human genome is used to encode proteins. This finding has led to vigorous debate over the functional importance of the transcribed but untranslated portions of the genome. Currently, scientists tend to assume coding genes are functional until proven not to be, while the opposite is true for noncoding genes. This review takes a new look at the evidence for and against widespread noncoding gene functionality. We focus in particular on long noncoding RNA (noncoding RNAs longer than 200 nucleotides) genes and their 'junk' associates, transposable elements, and satellite repeats. Taken together, the suggestion put forward is that more of this junk DNA may be functional than nonfunctional and that noncoding RNAs and transposable elements act symbiotically to drive evolution.
Collapse
Affiliation(s)
- Hyunmin Lee
- Donnelly Centre, University of Toronto, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Zhaolei Zhang
- Donnelly Centre, University of Toronto, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada; Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Henry M Krause
- Donnelly Centre, University of Toronto, Toronto, ON, Canada; Department of Computer Science, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|