1
|
Ricardo PC, Arias MC, de Souza Araujo N. Decoding bee cleptoparasitism through comparative transcriptomics of Coelioxoides waltheriae and its host Tetrapedia diversipes. Sci Rep 2024; 14:12361. [PMID: 38811580 PMCID: PMC11137135 DOI: 10.1038/s41598-024-56261-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Accepted: 03/04/2024] [Indexed: 05/31/2024] Open
Abstract
Cleptoparasitism, also known as brood parasitism, is a widespread strategy among bee species in which the parasite lays eggs into the nests of the host species. Even though this behavior has significant ecological implications for the dynamics of several species, little is known about the molecular pathways associated with cleptoparasitism. To shed some light on this issue, we used gene expression data to perform a comparative analysis between two solitary neotropical bees: Coelioxoides waltheriae, an obligate parasite, and their specific host Tetrapedia diversipes. We found that ortholog genes involved in signal transduction, sensory perception, learning, and memory formation were differentially expressed between the cleptoparasite and the host. We hypothesize that these genes and their associated molecular pathways are engaged in cleptoparasitism-related processes and, hence, are appealing subjects for further investigation into functional and evolutionary aspects of cleptoparasitism in bees.
Collapse
Affiliation(s)
- Paulo Cseri Ricardo
- Departamento de Genética e Biologia Evolutiva - Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil.
| | - Maria Cristina Arias
- Departamento de Genética e Biologia Evolutiva - Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
| | | |
Collapse
|
2
|
Sharma N, Raman H, Wheeler D, Kalenahalli Y, Sharma R. Data-driven approaches to improve water-use efficiency and drought resistance in crop plants. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2023; 336:111852. [PMID: 37659733 DOI: 10.1016/j.plantsci.2023.111852] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Revised: 08/23/2023] [Accepted: 08/29/2023] [Indexed: 09/04/2023]
Abstract
With the increasing population, there lies a pressing demand for food, feed and fibre, while the changing climatic conditions pose severe challenges for agricultural production worldwide. Water is the lifeline for crop production; thus, enhancing crop water-use efficiency (WUE) and improving drought resistance in crop varieties are crucial for overcoming these challenges. Genetically-driven improvements in yield, WUE and drought tolerance traits can buffer the worst effects of climate change on crop production in dry areas. While traditional crop breeding approaches have delivered impressive results in increasing yield, the methods remain time-consuming and are often limited by the existing allelic variation present in the germplasm. Significant advances in breeding and high-throughput omics technologies in parallel with smart agriculture practices have created avenues to dramatically speed up the process of trait improvement by leveraging the vast volumes of genomic and phenotypic data. For example, individual genome and pan-genome assemblies, along with transcriptomic, metabolomic and proteomic data from germplasm collections, characterised at phenotypic levels, could be utilised to identify marker-trait associations and superior haplotypes for crop genetic improvement. In addition, these omics approaches enable the identification of genes involved in pathways leading to the expression of a trait, thereby providing an understanding of the genetic, physiological and biochemical basis of trait variation. These data-driven gene discoveries and validation approaches are essential for crop improvement pipelines, including genomic breeding, speed breeding and gene editing. Herein, we provide an overview of prospects presented using big data-driven approaches (including artificial intelligence and machine learning) to harness new genetic gains for breeding programs and develop drought-tolerant crop varieties with favourable WUE and high-yield potential traits.
Collapse
Affiliation(s)
- Niharika Sharma
- NSW Department of Primary Industries, Orange Agricultural Institute, Orange, NSW 2800, Australia.
| | - Harsh Raman
- NSW Department of Primary Industries, Wagga Wagga Agricultural Institute, Wagga Wagga, NSW 2650, Australia
| | - David Wheeler
- NSW Department of Primary Industries, Orange Agricultural Institute, Orange, NSW 2800, Australia
| | - Yogendra Kalenahalli
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, Telangana 502324, India
| | - Rita Sharma
- Department of Biological Sciences, BITS Pilani, Pilani Campus, Rajasthan 333031, India
| |
Collapse
|
3
|
Dlamini SB, Saunders CJ, Laguette MJN, Gibbon A, Gamieldien J, Collins M, September AV. Application of an in silico approach identifies a genetic locus within ITGB2, and its interactions with HSPG2 and FGF9, to be associated with anterior cruciate ligament rupture risk. Eur J Sport Sci 2023; 23:2098-2108. [PMID: 36680346 DOI: 10.1080/17461391.2023.2171906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
We developed a Biomedical Knowledge Graph model that is phenotype and biological function-aware through integrating knowledge from multiple domains in a Neo4j, graph database. All known human genes were assessed through the model to identify potential new risk genes for anterior cruciate ligament (ACL) ruptures and Achilles tendinopathy (AT). Genes were prioritised and explored in a case-control study comparing participants with ACL ruptures (ACL-R), including a sub-group with non-contact mechanism injuries (ACL-NON), to uninjured control individuals (CON). After gene filtering, 3376 genes, including 411 genes identified through previous whole exome sequencing, were found to be potentially linked to AT and ACL ruptures. Four variants were prioritised: HSPG2:rs2291826A/G, HSPG2:rs2291827G/A, ITGB2:rs2230528C/T and FGF9:rs2274296C/T. The rs2230528 CC genotype was over-represented in the CON group compared to ACL-R (p < 0.001) and ACL-NON (p < 0.001) and the TT genotype and T allele were over-represented in the ACL-R group and ACL-NON compared to CON (p < 0.001) group. Several significant differences in distributions were noted for the gene-gene interactions: (HSPG2:rs2291826, rs2291827 and ITGB2:rs2230528) and (ITGB2:rs2230528 and FGF9:rs2297429). This study substantiates the efficiency of using a prior knowledge-driven in silico approach to identify candidate genes linked to tendon and ACL injuries. Our biomedical knowledge graph identified and, with further testing, highlighted novel associations of the ITGB2 gene which has not been explored in a genetic case control association study, with ACL rupture risk. We thus recommend a multistep approach including bioinformatics in conjunction with next generation sequencing technology to improve the discovery potential of genomics technologies in musculoskeletal soft tissue injuries.HighlightsA biomedical knowledge graph was modelled for musculoskeletal soft tissue injuries to efficiently identify candidate genes for genetic susceptibility analyses.The biomedical knowledge graph and sequencing data identified potential biologically relevant variants to explore susceptibility to common tendon and ligament injuries. Specifically genetic variants within the ITGB2 and FGF9 genes were associated with ACL risk.Novel allele combinations (HSPG2-ITGB2 and ITGB2-FGF9) showcase the potential effect of ITGB2 in influencing risk of ACL rupture.
Collapse
Affiliation(s)
- Senanile B Dlamini
- Division of Physiological Sciences, Department of Human Biology, University of Cape Town, Cape Town, South Africa
- Department of Human Biology, Health through Physical Activity Lifestyle and Sport Research Centre (HPALS), Newlands, South Africa
| | - Colleen J Saunders
- Division of Emergency Medicine, Department of Surgery, University of Cape Town, Cape Town, South Africa
- South African National Bioinformatics Institute, University of the Western Cape, Cape Town, South Africa
| | - Mary-Jessica N Laguette
- Division of Physiological Sciences, Department of Human Biology, University of Cape Town, Cape Town, South Africa
- Department of Human Biology, Health through Physical Activity Lifestyle and Sport Research Centre (HPALS), Newlands, South Africa
| | - Andrea Gibbon
- Division of Physiological Sciences, Department of Human Biology, University of Cape Town, Cape Town, South Africa
| | - Junaid Gamieldien
- South African National Bioinformatics Institute, University of the Western Cape, Cape Town, South Africa
| | - Malcolm Collins
- Division of Physiological Sciences, Department of Human Biology, University of Cape Town, Cape Town, South Africa
- Department of Human Biology, Health through Physical Activity Lifestyle and Sport Research Centre (HPALS), Newlands, South Africa
- Department of Human Biology, International Federation of Sports Medicine (FIMS) Collaborative Centre of Sports Medicine, University of Cape Town, Newlands, South Africa
| | - Alison V September
- Division of Physiological Sciences, Department of Human Biology, University of Cape Town, Cape Town, South Africa
- Department of Human Biology, Health through Physical Activity Lifestyle and Sport Research Centre (HPALS), Newlands, South Africa
- Department of Human Biology, International Federation of Sports Medicine (FIMS) Collaborative Centre of Sports Medicine, University of Cape Town, Newlands, South Africa
| |
Collapse
|
4
|
Youn J, Rai N, Tagkopoulos I. Knowledge integration and decision support for accelerated discovery of antibiotic resistance genes. Nat Commun 2022; 13:2360. [PMID: 35487919 PMCID: PMC9055065 DOI: 10.1038/s41467-022-29993-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 03/04/2022] [Indexed: 11/09/2022] Open
Abstract
We present a machine learning framework to automate knowledge discovery through knowledge graph construction, inconsistency resolution, and iterative link prediction. By incorporating knowledge from 10 publicly available sources, we construct an Escherichia coli antibiotic resistance knowledge graph with 651,758 triples from 23 triple types after resolving 236 sets of inconsistencies. Iteratively applying link prediction to this graph and wet-lab validation of the generated hypotheses reveal 15 antibiotic resistant E. coli genes, with 6 of them never associated with antibiotic resistance for any microbe. Iterative link prediction leads to a performance improvement and more findings. The probability of positive findings highly correlates with experimentally validated findings (R2 = 0.94). We also identify 5 homologs in Salmonella enterica that are all validated to confer resistance to antibiotics. This work demonstrates how evidence-driven decisions are a step toward automating knowledge discovery with high confidence and accelerated pace, thereby substituting traditional time-consuming and expensive methods.
Collapse
Affiliation(s)
- Jason Youn
- Department of Computer Science, University of California, Davis, CA, 95616, USA
- Genome Center, University of California, Davis, CA, 95616, USA
- USDA/NSF AI Institute for Next Generation Food Systems (AIFS), University of California, Davis, CA, 95616, USA
| | - Navneet Rai
- Department of Computer Science, University of California, Davis, CA, 95616, USA
- Genome Center, University of California, Davis, CA, 95616, USA
- USDA/NSF AI Institute for Next Generation Food Systems (AIFS), University of California, Davis, CA, 95616, USA
| | - Ilias Tagkopoulos
- Department of Computer Science, University of California, Davis, CA, 95616, USA.
- Genome Center, University of California, Davis, CA, 95616, USA.
- USDA/NSF AI Institute for Next Generation Food Systems (AIFS), University of California, Davis, CA, 95616, USA.
| |
Collapse
|
5
|
Lenz AR, Galán-Vásquez E, Balbinot E, de Abreu FP, Souza de Oliveira N, da Rosa LO, de Avila e Silva S, Camassola M, Dillon AJP, Perez-Rueda E. Gene Regulatory Networks of Penicillium echinulatum 2HH and Penicillium oxalicum 114-2 Inferred by a Computational Biology Approach. Front Microbiol 2020; 11:588263. [PMID: 33193246 PMCID: PMC7652724 DOI: 10.3389/fmicb.2020.588263] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 09/23/2020] [Indexed: 11/29/2022] Open
Abstract
Penicillium echinulatum 2HH and Penicillium oxalicum 114-2 are well-known cellulase fungal producers. However, few studies addressing global mechanisms for gene regulation of these two important organisms are available so far. A recent finding that the 2HH wild-type is closely related to P. oxalicum leads to a combined study of these two species. Firstly, we provide a global gene regulatory network for P. echinulatum 2HH and P. oxalicum 114-2, based on TF-TG orthology relationships, considering three related species with well-known regulatory interactions combined with TFBSs prediction. The network was then analyzed in terms of topology, identifying TFs as hubs, and modules. Based on this approach, we explore numerous identified modules, such as the expression of cellulolytic and xylanolytic systems, where XlnR plays a key role in positive regulation of the xylanolytic system. It also regulates positively the cellulolytic system by acting indirectly through the cellodextrin induction system. This remarkable finding suggests that the XlnR-dependent cellulolytic and xylanolytic regulatory systems are probably conserved in both P. echinulatum and P. oxalicum. Finally, we explore the functional congruency on the genes clustered in terms of communities, where the genes related to cellular nitrogen, compound metabolic process and macromolecule metabolic process were the most abundant. Therefore, our approach allows us to confer a degree of accuracy regarding the existence of each inferred interaction.
Collapse
Affiliation(s)
- Alexandre Rafael Lenz
- Unidad Académica Yucatán, Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de Mexico, Mérida, Mexico
- Laboratório de Bioinformática e Biologia Computacional, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
- Departamento de Ciências Exatas e da Terra, Universidade do Estado da Bahia, Salvador, Brazil
| | - Edgardo Galán-Vásquez
- Departamento de Ingeniería de Sistemas Computacionales y Automatización, Instituto de Investigaciones en Matemàticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de Mexico, Ciudad Universitaria, Mexico
| | - Eduardo Balbinot
- Laboratório de Bioinformática e Biologia Computacional, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Fernanda Pessi de Abreu
- Laboratório de Bioinformática e Biologia Computacional, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Nikael Souza de Oliveira
- Laboratório de Bioinformática e Biologia Computacional, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
- Laboratório de Enzimas e Biomassas, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Letícia Osório da Rosa
- Laboratório de Enzimas e Biomassas, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Scheila de Avila e Silva
- Laboratório de Bioinformática e Biologia Computacional, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Marli Camassola
- Laboratório de Enzimas e Biomassas, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Aldo José Pinheiro Dillon
- Laboratório de Enzimas e Biomassas, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Ernesto Perez-Rueda
- Unidad Académica Yucatán, Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de Mexico, Mérida, Mexico
- Facultad de Ciencias, Centro de Genómica y Bioinformática, Universidad Mayor, Santiago, Chile
| |
Collapse
|
6
|
Pourkheirandish M, Golicz AA, Bhalla PL, Singh MB. Global Role of Crop Genomics in the Face of Climate Change. FRONTIERS IN PLANT SCIENCE 2020; 11:922. [PMID: 32765541 PMCID: PMC7378793 DOI: 10.3389/fpls.2020.00922] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/24/2019] [Accepted: 06/05/2020] [Indexed: 05/05/2023]
Abstract
The development of climate change resilient crops is necessary if we are to meet the challenge of feeding the growing world's population. We must be able to increase food production despite the projected decrease in arable land and unpredictable environmental conditions. This review summarizes the technological and conceptual advances that have the potential to transform plant breeding, help overcome the challenges of climate change, and initiate the next plant breeding revolution. Recent developments in genomics in combination with high-throughput and precision phenotyping facilitate the identification of genes controlling critical agronomic traits. The discovery of these genes can now be paired with genome editing techniques to rapidly develop climate change resilient crops, including plants with better biotic and abiotic stress tolerance and enhanced nutritional value. Utilizing the genetic potential of crop wild relatives (CWRs) enables the domestication of new species and the generation of synthetic polyploids. The high-quality crop plant genome assemblies and annotations provide new, exciting research targets, including long non-coding RNAs (lncRNAs) and cis-regulatory regions. Metagenomic studies give insights into plant-microbiome interactions and guide selection of optimal soils for plant cultivation. Together, all these advances will allow breeders to produce improved, resilient crops in relatively short timeframes meeting the demands of the growing population and changing climate.
Collapse
Affiliation(s)
| | | | | | - Mohan B. Singh
- Plant Molecular Biology and Biotechnology Laboratory, Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Parkville, VIC, Australia
| |
Collapse
|
7
|
Genetic Basis of Maize Resistance to Multiple Insect Pests: Integrated Genome-Wide Comparative Mapping and Candidate Gene Prioritization. Genes (Basel) 2020; 11:genes11060689. [PMID: 32599710 PMCID: PMC7349181 DOI: 10.3390/genes11060689] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Revised: 05/30/2020] [Accepted: 06/01/2020] [Indexed: 01/01/2023] Open
Abstract
Several species of herbivores feed on maize in field and storage setups, making the development of multiple insect resistance a critical breeding target. In this study, an association mapping panel of 341 tropical maize lines was evaluated in three field environments for resistance to fall armyworm (FAW), whilst bulked grains were subjected to a maize weevil (MW) bioassay and genotyped with Diversity Array Technology's single nucleotide polymorphisms (SNPs) markers. A multi-locus genome-wide association study (GWAS) revealed 62 quantitative trait nucleotides (QTNs) associated with FAW and MW resistance traits on all 10 maize chromosomes, of which, 47 and 31 were discovered at stringent Bonferroni genome-wide significance levels of 0.05 and 0.01, respectively, and located within or close to multiple insect resistance genomic regions (MIRGRs) concerning FAW, SB, and MW. Sixteen QTNs influenced multiple traits, of which, six were associated with resistance to both FAW and MW, suggesting a pleiotropic genetic control. Functional prioritization of candidate genes (CGs) located within 10-30 kb of the QTNs revealed 64 putative GWAS-based CGs (GbCGs) showing evidence of involvement in plant defense mechanisms. Only one GbCG was associated with each of the five of the six combined resistance QTNs, thus reinforcing the pleiotropy hypothesis. In addition, through in silico co-functional network inferences, an additional 107 network-based CGs (NbCGs), biologically connected to the 64 GbCGs, and differentially expressed under biotic or abiotic stress, were revealed within MIRGRs. The provided multiple insect resistance physical map should contribute to the development of combined insect resistance in maize.
Collapse
|
8
|
Scheben A, Chan CKK, Mansueto L, Mauleon R, Larmande P, Alexandrov N, Wing RA, McNally KL, Quesneville H, Edwards D. Progress in single-access information systems for wheat and rice crop improvement. Brief Bioinform 2020; 20:565-571. [PMID: 29659709 DOI: 10.1093/bib/bby016] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Improving productivity of the staple crops wheat and rice is essential to feed the growing global population, particularly in the context of a changing climate. However, current rates of yield gain are insufficient to support the predicted population growth. New approaches are required to accelerate the breeding process, and many of these are driven by the application of large-scale crop data. To leverage the substantial volumes and types of data that can be applied for precision breeding, the wheat and rice research communities are working towards the development of integrated systems to access and standardize the dispersed, heterogeneous available data. Here, we outline the initiatives of the International Wheat Information System (WheatIS) and the International Rice Informatics Consortium (IRIC) to establish Web-based single-access systems and data mining tools to make the available resources more accessible, drive discovery and accelerate the production of new crop varieties. We discuss the progress of WheatIS and IRIC towards unifying specialized wheat and rice databases and building custom software platforms to manage and interrogate these data. Single-access crop information systems will strengthen scientific collaboration, optimize the use of public research funds and help achieve the required yield gains in the two most important global food crops.
Collapse
Affiliation(s)
- Armin Scheben
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, 6009 Perth, WA, Australia
| | - Chon-Kit Kenneth Chan
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, 6009 Perth, WA, Australia
| | - Locedie Mansueto
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, The Philippines
| | - Ramil Mauleon
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, The Philippines
| | - Pierre Larmande
- IRD, UMR DIADE (Plant Diversity Adaptation and Development Research unit) , 911 Avenue Agropolis, 34394 Montpellier, France
| | - Nickolai Alexandrov
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, The Philippines
| | - Rod A Wing
- Arizona Genomics Institute, University of Arizona, Tucson, Arizona, USA
| | - Kenneth L McNally
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, The Philippines
| | | | - David Edwards
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, 6009 Perth, WA, Australia
| |
Collapse
|
9
|
Weighill D, Tschaplinski TJ, Tuskan GA, Jacobson D. Data Integration in Poplar: 'Omics Layers and Integration Strategies. Front Genet 2019; 10:874. [PMID: 31608114 PMCID: PMC6773870 DOI: 10.3389/fgene.2019.00874] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2019] [Accepted: 08/20/2019] [Indexed: 12/20/2022] Open
Abstract
Populus trichocarpa is an important biofuel feedstock that has been the target of extensive research and is emerging as a model organism for plants, especially woody perennials. This research has generated several large ‘omics datasets. However, only few studies in Populus have attempted to integrate various data types. This review will summarize various ‘omics data layers, focusing on their application in Populus species. Subsequently, network and signal processing techniques for the integration and analysis of these data types will be discussed, with particular reference to examples in Populus.
Collapse
Affiliation(s)
- Deborah Weighill
- The Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee, Knoxville, Knoxville, TN, United States.,Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| | - Timothy J Tschaplinski
- The Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee, Knoxville, Knoxville, TN, United States.,Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| | - Gerald A Tuskan
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| | - Daniel Jacobson
- The Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee, Knoxville, Knoxville, TN, United States.,Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| |
Collapse
|
10
|
Zolotareva O, Kleine M. A Survey of Gene Prioritization Tools for Mendelian and Complex Human Diseases. J Integr Bioinform 2019; 16:/j/jib.ahead-of-print/jib-2018-0069/jib-2018-0069.xml. [PMID: 31494632 PMCID: PMC7074139 DOI: 10.1515/jib-2018-0069] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2018] [Accepted: 07/12/2019] [Indexed: 12/16/2022] Open
Abstract
Modern high-throughput experiments provide us with numerous potential associations between genes and diseases. Experimental validation of all the discovered associations, let alone all the possible interactions between them, is time-consuming and expensive. To facilitate the discovery of causative genes, various approaches for prioritization of genes according to their relevance for a given disease have been developed. In this article, we explain the gene prioritization problem and provide an overview of computational tools for gene prioritization. Among about a hundred of published gene prioritization tools, we select and briefly describe 14 most up-to-date and user-friendly. Also, we discuss the advantages and disadvantages of existing tools, challenges of their validation, and the directions for future research.
Collapse
Affiliation(s)
- Olga Zolotareva
- Bielefeld University, Faculty of Technology and Center for Biotechnology, International Research Training Group "Computational Methods for the Analysis of the Diversity and Dynamics of Genomes" and Genome Informatics, Universitätsstraße 25, Bielefeld, Germany
| | - Maren Kleine
- Bielefeld University, Faculty of Technology, Bioinformatics/Medical Informatics Department, Universitätsstraße 25, Bielefeld, Germany
| |
Collapse
|
11
|
Guala D, Ogris C, Müller N, Sonnhammer ELL. Genome-wide functional association networks: background, data & state-of-the-art resources. Brief Bioinform 2019; 21:1224-1237. [PMID: 31281921 PMCID: PMC7373183 DOI: 10.1093/bib/bbz064] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2019] [Revised: 04/29/2019] [Accepted: 05/04/2019] [Indexed: 02/06/2023] Open
Abstract
The vast amount of experimental data from recent advances in the field of high-throughput biology begs for integration into more complex data structures such as genome-wide functional association networks. Such networks have been used for elucidation of the interplay of intra-cellular molecules to make advances ranging from the basic science understanding of evolutionary processes to the more translational field of precision medicine. The allure of the field has resulted in rapid growth of the number of available network resources, each with unique attributes exploitable to answer different biological questions. Unfortunately, the high volume of network resources makes it impossible for the intended user to select an appropriate tool for their particular research question. The aim of this paper is to provide an overview of the underlying data and representative network resources as well as to mention methods of integration, allowing a customized approach to resource selection. Additionally, this report will provide a primer for researchers venturing into the field of network integration.
Collapse
Affiliation(s)
- Dimitri Guala
- Science for Life Laboratory, Stockholm Bioinformatics Center, Department of Biochemistry and Biophysics, Stockholm University, Box 1031, 17121 Solna, Sweden
| | - Christoph Ogris
- Computational Cell Maps, Institute of Computational Biology, Helmholtz Center Munich, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
| | - Nikola Müller
- Computational Cell Maps, Institute of Computational Biology, Helmholtz Center Munich, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
| | - Erik L L Sonnhammer
- Science for Life Laboratory, Stockholm Bioinformatics Center, Department of Biochemistry and Biophysics, Stockholm University, Box 1031, 17121 Solna, Sweden
| |
Collapse
|
12
|
Rasheed A, Xia X. From markers to genome-based breeding in wheat. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:767-784. [PMID: 30673804 DOI: 10.1007/s00122-019-03286-4] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 01/16/2019] [Indexed: 05/22/2023]
Abstract
Recent technological advances in wheat genomics provide new opportunities to uncover genetic variation in traits of breeding interest and enable genome-based breeding to deliver wheat cultivars for the projected food requirements for 2050. There has been tremendous progress in development of whole-genome sequencing resources in wheat and its progenitor species during the last 5 years. High-throughput genotyping is now possible in wheat not only for routine gene introgression but also for high-density genome-wide genotyping. This is a major transition phase to enable genome-based breeding to achieve progressive genetic gains to parallel to projected wheat production demands. These advances have intrigued wheat researchers to practice less pursued analytical approaches which were not practiced due to the short history of genome sequence availability. Such approaches have been successful in gene discovery and breeding applications in other crops and animals for which genome sequences have been available for much longer. These strategies include, (i) environmental genome-wide association studies in wheat genetic resources stored in genbanks to identify genes for local adaptation by using agroclimatic traits as phenotypes, (ii) haplotype-based analyses to improve the statistical power and resolution of genomic selection and gene mapping experiments, (iii) new breeding strategies for genome-based prediction of heterosis patterns in wheat, and (iv) ultimate use of genomics information to develop more efficient and robust genome-wide genotyping platforms to precisely predict higher yield potential and stability with greater precision. Genome-based breeding has potential to achieve the ultimate objective of ensuring sustainable wheat production through developing high yielding, climate-resilient wheat cultivars with high nutritional quality.
Collapse
Affiliation(s)
- Awais Rasheed
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), 12 Zhongguancun South Street, Beijing, 100081, China
- International Maize and Wheat Improvement Center (CIMMYT), c/o CAAS, 12 Zhongguancun South Street, Beijing, 100081, China
- Department of Plant Sciences, Quaid-i-Azam University, Islamabad, 45320, Pakistan
| | - Xianchun Xia
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), 12 Zhongguancun South Street, Beijing, 100081, China.
| |
Collapse
|
13
|
Abstract
Gene discovery and government regulation are bottlenecks for the widespread adoption of genome-edited crops. We propose a culture of sharing and integrating crop data to accelerate the discovery and prioritization of candidate genes, as well as a strong engagement with governments and the public to address environmental and health concerns and to achieve appropriate regulatory standards.
Collapse
Affiliation(s)
- Armin Scheben
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Perth, WA, 6009, Australia
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Perth, WA, 6009, Australia.
| |
Collapse
|
14
|
Kedaigle A, Fraenkel E. Turning omics data into therapeutic insights. Curr Opin Pharmacol 2018; 42:95-101. [PMID: 30149217 PMCID: PMC6204089 DOI: 10.1016/j.coph.2018.08.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Revised: 07/18/2018] [Accepted: 08/09/2018] [Indexed: 12/30/2022]
Abstract
Omics technologies have made it easier and cheaper to evaluate thousands of biological molecules at once. These advances have led to novel therapies approved for use in the clinic, elucidated the mechanisms behind disease-associated mutations, led to increased accuracy in disease subtyping and personalized medicine, and revealed novel uses and treatment regimes for existing drugs through drug repurposing and pharmacology studies. In this review, we summarize some of these milestones and discuss the potential of integrative analyses that combine multiple data types for further advances.
Collapse
Affiliation(s)
- Amanda Kedaigle
- Computational & Systems Biology Program and the Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Ernest Fraenkel
- Computational & Systems Biology Program and the Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA.
| |
Collapse
|
15
|
Brandizi M, Singh A, Rawlings C, Hassani-Pak K. Towards FAIRer Biological Knowledge Networks Using a Hybrid Linked Data and Graph Database Approach. J Integr Bioinform 2018; 15:/j/jib.ahead-of-print/jib-2018-0023/jib-2018-0023.xml. [PMID: 30085931 PMCID: PMC6340125 DOI: 10.1515/jib-2018-0023] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2018] [Accepted: 06/07/2018] [Indexed: 01/01/2023] Open
Abstract
The speed and accuracy of new scientific discoveries – be it by humans or artificial intelligence – depends on the quality of the underlying data and on the technology to connect, search and share the data efficiently. In recent years, we have seen the rise of graph databases and semi-formal data models such as knowledge graphs to facilitate software approaches to scientific discovery. These approaches extend work based on formalised models, such as the Semantic Web. In this paper, we present our developments to connect, search and share data about genome-scale knowledge networks (GSKN). We have developed a simple application ontology based on OWL/RDF with mappings to standard schemas. We are employing the ontology to power data access services like resolvable URIs, SPARQL endpoints, JSON-LD web APIs and Neo4j-based knowledge graphs. We demonstrate how the proposed ontology and graph databases considerably improve search and access to interoperable and reusable biological knowledge (i.e. the FAIRness data principles).
Collapse
Affiliation(s)
- Marco Brandizi
- Rothamsted Research, Computational and Analytical Sciences Department, Harpenden, AL5 2JQ, UK
| | - Ajit Singh
- Rothamsted Research, Computational and Analytical Sciences Department, Harpenden, AL5 2JQ, UK
| | - Christopher Rawlings
- Rothamsted Research, Computational and Analytical Sciences Department, Harpenden, AL5 2JQ, UK
| | - Keywan Hassani-Pak
- Rothamsted Research, Computational and Analytical Sciences Department, Harpenden, AL5 2JQ, UK
| |
Collapse
|