1
|
da Silva JEH, de Carvalho PC, Camata JJ, de Oliveira IL, Bernardino HS. A Data-Distribution and Successive Spline Points based discretization approach for evolving gene regulatory networks from scRNA-Seq time-series data using Cartesian Genetic Programming. Biosystems 2024; 236:105126. [PMID: 38278505 DOI: 10.1016/j.biosystems.2024.105126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 11/18/2023] [Accepted: 01/19/2024] [Indexed: 01/28/2024]
Abstract
The inference of gene regulatory networks (GRNs) is a widely addressed problem in Systems Biology. GRNs can be modeled as Boolean networks, which is the simplest approach for this task. However, Boolean models need binarized data. Several approaches have been developed for the discretization of gene expression data (GED). Also, the advance of data extraction technologies, such as single-cell RNA-Sequencing (scRNA-Seq), provides a new vision of gene expression and brings new challenges for dealing with its specificities, such as a large occurrence of zero data. This work proposes a new discretization approach for dealing with scRNA-Seq time-series data, named Distribution and Successive Spline Points Discretization (DSSPD), which considers the data distribution and a proper preprocessing step. Here, Cartesian Genetic Programming (CGP) is used to infer GRNs using the results of DSSPD. The proposal is compared with CGP with the standard data handling and five state-of-the-art algorithms on curated models and experimental data. The results show that the proposal improves the results of CGP in all tested cases and outperforms the state-of-the-art algorithms in most cases.
Collapse
Affiliation(s)
| | | | - José J Camata
- Universidade Federal de Juiz de Fora, Juiz de Fora, MG, Brazil.
| | | | | |
Collapse
|
2
|
Genetic effects on longitudinal cognitive decline during the early stages of Alzheimer's disease. Sci Rep 2021; 11:19853. [PMID: 34615922 PMCID: PMC8494841 DOI: 10.1038/s41598-021-99310-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 09/22/2021] [Indexed: 11/08/2022] Open
Abstract
Cognitive decline in early-stage Alzheimer's disease (AD) may depend on genetic variability. In the Swedish BioFINDER study, we used polygenic scores (PGS) (for AD, intelligence, and educational attainment) to predict longitudinal cognitive change (measured by mini-mental state examination (MMSE) [primary outcome] and other cognitive tests) over a mean of 4.2 years. We included 260 β-amyloid (Aβ) negative cognitively unimpaired (CU) individuals, 121 Aβ-positive CU (preclinical AD), 50 Aβ-negative mild cognitive impairment (MCI) patients, and 127 Aβ-positive MCI patients (prodromal AD). Statistical significance was determined at Bonferroni corrected p value < 0.05. The PGS for intelligence (beta = 0.1, p = 2.9e-02) was protective against decline in MMSE in CU and MCI participants regardless of Aβ status. The polygenic risk score for AD (beta = - 0.12, p = 9.4e-03) was correlated with the rate of change in MMSE and was partially mediated by Aβ-pathology (mediation effect 20%). There was no effect of education PGS on cognitive measures. Genetic variants associated with intelligence mitigate cognitive decline independent of Aβ-pathology, while effects of genetic variants associated with AD are partly mediated by Aβ-pathology.
Collapse
|
3
|
Ong Q, Nguyen P, Thao NP, Le L. Bioinformatics Approach in Plant Genomic Research. Curr Genomics 2016; 17:368-78. [PMID: 27499685 PMCID: PMC4955030 DOI: 10.2174/1389202917666160331202956] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2015] [Revised: 09/11/2015] [Accepted: 09/18/2015] [Indexed: 11/22/2022] Open
Abstract
The advance in genomics technology leads to the dramatic change in plant biology research. Plant biologists now easily access to enormous genomic data to deeply study plant high-density genetic variation at molecular level. Therefore, fully understanding and well manipulating bioinformatics tools to manage and analyze these data are essential in current plant genome research. Many plant genome databases have been established and continued expanding recently. Meanwhile, analytical methods based on bioinformatics are also well developed in many aspects of plant genomic research including comparative genomic analysis, phylogenomics and evolutionary analysis, and genome-wide association study. However, constantly upgrading in computational infrastructures, such as high capacity data storage and high performing analysis software, is the real challenge for plant genome research. This review paper focuses on challenges and opportunities which knowledge and skills in bioinformatics can bring to plant scientists in present plant genomics era as well as future aspects in critical need for effective tools to facilitate the translation of knowledge from new sequencing data to enhancement of plant productivity.
Collapse
Affiliation(s)
- Quang Ong
- Plant Abiotic Stress Research Group, Ton Duc Thang University, Ho Chi Minh City, Vietnam
- Faculty of Applied Sciences, Ton Duc Thang University, Ho Chi Minh City, Vietnam
| | - Phuc Nguyen
- School of Biotechnology, International University, Vietnam National University, Ho Chi Minh City, Vietnam
| | - Nguyen Phuong Thao
- School of Biotechnology, International University, Vietnam National University, Ho Chi Minh City, Vietnam
| | - Ly Le
- School of Biotechnology, International University, Vietnam National University, Ho Chi Minh City, Vietnam
| |
Collapse
|
4
|
Pandey MK, Roorkiwal M, Singh VK, Ramalingam A, Kudapa H, Thudi M, Chitikineni A, Rathore A, Varshney RK. Emerging Genomic Tools for Legume Breeding: Current Status and Future Prospects. FRONTIERS IN PLANT SCIENCE 2016; 7:455. [PMID: 27199998 PMCID: PMC4852475 DOI: 10.3389/fpls.2016.00455] [Citation(s) in RCA: 99] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/24/2015] [Accepted: 03/24/2016] [Indexed: 05/19/2023]
Abstract
Legumes play a vital role in ensuring global nutritional food security and improving soil quality through nitrogen fixation. Accelerated higher genetic gains is required to meet the demand of ever increasing global population. In recent years, speedy developments have been witnessed in legume genomics due to advancements in next-generation sequencing (NGS) and high-throughput genotyping technologies. Reference genome sequences for many legume crops have been reported in the last 5 years. The availability of the draft genome sequences and re-sequencing of elite genotypes for several important legume crops have made it possible to identify structural variations at large scale. Availability of large-scale genomic resources and low-cost and high-throughput genotyping technologies are enhancing the efficiency and resolution of genetic mapping and marker-trait association studies. Most importantly, deployment of molecular breeding approaches has resulted in development of improved lines in some legume crops such as chickpea and groundnut. In order to support genomics-driven crop improvement at a fast pace, the deployment of breeder-friendly genomics and decision support tools seems appear to be critical in breeding programs in developing countries. This review provides an overview of emerging genomics and informatics tools/approaches that will be the key driving force for accelerating genomics-assisted breeding and ultimately ensuring nutritional and food security in developing countries.
Collapse
Affiliation(s)
- Manish K. Pandey
- International Crops Research Institute for the Semi-Arid TropicsHyderabad, India
| | - Manish Roorkiwal
- International Crops Research Institute for the Semi-Arid TropicsHyderabad, India
| | - Vikas K. Singh
- International Crops Research Institute for the Semi-Arid TropicsHyderabad, India
| | - Abirami Ramalingam
- International Crops Research Institute for the Semi-Arid TropicsHyderabad, India
| | - Himabindu Kudapa
- International Crops Research Institute for the Semi-Arid TropicsHyderabad, India
| | - Mahendar Thudi
- International Crops Research Institute for the Semi-Arid TropicsHyderabad, India
| | - Anu Chitikineni
- International Crops Research Institute for the Semi-Arid TropicsHyderabad, India
| | - Abhishek Rathore
- International Crops Research Institute for the Semi-Arid TropicsHyderabad, India
| | - Rajeev K. Varshney
- International Crops Research Institute for the Semi-Arid TropicsHyderabad, India
- The University of Western AustraliaCrawley, WA, Australia
| |
Collapse
|
5
|
Hashmi U, Shafqat S, Khan F, Majid M, Hussain H, Kazi AG, John R, Ahmad P. Plant exomics: concepts, applications and methodologies in crop improvement. PLANT SIGNALING & BEHAVIOR 2015; 10:e976152. [PMID: 25482786 PMCID: PMC4622497 DOI: 10.4161/15592324.2014.976152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/05/2014] [Revised: 08/17/2014] [Accepted: 08/18/2014] [Indexed: 05/17/2023]
Abstract
Molecular breeding has a crucial role in improvement of crops. Conventional breeding techniques have failed to ameliorate food production. Next generation sequencing has established new concepts of molecular breeding. Exome sequencing has proven to be a significant tool for assessing natural evolution in plants, studying host pathogen interactions and betterment of crop production as exons assist in interpretation of allelic variation with respect to their phenotype. This review covers the platforms for exome sequencing, next generation sequencing technologies that have revolutionized exome sequencing and led toward development of third generation sequencing. Also discussed in this review are the uses of these sequencing technologies to improve wheat, rice and cotton yield and how these technologies are used in exploring the biodiversity of crops, providing better understanding of plant-host pathogen interaction and assessing the process of natural evolution in crops and it also covers how exome sequencing identifies the gene pool involved in symbiotic and other co-existential systems. Furthermore, we conclude how integration of other methodologies including whole genome sequencing, proteomics, transcriptomics and metabolomics with plant exomics covers the areas which are left untouched with exomics alone and in the end how these integration will transform the future of crops.
Collapse
Key Words
- BAC, bacterial artificial chromosome
- BGR, bacterial grain rot
- CBOL, consortium for 860 the barcode of life
- ETI, effector-triggered immunity
- HPRT, hypoxanthineguanine phosphoribosyl transferase
- MMs, molecular markers
- NGS, next generation sequencing
- NITSR, nuclear internal transcribed spacer region
- OPC, open promoter complex
- QTL, quantitative trait locus
- SMRT, single molecule real time
- SNPs, single nucleotide poly-morphisms
- SOLiD, sequencing by oligonucleotide ligation and detection
- WES, whole exome sequencing
- WGS, whole genome sequencing
- WGS, whole genome shotgun
- biodiversity
- crop improvement
- dNMPs, deoxyribosenucleoside monophosphates
- exome sequencing
- plant biotechnology
- plant-host pathogen interactions
Collapse
Affiliation(s)
- Uzair Hashmi
- Atta ur Rahman School of Applied Biosciences; National University of Sciences and Technology; Islamabad, Pakistan
| | - Samia Shafqat
- Atta ur Rahman School of Applied Biosciences; National University of Sciences and Technology; Islamabad, Pakistan
| | - Faria Khan
- Atta ur Rahman School of Applied Biosciences; National University of Sciences and Technology; Islamabad, Pakistan
| | - Misbah Majid
- Atta ur Rahman School of Applied Biosciences; National University of Sciences and Technology; Islamabad, Pakistan
| | - Harris Hussain
- Atta ur Rahman School of Applied Biosciences; National University of Sciences and Technology; Islamabad, Pakistan
| | - Alvina Gul Kazi
- Atta ur Rahman School of Applied Biosciences; National University of Sciences and Technology; Islamabad, Pakistan
| | - Riffat John
- Department of Botany; University of Kashmir; Jammu and Kashmir, India
| | - Parvaiz Ahmad
- Department of Botany; S.P. College Srinagar; Jammu and Kashmir, India
- Correspondence to: Parvaiz Ahmad;
| |
Collapse
|
6
|
Fan WL, Ng CS, Chen CF, Lu MYJ, Chen YH, Liu CJ, Wu SM, Chen CK, Chen JJ, Mao CT, Lai YT, Lo WS, Chang WH, Li WH. Genome-wide patterns of genetic variation in two domestic chickens. Genome Biol Evol 2013; 5:1376-92. [PMID: 23814129 PMCID: PMC3730349 DOI: 10.1093/gbe/evt097] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Domestic chickens are excellent models for investigating the genetic basis of phenotypic diversity, as numerous phenotypic changes in physiology, morphology, and behavior in chickens have been artificially selected. Genomic study is required to study genome-wide patterns of DNA variation for dissecting the genetic basis of phenotypic traits. We sequenced the genomes of the Silkie and the Taiwanese native chicken L2 at ∼23- and 25-fold average coverage depth, respectively, using Illumina sequencing. The reads were mapped onto the chicken reference genome (including 5.1% Ns) to 92.32% genome coverage for the two breeds. Using a stringent filter, we identified ∼7.6 million single-nucleotide polymorphisms (SNPs) and 8,839 copy number variations (CNVs) in the mapped regions; 42% of the SNPs have not found in other chickens before. Among the 68,906 SNPs annotated in the chicken sequence assembly, 27,852 were nonsynonymous SNPs located in 13,537 genes. We also identified hundreds of shared and divergent structural and copy number variants in intronic and intergenic regions and in coding regions in the two breeds. Functional enrichments of identified genetic variants were discussed. Radical nsSNP-containing immunity genes were enriched in the QTL regions associated with some economic traits for both breeds. Moreover, genetic changes involved in selective sweeps were detected. From the selective sweeps identified in our two breeds, several genes associated with growth, appetite, and metabolic regulation were identified. Our study provides a framework for genetic and genomic research of domestic chickens and facilitates the domestic chicken as an avian model for genomic, biomedical, and evolutionary studies.
Collapse
Affiliation(s)
- Wen-Lang Fan
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Kitchen JL, Allaby RG. Systems Modeling at Multiple Levels of Regulation: Linking Systems and Genetic Networks to Spatially Explicit Plant Populations. PLANTS (BASEL, SWITZERLAND) 2013; 2:16-49. [PMID: 27137364 PMCID: PMC4844292 DOI: 10.3390/plants2010016] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/01/2012] [Revised: 12/21/2012] [Accepted: 01/16/2013] [Indexed: 11/16/2022]
Abstract
Selection and adaptation of individuals to their underlying environments are highly dynamical processes, encompassing interactions between the individual and its seasonally changing environment, synergistic or antagonistic interactions between individuals and interactions amongst the regulatory genes within the individual. Plants are useful organisms to study within systems modeling because their sedentary nature simplifies interactions between individuals and the environment, and many important plant processes such as germination or flowering are dependent on annual cycles which can be disrupted by climate behavior. Sedentism makes plants relevant candidates for spatially explicit modeling that is tied in with dynamical environments. We propose that in order to fully understand the complexities behind plant adaptation, a system that couples aspects from systems biology with population and landscape genetics is required. A suitable system could be represented by spatially explicit individual-based models where the virtual individuals are located within time-variable heterogeneous environments and contain mutable regulatory gene networks. These networks could directly interact with the environment, and should provide a useful approach to studying plant adaptation.
Collapse
Affiliation(s)
- James L Kitchen
- School of Life Sciences, University of Warwick, Coventry, CV4 7AL, UK
| | - Robin G Allaby
- School of Life Sciences, University of Warwick, Coventry, CV4 7AL, UK.
| |
Collapse
|