5251
|
Refining the clustering coefficient for analysis of social and neural network data. SOCIAL NETWORK ANALYSIS AND MINING 2016. [DOI: 10.1007/s13278-016-0361-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
5252
|
Li J, Lin X, Teng Y, Qi S, Xiao D, Zhang J, Kang Y. A Comprehensive Evaluation of Disease Phenotype Networks for Gene Prioritization. PLoS One 2016; 11:e0159457. [PMID: 27415759 PMCID: PMC4944959 DOI: 10.1371/journal.pone.0159457] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2016] [Accepted: 07/01/2016] [Indexed: 12/31/2022] Open
Abstract
Identification of disease-causing genes is a fundamental challenge for human health studies. The phenotypic similarity among diseases may reflect the interactions at the molecular level, and phenotype comparison can be used to predict disease candidate genes. Online Mendelian Inheritance in Man (OMIM) is a database of human genetic diseases and related genes that has become an authoritative source of disease phenotypes. However, disease phenotypes have been described by free text; thus, standardization of phenotypic descriptions is needed before diseases can be compared. Several disease phenotype networks have been established in OMIM using different standardization methods. Two of these networks are important for phenotypic similarity analysis: the first and most commonly used network (mimMiner) is standardized by medical subject heading, and the other network (resnikHPO) is the first to be standardized by human phenotype ontology. This paper comprehensively evaluates for the first time the accuracy of these two networks in gene prioritization based on protein–protein interactions using large-scale, leave-one-out cross-validation experiments. The results show that both networks can effectively prioritize disease-causing genes, and the approach that relates two diseases using a logistic function improves prioritization performance. Tanimoto, one of four methods for normalizing resnikHPO, generates a symmetric network and it performs similarly to mimMiner. Furthermore, an integration of these two networks outperforms either network alone in gene prioritization, indicating that these two disease networks are complementary.
Collapse
Affiliation(s)
- Jianhua Li
- Department of Biomedical Informatics, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
- Key Laboratory of Medical Image Computing of Northeastern University, Ministry of Education, Shenyang, Liaoning, China
| | - Xiaoyan Lin
- Department of Biomedical Informatics, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
| | - Yueyang Teng
- Department of Biomedical Imaging, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
| | - Shouliang Qi
- Key Laboratory of Medical Image Computing of Northeastern University, Ministry of Education, Shenyang, Liaoning, China
- Department of Biomedical Imaging, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
| | - Dayu Xiao
- Department of Biomedical Imaging, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
| | - Jianying Zhang
- Department of Biomedical Informatics, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
- Border Biomedical Research Center, Department of Biological Sciences, The University of Texas at El Paso, El Paso, Texas, United States of America
| | - Yan Kang
- Key Laboratory of Medical Image Computing of Northeastern University, Ministry of Education, Shenyang, Liaoning, China
- Department of Biomedical Imaging, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
- * E-mail:
| |
Collapse
|
5253
|
Schäfer RA, Voß B. VisualGraphX: interactive graph visualization within Galaxy. Bioinformatics 2016; 32:3525-3527. [PMID: 27412091 DOI: 10.1093/bioinformatics/btw414] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2016] [Revised: 06/06/2016] [Accepted: 06/23/2016] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION We developed VisualGraphX, a web-based, interactive visualization tool for large-scale graphs. Current graph visualization tools that follow the rich-internet paradigm lack an interactive and scalable visualization of graph-based data. VisualGraphX aims to provide a universal graph visualization tool that empowers the users to efficiently explore the data for themselves at a large scale. It is available as a visualization plugin for the Galaxy platform, such that VisualGraphX can be integrated into custom analysis pipelines. AVAILABILITY AND IMPLEMENTATION VisualGraphX has been released as a visualization plugin for the Galaxy platform under AFL 3.0 and is available with instructions and application data at http://gitlab.com/comptrans/VisualGraphX/ CONTACT: bjoern.voss@ibvt.uni-stuttgart.de.
Collapse
Affiliation(s)
- Richard A Schäfer
- Institute of Biochemical Engineering, Computational Biology, University of Stuttgart, Allmandring, 31, 70569 Stuttgart, Germany
| | - Björn Voß
- Institute of Biochemical Engineering, Computational Biology, University of Stuttgart, Allmandring, 31, 70569 Stuttgart, Germany
| |
Collapse
|
5254
|
Steegenga WT, Mischke M, Lute C, Boekschoten MV, Lendvai A, Pruis MGM, Verkade HJ, van de Heijning BJM, Boekhorst J, Timmerman HM, Plösch T, Müller M, Hooiveld GJEJ. Maternal exposure to a Western-style diet causes differences in intestinal microbiota composition and gene expression of suckling mouse pups. Mol Nutr Food Res 2016; 61. [PMID: 27129739 PMCID: PMC5215441 DOI: 10.1002/mnfr.201600141] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2016] [Revised: 03/25/2016] [Accepted: 04/13/2016] [Indexed: 12/14/2022]
Abstract
Scope The long‐lasting consequences of nutritional programming during the early phase of life have become increasingly evident. The effects of maternal nutrition on the developing intestine are still underexplored. Methods and results In this study, we observed (1) altered microbiota composition of the colonic luminal content, and (2) differential gene expression in the intestinal wall in 2‐week‐old mouse pups born from dams exposed to a Western‐style (WS) diet during the perinatal period. A sexually dimorphic effect was found for the differentially expressed genes in the offspring of WS diet‐exposed dams but no differences between male and female pups were found for the microbiota composition. Integrative analysis of the microbiota and gene expression data revealed that the maternal WS diet independently affected gene expression and microbiota composition. However, the abundance of bacterial families not affected by the WS diet (Bacteroidaceae, Porphyromonadaceae, and Lachnospiraceae) correlated with the expression of genes playing a key role in intestinal development and functioning (e.g. Pitx2 and Ace2). Conclusion Our data reveal that maternal consumption of a WS diet during the perinatal period alters both gene expression and microbiota composition in the intestinal tract of 2‐week‐old offspring.
Collapse
Affiliation(s)
- Wilma T Steegenga
- Nutrition, Metabolism, and Genomics Group, Division of Human Nutrition, Wageningen University, Wageningen, The Netherlands
| | - Mona Mischke
- Nutrition, Metabolism, and Genomics Group, Division of Human Nutrition, Wageningen University, Wageningen, The Netherlands
| | - Carolien Lute
- Nutrition, Metabolism, and Genomics Group, Division of Human Nutrition, Wageningen University, Wageningen, The Netherlands
| | - Mark V Boekschoten
- Nutrition, Metabolism, and Genomics Group, Division of Human Nutrition, Wageningen University, Wageningen, The Netherlands
| | - Agnes Lendvai
- Center for Liver, Digestive and Metabolic Diseases, Department of Pediatrics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Maurien G M Pruis
- Center for Liver, Digestive and Metabolic Diseases, Department of Pediatrics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Henkjan J Verkade
- Center for Liver, Digestive and Metabolic Diseases, Department of Pediatrics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | | | | | | | - Torsten Plösch
- Department of Obstetrics and Gynaecology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Michael Müller
- Nutrigenomics and Systems Nutrition, Norwich Medical School, University of East Anglia, Norwich, UK
| | - Guido J E J Hooiveld
- Nutrition, Metabolism, and Genomics Group, Division of Human Nutrition, Wageningen University, Wageningen, The Netherlands
| |
Collapse
|
5255
|
Genetic susceptibility to dyslipidemia and incidence of cardiovascular disease depending on a diet quality index in the Malmö Diet and Cancer cohort. GENES AND NUTRITION 2016; 11:20. [PMID: 27551321 PMCID: PMC4968442 DOI: 10.1186/s12263-016-0536-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Accepted: 06/27/2016] [Indexed: 02/08/2023]
Abstract
BACKGROUND By taking diet quality into account, we may clarify the relationship between genetically elevated triglycerides (TG) and low-density lipoprotein-cholesterol (LDL-C), and better understand the inconsistent results regarding genetically elevated high-density lipoprotein-cholesterol (HDL-C), and cardiovascular disease (CVD) risk. METHODS We included 24,799 participants (62 % women, age 44-74 years) from the Malmö Diet and Cancer cohort. During a mean follow-up time of 15 years, 3068 incident CVD cases (1814 coronary and 1254 ischemic stroke) were identified. Genetic risk scores (GRSs) were constructed by combining 80 validated genetic variants associated with higher TG and LDL-C or lower HDL-C. The participants' dietary intake, assessed by a modified diet history method, was ranked according to a diet quality index that included six dietary components: saturated fat, polyunsaturated fat, fish, fiber, fruit and vegetables, and sucrose. RESULTS The GRSLDL-C (P = 5 × 10(-6)) and GRSHDL-C (P = 0.02) but not GRSTG (P = 0.08) were significantly associated with CVD risk. No significant interaction between the GRSs and diet quality was observed on CVD risk (P > 0.39). A high compared to a low diet quality attenuated the association between GRSLDL-C and the risk of incident ischemic stroke (P interaction = 0.01). CONCLUSION We found some evidence of an interaction between diet quality and GRSLDL-C on ischemic stroke.
Collapse
|
5256
|
Meher PK, Sahu TK, Rao AR. Identification of species based on DNA barcode using k-mer feature vector and Random forest classifier. Gene 2016; 592:316-24. [PMID: 27393648 DOI: 10.1016/j.gene.2016.07.010] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2016] [Revised: 07/02/2016] [Accepted: 07/04/2016] [Indexed: 11/17/2022]
Abstract
DNA barcoding is a molecular diagnostic method that allows automated and accurate identification of species based on a short and standardized fragment of DNA. To this end, an attempt has been made in this study to develop a computational approach for identifying the species by comparing its barcode with the barcode sequence of known species present in the reference library. Each barcode sequence was first mapped onto a numeric feature vector based on k-mer frequencies and then Random forest methodology was employed on the transformed dataset for species identification. The proposed approach outperformed similarity-based, tree-based, diagnostic-based approaches and found comparable with existing supervised learning based approaches in terms of species identification success rate, while compared using real and simulated datasets. Based on the proposed approach, an online web interface SPIDBAR has also been developed and made freely available at http://cabgrid.res.in:8080/spidbar/ for species identification by the taxonomists.
Collapse
Affiliation(s)
- Prabina Kumar Meher
- Division of Statistical Genetics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi 110012, India.
| | - Tanmaya Kumar Sahu
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi 110012, India.
| | - A R Rao
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi 110012, India.
| |
Collapse
|
5257
|
Irvin MR, Rotroff DM, Aslibekyan S, Zhi D, Hidalgo B, Motsinger A, Marvel S, Srinivasasainagendra V, Claas SA, Buse JB, Straka RJ, Ordovas JM, Borecki IB, Guo X, Chen IYD, Rotter JI, Wagner MJ, Arnett DK. A genome-wide study of lipid response to fenofibrate in Caucasians: a combined analysis of the GOLDN and ACCORD studies. Pharmacogenet Genomics 2016; 26:324-33. [PMID: 27002377 PMCID: PMC4986826 DOI: 10.1097/fpc.0000000000000219] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
BACKGROUND Fibrates are commonly prescribed for hypertriglyceridemia, but they also lower LDL cholesterol and increase HDL cholesterol. Large interindividual variations in lipid response suggest that some patients may benefit more than others and genetic studies could help identify such patients. METHODS We carried out the first genome-wide association study of lipid response to fenofibrate using data from two well-characterized clinical trials: the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) Study and the Action to Control Cardiovascular Risk in Diabetes (ACCORD) Study. Genome-wide association study data from both studies were imputed to the 1000 Genomes CEU reference panel (phase 1). Lipid response was modeled as the log ratio of the post-treatment lipid level to the pretreatment level. Linear mixed models (GOLDN, N=813 from 173 families) and linear regression models (ACCORD, N=781) adjusted for pretreatment lipid level, demographic variables, clinical covariates, and ancestry were used to evaluate the association of genetic markers with lipid response. Among Caucasians, the results were combined using inverse-variance weighted fixed-effects meta-analyses. The main findings from the meta-analyses were examined in other ethnic groups from the HyperTG study (N=267 Hispanics) and ACCORD (N=83 Hispanics, 138 African Americans). RESULTS A known lipid locus harboring the pre-B-cell leukemia homeobox 4 (PBX4) gene on chromosome 19 is important for LDL cholesterol response to fenofibrate (smallest P=1.5×10). The main results replicated with nominal statistical significance in Hispanics from ACCORD (P<0.05). CONCLUSION Future research should evaluate the usefulness of this locus to refine clinical strategies for lipid-lowering treatments.
Collapse
Affiliation(s)
- Marguerite R Irvin
- Department of Epidemiology, University of Alabama at Birmingham, Birmingham AL 35294, USA
| | - Daniel M Rotroff
- Statistics Department, North Carolina State University, Raleigh, NC 27695, USA
| | - Stella Aslibekyan
- Department of Epidemiology, University of Alabama at Birmingham, Birmingham AL 35294, USA
| | - Degui Zhi
- Department of Biostatistics, University of Alabama at Birmingham, Birmingham AL 35294, USA
| | - Bertha Hidalgo
- Department of Epidemiology, University of Alabama at Birmingham, Birmingham AL 35294, USA
| | - Alison Motsinger
- Statistics Department, North Carolina State University, Raleigh, NC 27695, USA
| | - Skylar Marvel
- Statistics Department, North Carolina State University, Raleigh, NC 27695, USA
| | | | - Steven A Claas
- Department of Epidemiology, University of Alabama at Birmingham, Birmingham AL 35294, USA
| | - John B. Buse
- Diabetes Center for Research, University of North Carolina at Chapel Hill School of Medicine, Chapel Hill NC 27599, USA
| | - Robert J Straka
- Department of Experimental and Clinical Pharmacology, University of Minnesota, Minneapolis, MN 55455, USA
| | - Jose M Ordovas
- Jean Mayer USDA Human Nutrition Research Center on Aging, Tufts University, Boston, MA 02115, USA
| | - Ingrid B Borecki
- Division of Statistical Genomics, Department of Genetics, Washington University in St Louis, St Louis, MO 63108, USA
| | - Xiuqing Guo
- Laboratory of Statistical and Mathematical Genetics, Harbor-UCLA Medical Center, Torrance, CA 90502, USA
| | - Ida YD Chen
- Laboratory for Biochemistry, Molecular Phenotyping, and Microarray, Harbor-UCLA Medical Center, Torrance, CA 90502, USA
| | - Jerome I Rotter
- Institute for Translational Genomics and Population Sciences, Harbor-UCLA Medical Center, Torrance, CA 90502, USA
| | - Michael J Wagner
- Center for Pharmacogenomics and Individualized Therapy, University of North Carolina at Chapel Hill, Chapel Hill NC 27599, USA
| | - Donna K Arnett
- Department of Epidemiology, University of Alabama at Birmingham, Birmingham AL 35294, USA
| |
Collapse
|
5258
|
Richesson RL, Sun J, Pathak J, Kho AN, Denny JC. Clinical phenotyping in selected national networks: demonstrating the need for high-throughput, portable, and computational methods. Artif Intell Med 2016; 71:57-61. [PMID: 27506131 PMCID: PMC5480212 DOI: 10.1016/j.artmed.2016.05.005] [Citation(s) in RCA: 65] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2016] [Accepted: 05/30/2016] [Indexed: 12/22/2022]
Abstract
OBJECTIVE The combination of phenomic data from electronic health records (EHR) and clinical data repositories with dense biological data has enabled genomic and pharmacogenomic discovery, a first step toward precision medicine. Computational methods for the identification of clinical phenotypes from EHR data will advance our understanding of disease risk and drug response, and support the practice of precision medicine on a national scale. METHODS Based on our experience within three national research networks, we summarize the broad approaches to clinical phenotyping and highlight the important role of these networks in the progression of high-throughput phenotyping and precision medicine. We provide supporting literature in the form of a non-systematic review. RESULTS The practice of clinical phenotyping is evolving to meet the growing demand for scalable, portable, and data driven methods and tools. The resources required for traditional phenotyping algorithms from expert defined rules are significant. In contrast, machine learning approaches that rely on data patterns will require fewer clinical domain experts and resources. CONCLUSIONS Machine learning approaches that generate phenotype definitions from patient features and clinical profiles will result in truly computational phenotypes, derived from data rather than experts. Research networks and phenotype developers should cooperate to develop methods, collaboration platforms, and data standards that will enable computational phenotyping and truly modernize biomedical research and precision medicine.
Collapse
Affiliation(s)
- Rachel L Richesson
- Duke University School of Nursing, 311 Trent Drive, Durham, NC 27710 USA.
| | - Jimeng Sun
- School of Computational Science and Engineering, Georgia Institute of Technology, 266 Ferst Drive, Atlanta, GA 30313, USA.
| | - Jyotishman Pathak
- Department of Health Sciences Research, 200 1st Street SW, Mayo Clinic, Rochester, MN, 55905, USA.
| | - Abel N Kho
- Departments of Medicine and Preventive Medicine, Northwestern University, 633 N St. Clair St. 20th floor. Chicago IL 60611, USA.
| | - Joshua C Denny
- Departments of Biomedical Informatics and Medicine, Vanderbilt University, 2525 West End Ave, Suite 672, Nashville, TN 37203, USA.
| |
Collapse
|
5259
|
Krishnan ML, Wang Z, Silver M, Boardman JP, Ball G, Counsell SJ, Walley AJ, Montana G, Edwards AD. Possible relationship between common genetic variation and white matter development in a pilot study of preterm infants. Brain Behav 2016; 6:e00434. [PMID: 27110435 PMCID: PMC4821839 DOI: 10.1002/brb3.434] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/08/2015] [Revised: 12/16/2015] [Accepted: 12/19/2015] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND The consequences of preterm birth are a major public health concern with high rates of ensuing multisystem morbidity, and uncertain biological mechanisms. Common genetic variation may mediate vulnerability to the insult of prematurity and provide opportunities to predict and modify risk. OBJECTIVE To gain novel biological and therapeutic insights from the integrated analysis of magnetic resonance imaging and genetic data, informed by prior knowledge. METHODS We apply our previously validated pathway-based statistical method and a novel network-based method to discover sources of common genetic variation associated with imaging features indicative of structural brain damage. RESULTS Lipid pathways were highly ranked by Pathways Sparse Reduced Rank Regression in a model examining the effect of prematurity, and PPAR (peroxisome proliferator-activated receptor) signaling was the highest ranked pathway once degree of prematurity was accounted for. Within the PPAR pathway, five genes were found by Graph Guided Group Lasso to be highly associated with the phenotype: aquaporin 7 (AQP7), malic enzyme 1, NADP(+)-dependent, cytosolic (ME1), perilipin 1 (PLIN1), solute carrier family 27 (fatty acid transporter), member 1 (SLC27A1), and acetyl-CoA acyltransferase 1 (ACAA1). Expression of four of these (ACAA1, AQP7, ME1, and SLC27A1) is controlled by a common transcription factor, early growth response 4 (EGR-4). CONCLUSIONS This suggests an important role for lipid pathways in influencing development of white matter in preterm infants, and in particular a significant role for interindividual genetic variation in PPAR signaling.
Collapse
Affiliation(s)
- Michelle L Krishnan
- Centre for the Developing Brain King's College London St Thomas' Hospital London SE1 7EH UK
| | - Zi Wang
- Department of Biomedical Engineering King's College London St Thomas' Hospital London SE1 7EH UK
| | - Matt Silver
- Department of Population Health London School of Hygiene and Tropical Medicine London WC1E 7HT UK
| | - James P Boardman
- MRC Centre for Reproductive Health University of Edinburgh Edinburgh EH16 4TJ UK
| | - Gareth Ball
- Centre for the Developing Brain King's College London St Thomas' Hospital London SE1 7EH UK
| | - Serena J Counsell
- Centre for the Developing Brain King's College London St Thomas' Hospital London SE1 7EH UK
| | - Andrew J Walley
- School of Public Health Faculty of Medicine Imperial College London Norfolk Place London W2 1PG UK
| | - Giovanni Montana
- Department of Biomedical Engineering King's College London St Thomas' Hospital London SE1 7EH UK
| | - Anthony David Edwards
- Centre for the Developing Brain King's College London St Thomas' Hospital London SE1 7EH UK
| |
Collapse
|
5260
|
Ung MH, Liu CC, Cheng C. Integrative analysis of cancer genes in a functional interactome. Sci Rep 2016; 6:29228. [PMID: 27356765 PMCID: PMC4928112 DOI: 10.1038/srep29228] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2016] [Accepted: 06/16/2016] [Indexed: 11/09/2022] Open
Abstract
The post-genomic era has resulted in the accumulation of high-throughput cancer data from a vast array of genomic technologies including next-generation sequencing and microarray. As such, the large amounts of germline variant and somatic mutation data that have been generated from GWAS and sequencing projects, respectively, show great promise in providing a systems-level view of these genetic aberrations. In this study, we analyze publicly available GWAS, somatic mutation, and drug target data derived from large databanks using a network-based approach that incorporates directed edge information under a randomized network hypothesis testing procedure. We show that these three classes of disease-associated nodes exhibit non-random topological characteristics in the context of a functional interactome. Specifically, we show that drug targets tend to lie upstream of somatic mutations and disease susceptibility germline variants. In addition, we introduce a new approach to measuring hierarchy between drug targets, somatic mutants, and disease susceptibility genes by utilizing directionality and path length information. Overall, our results provide new insight into the intrinsic relationships between these node classes that broaden our understanding of cancer. In addition, our results align with current knowledge on the therapeutic actionability of GWAS and somatic mutant nodes, while demonstrating relationships between node classes from a global network perspective.
Collapse
Affiliation(s)
- Matthew H Ung
- Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, 03755 USA.,Program in Quantitative Biomedical Sciences, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, 03755 USA
| | - Chun-Chi Liu
- Institute of Genomics and Bioinformatics, National Chung Hsing University, Taiwan
| | - Chao Cheng
- Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, 03755 USA.,Program in Quantitative Biomedical Sciences, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, 03755 USA.,Norris Cotton Cancer Center, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, 03766 USA
| |
Collapse
|
5261
|
Doppler M, Kluger B, Bueschl C, Schneider C, Krska R, Delcambre S, Hiller K, Lemmens M, Schuhmacher R. Stable Isotope-Assisted Evaluation of Different Extraction Solvents for Untargeted Metabolomics of Plants. Int J Mol Sci 2016; 17:ijms17071017. [PMID: 27367667 PMCID: PMC4964393 DOI: 10.3390/ijms17071017] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2016] [Revised: 06/13/2016] [Accepted: 06/21/2016] [Indexed: 12/21/2022] Open
Abstract
The evaluation of extraction protocols for untargeted metabolomics approaches is still difficult. We have applied a novel stable isotope-assisted workflow for untargeted LC-HRMS-based plant metabolomics , which allows for the first time every detected feature to be considered for method evaluation. The efficiency and complementarity of commonly used extraction solvents, namely 1 + 3 (v/v) mixtures of water and selected organic solvents (methanol, acetonitrile or methanol/acetonitrile 1 + 1 (v/v)), with and without the addition of 0.1% (v/v) formic acid were compared. Four different wheat organs were sampled, extracted and analysed by LC-HRMS. Data evaluation was performed with the in-house-developed MetExtract II software and R. With all tested solvents a total of 871 metabolites were extracted in ear, 785 in stem, 733 in leaf and 517 in root samples, respectively. Between 48% (stem) and 57% (ear) of the metabolites detected in a particular organ were found with all extraction mixtures, and 127 of 996 metabolites were consistently shared between all extraction agent/organ combinations. In aqueous methanol, acidification with formic acid led to pronounced pH dependency regarding the precision of metabolite abundance and the number of detectable metabolites, whereas extracts of acetonitrile-containing mixtures were less affected. Moreover, methanol and acetonitrile have been found to be complementary with respect to extraction efficiency. Interestingly, the beneficial properties of both solvents can be combined by the use of a water-methanol-acetonitrile mixture for global metabolite extraction instead of aqueous methanol or aqueous acetonitrile alone.
Collapse
Affiliation(s)
- Maria Doppler
- Center for Analytical Chemistry, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
- Institute for Biotechnology in Plant Production, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
| | - Bernhard Kluger
- Center for Analytical Chemistry, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
- Institute for Biotechnology in Plant Production, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
| | - Christoph Bueschl
- Center for Analytical Chemistry, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
- Institute for Biotechnology in Plant Production, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
| | - Christina Schneider
- Center for Analytical Chemistry, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
- Institute for Biotechnology in Plant Production, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
| | - Rudolf Krska
- Center for Analytical Chemistry, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
- Institute for Biotechnology in Plant Production, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
| | - Sylvie Delcambre
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg Campus Belval, Avenue du Swing 6, 4367 Esch-Belval, Luxembourg.
| | - Karsten Hiller
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg Campus Belval, Avenue du Swing 6, 4367 Esch-Belval, Luxembourg.
| | - Marc Lemmens
- Center for Analytical Chemistry, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
- Institute for Biotechnology in Plant Production, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
| | - Rainer Schuhmacher
- Center for Analytical Chemistry, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
- Institute for Biotechnology in Plant Production, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Vienna (BOKU), Konrad-Lorenz-Strasse 20, 3430 Tulln, Austria.
| |
Collapse
|
5262
|
Masson N, Pesenti M, Dormal V. Impact of optokinetic stimulation on mental arithmetic. PSYCHOLOGICAL RESEARCH 2016; 81:840-849. [DOI: 10.1007/s00426-016-0784-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2015] [Accepted: 06/19/2016] [Indexed: 11/29/2022]
|
5263
|
Tishchenko I, Milioli HH, Riveros C, Moscato P. Extensive Transcriptomic and Genomic Analysis Provides New Insights about Luminal Breast Cancers. PLoS One 2016; 11:e0158259. [PMID: 27341628 PMCID: PMC4920434 DOI: 10.1371/journal.pone.0158259] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2016] [Accepted: 06/13/2016] [Indexed: 12/19/2022] Open
Abstract
Despite constituting approximately two thirds of all breast cancers, the luminal A and B tumours are poorly classified at both clinical and molecular levels. There are contradictory reports on the nature of these subtypes: some define them as intrinsic entities, others as a continuum. With the aim of addressing these uncertainties and identifying molecular signatures of patients at risk, we conducted a comprehensive transcriptomic and genomic analysis of 2,425 luminal breast cancer samples. Our results indicate that the separation between the molecular luminal A and B subtypes—per definition—is not associated with intrinsic characteristics evident in the differentiation between other subtypes. Moreover, t-SNE and MST-kNN clustering approaches based on 10,000 probes, associated with luminal tumour initiation and/or development, revealed the close connections between luminal A and B tumours, with no evidence of a clear boundary between them. Thus, we considered all luminal tumours as a single heterogeneous group for analysis purposes. We first stratified luminal tumours into two distinct groups by their HER2 gene cluster co-expression: HER2-amplified luminal and ordinary-luminal. The former group is associated with distinct transcriptomic and genomic profiles, and poor prognosis; it comprises approximately 8% of all luminal cases. For the remaining ordinary-luminal tumours we further identified the molecular signature correlated with disease outcomes, exhibiting an approximately continuous gene expression range from low to high risk. Thus, we employed four virtual quantiles to segregate the groups of patients. The clinico-pathological characteristics and ratios of genomic aberrations are concordant with the variations in gene expression profiles, hinting at a progressive staging. The comparison with the current separation into luminal A and B subtypes revealed a substantially improved survival stratification. Concluding, we suggest a review of the definition of luminal A and B subtypes. A proposition for a revisited delineation is provided in this study.
Collapse
Affiliation(s)
- Inna Tishchenko
- Information-based Medicine Program, Hunter Medical Research Institute, New Lambton Heights, NSW, Australia
- School of Electrical Engineering and Computer Science, The University of Newcastle, Callaghan, NSW, Australia
| | - Heloisa Helena Milioli
- Information-based Medicine Program, Hunter Medical Research Institute, New Lambton Heights, NSW, Australia
- School of Environmental and Life Science, The University of Newcastle, Callaghan, NSW, Australia
| | - Carlos Riveros
- CReDITSS Unit, Hunter Medical Research Institute, New Lambton Heights, NSW, Australia
| | - Pablo Moscato
- Information-based Medicine Program, Hunter Medical Research Institute, New Lambton Heights, NSW, Australia
- School of Electrical Engineering and Computer Science, The University of Newcastle, Callaghan, NSW, Australia
- * E-mail:
| |
Collapse
|
5264
|
Greene CS, Voight BF. Pathway and network-based strategies to translate genetic discoveries into effective therapies. Hum Mol Genet 2016; 25:R94-R98. [PMID: 27340225 DOI: 10.1093/hmg/ddw160] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2016] [Accepted: 05/19/2016] [Indexed: 11/13/2022] Open
Abstract
One way to design a drug is to attempt to phenocopy a genetic variant that is known to have the desired effect. In general, drugs that are supported by genetic associations progress further in the development pipeline. However, the number of associations that are candidates for development into drugs is limited because many associations are in non-coding regions or difficult to target genes. Approaches that overlay information from pathway databases or biological networks can expand the potential target list. In cases where the initial variant is not targetable or there is no variant with the desired effect, this may reveal new means to target a disease. In this review, we discuss recent examples in the domain of pathway and network-based drug repositioning from genetic associations. We highlight important caveats and challenges for the field, and we discuss opportunities for further development.
Collapse
Affiliation(s)
- Casey S Greene
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine Institute for Translational Medicine and Therapeutics, Perelman School of Medicine
| | - Benjamin F Voight
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine Institute for Translational Medicine and Therapeutics, Perelman School of Medicine Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19103 USA
| |
Collapse
|
5265
|
Chinese herbal medicine for constipation: zheng-based associations among herbs, formulae, proprietary medicines, and herb-drug interactions. Chin Med 2016; 11:28. [PMID: 27347002 PMCID: PMC4919884 DOI: 10.1186/s13020-016-0099-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2015] [Accepted: 05/27/2016] [Indexed: 12/11/2022] Open
Abstract
Background As current symptomatic treatments of constipation are still unsatisfactory, an increasing number of patients seek help from Chinese medicine (CM), particularly Chinese herbal medicine (CHM). This study aimed to review the most frequently used CHM herbs and formulae, proprietary CHMs, and herb–drug interactions for functional constipation using zheng (syndrome)-based differentiation, and to determine the current practice of zheng-based CHM treatments for functional constipation. Methods We developed a search strategy to include all the related clinical studies of CHM for constipation and set inclusion and exclusion criteria as studies on subjects with constipation of all ages and both sexes, using objective measures from laboratory or imaging techniques. The interventions included single herbs, CM classical formulae, CM new formulae, and Chinese herb-derived products and combination products. The clinical study types included were quasi- or randomized controlled trials, observational clinical studies, case series or case reports, and other types of appropriate research methods. The data concerning study design, sample size, mode of recruitment, sampling and diagnostic procedure, inclusion and exclusion criteria, and participants’ characteristics (including age, sex, and duration of constipation). CM patterns, CM treatment principles, treatment regimen, and CM treatment outcomes were recorded. Results A total of 29,832 relevant records were found, of which 8541 were duplicate records and 20,639 were excluded for reasons of irrelevance. The full text of 965 articles was retrieved for detailed assessment, following which 480 articles were excluded for various reasons. From the included articles, we retrieved 190 different CM zheng diagnoses from 485 individual studies. The most common zheng was dual deficiency of qi and blood (N = 48), which was diagnosed in 948 out of 15,740 subjects. The most frequently used classical formula was Ma-Zi-Ren-Wan (MZRW) (N = 75) and the most frequently used proprietary CHM was Run-Chang-Wan (N = 87). The most frequently used combined medication was Da Huang with sodium bicarbonate tablets (frequency across all studies, n = 23), followed by Fan Xie Ye with lactulose oral solution (n = 8), Ma-Ren-Ruan-Jiao-Nang with lactulose oral solution (n = 6) and Liu-Wei-An-Xiao-Jiao-Nang (n = 6) with mosapride citrate tablets. Conclusion This study examined the use of CHM for constipation and summarized the herbs, formulae, proprietary medicines, and herb–drug interactions application. These data indicated there were limited information about herb-drug interactions and adverse effects of CHM and further randomized controlled trials with strict design are necessary. Electronic supplementary material The online version of this article (doi:10.1186/s13020-016-0099-4) contains supplementary material, which is available to authorized users.
Collapse
|
5266
|
Kurzbach D. Network representation of protein interactions: Theory of graph description and analysis. Protein Sci 2016; 25:1617-27. [PMID: 27272236 DOI: 10.1002/pro.2963] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2016] [Accepted: 05/31/2016] [Indexed: 12/12/2022]
Abstract
A methodological framework is presented for the graph theoretical interpretation of NMR data of protein interactions. The proposed analysis generalizes the idea of network representations of protein structures by expanding it to protein interactions. This approach is based on regularization of residue-resolved NMR relaxation times and chemical shift data and subsequent construction of an adjacency matrix that represents the underlying protein interaction as a graph or network. The network nodes represent protein residues. Two nodes are connected if two residues are functionally correlated during the protein interaction event. The analysis of the resulting network enables the quantification of the importance of each amino acid of a protein for its interactions. Furthermore, the determination of the pattern of correlations between residues yields insights into the functional architecture of an interaction. This is of special interest for intrinsically disordered proteins, since the structural (three-dimensional) architecture of these proteins and their complexes is difficult to determine. The power of the proposed methodology is demonstrated at the example of the interaction between the intrinsically disordered protein osteopontin and its natural ligand heparin.
Collapse
Affiliation(s)
- Dennis Kurzbach
- Departement de Chimie, Ecole Normale Superieure, PSL Research University, UPMC Univ Paris 06, CNRS, Laboratoire des Biomolecules (LBM), 24 rue Lhomond, 75005 Paris, France
| |
Collapse
|
5267
|
Cairns J, Freire-Pritchett P, Wingett SW, Várnai C, Dimond A, Plagnol V, Zerbino D, Schoenfelder S, Javierre BM, Osborne C, Fraser P, Spivakov M. CHiCAGO: robust detection of DNA looping interactions in Capture Hi-C data. Genome Biol 2016; 17:127. [PMID: 27306882 PMCID: PMC4908757 DOI: 10.1186/s13059-016-0992-2] [Citation(s) in RCA: 273] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Accepted: 05/25/2016] [Indexed: 12/14/2022] Open
Abstract
Capture Hi-C (CHi-C) is a method for profiling chromosomal interactions involving targeted regions of interest, such as gene promoters, globally and at high resolution. Signal detection in CHi-C data involves a number of statistical challenges that are not observed when using other Hi-C-like techniques. We present a background model and algorithms for normalisation and multiple testing that are specifically adapted to CHi-C experiments. We implement these procedures in CHiCAGO ( http://regulatorygenomicsgroup.org/chicago ), an open-source package for robust interaction detection in CHi-C. We validate CHiCAGO by showing that promoter-interacting regions detected with this method are enriched for regulatory features and disease-associated SNPs.
Collapse
Affiliation(s)
- Jonathan Cairns
- Nuclear Dynamics Programme, Babraham Institute, Cambridge, UK
| | | | - Steven W Wingett
- Nuclear Dynamics Programme, Babraham Institute, Cambridge, UK
- Bioinformatics Group, Babraham Institute, Cambridge, UK
| | - Csilla Várnai
- Nuclear Dynamics Programme, Babraham Institute, Cambridge, UK
| | - Andrew Dimond
- Nuclear Dynamics Programme, Babraham Institute, Cambridge, UK
| | | | - Daniel Zerbino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | | | | | - Cameron Osborne
- Department of Medical and Molecular Genetics, King's College, London, UK
| | - Peter Fraser
- Nuclear Dynamics Programme, Babraham Institute, Cambridge, UK
| | | |
Collapse
|
5268
|
Xie T, Yang QY, Wang XT, McLysaght A, Zhang HY. Spatial Colocalization of Human Ohnolog Pairs Acts to Maintain Dosage-Balance. Mol Biol Evol 2016; 33:2368-75. [PMID: 27297469 PMCID: PMC4989111 DOI: 10.1093/molbev/msw108] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Ohnologs -paralogous gene pairs generated by whole genome duplication- are enriched for dosage sensitive genes, that is, genes that have a phenotype due to copy number changes. Dosage sensitive genes frequently occur in the same metabolic pathway and in physically interacting proteins. Accumulating evidence reveals that functionally related genes tend to co-localize in the three-dimensional (3D) arrangement of chromosomes. We query whether the spatial distribution of ohnologs has implications for their dosage balance. We analyzed the colocalization frequency of ohnologs based on chromatin interaction datasets of seven human cell lines and found that ohnolog pairs exhibit higher spatial proximity in 3D nuclear organization than other paralog pairs and than randomly chosen ohnologs in the genome. We also found that colocalized ohnologs are more resistant to copy number variations and more likely to be disease-associated genes, which indicates a stronger dosage balance in ohnologs with high spatial proximity. This phenomenon is further supported by the stronger similarity of gene co-expression and of gene ontology terms of colocalized ohnologs. In addition, for a large fraction of ohnologs, the spatial colocalization is conserved in mouse cells, suggestive of functional constraint on their 3D positioning in the nucleus.
Collapse
Affiliation(s)
- Ting Xie
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan P. R. China
| | - Qing-Yong Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan P. R. China
| | - Xiao-Tao Wang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan P. R. China
| | - Aoife McLysaght
- Smurfit Institute of Genetics, Trinity College Dublin, University of Dublin, Dublin, Ireland
| | - Hong-Yu Zhang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan P. R. China
| |
Collapse
|
5269
|
Ulyantsev VI, Kazakov SV, Dubinkina VB, Tyakht AV, Alexeev DG. MetaFast: fast reference-free graph-based comparison of shotgun metagenomic data. Bioinformatics 2016; 32:2760-7. [PMID: 27259541 DOI: 10.1093/bioinformatics/btw312] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2015] [Accepted: 05/16/2016] [Indexed: 02/02/2023] Open
Abstract
MOTIVATION High-throughput metagenomic sequencing has revolutionized our view on the structure and metabolic potential of microbial communities. However, analysis of metagenomic composition is often complicated by the high complexity of the community and the lack of related reference genomic sequences. As a start point for comparative metagenomic analysis, the researchers require efficient means for assessing pairwise similarity of the metagenomes (beta-diversity). A number of approaches were used to address this task, however, most of them have inherent disadvantages that limit their scope of applicability. For instance, the reference-based methods poorly perform on metagenomes from previously unstudied niches, while composition-based methods appear to be too abstract for straightforward interpretation and do not allow to identify the differentially abundant features. RESULTS We developed MetaFast, an approach that allows to represent a shotgun metagenome from an arbitrary environment as a modified de Bruijn graph consisting of simplified components. For multiple metagenomes, the resulting representation is used to obtain a pairwise similarity matrix. The dimensional structure of the metagenomic components preserved in our algorithm reflects the inherent subspecies-level diversity of microbiota. The method is computationally efficient and especially promising for an analysis of metagenomes from novel environmental niches. AVAILABILITY AND IMPLEMENTATION Source code and binaries are freely available for download at https://github.com/ctlab/metafast The code is written in Java and is platform independent (tested on Linux and Windows x86_64). CONTACT ulyantsev@rain.ifmo.ru SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | - Veronika B Dubinkina
- Federal Research and Clinical Centre of Physical-Chemical Medicine, Moscow, Russian Federation Moscow Institute of Physics and Technology (State University), Dolgoprudny, Russian Federation
| | - Alexander V Tyakht
- Federal Research and Clinical Centre of Physical-Chemical Medicine, Moscow, Russian Federation Moscow Institute of Physics and Technology (State University), Dolgoprudny, Russian Federation
| | - Dmitry G Alexeev
- Moscow Institute of Physics and Technology (State University), Dolgoprudny, Russian Federation
| |
Collapse
|
5270
|
Zhou PZ, Zhu YM, Zou GH, Sun YX, Xiu XL, Huang X, Zhang QH. Relationship Between Glucocorticoids and Insulin Resistance in Healthy Individuals. Med Sci Monit 2016; 22:1887-94. [PMID: 27258456 PMCID: PMC4913831 DOI: 10.12659/msm.895251] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Background The aim of this study was to determine the correlation between glucocorticoids (GCs) and insulin resistance (IR) in healthy individuals by conducting a systematic meta-analysis. Material/Methods A systematic literature review was conducted using 9 electronic databases. Only case-control studies investigating fasting plasma glucose (FPG) and IR were enrolled based on strictly established selection criteria. Statistical analyses were performed by Stata software, version 12.0 (Stata Corporation, College Station, Texas, USA). Results Among 496 initially retrieved articles, only 6 papers published in English were eventually included in this meta-analysis. A total of 201 healthy individuals (105 in GC group and 96 in control group) were included in the 6 studies. In 4 of these 6 studies, dexamethasone was used, and in the other 2 studies prednisolone was given. This meta-analysis revealed that the FPG, fasting insulin (FINS), and homeostasis model assessment of insulin resistance (HOMA-IR) levels in the GC group were all significantly higher than that in the control group (FPG: SMD=2.65, 95%CI=1.42~3.88, P<0.001; FINS: SMD=2.48, 95%CI=1.01~3.95, P=0.001; HOMA-IR: SMD=38.30, 95%CI=24.38~52.22, P<0.001). Conclusions In conclusion, our present study revealed that therapies using GCs might result in elevated FPG, FINS, and HOMA-IR, and thereby contribute to IR in healthy individuals.
Collapse
Affiliation(s)
- Peng-Zhen Zhou
- Department of Reproductive Medicine, Yuhuangding Hospital Affiliated to Qingdao University, Yantai, Shandong, China (mainland)
| | - Yong-Mei Zhu
- Department of Pediatric Orthopaedic, Yantaishan Hospital, Yantai, Shandong, China (mainland)
| | - Guang-Hui Zou
- Department of Reproductive Medicine, Yuhuangding Hospital Affiliated to Qingdao University, Yantai, Shandong, China (mainland)
| | - Yu-Xia Sun
- Department of Reproductive Medicine, Yuhuangding Hospital Affiliated to Qingdao University, Yantai, Shandong, China (mainland)
| | - Xiao-Lin Xiu
- Department of Reproductive Medicine, Yuhuangding Hospital Affiliated to Qingdao University, Yantai, Shandong, China (mainland)
| | - Xin Huang
- Department of Reproductive Medicine, Yuhuangding Hospital Affiliated to Qingdao University, Yantai, Shandong, China (mainland)
| | - Qun-Hui Zhang
- Department of Rheumatism and Immunology, Yuhuangding Hospital Affiliated to Qingdao University, Yantai, Shandong, China (mainland)
| |
Collapse
|
5271
|
Boeven PHG, Longin CFH, Würschum T. A unified framework for hybrid breeding and the establishment of heterotic groups in wheat. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2016; 129:1231-45. [PMID: 26956559 DOI: 10.1007/s00122-016-2699-x] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2015] [Accepted: 02/18/2016] [Indexed: 05/26/2023]
Abstract
Global wheat genetic diversity can be used in a unified framework to support and accelerate hybrid breeding and the development of heterotic groups in wheat. Hybrid wheat breeding has great potential to increase the global wheat grain yield level particularly in view of the increasing abiotic and biotic stress challenges as well as variable climatic conditions. For the long-term success of hybrid wheat breeding and the maximum exploitation of heterosis, high-yielding heterotic patterns must be established. Here, we propose a unified framework for hybrid breeding and the establishment of heterotic groups in autogamous crops and exemplify it for hybrid wheat breeding in Germany. A key component is the establishment of genetic distance between heterotic groups and in this context, we assessed genetic diversity in a global collection of 1110 winter wheat varieties released during the past decades in 35 countries but with a focus on European origin. Our analyses revealed the absence of major population structure but nevertheless suggest genetically distinct subgroups with potential for hybrid wheat breeding. Taking our molecular results and additional phenotypic data together, we propose how global genetic diversity can be used to accelerate and support reciprocal recurrent selection for the development of genetically distinct heterotic groups in hybrid wheat breeding.
Collapse
Affiliation(s)
- Philipp H G Boeven
- State Plant Breeding Institute, University of Hohenheim, 70593, Stuttgart, Germany
| | - C Friedrich H Longin
- State Plant Breeding Institute, University of Hohenheim, 70593, Stuttgart, Germany
| | - Tobias Würschum
- State Plant Breeding Institute, University of Hohenheim, 70593, Stuttgart, Germany.
| |
Collapse
|
5272
|
Wang P, Chen Y, Lü J, Wang Q, Yu X. Graphical Features of Functional Genes in Human Protein Interaction Network. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2016; 10:707-20. [PMID: 26841412 DOI: 10.1109/tbcas.2015.2487299] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
With the completion of the human genome project, it is feasible to investigate large-scale human protein interaction network (HPIN) with complex networks theory. Proteins are encoded by genes. Essential, viable, disease, conserved, housekeeping (HK) and tissue-enriched (TE) genes are functional genes, which are organized and functioned via interaction networks. Based on up-to-date data from various databases or literature, two large-scale HPINs and six subnetworks are constructed. We illustrate that the HPINs and most of the subnetworks are sparse, small-world, scale-free, disassortative and with hierarchical modularity. Among the six subnetworks, essential, disease and HK subnetworks are more densely connected than the others. Statistical analysis on the topological structures of the HPIN reveals that the lethal, the conserved, the HK and the TE genes are with hallmark graphical features. Receiver operating characteristic (ROC) curves indicate that the essential genes can be distinguished from the viable ones with accuracy as high as almost 70%. Closeness, semi-local and eigenvector centralities can distinguish the HK genes from the TE ones with accuracy around 82%. Furthermore, the Venn diagram, cluster dendgrams and classifications of disease genes reveal that some classes of disease genes are with hallmark graphical features, especially for cancer genes, HK disease genes and TE disease genes. The findings facilitate the identification of some functional genes via topological structures. The investigations shed some light on the characteristics of the compete interactome, which have potential implications in networked medicine and biological network control.
Collapse
|
5273
|
Ghatak A, Chaturvedi P, Nagler M, Roustan V, Lyon D, Bachmann G, Postl W, Schröfl A, Desai N, Varshney RK, Weckwerth W. Comprehensive tissue-specific proteome analysis of drought stress responses in Pennisetum glaucum (L.) R. Br. (Pearl millet). J Proteomics 2016; 143:122-135. [DOI: 10.1016/j.jprot.2016.02.032] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2015] [Revised: 02/21/2016] [Accepted: 02/26/2016] [Indexed: 01/07/2023]
|
5274
|
Oetjens MT, Bush WS, Denny JC, Birdwell K, Kodaman N, Verma A, Dilks HH, Pendergrass SA, Ritchie MD, Crawford DC. Evidence for extensive pleiotropy among pharmacogenes. Pharmacogenomics 2016; 17:853-66. [PMID: 27249515 DOI: 10.2217/pgs-2015-0007] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
AIM We sought to identify potential pleiotropy involving pharmacogenes. METHODS We tested 184 functional variants in 34 pharmacogenes for associations using a custom grouping of International Classification and Disease, Ninth Revision billing codes extracted from deidentified electronic health records of 6892 patients. RESULTS We replicated several associations including ABCG2 (rs2231142) and gout (p = 1.73 × 10(-7); odds ratio [OR]: 1.73; 95% CI: 1.40-2.12); and SLCO1B1 (rs4149056) and jaundice (p = 2.50 × 10(-4); OR: 1.67; 95% CI: 1.27-2.20). CONCLUSION In this systematic screen for phenotypic associations with functional variants, several novel genotype-phenotype combinations also achieved phenome-wide significance, including SLC15A2 rs1143672 and renal osteodystrophy (p = 2.67 × 10(-) (6); OR: 0.61; 95% CI: 0.49-0.75).
Collapse
Affiliation(s)
- Matthew T Oetjens
- Center for Human Genetics Research, Vanderbilt University, Nashville, TN 37232, USA
| | - William S Bush
- Department of Epidemiology & Biostatistics, Institute for Computational Biology, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Joshua C Denny
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN 37203, USA
| | - Kelly Birdwell
- Department of Medicine, Vanderbilt University, Nashville, TN 37232, USA
| | - Nuri Kodaman
- Center for Human Genetics Research, Vanderbilt University, Nashville, TN 37232, USA
| | - Anurag Verma
- Center for Systems Genomics, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - Holli H Dilks
- Sarah Cannon Research Institute, Nashville, TN 37203 USA
| | - Sarah A Pendergrass
- Center for Systems Genomics, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - Marylyn D Ritchie
- Center for Systems Genomics, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - Dana C Crawford
- Department of Epidemiology & Biostatistics, Institute for Computational Biology, Case Western Reserve University, Cleveland, OH 44106, USA
| |
Collapse
|
5275
|
Nath A, Subbiah K. Probing an optimal class distribution for enhancing prediction and feature characterization of plant virus-encoded RNA-silencing suppressors. 3 Biotech 2016; 6:93. [PMID: 28330163 PMCID: PMC4801844 DOI: 10.1007/s13205-016-0410-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2015] [Accepted: 03/03/2016] [Indexed: 10/28/2022] Open
Abstract
To counter the host RNA silencing defense mechanism, many plant viruses encode RNA silencing suppressor proteins. These groups of proteins share very low sequence and structural similarities among them, which consequently hamper their annotation using sequence similarity-based search methods. Alternatively the machine learning-based methods can become a suitable choice, but the optimal performance through machine learning-based methods is being affected by various factors such as class imbalance, incomplete learning, selection of inappropriate features, etc. In this paper, we have proposed a novel approach to deal with the class imbalance problem by finding the optimal class distribution for enhancing the prediction accuracy for the RNA silencing suppressors. The optimal class distribution was obtained using different resampling techniques with varying degrees of class distribution starting from natural distribution to ideal distribution, i.e., equal distribution. The experimental results support the fact that optimal class distribution plays an important role to achieve near perfect learning. The best prediction results are obtained with Sequential Minimal Optimization (SMO) learning algorithm. We could achieve a sensitivity of 98.5 %, specificity of 92.6 % with an overall accuracy of 95.3 % on a tenfold cross validation and is further validated using leave one out cross validation test. It was also observed that the machine learning models trained on oversampled training sets using synthetic minority oversampling technique (SMOTE) have relatively performed better than on both randomly undersampled and imbalanced training data sets. Further, we have characterized the important discriminatory sequence features of RNA-silencing suppressors which distinguish these groups of proteins from other protein families.
Collapse
|
5276
|
Milioli HH. Life as an early career researcher: interview with Heloisa Helena Milioli. Future Sci OA 2016; 2:FSO128. [PMID: 28031973 PMCID: PMC5137908 DOI: 10.4155/fsoa-2016-0033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/12/2016] [Indexed: 11/17/2022] Open
Abstract
Heloisa Helena Milioli speaks to Francesca Lake, Managing Editor: Heloisa received a BSc degree in Biological Sciences (2008) from the Universidade Federal de Santa Catarina (Brazil) and obtained a MSc degree in Genetics (2011) from Universidade Federal do Paraná (Brazil). In 2011 and 2012, she worked as a lecturer and tutor in the Department of Cell Biology, Embryology and Genetics (Universidade Federal de Santa Catarina). She moved to Australia in 2012 to obtain her PhD in Biological Sciences, with emphasis on Bioinformatics, from The University of Newcastle. Her doctoral work brings together new considerations in the breast cancer field by combining novel bioinformatics approaches with the study of intrinsic subtypes. She has been applying advanced methods and sophisticated algorithms in unconventional computer architecture for the molecular classification of breast cancer based on the genomic (single nucleotide polymorphisms, circulating nucleic acids and copy number variations) and transcriptomic (gene expression and miRNA) signatures. Fundamental research will allow her to identify biomarkers of use in translational medicine for the diagnosis, prognosis and disease management focused on group-based tailored therapies.
Collapse
Affiliation(s)
- Heloisa Helena Milioli
- Priority Research Centre for Bioinformatics, Biomarker Discovery & Information-Based Medicine, Hunter Medical Research Institute, New Lambton Heights, NSW, Australia
- School of Environmental & Life Science, The University of Newcastle, Callaghan, NSW, Australia
| |
Collapse
|
5277
|
Thomas J, Seo D, Sael L. Review on Graph Clustering and Subgraph Similarity Based Analysis of Neurological Disorders. Int J Mol Sci 2016; 17:ijms17060862. [PMID: 27258269 PMCID: PMC4926396 DOI: 10.3390/ijms17060862] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Revised: 05/10/2016] [Accepted: 05/24/2016] [Indexed: 01/03/2023] Open
Abstract
How can complex relationships among molecular or clinico-pathological entities of neurological disorders be represented and analyzed? Graphs seem to be the current answer to the question no matter the type of information: molecular data, brain images or neural signals. We review a wide spectrum of graph representation and graph analysis methods and their application in the study of both the genomic level and the phenotypic level of the neurological disorder. We find numerous research works that create, process and analyze graphs formed from one or a few data types to gain an understanding of specific aspects of the neurological disorders. Furthermore, with the increasing number of data of various types becoming available for neurological disorders, we find that integrative analysis approaches that combine several types of data are being recognized as a way to gain a global understanding of the diseases. Although there are still not many integrative analyses of graphs due to the complexity in analysis, multi-layer graph analysis is a promising framework that can incorporate various data types. We describe and discuss the benefits of the multi-layer graph framework for studies of neurological disease.
Collapse
Affiliation(s)
- Jaya Thomas
- Department of Computer Science, Stony Brook University, Stony Brook, NY 11794, USA.
- Department of Computer Science, State University New York Korea, Incheon 406-840, Korea.
| | - Dongmin Seo
- Korea Institute of Science and Technology Information, 245 Daehak-ro, Yuseong-gu, Daejeon 34141, Korea.
| | - Lee Sael
- Department of Computer Science, Stony Brook University, Stony Brook, NY 11794, USA.
- Department of Computer Science, State University New York Korea, Incheon 406-840, Korea.
| |
Collapse
|
5278
|
Ferguson JF, Allayee H, Gerszten RE, Ideraabdullah F, Kris-Etherton PM, Ordovás JM, Rimm EB, Wang TJ, Bennett BJ. Nutrigenomics, the Microbiome, and Gene-Environment Interactions: New Directions in Cardiovascular Disease Research, Prevention, and Treatment: A Scientific Statement From the American Heart Association. CIRCULATION. CARDIOVASCULAR GENETICS 2016; 9:291-313. [PMID: 27095829 PMCID: PMC7829062 DOI: 10.1161/hcg.0000000000000030] [Citation(s) in RCA: 76] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Cardiometabolic diseases are the leading cause of death worldwide and are strongly linked to both genetic and nutritional factors. The field of nutrigenomics encompasses multiple approaches aimed at understanding the effects of diet on health or disease development, including nutrigenetic studies investigating the relationship between genetic variants and diet in modulating cardiometabolic risk, as well as the effects of dietary components on multiple "omic" measures, including transcriptomics, metabolomics, proteomics, lipidomics, epigenetic modifications, and the microbiome. Here, we describe the current state of the field of nutrigenomics with respect to cardiometabolic disease research and outline a direction for the integration of multiple omics techniques in future nutrigenomic studies aimed at understanding mechanisms and developing new therapeutic options for cardiometabolic disease treatment and prevention.
Collapse
|
5279
|
|
5280
|
Su M, Huang G, Zhang Q, Wang X, Li C, Tao Y, Zhang S, Lai J, Yang C, Wang Y. The LEA protein, ABR, is regulated by ABI5 and involved in dark-induced leaf senescence in Arabidopsis thaliana. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2016; 247:93-103. [PMID: 27095403 DOI: 10.1016/j.plantsci.2016.03.009] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2015] [Revised: 03/16/2016] [Accepted: 03/23/2016] [Indexed: 05/23/2023]
Abstract
The phytohormone abscisic acid (ABA) modulates plant growth and developmental processes such as leaf senescence. In this study, we investigated the role of the Arabidopsis late embryogenesis abundant (LEA) protein ABR (ABA-response protein) in delaying dark-induced leaf senescence. The ABR gene was up-regulated by treatment with ABA, NaCl and mannitol, as well as by light deprivation. In the dark, abr mutant plants displayed a premature leaf senescence phenotype, and various senescence-associated indicators, such as an increase in chlorophyll degradation and membrane leakiness, were enhanced, whereas 35S:ABR/abr transgenic lines showed a marked delay in dark-induced leaf senescence phenotypes. In vitro and in vivo assays showed that ABI5 bind to the ABR promoter, indicating that ABI5 directly regulates the expression of ABR. The disruption of ABI5 function in abr abi5-1 plants abolished the senescence-accelerating phenotype of the abr mutant, demonstrating that ABI5 is epistatic to ABR. In summary, these results highlight the important role that ABR, which is negatively regulated by ABI5, plays in delaying dark-induced leaf senescence.
Collapse
Affiliation(s)
- Mengying Su
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Sciences, South China Normal University, Guangzhou 510631, PR China.
| | - Gan Huang
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Sciences, South China Normal University, Guangzhou 510631, PR China.
| | - Qing Zhang
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Sciences, South China Normal University, Guangzhou 510631, PR China.
| | - Xiao Wang
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Sciences, South China Normal University, Guangzhou 510631, PR China.
| | - Chunxin Li
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Sciences, South China Normal University, Guangzhou 510631, PR China.
| | - Yujin Tao
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Sciences, South China Normal University, Guangzhou 510631, PR China.
| | - Shengchun Zhang
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Sciences, South China Normal University, Guangzhou 510631, PR China.
| | - Jianbin Lai
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Sciences, South China Normal University, Guangzhou 510631, PR China.
| | - Chengwei Yang
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Sciences, South China Normal University, Guangzhou 510631, PR China.
| | - Yaqin Wang
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Sciences, South China Normal University, Guangzhou 510631, PR China.
| |
Collapse
|
5281
|
Hernandez L, Kim MK, Lyle LT, Bunch KP, House CD, Ning F, Noonan AM, Annunziata CM. Characterization of ovarian cancer cell lines as in vivo models for preclinical studies. Gynecol Oncol 2016; 142:332-40. [PMID: 27235858 DOI: 10.1016/j.ygyno.2016.05.028] [Citation(s) in RCA: 74] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Revised: 05/10/2016] [Accepted: 05/24/2016] [Indexed: 12/11/2022]
Abstract
OBJECTIVE The value of cell lines for pre-clinical work lies in choosing those with similar characteristics. Selection of cell lines is typically based on patient history, histological subtype at diagnosis, mutation patterns, or signaling pathways. Although recent studies established consensus regarding molecular characteristics of ovarian cancer cell lines, data on in vivo tumorigenicity remains only sporadically available, impeding translation of in vitro work to xenograft models. METHODS We introduced 18 ovarian cancer cell lines into athymic nude mice through subcutaneous, intraperitoneal, and ovary intrabursal routes, and observed tumor development over 6weeks. We also profiled cell line gene expression and identified differentially expressed gene sets based on their ability to form tumors in the subcutaneous or intraperitoneal locations. Representative cell lines were further subjected to proteomic analyses. RESULTS Ovarian cancer cell lines showed variable ability to grow in mice when implanted subcutaneous, intraperitoneal, or intrabursal. While some cell lines grew well in both SC and IP locations, others showed a strong propensity to grow in one location only. Gene expression profiles suggested that cell lines showing preference for IP growth had gene expression patterns more similar to primary tumors. CONCLUSIONS We report the tumorigenicity of 17 human ovarian cancer cell lines and one mouse cell line in three distinct anatomical locations, and associated gene networks. Growth patterns and histopathology, linked to molecular characteristics, provide a valuable resource to the research community, and better guide the choice of cell lines for in vitro studies to translate efficiently into xenograft testing.
Collapse
Affiliation(s)
- Lidia Hernandez
- Women's Malignancies Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, United States
| | - Marianne K Kim
- Women's Malignancies Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, United States
| | - L Tiffany Lyle
- Women's Malignancies Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, United States
| | - Kristen P Bunch
- Women's Malignancies Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, United States
| | - Carrie D House
- Women's Malignancies Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, United States
| | - Franklin Ning
- Women's Malignancies Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, United States
| | - Anne M Noonan
- Women's Malignancies Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, United States
| | - Christina M Annunziata
- Women's Malignancies Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, United States.
| |
Collapse
|
5282
|
Zeng Z, Jiang X, Neapolitan R. Discovering causal interactions using Bayesian network scoring and information gain. BMC Bioinformatics 2016; 17:221. [PMID: 27230078 PMCID: PMC4880828 DOI: 10.1186/s12859-016-1084-8] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2015] [Accepted: 05/14/2016] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND The problem of learning causal influences from data has recently attracted much attention. Standard statistical methods can have difficulty learning discrete causes, which interacting to affect a target, because the assumptions in these methods often do not model discrete causal relationships well. An important task then is to learn such interactions from data. Motivated by the problem of learning epistatic interactions from datasets developed in genome-wide association studies (GWAS), researchers conceived new methods for learning discrete interactions. However, many of these methods do not differentiate a model representing a true interaction from a model representing non-interacting causes with strong individual affects. The recent algorithm MBS-IGain addresses this difficulty by using Bayesian network learning and information gain to discover interactions from high-dimensional datasets. However, MBS-IGain requires marginal effects to detect interactions containing more than two causes. If the dataset is not high-dimensional, we can avoid this shortcoming by doing an exhaustive search. RESULTS We develop Exhaustive-IGain, which is like MBS-IGain but does an exhaustive search. We compare the performance of Exhaustive-IGain to MBS-IGain using low-dimensional simulated datasets based on interactions with marginal effects and ones based on interactions without marginal effects. Their performance is similar on the datasets based on marginal effects. However, Exhaustive-IGain compellingly outperforms MBS-IGain on the datasets based on 3 and 4-cause interactions without marginal effects. We apply Exhaustive-IGain to investigate how clinical variables interact to affect breast cancer survival, and obtain results that agree with judgements of a breast cancer oncologist. CONCLUSIONS We conclude that the combined use of information gain and Bayesian network scoring enables us to discover higher order interactions with no marginal effects if we perform an exhaustive search. We further conclude that Exhaustive-IGain can be effective when applied to real data.
Collapse
Affiliation(s)
- Zexian Zeng
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Xia Jiang
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Richard Neapolitan
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.
| |
Collapse
|
5283
|
Abstract
Next-generation sequencing (NGS) technologies have rapidly evolved in the last 5 years, leading to the generation of millions of short reads in a single run. Consequently, various sequence alignment algorithms have been developed to compare these reads to an appropriate reference in order to perform important downstream analysis. SOAP2 from the SOAP series is one of the most commonly used alignment programs to handle NGS data, and it efficiently does so using low computer memory usage and fast alignment speed. This chapter describes the protocol used to align short reads to a reference genome using SOAP2, and highlights the significance of using the in-built command-line options to tune the behavior of the algorithm according to the inputs and the desired results.
Collapse
|
5284
|
Koychev I, Joyce D, Barkus E, Ettinger U, Schmechtig A, Dourish CT, Dawson GR, Craig KJ, Deakin JFW. Cognitive and oculomotor performance in subjects with low and high schizotypy: implications for translational drug development studies. Transl Psychiatry 2016; 6:C. [PMID: 27187233 PMCID: PMC5070057 DOI: 10.1038/tp.2016.64] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/02/2015] [Revised: 02/17/2016] [Accepted: 02/19/2016] [Indexed: 12/14/2022] Open
Abstract
The development of drugs to improve cognition in patients with schizophrenia is a major unmet clinical need. A number of promising compounds failed in recent clinical trials, a pattern linked to poor translation between preclinical and clinical stages of drug development. Seeking proof of efficacy in early Phase 1 studies in surrogate patient populations (for example, high schizotypy individuals where subtle cognitive impairment is present) has been suggested as a strategy to reduce attrition in the later stages of drug development. However, there is little agreement regarding the pattern of distribution of schizotypal features in the general population, creating uncertainty regarding the optimal control group that should be included in prospective trials. We aimed to address this question by comparing the performance of groups derived from the general population with low, average and high schizotypy scores over a range of cognitive and oculomotor tasks. We found that tasks dependent on frontal inhibitory mechanisms (N-Back working memory and anti-saccade oculomotor tasks), as well as a smooth-pursuit oculomotor task were sensitive to differences in the schizotypy phenotype. In these tasks the cognitive performance of 'low schizotypes' was significantly different from 'high schizotypes' with 'average schizotypes' having an intermediate performance. These results indicate that for evaluating putative cognition enhancers for treating schizophrenia in early-drug development studies the maximum schizotypy effect would be achieved using a design that compares low and high schizotypes.
Collapse
Affiliation(s)
- I Koychev
- Department of Community-Based Psychiatry, Neuroscience and Psychiatry Unit, The University of Manchester, School of Community-Based Medicine, Manchester, UK
- Department of Psychiatry, University of Oxford, Warneford Hospital, Oxford, UK
| | - D Joyce
- Cognition, Schizophrenia and Imaging Lab, Department of Psychosis Studies, Institute of Psychiatry, Psychology and Neuroscience, Denmark Hill, London
| | - E Barkus
- Department of Psychology, University of Wollongong, Wollongong, New South Wales, Australia
- Department of Psychiatry, School of Community-Based Medicine, The University of Manchester, Manchester, UK
| | - U Ettinger
- Department of Psychology, University of Bonn, Bonn, Germany
| | - A Schmechtig
- Department of Neuroimaging, Institute of Psychiatry, King's College London, London, UK
| | - C T Dourish
- P1vital, Manor House, Howbery Park, Wallingford, UK
| | - G R Dawson
- P1vital, Manor House, Howbery Park, Wallingford, UK
| | - K J Craig
- P1vital, Manor House, Howbery Park, Wallingford, UK
| | - J F W Deakin
- Department of Community-Based Psychiatry, Neuroscience and Psychiatry Unit, The University of Manchester, School of Community-Based Medicine, Manchester, UK
| |
Collapse
|
5285
|
CINOEDV: a co-information based method for detecting and visualizing n-order epistatic interactions. BMC Bioinformatics 2016; 17:214. [PMID: 27184783 PMCID: PMC4869388 DOI: 10.1186/s12859-016-1076-8] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2015] [Accepted: 05/07/2016] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Detecting and visualizing nonlinear interaction effects of single nucleotide polymorphisms (SNPs) or epistatic interactions are important topics in bioinformatics since they play an important role in unraveling the mystery of "missing heritability". However, related studies are almost limited to pairwise epistatic interactions due to their methodological and computational challenges. RESULTS We develop CINOEDV (Co-Information based N-Order Epistasis Detector and Visualizer) for the detection and visualization of epistatic interactions of their orders from 1 to n (n ≥ 2). CINOEDV is composed of two stages, namely, detecting stage and visualizing stage. In detecting stage, co-information based measures are employed to quantify association effects of n-order SNP combinations to the phenotype, and two types of search strategies are introduced to identify n-order epistatic interactions: an exhaustive search and a particle swarm optimization based search. In visualizing stage, all detected n-order epistatic interactions are used to construct a hypergraph, where a real vertex represents the main effect of a SNP and a virtual vertex denotes the interaction effect of an n-order epistatic interaction. By deeply analyzing the constructed hypergraph, some hidden clues for better understanding the underlying genetic architecture of complex diseases could be revealed. CONCLUSIONS Experiments of CINOEDV and its comparison with existing state-of-the-art methods are performed on both simulation data sets and a real data set of age-related macular degeneration. Results demonstrate that CINOEDV is promising in detecting and visualizing n-order epistatic interactions. CINOEDV is implemented in R and is freely available from R CRAN: http://cran.r-project.org and https://sourceforge.net/projects/cinoedv/files/ .
Collapse
|
5286
|
Zhang Y, Wong YS, Deng J, Anton C, Gabos S, Zhang W, Huang DY, Jin C. Machine learning algorithms for mode-of-action classification in toxicity assessment. BioData Min 2016; 9:19. [PMID: 27182283 PMCID: PMC4866020 DOI: 10.1186/s13040-016-0098-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Accepted: 04/30/2016] [Indexed: 12/29/2022] Open
Abstract
Background Real Time Cell Analysis (RTCA) technology is used to monitor cellular changes continuously over the entire exposure period. Combining with different testing concentrations, the profiles have potential in probing the mode of action (MOA) of the testing substances. Results In this paper, we present machine learning approaches for MOA assessment. Computational tools based on artificial neural network (ANN) and support vector machine (SVM) are developed to analyze the time-concentration response curves (TCRCs) of human cell lines responding to tested chemicals. The techniques are capable of learning data from given TCRCs with known MOA information and then making MOA classification for the unknown toxicity. A novel data processing step based on wavelet transform is introduced to extract important features from the original TCRC data. From the dose response curves, time interval leading to higher classification success rate can be selected as input to enhance the performance of the machine learning algorithm. This is particularly helpful when handling cases with limited and imbalanced data. The validation of the proposed method is demonstrated by the supervised learning algorithm applied to the exposure data of HepG2 cell line to 63 chemicals with 11 concentrations in each test case. Classification success rate in the range of 85 to 95 % are obtained using SVM for MOA classification with two clusters to cases up to four clusters. Conclusions Wavelet transform is capable of capturing important features of TCRCs for MOA classification. The proposed SVM scheme incorporated with wavelet transform has a great potential for large scale MOA classification and high-through output chemical screening. Electronic supplementary material The online version of this article (doi:10.1186/s13040-016-0098-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yile Zhang
- Department of Mathematical and Statistical Science, University of Alberta, T6G 2G1, Edmonton, Canada
| | - Yau Shu Wong
- Department of Mathematical and Statistical Science, University of Alberta, T6G 2G1, Edmonton, Canada
| | - Jian Deng
- Department of Mathematical and Statistical Science, University of Alberta, T6G 2G1, Edmonton, Canada
| | - Cristina Anton
- Department of Mathematics and Statistics, Grant MacEwan University, T5P 2P7, Edmonton, Canada
| | - Stephan Gabos
- Department of Laboratory Medicine and Pathology, University of Alberta, T6G 2B7, Edmonton, Canada
| | | | - Dorothy Yu Huang
- Alberta Centre for Toxicology, University of Calgary, T2N 4N1, Calgary, Canada
| | - Can Jin
- AACEA Biosciences Inc, San Diego, 92121 USA
| |
Collapse
|
5287
|
Jeanson L, Thomas L, Copin B, Coste A, Sermet-Gaudelus I, Dastot-Le Moal F, Duquesnoy P, Montantin G, Collot N, Tissier S, Papon JF, Clement A, Louis B, Escudier E, Amselem S, Legendre M. Mutations in GAS8, a Gene Encoding a Nexin-Dynein Regulatory Complex Subunit, Cause Primary Ciliary Dyskinesia with Axonemal Disorganization. Hum Mutat 2016; 37:776-85. [PMID: 27120127 DOI: 10.1002/humu.23005] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2015] [Accepted: 04/10/2016] [Indexed: 12/12/2022]
Abstract
Primary ciliary dyskinesia (PCD) is an autosomal recessive disease characterized by chronic respiratory infections of the upper and lower airways, hypofertility, and, in approximately half of the cases, situs inversus. This complex phenotype results from defects in motile cilia and sperm flagella. Among the numerous genes involved in PCD, very few-including CCDC39 and CCDC40-carry mutations that lead to a disorganization of ciliary axonemes with microtubule misalignment. Focusing on this particular phenotype, we identified bi-allelic loss-of-function mutations in GAS8, a gene that encodes a subunit of the nexin-dynein regulatory complex (N-DRC) orthologous to DRC4 of the flagellated alga Chlamydomonas reinhardtii. Unlike the majority of PCD patients, individuals with GAS8 mutations have motile cilia, which, as documented by high-speed videomicroscopy, display a subtle beating pattern defect characterized by slightly reduced bending amplitude. Immunofluorescence studies performed on patients' respiratory cilia revealed that GAS8 is not required for the proper expression of CCDC39 and CCDC40. Rather, mutations in GAS8 affect the subcellular localization of another N-DRC subunit called DRC3. Overall, this study, which identifies GAS8 as a PCD gene, unveils the key importance of the corresponding protein in N-DRC integrity and in the proper alignment of axonemal microtubules in humans.
Collapse
Affiliation(s)
- Ludovic Jeanson
- Institut National de la Santé Et de la Recherche Médicale (INSERM), UMR_S933, Sorbonne Universités, UPMC Univ Paris 06, Paris, F-75012, France
| | - Lucie Thomas
- Institut National de la Santé Et de la Recherche Médicale (INSERM), UMR_S933, Sorbonne Universités, UPMC Univ Paris 06, Paris, F-75012, France
| | - Bruno Copin
- Assistance Publique - Hôpitaux de Paris (AP-HP), Hôpital Armand Trousseau, Service de Génétique et Embryologie Médicales, Paris, F-75012, France
| | - André Coste
- Assistance Publique - Hôpitaux de Paris (AP-HP), Hôpital Intercommunal et Groupe Hospitalier Henri Mondor - Albert Chenevier, Service d'Oto-Rhino-Laryngologie et de Chirurgie Cervico-Faciale, Créteil, F-94000, France
| | - Isabelle Sermet-Gaudelus
- Assistance Publique - Hôpitaux de Paris (AP-HP), Hôpital Necker, Service de Pneumo-Allergologie Pédiatrique, Paris, F-75015, France
| | - Florence Dastot-Le Moal
- Assistance Publique - Hôpitaux de Paris (AP-HP), Hôpital Armand Trousseau, Service de Génétique et Embryologie Médicales, Paris, F-75012, France
| | - Philippe Duquesnoy
- Institut National de la Santé Et de la Recherche Médicale (INSERM), UMR_S933, Sorbonne Universités, UPMC Univ Paris 06, Paris, F-75012, France
| | - Guy Montantin
- Assistance Publique - Hôpitaux de Paris (AP-HP), Hôpital Armand Trousseau, Service de Génétique et Embryologie Médicales, Paris, F-75012, France
| | - Nathalie Collot
- Assistance Publique - Hôpitaux de Paris (AP-HP), Hôpital Armand Trousseau, Service de Génétique et Embryologie Médicales, Paris, F-75012, France
| | - Sylvie Tissier
- Assistance Publique - Hôpitaux de Paris (AP-HP), Hôpital Armand Trousseau, Service de Génétique et Embryologie Médicales, Paris, F-75012, France
| | - Jean-François Papon
- Assistance Publique - Hôpitaux de Paris (AP-HP), Hôpital Bicêtre, Service d'Oto-Rhino-Laryngologie et de Chirurgie Cervico-Maxillo-Faciale, Le Kremlin-Bicêtre, F-94275, France
| | - Annick Clement
- Institut National de la Santé Et de la Recherche Médicale (INSERM), UMR_S933, Sorbonne Universités, UPMC Univ Paris 06, Paris, F-75012, France.,Assistance Publique - Hôpitaux de Paris (AP-HP), Hôpital Armand Trousseau, Service de Pneumologie Pédiatrique, Centre National de Référence des Maladies Respiratoires Rares, Paris, F-75012, France
| | - Bruno Louis
- Institut National de la Santé Et de la Recherche Médicale (INSERM), UMR_S955, Equipe 13, Univ Paris Est, Créteil, F-94000, France
| | - Estelle Escudier
- Institut National de la Santé Et de la Recherche Médicale (INSERM), UMR_S933, Sorbonne Universités, UPMC Univ Paris 06, Paris, F-75012, France.,Assistance Publique - Hôpitaux de Paris (AP-HP), Hôpital Armand Trousseau, Service de Génétique et Embryologie Médicales, Paris, F-75012, France
| | - Serge Amselem
- Institut National de la Santé Et de la Recherche Médicale (INSERM), UMR_S933, Sorbonne Universités, UPMC Univ Paris 06, Paris, F-75012, France.,Assistance Publique - Hôpitaux de Paris (AP-HP), Hôpital Armand Trousseau, Service de Génétique et Embryologie Médicales, Paris, F-75012, France
| | - Marie Legendre
- Institut National de la Santé Et de la Recherche Médicale (INSERM), UMR_S933, Sorbonne Universités, UPMC Univ Paris 06, Paris, F-75012, France.,Assistance Publique - Hôpitaux de Paris (AP-HP), Hôpital Armand Trousseau, Service de Génétique et Embryologie Médicales, Paris, F-75012, France
| |
Collapse
|
5288
|
Zabinski JW, Garcia-Vargas G, Rubio-Andrade M, Fry RC, Gibson JM. Advancing Dose-Response Assessment Methods for Environmental Regulatory Impact Analysis: A Bayesian Belief Network Approach Applied to Inorganic Arsenic. ENVIRONMENTAL SCIENCE & TECHNOLOGY LETTERS 2016; 3:200-204. [PMID: 27747248 PMCID: PMC5063306 DOI: 10.1021/acs.estlett.6b00076] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Dose-response functions used in regulatory risk assessment are based on studies of whole organisms and fail to incorporate genetic and metabolomic data. Bayesian belief networks (BBNs) could provide a powerful framework for incorporating such data, but no prior research has examined this possibility. To address this gap, we develop a BBN-based model predicting birthweight at gestational age from arsenic exposure via drinking water and maternal metabolic indicators using a cohort of 200 pregnant women from an arsenic-endemic region of Mexico. We compare BBN predictions to those of prevailing slope-factor and reference-dose approaches. The BBN outperforms prevailing approaches in balancing false-positive and false-negative rates. Whereas the slope-factor approach had 2% sensitivity and 99% specificity and the reference-dose approach had 100% sensitivity and 0% specificity, the BBN's sensitivity and specificity were 71% and 30%, respectively. BBNs offer a promising opportunity to advance health risk assessment by incorporating modern genetic and metabolomic data.
Collapse
|
5289
|
Meng X, Jiang R, Lin D, Bustillo J, Jones T, Chen J, Yu Q, Du Y, Zhang Y, Jiang T, Sui J, Calhoun VD. Predicting individualized clinical measures by a generalized prediction framework and multimodal fusion of MRI data. Neuroimage 2016; 145:218-229. [PMID: 27177764 DOI: 10.1016/j.neuroimage.2016.05.026] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2015] [Revised: 04/13/2016] [Accepted: 05/07/2016] [Indexed: 12/24/2022] Open
Abstract
Neuroimaging techniques have greatly enhanced the understanding of neurodiversity (human brain variation across individuals) in both health and disease. The ultimate goal of using brain imaging biomarkers is to perform individualized predictions. Here we proposed a generalized framework that can predict explicit values of the targeted measures by taking advantage of joint information from multiple modalities. This framework also enables whole brain voxel-wise searching by combining multivariate techniques such as ReliefF, clustering, correlation-based feature selection and multiple regression models, which is more flexible and can achieve better prediction performance than alternative atlas-based methods. For 50 healthy controls and 47 schizophrenia patients, three kinds of features derived from resting-state fMRI (fALFF), sMRI (gray matter) and DTI (fractional anisotropy) were extracted and fed into a regression model, achieving high prediction for both cognitive scores (MCCB composite r=0.7033, MCCB social cognition r=0.7084) and symptomatic scores (positive and negative syndrome scale [PANSS] positive r=0.7785, PANSS negative r=0.7804). Moreover, the brain areas likely responsible for cognitive deficits of schizophrenia, including middle temporal gyrus, dorsolateral prefrontal cortex, striatum, cuneus and cerebellum, were located with different weights, as well as regions predicting PANSS symptoms, including thalamus, striatum and inferior parietal lobule, pinpointing the potential neuromarkers. Finally, compared to a single modality, multimodal combination achieves higher prediction accuracy and enables individualized prediction on multiple clinical measures. There is more work to be done, but the current results highlight the potential utility of multimodal brain imaging biomarkers to eventually inform clinical decision-making.
Collapse
Affiliation(s)
- Xing Meng
- Brainnetome Center and National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
| | - Rongtao Jiang
- Brainnetome Center and National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
| | - Dongdong Lin
- The Mind Research Network and Lovelace Biomedical and Environmental Research Institute, Albuquerque, NM 87106, USA
| | - Juan Bustillo
- Dept. of Psychiatry and Neuroscience, University of New Mexico, Albuquerque, NM 87131, USA
| | - Thomas Jones
- Dept. of Psychiatry and Neuroscience, University of New Mexico, Albuquerque, NM 87131, USA
| | - Jiayu Chen
- The Mind Research Network and Lovelace Biomedical and Environmental Research Institute, Albuquerque, NM 87106, USA
| | - Qingbao Yu
- The Mind Research Network and Lovelace Biomedical and Environmental Research Institute, Albuquerque, NM 87106, USA
| | - Yuhui Du
- The Mind Research Network and Lovelace Biomedical and Environmental Research Institute, Albuquerque, NM 87106, USA
| | - Yu Zhang
- Brainnetome Center and National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
| | - Tianzi Jiang
- Brainnetome Center and National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China; CAS Center for Excellence in Brain Science, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
| | - Jing Sui
- Brainnetome Center and National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China; The Mind Research Network and Lovelace Biomedical and Environmental Research Institute, Albuquerque, NM 87106, USA; CAS Center for Excellence in Brain Science, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China.
| | - Vince D Calhoun
- The Mind Research Network and Lovelace Biomedical and Environmental Research Institute, Albuquerque, NM 87106, USA; Dept. of Psychiatry and Neuroscience, University of New Mexico, Albuquerque, NM 87131, USA; Dept. of Electronic and Computer Engineering, University of New Mexico, Albuquerque, NM 87131, USA
| |
Collapse
|
5290
|
Li R, Dudek SM, Kim D, Hall MA, Bradford Y, Peissig PL, Brilliant MH, Linneman JG, McCarty CA, Bao L, Ritchie MD. Identification of genetic interaction networks via an evolutionary algorithm evolved Bayesian network. BioData Min 2016; 9:18. [PMID: 27168765 PMCID: PMC4862166 DOI: 10.1186/s13040-016-0094-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2015] [Accepted: 04/18/2016] [Indexed: 12/01/2022] Open
Abstract
Background The future of medicine is moving towards the phase of precision medicine, with the goal to prevent and treat diseases by taking inter-individual variability into account. A large part of the variability lies in our genetic makeup. With the fast paced improvement of high-throughput methods for genome sequencing, a tremendous amount of genetics data have already been generated. The next hurdle for precision medicine is to have sufficient computational tools for analyzing large sets of data. Genome-Wide Association Studies (GWAS) have been the primary method to assess the relationship between single nucleotide polymorphisms (SNPs) and disease traits. While GWAS is sufficient in finding individual SNPs with strong main effects, it does not capture potential interactions among multiple SNPs. In many traits, a large proportion of variation remain unexplained by using main effects alone, leaving the door open for exploring the role of genetic interactions. However, identifying genetic interactions in large-scale genomics data poses a challenge even for modern computing. Results For this study, we present a new algorithm, Grammatical Evolution Bayesian Network (GEBN) that utilizes Bayesian Networks to identify interactions in the data, and at the same time, uses an evolutionary algorithm to reduce the computational cost associated with network optimization. GEBN excelled in simulation studies where the data contained main effects and interaction effects. We also applied GEBN to a Type 2 diabetes (T2D) dataset obtained from the Marshfield Personalized Medicine Research Project (PMRP). We were able to identify genetic interactions for T2D cases and controls and use information from those interactions to classify T2D samples. We obtained an average testing area under the curve (AUC) of 86.8 %. We also identified several interacting genes such as INADL and LPP that are known to be associated with T2D. Conclusions Developing the computational tools to explore genetic associations beyond main effects remains a critically important challenge in human genetics. Methods, such as GEBN, demonstrate the utility of considering genetic interactions, as they likely explain some of the missing heritability.
Collapse
Affiliation(s)
- Ruowang Li
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Scott M Dudek
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Dokyoon Kim
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Molly A Hall
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Yuki Bradford
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Peggy L Peissig
- Biomedical Informatics Research Center, Marshfield Clinic Research Foundation, Marshfield, Wisconsin USA
| | - Murray H Brilliant
- Biomedical Informatics Research Center, Marshfield Clinic Research Foundation, Marshfield, Wisconsin USA
| | - James G Linneman
- Biomedical Informatics Research Center, Marshfield Clinic Research Foundation, Marshfield, Wisconsin USA
| | | | - Le Bao
- Department of Statistics, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Marylyn D Ritchie
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania, USA ; Biomedical & Translational Informatics, Geisinger Health System, Danville, Pennsylvania USA
| |
Collapse
|
5291
|
Pappalardo F, Fichera E, Paparone N, Lombardo A, Pennisi M, Russo G, Leotta M, Pappalardo F, Pedretti A, De Fiore F, Motta S. A computational model to predict the immune system activation by citrus-derived vaccine adjuvants. Bioinformatics 2016; 32:2672-80. [PMID: 27162187 DOI: 10.1093/bioinformatics/btw293] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2016] [Accepted: 04/20/2016] [Indexed: 12/12/2022] Open
Abstract
MOTIVATION Vaccines represent the most effective and cost-efficient weapons against a wide range of diseases. Nowadays new generation vaccines based on subunit antigens reduce adverse effects in high risk individuals. However, vaccine antigens are often poor immunogens when administered alone. Adjuvants represent a good strategy to overcome such hurdles, indeed they are able to: enhance the immune response; allow antigens sparing; accelerate the specific immune response; and increase vaccine efficacy in vulnerable groups such as newborns, elderly or immuno-compromised people. However, due to safety concerns and adverse reactions, there are only a few adjuvants approved for use in humans. Moreover, in practice current adjuvants sometimes fail to confer adequate stimulation. Hence, there is an imperative need to develop novel adjuvants that overcome the limitations of the currently available licensed adjuvants. RESULTS We developed a computational framework that provides a complete pipeline capable of predicting the best citrus-derived adjuvants for enhancing the immune system response using, as a target disease model, influenza A infection. In silico simulations suggested a good immune efficacy of specific citrus-derived adjuvant (Beta Sitosterol) that was then confirmed in vivoAvailability: The model is available visiting the following URL: http://vaima.dmi.unict.it/AdjSim CONTACT francesco.pappalardo@unict.it; fp@francescopappalardo.net.
Collapse
Affiliation(s)
| | - Epifanio Fichera
- Etna Biotech S.R.L, via Vincenzo Lancia, 57 - Zona Industriale Blocco Palma 1
| | - Nicoletta Paparone
- Parco Scientifico E Tecnologico Della Sicilia, via Vincenzo Lancia, 57 - Zona Industriale Blocco Palma 1
| | - Alessandro Lombardo
- Parco Scientifico E Tecnologico Della Sicilia, via Vincenzo Lancia, 57 - Zona Industriale Blocco Palma 1
| | - Marzio Pennisi
- Department of Mathematics and Computer Science, University of Catania
| | - Giulia Russo
- Department of Biomedical and Biotechnological Sciences, University of Catania, Catania, Italy
| | - Marco Leotta
- Department of Drug Sciences, University of Catania
| | - Francesco Pappalardo
- Parco Scientifico E Tecnologico Della Sicilia, via Vincenzo Lancia, 57 - Zona Industriale Blocco Palma 1
| | | | | | - Santo Motta
- Department of Mathematics and Computer Science, University of Catania
| |
Collapse
|
5292
|
Rahmani B, Zimmermann MT, Grill DE, Kennedy RB, Oberg AL, White BC, Poland GA, McKinney BA. Recursive Indirect-Paths Modularity (RIP-M) for Detecting Community Structure in RNA-Seq Co-expression Networks. Front Genet 2016; 7:80. [PMID: 27242890 PMCID: PMC4861003 DOI: 10.3389/fgene.2016.00080] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Accepted: 04/20/2016] [Indexed: 11/16/2022] Open
Abstract
Clusters of genes in co-expression networks are commonly used as functional units for gene set enrichment detection and increasingly as features (attribute construction) for statistical inference and sample classification. One of the practical challenges of clustering for these purposes is to identify an optimal partition of the network where the individual clusters are neither too large, prohibiting interpretation, nor too small, precluding general inference. Newman Modularity is a spectral clustering algorithm that automatically finds the number of clusters, but for many biological networks the cluster sizes are suboptimal. In this work, we generalize Newman Modularity to incorporate information from indirect paths in RNA-Seq co-expression networks. We implement a merge-and-split algorithm that allows the user to constrain the range of cluster sizes: large enough to capture genes in relevant pathways, yet small enough to resolve distinct functions. We investigate the properties of our recursive indirect-pathways modularity (RIP-M) and compare it with other clustering methods using simulated co-expression networks and RNA-seq data from an influenza vaccine response study. RIP-M had higher cluster assignment accuracy than Newman Modularity for finding clusters in simulated co-expression networks for all scenarios, and RIP-M had comparable accuracy to Weighted Gene Correlation Network Analysis (WGCNA). RIP-M was more accurate than WGCNA for modest hard thresholds and comparable for high, while WGCNA was slightly more accurate for soft thresholds. In the vaccine study data, RIP-M and WGCNA enriched for a comparable number of immunologically relevant pathways.
Collapse
Affiliation(s)
- Bahareh Rahmani
- Tandy School of Computer Science, University of Tulsa Tulsa, OK, USA
| | - Michael T Zimmermann
- Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo ClinicRochester, MN, USA; Mayo Clinic Vaccine Research Group, Mayo ClinicRochester, MN, USA
| | - Diane E Grill
- Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo ClinicRochester, MN, USA; Mayo Clinic Vaccine Research Group, Mayo ClinicRochester, MN, USA
| | | | - Ann L Oberg
- Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo ClinicRochester, MN, USA; Mayo Clinic Vaccine Research Group, Mayo ClinicRochester, MN, USA
| | - Bill C White
- Tandy School of Computer Science, University of Tulsa Tulsa, OK, USA
| | - Gregory A Poland
- Mayo Clinic Vaccine Research Group, Mayo Clinic Rochester, MN, USA
| | - Brett A McKinney
- Tandy School of Computer Science, University of TulsaTulsa, OK, USA; Department of Mathematics, University of TulsaTulsa, OK USA
| |
Collapse
|
5293
|
A Comprehensive In Silico Analysis on the Structural and Functional Impact of SNPs in the Congenital Heart Defects Associated with NKX2-5 Gene-A Molecular Dynamic Simulation Approach. PLoS One 2016; 11:e0153999. [PMID: 27152669 PMCID: PMC4859487 DOI: 10.1371/journal.pone.0153999] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2016] [Accepted: 03/21/2016] [Indexed: 11/23/2022] Open
Abstract
Congenital heart defects (CHD) presented as structural defects in the heart and blood vessels during birth contribute an important cause of childhood morbidity and mortality worldwide. Many Single nucletotide polymorphisms (SNPs) in different genes have been associated with various types of congenital heart defects. NKX 2–5 gene is one among them, which encodes a homeobox-containing transcription factor that plays a crucial role during the initial phases of heart formation and development. Mutations in this gene could cause different types of congenital heart defects, including Atrial septal defect (ASD), Atrial ventricular block (AVB), Tetralogy of fallot and ventricular septal defect. This highlights the importance of studying the impact of different SNPs found within this gene that might cause structural and functional modification of its encoded protein. In this study, we retrieved SNPs from the database (dbSNP), followed by identification of potentially deleterious Non-synonymous single nucleotide polymorphisms (nsSNPs) and prediction of their effect on proteins by computational screening using SIFT and Polyphen. Furthermore, we have carried out molecular dynamic simulation (MDS) in order to uncover the SNPs that would cause the most structural damage to the protein altering its biological function. The most important SNP that was found using our approach was rs137852685 R161P, which was predicted to cause the most damage to the structural features of the protein. Mapping nsSNPs in genes such as NKX 2–5 would provide valuable information about individuals carrying these polymorphisms, where such variations could be used as diagnostic markers.
Collapse
|
5294
|
Arakelyan A, Nersisyan L, Petrek M, Löffler-Wirth H, Binder H. Cartography of Pathway Signal Perturbations Identifies Distinct Molecular Pathomechanisms in Malignant and Chronic Lung Diseases. Front Genet 2016; 7:79. [PMID: 27200087 PMCID: PMC4859092 DOI: 10.3389/fgene.2016.00079] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2016] [Accepted: 04/20/2016] [Indexed: 12/16/2022] Open
Abstract
Lung diseases are described by a wide variety of developmental mechanisms and clinical manifestations. Accurate classification and diagnosis of lung diseases are the bases for development of effective treatments. While extensive studies are conducted toward characterization of various lung diseases at molecular level, no systematic approach has been developed so far. Here we have applied a methodology for pathway-centered mining of high throughput gene expression data to describe a wide range of lung diseases in the light of shared and specific pathway activity profiles. We have applied an algorithm combining a Pathway Signal Flow (PSF) algorithm for estimation of pathway activity deregulation states in lung diseases and malignancies, and a Self Organizing Maps algorithm for classification and clustering of the pathway activity profiles. The analysis results allowed clearly distinguish between cancer and non-cancer lung diseases. Lung cancers were characterized by pathways implicated in cell proliferation, metabolism, while non-malignant lung diseases were characterized by deregulations in pathways involved in immune/inflammatory response and fibrotic tissue remodeling. In contrast to lung malignancies, chronic lung diseases had relatively heterogeneous pathway deregulation profiles. We identified three groups of interstitial lung diseases and showed that the development of characteristic pathological processes, such as fibrosis, can be initiated by deregulations in different signaling pathways. In conclusion, this paper describes the pathobiology of lung diseases from systems viewpoint using pathway centered high-dimensional data mining approach. Our results contribute largely to current understanding of pathological events in lung cancers and non-malignant lung diseases. Moreover, this paper provides new insight into molecular mechanisms of a number of interstitial lung diseases that have been studied to a lesser extent.
Collapse
Affiliation(s)
- Arsen Arakelyan
- Group of Bioinformatics, Institute of Molecular Biology, National Academy of SciencesYerevan, Armenia; College of Science and Engineering, American University of ArmeniaYerevan, Armenia
| | - Lilit Nersisyan
- Group of Bioinformatics, Institute of Molecular Biology, National Academy of SciencesYerevan, Armenia; College of Science and Engineering, American University of ArmeniaYerevan, Armenia
| | - Martin Petrek
- Laboratory of Immunogenomics, Department of Pathological Physiology, Faculty of Medicine and Dentistry, Institute of Molecular and Translational Medicine, Palacky University Olomouc Olomouc, Czech Republic
| | - Henry Löffler-Wirth
- Interdisciplinary Centre for Bioinformatics, University of Leipzig Leipzig, Germany
| | - Hans Binder
- Interdisciplinary Centre for Bioinformatics, University of Leipzig Leipzig, Germany
| |
Collapse
|
5295
|
DensityMap: a genome viewer for illustrating the densities of features. BMC Bioinformatics 2016; 17:204. [PMID: 27153821 PMCID: PMC4858867 DOI: 10.1186/s12859-016-1055-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2016] [Accepted: 04/13/2016] [Indexed: 12/13/2022] Open
Abstract
Background Several tools are available for visualizing genomic data. Some, such as Gbrowse and Jbrowse, are very efficient for small genomic regions, but they are not suitable for entire genomes. Others, like Phenogram and CViT, can be used to visualise whole genomes, but are not designed to display very dense genomic features (eg: interspersed repeats). We have therefore developed DensityMap, a lightweight Perl program that can display the densities of several features (genes, ncRNA, cpg, etc.) along chromosomes on the scale of the whole genome. A critical advantage of DensityMap is that it uses GFF annotation files directly to compute the densities of features without needing additional information from the user. The resulting picture is readily configurable, and the colour scales used can be customized for a best fit to the data plotted. Results DensityMap runs on Linux architecture with few requirements so that users can easily and quickly visualize the distributions and densities of genomic features for an entire genome. The input is GFF3-formated data representing chromosomes (linkage groups or pseudomolecules) and sets of features which are used to calculate representations in density maps. In practise, DensityMap uses a tilling window to compute the density of one or more features and the number of bases covered by these features along chromosomes. The densities are represented by colour scales that can be customized to highlight critical points. DensityMap can compare the distributions of features; it calculates several chromosomal density maps in a single image, each of which describes a different genomic feature. It can also use the genome nucleotide sequence to compute and plot a density map of the GC content along chromosomes. Conclusions DensityMap is a compact, easily-used tool for displaying the distribution and density of all types of genomic features within a genome. It is flexible enough to visualize the densities of several types of features in a single representation. The images produced are readily configurable and their SVG format ensures that they can be edited. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1055-0) contains supplementary material, which is available to authorized users.
Collapse
|
5296
|
Oetjens MT, Brown-Gentry K, Goodloe R, Dilks HH, Crawford DC. Population Stratification in the Context of Diverse Epidemiologic Surveys Sans Genome-Wide Data. Front Genet 2016; 7:76. [PMID: 27200085 PMCID: PMC4858524 DOI: 10.3389/fgene.2016.00076] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2015] [Accepted: 04/18/2016] [Indexed: 01/01/2023] Open
Abstract
Population stratification or confounding by genetic ancestry is a potential cause of false associations in genetic association studies. Estimation of and adjustment for genetic ancestry has become common practice thanks in part to the availability of ancestry informative markers on genome-wide association study (GWAS) arrays. While array data is now widespread, these data are not ubiquitous as several large epidemiologic and clinic-based studies lack genome-wide data. One such large epidemiologic-based study lacking genome-wide data accessible to investigators is the National Health and Nutrition Examination Surveys (NHANES), population-based cross-sectional surveys of Americans linked to demographic, health, and lifestyle data conducted by the Centers for Disease Control and Prevention. DNA samples (n = 14,998) were extracted from biospecimens from consented NHANES participants between 1991-1994 (NHANES III, phase 2) and 1999-2002 and represent three major self-identified racial/ethnic groups: non-Hispanic whites (n = 6,634), non-Hispanic blacks (n = 3,458), and Mexican Americans (n = 3,950). We as the Epidemiologic Architecture for Genes Linked to Environment study genotyped candidate gene and GWAS-identified index variants in NHANES as part of the larger Population Architecture using Genomics and Epidemiology I study for collaborative genetic association studies. To enable basic quality control such as estimation of genetic ancestry to control for population stratification in NHANES san genome-wide data, we outline here strategies that use limited genetic data to identify the markers optimal for characterizing genetic ancestry. From among 411 and 295 autosomal SNPs available in NHANES III and NHANES 1999-2002, we demonstrate that markers with ancestry information can be identified to estimate global ancestry. Despite limited resolution, global genetic ancestry is highly correlated with self-identified race for the majority of participants, although less so for ethnicity. Overall, the strategies outlined here for a large epidemiologic study can be applied to other datasets accessible for genotype-phenotype studies but are sans genome-wide data.
Collapse
Affiliation(s)
- Matthew T. Oetjens
- Center for Human Genetics Research Vanderbilt University, NashvilleTN, USA
| | | | - Robert Goodloe
- Center for Human Genetics Research Vanderbilt University, NashvilleTN, USA
| | | | - Dana C. Crawford
- Department of Epidemiology and Biostatistics, Institute for Computational Biology, Case Western Reserve University, ClevelandOH, USA
| |
Collapse
|
5297
|
Bouyioukos C, Bucchini F, Elati M, Képès F. GREAT: a web portal for Genome Regulatory Architecture Tools. Nucleic Acids Res 2016; 44:W77-82. [PMID: 27151196 PMCID: PMC4987929 DOI: 10.1093/nar/gkw384] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2016] [Accepted: 04/26/2016] [Indexed: 11/15/2022] Open
Abstract
GREAT (Genome REgulatory Architecture Tools) is a novel web portal for tools designed to generate user-friendly and biologically useful analysis of genome architecture and regulation. The online tools of GREAT are freely accessible and compatible with essentially any operating system which runs a modern browser. GREAT is based on the analysis of genome layout -defined as the respective positioning of co-functional genes- and its relation with chromosome architecture and gene expression. GREAT tools allow users to systematically detect regular patterns along co-functional genomic features in an automatic way consisting of three individual steps and respective interactive visualizations. In addition to the complete analysis of regularities, GREAT tools enable the use of periodicity and position information for improving the prediction of transcription factor binding sites using a multi-view machine learning approach. The outcome of this integrative approach features a multivariate analysis of the interplay between the location of a gene and its regulatory sequence. GREAT results are plotted in web interactive graphs and are available for download either as individual plots, self-contained interactive pages or as machine readable tables for downstream analysis. The GREAT portal can be reached at the following URL https://absynth.issb.genopole.fr/GREAT and each individual GREAT tool is available for downloading.
Collapse
Affiliation(s)
- Costas Bouyioukos
- iSSB, CNRS, Genopole, UEVE, Université Paris-Saclay, 5 rue Henri Desbruères, Évry 91030 Cedex, France
| | - François Bucchini
- iSSB, CNRS, Genopole, UEVE, Université Paris-Saclay, 5 rue Henri Desbruères, Évry 91030 Cedex, France
| | - Mohamed Elati
- iSSB, CNRS, Genopole, UEVE, Université Paris-Saclay, 5 rue Henri Desbruères, Évry 91030 Cedex, France
| | - François Képès
- iSSB, CNRS, Genopole, UEVE, Université Paris-Saclay, 5 rue Henri Desbruères, Évry 91030 Cedex, France
| |
Collapse
|
5298
|
Urnikyte A, Domarkiene I, Stoma S, Ambrozaityte L, Uktveryte I, Meskiene R, Kasiulevičius V, Burokiene N, Kučinskas V. CNV analysis in the Lithuanian population. BMC Genet 2016; 17:64. [PMID: 27142071 PMCID: PMC4855864 DOI: 10.1186/s12863-016-0373-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2015] [Accepted: 04/22/2016] [Indexed: 12/13/2022] Open
Abstract
Background Although copy number variation (CNV) has received much attention, knowledge about the characteristics of CNVs such as occurrence rate and distribution in the genome between populations and within the same population is still insufficient. In this study, Illumina 770 K HumanOmniExpress-12 v1.0 (and v1.1) arrays were used to examine the diversity and distribution of CNVs in 286 unrelated individuals from the two main ethnolinguistic groups of the Lithuanian population (Aukštaičiai and Žemaičiai) (see Additional file 3). For primary data analysis, the Illumina GenomeStudio™ Genotyping Module v1.9 and two algorithms, cnvPartition 3.2.0 and QuantiSNP 2.0, were used to identify high-confidence CNVs. Results A total of 478 autosomal CNVs were detected by both algorithms, and those were clustered in 87 copy number variation regions (CNVRs), spanning ~12.5 Mb of the genome (see Table 1). At least 8.6 % of the CNVRs were unique and had not been reported in the Database of Genomic Variants. Most CNVRs (57.5 %) were rare, with a frequency of <1 %, whereas common CNVRs with at least 5 % frequency made up only 1.1 % of all CNVRs identified. About 49 % of non-singleton CNVRs were shared between Aukštaičiai and Žemaičiai, and the remaining CNVRs were specific to each group. Many of the CNVs detected (66 %) overlapped with known UCSC gene regions. Conclusions The ethnolinguistic groups of the Lithuanian population could not be differentiated based on CNV profiles, which may reflect their geographical proximity and suggest the homogeneity of the Lithuanian population. In addition, putative novel CNVs unique to the Lithuanian population were identified. The results of our study enhance the CNV map of the Lithuanian population. Electronic supplementary material The online version of this article (doi:10.1186/s12863-016-0373-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- A Urnikyte
- Department of Human and Medical Genetics, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania.
| | - I Domarkiene
- Department of Human and Medical Genetics, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| | - S Stoma
- Master of Science (MSc), Bioinformatics student, VU University Amsterdam, Amsterdam, Netherlands
| | - L Ambrozaityte
- Department of Human and Medical Genetics, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| | - I Uktveryte
- Department of Human and Medical Genetics, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| | - R Meskiene
- Department of Human and Medical Genetics, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| | - V Kasiulevičius
- Clinics of Internal Diseases, Family Medicine and Oncology, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| | - N Burokiene
- Clinics of Internal Diseases, Family Medicine and Oncology, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| | - V Kučinskas
- Department of Human and Medical Genetics, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| |
Collapse
|
5299
|
Gómez-Vela F, Barranco CD, Díaz-Díaz N. Incorporating biological knowledge for construction of fuzzy networks of gene associations. Appl Soft Comput 2016. [DOI: 10.1016/j.asoc.2016.01.014] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
5300
|
Liu X, Tang S, Jia G, Schnable JC, Su H, Tang C, Zhi H, Diao X. The C-terminal motif of SiAGO1b is required for the regulation of growth, development and stress responses in foxtail millet (Setaria italica (L.) P. Beauv). JOURNAL OF EXPERIMENTAL BOTANY 2016; 67:3237-49. [PMID: 27045099 PMCID: PMC4892719 DOI: 10.1093/jxb/erw135] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Foxtail millet (Setaria italica (L.) P. Beauv), which belongs to the Panicoideae tribe of the Poaceae, is an important grain crop widely grown in Northern China and India. It is currently developing into a novel model species for functional genomics of the Panicoideae as a result of its fully available reference genome sequence, small diploid genome (2n=18, ~510Mb), short life cycle, small stature and prolific seed production. Argonaute 1 (AGO1), belonging to the argonaute (AGO) protein family, recruits small RNAs and regulates plant growth and development. Here, we characterized an AGO1 mutant (siago1b) in foxtail millet, which was induced by ethyl methanesulfonate treatment. The mutant exhibited pleiotropic developmental defects, including dwarfing stem, narrow and rolled leaves, smaller panicles and lower rates of seed setting. Map-based cloning analysis demonstrated that these phenotypic variations were attributed to a C-A transversion, and a 7-bp deletion in the C-terminus of the SiAGO1b gene in siago1b Yeast two-hybrid assays and BiFC experiments revealed that the mutated region was an essential functional motif for the interaction between SiAGO1b and SiHYL1. Furthermore, 1598 differentially expressed genes were detected via RNA-seq-based comparison of SiAGO1b and wild-type plants, which revealed that SiAGO1b mutation influenced multiple biological processes, including energy metabolism, cell growth, programmed death and abiotic stress responses in foxtail millet. This study may provide a better understanding of the mechanisms by which SiAGO1b regulates the growth and development of crops.
Collapse
Affiliation(s)
- Xiaotong Liu
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Sha Tang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Guanqing Jia
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - James C Schnable
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China. Agronomy & Horticulture, University of Nebraska-Lincoln, Beadle Center E207, Lincoln, NE 68583-0660, USA
| | - Haixia Su
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Chanjuan Tang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Hui Zhi
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China.
| | - Xianmin Diao
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China.
| |
Collapse
|