101
|
Seif Y, Palsson BØ. Path to improving the life cycle and quality of genome-scale models of metabolism. Cell Syst 2021; 12:842-859. [PMID: 34555324 PMCID: PMC8480436 DOI: 10.1016/j.cels.2021.06.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 02/17/2021] [Accepted: 06/23/2021] [Indexed: 11/28/2022]
Abstract
Genome-scale models of metabolism (GEMs) are key computational tools for the systems-level study of metabolic networks. Here, we describe the "GEM life cycle," which we subdivide into four stages: inception, maturation, specialization, and amalgamation. We show how different types of GEM reconstruction workflows fit in each stage and proceed to highlight two fundamental bottlenecks for GEM quality improvement: GEM maturation and content removal. We identify common characteristics contributing to increasing quality of maturing GEMs drawing from past independent GEM maturation efforts. We then shed some much-needed light on the latent and unrecognized but pervasive issue of content removal, demonstrating the substantial effects of model pruning on its solution space. Finally, we propose a novel framework for content removal and associated confidence-level assignment which will help guide future GEM development efforts, reduce duplication of effort across groups, potentially aid automated reconstruction platforms, and boost the reproducibility of model development.
Collapse
|
102
|
Griffith D, Holehouse AS. PARROT is a flexible recurrent neural network framework for analysis of large protein datasets. eLife 2021; 10:e70576. [PMID: 34533455 PMCID: PMC8448528 DOI: 10.7554/elife.70576] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 09/06/2021] [Indexed: 11/29/2022] Open
Abstract
The rise of high-throughput experiments has transformed how scientists approach biological questions. The ubiquity of large-scale assays that can test thousands of samples in a day has necessitated the development of new computational approaches to interpret this data. Among these tools, machine learning approaches are increasingly being utilized due to their ability to infer complex nonlinear patterns from high-dimensional data. Despite their effectiveness, machine learning (and in particular deep learning) approaches are not always accessible or easy to implement for those with limited computational expertise. Here we present PARROT, a general framework for training and applying deep learning-based predictors on large protein datasets. Using an internal recurrent neural network architecture, PARROT is capable of tackling both classification and regression tasks while only requiring raw protein sequences as input. We showcase the potential uses of PARROT on three diverse machine learning tasks: predicting phosphorylation sites, predicting transcriptional activation function of peptides generated by high-throughput reporter assays, and predicting the fibrillization propensity of amyloid beta with data generated by deep mutational scanning. Through these examples, we demonstrate that PARROT is easy to use, performs comparably to state-of-the-art computational tools, and is applicable for a wide array of biological problems.
Collapse
|
103
|
Hu D, Yang J, Qi Y, Li B, Li K, Mok KM. Metagenomic Analysis of Fecal Archaea, Bacteria, Eukaryota, and Virus in Przewalski's Horses Following Anthelmintic Treatment. Front Vet Sci 2021; 8:708512. [PMID: 34490397 PMCID: PMC8416479 DOI: 10.3389/fvets.2021.708512] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Accepted: 07/07/2021] [Indexed: 12/25/2022] Open
Abstract
Intestinal microbiota is involved in immune response and metabolism of the host. The frequent use of anthelmintic compounds for parasite expulsion causes disturbance to the equine intestinal microbiota. However, most studies were on the effects of such treatment on the intestinal bacterial microbes; none is on the entire microbial community including archaea and eukaryotic and viral community in equine animals. This study is the first to explore the differences of the microbial community composition and structure in Przewalski's horses prior to and following anthelmintic treatment, and to determine the corresponding changes of their functional attributes based on metagenomic sequencing. Results showed that in archaea, the methanogen of Euryarchaeota was the dominant phylum. Under this phylum, anthelmintic treatment increased the Methanobrevibacter genus and decreased the Methanocorpusculum genus and two other dominant archaea species, Methanocorpusculum labreanum and Methanocorpusculum bavaricum. In bacteria, Firmicutes and Bacteroidetes were the dominant phyla. Anthelmintic treatment increased the genera of Clostridium and Eubacterium and decreased those of Bacteroides and Prevotella and dominant bacteria species. These altered genera were associated with immunity and digestion. In eukaryota, anthelmintic treatment also changed the genera related to digestion and substantially decreased the relative abundances of identified species. In virus, anthelmintic treatment increased the genus of unclassified_d__Viruses and decreased those of unclassified_f__Siphoviridae and unclassified_f__Myoviridae. Most of the identified viral species were classified into phage, which were more sensitive to anthelmintic treatment than other viruses. Furthermore, anthelmintic treatment was found to increase the number of pathogens related to some clinical diseases in horses. The COG and KEGG function analysis showed that the intestinal microbiota of Przewalski's horse mainly participated in the carbohydrate and amino acid metabolism. The anthelmintic treatment did not change their overall function; however, it displaced the population of the functional microbes involved in each function or pathway. These results provide a complete view on the changes caused by anthelmintic treatment in the intestinal microbiota of the Przewalski's horses.
Collapse
|
104
|
Reed CJ, Hutinet G, de Crécy-Lagard V. Comparative Genomic Analysis of the DUF34 Protein Family Suggests Role as a Metal Ion Chaperone or Insertase. Biomolecules 2021; 11:1282. [PMID: 34572495 PMCID: PMC8469502 DOI: 10.3390/biom11091282] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 08/20/2021] [Accepted: 08/24/2021] [Indexed: 12/12/2022] Open
Abstract
Members of the DUF34 (domain of unknown function 34) family, also known as the NIF3 protein superfamily, are ubiquitous across superkingdoms. Proteins of this family have been widely annotated as "GTP cyclohydrolase I type 2" through electronic propagation based on one study. Here, the annotation status of this protein family was examined through a comprehensive literature review and integrative bioinformatic analyses that revealed varied pleiotropic associations and phenotypes. This analysis combined with functional complementation studies strongly challenges the current annotation and suggests that DUF34 family members may serve as metal ion insertases, chaperones, or metallocofactor maturases. This general molecular function could explain how DUF34 subgroups participate in highly diversified pathways such as cell differentiation, metal ion homeostasis, pathogen virulence, redox, and universal stress responses.
Collapse
|
105
|
Saha S, Cooksey AM, Childers AK, Poelchau MF, McCarthy FM. Workflows for Rapid Functional Annotation of Diverse Arthropod Genomes. INSECTS 2021; 12:748. [PMID: 34442314 PMCID: PMC8397112 DOI: 10.3390/insects12080748] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 07/16/2021] [Accepted: 08/11/2021] [Indexed: 12/03/2022]
Abstract
Genome sequencing of a diverse array of arthropod genomes is already underway, and these genomes will be used to study human health, agriculture, biodiversity, and ecology. These new genomes are intended to serve as community resources and provide the foundational information required to apply 'omics technologies to a more diverse set of species. However, biologists require genome annotation to use these genomes and derive a better understanding of complex biological systems. Genome annotation incorporates two related, but distinct, processes: Demarcating genes and other elements present in genome sequences (structural annotation); and associating a function with genetic elements (functional annotation). While there are well-established and freely available workflows for structural annotation of gene identification in newly assembled genomes, workflows for providing the functional annotation required to support functional genomics studies are less well understood. Genome-scale functional annotation is required for functional modeling (enrichment, networks, etc.). A first-pass genome-wide functional annotation effort can rapidly identify under-represented gene sets for focused community annotation efforts. We present an open-source, open access, and containerized pipeline for genome-scale functional annotation of insect proteomes and apply it to various arthropod species. We show that the performance of the predictions is consistent across a set of arthropod genomes with varying assembly and annotation quality.
Collapse
|
106
|
Yengo L, Yang J, Keller MC, Goddard ME, Wray NR, Visscher PM. Genomic partitioning of inbreeding depression in humans. Am J Hum Genet 2021; 108:1488-1501. [PMID: 34214457 DOI: 10.1016/j.ajhg.2021.06.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 06/01/2021] [Indexed: 02/05/2023] Open
Abstract
Across species, offspring of related individuals often exhibit significant reduction in fitness-related traits, known as inbreeding depression (ID), yet the genetic and molecular basis for ID remains elusive. Here, we develop a method to quantify enrichment of ID within specific genomic annotations and apply it to human data. We analyzed the phenomes and genomes of ∼350,000 unrelated participants of the UK Biobank and found, on average of over 11 traits, significant enrichment of ID within genomic regions with high recombination rates (>21-fold; p < 10-5), with conserved function across species (>19-fold; p < 10-4), and within regulatory elements such as DNase I hypersensitive sites (∼5-fold; p = 8.9 × 10-7). We also quantified enrichment of ID within trait-associated regions and found suggestive evidence that genomic regions contributing to additive genetic variance in the population are enriched for ID signal. We find strong correlations between functional enrichment of SNP-based heritability and that of ID (r = 0.8, standard error: 0.1). These findings provide empirical evidence that ID is most likely due to many partially recessive deleterious alleles in low linkage disequilibrium regions of the genome. Our study suggests that functional characterization of ID may further elucidate the genetic architectures and biological mechanisms underlying complex traits and diseases.
Collapse
|
107
|
Xiao Q, Li Z, Qu M, Xu W, Su Z, Yang J. LjaFGD: Lonicera japonica functional genomics database. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2021; 63:1422-1436. [PMID: 33982879 DOI: 10.1111/jipb.13112] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Accepted: 05/09/2021] [Indexed: 06/12/2023]
Abstract
Lonicera japonica Thunb., a traditional Chinese herb, has been used for treating human diseases for thousands of years. Recently, the genome of L. japonica has been decoded, providing valuable information for research into gene function. However, no comprehensive database for gene functional analysis and mining is available for L. japonica. We therefore constructed LjaFGD (www.gzybioinformatics.cn/LjaFGD and bioinformatics.cau.edu.cn/LjaFGD), a database for analyzing and comparing gene function in L. japonica. We constructed a gene co-expression network based on 77 RNA-seq samples, and then annotated genes of L. japonica by alignment against protein sequences from public databases. We also introduced several tools for gene functional analysis, including Blast, motif analysis, gene set enrichment analysis, heatmap analysis, and JBrowse. Our co-expression network revealed that MYB and WRKY transcription factor family genes were co-expressed with genes encoding key enzymes in the biosynthesis of chlorogenic acid and luteolin in L. japonica. We used flavonol synthase 1 (LjFLS1) as an example to show the reliability and applicability of our database. LjaFGD and its various associated tools will provide researchers with an accessible platform for retrieving functional information on L. japonica genes to further biological discovery.
Collapse
|
108
|
Harris BD, Crow M, Fischer S, Gillis J. Single-cell co-expression analysis reveals that transcriptional modules are shared across cell types in the brain. Cell Syst 2021; 12:748-756.e3. [PMID: 34015329 PMCID: PMC8298279 DOI: 10.1016/j.cels.2021.04.010] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 02/11/2021] [Accepted: 04/23/2021] [Indexed: 12/27/2022]
Abstract
Gene-gene relationships are commonly measured via the co-variation of gene expression across samples, also known as gene co-expression. Because shared expression patterns are thought to reflect shared function, co-expression networks describe functional relationships between genes, including co-regulation. However, the heterogeneity of cell types in bulk RNA-seq samples creates connections in co-expression networks that potentially obscure co-regulatory modules. The brain initiative cell census network (BICCN) single-cell RNA sequencing (scRNA-seq) datasets provide an unparalleled opportunity to understand how gene-gene relationships shape cell identity. Comparison of the BICCN data (500,000 cells/nuclei across 7 BICCN datasets) with that of bulk RNA-seq networks (2,000 mouse brain samples across 52 studies) reveals a consistent topology reflecting a shared co-regulatory signal. Differential signals between broad cell classes persist in driving variation at finer levels, indicating that convergent regulatory processes affect cell phenotype at multiple scales.
Collapse
|
109
|
Minegishi R, Gotoh O, Tanaka N, Maruyama R, Chang JT, Mori S. A method of sample-wise region-set enrichment analysis for DNA methylomics. Epigenomics 2021; 13:1081-1093. [PMID: 34241544 DOI: 10.2217/epi-2021-0065] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Aim: Gene set analysis has commonly been used to interpret DNA methylome data. However, summarizing the DNA methylation level of a gene is challenging due to variability in the number, density and methylation levels of CpG sites, and the numerous intergenic CpGs. Instead, we propose to use region sets to annotate the DNA methylome. Methods: We developed single sample region-set enrichment analysis for DNA methylome (methyl-ssRSEA) to conduct sample-wise, region-set enrichment analysis. Results: Methyl-ssRSEA can handle both microarray- and sequencing-based platforms and reproducibly recover the known biology from the methylation profiles of peripheral blood cells and breast cancers. The performance was superior to existing tools for region-set analysis in discriminating blood cell types. Conclusion: Methyl-ssRSEA offers a novel way to functionally interpret the DNA methylome in the cell.
Collapse
|
110
|
Baxter JS, Johnson N, Tomczyk K, Gillespie A, Maguire S, Brough R, Fachal L, Michailidou K, Bolla MK, Wang Q, Dennis J, Ahearn TU, Andrulis IL, Anton-Culver H, Antonenkova NN, Arndt V, Aronson KJ, Augustinsson A, Becher H, Beckmann MW, Behrens S, Benitez J, Bermisheva M, Bogdanova NV, Bojesen SE, Brenner H, Brucker SY, Cai Q, Campa D, Canzian F, Castelao JE, Chan TL, Chang-Claude J, Chanock SJ, Chenevix-Trench G, Choi JY, Clarke CL, Colonna S, Conroy DM, Couch FJ, Cox A, Cross SS, Czene K, Daly MB, Devilee P, Dörk T, Dossus L, Dwek M, Eccles DM, Ekici AB, Eliassen AH, Engel C, Fasching PA, Figueroa J, Flyger H, Gago-Dominguez M, Gao C, García-Closas M, García-Sáenz JA, Ghoussaini M, Giles GG, Goldberg MS, González-Neira A, Guénel P, Gündert M, Haeberle L, Hahnen E, Haiman CA, Hall P, Hamann U, Hartman M, Hatse S, Hauke J, Hollestelle A, Hoppe R, Hopper JL, Hou MF, Ito H, Iwasaki M, Jager A, Jakubowska A, Janni W, John EM, Joseph V, Jung A, Kaaks R, Kang D, Keeman R, Khusnutdinova E, Kim SW, Kosma VM, Kraft P, Kristensen VN, Kubelka-Sabit K, Kurian AW, Kwong A, Lacey JV, Lambrechts D, Larson NL, Larsson SC, Le Marchand L, Lejbkowicz F, Li J, Long J, Lophatananon A, Lubiński J, Mannermaa A, Manoochehri M, Manoukian S, Margolin S, Matsuo K, Mavroudis D, Mayes R, Menon U, Milne RL, Mohd Taib NA, Muir K, Muranen TA, Murphy RA, Nevanlinna H, O'Brien KM, Offit K, Olson JE, Olsson H, Park SK, Park-Simon TW, Patel AV, Peterlongo P, Peto J, Plaseska-Karanfilska D, Presneau N, Pylkäs K, Rack B, Rennert G, Romero A, Ruebner M, Rüdiger T, Saloustros E, Sandler DP, Sawyer EJ, Schmidt MK, Schmutzler RK, Schneeweiss A, Schoemaker MJ, Shah M, Shen CY, Shu XO, Simard J, Southey MC, Stone J, Surowy H, Swerdlow AJ, Tamimi RM, Tapper WJ, Taylor JA, Teo SH, Teras LR, Terry MB, Toland AE, Tomlinson I, Truong T, Tseng CC, Untch M, Vachon CM, van den Ouweland AMW, Wang SS, Weinberg CR, Wendt C, Winham SJ, Winqvist R, Wolk A, Wu AH, Yamaji T, Zheng W, Ziogas A, Pharoah PDP, Dunning AM, Easton DF, Pettitt SJ, Lord CJ, Haider S, Orr N, Fletcher O. Functional annotation of the 2q35 breast cancer risk locus implicates a structural variant in influencing activity of a long-range enhancer element. Am J Hum Genet 2021; 108:1190-1203. [PMID: 34146516 PMCID: PMC8322933 DOI: 10.1016/j.ajhg.2021.05.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 05/25/2021] [Indexed: 12/21/2022] Open
Abstract
A combination of genetic and functional approaches has identified three independent breast cancer risk loci at 2q35. A recent fine-scale mapping analysis to refine these associations resulted in 1 (signal 1), 5 (signal 2), and 42 (signal 3) credible causal variants at these loci. We used publicly available in silico DNase I and ChIP-seq data with in vitro reporter gene and CRISPR assays to annotate signals 2 and 3. We identified putative regulatory elements that enhanced cell-type-specific transcription from the IGFBP5 promoter at both signals (30- to 40-fold increased expression by the putative regulatory element at signal 2, 2- to 3-fold by the putative regulatory element at signal 3). We further identified one of the five credible causal variants at signal 2, a 1.4 kb deletion (esv3594306), as the likely causal variant; the deletion allele of this variant was associated with an average additional increase in IGFBP5 expression of 1.3-fold (MCF-7) and 2.2-fold (T-47D). We propose a model in which the deletion allele of esv3594306 juxtaposes two transcription factor binding regions (annotated by estrogen receptor alpha ChIP-seq peaks) to generate a single extended regulatory element. This regulatory element increases cell-type-specific expression of the tumor suppressor gene IGFBP5 and, thereby, reduces risk of estrogen receptor-positive breast cancer (odds ratio = 0.77, 95% CI 0.74-0.81, p = 3.1 × 10-31).
Collapse
|
111
|
De Novo Sporophyte Transcriptome Assembly and Functional Annotation in the Endangered Fern Species Vandenboschia speciosa (Willd.) G. Kunkel. Genes (Basel) 2021; 12:genes12071017. [PMID: 34208974 PMCID: PMC8304985 DOI: 10.3390/genes12071017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 06/25/2021] [Accepted: 06/28/2021] [Indexed: 11/17/2022] Open
Abstract
We sequenced the sporophyte transcriptome of Killarney fern (Vandenboschia speciosa (Willd.) G. Kunkel). In addition to being a rare endangered Macaronesian-European endemism, this species has a huge genome (10.52 Gb) as well as particular biological features and extreme ecological requirements. These characteristics, together with the systematic position of ferns among vascular plants, make it of high interest for evolutionary, conservation and functional genomics studies. The transcriptome was constructed de novo and contained 36,430 transcripts, of which 17,706 had valid BLAST hits. A total of 19,539 transcripts showed at least one of the 7362 GO terms assigned to the transcriptome, whereas 6547 transcripts showed at least one of the 1359 KEGG assigned terms. A prospective analysis of functional annotation results provided relevant insights on genes involved in important functions such as growth and development as well as physiological adaptations. In this context, a catalogue of genes involved in the genetic control of plant development, during the vegetative to reproductive transition, in stress response as well as genes coding for transcription factors is given. Altogether, this study provides a first step towards understanding the gene expression of a significant fern species and the in silico functional and comparative analyses reported here provide important data and insights for further comparative evolutionary studies in ferns and land plants in general.
Collapse
|
112
|
Identification of Sex-Related Genes from the Three-Spot Swimming Crab Portunus sanguinolentus and Comparative Analysis with the Crucifix Crab Charybdis feriatus. Animals (Basel) 2021; 11:ani11071946. [PMID: 34209957 PMCID: PMC8300171 DOI: 10.3390/ani11071946] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2021] [Revised: 06/12/2021] [Accepted: 06/22/2021] [Indexed: 11/17/2022] Open
Abstract
Simple Summary Crabs within the family Portunidae are important marine species in both aquaculture and fishery sectors. The current aquaculture status of most portunids still relies on wild-caught fisheries due to the lack of essential knowledge regarding their reproductive biology and underlying governing mechanism. In the present study, we compared the differentially expressed genes (DEGs) between the different sexes of Portunus sanguinolentus based on their gonadal transcriptome profiles and subsequently contrasted them with the gonadal DEGs of Charybdis feriatus, the other member of the family Portunidae. In total, 40,964 DEGs between the ovaries and testes of P. sanguinolentus were uncovered, with 27,578 up-regulated and 13,386 down-regulated in females. After comparison, C. feriatus has approximately 63.5% of genes in common with P. sanguinolentus, with 62.6% showing similar expression patterns. Interestingly, the DMRT gene was specifically expressed in male P. sanguinolentus, while its homologous gene—doublesex (DSX)—was specifically expressed in male C. feriatus. The DEGs obtained from the gonadal transcriptome of P. sanguinolentus are a beneficial resource for future genetic and genomic research in P. sanguinolentus and its close species. The transcriptomic comparison analysis might provide references for better understanding the sex determination and differentiation mechanisms among portunids. Abstract Crabs within the family Portunidae are important marine species in both aquaculture and fishery sectors. The current aquaculture status of most portunids, however, still relies on wild-caught fisheries due to the lack of essential knowledge regarding their reproductive biology and underlying governing mechanism. With the advancement of sequencing technology, transcriptome sequencing has been progressively used to understand various physiological processes, especially on non-model organisms. In the present study, we compared the differentially expressed genes (DEGs) between sexes of Portunus sanguinolentus based on their gonadal transcriptome profiles and subsequently contrasted them with the gonadal DEGs of Charybdis feriatus, the other member of Family Portunidae. In total, 40,964 DEGs between ovaries and testes were uncovered, with 27,578 up- and 13,386 down-regulated in females. Among those, some sex-related DEGs were identified, including a dmrt-like (DMRT) gene which was specifically expressed in males. C. feriatus has approximately 63.5% of genes common with P. sanguinolentus, with 62.6% showing similar expression patterns. Interestingly, the DMRT gene was specifically expressed in male P. sanguinolentus while its homologous gene—doublesex (DSX)—was specifically expressed in male C. feriatus. The DEGs obtained from the gonadal transcriptome of P. sanguinolentus are a beneficial resource for future genetic and genomic research in P. sanguinolentus and its close species. The transcriptomic comparison analysis might provide references for better understanding the sex determination and differentiation mechanisms among portunids.
Collapse
|
113
|
Harrison PW, Sokolov A, Nayak A, Fan J, Zerbino D, Cochrane G, Flicek P. The FAANG Data Portal: Global, Open-Access, "FAIR", and Richly Validated Genotype to Phenotype Data for High-Quality Functional Annotation of Animal Genomes. Front Genet 2021; 12:639238. [PMID: 34220930 PMCID: PMC8248360 DOI: 10.3389/fgene.2021.639238] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Accepted: 05/04/2021] [Indexed: 11/13/2022] Open
Abstract
The Functional Annotation of ANimal Genomes (FAANG) project is a worldwide coordinated action creating high-quality functional annotation of farmed and companion animal genomes. The generation of a rich genome-to-phenome resource and supporting informatic infrastructure advances the scope of comparative genomics and furthers the understanding of functional elements. The project also provides terrestrial and aquatic animal agriculture community powerful resources for supporting improvements to farmed animal production, disease resistance, and genetic diversity. The FAANG Data Portal (https://data.faang.org) ensures Findable, Accessible, Interoperable and Reusable (FAIR) open access to the wealth of sample, sequencing, and analysis data produced by an ever-growing number of FAANG consortia. It is developed and maintained by the FAANG Data Coordination Centre (DCC) at the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI). FAANG projects produce a standardised set of multi-omic assays with resulting data placed into a range of specialised open data archives. To ensure this data is easily findable and accessible by the community, the portal automatically identifies and collates all submitted FAANG data into a single easily searchable resource. The Data Portal supports direct download from the multiple underlying archives to enable seamless access to all FAANG data from within the portal itself. The portal provides a range of predefined filters, powerful predictive search, and a catalogue of sampling and analysis protocols and automatically identifies publications associated with any dataset. To ensure all FAANG data submissions are high-quality, the portal includes powerful contextual metadata validation and data submissions brokering to the underlying EMBL-EBI archives. The portal will incorporate extensive new technical infrastructure to effectively deliver and standardise FAANG's shift to single-cellomics, cell atlases, pangenomes, and novel phenotypic prediction models. The Data Portal plays a key role for FAANG by supporting high-quality functional annotation of animal genomes, through open FAIR sharing of data, complete with standardised rich metadata. Future Data Portal features developed by the DCC will support new technological developments for continued improvement for FAANG projects.
Collapse
|
114
|
Chitneedi PK, Weikard R, Arranz JJ, Martínez-Valladares M, Kuehn C, Gutiérrez-Gil B. Identification of Regulatory Functions of LncRNAs Associated With T. circumcincta Infection in Adult Sheep. Front Genet 2021; 12:685341. [PMID: 34194481 PMCID: PMC8236958 DOI: 10.3389/fgene.2021.685341] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 05/20/2021] [Indexed: 12/13/2022] Open
Abstract
Several recent studies have demonstrated the role of long non-coding RNAs (lncRNAs) in regulating the defense mechanism against parasite infections, but no studies are available that investigated their relevance for immune response to nematode infection in sheep. Thus, the aim of the current study was to (i) detect putative lncRNAs that are expressed in the abomasal lymph node of adult sheep after an experimental infection with the gastrointestinal nematode (GIN) Teladorsagia circumcincta and (ii) to elucidate their potential functional role associated with the differential host immune response. We hypothesized that putative lncRNAs differentially expressed (DE) between samples from animals that differ in resistance to infection may play a significant regulatory role in response to nematode infection in adult sheep. To obtain further support for our hypothesis, we performed co-expression and functional gene enrichment analyses with the differentially expressed lncRNAs (DE lncRNAs). In a conservative approach, we included for this predictive analysis only those lncRNAs that are confirmed and supported by documentation of expression in gastrointestinal tissues in the current sheep gene atlas. We identified 9,105 putative lncRNA transcripts corresponding to 7,124 gene loci. Of these, 457 were differentially expressed lncRNA loci (DELs) with 683 lncRNA transcripts. Based on a gene co-expression analysis via weighted gene co-expression network analysis, 12 gene network modules (GNMs) were found significantly correlated with at least one of 10 selected target DE lncRNAs. Based on the principle of “guilt-by-association,” the DE genes from each of the three most significantly correlated GNMs were subjected to a gene enrichment analysis. The significant pathways associated with DE lncRNAs included ERK5 Signaling, SAPK/JNK Signaling, RhoGDI Signaling, EIF2 Signaling, Regulation of eIF4 and p70S6K Signaling and Oxidative Phosphorylation pathways. They belong to signaling pathway categories like Cellular Growth, Proliferation and Development, Cellular Stress and Injury, Intracellular and Second Messenger Signaling and Apoptosis. Overall, this lncRNA study conducted in adult sheep after GIN infection provided first insights into the potential functional role of lncRNAs in the differential host response to nematode infection.
Collapse
|
115
|
Satyam R, Bhardwaj T, Goel S, Jha NK, Jha SK, Nand P, Ruokolainen J, Kamal MA, Kesari KK. miRNAs in SARS-CoV 2: A Spoke in the Wheel of Pathogenesis. Curr Pharm Des 2021; 27:1628-1641. [PMID: 33023438 DOI: 10.2174/1381612826999201001200529] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2020] [Revised: 07/28/2020] [Accepted: 08/04/2020] [Indexed: 02/08/2023]
Abstract
INTRODUCTION The rapid emergence of Severe Acute Respiratory Syndrome coronavirus 2 (SARS-- CoV-2) has resulted in an increased mortality rate across the globe. However, the underlying mechanism of SARS-CoV-2 altering human immune response is still elusive. The existing literature on miRNA mediated pathogenesis of RNA virus viz. Dengue virus, West Nile virus, etc. raises a suspicion that miRNA encoded by SARS-CoV-2 might facilitate virus replication and regulate the host's gene expression at the post-transcriptional level. METHODS We investigated this possibility via computational prediction of putative miRNAs encoded by the SARS-CoV-2 genome using a novel systematic pipeline that predicts putative mature-miRNA and their targeted genes transcripts. To trace down if viral-miRNAs targeted the genes critical to the immune pathway, we assessed whether mature miRNA transcripts exhibit effective hybridization with the 3'UTR region of human gene transcripts. Conversely, we also tried to study human miRNA-mediated viral gene regulation to get insight into the miRNA mediated offense and defense mechanism of virus and its host organisms in toto. RESULTS Our analysis led us to shortlist six putative miRNAs that target, majorly, genes related to cell proliferation/ differentiation/signaling, and senescence. Nonetheless, they also target immune-related genes that directly/ indirectly orchestrate immune pathways like TNF (Tumor Necrosis Factor) signaling and Chemokine signaling pathways putatively serving as the nucleus to cytokine storms. CONCLUSION Besides, these six miRNAs were found to be conserved so far across 80 complete genomes of SARS-CoV-2 (NCBI Virus, last assessed 12 April 2020) including Indian strains that are also targeted by 7 human miRNAs and can, therefore, be exploited to develop MicroRNA-Attenuated Vaccines.
Collapse
|
116
|
de Mattos Pereira L, de Jezuz MPG, Rangel AR, Baldasso BD, Zaluski AB, Graeff-Teixeira C, Morassutti AL. De novo transcriptome reveals blood coagulation/antithrombin factors and infection mechanisms in Angiostrongylus cantonensis adult worms. Parasitology 2021; 148:857-870. [PMID: 33729108 PMCID: PMC11010222 DOI: 10.1017/s0031182021000469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2020] [Revised: 02/17/2021] [Accepted: 03/05/2021] [Indexed: 11/06/2022]
Abstract
Angiostrongylus cantonensis is the main aetiological agent of eosinophilic meningoencephalitis in humans. Several outbreaks have been documented around the world, cementing its status as an emerging global public health concern. As a result, new strategies for the diagnosis, prophylaxis and treatment of cerebral angiostrongyliasis are urgently needed. In this study, we report on the de novo assembly of the A. cantonensis transcriptome, its full functional annotation and a reconstruction of complete metabolic pathways. All results are available at AngiostrongylusDB (http://angiostrongylus.lad.pucrs.br/admin/welcome). The aim of this study was to identify the active genes and metabolic pathways involved in the mechanisms of infection and survival inside Rattus norvegicus. Among 389 metabolic mapped pathways, the blood coagulation/antithrombin pathways of heparan sulphate/heparin are highlighted. Moreover, we identified genes codified to GP63 (leishmanolysin), CALR (calreticulin), ACE (peptidyl-dipeptidase A), myoglobin and vWD (von Willebrand factor type D domain protein) involved in the infection invasion and survival of the parasite. The large dataset of functional annotations provided and the full-length transcripts identified in this research may facilitate future functional genomics studies and provides a basis for the development of new techniques for the diagnosis, prevention and treatment of cerebral angiostrongyliasis.
Collapse
|
117
|
Azmi N, Othman N. Entamoeba histolytica: Proteomics Bioinformatics Reveal Predictive Functions and Protein-Protein Interactions of Differentially Abundant Membrane and Cytosolic Proteins. MEMBRANES 2021; 11:membranes11060376. [PMID: 34063994 PMCID: PMC8224062 DOI: 10.3390/membranes11060376] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 05/18/2021] [Accepted: 05/18/2021] [Indexed: 11/16/2022]
Abstract
Amoebiasis is caused by Entamoeba histolytica and ranked second for parasitic diseases causing death after malaria. E. histolytica membrane and cytosolic proteins play important roles in the pathogenesis. Our previous study had shown several cytosolic proteins were found in the membrane fraction. Therefore, this study aimed to quantify the differential abundance of membrane and cytosolic proteins in membrane versus cytosolic fractions and analyze their predicted functions and interaction. Previous LC-ESI-MS/MS data were analyzed by PERSEUS software for the differentially abundant proteins, then they were classified into their functional annotations and the protein networks were summarized using PantherDB and STRiNG, respectively. The results showed 24 (44.4%) out of the 54 proteins that increased in abundance were membrane proteins and 30 were cytosolic proteins. Meanwhile, 45 cytosolic proteins were found to decrease in abundance. Functional analysis showed differential abundance proteins involved in the molecular function, biological process, and cellular component with 18.88%, 33.04% and, 48.07%, respectively. The STRiNG server predicted that the decreased abundance proteins had more protein-protein network interactions compared to increased abundance proteins. Overall, this study has confirmed the presence of the differentially abundant membrane and cytosolic proteins and provided the predictive functions and interactions between them.
Collapse
|
118
|
Rauchmann BS, Sadlon A, Perneczky R. Soluble TREM2 and Inflammatory Proteins in Alzheimer's Disease Cerebrospinal Fluid. J Alzheimers Dis 2021; 73:1615-1626. [PMID: 31958095 DOI: 10.3233/jad-191120] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
The present study explores the associations of soluble TREM2, an important regulator of microglial activity linked to Alzheimer's disease (AD), with other known inflammatory proteins in cerebrospinal fluid (CSF). We studied 303 participants, including 89 controls, 135 mild cognitive impairment, and 79 AD dementia patients. Using established CSF biomarkers, subjects were classified according to the National Institute on Aging-Alzheimer's Association research framework, which groups markers into those of amyloid-β deposition (A), tau pathology (T), and neurodegeneration (N). TNFR1, TNFR2, TGF-β1, TGFβ2, IL-9, TNF-α, ICAM1, and VCAM1 showed significant concentration differences between the ATN groups, with higher concentrations in more advanced disease categories. sTREM2 was positively associated with the pro-inflammatory proteins TNF-α, TNFR1, TNFR2, ICAM1, VCAM1, and IP-10 and negatively with IL-21; also, positive associations with the anti-inflammatory proteins TGFβ1, IL-10, and IL-9 were found. Pathway enrichment analysis highlighted the involvement of sTREM2 in key functional clusters including immunoglobulin and cytokine production and cellular response to lipopolysaccharides, cytokines, and steroid hormones. Our work provides further evidence in support of TREM2 as amarker of neuroinflammatory response in AD.
Collapse
|
119
|
Shmookler Reis RJ, Atluri R, Balasubramaniam M, Johnson J, Ganne A, Ayyadevara S. "Protein aggregates" contain RNA and DNA, entrapped by misfolded proteins but largely rescued by slowing translational elongation. Aging Cell 2021; 20:e13326. [PMID: 33788386 PMCID: PMC8135009 DOI: 10.1111/acel.13326] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 01/12/2021] [Accepted: 02/01/2021] [Indexed: 01/03/2023] Open
Abstract
All neurodegenerative diseases feature aggregates, which usually contain disease-specific diagnostic proteins; non-protein constituents, however, have rarely been explored. Aggregates from SY5Y-APPSw neuroblastoma, a cell model of familial Alzheimer's disease, were crosslinked and sequences of linked peptides identified. We constructed a normalized "contactome" comprising 11 subnetworks, centered on 24 high-connectivity hubs. Remarkably, all 24 are nucleic acid-binding proteins. This led us to isolate and sequence RNA and DNA from Alzheimer's and control aggregates. RNA fragments were mapped to the human genome by RNA-seq and DNA by ChIP-seq. Nearly all aggregate RNA sequences mapped to specific genes, whereas DNA fragments were predominantly intergenic. These nucleic acid mappings are all significantly nonrandom, making an artifactual origin extremely unlikely. RNA (mostly cytoplasmic) exceeded DNA (chiefly nuclear) by twofold to fivefold. RNA fragments recovered from AD tissue were ~1.5-to 2.5-fold more abundant than those recovered from control tissue, similar to the increase in protein. Aggregate abundances of specific RNA sequences were strikingly differential between cultured SY5Y-APPSw glioblastoma cells expressing APOE3 vs. APOE4, consistent with APOE4 competition for E-box/CLEAR motifs. We identified many G-quadruplex and viral sequences within RNA and DNA of aggregates, suggesting that sequestration of viral genomes may have driven the evolution of disordered nucleic acid-binding proteins. After RNA-interference knockdown of the translational-procession factor EEF2 to suppress translation in SY5Y-APPSw cells, the RNA content of aggregates declined by >90%, while reducing protein content by only 30% and altering DNA content by ≤10%. This implies that cotranslational misfolding of nascent proteins may ensnare polysomes into aggregates, accounting for most of their RNA content.
Collapse
|
120
|
Peterson D, Li T, Calvo AM, Yin Y. Categorization of Orthologous Gene Clusters in 92 Ascomycota Genomes Reveals Functions Important for Phytopathogenicity. J Fungi (Basel) 2021; 7:337. [PMID: 33925458 PMCID: PMC8146833 DOI: 10.3390/jof7050337] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Revised: 04/23/2021] [Accepted: 04/24/2021] [Indexed: 01/01/2023] Open
Abstract
Phytopathogenic Ascomycota are responsible for substantial economic losses each year, destroying valuable crops. The present study aims to provide new insights into phytopathogenicity in Ascomycota from a comparative genomic perspective. This has been achieved by categorizing orthologous gene groups (orthogroups) from 68 phytopathogenic and 24 non-phytopathogenic Ascomycota genomes into three classes: Core, (pathogen or non-pathogen) group-specific, and genome-specific accessory orthogroups. We found that (i) ~20% orthogroups are group-specific and accessory in the 92 Ascomycota genomes, (ii) phytopathogenicity is not phylogenetically determined, (iii) group-specific orthogroups have more enriched functional terms than accessory orthogroups and this trend is particularly evident in phytopathogenic fungi, (iv) secreted proteins with signal peptides and horizontal gene transfers (HGTs) are the two functional terms that show the highest occurrence and significance in group-specific orthogroups, (v) a number of other functional terms are also identified to have higher significance and occurrence in group-specific orthogroups. Overall, our comparative genomics analysis determined positive enrichment existing between orthogroup classes and revealed a prediction of what genomic characteristics make an Ascomycete phytopathogenic. We conclude that genes shared by multiple phytopathogenic genomes are more important for phytopathogenicity than those that are unique in each genome.
Collapse
|
121
|
Rabbi MF, Akter SA, Hasan MJ, Amin A. In Silico Characterization of a Hypothetical Protein from Shigella dysenteriae ATCC 12039 Reveals a Pathogenesis-Related Protein of the Type-VI Secretion System. Bioinform Biol Insights 2021; 15:11779322211011140. [PMID: 33994781 PMCID: PMC8076777 DOI: 10.1177/11779322211011140] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Accepted: 03/30/2021] [Indexed: 11/17/2022] Open
Abstract
Shigellosis caused by Shigella dysenteriae is a major public health concern worldwide, particularly in developing countries. The bacterial genome is known, but there are many hypothetical proteins whose functions are yet to be discovered. A hypothetical protein (accession no. WP_128879999.1, 161 residues) of S. dysenteriae ATCC 12039 strain was selected in this study for comprehensive structural and functional analysis. Subcellular localization and different physicochemical properties of this hypothetical protein were estimated indicating it as a stable, soluble, and extracellular protein. Functional annotation tools, such as NCBI-CD Search, Pfam, and InterProScan, predicted our target protein to be an amidase effector protein 4 (Tae4) of type-VI secretion system (T6SS). Multiple sequence alignment of the homologous sequences coincided with previous findings. Random coil was found to be predominant in secondary structure. Three-dimensional (3D) structure of the protein was obtained using homology modeling method by SWISS-MODEL server using a template protein (PDB ID: 4J30) of 80.12% sequence identity. The 3D structure became more stable after YASARA energy minimization and was validated by several quality assessment tools like PROCHECK, QMEAN, Verify3D, and ERRAT. Superimposition of the target with the template protein by UCSF Chimera generated RMSD value of 0.115 Å, suggesting a reliable 3D structure. The active site of the modeled structure was predicted and visualized by CASTp server and PyMOL. Interestingly, similar binding affinity and key interacting residues were found for the target protein and a Salmonella enterica Tae4 protein with the ligand L-Ala D-Glu-mDAP by molecular docking analysis. Protein-protein docking was also performed between the target protein and hemolysin coregulated protein 1 of T6SS. Finally, the protein was found to be a unique protein of S. dysenteriae nonhomologous to human by comparative genomics approach indicating a potential therapeutic target. Most pathogens harboring T6SS in their system pose a significant threat to the human health. Many T6SSs and their effectors are associated with interbacterial competition, pathogenesis, and virulency; however, relationships between these effectors and pathogenicity of S. dysenteriae are yet to be determined. The study findings provide a lucrative platform for future antibacterial treatment.
Collapse
|
122
|
Functional annotation of lncRNA in high-throughput screening. Essays Biochem 2021; 65:761-773. [PMID: 33835127 PMCID: PMC8564734 DOI: 10.1042/ebc20200061] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 02/25/2021] [Accepted: 03/15/2021] [Indexed: 12/17/2022]
Abstract
Recent efforts on the characterization of long non-coding RNAs (lncRNAs) revealed their functional roles in modulating diverse cellular processes. These include pluripotency maintenance, lineage commitment, carcinogenesis, and pathogenesis of various diseases. By interacting with DNA, RNA and protein, lncRNAs mediate multifaceted mechanisms to regulate transcription, RNA processing, RNA interference and translation. Of more than 173000 discovered lncRNAs, the majority remain functionally unknown. The cell type-specific expression and localization of the lncRNA also suggest potential distinct functions of lncRNAs across different cell types. This highlights the niche of identifying functional lncRNAs in different biological processes and diseases through high-throughput (HTP) screening. This review summarizes the current work performed and perspectives on HTP screening of functional lncRNAs where different technologies, platforms, cellular responses and the downstream analyses are discussed. We hope to provide a better picture in applying different technologies to facilitate functional annotation of lncRNA efficiently.
Collapse
|
123
|
Song X, Yang T, Zhang X, Yuan Y, Yan X, Wei Y, Zhang J, Zhou C. Comparison of the Microsatellite Distribution Patterns in the Genomes of Euarchontoglires at the Taxonomic Level. Front Genet 2021; 12:622724. [PMID: 33719337 PMCID: PMC7953163 DOI: 10.3389/fgene.2021.622724] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 02/05/2021] [Indexed: 02/05/2023] Open
Abstract
Microsatellite or simple sequence repeat (SSR) instability within genes can induce genetic variation. The SSR signatures remain largely unknown in different clades within Euarchontoglires, one of the most successful mammalian radiations. Here, we conducted a genome-wide characterization of microsatellite distribution patterns at different taxonomic levels in 153 Euarchontoglires genomes. Our results showed that the abundance and density of the SSRs were significantly positively correlated with primate genome size, but no significant relationship with the genome size of rodents was found. Furthermore, a higher level of complexity for perfect SSR (P-SSR) attributes was observed in rodents than in primates. The most frequent type of P-SSR was the mononucleotide P-SSR in the genomes of primates, tree shrews, and colugos, while mononucleotide or dinucleotide motif types were dominant in the genomes of rodents and lagomorphs. Furthermore, (A)n was the most abundant motif in primate genomes, but (A)n, (AC)n, or (AG)n was the most abundant motif in rodent genomes which even varied within the same genus. The GC content and the repeat copy numbers of P-SSRs varied in different species when compared at different taxonomic levels, reflecting underlying differences in SSR mutation processes. Notably, the CDSs containing P-SSRs were categorized by functions and pathways using Gene Ontology and Kyoto Encyclopedia of Genes and Genomes annotations, highlighting their roles in transcription regulation. Generally, this work will aid future studies of the functional roles of the taxonomic features of microsatellites during the evolution of mammals in Euarchontoglires.
Collapse
|
124
|
Zhang YM, Liu XZ, Zhou XJ, Liu LJ, Shi SF, Hou P, Lv JC, Zhang H. A Functional Variant rs3093023 in CCR6 Is Associated With IgA Nephropathy by Regulating Th17 Cells in a North Han Chinese Population. Front Immunol 2021; 12:600598. [PMID: 33717080 PMCID: PMC7946973 DOI: 10.3389/fimmu.2021.600598] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2020] [Accepted: 02/01/2021] [Indexed: 12/12/2022] Open
Abstract
C-C chemokine receptor 6 (CCR6) is a susceptibility gene of various immune-related diseases, which was suggested to be shared with immunoglobulin A nephropathy (IgAN). In this study, we aimed to identify the functional variants. First, we analyzed the associations of CCR6 common and rare variants detected by multi-platform chips with IgAN susceptibility using imputation and identified 68 significantly associated common variants located in the regulatory region. Among them, rs3093023 showed both statistical significance (rs3093023-A, odds ratio [OR] = 1.15, P = 2.00 × 10−2) and the expression quantitative trait loci (eQTL) effect (P = 1.45 × 10−3). It was independently replicated (rs3093023-A, OR = 1.18, P = 5.56 × 10−3) and the association was reinforced in the meta-analysis (rs3093023-A, OR = 1.17, P = 6.14 × 10−7). Although rs3093023 was in a strong linkage disequilibrium with the reported CCR6 functional variant dinucleotide polymorphism, CCR6DNP, the alleles of rs3093023 (G>A) rather than of CCR6DNP were shown differential nuclear protein binding effect by electrophoretic mobility shift assay. The RegulomeDB and JASPAR databases predicted Pou2f1 as the potential transcription factor, which was negatively associated with CCR6 mRNA (r = −0.60, P = 3.94 × 10−9). At the mRNA level, the eQTL effect of CCR6 was validated (P = 4.39 × 10−2), and CCR6 was positively associated with the expression of CCR4 and IL-17A rather than that of CXCR3 and IFNG. At the protein level, a higher CCR6+ cell ratio was observed in a risk allele dose-dependent manner in lymphocytes (P = 3.57 × 10−2), CD3+ T cells (P = 4.54 × 10−2), and CD4+ T cells (P = 1.32 × 10−2), but not in CD8+ T cells. Clinical-pathological analysis showed that rs3093023 risk allele was significantly associated with diastolic blood pressure, serum creatinine, and high ratio of tubular atrophy/interstitial fibrosis. Overall, the rs3093023 was prioritized as the function variant in CCR6, which may contribute to IgAN susceptibility by regulating Th17 cells.
Collapse
|
125
|
Frontiers Production Office. Erratum: Full Transcriptome Analysis of Callus Suspension Culture System of Bletilla striata. Front Genet 2021; 11:636385. [PMID: 33643373 PMCID: PMC7905436 DOI: 10.3389/fgene.2020.636385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Accepted: 12/01/2020] [Indexed: 11/13/2022] Open
Abstract
[This corrects the article DOI: 10.3389/fgene.2020.00995.].
Collapse
|