1
|
A revamped rat reference genome improves the discovery of genetic diversity in laboratory rats. CELL GENOMICS 2024; 4:100527. [PMID: 38537634 PMCID: PMC11019364 DOI: 10.1016/j.xgen.2024.100527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 12/26/2023] [Accepted: 02/29/2024] [Indexed: 04/09/2024]
Abstract
The seventh iteration of the reference genome assembly for Rattus norvegicus-mRatBN7.2-corrects numerous misplaced segments and reduces base-level errors by approximately 9-fold and increases contiguity by 290-fold compared with its predecessor. Gene annotations are now more complete, improving the mapping precision of genomic, transcriptomic, and proteomics datasets. We jointly analyzed 163 short-read whole-genome sequencing datasets representing 120 laboratory rat strains and substrains using mRatBN7.2. We defined ∼20.0 million sequence variations, of which 18,700 are predicted to potentially impact the function of 6,677 genes. We also generated a new rat genetic map from 1,893 heterogeneous stock rats and annotated transcription start sites and alternative polyadenylation sites. The mRatBN7.2 assembly, along with the extensive analysis of genomic variations among rat strains, enhances our understanding of the rat genome, providing researchers with an expanded resource for studies involving rats.
Collapse
|
2
|
A revamped rat reference genome improves the discovery of genetic diversity in laboratory rats. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.13.536694. [PMID: 37214860 PMCID: PMC10197727 DOI: 10.1101/2023.04.13.536694] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The seventh iteration of the reference genome assembly for Rattus norvegicus-mRatBN7.2-corrects numerous misplaced segments and reduces base-level errors by approximately 9-fold and increases contiguity by 290-fold compared to its predecessor. Gene annotations are now more complete, significantly improving the mapping precision of genomic, transcriptomic, and proteomics data sets. We jointly analyzed 163 short-read whole genome sequencing datasets representing 120 laboratory rat strains and substrains using mRatBN7.2. We defined ~20.0 million sequence variations, of which 18.7 thousand are predicted to potentially impact the function of 6,677 genes. We also generated a new rat genetic map from 1,893 heterogeneous stock rats and annotated transcription start sites and alternative polyadenylation sites. The mRatBN7.2 assembly, along with the extensive analysis of genomic variations among rat strains, enhances our understanding of the rat genome, providing researchers with an expanded resource for studies involving rats.
Collapse
|
3
|
Imputed genomes of historical horses provide insights into modern breeding. iScience 2023; 26:107104. [PMID: 37416458 PMCID: PMC10319840 DOI: 10.1016/j.isci.2023.107104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Revised: 04/25/2023] [Accepted: 06/08/2023] [Indexed: 07/08/2023] Open
Abstract
Historical genomes can provide important insights into recent genomic changes in horses, especially the development of modern breeds. In this study, we characterized 8.7 million genomic variants from a panel of 430 horses from 73 breeds, including newly sequenced genomes from 20 Clydesdales and 10 Shire horses. We used this modern genomic variation to impute the genomes of four historically important horses, consisting of publicly available genomes from 2 Przewalski's horses, 1 Thoroughbred, and a newly sequenced Clydesdale. Using these historical genomes, we identified modern horses with higher genetic similarity to those in the past and unveiled increased inbreeding in recent times. We genotyped variants associated with appearance and behavior to uncover previously unknown characteristics of these important historical horses. Overall, we provide insights into the history of Thoroughbred and Clydesdale breeds and highlight genomic changes in the endangered Przewalski's horse following a century of captive breeding.
Collapse
|
4
|
Functional annotation of the animal genomes: An integrated annotation resource for the horse. PLoS Genet 2023; 19:e1010468. [PMID: 36862752 PMCID: PMC10013926 DOI: 10.1371/journal.pgen.1010468] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 03/14/2023] [Accepted: 01/28/2023] [Indexed: 03/03/2023] Open
Abstract
The genomic sequence of the horse has been available since 2009, providing critical resources for discovering important genomic variants regarding both animal health and population structures. However, to fully understand the functional implications of these variants, detailed annotation of the horse genome is required. Due to the limited availability of functional data for the equine genome, as well as the technical limitations of short-read RNA-seq, existing annotation of the equine genome contains limited information about important aspects of gene regulation, such as alternate isoforms and regulatory elements, which are either not transcribed or transcribed at a very low level. To solve above problems, the Functional Annotation of the Animal Genomes (FAANG) project proposed a systemic approach to tissue collection, phenotyping, and data generation, adopting the blueprint laid out by the Encyclopedia of DNA Elements (ENCODE) project. Here we detail the first comprehensive overview of gene expression and regulation in the horse, presenting 39,625 novel transcripts, 84,613 candidate cis-regulatory elements (CRE) and their target genes, 332,115 open chromatin regions genome wide across a diverse set of tissues. We showed substantial concordance between chromatin accessibility, chromatin states in different genic features and gene expression. This comprehensive and expanded set of genomics resources will provide the equine research community ample opportunities for studies of complex traits in the horse.
Collapse
|
5
|
Evidence for origin of Lavender Foal Syndrome among Egyptian Arabian horses in Egypt. Equine Vet J 2022; 55:487-493. [PMID: 35665534 DOI: 10.1111/evj.13604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Accepted: 05/14/2022] [Indexed: 11/30/2022]
Abstract
BACKGROUND Lavender Foal Syndrome (LFS) is a fatal hereditary condition that is inherited in an autosomal recessive pattern. This detrimental mutation is more common in Arabian foals of Egyptian origin than foals from other bloodlines. Heterozygous horses are carriers of the LFS trait and appear normal, while recessive homozygous foals died shortly after birth due to serious complications. In Egypt, in 2014, an Egyptian foal died after manifestations of neurological signs and abnormal coat colour as LFS signs. Therefore, it is important to identify LFS carriers in the population of Arabian horses in Egypt and to encourage improvement of the Arabian horse industry in Egypt by constructing a breeding system based on genetic background in order to avoid mating between carriers and reduce financial losses from deaths of affected foals. OBJECTIVES To establish a PCR-based test for detecting the MYO5A gene mutation causing LFS in the registered Arabian horse population in Egypt prior to breeding. STUDY DESIGN Cross sectional survey (n = 170) plus targeted sampling (n = 30). METHODS A total of 200 samples were collected from an Arabian farm in Egypt and some of them were traced for LFS based on the farm records. The LFS genotypes were identified using the PCR-RFLP technique, fragment analysis followed by sequence analysis. RESULTS The overall mutated allele and genotype frequencies (N/L) were 0.08 and 16%, respectively. CONCLUSION The observed frequency of heterozygotes suggests foals affected with LFS will be produced among Arabian horses in Egypt. Therefore, screening of the entire population for this mutation should be undertaken in the breeding program. This article is protected by copyright. All rights reserved.
Collapse
|
6
|
Decoding the Equine Genome: Lessons from ENCODE. Genes (Basel) 2021; 12:genes12111707. [PMID: 34828313 PMCID: PMC8625040 DOI: 10.3390/genes12111707] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 10/24/2021] [Accepted: 10/26/2021] [Indexed: 12/23/2022] Open
Abstract
The horse reference genome assemblies, EquCab2.0 and EquCab3.0, have enabled great advancements in the equine genomics field, from tools to novel discoveries. However, significant gaps of knowledge regarding genome function remain, hindering the study of complex traits in horses. In an effort to address these gaps and with inspiration from the Encyclopedia of DNA Elements (ENCODE) project, the equine Functional Annotation of Animal Genome (FAANG) initiative was proposed to bridge the gap between genome and gene expression, providing further insights into functional regulation within the horse genome. Three years after launching the initiative, the equine FAANG group has generated data from more than 400 experiments using over 50 tissues, targeting a variety of regulatory features of the equine genome. In this review, we examine how valuable lessons learned from the ENCODE project informed our decisions in the equine FAANG project. We report the current state of the equine FAANG project and discuss how FAANG can serve as a template for future expansion of functional annotation in the equine genome and be used as a reference for studies of complex traits in horse. A well-annotated reference functional atlas will also help advance equine genetics in the pan-genome and precision medicine era.
Collapse
|
7
|
Using triallelic SNPs for determining parentage in North American yak ( Bos grunniens) and estimating cattle ( B. taurus) introgression. F1000Res 2020; 9:1096. [PMID: 33163159 PMCID: PMC7605211 DOI: 10.12688/f1000research.25803.2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 10/16/2020] [Indexed: 11/20/2022] Open
Abstract
Background: Genetic testing for pedigree accuracy is critical for managing genetic diversity in North American (NA) yak ( Bos grunniens), a population expanded mostly from imported zoological park specimens. DNA testing also enhances species conservation by identifying recent B. taurus F1 hybrid ancestors (within three generations). Biallelic single nucleotide polymorphisms (SNPs) can accomplish either task, but increases the marker count and costs necessary to achieve both. Our aim was to identify novel, multifunctional, triallelic yak SNPs (tySNPs), with each having two alleles for yak parentage testing, and a third allele for identifying recent cattle introgression. Methods: Genome sequences were aligned to the cattle UMD3.1 assembly and SNPs were screened for 1) heterozygosity in a NA and a Chinese yak, 2) a third allele at high frequency in cattle, and 3) flanking sequences conserved in both species. Subsequently, tySNPs were filtered for unique alignment to the haplotype-resolved F1 yak assembly. Allele frequencies were estimated in a subset of 87 tySNPs by genotyping 170 NA yak. Results: We identified 610 autosomal tySNPs, distributed in 441 clusters with 5 Mb average genome spacing. The average NA yak minor allele frequency was high (0.296), while average introgressed cattle alleles were low (0.004). In simulations with tySNPs, 28 were sufficient for globally-unique animal identification (P I=5.81x10 -12), 87 were able to exclude 19 random bulls from parentage at the 99% level without using the dam's genotype (P E=5.3x10 -4), and 87 were able to detect F1 hybridization events after three generations of yak backcrosses (1/16th B. taurus germplasm). Conclusions: Identifying animals, determining parentage and detecting recent hybridization events was efficient with as few as 87 tySNPs. A similar triallelic approach could be used with other bottlenecked Bos species that hybridize with cattle, such as NA plains bison ( B. bison).
Collapse
|
8
|
Transcriptomic analysis reveals the key regulators and molecular mechanisms underlying myometrial activation during equine placentitis†. Biol Reprod 2020; 102:1306-1325. [PMID: 32065222 DOI: 10.1093/biolre/ioaa020] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2019] [Revised: 01/30/2020] [Accepted: 02/14/2020] [Indexed: 01/06/2023] Open
Abstract
The key event in placentitis-induced preterm labor is myometrial activation with the subsequent initiation of labor. However, the molecular mechanisms underlying myometrial activation are not fully understood in the mares. Therefore, the equine myometrial transcriptome was characterized during placentitis (290.0 ± 1.52 days of GA, n = 5) and the prepartum period (330 days of GA, n = 3) in comparison to normal pregnant mares (289.8 ± 2.18 days of GA, n = 4). Transcriptome analysis identified 596 and 290 DEGs in the myometrium during placentitis and the prepartum period, respectively, with 138 DEGs in common. The placentitis DEGs included eight genes (MMP1, MMP8, S100A9, S100A8, PI3, APOBEC3Z1B, RETN, and CXCL2) that are exclusively expressed in the inflamed myometrium. Pathway analysis elucidated that inflammatory signaling, Toll-like receptor signaling, and apoptosis pathways dominate myometrial activation during placentitis. The prepartum myometrium was associated with overexpression of inflammatory signaling, oxidative stress, and 5-hydroxytryptamine degradation. Gene ontology enrichment analysis identified several chemoattractant factors in the myometrium during placentitis and prepartum period, including CCL2, CXCL1, CXCL3, and CXCL6 in common. Upstream regulator analysis revealed 19 potential upstream regulators in placentitis dataset including transcription regulators (E2F1, FOXM1, HIF1A, JUNB, NFKB1A, and STAT1), transmembrane receptors (FAS, ICAM1, SELP, TLR2, and TYROBP), growth factors (HGF and TGFB3), enzymes (PTGS2 and PRKCP), and others (S100A8, S100A9, CD44, and C5AR1). Additionally, three upstream regulators (STAT3, EGR1, and F2R) were identified in the prepartum dataset. These findings revealed the key regulators and pathways underlying myometrial activation during placentitis, which aid in understanding the disease and facilitate the development of efficacious therapies.
Collapse
|
9
|
Abstract
Abstract
The molecular determination of complex traits related to animal production, health, and performance remains elusive. In response, an international effort (Functional Annotation of ANimal Genomes, or FAANG) was initiated with the goal of identifying functional elements of the genome across domestic animal species. Toward this goal, the equine FAANG community has developed a biobank of over 80 tissues, four fluid types, and nine microbiome samples collected from two adult Thoroughbred mares as a resource for functional annotation of the horse genome. Full clinical phenotyping and careful histologic evaluation was performed on each tissue to allow for correlation of any observed pathologies and cell composition with sequencing results. Whole-genome sequencing (WGS) of each horse is complete as is RNA-sequencing (mRNA and smRNA) and analysis from eight prioritized tissues (liver, lung, hoof lamina, heart, longissimus dorsi muscle, ovary, parietal cortex, and adipose) as well as from 30 additional tissues sequenced with support from members of the community (known as the “Adopt-a-tissue” initiative). All sequencing data are now publicly available. Chromatin shearing and antibody concentrations have been optimized for ChIP-seq to characterize the major histone modification marks (H3K4me1, H3K4me3, H3K27me3, H3K27ac) in the eight prioritized tissues and spleen. Data collection and analyses are nearly complete. Assays to identify genomic insulators denoted by CCCTC-binding sites are also underway as is optimization of ATAC-seq, to characterize open chromatin in select tissues. The biobank’s use has extended to include chromatin run-on and sequencing (ChRO-seq) assays allowing for a novel comparison with ChIP-seq peaks. Further extensions of the biobank include keratinocyte cell culture, centromere mapping, karyotype analyses, methylation profiles, and microbiome characterization. These data provide a valuable baseline of genome function in the healthy, adult Thoroughbred mare and will allow for an improved understanding of and continuing research on tissue-, developmental-, and disease-associated genome regulation.
Collapse
|
10
|
Next generation sequencing analysis of soy glyceollins and 17-β estradiol: Effects on transcript abundance in the female mouse brain. Mol Cell Endocrinol 2018; 471:15-21. [PMID: 28483703 DOI: 10.1016/j.mce.2017.05.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Revised: 04/07/2017] [Accepted: 05/04/2017] [Indexed: 02/06/2023]
Abstract
Glyceollins (Glys) are produced by soy plants in response to stress and are known for their anti-estrogenic activity both in vivo and in vitro in cancer cell lines as well as peripheral tissues. Glys can also exhibit non-estrogen receptor (ER) mediated effects. The effects of Glys on gene expression in the brain are still unclear. For this study, 17-β estradiol (E2) or placebo slow-release pellets were implanted into ovariectomized CFW mice followed by 11 days of exposure to either Glys or vehicle i.p. injections. We then examined the female mouse brain transcriptome using paired-end RNA sequencing (RNA-Seq) on the Illumina GAIIx platform. The goal of this study was to compare and contrast the results obtained from RNA-Seq with the results from our previous whole brain microarray experiment, which indicated that Glys potentially act through both ER-mediated and non-ER-mediated mechanisms, exhibiting a gene expression profile distinct from E2-treated groups. Our results suggest that the transcripts regulated by both E2 and Glys alone or in combination annotated to similar pathway maps and networks in both microarray and RNA-Seq experiments. Additionally, unlike our microarray data analysis, RNA-Seq enabled the detection of treatment effects on low expression transcripts of interest (e.g., prolactin and growth hormone). Collectively, our results suggest that depending on the gene, Glys can regulate expression independently of E2 action, similarly to E2, or oppose E2's effects in the female mouse brain.
Collapse
|
11
|
Exclusion of adrenoceptor alpha 2 variants in a horse insensitive to medetomidine. Anim Genet 2018; 49:141. [PMID: 29359424 DOI: 10.1111/age.12636] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
12
|
Developing a 670k genotyping array to tag ~2M SNPs across 24 horse breeds. BMC Genomics 2017; 18:565. [PMID: 28750625 PMCID: PMC5530493 DOI: 10.1186/s12864-017-3943-8] [Citation(s) in RCA: 75] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2017] [Accepted: 07/13/2017] [Indexed: 12/30/2022] Open
Abstract
Background To date, genome-scale analyses in the domestic horse have been limited by suboptimal single nucleotide polymorphism (SNP) density and uneven genomic coverage of the current SNP genotyping arrays. The recent availability of whole genome sequences has created the opportunity to develop a next generation, high-density equine SNP array. Results Using whole genome sequence from 153 individuals representing 24 distinct breeds collated by the equine genomics community, we cataloged over 23 million de novo discovered genetic variants. Leveraging genotype data from individuals with both whole genome sequence, and genotypes from lower-density, legacy SNP arrays, a subset of ~5 million high-quality, high-density array candidate SNPs were selected based on breed representation and uniform spacing across the genome. Considering probe design recommendations from a commercial vendor (Affymetrix, now Thermo Fisher Scientific) a set of ~2 million SNPs were selected for a next-generation high-density SNP chip (MNEc2M). Genotype data were generated using the MNEc2M array from a cohort of 332 horses from 20 breeds and a lower-density array, consisting of ~670 thousand SNPs (MNEc670k), was designed for genotype imputation. Conclusions Here, we document the steps taken to design both the MNEc2M and MNEc670k arrays, report genomic and technical properties of these genotyping platforms, and demonstrate the imputation capabilities of these tools for the domestic horse. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3943-8) contains supplementary material, which is available to authorized users.
Collapse
|
13
|
Allelic Variation in CXCL16 Determines CD3+ T Lymphocyte Susceptibility to Equine Arteritis Virus Infection and Establishment of Long-Term Carrier State in the Stallion. PLoS Genet 2016; 12:e1006467. [PMID: 27930647 PMCID: PMC5145142 DOI: 10.1371/journal.pgen.1006467] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2016] [Accepted: 11/05/2016] [Indexed: 12/25/2022] Open
Abstract
Equine arteritis virus (EAV) is the causative agent of equine viral arteritis (EVA), a respiratory, systemic, and reproductive disease of horses and other equid species. Following natural infection, 10-70% of the infected stallions can become persistently infected and continue to shed EAV in their semen for periods ranging from several months to life. Recently, we reported that some stallions possess a subpopulation(s) of CD3+ T lymphocytes that are susceptible to in vitro EAV infection and that this phenotypic trait is associated with long-term carrier status following exposure to the virus. In contrast, stallions not possessing the CD3+ T lymphocyte susceptible phenotype are at less risk of becoming long-term virus carriers. A genome wide association study (GWAS) using the Illumina Equine SNP50 chip revealed that the ability of EAV to infect CD3+ T lymphocytes and establish long-term carrier status in stallions correlated with a region within equine chromosome 11. Here we identified the gene and mutations responsible for these phenotypes. Specifically, the work implicated three allelic variants of the equine orthologue of CXCL16 (EqCXCL16) that differ by four non-synonymous nucleotide substitutions (XM_00154756; c.715 A → T, c.801 G → C, c.804 T → A/G, c.810 G → A) within exon 1. This resulted in four amino acid changes with EqCXCL16S (XP_001504806.1) having Phe, His, Ile and Lys as compared to EqCXL16R having Tyr, Asp, Phe, and Glu at 40, 49, 50, and 52, respectively. Two alleles (EqCXCL16Sa, EqCXCL16Sb) encoded identical protein products that correlated strongly with long-term EAV persistence in stallions (P<0.000001) and are required for in vitro CD3+ T lymphocyte susceptibility to EAV infection. The third (EqCXCL16R) was associated with in vitro CD3+ T lymphocyte resistance to EAV infection and a significantly lower probability for establishment of the long-term carrier state (viral persistence) in the male reproductive tract. EqCXCL16Sa and EqCXCL16Sb exert a dominant mode of inheritance. Most importantly, the protein isoform EqCXCL16S but not EqCXCL16R can function as an EAV cellular receptor. Although both molecules have equal chemoattractant potential, EqCXCL16S has significantly higher scavenger receptor and adhesion properties compared to EqCXCL16R.
Collapse
|
14
|
Health status, perceptions and needs of Hispanics in rural Shelbyville, Kentucky. J Immigr Minor Health 2016; 17:148-55. [PMID: 24014190 DOI: 10.1007/s10903-013-9907-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
This cross-sectional study was completed to characterize the health status, perceptions and needs of Hispanics in Shelbyville, KY, USA. Community Health Workers interviewed 668 Hispanic residents in Shelbyville, KY, USA. Data were collected from 2009 to 2010 and analyzed from 2011 until present. Hispanic immigrants from Mexico and other Central American countries completed the survey. The most common self-reported diseases were allergies, asthma, diabetes, lung disease and cardiovascular disease. High blood pressure and diabetes were the two most common diagnoses among insured, older females. Health education, disease prevention and nutrition were the top health concerns among participants. Deficits in health care infrastructure for this largely transient community may compromise their ability to meet health care needs and concerns. Similar issues may be faced by other disadvantaged Hispanic communities in the continental US and likely to be influenced by anticipated provisions of the Patient Protection and Affordable Care Act.
Collapse
|
15
|
Observations on macrolide resistance and susceptibility testing performance in field isolates collected from clinical bovine respiratory disease cases. Vet Microbiol 2016; 192:186-193. [PMID: 27527782 DOI: 10.1016/j.vetmic.2016.07.021] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2016] [Revised: 07/12/2016] [Accepted: 07/25/2016] [Indexed: 12/21/2022]
Abstract
The objectives of this study were; first, to describe gamithromycin susceptibility of Mannheimia haemolytica, Pasteurella multocida, and Histophilus somni isolated from cattle diagnosed with bovine respiratory disease (BRD) and previously treated with either gamithromycin for control of BRD (mass medication=MM) or sham-saline injected (control=CON); second, to describe the macrolide resistance genes present in genetically typed M. haemolytica isolates; third, use whole-genome sequencing (WGS) to correlate the phenotypic resistance and genetic determinants for resistance among M. haemolytica isolates. M. haemolytica (n=276), P. multocida (n=253), and H. somni (n=78) were isolated from feedlot cattle diagnosed with BRD. Gamithromycin susceptibility was determined by broth microdilution. Whole-genome sequencing was utilized to determine the presence/absence of macrolide resistance genes and to genetically type M. haemolytica. Generalized linear mixed models were built for analysis. There was not a significant difference between MM and CON groups in regards to the likelihood of culturing a resistant isolate of M. haemolytica or P. multocida. The likelihood of culturing a resistant isolate of M. haemolytica differed significantly by state of origin in this study. A single M. haemolytica genetic subtype was associated with an over whelming majority of the observed resistance. H. somni isolation counts were low and statistical models would not converge. Phenotypic resistance was predicted with high sensitivity and specificity by WGS. Additional studies to elucidate the relationships between phenotypic expression of resistance/genetic determinants for resistance and clinical response to antimicrobials are necessary to inform judicious use of antimicrobials in the context of relieving animal disease and suffering.
Collapse
|
16
|
Characterization of an APC Promoter 1B deletion in a Patient Diagnosed with Familial Adenomatous Polyposis via Whole Genome Shotgun Sequencing. F1000Res 2015. [PMID: 26213617 PMCID: PMC4505784 DOI: 10.12688/f1000research.6636.1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Recently, deletions have been identified and published as causal for Familial Adenomatous Polyposis in the 1B promoter region of the APC gene. Those deletions were measured using multiplex ligation-dependent probe amplification. Here, we present and characterize an ~11kb deletion identified by whole genome shotgun sequencing. The deletion occurred in a patient diagnosed with Familial Adenomatous Polyposis, and was located on chr5, between bases 112,034,824 and 112,045,845, fully encompassing the 1B promoter region of the APC gene. Results are presented here that include the sequence evidence supporting the presence of the deletion as well as base level characterization of the deletion site. These results demonstrate the capacity of whole genome sequencing for the detection of large structural variants in single individuals.
Collapse
|
17
|
Mapping whole genome shotgun sequence and variant calling in mammalian species without their reference genomes. F1000Res 2013; 2:244. [PMID: 25075278 PMCID: PMC4103496 DOI: 10.12688/f1000research.2-244.v2] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/04/2014] [Indexed: 01/20/2023] Open
Abstract
Genomics research in mammals has produced reference genome sequences that are essential for identifying variation associated with disease. High quality reference genome sequences are now available for humans, model species, and economically important agricultural animals. Comparisons between these species have provided unique insights into mammalian gene function. However, the number of species with reference genomes is small compared to those needed for studying molecular evolutionary relationships in the tree of life. For example, among the even-toed ungulates there are approximately 300 species whose phylogenetic relationships have been calculated in the 10k trees project. Only six of these have reference genomes: cattle, swine, sheep, goat, water buffalo, and bison. Although reference sequences will eventually be developed for additional hoof stock, the resources in terms of time, money, infrastructure and expertise required to develop a quality reference genome may be unattainable for most species for at least another decade. In this work we mapped 35 Gb of next generation sequence data of a Katahdin sheep to its own species' reference genome ( Ovis aries Oar3.1) and to that of a species that diverged 15 to 30 million years ago ( Bos taurus UMD3.1). In total, 56% of reads covered 76% of UMD3.1 to an average depth of 6.8 reads per site, 83 million variants were identified, of which 78 million were homozygous and likely represent interspecies nucleotide differences. Excluding repeat regions and sex chromosomes, nearly 3.7 million heterozygous sites were identified in this animal vs. bovine UMD3.1, representing polymorphisms occurring in sheep. Of these, 41% could be readily mapped to orthologous positions in ovine Oar3.1 with 80% corroborated as heterozygous. These variant sites, identified via interspecies mapping could be used for comparative genomics, disease association studies, and ultimately to understand mammalian gene function.
Collapse
|
18
|
Mapping whole genome shotgun sequence and variant calling in mammalian species without their reference genomes. F1000Res 2013; 2:244. [PMID: 25075278 PMCID: PMC4103496 DOI: 10.12688/f1000research.2-244.v1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/05/2013] [Indexed: 05/28/2024] Open
Abstract
Genomics research in mammals has produced reference genome sequences that are essential for identifying variation associated with disease. High quality reference genome sequences are now available for humans, model species, and economically important agricultural animals. Comparisons between these species have provided unique insights into mammalian gene function. However, the number of species with reference genomes is small compared to those needed for studying molecular evolutionary relationships in the tree of life. For example, among the even-toed ungulates there are approximately 300 species whose phylogenetic relationships have been calculated in the 10k trees project. Only six of these have reference genomes: cattle, swine, sheep, goat, water buffalo, and bison. Although reference sequences will eventually be developed for additional hoof stock, the resources in terms of time, money, infrastructure and expertise required to develop a quality reference genome may be unattainable for most species for at least another decade. In this work we mapped 35 Gb of next generation sequence data of a Katahdin sheep to its own species' reference genome ( Ovis aries Oar3.1) and to that of a species that diverged 15 to 30 million years ago ( Bos taurus UMD3.1). In total, 56% of reads covered 76% of UMD3.1 to an average depth of 6.8 reads per site, 83 million variants were identified, of which 78 million were homozygous and likely represent interspecies nucleotide differences. Excluding genome repeat regions and sex chromosomes, approximately 3.7 million heterozygous sites were identified in this animal vs. bovine UMD3.1, representing polymorphisms occurring in sheep. Of these, 41% could be readily mapped to orthologous positions in ovine Oar3.1 with 80% corroborated as heterozygous. These variant sites, identified via interspecies mapping could be used for comparative genomics, disease association studies, and ultimately to understand mammalian gene function.
Collapse
|
19
|
Assessment of genetic variation for the LINE-1 retrotransposon from next generation sequence data. BMC Bioinformatics 2010; 11 Suppl 9:S12. [PMID: 21044359 PMCID: PMC2967742 DOI: 10.1186/1471-2105-11-s9-s12] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
Background In humans, copies of the Long Interspersed Nuclear Element 1 (LINE-1) retrotransposon comprise 21% of the reference genome, and have been shown to modulate expression and produce novel splice isoforms of transcripts from genes that span or neighbor the LINE-1 insertion site. Results In this work, newly released pilot data from the 1000 Genomes Project is analyzed to detect previously unreported full length insertions of the retrotransposon LINE-1. By direct analysis of the sequence data, we have identified 22 previously unreported LINE-1 insertion sites within the sequence data reported for a mother/father/daughter trio. Conclusions It is demonstrated here that next generation sequencing data, as well as emerging high quality datasets from individual genome projects allow us to assess the amount of heterogeneity with respect to the LINE-1 retrotransposon amongst humans, and provide us with a wealth of testable hypotheses as to the impact that this diversity may have on the health of individuals and populations.
Collapse
|
20
|
Abstract
BACKGROUND Prostate cancer (PCa) incidence and mortality are disproportionately high among African-American (AA) men. Its detection and perhaps its disparities could be improved through the identification of genetic susceptibility biomarkers within essential biological pathways. Interactions among highly variant genes, central to angiogenesis, may modulate susceptibility for prostate cancer, as previous demonstrated. This study evaluates the interplay among three highly variant genes (i.e., IL-10, TGFbetaR-1, VEGF), their receptors and their influence on PCa within a case-control study consisting of an under-served population. METHODS This study evaluated single gene and joint modifying effects on PCa risk in a case-control study comprised of 859 AA men (193 cases and 666 controls) using TaqMan qPCR. Interaction among polymorphic IL-10, TGFbetaR-1 and VEGF was analyzed using conventional logistic regression analysis (LR) models, multi-dimensionality reduction (MDR) and interaction entropy graphs. Symbolic modeling allowed validation of gene-gene interaction findings identified by MDR. RESULTS No significant single gene effects were demonstrated in relation to PCa risk. However, carriers of the VEGF 2482T allele had a threefold increase in the risk of developing aggressive PCa. The presence of VEGF 2482T combined with VEGFR IVS6 + 54 loci were highly significant for the risk of PCa based on MDR and symbolic modeling analyses. These findings were substantiated by 1,000-fold cross validation permutation testing (P = 0.04), respectively. CONCLUSION These findings suggest the inheritance of VEGF and VEGFR IVS6 + 54 sequence variants may jointly modify PCa susceptibility through their influence on angiogenesis. Larger sub-population studies are needed to validate these findings and evaluate whether the VEGF-VEGR axis may serve as predictors of disease prognosis and ultimately clinical response to available treatment strategies.
Collapse
|
21
|
A MicroRNA gene is hosted in an intron of a schizophrenia-susceptibility gene. Schizophr Res 2009; 109:86-9. [PMID: 19264453 PMCID: PMC2664103 DOI: 10.1016/j.schres.2009.01.022] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/10/2008] [Revised: 01/26/2009] [Accepted: 01/27/2009] [Indexed: 10/21/2022]
Abstract
Schizophrenia (SZ) is a neuropsychiatric disorder that affects about 1% of the adult population. Numerous genes have been implicated in SZ susceptibility. MicroRNAs (miRNA) are small RNA molecules that regulate the translation of mRNAs via interactions with their 3' untranslated regions. Identification of known miRNA targets on all human genes indicated that miRNA-346 targets SZ susceptibility genes listed in the SchizophreniaGene database twice as frequently as expected relative to other genes in the genome. The gene encoding this miRNA, miR-346, is located in intron 2 of the glutamate receptor ionotropic delta 1 (GRID1) gene, which has been previously implicated in SZ susceptibility. We used quantitative real-time PCR to determine the expression levels of miR-346 and GRID1 using brain RNA samples from the Stanley Array Collection, Stanley Medical Research Institute. Expression of both miR-346 and GRID1 is lower in SZ patients than that in normal controls (P=0.017 and 0.086, respectively). However, the expression of miR-346 and GRID1 is less correlated in SZ patients than in bipolar patients or in normal controls. This study implicates the importance of a miRNA in SZ.
Collapse
|
22
|
Epigenetic control of mammalian LINE-1 retrotransposon by retinoblastoma proteins. Mutat Res 2009; 665:20-8. [PMID: 19427507 DOI: 10.1016/j.mrfmmm.2009.02.011] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2009] [Revised: 02/20/2009] [Accepted: 02/23/2009] [Indexed: 02/07/2023]
Abstract
Long interspersed nuclear elements (LINEs or L1 elements) are targeted for epigenetic silencing during early embryonic development and remain inactive in most cells and tissues. Here we show that E2F-Rb family complexes participate in L1 elements epigenetic regulation via nucleosomal histone modifications and recruitment of histone deacetylases (HDACs) HDAC1 and HDAC2. Our experiments demonstrated that (i) Rb and E2F interact with human and mouse L1 elements, (ii) L1 elements are deficient in both heterochromatin-associated histone marks H3 tri methyl K9 and H4 tri methyl K20 in Rb family triple knock out (Rb, p107, and p130) fibroblasts (TKO), (iii) L1 promoter exhibits increased histone H3 acetylation in the absence of HDAC1 and HDAC2 recruitment, (iv) L1 expression in TKO fibroblasts is upregulated compared to wild type counterparts, (v) L1 expression increases in the presence of the HDAC inhibitor TSA. On the basis of these findings we propose a model in which L1 sequences throughout the genome serve as centers for heterochromatin formation in an Rb family-dependent manner. As such, Rb proteins and L1 elements may play key roles in heterochromatin formation beyond pericentromeric chromosomal regions. These findings describe a novel mechanism of L1 reactivation in mammalian cells mediated by failure of corepressor protein recruitment by Rb, loss of histone epigenetic marks, heterochromatin formation, and increased histone H3 acetylation.
Collapse
|
23
|
Abstract
Intracellular proteins with a carboxy-terminal transmembrane domain and the amino-terminus oriented toward the cytosol are known as 'tail-anchored' proteins. Tail-anchored proteins have been of considerable interest because several important classes of proteins, including the vesicle-targeting/fusion proteins known as SNAREs and the apoptosis-related proteins of the Bcl-2 family, among others, utilize this unique membrane-anchoring motif. Here, we use a bioinformatic technique to develop a comprehensive list of potentially tail-anchored proteins in the human genome. Our final list contains 411 entries derived from 325 unique genes. We also analyzed both known and predicted tail-anchored proteins with respect to the amino acid composition of the transmembrane segments. This analysis revealed a distinctive composition of the membrane anchor in SNARE proteins.
Collapse
|
24
|
Computational and biological inference of gene regulatory networks of the LINE-1 retrotransposon. Genomics 2007; 90:176-85. [PMID: 17521869 PMCID: PMC2065750 DOI: 10.1016/j.ygeno.2007.04.003] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2007] [Accepted: 04/05/2007] [Indexed: 01/21/2023]
Abstract
Computational approaches were used to define structural and functional determinants of a putative genetic regulatory network of murine LINE-1 (long interspersed nuclear element-1), an active mammalian retrotransposon that uses RNA intermediates to populate new sites throughout the genome. Polymerase (RNA) II polypeptide E AI845735 and mouse DNA homologous to Drosophila per fragment M12039 were identified as primary attractors. siRNA knockdown of the aryl hydrocarbon receptor NM_013464 modulated gene expression within the network, including LINE-1, Sgpl1, Sdcbp, and Mgst1. Genes within the network did not exhibit physical proximity and instead were dispersed throughout the genome. The potential impact of individual members of the network on the global dynamical behavior of LINE-1 was examined from a theoretical and empirical framework.
Collapse
|
25
|
Validation of a mathematical model of gene transcription in aggregated cellular systems: application to l1 retrotransposition. J Comput Biol 2007; 14:339-49. [PMID: 17563315 DOI: 10.1089/cmb.2006.0125] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We present a methodology aimed at partial validation and accuracy-precision assessment of a mathematical model of gene transcription at the cellular level. The method is based on the analysis of time-series measurements aggregated over a large number of cells. Such measurements are typically obtained via reverse transcriptase-polymerase chain reaction (RT-PCR) experiments. The validation procedure presented herein uses as an example data on L1 retrotransposon gene in HeLa cells. The procedure compares model predicted values with the RT-PCR data for L1 by means of the standard Bayesian statistical techniques with the help of modern Markov-Chain Monte-Carlo methodology.
Collapse
|
26
|
A stochastic model of gene transcription: an application to L1 retrotransposition events. J Theor Biol 2006; 242:101-16. [PMID: 16624324 DOI: 10.1016/j.jtbi.2006.02.010] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2005] [Revised: 02/04/2006] [Accepted: 02/07/2006] [Indexed: 10/24/2022]
Abstract
A simplified mathematical model of gene transcription is presented based on a system of coupled chemical reactions and a corresponding set of stochastic equations similar to those used in enzyme kinetics theory. The quasi-stationary distribution for the model is derived and its usefulness illustrated with an example of model parameters estimation using sparse time course data on L1 retrotransposon expression kinetics. The issue of model validation is also discussed and a simple validation procedure for the estimated model is devised. The procedure compares model predicted values with the laboratory data via the standard Bayesian techniques with the help of modern Markov-Chain Monte-Carlo methodology.
Collapse
|
27
|
Abstract
Two large-scale yeast two-hybrid screens were undertaken to identify protein-protein interactions between full-length open reading frames predicted from the Saccharomyces cerevisiae genome sequence. In one approach, we constructed a protein array of about 6,000 yeast transformants, with each transformant expressing one of the open reading frames as a fusion to an activation domain. This array was screened by a simple and automated procedure for 192 yeast proteins, with positive responses identified by their positions in the array. In a second approach, we pooled cells expressing one of about 6,000 activation domain fusions to generate a library. We used a high-throughput screening procedure to screen nearly all of the 6,000 predicted yeast proteins, expressed as Gal4 DNA-binding domain fusion proteins, against the library, and characterized positives by sequence analysis. These approaches resulted in the detection of 957 putative interactions involving 1,004 S. cerevisiae proteins. These data reveal interactions that place functionally unclassified proteins in a biological context, interactions between proteins involved in the same biological function, and interactions that link biological functions together into larger cellular processes. The results of these screens are shown here.
Collapse
|
28
|
Untangling the physical contributions to instantaneous normal mode approximations: Inhomogeneous broadening, motional narrowing, and energy relaxation. J Chem Phys 1998. [DOI: 10.1063/1.476157] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
29
|
A molecular dynamics analysis of resonance emission: Optical dephasing and inhomogeneous broadening of CH3I in CH4and Ar. J Chem Phys 1996. [DOI: 10.1063/1.471245] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
30
|
A molecular dynamics study of electronic absorption line broadening in high‐pressure nonpolar gases. J Chem Phys 1995. [DOI: 10.1063/1.470290] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|