1
|
Zeng C, Takeda A, Sekine K, Osato N, Fukunaga T, Hamada M. Bioinformatics Approaches for Determining the Functional Impact of Repetitive Elements on Non-coding RNAs. Methods Mol Biol 2022; 2509:315-340. [PMID: 35796972 DOI: 10.1007/978-1-0716-2380-0_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
With a large number of annotated non-coding RNAs (ncRNAs), repetitive sequences are found to constitute functional components (termed as repetitive elements) in ncRNAs that perform specific biological functions. Bioinformatics analysis is a powerful tool for improving our understanding of the role of repetitive elements in ncRNAs. This chapter summarizes recent findings that reveal the role of repetitive elements in ncRNAs. Furthermore, relevant bioinformatics approaches are systematically reviewed, which promises to provide valuable resources for studying the functional impact of repetitive elements on ncRNAs.
Collapse
Affiliation(s)
- Chao Zeng
- Faculty of Science and Engineering, Waseda University, Tokyo, Japan.
- AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), Tokyo, Japan.
| | - Atsushi Takeda
- Faculty of Science and Engineering, Waseda University, Tokyo, Japan
| | - Kotaro Sekine
- Faculty of Science and Engineering, Waseda University, Tokyo, Japan
| | - Naoki Osato
- Faculty of Science and Engineering, Waseda University, Tokyo, Japan
| | - Tsukasa Fukunaga
- Waseda Institute for Advanced Study, Waseda University, Tokyo, Japan
| | - Michiaki Hamada
- Faculty of Science and Engineering, Waseda University, Tokyo, Japan.
- AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), Tokyo, Japan.
| |
Collapse
|
2
|
Abstract
Background Although recent studies have revealed the genome-wide distribution of R-loops, our understanding of R-loop formation is still limited. Genomes are known to have a large number of repetitive elements. Emerging evidence suggests that these sequences may play an important regulatory role. However, few studies have investigated the effect of repetitive elements on R-loop formation. Results We found different repetitive elements related to R-loop formation in various species. By controlling length and genomic distributions, we observed that satellite, long interspersed nuclear elements (LINEs), and DNA transposons were each specifically enriched for R-loops in humans, fruit flies, and Arabidopsis thaliana, respectively. R-loops also tended to arise in regions of low-complexity or simple repeats across species. We also found that the repetitive elements associated with R-loop formation differ according to developmental stage. For instance, LINEs and long terminal repeat retrotransposons (LTRs) are more likely to contain R-loops in embryos (fruit fly) and then turn out to be low-complexity and simple repeats in post-developmental S2 cells. Conclusions Our results indicate that repetitive elements may have species-specific or development-specific regulatory effects on R-loop formation. This work advances our understanding of repetitive elements and R-loop biology. Supplementary Information The online version contains supplementary material available at (10.1186/s13100-021-00231-5).
Collapse
Affiliation(s)
- Chao Zeng
- AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology, 63-520, 3-4-1, Okubo Shinjuku-ku, Tokyo, 169-8555, Japan. .,Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1 Okubo Shinjuku-ku, Tokyo, 169-8555, Japan.
| | - Masahiro Onoguchi
- AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology, 63-520, 3-4-1, Okubo Shinjuku-ku, Tokyo, 169-8555, Japan.,Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1 Okubo Shinjuku-ku, Tokyo, 169-8555, Japan
| | - Michiaki Hamada
- AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology, 63-520, 3-4-1, Okubo Shinjuku-ku, Tokyo, 169-8555, Japan. .,Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1 Okubo Shinjuku-ku, Tokyo, 169-8555, Japan. .,Institute for Medical-oriented Structural Biology, Waseda University, 2-2, Wakamatsu-cho Shinjuku-ku, Tokyo, 162-8480, Japan. .,Graduate School of Medicine, Nippon Medical School, 1-1-5, Sendagi, Bunkyo-ku, Tokyo, 113-8602, Japan.
| |
Collapse
|
3
|
Qu Z, Nong W, Yu Y, Baril T, Yip HY, Hayward A, Hui JHL. Genome of the four-finger threadfin Eleutheronema tetradactylum (Perciforms: Polynemidae). BMC Genomics 2020; 21:726. [PMID: 33076831 PMCID: PMC7574432 DOI: 10.1186/s12864-020-07145-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Accepted: 10/12/2020] [Indexed: 12/02/2022] Open
Abstract
Background Teleost fish play important roles in aquatic ecosystems and aquaculture. Threadfins (Perciformes: Polynemidae) show a range of interesting biology, and are of considerable importance for both wild fisheries and aquaculture. Additionally, the four-finger threadfin Eleutheronema tetradactylum is of conservation relevance since its populations are considered to be in rapid decline and it is classified as endangered. However, no genomic resources are currently available for the threadfin family Polynemidae. Results We sequenced and assembled the first threadfin fish genome, the four-finger threadfin E. tetradactylum. We provide a genome assembly for E. tetradactylum with high contiguity (scaffold N50 = 56.3 kb) and high BUSCO completeness at 96.5%. The assembled genome size of E. tetradactylum is just 610.5 Mb, making it the second smallest perciform genome assembled to date. Just 9.07–10.91% of the genome sequence was found to consist of repetitive elements (standard RepeatMasker analysis vs custom analysis), making this the lowest repeat content identified to date for any perciform fish. A total of 37,683 protein-coding genes were annotated, and we include analyses of developmental transcription factors, including the Hox, ParaHox, and Sox families. MicroRNA genes were also annotated and compared with other chordate lineages, elucidating the gains and losses of chordate microRNAs. Conclusions The four-finger threadfin E. tetradactylum genome presented here represents the first available genome sequence for the ecologically, biologically, and commercially important clade of threadfin fish. Our findings provide a useful genomic resource for future research into the interesting biology and evolution of this valuable group of food fish. Supplementary information Supplementary information accompanies this paper at 10.1186/s12864-020-07145-1.
Collapse
Affiliation(s)
- Zhe Qu
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China.
| | - Wenyan Nong
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Yifei Yu
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, Exeter, TR10 9FE, UK
| | - Ho Yin Yip
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China
| | - Alexander Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, Exeter, TR10 9FE, UK.
| | - Jerome H L Hui
- School of Life Sciences, Simon F.S. Li Marine Science Laboratory, State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, China.
| |
Collapse
|
4
|
Lu H, Cui X, Zhao Y, Magwanga RO, Li P, Cai X, Zhou Z, Wang X, Liu Y, Xu Y, Hou Y, Peng R, Wang K, Liu F. Identification of a genome-specific repetitive element in the Gossypium D genome. PeerJ 2020; 8:e8344. [PMID: 31915591 PMCID: PMC6944119 DOI: 10.7717/peerj.8344] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2019] [Accepted: 12/04/2019] [Indexed: 01/15/2023] Open
Abstract
The activity of genome-specific repetitive sequences is the main cause of genome variation between Gossypium A and D genomes. Through comparative analysis of the two genomes, we retrieved a repetitive element termed ICRd motif, which appears frequently in the diploid Gossypium raimondii (D5) genome but rarely in the diploid Gossypium arboreum (A2) genome. We further explored the existence of the ICRd motif in chromosomes of G. raimondii, G. arboreum, and two tetraploid (AADD) cotton species, Gossypium hirsutum and Gossypium barbadense, by fluorescence in situ hybridization (FISH), and observed that the ICRd motif exists in the D5 and D-subgenomes but not in the A2 and A-subgenomes. The ICRd motif comprises two components, a variable tandem repeat (TR) region and a conservative sequence (CS). The two constituents each have hundreds of repeats that evenly distribute across 13 chromosomes of the D5genome. The ICRd motif (and its repeats) was revealed as the common conservative region harbored by ancient Long Terminal Repeat Retrotransposons. Identification and investigation of the ICRd motif promotes the study of A and D genome differences, facilitates research on Gossypium genome evolution, and provides assistance to subgenome identification and genome assembling.
Collapse
Affiliation(s)
- Hejun Lu
- Gembloux Agro-Bio Tech, University of Liège, Gembloux, Namur, Belgium.,Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Xinglei Cui
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Yanyan Zhao
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Richard Odongo Magwanga
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China.,School of Biological and Physical Sciences (SBPS), Jaramogi Oginga Odinga University of Science and Technology (JOOUST), Bondo-Kenya, Bondo, Kenya
| | - Pengcheng Li
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Xiaoyan Cai
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Zhongli Zhou
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Xingxing Wang
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Yuling Liu
- Anyang Institute of Technology, Anyang, Henan, China
| | - Yanchao Xu
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Yuqing Hou
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Renhai Peng
- Anyang Institute of Technology, Anyang, Henan, China
| | - Kunbo Wang
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China.,Tarium University, Alar, Xinjiang, China
| | - Fang Liu
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| |
Collapse
|
5
|
Lu H, Cui X, Zhao Y, Magwanga RO, Li P, Cai X, Zhou Z, Wang X, Liu Y, Xu Y, Hou Y, Peng R, Wang K, Liu F. Identification of a genome-specific repetitive element in the Gossypium D genome. PeerJ 2020; 8:e8344. [PMID: 31915591 DOI: 10.7287/peerj.preprints.27806v1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2019] [Accepted: 12/04/2019] [Indexed: 05/23/2023] Open
Abstract
The activity of genome-specific repetitive sequences is the main cause of genome variation between Gossypium A and D genomes. Through comparative analysis of the two genomes, we retrieved a repetitive element termed ICRd motif, which appears frequently in the diploid Gossypium raimondii (D5) genome but rarely in the diploid Gossypium arboreum (A2) genome. We further explored the existence of the ICRd motif in chromosomes of G. raimondii, G. arboreum, and two tetraploid (AADD) cotton species, Gossypium hirsutum and Gossypium barbadense, by fluorescence in situ hybridization (FISH), and observed that the ICRd motif exists in the D5 and D-subgenomes but not in the A2 and A-subgenomes. The ICRd motif comprises two components, a variable tandem repeat (TR) region and a conservative sequence (CS). The two constituents each have hundreds of repeats that evenly distribute across 13 chromosomes of the D5genome. The ICRd motif (and its repeats) was revealed as the common conservative region harbored by ancient Long Terminal Repeat Retrotransposons. Identification and investigation of the ICRd motif promotes the study of A and D genome differences, facilitates research on Gossypium genome evolution, and provides assistance to subgenome identification and genome assembling.
Collapse
Affiliation(s)
- Hejun Lu
- Gembloux Agro-Bio Tech, University of Liège, Gembloux, Namur, Belgium
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Xinglei Cui
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Yanyan Zhao
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Richard Odongo Magwanga
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
- School of Biological and Physical Sciences (SBPS), Jaramogi Oginga Odinga University of Science and Technology (JOOUST), Bondo-Kenya, Bondo, Kenya
| | - Pengcheng Li
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Xiaoyan Cai
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Zhongli Zhou
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Xingxing Wang
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Yuling Liu
- Anyang Institute of Technology, Anyang, Henan, China
| | - Yanchao Xu
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Yuqing Hou
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| | - Renhai Peng
- Anyang Institute of Technology, Anyang, Henan, China
| | - Kunbo Wang
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
- Tarium University, Alar, Xinjiang, China
| | - Fang Liu
- Research Base of Tarium University, State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Science, Anyang, Henan, China
| |
Collapse
|
6
|
Arroyo M, Bautista R, Larrosa R, Cobo MÁ, Claros MG. Biomarker potential of repetitive-element transcriptome in lung cancer. PeerJ 2019; 7:e8277. [PMID: 31875158 PMCID: PMC6925957 DOI: 10.7717/peerj.8277] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Accepted: 11/22/2019] [Indexed: 12/11/2022] Open
Abstract
Since repetitive elements (REs) account for nearly 53% of the human genome, profiling its transcription after an oncogenic change might help in the search for new biomarkers. Lung cancer was selected as target since it is the most frequent cause of cancer death. A bioinformatic workflow based on well-established bioinformatic tools (such as RepEnrich, RepBase, SAMTools, edgeR and DESeq2) has been developed to identify differentially expressed RNAs from REs. It was trained and tested with public RNA-seq data from matched sequencing of tumour and healthy lung tissues from the same patient to reveal differential expression within the RE transcriptome. Healthy lung tissues express a specific set of REs whose expression, after an oncogenic process, is strictly and specifically changed. Discrete sets of differentially expressed REs were found for lung adenocarcinoma, for small-cell lung cancer, and for both cancers. Differential expression affects more HERV-than LINE-derived REs and seems biased towards down-regulation in cancer cells. REs behaving consistently in all patients were tested in a different patient cohort to validate the proposed biomarkers. Down-regulation of AluYg6 and LTR18B was confirmed as potential lung cancer biomarkers, while up-regulation of HERVK11D-Int is specific for lung adenocarcinoma and up-regulation of UCON88 is specific for small cell lung cancer. Hence, the study of RE transcriptome might be considered another research target in cancer, making REs a promising source of lung cancer biomarkers.
Collapse
Affiliation(s)
- Macarena Arroyo
- U.G.C. Médico-Quirúrgica de Enfermedades Respiratorias, Hospital Regional Universitario de Málaga, Málaga, Spain.,Department of Molecular Biology and Biochemistry, Universidad de Málaga, Málaga, Spain
| | - Rocío Bautista
- Andalusian Platform for Bioinformatics-SCBI, Universidad de Málaga, Málaga, Spain
| | - Rafael Larrosa
- Andalusian Platform for Bioinformatics-SCBI, Universidad de Málaga, Málaga, Spain.,Department of Computer Architecture, Universidad de Málaga, Málaga, Spain
| | - Manuel Ángel Cobo
- Area of Oncology and Rare Diseases (IBIMA), Hospital Regional Universitario de Málaga, Málaga, Spain
| | - M Gonzalo Claros
- Department of Molecular Biology and Biochemistry, Universidad de Málaga, Málaga, Spain.,Andalusian Platform for Bioinformatics-SCBI, Universidad de Málaga, Málaga, Spain.,Area of Oncology and Rare Diseases (IBIMA), Hospital Regional Universitario de Málaga, Málaga, Spain
| |
Collapse
|
7
|
Zheng Y, Hlady RA, Joyce BT, Robertson KD, He C, Nannini DR, Kibbe WA, Achenbach CJ, Murphy RL, Roberts LR, Hou L. DNA methylation of individual repetitive elements in hepatitis C virus infection-induced hepatocellular carcinoma. Clin Epigenetics 2019; 11:145. [PMID: 31639042 PMCID: PMC6802191 DOI: 10.1186/s13148-019-0733-y] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Accepted: 08/29/2019] [Indexed: 02/07/2023] Open
Abstract
Background The two most common repetitive elements (REs) in humans, long interspersed nuclear element-1 (LINE-1) and Alu element (Alu), have been linked to various cancers. Hepatitis C virus (HCV) may cause hepatocellular carcinoma (HCC) by suppressing host defenses, through DNA methylation that controls the mobilization of REs. We aimed to investigate the role of RE methylation in HCV-induced HCC (HCV-HCC). Results We studied methylation of over 30,000 locus-specific REs across the genome in HCC, cirrhotic, and healthy liver tissues obtained by surgical resection. Relative to normal liver tissue, we observed the largest number of differentially methylated REs in HCV-HCC followed by alcohol-induced HCC (EtOH-HCC). After excluding EtOH-HCC-associated RE methylation (FDR < 0.001) and those unable to be validated in The Cancer Genome Atlas (TCGA), we identified 13 hypomethylated REs (11 LINE-1 and 2 Alu) and 2 hypermethylated REs (1 LINE-1 and 1 Alu) in HCV-HCC (FDR < 0.001). A majority of these REs were located in non-coding regions, preferentially enriched with chromatin repressive marks H3K27me3, and positively associated with gene expression (median correlation r = 0.32 across REs). We further constructed an HCV-HCC RE methylation score that distinguished HCV-HCC (lowest score), HCV-cirrhosis, and normal liver (highest score) in a dose-responsive manner (p for trend < 0.001). HCV-cirrhosis had a lower score than EtOH-cirrhosis (p = 0.038) and HCV-HCC had a lower score than EtOH-HCC in TCGA (p = 0.024). Conclusions Our findings indicate that HCV infection is associated with loss of DNA methylation in specific REs, which could implicate molecular mechanisms in liver cancer development. If our findings are validated in larger sample sizes, methylation of these REs may be useful as an early detection biomarker for HCV-HCC and/or a target for prevention of HCC in HCV-positive individuals. Electronic supplementary material The online version of this article (10.1186/s13148-019-0733-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yinan Zheng
- Center for Global Oncology, Institute for Global Health, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 680 N. Lake Shore Drive, Suite 1400, Chicago, IL, 60611-4402, USA.
| | - Ryan A Hlady
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, USA
| | - Brian T Joyce
- Center for Global Oncology, Institute for Global Health, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 680 N. Lake Shore Drive, Suite 1400, Chicago, IL, 60611-4402, USA
| | - Keith D Robertson
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, USA.,Center for Individualized Medicine, Mayo Clinic, Rochester, MN, USA
| | - Chunyan He
- University of Kentucky Markey Cancer Center, Lexington, KY, USA.,Department of Internal Medicine, Division of Medical Oncology, University of Kentucky, Lexington, KY, USA
| | - Drew R Nannini
- Center for Global Oncology, Institute for Global Health, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 680 N. Lake Shore Drive, Suite 1400, Chicago, IL, 60611-4402, USA
| | - Warren A Kibbe
- Duke Cancer Institute and Duke School of Medicine, Duke University, Durham, NC, USA
| | - Chad J Achenbach
- Center for Global Health, Institute for Public Health and Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.,Division of Infectious Diseases, Department of Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Robert L Murphy
- Center for Global Health, Institute for Public Health and Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.,Robert H Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Lewis R Roberts
- Division of Gastroenterology and Hepatology, Mayo Clinic, Rochester, MN, USA
| | - Lifang Hou
- Center for Global Oncology, Institute for Global Health, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 680 N. Lake Shore Drive, Suite 1400, Chicago, IL, 60611-4402, USA
| |
Collapse
|
8
|
Huang LL, Zhou B, Ai SH, Yang P, Chen YJ, Liu C, Deng YL, Lu Q, Miao XP, Lu WQ, Wang YX, Zeng Q. Prenatal phthalate exposure, birth outcomes and DNA methylation of Alu and LINE-1 repetitive elements: A pilot study in China. Chemosphere 2018; 206:759-765. [PMID: 29793068 DOI: 10.1016/j.chemosphere.2018.05.030] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2018] [Revised: 05/03/2018] [Accepted: 05/04/2018] [Indexed: 05/08/2023]
Abstract
BACKGROUND Epigenetic mechanisms, such as altered DNA methylation, may participate in the relationship between prenatal phthalate exposure and adverse birth outcomes. OBJECTIVE To explore the mediation effect of DNA methylation in the associations of phthalate exposure before delivery with birth outcomes in a Chinese cohort. METHODS Eight phthalate metabolites in maternal urine before delivery and DNA methylation of Alu and long interspersed nucleotide elements (LINE-1) in cord blood were determined among 106 mother-infant pairs. General additive models were used to assess the associations of maternal urinary phthalate metabolites with birth outcomes and DNA methylation; the mediating role of DNA methylation in cord blood was evaluated by mediation analysis. RESULTS We found sex-specific associations between prenatal phthalate exposure and birth outcomes and DNA methylation of cord blood. For example, the molar sum of di-2-(ethylhexyl) phthalate (∑DEHPm) metabolites in maternal urine was positively associated with gestational age among male newborns only (P < 0.05); maternal urinary monobenzyl phthalate (MBzP) was negatively associated with Alu methylation among female newborns only (P < 0.05). Mediation analysis did not find that methylation of Alu and LINE-1 to be a direct mediator in the relationships between maternal urinary phthalate metabolites before delivery and birth outcomes. CONCLUSION Prenatal exposure to certain phthalates was associated with altered birth outcomes and decreased repetitive element methylation of newborns. However, the altered birth outcomes exerted by prenatal phthalate exposure does not seem to be directly mediated through repetitive element methylation in cord blood.
Collapse
Affiliation(s)
- Li-Li Huang
- Department of Occupational and Environmental Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China; Key Laboratory of Environment and Health, Ministry of Education & Ministry of Environmental Protection, and State Key Laboratory of Environmental Health (incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
| | - Bin Zhou
- Wuhan Children's Hospital (Wuhan Maternal and Child Healthcare Hospital), Tongji Medical College, Huazhong University of Science and Technology, WuHan, PR China; Department of Epidemiology and Biostatistics, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
| | - Song-Hua Ai
- Department of Occupational and Environmental Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China; Key Laboratory of Environment and Health, Ministry of Education & Ministry of Environmental Protection, and State Key Laboratory of Environmental Health (incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
| | - Pan Yang
- Department of Occupational and Environmental Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China; Key Laboratory of Environment and Health, Ministry of Education & Ministry of Environmental Protection, and State Key Laboratory of Environmental Health (incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
| | - Ying-Jun Chen
- Department of Occupational and Environmental Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China; Key Laboratory of Environment and Health, Ministry of Education & Ministry of Environmental Protection, and State Key Laboratory of Environmental Health (incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
| | - Chong Liu
- Department of Occupational and Environmental Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China; Key Laboratory of Environment and Health, Ministry of Education & Ministry of Environmental Protection, and State Key Laboratory of Environmental Health (incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
| | - Yan-Ling Deng
- Department of Occupational and Environmental Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China; Key Laboratory of Environment and Health, Ministry of Education & Ministry of Environmental Protection, and State Key Laboratory of Environmental Health (incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
| | - Qing Lu
- Key Laboratory of Environment and Health, Ministry of Education & Ministry of Environmental Protection, and State Key Laboratory of Environmental Health (incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
| | - Xiao-Ping Miao
- Key Laboratory of Environment and Health, Ministry of Education & Ministry of Environmental Protection, and State Key Laboratory of Environmental Health (incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China; Department of Epidemiology and Biostatistics, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
| | - Wen-Qing Lu
- Department of Occupational and Environmental Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China; Key Laboratory of Environment and Health, Ministry of Education & Ministry of Environmental Protection, and State Key Laboratory of Environmental Health (incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
| | - Yi-Xin Wang
- Department of Occupational and Environmental Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China; Key Laboratory of Environment and Health, Ministry of Education & Ministry of Environmental Protection, and State Key Laboratory of Environmental Health (incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
| | - Qiang Zeng
- Department of Occupational and Environmental Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China; Key Laboratory of Environment and Health, Ministry of Education & Ministry of Environmental Protection, and State Key Laboratory of Environmental Health (incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China.
| |
Collapse
|
9
|
McKerrow WH, Savva YA, Rezaei A, Reenan RA, Lawrence CE. Predicting RNA hyper-editing with a novel tool when unambiguous alignment is impossible. BMC Genomics 2017; 18:522. [PMID: 28693467 PMCID: PMC5502491 DOI: 10.1186/s12864-017-3898-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2017] [Accepted: 06/22/2017] [Indexed: 12/13/2022] Open
Abstract
Background Repetitive elements are now known to have relevant cellular functions, including self-complementary sequences that form double stranded (ds) RNA. There are numerous pathways that determine the fate of endogenous dsRNA, and misregulation of endogenous dsRNA is a driver of autoimmune disease, particularly in the brain. Unfortunately, the alignment of high-throughput, short-read sequences to repeat elements poses a dilemma: Such sequences may align equally well to multiple genomic locations. In order to differentiate repeat elements, current alignment methods depend on sequence variation in the reference genome. Reads are discarded when no such variations are present. However, RNA hyper-editing, a possible fate for dsRNA, introduces enough variation to distinguish between repeats that are otherwise identical. Results To take advantage of this variation, we developed a new algorithm, RepProfile, that simultaneously aligns reads and predicts novel variations. RepProfile accurately aligns hyper-edited reads that other methods discard. In particular we predict hyper-editing of Drosophila melanogaster repeat elements in vivo at levels previously described only in vitro, and provide validation by Sanger sequencing sixty-two individual cloned sequences. We find that hyper-editing is concentrated in genes involved in cell-cell communication at the synapse, including some that are associated with neurodegeneration. We also find that hyper-editing tends to occur in short runs. Conclusions Previous studies of RNA hyper-editing discarded ambiguously aligned reads, ignoring hyper-editing in long, perfect dsRNA – the perfect substrate for hyper-editing. We provide a method that simulation and Sanger validation show accurately predicts such RNA editing, yielding a superior picture of hyper-editing. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3898-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Wilson H McKerrow
- Division of Applied Mathematics, Brown University, Providence, 02912, RI, USA.
| | - Yiannis A Savva
- Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, 02912, RI, USA
| | - Ali Rezaei
- Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, 02912, RI, USA
| | - Robert A Reenan
- Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, 02912, RI, USA
| | - Charles E Lawrence
- Division of Applied Mathematics, Brown University, Providence, 02912, RI, USA
| |
Collapse
|
10
|
Ramos É, Cardoso AL, Brown J, Marques DF, Fantinatti BEA, Cabral-de-Mello DC, Oliveira RA, O'Neill RJ, Martins C. The repetitive DNA element BncDNA, enriched in the B chromosome of the cichlid fish Astatotilapia latifasciata, transcribes a potentially noncoding RNA. Chromosoma 2016; 126:313-323. [PMID: 27169573 DOI: 10.1007/s00412-016-0601-x] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2016] [Revised: 04/03/2016] [Accepted: 05/03/2016] [Indexed: 12/27/2022]
Abstract
Supernumerary chromosomes have been studied in many species of eukaryotes, including the cichlid fish, Astatotilapia latifasciata. However, there are many unanswered questions about the maintenance, inheritance, and functional aspects of supernumerary chromosomes. The cichlid family has been highlighted as a model for evolutionary studies, including those that focus on mechanisms of chromosome evolution. Individuals of A. latifasciata are known to carry up to two B heterochromatic isochromosomes that are enriched in repetitive DNA and contain few intact gene sequences. We isolated and characterized a transcriptionally active repeated DNA, called B chromosome noncoding DNA (BncDNA), highly represented across all B chromosomes of A. latifasciata. BncDNA transcripts are differentially processed among six different tissues, including the production of smaller transcripts, indicating transcriptional variation may be linked to B chromosome presence and sexual phenotype. The transcript lengths and lack of similarity with known protein/gene sequences indicate BncRNA might represent a novel long noncoding RNA family (lncRNA). The potential for interaction between BncRNA and known miRNAs were computationally predicted, resulting in the identification of possible binding of this sequence in upregulated miRNAs related to the presence of B chromosomes. In conclusion, Bnc is a transcriptionally active repetitive DNA enriched in B chromosomes with potential action over B chromosome maintenance in somatic cells and meiotic drive in gametic cells.
Collapse
Affiliation(s)
- Érica Ramos
- Department of Morphology, Institute of Biosciences, Sao Paulo State University, 18618-689, Botucatu, SP, Brazil
| | - Adauto L Cardoso
- Department of Morphology, Institute of Biosciences, Sao Paulo State University, 18618-689, Botucatu, SP, Brazil
| | - Judith Brown
- Allied Health Sciences Department and Institute for Systems Genomics, University of Connecticut, 06269, Storrs, CT, USA
| | - Diego F Marques
- Department of Morphology, Institute of Biosciences, Sao Paulo State University, 18618-689, Botucatu, SP, Brazil
| | - Bruno E A Fantinatti
- Department of Morphology, Institute of Biosciences, Sao Paulo State University, 18618-689, Botucatu, SP, Brazil
| | - Diogo C Cabral-de-Mello
- Department of Biology, Institute of Biosciences, Sao Paulo State University, 13506-900, Rio Claro, SP, Brazil
| | - Rogério A Oliveira
- Department of Biostatistics, Institute of Biosciences, Sao Paulo State University, 18618-689, Botucatu, SP, Brazil
| | - Rachel J O'Neill
- Department of Molecular and Cell Biology and Institute for Systems Genomics, University of Connecticut, 06269, Storrs, CT, USA
| | - Cesar Martins
- Department of Morphology, Institute of Biosciences, Sao Paulo State University, 18618-689, Botucatu, SP, Brazil.
| |
Collapse
|
11
|
Mallona I, Jordà M, Peinado MA. A knowledgebase of the human Alu repetitive elements. J Biomed Inform 2016; 60:77-83. [PMID: 26827622 DOI: 10.1016/j.jbi.2016.01.010] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2015] [Revised: 01/20/2016] [Accepted: 01/22/2016] [Indexed: 01/13/2023]
Abstract
Alu elements are the most abundant retrotransposons in the human genome with more than one million copies. Alu repeats have been reported to participate in multiple processes related with genome regulation and compartmentalization. Moreover, they have been involved in the facilitation of pathological mutations in many diseases, including cancer. The contribution of Alus and other repeats in genomic regulation is often overlooked because their study poses technical and analytical challenges hardly attainable with conventional strategies. Here we propose the integration of ontology-based semantic methods to query a knowledgebase for the human Alus. The knowledgebase for the human Alus leverages Sequence (SO) and Gene Ontologies (GO) and is devoted to address functional and genetic information in the genomic context of the Alus. For each Alu element, the closest gene and transcript are stored, as well their functional annotation according to GO, the state of the chromatin and the transcription factors binding sites inside the Alu. The model uses Web Ontology Language (OWL) and Semantic Web Rule Language (SWRL). As a case of use and to illustrate the utility of the tool, we have evaluated the epigenetic states of Alu repeats associated with gene promoters according to their transcriptional activity. The ontology is easily extendable, offering a scaffold for the inclusion of new experimental data. The RDF/XML formalization is freely available at http://aluontology.sourceforge.net/.
Collapse
Affiliation(s)
- Izaskun Mallona
- Institute of Predictive and Personalized Medicine of Cancer (IMPPC) and Health Research Institute Germans Trias i Pujol (IGTP), Can Ruti Campus. Ctra. de Can Ruti, camí de les escoles, s/n, 08916 Badalona, Spain.
| | - Mireia Jordà
- Institute of Predictive and Personalized Medicine of Cancer (IMPPC) and Health Research Institute Germans Trias i Pujol (IGTP), Can Ruti Campus. Ctra. de Can Ruti, camí de les escoles, s/n, 08916 Badalona, Spain
| | - Miguel A Peinado
- Institute of Predictive and Personalized Medicine of Cancer (IMPPC) and Health Research Institute Germans Trias i Pujol (IGTP), Can Ruti Campus. Ctra. de Can Ruti, camí de les escoles, s/n, 08916 Badalona, Spain
| |
Collapse
|
12
|
Lee KH, Shin KS, Lim D, Kim WC, Chung BC, Han GB, Roh J, Cho DH, Cho K. Repetitive element signature-based visualization, distance computation, and classification of 1766 microbial genomes. Genomics 2015; 106:30-42. [PMID: 25918033 DOI: 10.1016/j.ygeno.2015.04.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2014] [Revised: 03/24/2015] [Accepted: 04/17/2015] [Indexed: 10/23/2022]
Abstract
The genomes of living organisms are populated with pleomorphic repetitive elements (REs) of varying densities. Our hypothesis that genomic RE landscapes are species/strain/individual-specific was implemented into the Genome Signature Imaging system to visualize and compute the RE-based signatures of any genome. Following the occurrence profiling of 5-nucleotide REs/words, the information from top-50 frequency words was transformed into a genome-specific signature and visualized as Genome Signature Images (GSIs), using a CMYK scheme. An algorithm for computing distances among GSIs was formulated using the GSIs' variables (word identity, frequency, and frequency order). The utility of the GSI-distance computation system was demonstrated with control genomes. GSI-based computation of genome-relatedness among 1766 microbes (117 archaea and 1649 bacteria) identified their clustering patterns; although the majority paralleled the established classification, some did not. The Genome Signature Imaging system, with its visualization and distance computation functions, enables genome-scale evolutionary studies involving numerous genomes with varying sizes.
Collapse
Affiliation(s)
- Kang-Hoon Lee
- Department of Surgery, University of California, Davis, Sacramento, CA, United States; Shriners Hospitals for Children Northern California, Sacramento, CA, United States
| | - Kyung-Seop Shin
- Department of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea
| | - Debora Lim
- Department of Surgery, University of California, Davis, Sacramento, CA, United States; Shriners Hospitals for Children Northern California, Sacramento, CA, United States
| | - Woo-Chan Kim
- Department of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea
| | - Byung Chang Chung
- Department of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea
| | - Gyu-Bum Han
- Department of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea
| | - Jeongkyu Roh
- Department of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea
| | - Dong-Ho Cho
- Department of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea
| | - Kiho Cho
- Department of Surgery, University of California, Davis, Sacramento, CA, United States; Shriners Hospitals for Children Northern California, Sacramento, CA, United States.
| |
Collapse
|
13
|
You RN, Kim WC, Lee KH, Lee YK, Shin KS, Cho K, Cho DH. REViewer: a tool for linear visualization of repetitive elements within a sequence query. Genomics 2013; 102:209-14. [PMID: 23891933 DOI: 10.1016/j.ygeno.2013.07.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2012] [Revised: 05/28/2013] [Accepted: 07/16/2013] [Indexed: 01/20/2023]
Abstract
A species-specific population of arrangements of repetitive elements (REs), called RE arrays, exists in the human and mouse genomes. We developed an RE analytical tool, named REViewer, for visualizing RE occurrences within RE arrays and other genomic regions as an interactive line map. REViewer utilizes an RE reference library which is established with two RE types: 1) REMiner-generated undefined REs and 2) RepeatMasker-derived defined REs. RE occurrences within queries are visualized as a line map using these two RE types. The REViewer's controller provides analytical options, such as zoom, customization of axis unit, and RE type selection. The functionality of REViewer was evaluated using the human chromosome Y sequence. The REViewer is determined to be an efficient tool that facilitates visualization of up to 6000 REs in RE arrays and other genomic regions. The maximum query size is linked to the RE mining tools (e.g., REMiner, RepeatMasker), not to REViewer.
Collapse
|