1
|
Fick A, Swart V, Bombarely A, van den Berg N. Comparative transcriptional analysis of Persea americana MYB, WRKY and AP2/ERF transcription factors following Phytophthora cinnamomi infection. MOLECULAR PLANT PATHOLOGY 2024; 25:e13453. [PMID: 38590150 PMCID: PMC11002358 DOI: 10.1111/mpp.13453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 03/07/2024] [Accepted: 03/20/2024] [Indexed: 04/10/2024]
Abstract
Plant cells undergo extensive transcriptional reprogramming following pathogen infection, with these reprogramming patterns becoming more complex when pathogens, such as hemibiotrophs, exhibit different lifestyles. These transcriptional changes are often orchestrated by MYB, WRKY and AP2/ERF transcription factors (TFs), which modulate both growth and defence-related gene expression. Transcriptional analysis of defence-related genes in avocado (Persea americana) infected with Phytophthora cinnamomi indicated differential immune response activation when comparing a partially resistant and susceptible rootstock. This study identified 226 MYB, 82 WRKY, and 174 AP2/ERF TF-encoding genes in avocado, using a genome-wide approach. Phylogenetic analysis revealed substantial sequence conservation within TF groups underscoring their functional significance. RNA-sequencing analysis in a partially resistant and susceptible avocado rootstock infected with P. cinnamomi was indicative of an immune response switch occurring in either rootstock after 24 and 6 h post-inoculation, respectively. Different clusters of co-expressed TF genes were observed at these times, suggesting the activation of necrotroph-related immune responses at varying intervals between the two rootstocks. This study aids our understanding of avocado immune response activation following P. cinnamomi infection, and the role of the TFs therein, elucidating the transcriptional reprogramming disparities between partially resistant and susceptible rootstocks.
Collapse
Affiliation(s)
- Alicia Fick
- Department of Biochemistry, Genetics and MicrobiologyUniversity of PretoriaPretoriaGautengSouth Africa
- Hans Merensky Chair in Avocado Research, Forestry and Agricultural Biotechnology InstituteUniversity of PretoriaPretoriaGautengSouth Africa
| | - Velushka Swart
- Department of Biochemistry, Genetics and MicrobiologyUniversity of PretoriaPretoriaGautengSouth Africa
- Hans Merensky Chair in Avocado Research, Forestry and Agricultural Biotechnology InstituteUniversity of PretoriaPretoriaGautengSouth Africa
| | - Aureliano Bombarely
- Instituto de Biología Molecular y Celular de PlantasConsejo Superior de Investigaciones Científicas‐Universitat Politècnica de València (IBMCP‐CSIC‐UPV)ValenciaSpain
| | - Noëlani van den Berg
- Department of Biochemistry, Genetics and MicrobiologyUniversity of PretoriaPretoriaGautengSouth Africa
- Hans Merensky Chair in Avocado Research, Forestry and Agricultural Biotechnology InstituteUniversity of PretoriaPretoriaGautengSouth Africa
| |
Collapse
|
2
|
Liu K, Qi A, Ru W, Jiang X, Cao H, Lan X, Huang Y, Lei C, Sun X, Chen H. Insertion/deletions within the bovine FoxO1 gene and their association analysis with growth traits in three Chinese cattle breeds. Anim Biotechnol 2023; 34:2051-2058. [PMID: 35491893 DOI: 10.1080/10495398.2022.2068024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Abstract
FOXO1 (FKHR) gene, as a transcription factor, plays a vital role in animal growth and development, participating in many biological processes. The aim of this study was to ascertain Insertion/deletions (Indels) polymorphism within bovine FoxO1 gene in 679 Chinese adult cows and associate them with stature traits. Two Indels (named as Indel-3 and Indel-4, recorded as rs383545622 and rs525318770 in NCBI, respectively) were successfully genotyped by the Once PCR method, which was reliable, rapid and cost effective for simultaneous detection of two or more Indels. Indel-3 and Indel-4 were located at the second intron. All four different haplotypes (H1: D3D4, H2: I3D4, H3: D3I4, H4: I3I4) could be identified, and the D (del-) allele, DD (del-/del-) genotype and D3D4 haplotype retained the highest frequency. However, individuals with DI (D3I3, D4I4 or H1H4/H2H3 genotype) showed significantly better phenotypic traits than those with the other genotypes in Nanyang cattle, showing a hybrid vigor. The results implied that this DI genotype can be applied to early selective breeding to improve the productivity of Nanyang cattle. Our results suggested that these two Indels within the bovine FoxO1 gene might be used as genetic markers for marker-assisted selection (MAS) in cattle breeding and genetics.
Collapse
Affiliation(s)
- Kunpeng Liu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Ao Qi
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Wenxiu Ru
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Xiaojun Jiang
- Agriculture and Animal Husbandry Fine Seed Breeding Farm of Shaanxi Province, Fufeng, China
| | - Hui Cao
- Shaanxi Kingbull Livestock Co., LTD, Yangling, China
| | - Xianyong Lan
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Yongzhen Huang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Chuzhao Lei
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Xiuzhu Sun
- College of Grassland Agriculture, Northwest A&F University, Yangling, China
| | - Hong Chen
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| |
Collapse
|
3
|
Westrick NM, Park SC, Keller NP, Smith DL, Kabbage M. A broadly conserved fungal alcohol oxidase (AOX) facilitates fungal invasion of plants. MOLECULAR PLANT PATHOLOGY 2023; 24:28-43. [PMID: 36251755 PMCID: PMC9742500 DOI: 10.1111/mpp.13274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 09/13/2022] [Accepted: 09/15/2022] [Indexed: 06/16/2023]
Abstract
Alcohol oxidases (AOXs) are ecologically important enzymes that facilitate a number of plant-fungal interactions. Within Ascomycota they are primarily associated with methylotrophy, as a peroxisomal AOX catalysing the conversion of methanol to formaldehyde in methylotrophic yeast. In this study we demonstrate that AOX orthologues are phylogenetically conserved proteins that are common in the genomes of nonmethylotrophic, plant-associating fungi. Additionally, AOX orthologues are highly expressed during infection in a range of diverse pathosystems. To study the role of AOX in plant colonization, AOX knockout mutants were generated in the broad host range pathogen Sclerotinia sclerotiorum. Disease assays in soybean showed that these mutants had a significant virulence defect as evidenced by markedly reduced stem lesions and mortality rates. Chemical genomics suggested that SsAOX may function as an aromatic AOX, and growth assays demonstrated that ΔSsAOX is incapable of properly utilizing plant extract as a nutrient source. Profiling of known aromatic alcohols pointed towards the monolignol coniferyl alcohol (CA) as a possible substrate for SsAOX. As CA and other monolignols are ubiquitous among land plants, the presence of highly conserved AOX orthologues throughout Ascomycota implies that this is a broadly conserved protein used by ascomycete fungi during plant colonization.
Collapse
Affiliation(s)
- Nathaniel M. Westrick
- Department of Plant PathologyUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
- United States Department of Agriculture–Agricultural Research ServiceMadisonWisconsinUSA
| | - Sung Chul Park
- Department of Medical Microbiology and ImmunologyUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
| | - Nancy P. Keller
- Department of Plant PathologyUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
- Department of Medical Microbiology and ImmunologyUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
| | - Damon L. Smith
- Department of Plant PathologyUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
| | - Mehdi Kabbage
- Department of Plant PathologyUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
| |
Collapse
|
4
|
Pan L, Luo Y, Wang J, Li X, Tang B, Yang H, Hou X, Liu F, Zou X. Evolution and functional diversification of catalase genes in the green lineage. BMC Genomics 2022; 23:411. [PMID: 35650553 PMCID: PMC9158360 DOI: 10.1186/s12864-022-08621-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 05/09/2022] [Indexed: 11/10/2022] Open
Abstract
Background Catalases (CATs) break down hydrogen peroxide into water and oxygen to prevent cellular oxidative damage, and play key roles in the development, biotic and abiotic stresses of plants. However, the evolutionary relationships of the plant CAT gene family have not been systematically reported. Results Here, we conducted genome-wide comparative, phylogenetic, and structural analyses of CAT orthologs from 29 out of 31 representative green lineage species to characterize the evolution and functional diversity of CATs. We found that CAT genes in land plants were derived from core chlorophytes and detected a lineage-specific loss of CAT genes in Fabaceae, suggesting that the CAT genes in this group possess divergent functions. All CAT genes were split into three major groups (group α, β1, and β2) based on the phylogeny. CAT genes were transferred from bacteria to core chlorophytes and charophytes by lateral gene transfer, and this led to the independent evolution of two types of CAT genes: α and β types. Ten common motifs were detected in both α and β groups, and β CAT genes had five unique motifs, respectively. The findings of our study are inconsistent with two previous hypotheses proposing that (i) new CAT genes are acquired through intron loss and that (ii) the Cys-343 residue is highly conserved in plants. We found that new CAT genes in most higher plants were produced through intron acquisition and that the Cys-343 residue was only present in monocots, Brassicaceae and Pp_CatX7 in P. patens, which indicates the functional specificity of the CATs in these three lineages. Finally, our finding that CAT genes show high overall sequence identity but that individual CAT genes showed developmental stage and organ-specific expression patterns suggests that CAT genes have functionally diverged independently. Conclusions Overall, our analyses of the CAT gene family provide new insights into their evolution and functional diversification in green lineage species. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08621-6.
Collapse
|
5
|
Schäfer GG, Grebe LJ, Schinkel R, Lieb B. The Evolution of Hemocyanin Genes in Caenogastropoda: Gene Duplications and Intron Accumulation in Highly Diverse Gastropods. J Mol Evol 2021; 89:639-655. [PMID: 34757470 PMCID: PMC8599328 DOI: 10.1007/s00239-021-10036-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Accepted: 10/15/2021] [Indexed: 11/30/2022]
Abstract
Hemocyanin is the oxygen transport protein of most molluscs and represents an important physiological factor that has to be well-adapted to their environments because of the strong influences of abiotic factors on its oxygen affinity. Multiple independent gene duplications and intron gains have been reported for hemocyanin genes of Tectipleura (Heterobranchia) and the caenogastropod species Pomacea canaliculata, which contrast with the uniform gene architectures of hemocyanins in Vetigastropoda. The goal of this study was to analyze hemocyanin gene evolution within the diverse group of Caenogastropoda in more detail. Our findings reveal multiple gene duplications and intron gains and imply that these represent general features of Apogastropoda hemocyanins. Whereas hemocyanin exon–intron structures are identical within different Tectipleura lineages, they differ strongly within Caenogastropoda among phylogenetic groups as well as between paralogous hemocyanin genes of the same species. Thus, intron accumulation took place more gradually within Caenogastropoda but finally led to a similar consequence, namely, a multitude of introns. Since both phenomena occurred independently within Heterobranchia and Caenogastropoda, the results support the hypothesis that introns may contribute to adaptive radiation by offering new opportunities for genetic variability (multiple paralogs that may evolve differently) and regulation (multiple introns). Our study indicates that adaptation of hemocyanin genes may be one of several factors that contributed to the evolution of the large diversity of Apogastropoda. While questions remain, this hypothesis is presented as a starting point for the further study of hemocyanin genes and possible correlations between hemocyanin diversity and adaptive radiation.
Collapse
Affiliation(s)
- Gabriela Giannina Schäfer
- Institute of Molecular Physiology, Johannes Gutenberg-University of Mainz, Johann-Joachim-Becher-Weg 7, 55128, Mainz, Germany
| | - Lukas Jörg Grebe
- Institute of Molecular Physiology, Johannes Gutenberg-University of Mainz, Johann-Joachim-Becher-Weg 7, 55128, Mainz, Germany
| | - Robin Schinkel
- Institute of Molecular Physiology, Johannes Gutenberg-University of Mainz, Johann-Joachim-Becher-Weg 7, 55128, Mainz, Germany
| | - Bernhard Lieb
- Institute of Molecular Physiology, Johannes Gutenberg-University of Mainz, Johann-Joachim-Becher-Weg 7, 55128, Mainz, Germany.
| |
Collapse
|
6
|
Meng D, Dong B, Niu L, Song Z, Wang L, Amin R, Cao H, Li H, Yang Q, Fu Y. The pigeon pea CcCIPK14-CcCBL1 pair positively modulates drought tolerance by enhancing flavonoid biosynthesis. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 106:1278-1297. [PMID: 33733535 DOI: 10.1111/tpj.15234] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 03/09/2021] [Accepted: 03/10/2021] [Indexed: 05/22/2023]
Abstract
Calcineurin B-like (CBL)-interacting protein kinases (CIPKs) play a central role in Ca2+ signalling and promote drought tolerance in plants. The CIPK gene family in pigeon pea (Cajanus cajan L.), a major food crop affected by drought, has not previously been characterised. Here, we identified 28 CIPK genes in the pigeon pea genome. Five CcCIPK genes were strongly upregulated in roots upon drought treatment and were selected for further characterisation. Overexpression of CcCIPK13 and CcCIPK14 increased survival rates by two- to three-fold relative to controls after 14 days of drought. Furthermore, the three major flavonoids, genistin, genistein and apigenin, were significantly upregulated in the same transgenic plants. Using CcCIPK14 as bait, we performed a yeast two-hybrid screen and identified six interactors, including CcCBL1. CcCIPK14 exhibited autophosphorylation and phosphorylation of CcCBL1 in vitro. CcCBL1-overexpressed plants displayed higher survival rates upon drought stress as well as higher expression of flavonoid biosynthetic genes and flavonoid content. CcCIPK14-overexpressed plants in which CcCBL1 transcript levels were reduced by RNA interference had lower survival rates, which indicated CcCBL1 in the same pathway as CcCIPK14. Together, our results demonstrate a role for the CcCIPK14-CcCBL1 complex in drought stress tolerance through the regulation of flavonoid biosynthesis in pigeon pea.
Collapse
Affiliation(s)
- Dong Meng
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, China
- College of Forestry, Beijing Forestry University, Bejing, China
| | - Biying Dong
- College of Forestry, Beijing Forestry University, Bejing, China
| | - Lili Niu
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, China
- College of Forestry, Beijing Forestry University, Bejing, China
| | - Zhihua Song
- College of Forestry, Beijing Forestry University, Bejing, China
| | - Litao Wang
- College of Forestry, Beijing Forestry University, Bejing, China
| | - Rohul Amin
- College of Forestry, Beijing Forestry University, Bejing, China
| | - Hongyan Cao
- College of Forestry, Beijing Forestry University, Bejing, China
| | - Hanghang Li
- College of Forestry, Beijing Forestry University, Bejing, China
| | - Qing Yang
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, China
- College of Forestry, Beijing Forestry University, Bejing, China
| | - Yujie Fu
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, China
- College of Forestry, Beijing Forestry University, Bejing, China
- Key Laboratory of Forest Plant Ecology, Ministry of Education, Northeast Forestry University, Harbin, China
| |
Collapse
|
7
|
Fanelli E, Troccoli A, Tarasco E, De Luca F. Molecular Characterization and Functional Analysis of the Hb-hsp90-1 Gene in Relation to Temperature Changes in Heterorhabditis bacteriophora. Front Physiol 2021; 12:615653. [PMID: 33732162 PMCID: PMC7959791 DOI: 10.3389/fphys.2021.615653] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 01/26/2021] [Indexed: 12/17/2022] Open
Abstract
Understanding how entomopathogenic nematodes respond to temperature changes and have adapted to the local environment is crucial to improve their potential as biocontrol agents. In order to improve understanding of Heterorhabditis bacteriophora's potential adaptability to future climate changes, full-length cDNA and the corresponding gene of heat shock protein 90 (Hsp90) were isolated and fully characterized. The reproductive potential of the Apulian strain of H. bacteriophora increased when the temperature rose from 23 to 30°C, but no reproduction was found at 12°C. Expression analyses revealed that Hb-hsp90-1 was differentially expressed in Infective Juveniles (IJs) and adults (hermaphrodites, females and males). Up-regulation of Hb-hsp90-1 was higher during the recovery process in Galleria mellonella larvae than adults, thus confirming the protective role of Hb-hsp90-1 in coping with the host environment. Silencing of Hb-hsp90-1 resulted in a significant reduction (76%) in the expression level. Silenced IJs took longer than untreated nematodes to infect G. mellonella, showing that Hb-hsp90-1 could be also involved in chemosensation. Furthermore, the number of adults and IJs recovered from G. mellonella infected with silenced nematodes and incubated at 30°C was higher than that obtained from G. mellonella infected with untreated nematodes. These data confirm the crucial role of Hb-hsp90-1 allowing acclimation to increased temperatures and modulation of the recovery process.
Collapse
Affiliation(s)
- Elena Fanelli
- Institute for Sustainable Plant Protection-CNR, Bari, Italy
| | | | - Eustachio Tarasco
- Institute for Sustainable Plant Protection-CNR, Bari, Italy
- Section of Entomology and Zoology, Department of Soil, Plant and Food Sciences, University of Bari “A. Moro”, Bari, Italy
| | | |
Collapse
|
8
|
Wang B, Du X, Wang H, Jin C, Gao C, Liu J, Zhang Q. Comparative studies on duplicated tdrd7 paralogs in teleosts: Molecular evolution caused neo-functionalization. COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY D-GENOMICS & PROTEOMICS 2019; 30:347-357. [PMID: 31059868 DOI: 10.1016/j.cbd.2019.04.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Revised: 03/03/2019] [Accepted: 04/24/2019] [Indexed: 11/27/2022]
Abstract
The third-round whole genome duplication (3R-WGD) event occurred in the stem lineage of teleost during evolution, and is considered to be responsible for the biological diversification of ray-finned fishes. TUDOR domain containing protein 7 (Tdrd7), which belongs to the Tudor family proteins has been widely discussed in mammals. However, information about this gene in teleost is still lacking. In this study, two teleost tdrd7 genes (tdrd7a and tdrd7b) were identified in the transcriptome of Japanese flounder (Paralichthys olivaceus). Through genomic structure, phylogenetic, synteny analysis and online bioinformatic mining of tdrd7 duplications in other selected species, we confirmed that tdrd7a/7b were originated from the teleost-specific 3R-WGD. The tdrd7a is specific to teleost except for spotted gar. The tdrd7a showed a higher molecular evolution rate than tdrd7b with longer branch-length in the phylogenetic tree and multiple positively selected sites. Interestingly, it showed gonad specific expression pattern in adult tissues and germ cell specific distribution in embryos and gonads. Its 3'-untranslated region (3'UTR) labeled eGFP/DsRED could visualize primordial germ cells (PGCs) in zebrafish embryos. The tdrd7b did not show similar tissue and cell type specificity. These characteristic differences between the duplicated tdrd7 paralogues suggest that tdrd7a and tdrd7b have undergone neofunctionalization in Japanese flounder. Our results provide novel insight into the evolution and functional diversification of teleost tdrd7 genes deserving further investigations.
Collapse
Affiliation(s)
- Bo Wang
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Ocean University of China, 266003 Qingdao, Shandong, China
| | - Xinxin Du
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Ocean University of China, 266003 Qingdao, Shandong, China
| | - Huizhen Wang
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Ocean University of China, 266003 Qingdao, Shandong, China
| | - Chaofan Jin
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Ocean University of China, 266003 Qingdao, Shandong, China
| | - Chen Gao
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Ocean University of China, 266003 Qingdao, Shandong, China
| | - Jinxiang Liu
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Ocean University of China, 266003 Qingdao, Shandong, China
| | - Quanqi Zhang
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Ocean University of China, 266003 Qingdao, Shandong, China; Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, China.
| |
Collapse
|
9
|
Das S, Bansal M. Variation of gene expression in plants is influenced by gene architecture and structural properties of promoters. PLoS One 2019; 14:e0212678. [PMID: 30908494 PMCID: PMC6433290 DOI: 10.1371/journal.pone.0212678] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2018] [Accepted: 02/07/2019] [Indexed: 12/03/2022] Open
Abstract
In higher eukaryotes, gene architecture and structural properties of promoters have emerged as significant factors influencing variation in number of transcripts (expression level) and specificity of gene expression in a tissue (expression breadth), which eventually shape the phenotype. In this study, transcriptome data of different tissue types at various developmental stages of A. thaliana, O. sativa, S. bicolor and Z. mays have been used to understand the relationship between properties of gene components and its expression. Our findings indicate that in plants, among all gene architecture and structural properties of promoters, compactness of genes in terms of intron content is significantly linked to gene expression level and breadth, whereas in human an exactly opposite scenario is seen. In plants, for the first time we have carried out a quantitative estimation of effect of a particular trait on expression level and breadth, by using multiple regression analysis and it confirms that intron content of primary transcript (as %) is a powerful determinant of expression breadth. Similarly, further regression analysis revealed that among structural properties of the promoters, stability is negatively linked to expression breadth, while DNase1 sensitivity strongly governs gene expression breadth in monocots and gene expression level in dicots. In addition, promoter regions of tissue specific genes are found to be enriched with TATA box and Y-patch motifs. Finally, multi copy orthologous genes in plants are found to be longer, highly regulated and tissue specific.
Collapse
Affiliation(s)
- Sanjukta Das
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | - Manju Bansal
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| |
Collapse
|
10
|
Mukherjee D, Saha D, Acharya D, Mukherjee A, Chakraborty S, Ghosh TC. The role of introns in the conservation of the metabolic genes of Arabidopsis thaliana. Genomics 2018; 110:310-317. [DOI: 10.1016/j.ygeno.2017.12.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2017] [Revised: 12/06/2017] [Accepted: 12/08/2017] [Indexed: 10/18/2022]
|
11
|
Jayaswal PK, Dogra V, Shanker A, Sharma TR, Singh NK. A tree of life based on ninety-eight expressed genes conserved across diverse eukaryotic species. PLoS One 2017; 12:e0184276. [PMID: 28922368 PMCID: PMC5603157 DOI: 10.1371/journal.pone.0184276] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2017] [Accepted: 08/21/2017] [Indexed: 01/07/2023] Open
Abstract
Rapid advances in DNA sequencing technologies have resulted in the accumulation of large data sets in the public domain, facilitating comparative studies to provide novel insights into the evolution of life. Phylogenetic studies across the eukaryotic taxa have been reported but on the basis of a limited number of genes. Here we present a genome-wide analysis across different plant, fungal, protist, and animal species, with reference to the 36,002 expressed genes of the rice genome. Our analysis revealed 9831 genes unique to rice and 98 genes conserved across all 49 eukaryotic species analysed. The 98 genes conserved across diverse eukaryotes mostly exhibited binding and catalytic activities and shared common sequence motifs; and hence appeared to have a common origin. The 98 conserved genes belonged to 22 functional gene families including 26S protease, actin, ADP–ribosylation factor, ATP synthase, casein kinase, DEAD-box protein, DnaK, elongation factor 2, glyceraldehyde 3-phosphate, phosphatase 2A, ras-related protein, Ser/Thr protein phosphatase family protein, tubulin, ubiquitin and others. The consensus Bayesian eukaryotic tree of life developed in this study demonstrated widely separated clades of plants, fungi, and animals. Musa acuminata provided an evolutionary link between monocotyledons and dicotyledons, and Salpingoeca rosetta provided an evolutionary link between fungi and animals, which indicating that protozoan species are close relatives of fungi and animals. The divergence times for 1176 species pairs were estimated accurately by integrating fossil information with synonymous substitution rates in the comprehensive set of 98 genes. The present study provides valuable insight into the evolution of eukaryotes.
Collapse
Affiliation(s)
- Pawan Kumar Jayaswal
- National Research Centre on Plant Biotechnology, IARI, Pusa, New Delhi, India
- Banasthali University, Banasthali, Rajasthan, India
| | - Vivek Dogra
- National Research Centre on Plant Biotechnology, IARI, Pusa, New Delhi, India
| | - Asheesh Shanker
- Bioinformatics Programme, Centre for Biological Sciences, Central University of South Bihar, Patna, Bihar, India
| | - Tilak Raj Sharma
- National Research Centre on Plant Biotechnology, IARI, Pusa, New Delhi, India
| | - Nagendra Kumar Singh
- National Research Centre on Plant Biotechnology, IARI, Pusa, New Delhi, India
- * E-mail:
| |
Collapse
|
12
|
Catania F. From intronization to intron loss: How the interplay between mRNA-associated processes can shape the architecture and the expression of eukaryotic genes. Int J Biochem Cell Biol 2017; 91:136-144. [PMID: 28673893 DOI: 10.1016/j.biocel.2017.06.017] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Revised: 06/25/2017] [Accepted: 06/30/2017] [Indexed: 12/29/2022]
Abstract
Transcription-coupled processes such as capping, splicing, and cleavage/polyadenylation participate in the journey from genes to proteins. Although they are traditionally thought to serve only as steps in the generation of mature mRNAs, a synthesis of available data indicates that these processes could also act as a driving force for the evolution of eukaryotic genes. A theoretical framework for how mRNA-associated processes may shape gene structure and expression has recently been proposed. Factors that promote splicing and cleavage/polyadenylation in this framework compete for access to overlapping or neighboring signals throughout the transcription cycle. These antagonistic interactions allow mechanisms for intron gain and splice site recognition as well as common trends in eukaryotic gene structure and expression to be coherently integrated. Here, I extend this framework further. Observations that largely (but not exclusively) revolve around the formation of DNA-RNA hybrid structures, called R loops, and promoter directionality are integrated. Additionally, the interplay between splicing factors and cleavage/polyadenylation factors is theorized to also affect the formation of intragenic DNA double-stranded breaks thereby contributing to intron loss. The most notable prediction in this proposition is that RNA molecules can mediate intron loss by serving as a template to repair DNA double-stranded breaks. The framework presented here leverages a vast body of empirical observations, logically extending previous suggestions, and generating verifiable predictions to further substantiate the view that the intracellular environment plays an active role in shaping the structure and the expression of eukaryotic genes.
Collapse
Affiliation(s)
- Francesco Catania
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstraße 1, 48149 Münster, Germany.
| |
Collapse
|
13
|
Nobre T, Oliveira M, Arnholdt-Schmitt B. Wild Carrot Differentiation in Europe and Selection at DcAOX1 Gene? PLoS One 2016; 11:e0164872. [PMID: 27768735 PMCID: PMC5074564 DOI: 10.1371/journal.pone.0164872] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Accepted: 10/03/2016] [Indexed: 11/19/2022] Open
Abstract
By definition, the domestication process leads to an overall reduction of crop genetic diversity. This lead to the current search of genomic regions in wild crop relatives (CWR), an important task for modern carrot breeding. Nowadays massive sequencing possibilities can allow for discovery of novel genetic resources in wild populations, but this quest could be aided by the use of a surrogate gene (to first identify and prioritize novel wild populations for increased sequencing effort). Alternative oxidase (AOX) gene family seems to be linked to all kinds of abiotic and biotic stress reactions in various organisms and thus have the potential to be used in the identification of CWR hotspots of environment-adapted diversity. High variability of DcAOX1 was found in populations of wild carrot sampled across a West-European environmental gradient. Even though no direct relation was found with the analyzed climatic conditions or with physical distance, population differentiation exists and results mainly from the polymorphisms associated with DcAOX1 exon 1 and intron 1. The relatively high number of amino acid changes and the identification of several unusually variable positions (through a likelihood ratio test), suggests that DcAOX1 gene might be under positive selection. However, if positive selection is considered, it only acts on some specific populations (i.e. is in the form of adaptive differences in different population locations) given the observed high genetic diversity. We were able to identify two populations with higher levels of differentiation which are promising as hot spots of specific functional diversity.
Collapse
Affiliation(s)
- Tânia Nobre
- EU Marie Curie Chair, ICAAM - Instituto de Ciências Agrárias e Ambientais Mediterrânicas, Universidade de Évora, Évora, Portugal
| | - Manuela Oliveira
- Centro de Investigação em Matemática e Aplicações, Instituto de Investigação e Formação Avançada, Universidade de Évora, Évora, Portugal
| | - Birgit Arnholdt-Schmitt
- EU Marie Curie Chair, ICAAM - Instituto de Ciências Agrárias e Ambientais Mediterrânicas, Universidade de Évora, Évora, Portugal
| |
Collapse
|
14
|
Du X, Liu Y, Liu J, Zhang Q, Wang X. Evolution history of duplicated smad3 genes in teleost: insights from Japanese flounder, Paralichthys olivaceus. PeerJ 2016; 4:e2500. [PMID: 27703851 PMCID: PMC5045880 DOI: 10.7717/peerj.2500] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2016] [Accepted: 08/29/2016] [Indexed: 01/27/2023] Open
Abstract
Following the two rounds of whole-genome duplication (WGD) during deuterosome evolution, a third genome duplication occurred in the ray-fined fish lineage and is considered to be responsible for the teleost-specific lineage diversification and regulation mechanisms. As a receptor-regulated SMAD (R-SMAD), the function of SMAD3 was widely studied in mammals. However, limited information of its role or putative paralogs is available in ray-finned fishes. In this study, two SMAD3 paralogs were first identified in the transcriptome and genome of Japanese flounder (Paralichthys olivaceus). We also explored SMAD3 duplication in other selected species. Following identification, genomic structure, phylogenetic reconstruction, and synteny analyses performed by MrBayes and online bioinformatic tools confirmed that smad3a/3b most likely originated from the teleost-specific WGD. Additionally, selection pressure analysis and expression pattern of the two genes performed by PAML and quantitative real-time PCR (qRT-PCR) revealed evidence of subfunctionalization of the two SMAD3 paralogs in teleost. Our results indicate that two SMAD3 genes originate from teleost-specific WGD, remain transcriptionally active, and may have likely undergone subfunctionalization. This study provides novel insights to the evolution fates of smad3a/3b and draws attentions to future function analysis of SMAD3 gene family.
Collapse
Affiliation(s)
- Xinxin Du
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, College of Marine Life Sciences, Ocean University of China, Qingdao, Shandong, China
| | - Yuezhong Liu
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, College of Marine Life Sciences, Ocean University of China, Qingdao, Shandong, China
| | - Jinxiang Liu
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, College of Marine Life Sciences, Ocean University of China, Qingdao, Shandong, China
| | - Quanqi Zhang
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, College of Marine Life Sciences, Ocean University of China, Qingdao, Shandong, China
| | - Xubo Wang
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, College of Marine Life Sciences, Ocean University of China, Qingdao, Shandong, China
| |
Collapse
|
15
|
França GS, Vibranovski MD, Galante PAF. Host gene constraints and genomic context impact the expression and evolution of human microRNAs. Nat Commun 2016; 7:11438. [PMID: 27109497 PMCID: PMC4848552 DOI: 10.1038/ncomms11438] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Accepted: 03/25/2016] [Indexed: 12/16/2022] Open
Abstract
Increasing evidence has shown that recent miRNAs tend to emerge within coding genes. Here we conjecture that human miRNA evolution is tightly influenced by the genomic context, especially by host genes. Our findings show a preferential emergence of intragenic miRNAs within old genes. We found that miRNAs within old host genes are significantly more broadly expressed than those within young ones. Young miRNAs within old genes are more broadly expressed than their intergenic counterparts, suggesting that young miRNAs have an initial advantage by residing in old genes, and benefit from their hosts' expression control and from the exposure to diverse cellular contexts and target genes. Our results demonstrate that host genes may provide stronger expression constraints to intragenic miRNAs in the long run. We also report associated functional implications, highlighting the genomic context and host genes as driving factors for the expression and evolution of human miRNAs. Recent miRNAs tend to emerge within coding genes. Here, by analysing miRNA expression data from six species and comparing genomes from 13 species, the authors report that host genes may provide stronger expression constraints to intragenic miRNAs in the long run.
Collapse
Affiliation(s)
- Gustavo S França
- Centro de Oncologia Molecular, Hospital Sírio-Libanês, Rua Daher Cutait 69, 01308-060 São Paulo, Brazil.,Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes 748, 05508-000 São Paulo, Brazil
| | - Maria D Vibranovski
- Departamento de Genética e Biologia Evolutiva, Universidade de São Paulo, Rua do Matao 277, 05508-090 São Paulo, Brazil
| | - Pedro A F Galante
- Centro de Oncologia Molecular, Hospital Sírio-Libanês, Rua Daher Cutait 69, 01308-060 São Paulo, Brazil
| |
Collapse
|
16
|
Liu J, Jiang J, Wang Z, He Y, Zhang Q. Origin and evolution of GATA2a and GATA2b in teleosts: insights from tongue sole, Cynoglossus semilaevis. PeerJ 2016; 4:e1790. [PMID: 27019782 PMCID: PMC4806627 DOI: 10.7717/peerj.1790] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2015] [Accepted: 02/20/2016] [Indexed: 12/31/2022] Open
Abstract
Background. Following the two rounds of whole-genome duplication that occurred during deuterostome evolution, a third genome duplication occurred in the lineage of teleost fish and is considered to be responsible for much of the biological diversification within the lineage. GATA2, a member of GATA family of transcription factors, is an important regulator of gene expression in hematopoietic cell in mammals, yet the role of this gene or its putative paralogs in ray-finned fishes remains relatively unknown. Methods. In this study, we attempted to identify GATA2 sequences from the transcriptomes and genomes of multiple teleosts using the bioinformatic tools MrBayes, MEME, and PAML. Following identification, comparative analysis of genome structure, molecular evolution rate, and expression by real-time qPCR were used to predict functional divergence of GATA2 paralogs and their relative transcription in organs of female and male tongue soles (Cynoglossus semilaevis). Results. Two teleost GATA2 genes were identified in the transcriptomes of tongue sole and Japanese flounder (Paralichthysolivaceus). Synteny and phylogenetic analysis confirmed that the two genes likely originated from the teleost-specific genome duplication . Additionally, selection pressure analysis predicted these gene duplicates to have undergone purifying selection and possible divergent new functions. This was supported by differential expression pattern of GATA2a and GATA2b observed in organs of female and male tongue soles. Discussion. Our results indicate that two GATA2 genes originating from the first teleost-specific genome duplication have remained transcriptionally active in some fish species and have likely undergone neofunctionalization. This knowledge provides novel insights into the evolution of the teleost GATA2 genes and constituted important groundwork for further research on the GATA gene family.
Collapse
Affiliation(s)
- Jinxiang Liu
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Ocean University of China , Qingdao , China
| | - Jiajun Jiang
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Ocean University of China , Qingdao , China
| | - Zhongkai Wang
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Ocean University of China , Qingdao , China
| | - Yan He
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Ocean University of China , Qingdao , China
| | - Quanqi Zhang
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, Ocean University of China , Qingdao , China
| |
Collapse
|
17
|
Glastad KM, Goodisman MAD, Yi SV, Hunt BG. Effects of DNA Methylation and Chromatin State on Rates of Molecular Evolution in Insects. G3 (BETHESDA, MD.) 2015; 6:357-63. [PMID: 26637432 PMCID: PMC4751555 DOI: 10.1534/g3.115.023499] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2015] [Accepted: 11/30/2015] [Indexed: 01/03/2023]
Abstract
Epigenetic information is widely appreciated for its role in gene regulation in eukaryotic organisms. However, epigenetic information can also influence genome evolution. Here, we investigate the effects of epigenetic information on gene sequence evolution in two disparate insects: the fly Drosophila melanogaster, which lacks substantial DNA methylation, and the ant Camponotus floridanus, which possesses a functional DNA methylation system. We found that DNA methylation was positively correlated with the synonymous substitution rate in C. floridanus, suggesting a key effect of DNA methylation on patterns of gene evolution. However, our data suggest the link between DNA methylation and elevated rates of synonymous substitution was explained, in large part, by the targeting of DNA methylation to genes with signatures of transcriptionally active chromatin, rather than the mutational effect of DNA methylation itself. This phenomenon may be explained by an elevated mutation rate for genes residing in transcriptionally active chromatin, or by increased structural constraints on genes in inactive chromatin. This result highlights the importance of chromatin structure as the primary epigenetic driver of genome evolution in insects. Overall, our study demonstrates how different epigenetic systems contribute to variation in the rates of coding sequence evolution.
Collapse
Affiliation(s)
- Karl M Glastad
- School of Biology, Georgia Institute of Technology, Atlanta, Georgia 30332
| | | | - Soojin V Yi
- School of Biology, Georgia Institute of Technology, Atlanta, Georgia 30332
| | - Brendan G Hunt
- Department of Entomology, University of Georgia, Griffin, Georgia 30223
| |
Collapse
|
18
|
Zhou K, Kuo A, Grigoriev IV. Reverse transcriptase and intron number evolution. Stem Cell Investig 2014; 1:17. [PMID: 27358863 DOI: 10.3978/j.issn.2306-9759.2014.08.01] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2014] [Accepted: 08/04/2014] [Indexed: 11/14/2022]
Abstract
BACKGROUND Introns are universal in eukaryotic genomes and play important roles in transcriptional regulation, mRNA export to the cytoplasm, nonsense-mediated decay as both a regulatory and a splicing quality control mechanism, R-loop avoidance, alternative splicing, chromatin structure, and evolution by exon-shuffling. METHODS Sixteen complete fungal genomes were used 13 of which were sequenced and annotated by JGI. Ustilago maydis, Cryptococcus neoformans, and Coprinus cinereus (also named Coprinopsis cinerea) were from the Broad Institute. Gene models from JGI-annotated genomes were taken from the GeneCatalog track that contained the best representative gene models. Varying fractions of the GeneCatalog were manually curated by external users. For clarity, we used the JGI unique database identifier. RESULTS The last common ancestor of eukaryotes (LECA) has an estimated 6.4 coding exons per gene (EPG) and evolved into the diverse eukaryotic life forms, which is recapitulated by the development of a stem cell. We found a parallel between the simulated reverse transcriptase (RT)-mediated intron loss and the comparative analysis of 16 fungal genomes that spanned a wide range of intron density. Although footprints of RT (RTF) were dynamic, relative intron location (RIL) to the 5'-end of mRNA faithfully traced RT-mediated intron loss and revealed 7.7 EPG for LECA. The mode of exon length distribution was conserved in simulated intron loss, which was exemplified by the shared mode of 75 nt between fungal and Chlamydomonas genomes. The dominant ancient exon length was corroborated by the average exon length of the most intron-rich genes in fungal genomes and consistent with ancient protein modules being ~25 aa. Combined with the conservation of a protein length of 400 aa, the earliest ancestor of eukaryotes could have 16 EPG. During earlier evolution, Ascomycota's ancestor had significantly more 3'-biased RT-mediated intron loss that was followed by dramatic RTF loss. There was a down trend of EPG from more conserved to less conserved genes. Moreover, species-specific genes have higher exon-densities, shorter exons, and longer introns when compared to genes conserved at the phylum level. However, intron length in species-specific genes became shorter than that of genes conserved in all species after genomes experiencing drastic intron loss. The estimated EPG from the most frequent exon length is more than double that from the RIL method. CONCLUSIONS This implies significant intron loss during the very early period of eukaryotic evolution. De novo gene-birth contributes to shorter exons, longer introns, and higher exon-density in species-specific genes relative to conserved genes.
Collapse
Affiliation(s)
- Kemin Zhou
- 1 Computational Genomics, Bristol-Myers Squibb, 311 Pennington Rocky Hill Road, Pennington, NJ 08534, USA ; 2 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Alan Kuo
- 1 Computational Genomics, Bristol-Myers Squibb, 311 Pennington Rocky Hill Road, Pennington, NJ 08534, USA ; 2 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Igor V Grigoriev
- 1 Computational Genomics, Bristol-Myers Squibb, 311 Pennington Rocky Hill Road, Pennington, NJ 08534, USA ; 2 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| |
Collapse
|
19
|
Martínez-Quintana JA, Peregrino-Uriarte AB, Gollas-Galván T, Gómez-Jiménez S, Yepiz-Plascencia G. The glucose transporter 1 -GLUT1- from the white shrimp Litopenaeus vannamei is up-regulated during hypoxia. Mol Biol Rep 2014; 41:7885-98. [PMID: 25167855 DOI: 10.1007/s11033-014-3682-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2014] [Accepted: 08/19/2014] [Indexed: 11/29/2022]
Abstract
During hypoxia the shrimp Litopenaeus vannamei accelerates anaerobic glycolysis to obtain energy; therefore, a correct supply of glucose to the cells is needed. Facilitated glucose transport across the cells is mediated by a group of membrane embedded integral proteins called GLUT; being GLUT1 the most ubiquitous form. In this work, we report the first cDNA nucleotide and deduced amino acid sequences of a glucose transporter 1 from L. vannamei. A 1619 bp sequence was obtained by RT-PCR and RACE approaches. The 5´ UTR is 161 bp and the poly A tail is exactly after the stop codon in the mRNA. The ORF is 1485 bp and codes for 485 amino acids. The deduced protein sequence has high identity to GLUT1 proteins from several species and contains all the main features of glucose transporter proteins, including twelve transmembrane domains, the conserved motives and amino acids involved in transport activity, ligands binding and membrane anchor. Therefore, we decided to name this sequence, glucose transporter 1 of L. vannamei (LvGLUT1). A partial gene sequence of 8.87 Kbp was also obtained; it contains the complete coding sequence divided in 10 exons. LvGlut1 expression was detected in hemocytes, hepatopancreas, intestine gills, muscle and pleopods. The higher relative expression was found in gills and the lower in hemocytes. This indicates that LvGlut1 is ubiquitously expressed but its levels are tissue-specific and upon short-term hypoxia, the GLUT1 transcripts increase 3.7-fold in hepatopancreas and gills. To our knowledge, this is the first evidence of expression of GLUT1 in crustaceans.
Collapse
Affiliation(s)
- José A Martínez-Quintana
- Centro de Investigación en Alimentación y Desarrollo. A.C, P.O. Box 1735, Carretera a Ejido La Victoria Km. 0.6, 83304, Hermosillo, Sonora, Mexico
| | | | | | | | | |
Collapse
|
20
|
Li W, Kuzoff R, Wong CK, Tucker A, Lynch M. Characterization of newly gained introns in Daphnia populations. Genome Biol Evol 2014; 6:2218-34. [PMID: 25123113 PMCID: PMC4202315 DOI: 10.1093/gbe/evu174] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
As one of the few known species in an active phase of intron proliferation, the microcrustacean Daphnia pulex is an especially attractive system for interrogating the gain and loss of introns in natural populations. In this study, we used a comparative population-genomic approach to identify and characterize 90 recently gained introns in this species. Molecular clock analyses indicate that these introns arose between 3.9 × 10(5) and 1.45 × 10(4) years ago, with a spike in intron proliferation approximately 5.2 × 10(4) to 1.22 × 10(5) years ago. Parallel gains at homologous positions contribute to 47.8% (43/90) of discovered new introns. A disproportionally large number of new introns were found in historically isolated populations in Oregon. Nonetheless, derived, intron-bearing alleles were also identified in a wide range of geographic locations, suggesting intron gain and, to a lesser degree, intron loss are important sources of genetic variation in natural populations of Daphnia. A majority (55/90 or 61.1%) of the identified neointrons have associated internal direct repeats with lengths and compositions that are unlikely to occur by chance, suggesting repeated bouts of staggered double-strand breaks (DSBs) during their evolution. Accordingly, internal, staggered DSBs may contribute to a passive trend toward increased length and sequence diversity in nascent introns.
Collapse
Affiliation(s)
- Wenli Li
- Department of Pediatrics, Section of Genomic Pediatrics, Medical College of Wisconsin
| | - Robert Kuzoff
- Department of Biology, University of Wisconsin-Whitewater
| | - Chen Khuan Wong
- Genetics and Genomics Program, Department of Medicine, Boston University
| | | | - Michael Lynch
- Department of Biology, Indiana University, Bloomington
| |
Collapse
|
21
|
Guo Z, Jiang W, Lages N, Borcherds W, Wang D. Relationship between gene duplicability and diversifiability in the topology of biochemical networks. BMC Genomics 2014; 15:577. [PMID: 25005725 PMCID: PMC4129122 DOI: 10.1186/1471-2164-15-577] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2014] [Accepted: 06/26/2014] [Indexed: 01/21/2023] Open
Abstract
Background Selective gene duplicability, the extensive expansion of a small number of gene families, is universal. Quantitatively, the number of genes (P(K)) with K duplicates in a genome decreases precipitously as K increases, and often follows a power law (P(k)∝k-α). Functional diversification, either neo- or sub-functionalization, is a major evolution route for duplicate genes. Results Using three lines of genomic datasets, we studied the relationship between gene duplicability and diversifiability in the topology of biochemical networks. First, we explored scenario where two pathways in the biochemical networks antagonize each other. Synthetic knockout of respective genes for the two pathways rescues the phenotypic defects of each individual knockout. We identified duplicate gene pairs with sufficient divergences that represent this antagonism relationship in the yeast S. cerevisiae. Such pairs overwhelmingly belong to large gene families, thus tend to have high duplicability. Second, we used distances between proteins of duplicate genes in the protein interaction network as a metric of their diversification. The higher a gene’s duplicate count, the further the proteins of this gene and its duplicates drift away from one another in the networks, which is especially true for genetically antagonizing duplicate genes. Third, we computed a sequence-homology-based clustering coefficient to quantify sequence diversifiability among duplicate genes – the lower the coefficient, the more the sequences have diverged. Duplicate count (K) of a gene is negatively correlated to the clustering coefficient of its duplicates, suggesting that gene duplicability is related to the extent of sequence divergence within the duplicate gene family. Conclusion Thus, a positive correlation exists between gene diversifiability and duplicability in the context of biochemical networks – an improvement of our understanding of gene duplicability.
Collapse
Affiliation(s)
| | | | | | | | - Degeng Wang
- Greehey Children's Cancer Research Institute, University of Texas Health Science Center at San Antonio, 8403 Floyd Curl Drive, San Antonio, TX 78229-3900, USA.
| |
Collapse
|
22
|
Gorlova O, Fedorov A, Logothetis C, Amos C, Gorlov I. Genes with a large intronic burden show greater evolutionary conservation on the protein level. BMC Evol Biol 2014; 14:50. [PMID: 24629165 PMCID: PMC3995522 DOI: 10.1186/1471-2148-14-50] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2013] [Accepted: 03/11/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The existence of introns in eukaryotic genes is believed to provide an evolutionary advantage by increasing protein diversity through exon shuffling and alternative splicing. However, this eukaryotic feature is associated with the necessity of exclusion of intronic sequences, which requires considerable energy expenditure and can lead to splicing errors. The relationship between intronic burden and evolution is poorly understood. The goal of this study was to analyze the relationship between the intronic burden and the level of evolutionary conservation of the gene. RESULTS We found a positive correlation between the level of evolutionary conservation of a gene and its intronic burden. The level of evolutionary conservation was estimated using the conservation index (CI). The CI value was determined on the basis of the most distant ortholog of the human protein sequence and ranged from 0 (the gene was unique to the human genome) to 9 (an ortholog of the human gene was detected in plants). In multivariable model, both the number of introns and total intron size remained significant predictors of CI. We also found that the number of alternative splice variants was positively correlated with CI.The expression level of a gene was negatively correlated with the number of introns and total size of intronic region. Genes with a greater intronic burden had lower density of missense and nonsense mutations in the coding regions of the gene, which suggests that they are under a stronger pressure from purifying selection. CONCLUSIONS We identified a positive association between intronic burden and CI. One of the possible explanations of this is the idea of a cost-benefits balance. Evolutionarily conserved (functionally important) genes can "afford" the negative consequences of maintaining multiple introns because these consequences are outweighed by the benefit of maintaining the gene. Evolutionarily conserved and functionally important genes may use introns to create novel splice variants to tune the gene function to developmental stage and tissue type.
Collapse
Affiliation(s)
| | | | | | | | - Ivan Gorlov
- Department of Community and Family Medicine, Geisel School of Medicine, Dartmouth College, Lebanon 03766, NH, USA.
| |
Collapse
|
23
|
Frequency of intron loss correlates with processed pseudogene abundance: a novel strategy to test the reverse transcriptase model of intron loss. BMC Biol 2013; 11:23. [PMID: 23497167 PMCID: PMC3652778 DOI: 10.1186/1741-7007-11-23] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2013] [Accepted: 03/05/2013] [Indexed: 11/23/2022] Open
Abstract
Background Although intron loss in evolution has been described, the mechanism involved is still unclear. Three models have been proposed, the reverse transcriptase (RT) model, genomic deletion model and double-strand-break repair model. The RT model, also termed mRNA-mediated intron loss, suggests that cDNA molecules reverse transcribed from spliced mRNA recombine with genomic DNA causing intron loss. Many studies have attempted to test this model based on its predictions, such as simultaneous loss of adjacent introns, 3'-side bias of intron loss, and germline expression of intron-lost genes. Evidence either supporting or opposing the model has been reported. The mechanism of intron loss proposed in the RT model shares the process of reverse transcription with the formation of processed pseudogenes. If the RT model is correct, genes that have produced more processed pseudogenes are more likely to undergo intron loss. Results In the present study, we observed that the frequency of intron loss is correlated with processed pseudogene abundance by analyzing a new dataset of intron loss obtained in mice and rats. Furthermore, we found that mRNA molecules of intron-lost genes are mostly translated on free cytoplasmic ribosomes, a feature shared by mRNA molecules of the parental genes of processed pseudogenes and long interspersed elements. This feature is likely convenient for intron-lost gene mRNA molecules to be reverse transcribed. Analyses of adjacent intron loss, 3'-side bias of intron loss, and germline expression of intron-lost genes also support the RT model. Conclusions Compared with previous evidence, the correlation between the abundance of processed pseudogenes and intron loss frequency more directly supports the RT model of intron loss. Exploring such a correlation is a new strategy to test the RT model in organisms with abundant processed pseudogenes.
Collapse
|
24
|
Hill N, Leow A, Bleidorn C, Groth D, Tiedemann R, Selbig J, Hartmann S. Analysis of phylogenetic signal in protostomial intron patterns using Mutual Information. Theory Biosci 2012; 132:93-104. [DOI: 10.1007/s12064-012-0173-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2012] [Accepted: 11/30/2012] [Indexed: 11/29/2022]
|
25
|
Yenerall P, Zhou L. Identifying the mechanisms of intron gain: progress and trends. Biol Direct 2012; 7:29. [PMID: 22963364 PMCID: PMC3443670 DOI: 10.1186/1745-6150-7-29] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2012] [Accepted: 08/22/2012] [Indexed: 12/22/2022] Open
Abstract
Abstract Continued improvements in Next-Generation DNA/RNA sequencing coupled with advances in gene annotation have provided researchers access to a plethora of annotated genomes. Subsequent analyses of orthologous gene structures have identified numerous intron gain and loss events that have occurred both recently and in the very distant past. This research has afforded exceptional insight into the temporal and lineage-specific rates of intron gain and loss among various species throughout evolution. Numerous studies have also attempted to identify the molecular mechanisms of intron gain and loss. However, even after considerable effort, very little is known about these processes. In particular, the mechanism(s) of intron gain have proven exceptionally enigmatic and remain topics of considerable debate. Currently, there exists no definitive consensus as to what mechanism(s) may generate introns. Because many introns are known to affect gene expression, it is necessary to understand the molecular process(es) by which introns may be gained. Here we review the seven most commonly purported mechanisms of intron gain and, when possible, summarize molecular evidence for or against the occurrence of each of these mechanisms. Furthermore, we catalogue indirect evidence that supports the occurrence of each mechanism. Finally, because these proposed mechanisms fail to explain the mechanistic origin of many recently gained introns, we also look at trends that may aid researchers in identifying other potential mechanism(s) of intron gain. Reviewers This article was reviewed by Eugene Koonin, Scott Roy (nominated by W. Ford Doolittle), and John Logsdon.
Collapse
Affiliation(s)
- Paul Yenerall
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | | |
Collapse
|
26
|
Rogozin IB, Carmel L, Csuros M, Koonin EV. Origin and evolution of spliceosomal introns. Biol Direct 2012; 7:11. [PMID: 22507701 PMCID: PMC3488318 DOI: 10.1186/1745-6150-7-11] [Citation(s) in RCA: 217] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 03/15/2012] [Indexed: 12/31/2022] Open
Abstract
Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information NLM/NIH, 8600 Rockville Pike, Bldg, 38A, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
27
|
Abstract
The intron–exon architecture of many eukaryotic genes raises the intriguing question of whether this unique organization serves any function, or is it simply a result of the spread of functionless introns in eukaryotic genomes. In this review, we show that introns in contemporary species fulfill a broad spectrum of functions, and are involved in virtually every step of mRNA processing. We propose that this great diversity of intronic functions supports the notion that introns were indeed selfish elements in early eukaryotes, but then independently gained numerous functions in different eukaryotic lineages. We suggest a novel criterion of evolutionary conservation, dubbed intron positional conservation, which can identify functional introns.
Collapse
Affiliation(s)
- Michal Chorev
- Department of Genetics, The Alexander Silberman Institute of Life Sciences, Faculty of Science, The Hebrew University of Jerusalem Jerusalem, Israel
| | | |
Collapse
|
28
|
Fawcett JA, Rouzé P, Van de Peer Y. Higher intron loss rate in Arabidopsis thaliana than A. lyrata is consistent with stronger selection for a smaller genome. Mol Biol Evol 2011; 29:849-59. [PMID: 21998273 DOI: 10.1093/molbev/msr254] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The number of introns varies considerably among different organisms. This can be explained by the differences in the rates of intron gain and loss. Two factors that are likely to influence these rates are selection for or against introns and the mutation rate that generates the novel intron or the intronless copy. Although it has been speculated that stronger selection for a compact genome might result in a higher rate of intron loss and a lower rate of intron gain, clear evidence is lacking, and the role of selection in determining these rates has not been established. Here, we studied the gain and loss of introns in the two closely related species Arabidopsis thaliana and A. lyrata as it was recently shown that A. thaliana has been undergoing a faster genome reduction driven by selection. We found that A. thaliana has lost six times more introns than A. lyrata since the divergence of the two species but gained very few introns. We suggest that stronger selection for genome reduction probably resulted in the much higher intron loss rate in A. thaliana, although further analysis is required as we could not find evidence that the loss rate increased in A. thaliana as opposed to having decreased in A. lyrata compared with the rate in the common ancestor. We also examined the pattern of the intron gains and losses to better understand the mechanisms by which they occur. Microsimilarity was detected between the splice sites of several gained and lost introns, suggesting that nonhomologous end joining repair of double-strand breaks might be a common pathway not only for intron gain but also for intron loss.
Collapse
|
29
|
Park J, Xu K, Park T, Yi SV. What are the determinants of gene expression levels and breadths in the human genome? Hum Mol Genet 2011; 21:46-56. [PMID: 21945885 PMCID: PMC3235009 DOI: 10.1093/hmg/ddr436] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
In complex organisms, different tissues express different genes, which ultimately shape the function and phenotype of each tissue. An important goal of modern biology is to understand how some genes are turned on and off in specific tissues and how the numbers of different gene expression products are determined. These aspects are named ‘expression breadth’ (or ‘tissue specificity’) and ‘expression level’, respectively. Here, we show that we can predict substantial amount of variation in levels and breadths of gene expression using genomic information of each gene. Interestingly, many genomic traits are correlated with both aspects of gene expression in similar directions, suggesting shared molecular pathways. However, to elucidate distinctive molecular mechanisms governing gene expression levels and breadths, we need to identify the relative significance of each genomic trait on these two aspects of gene expression. To this end, we developed a novel multivariate multiple regression method. Using this new method, we show that gene compactness (in particular, the mean size of exons), codon usage bias and non-synonymous rates have a stronger influence on expression levels compared with their effects on expression breadths. In contrast, the propensity of promoter DNA methylation is a stronger indicator of expression breadths than of expression levels. Interestingly, intron DNA methylation exhibits an opposite pattern to the promoter DNA methylation in the human genome, suggesting that DNA methylation may play multiple roles depending upon its genomic targets. Furthermore, synonymous rates have stronger associations with expression breadths than with expression levels in the human genome. These findings provide clues toward distinctive molecular mechanisms regulating different aspects of gene expression.
Collapse
Affiliation(s)
- Jungsun Park
- Bioinformatics and Biostatistics Laboratory, Department of Statistics, Seoul National University, Seoul 151-742, Korea
| | | | | | | |
Collapse
|
30
|
Cohen NE, Shen R, Carmel L. The role of reverse transcriptase in intron gain and loss mechanisms. Mol Biol Evol 2011; 29:179-86. [PMID: 21804076 DOI: 10.1093/molbev/msr192] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Intron density is highly variable across eukaryotic species. It seems that different lineages have experienced considerably different levels of intron gain and loss events, but the reasons for this are not well known. A large number of mechanisms for intron loss and gain have been suggested, and most of them have at least some level of indirect support. We therefore figured out that the variability in intron density can be a reflection of the fact that different mechanisms are active in different lineages. Quite a number of these putative mechanisms, both for intron loss and for intron gain, postulate that the enzyme reverse transcriptase (RT) has a key role in the process. In this paper, we lay out three predictions whose approval or falsification gives indication for the involvement of RT in intron gain and loss processes. Testing these predictions requires data on the intron gain and loss rates of individual genes along different branches of the eukaryotic phylogenetic tree. So far, such rates could not be computed, and hence, these predictions could not be rigorously evaluated. Here, we use a maximum likelihood algorithm that we have devised in the past, Evolutionary Reconstruction by Expectation Maximization, which allows the estimation of such rates. Using this algorithm, we computed the intron loss and gain rates of more than 300 genes in each branch of the phylogenetic tree of 19 eukaryotic species. Based on that we found only little support for RT activity in intron gain. In contrast, we suggest that RT-mediated intron loss is a mechanism that is very efficient in removing introns, and thus, its levels of activity may be a major determinant of intron number. Moreover, we found that intron gain and loss rates are negatively correlated in intron-poor species but are positively correlated for intron-rich species. One explanation to this is that intron gain and loss mechanisms in intron-rich species (like metazoans) share a common mechanistic component, albeit not a RT.
Collapse
Affiliation(s)
- Noa E Cohen
- Department of Genetics, The Alexander Silberman Institute of Life Sciences, Faculty of Science, The Hebrew University of Jerusalem, Jerusalem, Israel
| | | | | |
Collapse
|
31
|
Khan N, Pande V, Das A. Characterization, comparative genomics, and evolutionary inferences of a human drug metabolizing (NAT2) gene. Interv Med Appl Sci 2011. [DOI: 10.1556/imas.3.2011.2.5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Abstract
Aim
The present-day genetic architecture of a species bears much significance to its closely related species. In recent availability of whole genome sequence data for closely related species, it is possible to detect genetic similarities/differences in specific lineages and infer the role of evolutionary forces in bringing such similarities/differences. In this respect, NAT2 gene, responsible for drug metabolism, is conserved across a few taxa and, thus, comparative genomic studies could be useful for better pharmacogenetic realization.
Methods
DNA sequences of human NAT2 gene were retrieved from NCBI and characterized. Comparative and evolutionary analyses were performed with sequences from four mammalian taxa and one avian taxon with different statistical algorithms.
Results
The observed genetic architecture of NAT2 gene was different across the taxa. Phylogenetic inferences revealed that human and chimpanzee are diverged recently and fowl was found to be diverged from rest of the taxa significantly. Also, gene length, microsatellites, Ka/Ks, secondary structure, and distribution of CpG islands were observed across taxa.
Conclusions
The detail architecture of NAT2 gene and its evolutionary history in different taxa show relationships with other taxa. Future population-based study in NAT2 would unravel the correlation between nucleotide changes and differential ability of drug metabolization in humans.
Collapse
Affiliation(s)
- Naazneen Khan
- 1 Evolutionary Genomics and Bioinformatics Laboratory, Division of Genomics and Bioinformatics, National Institute of Malaria Research, Dwarka, New Delhi, India
| | - Veena Pande
- 2 Department of Biotechnology, Kumaun University, Nainital, Uttarakhand, India
| | - Aparup Das
- 1 Evolutionary Genomics and Bioinformatics Laboratory, Division of Genomics and Bioinformatics, National Institute of Malaria Research, Dwarka, New Delhi, India
- 3 Evolutionary Genomics and Bioinformatics Laboratory, Division of Genomics and Bioinformatics, National Institute of Malaria Research, Sector-8, Dwarka, New Delhi, 110077, India
| |
Collapse
|
32
|
Niu DK, Yang YF. Why eukaryotic cells use introns to enhance gene expression: splicing reduces transcription-associated mutagenesis by inhibiting topoisomerase I cutting activity. Biol Direct 2011; 6:24. [PMID: 21592350 PMCID: PMC3118952 DOI: 10.1186/1745-6150-6-24] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2011] [Accepted: 05/18/2011] [Indexed: 11/10/2022] Open
Abstract
Background The costs and benefits of spliceosomal introns in eukaryotes have not been established. One recognized effect of intron splicing is its known enhancement of gene expression. However, the mechanism regulating such splicing-mediated expression enhancement has not been defined. Previous studies have shown that intron splicing is a time-consuming process, indicating that splicing may not reduce the time required for transcription and processing of spliced pre-mRNA molecules; rather, it might facilitate the later rounds of transcription. Because the densities of active RNA polymerase II on most genes are less than one molecule per gene, direct interactions between the splicing apparatus and transcriptional complexes (from the later rounds of transcription) are infrequent, and thus unlikely to account for splicing-mediated gene expression enhancement. Presentation of the hypothesis The serine/arginine-rich protein SF2/ASF can inhibit the DNA topoisomerase I activity that removes negative supercoiling of DNA generated by transcription. Consequently, splicing could make genes more receptive to RNA polymerase II during the later rounds of transcription, and thus affect the frequency of gene transcription. Compared with the transcriptional enhancement mediated by strong promoters, intron-containing genes experience a lower frequency of cut-and-paste processes. The cleavage and religation activity of DNA strands by DNA topoisomerase I was recently shown to account for transcription-associated mutagenesis. Therefore, intron-mediated enhancement of gene expression could reduce transcription-associated genome instability. Testing the hypothesis Experimentally test whether transcription-associated mutagenesis is lower in intron-containing genes than in intronless genes. Use bioinformatic analysis to check whether exons flanking lost introns have higher frequencies of short deletions. Implications of the hypothesis The mechanism of intron-mediated enhancement proposed here may also explain the positive correlation observed between intron size and gene expression levels in unicellular organisms, and the greater number of intron containing genes in higher organisms. Reviewers This article was reviewed by Dr Arcady Mushegian, Dr Igor B Rogozin (nominated by Dr I King Jordan) and Dr Alexey S Kondrashov. For the full reviews, please go to the Reviewer's Reports section.
Collapse
Affiliation(s)
- Deng-Ke Niu
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing 100875, China.
| | | |
Collapse
|
33
|
DNA double-strand break repair and the evolution of intron density. Trends Genet 2010; 27:1-6. [PMID: 21106271 PMCID: PMC3020277 DOI: 10.1016/j.tig.2010.10.004] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2010] [Revised: 10/18/2010] [Accepted: 10/18/2010] [Indexed: 01/23/2023]
Abstract
The density of introns is both an important feature of genome architecture and a highly variable trait across eukaryotes. This heterogeneity has posed an evolutionary puzzle for the last 30 years. Recent evidence is consistent with novel introns being the outcome of the error-prone repair of DNA double-stranded breaks (DSBs) via non-homologous end joining (NHEJ). Here we suggest that deletion of pre-existing introns could occur via the same pathway. We propose a novel framework in which species-specific differences in the activity of NHEJ and homologous recombination (HR) during the repair of DSBs underlie changes in intron density.
Collapse
|
34
|
Ahrazem O, Trapero A, Gómez MD, Rubio-Moraga A, Gómez-Gómez L. Genomic analysis and gene structure of the plant carotenoid dioxygenase 4 family: A deeper study in Crocus sativus and its allies. Genomics 2010; 96:239-50. [DOI: 10.1016/j.ygeno.2010.07.003] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2010] [Revised: 07/04/2010] [Accepted: 07/07/2010] [Indexed: 10/19/2022]
|
35
|
Andreakis N, D'Aniello S, Albalat R, Patti FP, Garcia-Fernàndez J, Procaccini G, Sordino P, Palumbo A. Evolution of the nitric oxide synthase family in metazoans. Mol Biol Evol 2010; 28:163-79. [PMID: 20639231 DOI: 10.1093/molbev/msq179] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Nitric oxide (NO) is essential to many physiological functions and operates in several signaling pathways. It is not understood how and when the different isoforms of nitric oxide synthase (NOS), the enzyme responsible for NO production, evolved in metazoans. This study investigates the number and structure of metazoan NOS enzymes by genome data mining and direct cloning of Nos genes from the lamprey. In total, 181 NOS proteins are analyzed from 33 invertebrate and 63 vertebrate species. Comparisons among protein and gene structures, combined with phylogenetic and syntenic studies, provide novel insights into how NOS isoforms arose and diverged. Protein domains and gene organization--that is, intron positions and phases--of animal NOS are remarkably conserved across all lineages, even in fast-evolving species. Phylogenetic and syntenic analyses support the view that a proto-NOS isoform was recurrently duplicated in different lineages, acquiring new structural configurations through gains and losses of protein motifs. We propose that in vertebrates a first duplication took place after the agnathan-gnathostome split followed by a paralog loss. A second duplication occurred during early tetrapod evolution, giving rise to the three isoforms--I, II, and III--in current mammals. Overall, NOS family evolution was the result of multiple gene and genome duplication events together with changes in protein architecture.
Collapse
Affiliation(s)
- Nikos Andreakis
- Australian Institute of Marine Science, Townsville, Queensland, Australia
| | | | | | | | | | | | | | | |
Collapse
|
36
|
Rao YS, Wang ZF, Chai XW, Wu GZ, Zhou M, Nie QH, Zhang XQ. Selection for the compactness of highly expressed genes in Gallus gallus. Biol Direct 2010; 5:35. [PMID: 20465857 PMCID: PMC2883972 DOI: 10.1186/1745-6150-5-35] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2009] [Accepted: 05/14/2010] [Indexed: 11/10/2022] Open
Abstract
Background Coding sequence (CDS) length, gene size, and intron length vary within a genome and among genomes. Previous studies in diverse organisms, including human, D. Melanogaster, C. elegans, S. cerevisiae, and Arabidopsis thaliana, indicated that there are negative relationships between expression level and gene size, CDS length as well as intron length. Different models such as selection for economy model, genomic design model, and mutational bias hypotheses have been proposed to explain such observation. The debate of which model is a superior one to explain the observation has not been settled down. The chicken (Gallus gallus) is an important model organism that bridges the evolutionary gap between mammals and other vertebrates. As D. Melanogaster, chicken has a larger effective population size, selection for chicken genome is expected to be more effective in increasing protein synthesis efficiency. Therefore, in this study the chicken was used as a model organism to elucidate the interaction between gene features and expression pattern upon selection pressure. Results Based on different technologies, we gathered expression data for nuclear protein coding, single-splicing genes from Gallus gallus genome and compared them with gene parameters. We found that gene size, CDS length, first intron length, average intron length, and total intron length are negatively correlated with expression level and expression breadth significantly. The tissue specificity is positively correlated with the first intron length but negatively correlated with the average intron length, and not correlated with the CDS length and protein domain numbers. Comparison analyses showed that ubiquitously expressed genes and narrowly expressed genes with the similar expression levels do not differ in compactness. Our data provided evidence that the genomic design model can not, at least in part, explain our observations. We grouped all somatic-tissue-specific genes (n = 1105), and compared the first intron length and the average intron length between highly expressed genes (top 5% expressed genes) and weakly expressed genes (bottom 5% expressed genes). We found that the first intron length and the average intron length in highly expressed genes are not different from that in weakly expressed genes. We also made a comparison between ubiquitously expressed genes and narrowly expressed somatic genes with similar expression levels. Our data demonstrated that ubiquitously expressed genes are less compact than narrowly expressed genes with the similar expression levels. Obviously, these observations can not be explained by mutational bias hypotheses either. We also found that the significant trend between genes' compactness and expression level could not be affected by local mutational biases. We argued that the selection of economy model is most likely one to explain the relationship between gene expression and gene characteristics in chicken genome. Conclusion Natural selection appears to favor the compactness of highly expressed genes in chicken genome. This observation can be explained by the selection of economy model. Reviewers This article was reviewed by Dr. Gavin Huttley, Dr. Liran Carmel (nominated by Dr. Eugene V. Koonin) and Dr. Araxi Urrutia (nominated by Dr. Laurence D. Hurst).
Collapse
Affiliation(s)
- You S Rao
- Department of Biological Technology, Jiangxi Educational Institute, Nanchang, Jiangxi, China
| | | | | | | | | | | | | |
Collapse
|
37
|
EREM: Parameter Estimation and Ancestral Reconstruction by Expectation-Maximization Algorithm for a Probabilistic Model of Genomic Binary Characters Evolution. Adv Bioinformatics 2010:167408. [PMID: 20467467 PMCID: PMC2866244 DOI: 10.1155/2010/167408] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2009] [Accepted: 03/02/2010] [Indexed: 11/19/2022] Open
Abstract
Evolutionary binary characters are features of species or genes, indicating the absence (value zero) or presence (value one) of some property. Examples include eukaryotic gene architecture (the presence or absence of an intron in a particular locus), gene content, and morphological characters. In many studies, the acquisition of such binary characters is assumed to represent a rare evolutionary event, and consequently, their evolution is analyzed using various flavors of parsimony. However, when gain and loss of the character are not rare enough, a probabilistic analysis becomes essential. Here, we present a comprehensive probabilistic model to describe the evolution of binary characters on a bifurcating phylogenetic tree. A fast software tool, EREM, is provided, using maximum likelihood to estimate the parameters of the model and to reconstruct ancestral states (presence and absence in internal nodes) and events (gain and loss events along branches).
Collapse
|
38
|
Abstract
Rates and mechanisms of intron gain and loss have traditionally been inferred from alignments of highly conserved genes sampled from phylogenetically distant taxa. We report a population-genomic approach that detected 24 discordant intron/exon boundaries between the whole-genome sequences of two Daphnia pulex isolates. Sequencing of presence/absence loci across a collection of D. pulex isolates and outgroup Daphnia species shows that most polymorphisms are a consequence of recent gains, with parallel gains often occurring at the same locations in independent allelic lineages. More than half of the recent gains are associated with short sequence repeats, suggesting an origin via repair of staggered double-strand breaks. By comparing the allele-frequency spectrum of intron-gain alleles with that for derived single-base substitutions, we also provide evidence that newly arisen introns are intrinsically deleterious and tend to accumulate in population-genetic settings where random genetic drift is a relatively strong force.
Collapse
Affiliation(s)
- Wenli Li
- Biology Department, Indiana University, Bloomington, IN 47405, USA
| | | | - Way Sung
- Hubbard Center for Genome Studies, University of New Hampshire, Durham, NH 03824, USA
| | - W. Kelley Thomas
- Hubbard Center for Genome Studies, University of New Hampshire, Durham, NH 03824, USA
| | - Michael Lynch
- Biology Department, Indiana University, Bloomington, IN 47405, USA
| |
Collapse
|
39
|
An Overview of the Introns-First Theory. J Mol Evol 2009; 69:527-40. [DOI: 10.1007/s00239-009-9279-5] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2009] [Accepted: 09/08/2009] [Indexed: 10/20/2022]
|
40
|
Carmel L, Koonin EV. A universal nonmonotonic relationship between gene compactness and expression levels in multicellular eukaryotes. Genome Biol Evol 2009; 1:382-90. [PMID: 20333206 PMCID: PMC2817431 DOI: 10.1093/gbe/evp038] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/19/2009] [Indexed: 01/21/2023] Open
Abstract
Analysis of gene architecture and expression levels of four organisms, Homo sapiens, Caenorhabditis elegans, Drosophila melanogaster, and Arabidopsis thaliana, reveals a surprising, nonmonotonic, universal relationship between expression level and gene compactness. With increasing expression level, the genes tend at first to become longer but, from a certain level of expression, they become more and more compact, resulting in an approximate bell-shaped dependence. There are two leading hypotheses to explain the compactness of highly expressed genes. The selection hypothesis predicts that gene compactness is predominantly driven by the level of expression, whereas the genomic design hypothesis predicts that expression breadth across tissues is the driving force. We observed the connection between gene expression breadth in humans and gene compactness to be significantly weaker than the connection between expression level and compactness, a result that is compatible with the selection hypothesis but not the genome design hypothesis. The initial gene elongation with increasing expression level could be explained, at least in part, by accumulation of regulatory elements enhancing expression, in particular, in introns. This explanation is compatible with the observed positive correlation between intron density and expression level of a gene. Conversely, the trend toward increasing compactness for highly expressed genes could be caused by selection for minimization of energy and time expenditure during transcription and splicing and for increased fidelity of transcription, splicing, and/or translation that is likely to be particularly critical for highly expressed genes. Regardless of the exact nature of the forces that shape the gene architecture, we present evidence that, at least, in animals, coding and noncoding parts of genes show similar architectonic trends.
Collapse
Affiliation(s)
- Liran Carmel
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
| | | |
Collapse
|
41
|
Patrushev LI, Minkevich IG. The problem of the eukaryotic genome size. BIOCHEMISTRY (MOSCOW) 2009; 73:1519-52. [PMID: 19216716 DOI: 10.1134/s0006297908130117] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The current state of knowledge concerning the unsolved problem of the huge interspecific eukaryotic genome size variations not correlating with the species phenotypic complexity (C-value enigma also known as C-value paradox) is reviewed. Characteristic features of eukaryotic genome structure and molecular mechanisms that are the basis of genome size changes are examined in connection with the C-value enigma. It is emphasized that endogenous mutagens, including reactive oxygen species, create a constant nuclear environment where any genome evolves. An original quantitative model and general conception are proposed to explain the C-value enigma. In accordance with the theory, the noncoding sequences of the eukaryotic genome provide genes with global and differential protection against chemical mutagens and (in addition to the anti-mutagenesis and DNA repair systems) form a new, third system that protects eukaryotic genetic information. The joint action of these systems controls the spontaneous mutation rate in coding sequences of the eukaryotic genome. It is hypothesized that the genome size is inversely proportional to functional efficiency of the anti-mutagenesis and/or DNA repair systems in a particular biological species. In this connection, a model of eukaryotic genome evolution is proposed.
Collapse
Affiliation(s)
- L I Patrushev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia.
| | | |
Collapse
|
42
|
The universal distribution of evolutionary rates of genes and distinct characteristics of eukaryotic genes of different apparent ages. Proc Natl Acad Sci U S A 2009; 106:7273-80. [PMID: 19351897 DOI: 10.1073/pnas.0901808106] [Citation(s) in RCA: 167] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The evolutionary rates of protein-coding genes in an organism span, approximately, 3 orders of magnitude and show a universal, approximately log-normal distribution in a broad variety of species from prokaryotes to mammals. This universal distribution implies a steady-state process, with identical distributions of evolutionary rates among genes that are gained and genes that are lost. A mathematical model of such process is developed under the single assumption of the constancy of the distributions of the propensities for gene loss (PGL). This model predicts that genes of different ages, that is, genes with homologs detectable at different phylogenetic depths, substantially differ in those variables that correlate with PGL. We computationally partition protein-coding genes from humans, flies, and Aspergillus fungus into age classes, and show that genes of different ages retain the universal log-normal distribution of evolutionary rates, with a shift toward higher rates in "younger" classes but also with a substantial overlap. The only exception involves human primate-specific genes that show a heavy tail of rapidly evolving genes, probably owing to gene annotation artifacts. As predicted, the gene age classes differ in characteristics correlated with PGL. Compared with "young" genes (e.g., mammal-specific human ones), "old" genes (e.g., eukaryote-specific), on average, are longer, are expressed at a higher level, possess a higher intron density, evolve slower on the short time scale, and are subject to stronger purifying selection. Thus, genome evolution fits a simple model with approximately uniform rates of gene gain and loss, without major bursts of genomic innovation.
Collapse
|
43
|
De Luca F, Di Vito M, Fanelli E, Reyes A, Greco N, De Giorgi C. Characterization of the heat shock protein 90 gene in the plant parasitic nematode Meloidogyne artiellia and its expression as related to different developmental stages and temperature. Gene 2009; 440:16-22. [PMID: 19348876 DOI: 10.1016/j.gene.2009.03.020] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2009] [Revised: 03/27/2009] [Accepted: 03/30/2009] [Indexed: 10/20/2022]
Abstract
The full-length cDNA and the corresponding gene of the heat shock protein 90, Mt-Hsp90, were isolated and characterized in the plant parasitic nematode Meloidogyne artiellia. The full-length Mt-Hsp90 cDNA contained a 5' untranslated region (UTR) of 45 bp with the 22 bp trans-spliced leader SL1, an ORF of 2172 bp encoding a polypeptide of 723 amino acids and a 3' UTR of 191 bp. The deduced amino acid sequence of Mt-hsp90 showed high similarity with other known Hsp90s. Five conserved amino acid signatures indicated that Mt-hsp90 is a cytosolic member of the Hsp90 family. The gene consists of 10 exons and 9 introns, a more expanded gene structure compared to the corresponding Caenorhabditis elegans gene, daf-21. Mt-hsp90 gene was constitutively expressed at high levels in all developmental stages of M. artiellia. Egg masses and second stage juveniles (J2s) were exposed at 5 degrees and 30 degrees C for different periods of times in order to explore the impact of adverse temperature on Mt-hsp90 gene expression. Expression levels of Mt-hsp90 were examined by fluorescent real-time PCR. At 30 degrees C a burst of expression for Mt-hsp90 was observed in J2s after 2 h of heat shock treatment, then expression dropped with longer exposing times, although remaining still relatively high after 24 h. This temperature did not affect Mt-hsp90 gene expression in the egg masses. However, egg masses exposed at 5 degrees C showed a little but gradual increase in the mRNA level with time. By contrast, no significant changes in the Mt-hsp90 level were observed in J2s exposed to cold. These data show that egg masses and J2s exposed to cold and heat stresses have different expression profiles suggesting that Mt-Hsp90 may provide a link between environmental conditions and the life cycle of the nematode.
Collapse
Affiliation(s)
- Francesca De Luca
- Istituto per la Protezione delle Piante, CNR, Via Amendola 122/D, 70126 Bari, Italy.
| | | | | | | | | | | |
Collapse
|
44
|
Sm/Lsm genes provide a glimpse into the early evolution of the spliceosome. PLoS Comput Biol 2009; 5:e1000315. [PMID: 19282982 PMCID: PMC2650416 DOI: 10.1371/journal.pcbi.1000315] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2008] [Accepted: 02/03/2009] [Indexed: 11/19/2022] Open
Abstract
The spliceosome, a sophisticated molecular machine involved in the removal of intervening sequences from the coding sections of eukaryotic genes, appeared and subsequently evolved rapidly during the early stages of eukaryotic evolution. The last eukaryotic common ancestor (LECA) had both complex spliceosomal machinery and some spliceosomal introns, yet little is known about the early stages of evolution of the spliceosomal apparatus. The Sm/Lsm family of proteins has been suggested as one of the earliest components of the emerging spliceosome and hence provides a first in-depth glimpse into the evolving spliceosomal apparatus. An analysis of 335 Sm and Sm-like genes from 80 species across all three kingdoms of life reveals two significant observations. First, the eukaryotic Sm/Lsm family underwent two rapid waves of duplication with subsequent divergence resulting in 14 distinct genes. Each wave resulted in a more sophisticated spliceosome, reflecting a possible jump in the complexity of the evolving eukaryotic cell. Second, an unusually high degree of conservation in intron positions is observed within individual orthologous Sm/Lsm genes and between some of the Sm/Lsm paralogs. This suggests that functional spliceosomal introns existed before the emergence of the complete Sm/Lsm family of proteins; hence, spliceosomal machinery with considerably fewer components than today's spliceosome was already functional. The spliceosome is a complex molecular machine that removes intervening sequences (introns) from mRNAs. It is unique to eukaryotes. Although prokaryotes have self-splicing introns, they completely lack spliceosomal introns and the spliceosome itself. Yet even the simplest eukaryotic organisms have introns and a rather complex spliceosomal apparatus. Little is known about how this amazing machine rapidly evolved in early eukaryotes. Here, we attempt to reconstruct a part of this evolutionary process using one of the most fundamental components of the spliceosome—the Sm and Lsm family of proteins. Using sequence and structure analysis as well as the analysis of the intron positions in Sm and Lsm genes in conjunction with a wealth of published data, we propose a plausible scenario for some aspects of spliceosomal evolution. In particular, we suggest that the Lsm family of genes could have been the first and the most essential component that allowed rudimentary splicing of early spliceosomal introns. Extensive duplications of Lsm genes and the later rise of the Sm gene family likely reflect a gradual increase in complexity of the spliceosome.
Collapse
|
45
|
Garcia-España A, Mares R, Sun TT, DeSalle R. Intron evolution: testing hypotheses of intron evolution using the phylogenomics of tetraspanins. PLoS One 2009; 4:e4680. [PMID: 19262691 PMCID: PMC2650405 DOI: 10.1371/journal.pone.0004680] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2008] [Accepted: 12/30/2008] [Indexed: 11/20/2022] Open
Abstract
Background Although large scale informatics studies on introns can be useful in making broad inferences concerning patterns of intron gain and loss, more specific questions about intron evolution at a finer scale can be addressed using a gene family where structure and function are well known. Genome wide surveys of tetraspanins from a broad array of organisms with fully sequenced genomes are an excellent means to understand specifics of intron evolution. Our approach incorporated several new fully sequenced genomes that cover the major lineages of the animal kingdom as well as plants, protists and fungi. The analysis of exon/intron gene structure in such an evolutionary broad set of genomes allowed us to identify ancestral intron structure in tetraspanins throughout the eukaryotic tree of life. Methodology/Principal Findings We performed a phylogenomic analysis of the intron/exon structure of the tetraspanin protein family. In addition, to the already characterized tetraspanin introns numbered 1 through 6 found in animals, three additional ancient, phase 0 introns we call 4a, 4b and 4c were found. These three novel introns in combination with the ancestral introns 1 to 6, define three basic tetraspanin gene structures which have been conserved throughout the animal kingdom. Our phylogenomic approach also allows the estimation of the time at which the introns of the 33 human tetraspanin paralogs appeared, which in many cases coincides with the concomitant acquisition of new introns. On the other hand, we observed that new introns (introns other than 1–6, 4a, b and c) were not randomly inserted into the tetraspanin gene structure. The region of tetraspanin genes corresponding to the small extracellular loop (SEL) accounts for only 10.5% of the total sequence length but had 46% of the new animal intron insertions. Conclusions/Significance Our results indicate that tests of intron evolution are strengthened by the phylogenomic approach with specific gene families like tetraspanins. These tests add to our understanding of genomic innovation coupled to major evolutionary divergence events, functional constraints and the timing of the appearance of evolutionary novelty.
Collapse
Affiliation(s)
- Antonio Garcia-España
- Unitat de Recerca, Hospital Joan XXIII, Institut de Investigacio Sanitaria Rovira I Virgili (IISPV), Universitat Rovira i Virgili, Tarragona, Spain
- CIBER de Diabetes y Enfermedades Metabólicas Asociadas (CIBERDEM), Universitat Rovira i Virgili, Tarragona, Spain
- * E-mail: (AG); (RD)
| | - Roso Mares
- Unitat de Recerca, Hospital Joan XXIII, Institut de Investigacio Sanitaria Rovira I Virgili (IISPV), Universitat Rovira i Virgili, Tarragona, Spain
| | - Tung-Tien Sun
- Department of Cell Biology, New York University School of Medicine, New York, New York, United States of America
- Department of Dermatology, New York University School of Medicine, New York, New York, United States of America
- Department of Pharmacology, New York University School of Medicine, New York, New York, United States of America
- Department of Urology, New York University School of Medicine, New York, New York, United States of America
| | - Rob DeSalle
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, New York, United States of America
- * E-mail: (AG); (RD)
| |
Collapse
|
46
|
Weadick CJ, Chang BSW. Molecular evolution of the betagamma lens crystallin superfamily: evidence for a retained ancestral function in gamma N crystallins? Mol Biol Evol 2009; 26:1127-42. [PMID: 19233964 DOI: 10.1093/molbev/msp028] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Within the vertebrate eye, betagamma crystallins are extremely stable lens proteins that are uniquely adapted to increase refractory power while maintaining transparency. Unlike alpha crystallins, which are well-characterized, multifunctional proteins that have important functions both in and out of the lens, betagamma lens crystallins are a diverse group of proteins with no clear ancestral or contemporary nonlens role. We carried out phylogenetic and molecular evolutionary analyses of the betagamma-crystallin superfamily in order to study the evolutionary history of the gamma N crystallins, a recently discovered, biochemically atypical family suggested to possess a divergent or ancestral function. By including nonlens, betagamma-motif-containing sequences in our analysis as outgroups, we confirmed the phylogenetic position of the gamma N family as sister to other gamma crystallins. Using maximum likelihood codon models to estimate lineage-specific nonsynonymous-to-synonymous rate ratios revealed strong positive selection in all of the early lineages within the betagamma family, with the striking exception of the lineage leading to the gamma N crystallins which was characterized by strong purifying selection. Branch-site analysis, used to identify candidate sites involved in functional divergence between gamma N crystallins and its sister clade containing all other gamma crystallins, identified several positively selected changes at sites of known functional importance in the betagamma crystallin protein structure. Further analyses of a fish-specific gamma N crystallin gene duplication revealed a more recent episode of positive selection in only one of the two descendant lineages (gamma N2). Finally, from the guppy, Poecilia reticulata, we isolated complete gamma N1 and gamma N2 coding sequence data from cDNA and partial coding sequence data from genomic DNA in order to confirm the presence of a novel gamma N2 intron, discovered through data mining of two pufferfish genomes. We conclude that the function of the gamma N family likely resembles the ancestral vertebrate betagamma crystallin more than other betagamma families. Furthermore, owing to the presence of an additional intron in some fish gamma N2 crystallins, and the inferred action of positive selection following the fish-specific gamma N duplication, we suggest that further study of fish gamma N crystallins will be critical in further elucidating possible ancestral functions of gamma N crystallins and any nonstructural role they may have.
Collapse
Affiliation(s)
- Cameron J Weadick
- Department of Ecology and Evolution, University of Toronto, Toronto, Ontario, Canada
| | | |
Collapse
|
47
|
Structural Variations in Protein Superfamilies: Actin and Tubulin. Mol Biotechnol 2009; 42:49-60. [DOI: 10.1007/s12033-008-9128-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2008] [Accepted: 11/14/2008] [Indexed: 11/28/2022]
|
48
|
Ivashchenko AT, Khailenko VA, Atambaeva SA. Variations of the length of exons and introns in human genome genes. RUSS J GENET+ 2009. [DOI: 10.1134/s1022795409010025] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
49
|
Evolution of GHF5 endoglucanase gene structure in plant-parasitic nematodes: no evidence for an early domain shuffling event. BMC Evol Biol 2008; 8:305. [PMID: 18980666 PMCID: PMC2633302 DOI: 10.1186/1471-2148-8-305] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2007] [Accepted: 11/03/2008] [Indexed: 11/22/2022] Open
Abstract
Background Endo-1,4-beta-glucanases or cellulases from the glycosyl hydrolase family 5 (GHF5) have been found in numerous bacteria and fungi, and recently also in higher eukaryotes, particularly in plant-parasitic nematodes (PPN). The origin of these genes has been attributed to horizontal gene transfer from bacteria, although there still is a lot of uncertainty about the origin and structure of the ancestral GHF5 PPN endoglucanase. It is not clear whether this ancestral endoglucanase consisted of the whole gene cassette, containing a catalytic domain and a carbohydrate-binding module (CBM, type 2 in PPN and bacteria) or only of the catalytic domain while the CBM2 was retrieved by domain shuffling later in evolution. Previous studies on the evolution of these genes have focused primarily on data of sedentary nematodes, while in this study, extra data from migratory nematodes were included. Results Two new endoglucanases from the migratory nematodes Pratylenchus coffeae and Ditylenchus africanus were included in this study. The latter one is the first gene isolated from a PPN of a different superfamily (Sphaerularioidea); all previously known nematode endoglucanases belong to the superfamily Tylenchoidea (order Rhabditida). Phylogenetic analyses were conducted with the PPN GHF5 endoglucanases and homologous endoglucanases from bacterial and other eukaryotic lineages such as beetles, fungi and plants. No statistical incongruence between the phylogenetic trees deduced from the catalytic domain and the CBM2 was found, which could suggest that both domains have evolved together. Furthermore, based on gene structure data, we inferred a model for the evolution of the GHF5 endoglucanase gene structure in plant-parasitic nematodes. Our data confirm a close relationship between Pratylenchus spp. and the root knot nematodes, while some Radopholus similis endoglucanases are more similar to cyst nematode genes. Conclusion We conclude that the ancestral PPN GHF5 endoglucanase gene most probably consisted of the whole gene cassette, i.e. the GHF5 catalytic domain and the CBM2, rather than that it evolved by domain shuffling. Our evolutionary model for the gene structure in PPN GHF5 endoglucanases implies the occurrence of an early duplication event, and more recent gene duplications at genus or species level.
Collapse
|
50
|
Armisén D, Lecharny A, Aubourg S. Unique genes in plants: specificities and conserved features throughout evolution. BMC Evol Biol 2008; 8:280. [PMID: 18847470 PMCID: PMC2576244 DOI: 10.1186/1471-2148-8-280] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2008] [Accepted: 10/10/2008] [Indexed: 11/10/2022] Open
Abstract
Background Plant genomes contain a high proportion of duplicated genes as a result of numerous whole, segmental and local duplications. These duplications lead up to the formation of gene families, which are the usual material for many evolutionary studies. However, all characterized genomes include single-copy (unique) genes that have not received much attention. Unlike gene duplication, gene loss is not an unspecific mechanism but is rather influenced by a functional selection. In this context, we have established and used stringent criteria in order to identify suitable sets of unique genes present in plant proteomes. Comparisons of unique genes in the green phylum were used to characterize the gene and protein features exhibited by both conserved and species-specific unique genes. Results We identified the unique genes within both A. thaliana and O. sativa genomes and classified them according to the number of homologs in the alternative species: none (U{1:0}), one (U{1:1}) or several (U{1:m}). Regardless of the species, all the genes in these groups present some conserved characteristics, such as small average protein size and abnormal intron number. In order to understand the origin and function of unique genes, we further characterized the U{1:1} gene pairs. The possible involvement of sequence convergence in the creation of U{1:1} pairs was discarded due to the frequent conservation of intron positions. Furthermore, an orthology relationship between the two members of each U{1:1} pair was strongly supported by a high conservation in the protein sizes and transcription levels. Within the promoter of the unique conserved genes, we found a number of TATA and TELO boxes that specifically differed from their mean number in the whole genome. Many unique genes have been conserved as unique through evolution from the green alga Ostreococcus lucimarinus to higher plants. Plant unique genes may also have homologs in bacteria and we showed a link between the targeting towards plastids of proteins encoded by plant nuclear unique genes and their homology with a bacterial protein. Conclusion Many of the A. thaliana and O. sativa unique genes are conserved in plants for which the ancestor diverged at least 725 million years ago (MYA). Half of these genes are also present in other eukaryotic and/or prokaryotic species. Thus, our results indicate that (i) a strong negative selection pressure has conserved a number of genes as unique in genomes throughout evolution, (ii) most unique genes are subjected to a low divergence rate, (iii) they have some features observed in housekeeping genes but for most of them there is no functional annotation and (iv) they may have an ancient origin involving a possible gene transfer from ancestral chloroplasts or bacteria to the plant nucleus.
Collapse
Affiliation(s)
- David Armisén
- Unité de Recherche en Génomique Végetale , UMR INRA 1165 - CNRS 8114 - Université d'Evry Val d'Essonne, 2 rue Gaston Crémieux, CP 5708, F-91057 Evry Cedex, France.
| | | | | |
Collapse
|