1
|
Ye Z, Mao D, Wang Y, Deng H, Liu X, Zhang T, Han Z, Zhang X. Comparative Genome-Wide Identification of the Fatty Acid Desaturase Gene Family in Tea and Oil Tea. PLANTS (BASEL, SWITZERLAND) 2024; 13:1444. [PMID: 38891253 PMCID: PMC11174766 DOI: 10.3390/plants13111444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 05/07/2024] [Accepted: 05/16/2024] [Indexed: 06/21/2024]
Abstract
Camellia oil is valuable as an edible oil and serves as a base material for a range of high-value products. Camellia plants of significant economic importance, such as Camellia sinensis and Camellia oleifera, have been classified into sect. Thea and sect. Oleifera, respectively. Fatty acid desaturases play a crucial role in catalyzing the formation of double bonds at specific positions of fatty acid chains, leading to the production of unsaturated fatty acids and contributing to lipid synthesis. Comparative genomics results have revealed that expanded gene families in oil tea are enriched in functions related to lipid, fatty acid, and seed processes. To explore the function of the FAD gene family, a total of 82 FAD genes were identified in tea and oil tea. Transcriptome data showed the differential expression of the FAD gene family in mature seeds of tea tree and oil tea tree. Furthermore, the structural analysis and clustering of FAD proteins provided insights for the further exploration of the function of the FAD gene family and its role in lipid synthesis. Overall, these findings shed light on the role of the FAD gene family in Camellia plants and their involvement in lipid metabolism, as well as provide a reference for understanding their function in oil synthesis.
Collapse
Affiliation(s)
- Ziqi Ye
- The Laboratory of Forestry Genetics, Central South University of Forestry and Technology, Changsha 410004, China; (Z.Y.); (H.D.); (X.L.); (T.Z.)
| | - Dan Mao
- National Forest and Seedling Workstation of Hunan Province, The Forestry Department of Hunan Province, Changsha 410004, China; (D.M.); (Y.W.)
| | - Yujian Wang
- National Forest and Seedling Workstation of Hunan Province, The Forestry Department of Hunan Province, Changsha 410004, China; (D.M.); (Y.W.)
| | - Hongda Deng
- The Laboratory of Forestry Genetics, Central South University of Forestry and Technology, Changsha 410004, China; (Z.Y.); (H.D.); (X.L.); (T.Z.)
| | - Xing Liu
- The Laboratory of Forestry Genetics, Central South University of Forestry and Technology, Changsha 410004, China; (Z.Y.); (H.D.); (X.L.); (T.Z.)
| | - Tongyue Zhang
- The Laboratory of Forestry Genetics, Central South University of Forestry and Technology, Changsha 410004, China; (Z.Y.); (H.D.); (X.L.); (T.Z.)
| | - Zhiqiang Han
- The Laboratory of Forestry Genetics, Central South University of Forestry and Technology, Changsha 410004, China; (Z.Y.); (H.D.); (X.L.); (T.Z.)
| | - Xingtan Zhang
- Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518000, China
| |
Collapse
|
2
|
Chen S, Fan H, Ran C, Hong Y, Feng H, Yue Z, Zhang H, Pontarotti P, Xu A, Huang S. The IL-17 pathway intertwines with neurotrophin and TLR/IL-1R pathways since its domain shuffling origin. Proc Natl Acad Sci U S A 2024; 121:e2400903121. [PMID: 38683992 PMCID: PMC11087794 DOI: 10.1073/pnas.2400903121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Accepted: 03/11/2024] [Indexed: 05/02/2024] Open
Abstract
The IL-17 pathway displays remarkably diverse functional modes between different subphyla, classes, and even orders, yet its driving factors remains elusive. Here, we demonstrate that the IL-17 pathway originated through domain shuffling between a Toll-like receptor (TLR)/IL-1R pathway and a neurotrophin-RTK (receptor-tyrosine-kinase) pathway (a Trunk-Torso pathway). Unlike other new pathways that evolve independently, the IL-17 pathway remains intertwined with its donor pathways throughout later evolution. This intertwining not only influenced the gains and losses of domains and components in the pathway but also drove the diversification of the pathway's functional modes among animal lineages. For instance, we reveal that the crustacean female sex hormone, a neurotrophin inducing sex differentiation, could interact with IL-17Rs and thus be classified as true IL-17s. Additionally, the insect prothoracicotropic hormone, a neurotrophin initiating ecdysis in Drosophila by binding to Torso, could bind to IL-17Rs in other insects. Furthermore, IL-17R and TLR/IL-1R pathways maintain crosstalk in amphioxus and zebrafish. Moreover, the loss of the Death domain in the pathway adaptor connection to IκB kinase and stress-activated protein kinase (CIKSs) dramatically reduced their abilities to activate nuclear factor-kappaB (NF-κB) and activator protein 1 (AP-1) in amphioxus and zebrafish. Reinstating this Death domain not only enhanced NF-κB/AP-1 activation but also strengthened anti-bacterial immunity in zebrafish larvae. This could explain why the mammalian IL-17 pathway, whose CIKS also lacks Death, is considered a weak signaling activator, relying on synergies with other pathways. Our findings provide insights into the functional diversity of the IL-17 pathway and unveil evolutionary principles that could govern the pathway and be used to redesign and manipulate it.
Collapse
Affiliation(s)
- Shenghui Chen
- State Key Laboratory of Biocontrol, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Guangdong Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-sen University, Guangzhou510275, China
- Laboratory for Marine Biology and Biotechnology, Qingdao Marine Science and Technology Center, Qingdao266237, China
| | - Huiping Fan
- State Key Laboratory of Biocontrol, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Guangdong Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-sen University, Guangzhou510275, China
| | - Chenrui Ran
- State Key Laboratory of Biocontrol, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Guangdong Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-sen University, Guangzhou510275, China
| | - Yun Hong
- State Key Laboratory of Biocontrol, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Guangdong Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-sen University, Guangzhou510275, China
| | - Huixiong Feng
- State Key Laboratory of Biocontrol, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Guangdong Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-sen University, Guangzhou510275, China
| | - Zirui Yue
- State Key Laboratory of Biocontrol, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Guangdong Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-sen University, Guangzhou510275, China
| | - Hao Zhang
- State Key Laboratory of Biocontrol, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Guangdong Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-sen University, Guangzhou510275, China
| | - Pierre Pontarotti
- MEPHI (Microbes, Evolution, Phylogénie et Infection), Aix Marseille Université, Marseille, France
| | - Anlong Xu
- State Key Laboratory of Biocontrol, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Guangdong Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-sen University, Guangzhou510275, China
- School of Life Sciences, Beijing University of Chinese Medicine, Beijing100029, China
| | - Shengfeng Huang
- State Key Laboratory of Biocontrol, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Guangdong Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-sen University, Guangzhou510275, China
- Laboratory for Marine Biology and Biotechnology, Qingdao Marine Science and Technology Center, Qingdao266237, China
| |
Collapse
|
3
|
Feng X, Zheng J, Irisarri I, Yu H, Zheng B, Ali Z, de Vries S, Keller J, Fürst-Jansen JMR, Dadras A, Zegers JMS, Rieseberg TP, Dhabalia Ashok A, Darienko T, Bierenbroodspot MJ, Gramzow L, Petroll R, Haas FB, Fernandez-Pozo N, Nousias O, Li T, Fitzek E, Grayburn WS, Rittmeier N, Permann C, Rümpler F, Archibald JM, Theißen G, Mower JP, Lorenz M, Buschmann H, von Schwartzenberg K, Boston L, Hayes RD, Daum C, Barry K, Grigoriev IV, Wang X, Li FW, Rensing SA, Ben Ari J, Keren N, Mosquna A, Holzinger A, Delaux PM, Zhang C, Huang J, Mutwil M, de Vries J, Yin Y. Genomes of multicellular algal sisters to land plants illuminate signaling network evolution. Nat Genet 2024; 56:1018-1031. [PMID: 38693345 PMCID: PMC11096116 DOI: 10.1038/s41588-024-01737-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 03/25/2024] [Indexed: 05/03/2024]
Abstract
Zygnematophyceae are the algal sisters of land plants. Here we sequenced four genomes of filamentous Zygnematophyceae, including chromosome-scale assemblies for three strains of Zygnema circumcarinatum. We inferred traits in the ancestor of Zygnematophyceae and land plants that might have ushered in the conquest of land by plants: expanded genes for signaling cascades, environmental response, and multicellular growth. Zygnematophyceae and land plants share all the major enzymes for cell wall synthesis and remodifications, and gene gains shaped this toolkit. Co-expression network analyses uncover gene cohorts that unite environmental signaling with multicellular developmental programs. Our data shed light on a molecular chassis that balances environmental response and growth modulation across more than 600 million years of streptophyte evolution.
Collapse
Affiliation(s)
- Xuehuan Feng
- Nebraska Food for Health Center, Department of Food Science and Technology, University of Nebraska-Lincoln, Lincoln, NE, USA
| | - Jinfang Zheng
- Nebraska Food for Health Center, Department of Food Science and Technology, University of Nebraska-Lincoln, Lincoln, NE, USA
- Zhejiang Lab, Hanzhou, China
| | - Iker Irisarri
- Institute of Microbiology and Genetics, Department of Applied Bioinformatics, University of Goettingen, Goettingen, Germany
- Campus Institute Data Science, University of Goettingen, Goettingen, Germany
- Section Phylogenomics, Centre for Molecular biodiversity Research, Leibniz Institute for the Analysis of Biodiversity Change, Zoological Museum Hamburg, Hamburg, Germany
| | - Huihui Yu
- University of Nebraska-Lincoln, Center for Plant Science Innovation, Lincoln, NE, USA
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Science, Yunnan, China
| | - Bo Zheng
- Nebraska Food for Health Center, Department of Food Science and Technology, University of Nebraska-Lincoln, Lincoln, NE, USA
| | - Zahin Ali
- Nanyang Technological University, School of Biological Sciences, Singapore, Singapore
| | - Sophie de Vries
- Institute of Microbiology and Genetics, Department of Applied Bioinformatics, University of Goettingen, Goettingen, Germany
| | - Jean Keller
- Laboratoire de Recherche en Sciences Végétales, Université de Toulouse, CNRS, UPS, INP Toulouse, Castanet-Tolosan, France
| | - Janine M R Fürst-Jansen
- Institute of Microbiology and Genetics, Department of Applied Bioinformatics, University of Goettingen, Goettingen, Germany
| | - Armin Dadras
- Institute of Microbiology and Genetics, Department of Applied Bioinformatics, University of Goettingen, Goettingen, Germany
| | - Jaccoline M S Zegers
- Institute of Microbiology and Genetics, Department of Applied Bioinformatics, University of Goettingen, Goettingen, Germany
| | - Tim P Rieseberg
- Institute of Microbiology and Genetics, Department of Applied Bioinformatics, University of Goettingen, Goettingen, Germany
| | - Amra Dhabalia Ashok
- Institute of Microbiology and Genetics, Department of Applied Bioinformatics, University of Goettingen, Goettingen, Germany
| | - Tatyana Darienko
- Institute of Microbiology and Genetics, Department of Applied Bioinformatics, University of Goettingen, Goettingen, Germany
| | - Maaike J Bierenbroodspot
- Institute of Microbiology and Genetics, Department of Applied Bioinformatics, University of Goettingen, Goettingen, Germany
| | - Lydia Gramzow
- University of Jena, Matthias Schleiden Institute/Genetics, Jena, Germany
| | - Romy Petroll
- Plant Cell Biology, Department of Biology, University of Marburg, Marburg, Germany
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | - Fabian B Haas
- Plant Cell Biology, Department of Biology, University of Marburg, Marburg, Germany
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | - Noe Fernandez-Pozo
- Plant Cell Biology, Department of Biology, University of Marburg, Marburg, Germany
- Institute for Mediterranean and Subtropical Horticulture 'La Mayora', Málaga, Spain
| | - Orestis Nousias
- Nebraska Food for Health Center, Department of Food Science and Technology, University of Nebraska-Lincoln, Lincoln, NE, USA
| | - Tang Li
- Nebraska Food for Health Center, Department of Food Science and Technology, University of Nebraska-Lincoln, Lincoln, NE, USA
| | - Elisabeth Fitzek
- Computational Biology, Department of Biology, Center for Biotechnology, Bielefeld University, Bielefeld, Germany
| | - W Scott Grayburn
- Northern Illinois University, Molecular Core Lab, Department of Biological Sciences, DeKalb, IL, USA
| | - Nina Rittmeier
- University of Innsbruck, Department of Botany, Research Group Plant Cell Biology, Innsbruck, Austria
| | - Charlotte Permann
- University of Innsbruck, Department of Botany, Research Group Plant Cell Biology, Innsbruck, Austria
| | - Florian Rümpler
- University of Jena, Matthias Schleiden Institute/Genetics, Jena, Germany
| | - John M Archibald
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Günter Theißen
- University of Jena, Matthias Schleiden Institute/Genetics, Jena, Germany
| | - Jeffrey P Mower
- University of Nebraska-Lincoln, Center for Plant Science Innovation, Lincoln, NE, USA
| | - Maike Lorenz
- University of Goettingen, Albrecht-von-Haller-Institute for Plant Sciences, Experimental Phycology and Culture Collection of Algae at Goettingen University, Goettingen, Germany
| | - Henrik Buschmann
- University of Applied Sciences Mittweida, Faculty of Applied Computer Sciences and Biosciences, Section Biotechnology and Chemistry, Molecular Biotechnology, Mittweida, Germany
| | - Klaus von Schwartzenberg
- Universität Hamburg, Institute of Plant Science and Microbiology, Microalgae and Zygnematophyceae Collection Hamburg and Aquatic Ecophysiology and Phycology, Hamburg, Germany
| | - Lori Boston
- Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Richard D Hayes
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Chris Daum
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Kerrie Barry
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Igor V Grigoriev
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
| | - Xiyin Wang
- North China University of Science and Technology, Tangshan, China
| | - Fay-Wei Li
- Boyce Thompson Institute, Ithaca, NY, USA
- Plant Biology Section, Cornell University, Ithaca, NY, USA
| | - Stefan A Rensing
- Plant Cell Biology, Department of Biology, University of Marburg, Marburg, Germany
- University of Freiburg, Centre for Biological Signalling Studies (BIOSS), Freiburg, Germany
| | - Julius Ben Ari
- The Hebrew University of Jerusalem, The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture, Rehovot, Israel
| | - Noa Keren
- The Hebrew University of Jerusalem, The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture, Rehovot, Israel
| | - Assaf Mosquna
- The Hebrew University of Jerusalem, The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture, Rehovot, Israel
| | - Andreas Holzinger
- University of Innsbruck, Department of Botany, Research Group Plant Cell Biology, Innsbruck, Austria
| | - Pierre-Marc Delaux
- Laboratoire de Recherche en Sciences Végétales, Université de Toulouse, CNRS, UPS, INP Toulouse, Castanet-Tolosan, France
| | - Chi Zhang
- University of Nebraska-Lincoln, Center for Plant Science Innovation, Lincoln, NE, USA
- University of Nebraska-Lincoln, School of Biological Sciences, Lincoln, NE, USA
| | - Jinling Huang
- Department of Biology, East Carolina University, Greenville, NC, USA
- State Key Laboratory of Crop Stress Adaptation and Improvement, School of Life Sciences, Henan University, Kaifeng, China
| | - Marek Mutwil
- Nanyang Technological University, School of Biological Sciences, Singapore, Singapore
| | - Jan de Vries
- Institute of Microbiology and Genetics, Department of Applied Bioinformatics, University of Goettingen, Goettingen, Germany.
- Campus Institute Data Science, University of Goettingen, Goettingen, Germany.
- University of Goettingen, Goettingen Center for Molecular Biosciences, Goettingen, Germany.
| | - Yanbin Yin
- Nebraska Food for Health Center, Department of Food Science and Technology, University of Nebraska-Lincoln, Lincoln, NE, USA.
| |
Collapse
|
4
|
Mahlich Y, Zhu C, Chung H, Velaga PK, De Paolis Kaluza M, Radivojac P, Friedberg I, Bromberg Y. Learning from the unknown: exploring the range of bacterial functionality. Nucleic Acids Res 2023; 51:10162-10175. [PMID: 37739408 PMCID: PMC10602916 DOI: 10.1093/nar/gkad757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Accepted: 09/11/2023] [Indexed: 09/24/2023] Open
Abstract
Determining the repertoire of a microbe's molecular functions is a central question in microbial biology. Modern techniques achieve this goal by comparing microbial genetic material against reference databases of functionally annotated genes/proteins or known taxonomic markers such as 16S rRNA. Here, we describe a novel approach to exploring bacterial functional repertoires without reference databases. Our Fusion scheme establishes functional relationships between bacteria and assigns organisms to Fusion-taxa that differ from otherwise defined taxonomic clades. Three key findings of our work stand out. First, bacterial functional comparisons outperform marker genes in assigning taxonomic clades. Fusion profiles are also better for this task than other functional annotation schemes. Second, Fusion-taxa are robust to addition of novel organisms and are, arguably, able to capture the environment-driven bacterial diversity. Finally, our alignment-free nucleic acid-based Siamese Neural Network model, created using Fusion functions, enables finding shared functionality of very distant, possibly structurally different, microbial homologs. Our work can thus help annotate functional repertoires of bacterial organisms and further guide our understanding of microbial communities.
Collapse
Affiliation(s)
- Yannick Mahlich
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA
| | - Chengsheng Zhu
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA
- Xbiome Inc., 1 Broadway, 14th fl, Cambridge, MA 02142, USA
| | - Henri Chung
- Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA 50011, USA
- Interdepartmental program in Bioinformatics and Computational Biology, Iowa State University, Ames, IA 50011, USA
| | - Pavan K Velaga
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA
| | - M Clara De Paolis Kaluza
- Khoury College of Computer Sciences, Northeastern University, 177 Huntington Avenue, Boston, MA 02115, USA
| | - Predrag Radivojac
- Khoury College of Computer Sciences, Northeastern University, 177 Huntington Avenue, Boston, MA 02115, USA
| | - Iddo Friedberg
- Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA 50011, USA
- Interdepartmental program in Bioinformatics and Computational Biology, Iowa State University, Ames, IA 50011, USA
| | - Yana Bromberg
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA
- Department of Biology, Emory University, 1510 Clifton Road NE, Atlanta, GA 30322, USA
- Department of Computer Science, Emory University, 400 Dowman Drive, Atlanta, GA 30322, USA
| |
Collapse
|
5
|
Yu NN, Veerana M, Ketya W, Sun HN, Park G. RNA-Seq-Based Transcriptome Analysis of Nitric Oxide Scavenging Response in Neurospora crassa. J Fungi (Basel) 2023; 9:985. [PMID: 37888241 PMCID: PMC10607626 DOI: 10.3390/jof9100985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 09/22/2023] [Accepted: 10/01/2023] [Indexed: 10/28/2023] Open
Abstract
While the biological role of naturally occurring nitric oxide (NO) in filamentous fungi has been uncovered, the underlying molecular regulatory networks remain unclear. In this study, we conducted an analysis of transcriptome profiles to investigate the initial stages of understanding these NO regulatory networks in Neurospora crassa, a well-established model filamentous fungus. Utilizing RNA sequencing, differential gene expression screening, and various functional analyses, our findings revealed that the removal of intracellular NO resulted in the differential transcription of 424 genes. Notably, the majority of these differentially expressed genes were functionally linked to processes associated with carbohydrate and amino acid metabolism. Furthermore, our analysis highlighted the prevalence of four specific protein domains (zinc finger C2H2, PLCYc, PLCXc, and SH3) in the encoded proteins of these differentially expressed genes. Through protein-protein interaction network analysis, we identified eight hub genes with substantial interaction connectivity, with mss-4 and gel-3 emerging as possibly major responsive genes during NO scavenging, particularly influencing vegetative growth. Additionally, our study unveiled that NO scavenging led to the inhibition of gene transcription related to a protein complex associated with ribosome biogenesis. Overall, our investigation suggests that endogenously produced NO in N. crassa likely governs the transcription of genes responsible for protein complexes involved in carbohydrate and amino acid metabolism, as well as ribosomal biogenesis, ultimately impacting the growth and development of hyphae.
Collapse
Affiliation(s)
- Nan-Nan Yu
- Plasma Bioscience Research Center, Department of Plasma-Bio Display, Kwangwoon University, Seoul 01897, Republic of Korea; (N.-N.Y.); (W.K.)
| | - Mayura Veerana
- Department of Applied Radiation and Isotopes, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand;
| | - Wirinthip Ketya
- Plasma Bioscience Research Center, Department of Plasma-Bio Display, Kwangwoon University, Seoul 01897, Republic of Korea; (N.-N.Y.); (W.K.)
| | - Hu-Nan Sun
- College of Life Science and Technology, Heilongjiang Bayi Agricultural University, Daqing 163319, China;
| | - Gyungsoon Park
- Plasma Bioscience Research Center, Department of Plasma-Bio Display, Kwangwoon University, Seoul 01897, Republic of Korea; (N.-N.Y.); (W.K.)
- Department of Electrical and Biological Physics, Kwangwoon University, Seoul 01897, Republic of Korea
| |
Collapse
|
6
|
Gollapalli P, Rudrappa S, Kumar V, Santosh Kumar HS. Domain Architecture Based Methods for Comparative Functional Genomics Toward Therapeutic Drug Target Discovery. J Mol Evol 2023; 91:598-615. [PMID: 37626222 DOI: 10.1007/s00239-023-10129-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Accepted: 08/06/2023] [Indexed: 08/27/2023]
Abstract
Genes duplicate, mutate, recombine, fuse or fission to produce new genes, or when genes are formed from de novo, novel functions arise during evolution. Researchers have tried to quantify the causes of these molecular diversification processes to know how these genes increase molecular complexity over a period of time, for instance protein domain organization. In contrast to global sequence similarity, protein domain architectures can capture key structural and functional characteristics, making them better proxies for describing functional equivalence. In Prokaryotes and eukaryotes it has proven that, domain designs are retained over significant evolutionary distances. Protein domain architectures are now being utilized to categorize and distinguish evolutionarily related proteins and find homologs among species that are evolutionarily distant from one another. Additionally, structural information stored in domain structures has accelerated homology identification and sequence search methods. Tools for functional protein annotation have been developed to discover, protein domain content, domain order, domain recurrence, and domain position as all these contribute to the prediction of protein functional accuracy. In this review, an attempt is made to summarise facts and speculations regarding the use of protein domain architecture and modularity to identify possible therapeutic targets among cellular activities based on the understanding their linked biological processes.
Collapse
Affiliation(s)
- Pavan Gollapalli
- Center for Bioinformatics and Biostatistics, Nitte (Deemed to be University), Mangalore, Karnataka, 575018, India
| | - Sushmitha Rudrappa
- Department of Biotechnology and Bioinformatics, Jnana Sahyadri Campus, Kuvempu University, Shankaraghatta, Shivamogga, Karnataka, 577451, India
| | - Vadlapudi Kumar
- Department of Biochemistry, Davangere University, Shivagangothri, Davangere, Karnataka, 577007, India
| | - Hulikal Shivashankara Santosh Kumar
- Department of Biotechnology and Bioinformatics, Jnana Sahyadri Campus, Kuvempu University, Shankaraghatta, Shivamogga, Karnataka, 577451, India.
| |
Collapse
|
7
|
Mohri M, Moghadam A, Burketova L, Ryšánek P. Genome-wide identification of the opsin protein in Leptosphaeria maculans and comparison with other fungi (pathogens of Brassica napus). Front Microbiol 2023; 14:1193892. [PMID: 37692395 PMCID: PMC10485269 DOI: 10.3389/fmicb.2023.1193892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2023] [Accepted: 06/28/2023] [Indexed: 09/12/2023] Open
Abstract
The largest family of transmembrane receptors are G-protein-coupled receptors (GPCRs). These receptors respond to perceived environmental signals and infect their host plants. Family A of the GPCR includes opsin. However, there is little known about the roles of GPCRs in phytopathogenic fungi. We studied opsin in Leptosphaeria maculans, an important pathogen of oilseed rape (Brassica napus) that causes blackleg disease, and compared it with six other fungal pathogens of oilseed rape. A phylogenetic tree analysis of 31 isoforms of the opsin protein showed six major groups and six subgroups. All three opsin isoforms of L. maculans are grouped in the same clade in the phylogenetic tree. Physicochemical analysis revealed that all studied opsin proteins are stable and hydrophobic. Subcellular localization revealed that most isoforms were localized in the endoplasmic reticulum membrane except for several isoforms in Verticillium species, which were localized in the mitochondrial membrane. Most isoforms comprise two conserved domains. One conserved motif was observed across all isoforms, consisting of the BACTERIAL_OPSIN_1 domain, which has been hypothesized to have an identical sensory function. Most studied isoforms showed seven transmembrane helices, except for one isoform of V. longisporum and four isoforms of Fusarium oxysporum. Tertiary structure prediction displayed a conformational change in four isoforms of F. oxysporum that presumed differences in binding to other proteins and sensing signals, thereby resulting in various pathogenicity strategies. Protein-protein interactions and binding site analyses demonstrated a variety of numbers of ligands and pockets across all isoforms, ranging between 0 and 13 ligands and 4 and 10 pockets. According to the phylogenetic analysis in this study and considerable physiochemically and structurally differences of opsin proteins among all studied fungi hypothesized that this protein acts in the pathogenicity, growth, sporulation, and mating of these fungi differently.
Collapse
Affiliation(s)
- Marzieh Mohri
- Department of Plant Protection, Faculty of Agrobiology, Food, and Natural Resources, Czech University of Life Sciences, Prague, Czechia
| | - Ali Moghadam
- Institute of Biotechnology, Shiraz University, Shiraz, Iran
| | - Lenka Burketova
- Institute of Experimental Botany, Czech Academy of Sciences, Prague, Czechia
| | - Pavel Ryšánek
- Department of Plant Protection, Faculty of Agrobiology, Food, and Natural Resources, Czech University of Life Sciences, Prague, Czechia
| |
Collapse
|
8
|
Zmasek CM, Lefkowitz EJ, Niewiadomska A, Scheuermann RH. Genomic evolution of the Coronaviridae family. Virology 2022; 570:123-133. [PMID: 35398776 PMCID: PMC8965632 DOI: 10.1016/j.virol.2022.03.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 03/11/2022] [Accepted: 03/18/2022] [Indexed: 01/03/2023]
Abstract
The current outbreak of coronavirus disease-2019 (COVID-19) caused by SARS-CoV-2 poses unparalleled challenges to global public health. SARS-CoV-2 is a Betacoronavirus, one of four genera belonging to the Coronaviridae subfamily Orthocoronavirinae. Coronaviridae, in turn, are members of the order Nidovirales, a group of enveloped, positive-stranded RNA viruses. Here we present a systematic phylogenetic and evolutionary study based on protein domain architecture, encompassing the entire proteomes of all Orthocoronavirinae, as well as other Nidovirales. This analysis has revealed that the genomic evolution of Nidovirales is associated with extensive gains and losses of protein domains. In Orthocoronavirinae, the sections of the genomes that show the largest divergence in protein domains are found in the proteins encoded in the amino-terminal end of the polyprotein (PP1ab), the spike protein (S), and many of the accessory proteins. The diversity among the accessory proteins is particularly striking, as each subgenus possesses a set of accessory proteins that is almost entirely specific to that subgenus. The only notable exception to this is ORF3b, which is present and orthologous over all Alphacoronaviruses. In contrast, the membrane protein (M), envelope small membrane protein (E), nucleoprotein (N), as well as proteins encoded in the central and carboxy-terminal end of PP1ab (such as the 3C-like protease, RNA-dependent RNA polymerase, and Helicase) show stable domain architectures across all Orthocoronavirinae. This comprehensive analysis of the Coronaviridae domain architecture has important implication for efforts to develop broadly cross-protective coronavirus vaccines.
Collapse
Affiliation(s)
- Christian M Zmasek
- Department of Informatics, J. Craig Venter Institute, La Jolla, CA, 92037, USA
| | - Elliot J Lefkowitz
- Department of Microbiology, UAB School of Medicine, Birmingham, AL, 35294, USA
| | - Anna Niewiadomska
- Department of Informatics, J. Craig Venter Institute, La Jolla, CA, 92037, USA
| | - Richard H Scheuermann
- Department of Informatics, J. Craig Venter Institute, La Jolla, CA, 92037, USA; Department of Pathology, University of California, San Diego, CA, 92093, USA; Division of Vaccine Discovery, La Jolla Institute for Immunology, La Jolla, CA, 92037, USA; Global Virus Network, Baltimore MD, 21201, USA.
| |
Collapse
|
9
|
Caetano-Anollés G, Aziz MF, Mughal F, Caetano-Anollés D. Tracing protein and proteome history with chronologies and networks: folding recapitulates evolution. Expert Rev Proteomics 2021; 18:863-880. [PMID: 34628994 DOI: 10.1080/14789450.2021.1992277] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
INTRODUCTION While the origin and evolution of proteins remain mysterious, advances in evolutionary genomics and systems biology are facilitating the historical exploration of the structure, function and organization of proteins and proteomes. Molecular chronologies are series of time events describing the history of biological systems and subsystems and the rise of biological innovations. Together with time-varying networks, these chronologies provide a window into the past. AREAS COVERED Here, we review molecular chronologies and networks built with modern methods of phylogeny reconstruction. We discuss how chronologies of structural domain families uncover the explosive emergence of metabolism, the late rise of translation, the co-evolution of ribosomal proteins and rRNA, and the late development of the ribosomal exit tunnel; events that coincided with a tendency to shorten folding time. Evolving networks described the early emergence of domains and a late 'big bang' of domain combinations. EXPERT OPINION Two processes, folding and recruitment appear central to the evolutionary progression. The former increases protein persistence. The later fosters diversity. Chronologically, protein evolution mirrors folding by combining supersecondary structures into domains, developing translation machinery to facilitate folding speed and stability, and enhancing structural complexity by establishing long-distance interactions in novel structural and architectural designs.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA.,C. R. Woese Institute for Genomic Biology, University of Illinois, Urbana, Illinois, USA
| | - M Fayez Aziz
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA
| | - Fizza Mughal
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA
| | - Derek Caetano-Anollés
- Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| |
Collapse
|
10
|
In Silico Analysis of Fatty Acid Desaturases Structures in Camelina sativa, and Functional Evaluation of Csafad7 and Csafad8 on Seed Oil Formation and Seed Morphology. Int J Mol Sci 2021; 22:ijms221910857. [PMID: 34639198 PMCID: PMC8532002 DOI: 10.3390/ijms221910857] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 10/01/2021] [Accepted: 10/05/2021] [Indexed: 12/19/2022] Open
Abstract
Fatty acid desaturases add a second bond into a single bond of carbon atoms in fatty acid chains, resulting in an unsaturated bond between the two carbons. They are classified into soluble and membrane-bound desaturases, according to their structure, subcellular location, and function. The orthologous genes in Camelina sativa were identified and analyzed, and a total of 62 desaturase genes were identified. It was revealed that they had the common fatty acid desaturase domain, which has evolved separately, and the proteins of the same family also originated from the same ancestry. A mix of conserved, gained, or lost intron structure was obvious. Besides, conserved histidine motifs were found in each family, and transmembrane domains were exclusively revealed in the membrane-bound desaturases. The expression profile analysis of C. sativa desaturases revealed an increase in young leaves, seeds, and flowers. C. sativa ω3-fatty acid desaturases CsaFAD7 and CsaDAF8 were cloned and the subcellular localization analysis showed their location in the chloroplast. They were transferred into Arabidopsis thaliana to obtain transgenic lines. It was revealed that the ω3-fatty acid desaturase could increase the C18:3 level at the expense of C18:2, but decreases in oil content and seed weight, and wrinkled phenotypes were observed in transgenic CsaFAD7 lines, while no significant change was observed in transgenic CsaFAD8 lines in comparison to the wild-type. These findings gave insights into the characteristics of desaturase genes, which could provide an excellent basis for further investigation for C. sativa improvement, and overexpression of ω3-fatty acid desaturases in seeds could be useful in genetic engineering strategies, which are aimed at modifying the fatty acid composition of seed oil.
Collapse
|
11
|
Lakshmanan Mangalath D, Hassan Mohammed SA. Ligand Binding Domain of Estrogen Receptor Alpha Preserve a Conserved Structural Architecture Similar to Bacterial Taxis Receptors. Front Ecol Evol 2021. [DOI: 10.3389/fevo.2021.681913] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
It remains a mystery why estrogen hormone receptors (ERs), which are highly specific toward its endogenous hormones, are responsive to chemically distinct exogenous agents. Does it indicate that ERs are environmentally regulated? Here, we speculate that ERs would have some common structural features with prokaryotic taxis receptor responsive toward environmental signals. This study addresses the low specificity and high responsiveness of ERs toward chemically distinct exogenous substances, from an evolutionary point of view. Here, we compared the ligand binding domain (LBD) of ER alpha (α) with the LBDs of prokaryotic taxis receptors to check if LBDs share any structural similarity. Interestingly, a high degree of similarity in the domain structural fold architecture of ERα and bacterial taxis receptors was observed. The pharmacophore modeling focused on ligand molecules of both receptors suggest that these ligands share common pharmacophore features. The molecular docking studies suggest that the natural ligands of bacterial chemotaxis receptors exhibit strong interaction with human ER as well. Although phylogenetic analysis proved that these proteins are unrelated, they would have evolved independently, suggesting a possibility of convergent molecular evolution. Nevertheless, a remarkable sequence divergence was seen between these proteins even when they shared common domain structural folds and common ligand-based pharmacophore features, suggesting that the protein architecture remains conserved within the structure for a specific function irrespective of sequence identity.
Collapse
|
12
|
YAMATO K, KATO H, KATSURAGI T, TAKAHASHI Y. The Multiple Representation of Protein Sequence MotifsUsing Sequence Binary Decision Diagrams. JOURNAL OF COMPUTER CHEMISTRY-JAPAN 2020. [DOI: 10.2477/jccj.2019-0028] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- Kohei YAMATO
- Department of Computer Science and Engineering, Toyohashi University of Technology, 1-1 Hibarigaoka, Tempaku-cho, Toyohashi, Aichi 441-8580, Japan
| | - Hiroaki KATO
- Department of Distribution and Information Engineering, National Institute of Technology, Hiroshima College,4272-1 Higashino, Osakikamijima-cho, Toyota-gun, Hiroshima 725-0231, Japan
| | - Tetsuo KATSURAGI
- Department of Computer Science and Engineering, Toyohashi University of Technology, 1-1 Hibarigaoka, Tempaku-cho, Toyohashi, Aichi 441-8580, Japan
| | - Yoshimasa TAKAHASHI
- Department of Computer Science and Engineering, Toyohashi University of Technology, 1-1 Hibarigaoka, Tempaku-cho, Toyohashi, Aichi 441-8580, Japan
| |
Collapse
|
13
|
López-Escardó D, Grau-Bové X, Guillaumet-Adkins A, Gut M, Sieracki ME, Ruiz-Trillo I. Reconstruction of protein domain evolution using single-cell amplified genomes of uncultured choanoflagellates sheds light on the origin of animals. Philos Trans R Soc Lond B Biol Sci 2019; 374:20190088. [PMID: 31587642 PMCID: PMC6792448 DOI: 10.1098/rstb.2019.0088] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/15/2019] [Indexed: 12/25/2022] Open
Abstract
Understanding the origins of animal multicellularity is a fundamental biological question. Recent genome data have unravelled the role that co-option of pre-existing genes played in the origin of animals. However, there were also some important genetic novelties at the onset of Metazoa. To have a clear understanding of the specific genetic innovations and how they appeared, we need the broadest taxon sampling possible, especially among early-branching animals and their unicellular relatives. Here, we take advantage of single-cell genomics to expand our understanding of the genomic diversity of choanoflagellates, the sister-group to animals. With these genomes, we have performed an updated and taxon-rich reconstruction of protein evolution from the Last Eukaryotic Common Ancestor (LECA) to animals. Our novel data re-defines the origin of some genes previously thought to be metazoan-specific, like the POU transcription factor, which we show appeared earlier in evolution. Moreover, our data indicate that the acquisition of new genes at the stem of Metazoa was mainly driven by duplications and protein domain rearrangement processes at the stem of Metazoa. Furthermore, our analysis allowed us to reveal protein domains that are essential to the maintenance of animal multicellularity. Our analyses also demonstrate the utility of single-cell genomics from uncultured taxa to address evolutionary questions. This article is part of a discussion meeting issue 'Single cell ecology'.
Collapse
Affiliation(s)
- David López-Escardó
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta 37-49, 08003 Barcelona, Catalonia, Spain
- Institut de Ciències del Mar (ICM-CSIC), Passeig Marítim de la Barceloneta 37-49, 08003 Barcelona, Catalonia, Spain
| | - Xavier Grau-Bové
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta 37-49, 08003 Barcelona, Catalonia, Spain
- Departament de Genètica, Microbiologia i Estadística, Universitat de Barcelona, 08028 Barcelona, Catalonia, Spain
- Department of Vector Biology, Liverpool School of Tropical Medicine, Pembroke Place, Liverpool, L3 5QA, UK
| | - Amy Guillaumet-Adkins
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08028 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08028 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | | | - Iñaki Ruiz-Trillo
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta 37-49, 08003 Barcelona, Catalonia, Spain
- Departament de Genètica, Microbiologia i Estadística, Universitat de Barcelona, 08028 Barcelona, Catalonia, Spain
- ICREA, Pg. Lluís Companys 23, 08010 Barcelona, Spain
| |
Collapse
|
14
|
Abstract
Members of the fibroblast growth factor (FGF) family play pleiotropic roles in cellular and metabolic homeostasis. During evolution, the ancestor FGF expands into multiple members by acquiring divergent structural elements that enable functional divergence and specification. Heparan sulfate-binding FGFs, which play critical roles in embryonic development and adult tissue remodeling homeostasis, adapt to an autocrine/paracrine mode of action to promote cell proliferation and population growth. By contrast, FGF19, 21, and 23 coevolve through losing binding affinity for extracellular matrix heparan sulfate while acquiring affinity for transmembrane α-Klotho (KL) or β-KL as a coreceptor, thereby adapting to an endocrine mode of action to drive interorgan crosstalk that regulates a broad spectrum of metabolic homeostasis. FGF19 metabolic axis from the ileum to liver negatively controls diurnal bile acid biosynthesis. FGF21 metabolic axes play multifaceted roles in controlling the homeostasis of lipid, glucose, and energy metabolism. FGF23 axes from the bone to kidney and parathyroid regulate metabolic homeostasis of phosphate, calcium, vitamin D, and parathyroid hormone that are important for bone health and systemic mineral balance. The significant divergence in structural elements and multiple functional specifications of FGF19, 21, and 23 in cellular and organismal metabolism instead of cell proliferation and growth sufficiently necessitate a new unified and specific term for these three endocrine FGFs. Thus, the term "FGF Metabolic Axis," which distinguishes the unique pathways and functions of endocrine FGFs from other autocrine/paracrine mitogenic FGFs, is coined.
Collapse
Affiliation(s)
- Xiaokun Li
- School of Pharmaceutical Science, Wenzhou Medical University, Wenzhou, 325035, China.
| |
Collapse
|
15
|
Zmasek CM, Knipe DM, Pellett PE, Scheuermann RH. Classification of human Herpesviridae proteins using Domain-architecture Aware Inference of Orthologs (DAIO). Virology 2019; 529:29-42. [PMID: 30660046 PMCID: PMC6502252 DOI: 10.1016/j.virol.2019.01.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Revised: 01/04/2019] [Accepted: 01/04/2019] [Indexed: 12/13/2022]
Abstract
We developed a computational approach called Domain-architecture Aware Inference of Orthologs (DAIO) for the analysis of protein orthology by combining phylogenetic and protein domain-architecture information. Using DAIO, we performed a systematic study of the proteomes of all human Herpesviridae species to define Strict Ortholog Groups (SOGs). In addition to assessing the taxonomic distribution for each protein based on sequence similarity, we performed a protein domain-architecture analysis for every protein family and computationally inferred gene duplication events. While many herpesvirus proteins have evolved without any detectable gene duplications or domain rearrangements, numerous herpesvirus protein families do exhibit complex evolutionary histories. Some proteins acquired additional domains (e.g., DNA polymerase), whereas others show a combination of domain acquisition and gene duplication (e.g., betaherpesvirus US22 family), with possible functional implications. This novel classification system of SOGs for human Herpesviridae proteins is available through the Virus Pathogen Resource (ViPR, www.viprbrc.org).
Collapse
Affiliation(s)
| | - David M Knipe
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA 02115, USA
| | - Philip E Pellett
- Department of Biochemistry, Microbiology & Immunology, Wayne State University School of Medicine, Detroit, MI 48201, USA
| | - Richard H Scheuermann
- J. Craig Venter Institute, La Jolla, CA 92037, USA; Department of Pathology, University of California, San Diego, CA 92093, USA; Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, CA 92037, USA.
| |
Collapse
|
16
|
Abstract
This chapter reviews current research on how protein domain architectures evolve. We begin by summarizing work on the phylogenetic distribution of proteins, as this will directly impact which domain architectures can be formed in different species. Studies relating domain family size to occurrence have shown that they generally follow power law distributions, both within genomes and larger evolutionary groups. These findings were subsequently extended to multi-domain architectures. Genome evolution models that have been suggested to explain the shape of these distributions are reviewed, as well as evidence for selective pressure to expand certain domain families more than others. Each domain has an intrinsic combinatorial propensity, and the effects of this have been studied using measures of domain versatility or promiscuity. Next, we study the principles of protein domain architecture evolution and how these have been inferred from distributions of extant domain arrangements. Following this, we review inferences of ancestral domain architecture and the conclusions concerning domain architecture evolution mechanisms that can be drawn from these. Finally, we examine whether all known cases of a given domain architecture can be assumed to have a single common origin (monophyly) or have evolved convergently (polyphyly). We end by a discussion of some available tools for computational analysis or exploitation of protein domain architectures and their evolution.
Collapse
|
17
|
Zhang QL, Zhang GL, Yuan ML, Dong ZX, Li HW, Guo J, Wang F, Deng XY, Chen JY, Lin LB. A Phylogenomic Framework and Divergence History of Cephalochordata Amphioxus. Front Physiol 2018; 9:1833. [PMID: 30618839 PMCID: PMC6305399 DOI: 10.3389/fphys.2018.01833] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2018] [Accepted: 12/06/2018] [Indexed: 11/21/2022] Open
Abstract
Amphioxus, or cephalochordates, are often used as the living invertebrate proxy of vertebrate ancestors and are widely used as evolutionary biology models of chordates. However, their phylogeny, divergence history, and speciation characteristics remain poorly understood, and phylogenomic studies to explore these problems lacking entirely from the literature. Here, we determined a new transcriptome of Branchiostoma japonicum. Combined with mass sequences of all other 18 species, a 19-way phylogeny was constructed via multiple methods (ML, BI, PhyloBayes, and ASTRAL), consistently supporting a phylogeny of [(B. belcheri + B. japonicum) + (B. lanceolatum + B. floridae) + Asymmetron lucayanum] in amphioxus. Congruent phylogenetic signals were found across mitochondrial genes, 12S RNA, and complete mitochondrial genomes according to previous reports, indicating that 12S RNA may have potential as a molecular marker for phylogenetic analysis in amphioxus. Molecular dating analysis indicated a radiation of the cephalochordates during the Cretaceous (∼104-61 million years ago), supporting an association between the diversification and speciation of cephalochordates with continental drift and associated changes in their respective habitats during this time. The identified functional enrichment analysis for species-specific domains indicated that their function mainly involves immune response, apoptosis, and lipid metabolism and utilization, signaling that pathogens and changes of energy requirements are an important driving force for amphioxus speciation. This study represents the first large-scale phylogenomic analysis of most major amphioxus genera based on phylogenomic data, providing a new perspective on both phylogeny and divergence speciation of cephalochordates.
Collapse
Affiliation(s)
- Qi-Lin Zhang
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, China.,Evo-Devo Institute, School of Life Sciences, Nanjing University, Nanjing, China
| | - Guan-Ling Zhang
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, China
| | - Ming-Long Yuan
- State Key Laboratory of Grassland Agro-Ecosystems, College of Pastoral Agricultural Science and Technology, Lanzhou University, Lanzhou, China
| | - Zhi-Xiang Dong
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, China
| | - Hong-Wei Li
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, China
| | - Jun Guo
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, China
| | - Feng Wang
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, China
| | - Xian-Yu Deng
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, China
| | - Jun-Yuan Chen
- Evo-Devo Institute, School of Life Sciences, Nanjing University, Nanjing, China.,State Key Laboratory of Palaeobiology and Stratigraphy (LPS), Nanjing Institute of Geology and Palaeontology, Chinese Academy of Sciences, Nanjing, China
| | - Lian-Bing Lin
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, China
| |
Collapse
|
18
|
Klasberg S, Bitard-Feildel T, Callebaut I, Bornberg-Bauer E. Origins and structural properties of novel and de novo protein domains during insect evolution. FEBS J 2018; 285:2605-2625. [PMID: 29802682 DOI: 10.1111/febs.14504] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2017] [Revised: 04/12/2018] [Accepted: 05/11/2018] [Indexed: 12/11/2022]
Abstract
Over long time scales, protein evolution is characterized by modular rearrangements of protein domains. Such rearrangements are mainly caused by gene duplication, fusion and terminal losses. To better understand domain emergence mechanisms we investigated 32 insect genomes covering a speciation gradient ranging from ~ 2 to ~ 390 mya. We use established domain models and foldable domains delineated by hydrophobic cluster analysis (HCA), which does not require homologous sequences, to also identify domains which have likely arisen de novo, that is, from previously noncoding DNA. Our results indicate that most novel domains emerge terminally as they originate from ORF extensions while fewer arise in middle arrangements, resulting from exonization of intronic or intergenic regions. Many novel domains rapidly migrate between terminal or middle positions and single- and multidomain arrangements. Young domains, such as most HCA-defined domains, are under strong selection pressure as they show signals of purifying selection. De novo domains, linked to ancient domains or defined by HCA, have higher degrees of intrinsic disorder and disorder-to-order transition upon binding than ancient domains. However, the corresponding DNA sequences of the novel domains of de novo origins could only rarely be found in sister genomes. We conclude that novel domains are often recruited by other proteins and undergo important structural modifications shortly after their emergence, but evolve too fast to be characterized by cross-species comparisons alone.
Collapse
Affiliation(s)
- Steffen Klasberg
- Institute for Evolution and Biodiversity, Westfalian Wilhelms University Muenster, Germany
| | - Tristan Bitard-Feildel
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Paris, France
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, Westfalian Wilhelms University Muenster, Germany
| |
Collapse
|
19
|
Raboanatahiry N, Wang B, Yu L, Li M. Functional and Structural Diversity of Acyl-coA Binding Proteins in Oil Crops. Front Genet 2018; 9:182. [PMID: 29872448 PMCID: PMC5972291 DOI: 10.3389/fgene.2018.00182] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2018] [Accepted: 05/01/2018] [Indexed: 12/16/2022] Open
Abstract
Diversities in structure and function of ACBP were discussed in this review. ACBP are important proteins that could transport newly synthesized fatty acid, activated into -coA, from plastid to endoplasmic reticulum, where oil in the form of triacylglycerol occurs. ACBP were detected in various animal and plants species, which indicated their importance in biological function. In fact, involvement of ACBP in important process such as lipid metabolism, regulation of enzyme and gene expression, and in response to plant stresses has been proven in several studies. In this review, findings on ACBP of 11 well-known oil crops were reviewed to comprehend diversity, comparative analyses on ACBP structure were made, and link between structure and function, tissue expression and subcellular location of ACBP were also observed. Incomplete reports in some species were mentioned, which might be encouraging to start or to perform deeper studies. Similar characteristics were found in paralogs ACBP, and orthologs ACBP had different functions, despite the high identity in amino acid sequence. At the end, it is confirmed that ortholog proteins could not necessarily display the same function, even from closely related species.
Collapse
Affiliation(s)
- Nadia Raboanatahiry
- Department of Biotechnology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China.,Hubei Key Laboratory of Economic Forest Germplasm Improvement and Resources Comprehensive Utilization, Hubei Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains, Huanggang Normal University, Huanggang, China
| | - Baoshan Wang
- College of Life Science, Shandong Normal University, Jinan, China
| | - Longjiang Yu
- Department of Biotechnology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
| | - Maoteng Li
- Department of Biotechnology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China.,Hubei Key Laboratory of Economic Forest Germplasm Improvement and Resources Comprehensive Utilization, Hubei Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains, Huanggang Normal University, Huanggang, China
| |
Collapse
|
20
|
Jiang F, Liu Q, Wang Y, Zhang J, Wang H, Song T, Yang M, Wang X, Kang L. Comparative genomic analysis of SET domain family reveals the origin, expansion, and putative function of the arthropod-specific SmydA genes as histone modifiers in insects. Gigascience 2018; 6:1-16. [PMID: 28444351 PMCID: PMC5459927 DOI: 10.1093/gigascience/gix031] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2016] [Accepted: 04/19/2017] [Indexed: 02/07/2023] Open
Abstract
The SET domain is an evolutionarily conserved motif present in histone lysine methyltransferases, which are important in the regulation of chromatin and gene expression in animals. In this study, we searched for SET domain–containing genes (SET genes) in all of the 147 arthropod genomes sequenced at the time of carrying out this experiment to understand the evolutionary history by which SET domains have evolved in insects. Phylogenetic and ancestral state reconstruction analysis revealed an arthropod-specific SET gene family, named SmydA, that is ancestral to arthropod animals and specifically diversified during insect evolution. Considering that pseudogenization is the most probable fate of the new emerging gene copies, we provided experimental and evolutionary evidence to demonstrate their essential functions. Fluorescence in situ hybridization analysis and in vitro methyltransferase activity assays showed that the SmydA-2 gene was transcriptionally active and retained the original histone methylation activity. Expression knockdown by RNA interference significantly increased mortality, implying that the SmydA genes may be essential for insect survival. We further showed predominantly strong purifying selection on the SmydA gene family and a potential association between the regulation of gene expression and insect phenotypic plasticity by transcriptome analysis. Overall, these data suggest that the SmydA gene family retains essential functions that may possibly define novel regulatory pathways in insects. This work provides insights into the roles of lineage-specific domain duplication in insect evolution.
Collapse
Affiliation(s)
- Feng Jiang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China
| | - Qing Liu
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China.,State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Yanli Wang
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.,Institute of Applied Biology, Shanxi University, Taiyuan, Shanxi, China
| | - Jie Zhang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China
| | - Huimin Wang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China
| | - Tianqi Song
- Institute of Applied Biology, Shanxi University, Taiyuan, Shanxi, China
| | - Meiling Yang
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Xianhui Wang
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Le Kang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China.,State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
21
|
Shapiro JA. Living Organisms Author Their Read-Write Genomes in Evolution. BIOLOGY 2017; 6:E42. [PMID: 29211049 PMCID: PMC5745447 DOI: 10.3390/biology6040042] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Revised: 11/17/2017] [Accepted: 11/28/2017] [Indexed: 12/18/2022]
Abstract
Evolutionary variations generating phenotypic adaptations and novel taxa resulted from complex cellular activities altering genome content and expression: (i) Symbiogenetic cell mergers producing the mitochondrion-bearing ancestor of eukaryotes and chloroplast-bearing ancestors of photosynthetic eukaryotes; (ii) interspecific hybridizations and genome doublings generating new species and adaptive radiations of higher plants and animals; and, (iii) interspecific horizontal DNA transfer encoding virtually all of the cellular functions between organisms and their viruses in all domains of life. Consequently, assuming that evolutionary processes occur in isolated genomes of individual species has become an unrealistic abstraction. Adaptive variations also involved natural genetic engineering of mobile DNA elements to rewire regulatory networks. In the most highly evolved organisms, biological complexity scales with "non-coding" DNA content more closely than with protein-coding capacity. Coincidentally, we have learned how so-called "non-coding" RNAs that are rich in repetitive mobile DNA sequences are key regulators of complex phenotypes. Both biotic and abiotic ecological challenges serve as triggers for episodes of elevated genome change. The intersections of cell activities, biosphere interactions, horizontal DNA transfers, and non-random Read-Write genome modifications by natural genetic engineering provide a rich molecular and biological foundation for understanding how ecological disruptions can stimulate productive, often abrupt, evolutionary transformations.
Collapse
Affiliation(s)
- James A Shapiro
- Department of Biochemistry and Molecular Biology, University of Chicago GCIS W123B, 979 E. 57th Street, Chicago, IL 60637, USA.
| |
Collapse
|
22
|
A proteome view of structural, functional, and taxonomic characteristics of major protein domain clusters. Sci Rep 2017; 7:14210. [PMID: 29079755 PMCID: PMC5660162 DOI: 10.1038/s41598-017-13297-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2016] [Accepted: 09/21/2017] [Indexed: 12/28/2022] Open
Abstract
Proteome-scale bioinformatics research is increasingly conducted as the number of completely sequenced genomes increases, but analysis of protein domains (PDs) usually relies on similarity in their amino acid sequences and/or three-dimensional structures. Here, we present results from a bi-clustering analysis on presence/absence data for 6,580 unique PDs in 2,134 species with a sequenced genome, thus covering a complete set of proteins, for the three superkingdoms of life, Bacteria, Archaea, and Eukarya. Our analysis revealed eight distinctive PD clusters, which, following an analysis of enrichment of Gene Ontology functions and CATH classification of protein structures, were shown to exhibit structural and functional properties that are taxa-characteristic. For examples, the largest cluster is ubiquitous in all three superkingdoms, constituting a set of 1,472 persistent domains created early in evolution and retained in living organisms and characterized by basic cellular functions and ancient structural architectures, while an Archaea and Eukarya bi-superkingdom cluster suggests its PDs may have existed in the ancestor of the two superkingdoms, and others are single superkingdom- or taxa (e.g. Fungi)-specific. These results contribute to increase our appreciation of PD diversity and our knowledge of how PDs are used in species, yielding implications on species evolution.
Collapse
|
23
|
Grau-Bové X, Torruella G, Donachie S, Suga H, Leonard G, Richards TA, Ruiz-Trillo I. Dynamics of genomic innovation in the unicellular ancestry of animals. eLife 2017; 6:26036. [PMID: 28726632 PMCID: PMC5560861 DOI: 10.7554/elife.26036] [Citation(s) in RCA: 78] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Accepted: 07/11/2017] [Indexed: 12/29/2022] Open
Abstract
Which genomic innovations underpinned the origin of multicellular animals is still an open debate. Here, we investigate this question by reconstructing the genome architecture and gene family diversity of ancestral premetazoans, aiming to date the emergence of animal-like traits. Our comparative analysis involves genomes from animals and their closest unicellular relatives (the Holozoa), including four new genomes: three Ichthyosporea and Corallochytrium limacisporum. Here, we show that the earliest animals were shaped by dynamic changes in genome architecture before the emergence of multicellularity: an early burst of gene diversity in the ancestor of Holozoa, enriched in transcription factors and cell adhesion machinery, was followed by multiple and differently-timed episodes of synteny disruption, intron gain and genome expansions. Thus, the foundations of animal genome architecture were laid before the origin of complex multicellularity – highlighting the necessity of a unicellular perspective to understand early animal evolution. DOI:http://dx.doi.org/10.7554/eLife.26036.001 Hundreds of millions of years ago, some single-celled organisms gained the ability to work together and form multicellular organisms. This transition was a major step in evolution and took place at separate times in several parts of the tree of life, including in animals, plants, fungi and algae. Animals are some of the most complex organisms on Earth. Their single-celled ancestors were also quite genetically complex themselves and their genomes (the complete set of the organism’s DNA) already contained many genes that now coordinate the activity of the cells in a multicellular organism. The genome of an animal typically has certain features: it is large, diverse and contains many segments (called introns) that are not genes. By seeing if the single-celled relatives of animals share these traits, it is possible to learn more about when specific genetic features first evolved, and whether they are linked to the origin of animals. Now, Grau-Bové et al. have studied the genomes of several of the animal kingdom’s closest single-celled relatives using a technique called whole genome sequencing. This revealed that there was a period of rapid genetic change in the single-celled ancestors of animals during which their genes became much more diverse. Another ‘explosion’ of diversity happened after animals had evolved. Furthermore, the overall amount of the genomic content inside cells and the number of introns found in the genome rapidly increased in separate, independent events in both animals and their single-celled ancestors. Future research is needed to investigate whether other multicellular life forms – such as plants, fungi and algae – originated in the same way as animal life. Understanding how the genetic material of animals evolved also helps us to understand the genetic structures that affect our health. For example, genes that coordinate the behavior of cells (and so are important for multicellular organisms) also play a role in cancer, where cells break free of this regulation to divide uncontrollably. DOI:http://dx.doi.org/10.7554/eLife.26036.002
Collapse
Affiliation(s)
- Xavier Grau-Bové
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Barcelona, Catalonia, Spain.,Departament de Genètica, Microbiologia i Estadística, Universitat de Barelona, Barcelona, Catalonia, Spain
| | - Guifré Torruella
- Unité d'Ecologie, Systématique et Evolution, Université Paris-Sud/Paris-Saclay, AgroParisTech, Orsay, France
| | - Stuart Donachie
- Department of Microbiology, University of Hawai'i at Mānoa, Honolulu, United States.,Advanced Studies in Genomics, Proteomics and Bioinformatics, University of Hawai'i at Mānoa, Honolulu, United States
| | - Hiroshi Suga
- Faculty of Life and Environmental Sciences, Prefectural University of Hiroshima, Hiroshima, Japan
| | - Guy Leonard
- Department of Biosciences, University of Exeter, Exeter, United Kingdom
| | - Thomas A Richards
- Department of Biosciences, University of Exeter, Exeter, United Kingdom
| | - Iñaki Ruiz-Trillo
- Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Barcelona, Catalonia, Spain.,Departament de Genètica, Microbiologia i Estadística, Universitat de Barelona, Barcelona, Catalonia, Spain.,ICREA, Passeig Lluís Companys, Barcelona, Catalonia, Spain
| |
Collapse
|
24
|
Raboanatahiry NH, Yin Y, Chen L, Li M. Genome-wide identification and Phylogenic analysis of kelch motif containing ACBP in Brassica napus. BMC Genomics 2015; 16:512. [PMID: 26156054 PMCID: PMC4497377 DOI: 10.1186/s12864-015-1735-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2014] [Accepted: 06/29/2015] [Indexed: 11/18/2022] Open
Abstract
Background Acyl-coA binding proteins (ACBPs) bind long chain acyl-CoA esters with very high affinity. Their possible involvement in fatty acid transportation from the plastid to the endoplasmic reticulum, prior to the formation of triacylglycerol has been suggested. Four classes of ACBPs were identified in Arabidopsis thaliana: the small ACBPs, the large ACBPs, the ankyrin repeats containing ACBPs and the kelch motif containing ACBPs. They differed in structure and in size, and showed multiple important functions. In the present study, Brassica napus ACBPs were identified and characterized. Results Eight copies of kelch motif ACBPs were cloned, it showed that B. napus ACBPs shared high amino acid sequence identity with A. thaliana, Brassica rapa and Brassica oleracea. Furthermore, phylogeny based on domain structure and comparison map showed the relationship and the evolution of ACBPs within Brassicaceae family: ACBPs evolved into four separate classes with different structure. Chromosome locations comparison showed conserved syntenic blocks. Conclusions ACBPs were highly conserved in Brassicaceae. They evolved from a common ancestor, but domain duplication and rearrangement might separate them into four distinct classes, with different structure and functions. Otherwise, B. napus inherited kelch motif ACBPs from ancestor conserving chromosomal location, emphasizing preserved synteny block region. This study provided a first insight for exploring ACBPs in B. napus, which supplies a valuable tool for crop improvement in agriculture. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1735-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nadia Haingotiana Raboanatahiry
- College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China. .,Hubei Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains, Huanggang, 435599, China.
| | - Yongtai Yin
- College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China. .,Hubei Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains, Huanggang, 435599, China.
| | - Li Chen
- College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China. .,Hubei Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains, Huanggang, 435599, China.
| | - Maoteng Li
- College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China. .,Hubei Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains, Huanggang, 435599, China.
| |
Collapse
|
25
|
Chang TC, Stergiopoulos I. Evolutionary analysis of the global landscape of protein domain types and domain architectures associated with family 14 carbohydrate-binding modules. FEBS Lett 2015; 589:1813-8. [PMID: 26067847 DOI: 10.1016/j.febslet.2015.05.048] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2015] [Revised: 05/11/2015] [Accepted: 05/20/2015] [Indexed: 10/23/2022]
Abstract
Domain promiscuity is a powerful evolutionary force that promotes functional innovation in proteins, thus increasing proteome and organismal complexity. Carbohydrate-binding modules, in particular, are known to partake in complex modular architectures that play crucial roles in numerous biochemical and molecular processes. However, the extent, functional, and evolutionary significance of promiscuity is shrouded in mystery for most CBM families. Here, we analyzed the global promiscuity of family 14 carbohydrate-binding modules (CBM14s) and show that fusion, fission, and reorganization events with numerous other domain types interplayed incessantly in a lineage-dependent manner to likely facilitate species adaptation and functional innovation in the family.
Collapse
Affiliation(s)
- Ti-Cheng Chang
- Department of Plant Pathology, University of California Davis, Davis, CA, USA
| | | |
Collapse
|
26
|
Linkeviciute V, Rackham OJL, Gough J, Oates ME, Fang H. Function-selective domain architecture plasticity potentials in eukaryotic genome evolution. Biochimie 2015; 119:269-77. [PMID: 25980317 PMCID: PMC4679076 DOI: 10.1016/j.biochi.2015.05.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 05/06/2015] [Indexed: 12/20/2022]
Abstract
To help evaluate how protein function impacts on genome evolution, we introduce a new concept of ‘architecture plasticity potential’ – the capacity to form distinct domain architectures – both for an individual domain, or more generally for a set of domains grouped by shared function. We devise a scoring metric to measure the plasticity potential for these domain sets, and evaluate how function has changed over time for different species. Applying this metric to a phylogenetic tree of eukaryotic genomes, we find that the involvement of each function is not random but highly selective. For certain lineages there is strong bias for evolution to involve domains related to certain functions. In general eukaryotic genomes, particularly animals, expand complex functional activities such as signalling and regulation, but at the cost of reducing metabolic processes. We also observe differential evolution of transcriptional regulation and a unique evolutionary role of channel regulators; crucially this is only observable in terms of the architecture plasticity potential. Our findings provide a new layer of information to understand the significance of function in eukaryotic genome evolution. A web search tool, available at http://supfam.org/Pevo, offers a wide spectrum of options for exploring functional importance in eukaryotic genome evolution. A new concept to measure domain architecture plasticity potential in a genome. We reveal the function-selective role in eukaryotic genome evolution. Eukaryotic genomes expand signalling and regulations but reduce metabolism. We observe differential evolution between trans- and cis-acting regulations. We observe a unique role of channel regulators in separating eukaryotic kingdoms.
Collapse
Affiliation(s)
- Viktorija Linkeviciute
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol BS8 1UB, UK; School of Biological Sciences, University of Edinburgh, Darwin Building, The King's Buildings, Edinburgh EH9 3BF, UK
| | - Owen J L Rackham
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol BS8 1UB, UK; Centre for Computational Biology, Duke-NUS Graduate Medical School, Singapore 169857, Singapore
| | - Julian Gough
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol BS8 1UB, UK
| | - Matt E Oates
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol BS8 1UB, UK
| | - Hai Fang
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol BS8 1UB, UK; Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK.
| |
Collapse
|
27
|
Gnanavel M, Mehrotra P, Rakshambikai R, Martin J, Srinivasan N, Bhaskara RM. CLAP: a web-server for automatic classification of proteins with special reference to multi-domain proteins. BMC Bioinformatics 2014; 15:343. [PMID: 25282152 PMCID: PMC4287353 DOI: 10.1186/1471-2105-15-343] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2014] [Accepted: 09/30/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The function of a protein can be deciphered with higher accuracy from its structure than from its amino acid sequence. Due to the huge gap in the available protein sequence and structural space, tools that can generate functionally homogeneous clusters using only the sequence information, hold great importance. For this, traditional alignment-based tools work well in most cases and clustering is performed on the basis of sequence similarity. But, in the case of multi-domain proteins, the alignment quality might be poor due to varied lengths of the proteins, domain shuffling or circular permutations. Multi-domain proteins are ubiquitous in nature, hence alignment-free tools, which overcome the shortcomings of alignment-based protein comparison methods, are required. Further, existing tools classify proteins using only domain-level information and hence miss out on the information encoded in the tethered regions or accessory domains. Our method, on the other hand, takes into account the full-length sequence of a protein, consolidating the complete sequence information to understand a given protein better. RESULTS Our web-server, CLAP (Classification of Proteins), is one such alignment-free software for automatic classification of protein sequences. It utilizes a pattern-matching algorithm that assigns local matching scores (LMS) to residues that are a part of the matched patterns between two sequences being compared. CLAP works on full-length sequences and does not require prior domain definitions.Pilot studies undertaken previously on protein kinases and immunoglobulins have shown that CLAP yields clusters, which have high functional and domain architectural similarity. Moreover, parsing at a statistically determined cut-off resulted in clusters that corroborated with the sub-family level classification of that particular domain family. CONCLUSIONS CLAP is a useful protein-clustering tool, independent of domain assignment, domain order, sequence length and domain diversity. Our method can be used for any set of protein sequences, yielding functionally relevant clusters with high domain architectural homogeneity. The CLAP web server is freely available for academic use at http://nslab.mbu.iisc.ernet.in/clap/.
Collapse
|
28
|
Analysis of the protein domain and domain architecture content in fungi and its application in the search of new antifungal targets. PLoS Comput Biol 2014; 10:e1003733. [PMID: 25033262 PMCID: PMC4102429 DOI: 10.1371/journal.pcbi.1003733] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2013] [Accepted: 06/04/2014] [Indexed: 01/25/2023] Open
Abstract
Over the past several years fungal infections have shown an increasing incidence in the susceptible population, and caused high mortality rates. In parallel, multi-resistant fungi are emerging in human infections. Therefore, the identification of new potential antifungal targets is a priority. The first task of this study was to analyse the protein domain and domain architecture content of the 137 fungal proteomes (corresponding to 111 species) available in UniProtKB (UniProt KnowledgeBase) by January 2013. The resulting list of core and exclusive domain and domain architectures is provided in this paper. It delineates the different levels of fungal taxonomic classification: phylum, subphylum, order, genus and species. The analysis highlighted Aspergillus as the most diverse genus in terms of exclusive domain content. In addition, we also investigated which domains could be considered promiscuous in the different organisms. As an application of this analysis, we explored three different ways to detect potential targets for antifungal drugs. First, we compared the domain and domain architecture content of the human and fungal proteomes, and identified those domains and domain architectures only present in fungi. Secondly, we looked for information regarding fungal pathways in public repositories, where proteins containing promiscuous domains could be involved. Three pathways were identified as a result: lovastatin biosynthesis, xylan degradation and biosynthesis of siroheme. Finally, we classified a subset of the studied fungi in five groups depending on their occurrence in clinical samples. We then looked for exclusive domains in the groups that were more relevant clinically and determined which of them had the potential to bind small molecules. Overall, this study provides a comprehensive analysis of the available fungal proteomes and shows three approaches that can be used as a first step in the detection of new antifungal targets. Some fungi have become pathogenic to plants and in a lesser extent to animals. Under certain conditions their presence in the human body can prove a threat for human health, especially for immunocompromised patients. Yet, some fungi can also infect healthy individuals. The low sensitivity of the antifungal drugs available together with the clinically observed resistance of some fungi raises the demand for new alternative treatments. Proteins are biological molecules which perform essential functions within the living organisms. Many of those functions are attributed to the varying folded structure of each protein. These configurations are composed of functional units -also called domains- each one independently responsible for a fraction of the overall biological function. Understanding how the different block combinations are distributed across members of the same or similar families of organisms is important. For instance, exclusive domain combinations can hold particular acquired functions. Blocks displaying a high mobility can play major roles for the organism's survival. The biological goal of this study was to analyse the functional implications of protein domains and domain combinations in the available fungal proteomes. This information can be used to highlight proteins and pathways that could be potentially used as drug targets.
Collapse
|
29
|
Mohanty S, Purwar M, Srinivasan N, Rekha N. Tethering preferences of domain families co-occurring in multi-domain proteins. MOLECULAR BIOSYSTEMS 2013; 9:1708-25. [PMID: 23571467 DOI: 10.1039/c3mb25481j] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Genomic data of several organisms have revealed the presence of a vast repertoire of multi-domain proteins. The role played by individual domains in a multi-domain protein has a profound influence on the overall function of the protein. In the present analysis an attempt has been made to better understand the tethering preferences of domain families that occur in multi-domain proteins. The analysis has been carried out on an exhaustive dataset of 2 961 898 sequences of proteins from 930 organisms, where 741 274 proteins are comprised of at least two domain families. For every domain family, the number of other domain families with which it co-occurs within a protein in this dataset has been enumerated and is referred to as the tethering number of the domain family. It was found that, in the general dataset, the AAA ATPase family and the family of Ser/Thr kinases have the highest tethering numbers of 450 and 444 respectively. Further analysis reveals significant correlation between the number of members in a family and its tethering number. Positive correlation was also observed for the extent of a sequence and functional diversity within a family and the tethering numbers of domain families. Domain families that are present ubiquitously in diverse organisms tend to have large tethering numbers, while organism/kingdom-specific families have low tethering numbers. Thus, the analysis uncovers how domain families recombine and evolve to give rise to multi-domain proteins.
Collapse
Affiliation(s)
- Smita Mohanty
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | | | | | | |
Collapse
|
30
|
Bornberg-Bauer E, Albà MM. Dynamics and adaptive benefits of modular protein evolution. Curr Opin Struct Biol 2013; 23:459-66. [PMID: 23562500 DOI: 10.1016/j.sbi.2013.02.012] [Citation(s) in RCA: 80] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Revised: 02/15/2013] [Accepted: 02/15/2013] [Indexed: 11/29/2022]
Abstract
During protein evolution, novel domain arrangements are continuously formed. Rearrangements are important for the creation of molecular biodiversity and for functional molecular changes which underlie developmental shifts in the bauplan of organisms. Here we review the mechanisms by which new arrangements arise and the potential benefits of rearrangements. We concentrate on how new domains emerge and why they rapidly spread across genomes, gaining higher copy numbers than older, more established domains. This spread is most likely a consequence of their high adaptive potential but is unlikely to make up on its own for the drastic loss of domains, which is observed across different taxa. We show that a significant portion of the recently emerged domains, especially those in multidomain families, are highly disordered and speculate about the significance of these findings for the evolvability of novel genetic material.
Collapse
Affiliation(s)
- Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, School of Biological Sciences, University of Münster, Hüfferstrasse 1, D48149 Münster, Germany.
| | | |
Collapse
|
31
|
Zmasek CM, Godzik A. This Déjà vu feeling--analysis of multidomain protein evolution in eukaryotic genomes. PLoS Comput Biol 2012; 8:e1002701. [PMID: 23166479 PMCID: PMC3499355 DOI: 10.1371/journal.pcbi.1002701] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2012] [Accepted: 07/27/2012] [Indexed: 12/31/2022] Open
Abstract
Evolutionary innovation in eukaryotes and especially animals is at least partially driven by genome rearrangements and the resulting emergence of proteins with new domain combinations, and thus potentially novel functionality. Given the random nature of such rearrangements, one could expect that proteins with particularly useful multidomain combinations may have been rediscovered multiple times by parallel evolution. However, existing reports suggest a minimal role of this phenomenon in the overall evolution of eukaryotic proteomes. We assembled a collection of 172 complete eukaryotic genomes that is not only the largest, but also the most phylogenetically complete set of genomes analyzed so far. By employing a maximum parsimony approach to compare repertoires of Pfam domains and their combinations, we show that independent evolution of domain combinations is significantly more prevalent than previously thought. Our results indicate that about 25% of all currently observed domain combinations have evolved multiple times. Interestingly, this percentage is even higher for sets of domain combinations in individual species, with, for instance, 70% of the domain combinations found in the human genome having evolved independently at least once in other species. We also show that previous, much lower estimates of this rate are most likely due to the small number and biased phylogenetic distribution of the genomes analyzed. The process of independent emergence of identical domain combination is widespread, not limited to domains with specific functional categories. Besides data from large-scale analyses, we also present individual examples of independent domain combination evolution. The surprisingly large contribution of parallel evolution to the development of the domain combination repertoire in extant genomes has profound consequences for our understanding of the evolution of pathways and cellular processes in eukaryotes and for comparative functional genomics. Most proteins in eukaryotes are composed of two or more domains, evolutionary independent units with (often) their own individual functions. The specific repertoire of multidomain proteins in a given species defines the topology of pathways and networks that carry out its metabolic and regulatory processes. When proteins with new domain combinations emerge by gene fusion and fission, it directly affects topology of cellular networks in this organism. To better understand the evolution of such networks we analyzed a large set of eukaryotic genomes for the evolutionary history of known domain combinations. Our analysis shows that 70% of all domain combinations present in the human genome independently appeared in at least one other eukaryotic genome. Overall, over 25% of all known multidomain architectures emerged independently several times in the history of life. The difference between a global and species specific picture can be explained by the existence of a core set of domain combinations that keeps reemerging in different species, which are accompanied by a smaller number of unique domain combinations that do not appear anywhere else.
Collapse
Affiliation(s)
- Christian M. Zmasek
- Program in Bioinformatics and Systems Biology, Sanford-Burnham Medical Research Institute, La Jolla, California, United States of America
- * E-mail: (CMZ); (AG)
| | - Adam Godzik
- Program in Bioinformatics and Systems Biology, Sanford-Burnham Medical Research Institute, La Jolla, California, United States of America
- * E-mail: (CMZ); (AG)
| |
Collapse
|
32
|
Yue JX, Meyers BC, Chen JQ, Tian D, Yang S. Tracing the origin and evolutionary history of plant nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes. THE NEW PHYTOLOGIST 2012; 193:1049-1063. [PMID: 22212278 DOI: 10.1111/j.1469-8137.2011.04006.x] [Citation(s) in RCA: 144] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Plant disease resistance genes (R genes) encode proteins that function to monitor signals indicating pathogenic infection, thus playing a critical role in the plant's defense system. Although many studies have been performed to explore the functional details of these important genes, their origin and evolutionary history remain unclear. In this study, focusing on the largest group of R genes, the nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes, we conducted an extensive genome-wide survey of 38 representative model organisms and obtained insights into the evolutionary stage and timing of NBS-LRR genes. Our data show that the two major domains, NBS and LRR, existed before the split of prokaryotes and eukaryotes but their fusion was observed only in land plant lineages. The Toll/interleukin-1 receptor (TIR) class of NBS-LRR genes probably had an earlier origin than its nonTIR counterpart. The similarities of the innate immune systems of plants and animals are likely to have been shaped by convergent evolution after their independent origins. Our findings start to unravel the evolutionary history of these important genes from the perspective of comparative genomics and also highlight the important role of reorganizing pre-existing building blocks in generating evolutionary novelties.
Collapse
Affiliation(s)
- Jia-Xing Yue
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing 210093, China
- Department of Ecology and Evolutionary Biology, Rice University, Houston, TX 77005, USA
| | - Blake C Meyers
- Department of Plant and Soil Sciences, and Delaware Biotechnology Institute, University of Delaware, Newark, DE 19711, USA
| | - Jian-Qun Chen
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing 210093, China
| | - Dacheng Tian
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing 210093, China
| | - Sihai Yang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing 210093, China
| |
Collapse
|
33
|
Zhang XC, Wang Z, Zhang X, Le MH, Sun J, Xu D, Cheng J, Stacey G. Evolutionary dynamics of protein domain architecture in plants. BMC Evol Biol 2012; 12:6. [PMID: 22252370 PMCID: PMC3310802 DOI: 10.1186/1471-2148-12-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2011] [Accepted: 01/17/2012] [Indexed: 12/17/2022] Open
Abstract
Background Protein domains are the structural, functional and evolutionary units of the protein. Protein domain architectures are the linear arrangements of domain(s) in individual proteins. Although the evolutionary history of protein domain architecture has been extensively studied in microorganisms, the evolutionary dynamics of domain architecture in the plant kingdom remains largely undefined. To address this question, we analyzed the lineage-based protein domain architecture content in 14 completed green plant genomes. Results Our analyses show that all 14 plant genomes maintain similar distributions of species-specific, single-domain, and multi-domain architectures. Approximately 65% of plant domain architectures are universally present in all plant lineages, while the remaining architectures are lineage-specific. Clear examples are seen of both the loss and gain of specific protein architectures in higher plants. There has been a dynamic, lineage-wise expansion of domain architectures during plant evolution. The data suggest that this expansion can be largely explained by changes in nuclear ploidy resulting from rounds of whole genome duplications. Indeed, there has been a decrease in the number of unique domain architectures when the genomes were normalized into a presumed ancestral genome that has not undergone whole genome duplications. Conclusions Our data show the conservation of universal domain architectures in all available plant genomes, indicating the presence of an evolutionarily conserved, core set of protein components. However, the occurrence of lineage-specific domain architectures indicates that domain architecture diversity has been maintained beyond these core components in plant genomes. Although several features of genome-wide domain architecture content are conserved in plants, the data clearly demonstrate lineage-wise, progressive changes and expansions of individual protein domain architectures, reinforcing the notion that plant genomes have undergone dynamic evolution.
Collapse
Affiliation(s)
- Xue-Cheng Zhang
- Division of Plant Sciences, University of Missouri, Columbia, MO 65211, USA.
| | | | | | | | | | | | | | | |
Collapse
|
34
|
Abstract
This chapter reviews the current research on how protein domain architectures evolve. We begin by summarizing work on the phylogenetic distribution of proteins, as this directly impacts which domain architectures can be formed in different species. Studies relating domain family size to occurrence have shown that they generally follow power law distributions, both within genomes and larger evolutionary groups. These findings were subsequently extended to multidomain architectures. Genome evolution models that have been suggested to explain the shape of these distributions are reviewed, as well as evidence for selective pressure to expand certain domain families more than others. Each domain has an intrinsic combinatorial propensity, and the effects of this have been studied using measures of domain versatility or promiscuity. Next, we study the principles of protein domain architecture evolution and how these have been inferred from distributions of extant domain arrangements. Following this, we review inferences of ancestral domain architecture and the conclusions concerning domain architecture evolution mechanisms that can be drawn from these. Finally, we examine whether all known cases of a given domain architecture can be assumed to have a single common origin (monophyly) or have evolved convergently (polyphyly).
Collapse
|
35
|
Parikesit AA, Stadler PF, Prohaska SJ. Evolution and quantitative comparison of genome-wide protein domain distributions. Genes (Basel) 2011; 2:912-24. [PMID: 24710298 PMCID: PMC3927604 DOI: 10.3390/genes2040912] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2011] [Revised: 10/07/2011] [Accepted: 10/25/2011] [Indexed: 02/01/2023] Open
Abstract
The metabolic and regulatory capabilities of an organism are implicit in its protein content. This is often hard to estimate, however, due to ascertainment biases inherent in the available genome annotations. Its complement of recognizable functional protein domains and their combinations convey essentially the same information and at the same time are much more readily accessible, although protein domain models trained for one phylogenetic group frequently fail on distantly related sequences. Pooling related domain models based on their GO-annotation in combination with de novo gene prediction methods provides estimates that seem to be less affected by phylogenetic biases. We show here for 18 diverse representatives from all eukaryotic kingdoms that a pooled analysis of the tendencies for co-occurrence or avoidance of protein domains is indeed feasible. This type of analysis can reveal general large-scale patterns in the domain co-occurrence and helps to identify lineage-specific variations in the evolution of protein domains. Somewhat surprisingly, we do not find strong ubiquitous patterns governing the evolutionary behavior of specific functional classes. Instead, there are strong variations between the major groups of Eukaryotes, pointing at systematic differences in their evolutionary constraints.
Collapse
Affiliation(s)
- Arli A Parikesit
- Computational EvoDevo Group, Department of Computer Science, University of Leipzig, Härtelstraße 16-18, D-04107 Leipzig, Germany.
| | - Peter F Stadler
- Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107 Leipzig, Germany.
| | - Sonja J Prohaska
- Computational EvoDevo Group, Department of Computer Science, University of Leipzig, Härtelstraße 16-18, D-04107 Leipzig, Germany.
| |
Collapse
|
36
|
Moore AD, Bornberg-Bauer E. The dynamics and evolutionary potential of domain loss and emergence. Mol Biol Evol 2011; 29:787-96. [PMID: 22016574 PMCID: PMC3258042 DOI: 10.1093/molbev/msr250] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The wealth of available genomic data presents an unrivaled opportunity to study the molecular basis of evolution. Studies on gene family expansions and site-dependent analyses have already helped establish important insights into how proteins facilitate adaptation. However, efforts to conduct full-scale cross-genomic comparisons between species are challenged by both growing amounts of data and the inherent difficulty in accurately inferring homology between deeply rooted species. Proteins, in comparison, evolve by means of domain rearrangements, a process more amenable to study given the strength of profile-based homology inference and the lower rates with which rearrangements occur. However, adapting to a constantly changing environment can require molecular modulations beyond reach of rearrangement alone. Here, we explore rates and functional implications of novel domain emergence in contrast to domain gain and loss in 20 arthropod species of the pancrustacean clade. Emerging domains are more likely disordered in structure and spread more rapidly within their genomes than established domains. Furthermore, although domain turnover occurs at lower rates than gene family turnover, we find strong evidence that the emergence of novel domains is foremost associated with environmental adaptation such as abiotic stress response. The results presented here illustrate the simplicity with which domain-based analyses can unravel key players of nature's adaptational machinery, complementing the classical site-based analyses of adaptation.
Collapse
Affiliation(s)
- Andrew D Moore
- Evolutionary Bioinformatics Group, Institute for Evolution and Biodiversity, University of Muenster, Germany
| | | |
Collapse
|
37
|
Xie X, Jin J, Mao Y. Evolutionary versatility of eukaryotic protein domains revealed by their bigram networks. BMC Evol Biol 2011; 11:242. [PMID: 21849086 PMCID: PMC3167776 DOI: 10.1186/1471-2148-11-242] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2011] [Accepted: 08/18/2011] [Indexed: 11/21/2022] Open
Abstract
Background Protein domains are globular structures of independently folded polypeptides that exert catalytic or binding activities. Their sequences are recognized as evolutionary units that, through genome recombination, constitute protein repertoires of linkage patterns. Via mutations, domains acquire modified functions that contribute to the fitness of cells and organisms. Recent studies have addressed the evolutionary selection that may have shaped the functions of individual domains and the emergence of particular domain combinations, which led to new cellular functions in multi-cellular animals. This study focuses on modeling domain linkage globally and investigates evolutionary implications that may be revealed by novel computational analysis. Results A survey of 77 completely sequenced eukaryotic genomes implies a potential hierarchical and modular organization of biological functions in most living organisms. Domains in a genome or multiple genomes are modeled as a network of hetero-duplex covalent linkages, termed bigrams. A novel computational technique is introduced to decompose such networks, whereby the notion of domain "networking versatility" is derived and measured. The most and least "versatile" domains (termed "core domains" and "peripheral domains" respectively) are examined both computationally via sequence conservation measures and experimentally using selected domains. Our study suggests that such a versatility measure extracted from the bigram networks correlates with the adaptivity of domains during evolution, where the network core domains are highly adaptive, significantly contrasting the network peripheral domains. Conclusions Domain recombination has played a major part in the evolution of eukaryotes attributing to genome complexity. From a system point of view, as the results of selection and constant refinement, networks of domain linkage are structured in a hierarchical modular fashion. Domains with high degree of networking versatility appear to be evolutionary adaptive, potentially through functional innovations. Domain bigram networks are informative as a model of biological functions. The networking versatility indices extracted from such networks for individual domains reflect the strength of evolutionary selection that the domains have experienced.
Collapse
Affiliation(s)
- Xueying Xie
- Research Center for Learning Science, Southeast University, Sipai Lou 2, Nanjing 210096 China.
| | | | | |
Collapse
|
38
|
Cohen-Gihon I, Fong JH, Sharan R, Nussinov R, Przytycka TM, Panchenko AR. Evolution of domain promiscuity in eukaryotic genomes--a perspective from the inferred ancestral domain architectures. MOLECULAR BIOSYSTEMS 2011; 7:784-92. [PMID: 21127809 PMCID: PMC3321261 DOI: 10.1039/c0mb00182a] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Most eukaryotic proteins are composed of two or more domains. These assemble in a modular manner to create new proteins usually by the acquisition of one or more domains to an existing protein. Promiscuous domains which are found embedded in a variety of proteins and co-exist with many other domains are of particular interest and were shown to have roles in signaling pathways and mediating network communication. The evolution of domain promiscuity is still an open problem, mostly due to the lack of sequenced ancestral genomes. Here we use inferred domain architectures of ancestral genomes to trace the evolution of domain promiscuity in eukaryotic genomes. We find an increase in average promiscuity along many branches of the eukaryotic tree. Moreover, domain promiscuity can proceed at almost a steady rate over long evolutionary time or exhibit lineage-specific acceleration. We also observe that many signaling and regulatory domains gained domain promiscuity around the Bilateria divergence. In addition we show that those domains that played a role in the creation of two body axes and existed before the divergence of the bilaterians from fungi/metazoan achieve a boost in their promiscuities during the bilaterian evolution.
Collapse
Affiliation(s)
- Inbar Cohen-Gihon
- Sackler Institute of Molecular Medicine, Department of Human Genetics, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Jessica H. Fong
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Roded Sharan
- The Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel
| | - Ruth Nussinov
- Sackler Institute of Molecular Medicine, Department of Human Genetics, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
- Center for Cancer Research Nanobiology Program, SAIC-Frederick, Inc., NCI-Frederick, Frederick, MD 21702, USA
| | - Teresa M. Przytycka
- Center for Cancer Research Nanobiology Program, SAIC-Frederick, Inc., NCI-Frederick, Frederick, MD 21702, USA
| | - Anna R. Panchenko
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
39
|
Meng W, Su YCF, Saunders RMK, Chye ML. The rice acyl-CoA-binding protein gene family: phylogeny, expression and functional analysis. THE NEW PHYTOLOGIST 2011; 189:1170-1184. [PMID: 21128943 DOI: 10.1111/j.1469-8137.2010.03546.x] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
• Acyl-CoA-binding proteins (ACBPs) show conservation in an acyl-CoA-binding domain (ACB domain) which binds acyl-CoA esters. Previous studies on plant ACBPs focused on eudicots, Arabidopsis and Brassica. Here, we report on the phylogeny and characterization of the ACBP family from the monocot Oryza sativa (rice). • Phylogenetic analyses were conducted using 16 plant genomes. Expression profiles of rice ACBPs under normal growth, as well as biotic and abiotic stress conditions, were examined by quantitative real-time reverse-transcription polymerase chain reactions. In vitro acyl-CoA-binding assays were conducted using recombinant (His)₆-tagged ACBPs. • The ACBP family diversified as land plants evolved. Classes I and IV show lineage-specific gene expansion. Classes II and III are closely related phylogenetically. As in the eudicot Arabidopsis, six genes (designated OsACBP1 to OsACBP6) encode rice ACBPs, but their distribution into various classes differed from Arabidopsis. Rice ACBP mRNAs showed ubiquitous expression and OsACBP4, OsACBP5 and OsACBP6 were stress-responsive. All recombinant rice ACBPs bind [¹⁴C]linolenoyl-CoA besides having specific substrates. • Phylogeny, gene expression and biochemical analyses suggest that paralogues within and across classes are not redundant proteins. In addition to performing conserved basal functions, multidomain rice ACBPs appear to be associated with stress responses.
Collapse
Affiliation(s)
- Wei Meng
- School of Biological Sciences, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| | - Yvonne C F Su
- School of Biological Sciences, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| | - Richard M K Saunders
- School of Biological Sciences, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| | - Mee-Len Chye
- School of Biological Sciences, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| |
Collapse
|
40
|
Buljan M, Frankish A, Bateman A. Quantifying the mechanisms of domain gain in animal proteins. Genome Biol 2010; 11:R74. [PMID: 20633280 PMCID: PMC2926785 DOI: 10.1186/gb-2010-11-7-r74] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2010] [Revised: 06/04/2010] [Accepted: 07/15/2010] [Indexed: 11/21/2022] Open
Abstract
Background Protein domains are protein regions that are shared among different proteins and are frequently functionally and structurally independent from the rest of the protein. Novel domain combinations have a major role in evolutionary innovation. However, the relative contributions of the different molecular mechanisms that underlie domain gains in animals are still unknown. By using animal gene phylogenies we were able to identify a set of high confidence domain gain events and by looking at their coding DNA investigate the causative mechanisms. Results Here we show that the major mechanism for gains of new domains in metazoan proteins is likely to be gene fusion through joining of exons from adjacent genes, possibly mediated by non-allelic homologous recombination. Retroposition and insertion of exons into ancestral introns through intronic recombination are, in contrast to previous expectations, only minor contributors to domain gains and have accounted for less than 1% and 10% of high confidence domain gain events, respectively. Additionally, exonization of previously non-coding regions appears to be an important mechanism for addition of disordered segments to proteins. We observe that gene duplication has preceded domain gain in at least 80% of the gain events. Conclusions The interplay of gene duplication and domain gain demonstrates an important mechanism for fast neofunctionalization of genes.
Collapse
Affiliation(s)
- Marija Buljan
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | | | | |
Collapse
|
41
|
The evolutionary history of protein domains viewed by species phylogeny. PLoS One 2009; 4:e8378. [PMID: 20041107 PMCID: PMC2794708 DOI: 10.1371/journal.pone.0008378] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2009] [Accepted: 11/16/2009] [Indexed: 11/20/2022] Open
Abstract
Background Protein structural domains are evolutionary units whose relationships can be detected over long evolutionary distances. The evolutionary history of protein domains, including the origin of protein domains, the identification of domain loss, transfer, duplication and combination with other domains to form new proteins, and the formation of the entire protein domain repertoire, are of great interest. Methodology/Principal Findings A methodology is presented for providing a parsimonious domain history based on gain, loss, vertical and horizontal transfer derived from the complete genomic domain assignments of 1015 organisms across the tree of life. When mapped to species trees the evolutionary history of domains and domain combinations is revealed, and the general evolutionary trend of domain and combination is analyzed. Conclusions/Significance We show that this approach provides a powerful tool to study how new proteins and functions emerged and to study such processes as horizontal gene transfer among more distant species.
Collapse
|
42
|
Valas RE, Yang S, Bourne PE. Nothing about protein structure classification makes sense except in the light of evolution. Curr Opin Struct Biol 2009; 19:329-34. [PMID: 19394812 DOI: 10.1016/j.sbi.2009.03.011] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2008] [Revised: 02/19/2009] [Accepted: 03/16/2009] [Indexed: 12/27/2022]
Abstract
In this, the 200th anniversary of Charles Darwin's birth and the 150th anniversary of the publication of the Origin of Species, it is fitting to revisit the classification of protein structures from an evolutionary perspective. Existing classifications use homologous sequence relationships, but knowing that structure is much more conserved that sequence creates an iterative loop from which structures can be further classified beyond that of the domain, thereby teasing out distant evolutionary relationships. The desired classification scheme is then one in which a fold is merely semantics and structure can be classified as either ancestral or derived.
Collapse
Affiliation(s)
- Ruben E Valas
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093-0743, USA
| | | | | |
Collapse
|
43
|
Weiner J, Moore AD, Bornberg-Bauer E. Just how versatile are domains? BMC Evol Biol 2008; 8:285. [PMID: 18854028 PMCID: PMC2588589 DOI: 10.1186/1471-2148-8-285] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2008] [Accepted: 10/14/2008] [Indexed: 11/17/2022] Open
Abstract
Background Creating new protein domain arrangements is a frequent mechanism of evolutionary innovation. While some domains always form the same combinations, others form many different arrangements. This ability, which is often referred to as versatility or promiscuity of domains, its a random evolutionary model in which a domain's promiscuity is based on its relative frequency of domains. Results We show that there is a clear relationship across genomes between the promiscuity of a given domain and its frequency. However, the strength of this relationship differs for different domains. We thus redefine domain promiscuity by defining a new index, DV I ("domain versatility index"), which eliminates the effect of domain frequency. We explore links between a domain's versatility, when unlinked from abundance, and its biological properties. Conclusion Our results indicate that domains occurring as single domain proteins and domains appearing frequently at protein termini have a higher DV I. This is consistent with previous observations that the evolution of domain re-arrangements is primarily driven by fusion of pre-existing arrangements and single domains as well as loss of domains at protein termini. Furthermore, we studied the link between domain age, defined as the first appearance of a domain in the species tree, and the DV I. Contrary to previous studies based on domain promiscuity, it seems as if the DV I is age independent. Finally, we find that contrary to previously reported findings, versatility is lower in Eukaryotes. In summary, our measure of domain versatility indicates that a random attachment process is sufficient to explain the observed distribution of domain arrangements and that several views on domain promiscuity need to be revised.
Collapse
Affiliation(s)
- January Weiner
- Institute for Evolution and Biodiversity, Evolutionary Bioinformatics Group, Westphalian Wilhelms-University, Münster, Germany.
| | | | | |
Collapse
|
44
|
Castro MAA, Dalmolin RJS, Moreira JCF, Mombach JCM, de Almeida RMC. Evolutionary origins of human apoptosis and genome-stability gene networks. Nucleic Acids Res 2008; 36:6269-83. [PMID: 18832373 PMCID: PMC2577361 DOI: 10.1093/nar/gkn636] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Apoptosis is essential for complex multicellular organisms and its failure is associated with genome instability and cancer. Interactions between apoptosis and genome-maintenance mechanisms have been extensively documented and include transactivation-independent and -dependent functions, in which the tumor-suppressor protein p53 works as a 'molecular node' in the DNA-damage response. Although apoptosis and genome stability have been identified as ancient pathways in eukaryote phylogeny, the biological evolution underlying the emergence of an integrated system remains largely unknown. Here, using computational methods, we reconstruct the evolutionary scenario that linked apoptosis with genome stability pathways in a functional human gene/protein association network. We found that the entanglement of DNA repair, chromosome stability and apoptosis gene networks appears with the caspase gene family and the antiapoptotic gene BCL2. Also, several critical nodes that entangle apoptosis and genome stability are cancer genes (e.g. ATM, BRCA1, BRCA2, MLH1, MSH2, MSH6 and TP53), although their orthologs have arisen in different points of evolution. Our results demonstrate how genome stability and apoptosis were co-opted during evolution recruiting genes that merge both systems. We also provide several examples to exploit this evolutionary platform, where we have judiciously extended information on gene essentiality inferred from model organisms to human.
Collapse
Affiliation(s)
- Mauro A A Castro
- Bioinformatics Unit, Department of Biochemistry, Federal University of Rio Grande do Sul (UFRGS), Rua Ramiro Barcelos 2600-anexo, Porto Alegre 90035-003, Brazil.
| | | | | | | | | |
Collapse
|
45
|
Grunt M, Žárský V, Cvrčková F. Roots of angiosperm formins: the evolutionary history of plant FH2 domain-containing proteins. BMC Evol Biol 2008; 8:115. [PMID: 18430232 PMCID: PMC2386819 DOI: 10.1186/1471-2148-8-115] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2007] [Accepted: 04/22/2008] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Shuffling of modular protein domains is an important source of evolutionary innovation. Formins are a family of actin-organizing proteins that share a conserved FH2 domain but their overall domain architecture differs dramatically between opisthokonts (metazoans and fungi) and plants. We performed a phylogenomic analysis of formins in most eukaryotic kingdoms, aiming to reconstruct an evolutionary scenario that may have produced the current diversity of domain combinations with focus on the origin of the angiosperm formin architectures. RESULTS The Rho GTPase-binding domain (GBD/FH3) reported from opisthokont and Dictyostelium formins was found in all lineages except plants, suggesting its ancestral character. Instead, mosses and vascular plants possess the two formin classes known from angiosperms: membrane-anchored Class I formins and Class II formins carrying a PTEN-like domain. PTEN-related domains were found also in stramenopile formins, where they have been probably acquired independently rather than by horizontal transfer, following a burst of domain rearrangements in the chromalveolate lineage. A novel RhoGAP-related domain was identified in some algal, moss and lycophyte (but not angiosperm) formins that define a specific branch (Class III) of the formin family. CONCLUSION We propose a scenario where formins underwent multiple domain rearrangements in several eukaryotic lineages, especially plants and chromalveolates. In plants this replaced GBD/FH3 by a probably inactive RhoGAP-like domain, preserving a formin-mediated association between (membrane-anchored) Rho GTPases and the actin cytoskeleton. Subsequent amplification of formin genes, possibly coincident with the expansion of plants to dry land, was followed by acquisition of alternative membrane attachment mechanisms present in extant Class I and Class II formins, allowing later loss of the RhoGAP-like domain-containing formins in angiosperms.
Collapse
Affiliation(s)
- Michal Grunt
- Department of Plant Physiology, Faculty of Sciences, Charles University, Vinièná 5, CZ 128 43 Praha 2, Czech Republic
| | - Viktor Žárský
- Department of Plant Physiology, Faculty of Sciences, Charles University, Vinièná 5, CZ 128 43 Praha 2, Czech Republic
- Institute of Experimental Botany, Academy of Sciences of the Czech Republic, Rozvojová 135, CZ 165 02 Praha 6, Czech Republic
| | - Fatima Cvrčková
- Department of Plant Physiology, Faculty of Sciences, Charles University, Vinièná 5, CZ 128 43 Praha 2, Czech Republic
| |
Collapse
|
46
|
Basu MK, Carmel L, Rogozin IB, Koonin EV. Evolution of protein domain promiscuity in eukaryotes. Genome Res 2008; 18:449-61. [PMID: 18230802 DOI: 10.1101/gr.6943508] [Citation(s) in RCA: 134] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Numerous eukaryotic proteins contain multiple domains. Certain domains show a tendency to occur in diverse domain architectures and can be considered "promiscuous." These promiscuous domains are, typically, involved in protein-protein interactions and play crucial roles in interaction networks, particularly those that contribute to signal transduction. A systematic comparative-genomic analysis of promiscuous domains in eukaryotes is described. Two quantitative measures of domain promiscuity are introduced and applied to the analysis of 28 genomes of diverse eukaryotes. Altogether, 215 domains are identified as strongly promiscuous. The fraction of promiscuous domains in animals is shown to be significantly greater than that in fungi or plants. Evolutionary reconstructions indicate that domain promiscuity is a volatile, relatively fast-changing feature of eukaryotic proteins, with few domains remaining promiscuous throughout the evolution of eukaryotes. Some domains appear to have attained promiscuity independently in different lineages, for example, animals and plants. It is proposed that promiscuous domains persist within a relatively small pool of evolutionarily stable domain combinations from which numerous rare architectures emerge during evolution. Domain promiscuity positively correlates with the number of experimentally detected domain interactions and with the strength of purifying selection affecting a domain. Thus, evolution of promiscuous domains seems to be constrained by the diversity of their interaction partners. The set of promiscuous domains is enriched for domains mediating protein-protein interactions that are involved in various forms of signal transduction, especially in the ubiquitin system and in chromatin. Thus, a limited repertoire of promiscuous domains makes a major contribution to the diversity and evolvability of eukaryotic proteomes and signaling networks.
Collapse
Affiliation(s)
- Malay Kumar Basu
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | | | | | | |
Collapse
|