Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kristensen DM, Wolf YI, Mushegian AR, Koonin EV. Computational methods for Gene Orthology inference. Brief Bioinform 2011;12:379-91. [PMID: 21690100 DOI: 10.1093/bib/bbr030] [Citation(s) in RCA: 150] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

For:	Kristensen DM, Wolf YI, Mushegian AR, Koonin EV. Computational methods for Gene Orthology inference. Brief Bioinform 2011;12:379-91. [PMID: 21690100 DOI: 10.1093/bib/bbr030] [Citation(s) in RCA: 150] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Number

Cited by Other Article(s)

Cosentino S, Sriswasdi S, Iwasaki W. SonicParanoid2: fast, accurate, and comprehensive orthology inference with machine learning and language models. Genome Biol 2024;25:195. [PMID: 39054525 PMCID: PMC11270883 DOI: 10.1186/s13059-024-03298-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 06/04/2024] [Indexed: 07/27/2024] Open

Aufiero G, Fruggiero C, D’Angelo D, D’Agostino N. Homoeologs in Allopolyploids: Navigating Redundancy as Both an Evolutionary Opportunity and a Technical Challenge-A Transcriptomics Perspective. Genes (Basel) 2024;15:977. [PMID: 39202338 PMCID: PMC11353593 DOI: 10.3390/genes15080977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2024] [Revised: 07/22/2024] [Accepted: 07/23/2024] [Indexed: 09/03/2024] Open

Abstract

Allopolyploidy in plants involves the merging of two or more distinct parental genomes into a single nucleus, a significant evolutionary process in the plant kingdom. Transcriptomic analysis provides invaluable insights into allopolyploid plants by elucidating the fate of duplicated genes, revealing evolutionary novelties and uncovering their environmental adaptations. By examining gene expression profiles, scientists can discern how duplicated genes have evolved to acquire new functions or regulatory roles. This process often leads to the development of novel traits and adaptive strategies that allopolyploid plants leverage to thrive in diverse ecological niches. Understanding these molecular mechanisms not only enhances our appreciation of the genetic complexity underlying allopolyploidy but also underscores their importance in agriculture and ecosystem resilience. However, transcriptome profiling is challenging due to genomic redundancy, which is further complicated by the presence of multiple chromosomes sets and the variations among homoeologs and allelic genes. Prior to transcriptome analysis, sub-genome phasing and homoeology inference are essential for obtaining a comprehensive view of gene expression. This review aims to clarify the terminology in this field, identify the most challenging aspects of transcriptome analysis, explain their inherent difficulties, and suggest reliable analytic strategies. Furthermore, bulk RNA-seq is highlighted as a primary method for studying allopolyploid gene expression, focusing on critical steps like read mapping and normalization in differential gene expression analysis. This approach effectively captures gene expression from both parental genomes, facilitating a comprehensive analysis of their combined profiles. Its sensitivity in detecting low-abundance transcripts allows for subtle differences between parental genomes to be identified, crucial for understanding regulatory dynamics and gene expression balance in allopolyploids.

Collapse

Guo L, Wang S, Jiao X, Ye X, Deng D, Liu H, Li Y, Van de Peer Y, Wu W. Convergent and/or parallel evolution of RNA-binding proteins in angiosperms after polyploidization. THE NEW PHYTOLOGIST 2024;242:1377-1393. [PMID: 38436132 DOI: 10.1111/nph.19656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Accepted: 02/20/2024] [Indexed: 03/05/2024]

Steinbinder J, Sachslehner AP, Holthaus KB, Eckhart L. Comparative genomics of monotremes provides insights into the early evolution of mammalian epidermal differentiation genes. Sci Rep 2024;14:1437. [PMID: 38228724 PMCID: PMC10791643 DOI: 10.1038/s41598-024-51926-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 01/11/2024] [Indexed: 01/18/2024] Open

Singh V, Singh V. Inferring Interaction Networks from Transcriptomic Data: Methods and Applications. Methods Mol Biol 2024;2812:11-37. [PMID: 39068355 DOI: 10.1007/978-1-0716-3886-6_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]

Ebu SM, Ray L, Panda AN, Gouda SK. De novo assembly and comparative genome analysis for polyhydroxyalkanoates-producing Bacillus sp. BNPI-92 strain. J Genet Eng Biotechnol 2023;21:132. [PMID: 37991636 PMCID: PMC10665291 DOI: 10.1186/s43141-023-00578-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 10/26/2023] [Indexed: 11/23/2023]

Abstract

BACKGROUND

Certain Bacillus species play a vital role in polyhydroxyalkanoate (PHA) production. However, most of these isolates did not properly identify to species level when scientifically had been reported.

RESULTS

From NGS analysis, 5719 genes were predicted in the de novo genome assembly. Based on genome annotation using RAST server, 5,527,513 bp sequences were predicted with 5679 bp number of protein-coding sequence. Its genome sequence contains 35.1% and 156 GC content and contigs, respectively. In RAST server analysis, subsystem (43%) and non-subsystem coverage (57%) were generated. Ortho Venn comparative genome analysis indicated that Bacillus sp. BNPI-92 shared 2930 gene cluster (core gene) with B. cereus ATCC 14579 T (AE016877), B. paranthracis Mn5T (MACE01000012), B. thuringiensis ATCC 10792 T (ACNF01000156), and B. antrics Amen T (AE016879) strains. For our strain, the maximum gene cluster (190) was shared with B. cereus ATCC 14579 T (AE016877). For Ortho Venn pair wise analysis, the maximum overlapping gene clusters thresholds have been detected between Bacillus s p.BNPI-92 and Ba. cereus ATCC 14579 T (5414). Average nucleotide identity (ANI) such as OriginalANI and OrthoANI, in silicon digital DND-DNA hybridization (isDDH), Type (Strain) Genome Server (TYGS), and Genome-Genome Distance Calculator (GGDC) were more essentially related Bacillus sp. BNPI-92 with B. cereus ATCC 14579 T strain. Therefore, based on the combination of RAST annotation, OrthoVenn server, ANI and isDDH result Bacillus sp.BNPI-92 strain was strongly confirmed to be a B. cereus type strain. It was designated as B. cereus BNPI-92 strain. In B. cereus BNPI-92 strain whole genome sequence, PHA biosynthesis encoding genes such as phaP, phaQ, phaR (PHA synthesis repressor phaR gene sequence), phaB/phbB, and phaC were predicted on the same operon. These gene clusters were designated as phaPQRBC. However, phaA was located on other operons.

CONCLUSIONS

This newly obtained isolate was found to be new a strain based on comparative genomic analysis and it was also observed as a potential candidate for PHA biosynthesis.

Collapse

Manzano-Morales S, Liu Y, González-Bodí S, Huerta-Cepas J, Iranzo J. Comparison of gene clustering criteria reveals intrinsic uncertainty in pangenome analyses. Genome Biol 2023;24:250. [PMID: 37904249 PMCID: PMC10614367 DOI: 10.1186/s13059-023-03089-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 10/16/2023] [Indexed: 11/01/2023] Open

Abstract

BACKGROUND

A key step for comparative genomics is to group open reading frames into functionally and evolutionarily meaningful gene clusters. Gene clustering is complicated by intraspecific duplications and horizontal gene transfers that are frequent in prokaryotes. In consequence, gene clustering methods must deal with a trade-off between identifying vertically transmitted representatives of multicopy gene families, which are recognizable by synteny conservation, and retrieving complete sets of species-level orthologs. We studied the implications of adopting homology, orthology, or synteny conservation as formal criteria for gene clustering by performing comparative analyses of 125 prokaryotic pangenomes.

RESULTS

Clustering criteria affect pangenome functional characterization, core genome inference, and reconstruction of ancestral gene content to different extents. Species-wise estimates of pangenome and core genome sizes change by the same factor when using different clustering criteria, allowing robust cross-species comparisons regardless of the clustering criterion. However, cross-species comparisons of genome plasticity and functional profiles are substantially affected by inconsistencies among clustering criteria. Such inconsistencies are driven not only by mobile genetic elements, but also by genes involved in defense, secondary metabolism, and other accessory functions. In some pangenome features, the variability attributed to methodological inconsistencies can even exceed the effect sizes of ecological and phylogenetic variables.

CONCLUSIONS

Choosing an appropriate criterion for gene clustering is critical to conduct unbiased pangenome analyses. We provide practical guidelines to choose the right method depending on the research goals and the quality of genome assemblies, and a benchmarking dataset to assess the robustness and reproducibility of future comparative studies.

Collapse

Dobbelaere J, Su TY, Erdi B, Schleiffer A, Dammermann A. A phylogenetic profiling approach identifies novel ciliogenesis genes in Drosophila and C. elegans. EMBO J 2023;42:e113616. [PMID: 37317646 PMCID: PMC10425847 DOI: 10.15252/embj.2023113616] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 05/22/2023] [Accepted: 06/01/2023] [Indexed: 06/16/2023] Open

Lyubetsky VA, Rubanov LI, Tereshina MB, Ivanova AS, Araslanova KR, Uroshlev LA, Goremykina GI, Yang JR, Kanovei VG, Zverkov OA, Shitikov AD, Korotkova DD, Zaraisky AG. Wide-scale identification of novel/eliminated genes responsible for evolutionary transformations. Biol Direct 2023;18:45. [PMID: 37568147 PMCID: PMC10416458 DOI: 10.1186/s13062-023-00405-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 08/07/2023] [Indexed: 08/13/2023] Open

Abstract

BACKGROUND

It is generally accepted that most evolutionary transformations at the phenotype level are associated either with rearrangements of genomic regulatory elements, which control the activity of gene networks, or with changes in the amino acid contents of proteins. Recently, evidence has accumulated that significant evolutionary transformations could also be associated with the loss/emergence of whole genes. The targeted identification of such genes is a challenging problem for both bioinformatics and evo-devo research.

RESULTS

To solve this problem we propose the WINEGRET method, named after the first letters of the title. Its main idea is to search for genes that satisfy two requirements: first, the desired genes were lost/emerged at the same evolutionary stage at which the phenotypic trait of interest was lost/emerged, and second, the expression of these genes changes significantly during the development of the trait of interest in the model organism. To verify the first requirement, we do not use existing databases of orthologs, but rely purely on gene homology and local synteny by using some novel quickly computable conditions. Genes satisfying the second requirement are found by deep RNA sequencing. As a proof of principle, we used our method to find genes absent in extant amniotes (reptiles, birds, mammals) but present in anamniotes (fish and amphibians), in which these genes are involved in the regeneration of large body appendages. As a result, 57 genes were identified. For three of them, c-c motif chemokine 4, eotaxin-like, and a previously unknown gene called here sod4, essential roles for tail regeneration were demonstrated. Noteworthy, we established that the latter gene belongs to a novel family of Cu/Zn-superoxide dismutases lost by amniotes, SOD4.

CONCLUSIONS

We present a method for targeted identification of genes whose loss/emergence in evolution could be associated with the loss/emergence of a phenotypic trait of interest. In a proof-of-principle study, we identified genes absent in amniotes that participate in body appendage regeneration in anamniotes. Our method provides a wide range of opportunities for studying the relationship between the loss/emergence of phenotypic traits and the loss/emergence of specific genes in evolution.

Collapse

Affiliation(s)

Vassily A Lyubetsky Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), 19 Build. 1, Bolshoy Karetny per., Moscow, Russia, 127051 Department of Mechanics and Mathematics, Lomonosov Moscow State University, Kolmogorova Str., 1, Moscow, Russia, 119234
Lev I Rubanov Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), 19 Build. 1, Bolshoy Karetny per., Moscow, Russia, 127051
Maria B Tereshina Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10, Miklukho-Maklaya Str., Moscow, Russia, 117997 Pirogov Russian National Research Medical University, Moscow, Russia
Anastasiya S Ivanova Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10, Miklukho-Maklaya Str., Moscow, Russia, 117997 Department of Molecular Medicine, The Scripps Research Institute, La Jolla, USA
Karina R Araslanova Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10, Miklukho-Maklaya Str., Moscow, Russia, 117997
Leonid A Uroshlev Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32, Vavilova Str., Moscow, Russia, 119991
Galina I Goremykina Plekhanov Russian University of Economics, Stremyanny Lane 36, Moscow, Russia
Jian-Rong Yang Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
Vladimir G Kanovei Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), 19 Build. 1, Bolshoy Karetny per., Moscow, Russia, 127051
Oleg A Zverkov Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), 19 Build. 1, Bolshoy Karetny per., Moscow, Russia, 127051
Alexander D Shitikov Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10, Miklukho-Maklaya Str., Moscow, Russia, 117997
Daria D Korotkova Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10, Miklukho-Maklaya Str., Moscow, Russia, 117997 Global Health Institute, School of Life Sciences, EPFL, Lausanne, Switzerland
Andrey G Zaraisky Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10, Miklukho-Maklaya Str., Moscow, Russia, 117997. Pirogov Russian National Research Medical University, Moscow, Russia.

Collapse

Xiong W, Risse J, Berke L, Zhao T, van de Geest H, Oplaat C, Busscher M, Ferreira de Carvalho J, van der Meer IM, Verhoeven KJF, Schranz ME, Vijverberg K. Phylogenomic analysis provides insights into MADS-box and TCP gene diversification and floral development of the Asteraceae, supported by de novo genome and transcriptome sequences from dandelion (Taraxacum officinale). FRONTIERS IN PLANT SCIENCE 2023;14:1198909. [PMCID: PMC10338227 DOI: 10.3389/fpls.2023.1198909] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Accepted: 05/26/2023] [Indexed: 07/15/2023]

Watanabe T, Kure A, Horiike T. OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information. Genome Biol Evol 2023;15:7044703. [PMID: 36799928 PMCID: PMC9991595 DOI: 10.1093/gbe/evad026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Revised: 01/30/2023] [Accepted: 02/13/2023] [Indexed: 02/18/2023] Open

Abstract

Species phylogenetic trees represent the evolutionary processes of organisms, and they are fundamental in evolutionary research. Therefore, new methods have been developed to obtain more reliable species phylogenetic trees. A highly reliable method is the construction of an ortholog data set based on sequence information of genes, which is then used to infer the species phylogenetic tree. However, although methods for constructing an ortholog data set for species phylogenetic analysis have been developed, they cannot remove some paralogs, which is necessary for reliable species phylogenetic inference. To address the limitations of current methods, we developed OrthoPhy, a program that excludes paralogs and constructs highly accurate ortholog data sets using taxonomic information dividing analyzed species into monophyletic groups. OrthoPhy can remove paralogs, detecting inconsistencies between taxonomic information and phylogenetic trees of candidate ortholog groups clustered by sequence similarity. Performance tests using evolutionary simulated sequences and real sequences of 40 bacteria revealed that the precision of ortholog inference by OrthoPhy is higher than that of existing programs. Additionally, the phylogenetic analysis of species was more accurate when performed using ortholog data sets constructed by OrthoPhy than that performed using data sets constructed by existing programs. Furthermore, we performed a benchmark test of the Quest for Orthologs using real sequence data and found that the concordance rate between the phylogenetic trees of orthologs inferred by OrthoPhy and those of species was higher than the rates obtained by other ortholog inference programs. Therefore, ortholog data sets constructed using OrthoPhy enabled a more accurate phylogenetic analysis of species than those constructed using the existing programs, and OrthoPhy can be used for the phylogenetic analysis of species even for distantly related species that have experienced many evolutionary events.

Collapse

Yu C, Chen H, zhu L, Song Y, Jiang Q, Zhang Y, Ali Q, Gu Q, Gao X, Borriss R, Dong S, Wu H. Profiling of Antimicrobial Metabolites Synthesized by the Endophytic and Genetically Amenable Biocontrol Strain Bacillus velezensis DMW1. Microbiol Spectr 2023;11:e0003823. [PMID: 36809029 PMCID: PMC10100683 DOI: 10.1128/spectrum.00038-23] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 01/26/2023] [Indexed: 02/23/2023] Open

Abstract

The genus Bacillus is one of the most important genera for the biological control of plant diseases that are caused by various phytopathogens. The endophytic Bacillus strain DMW1 was isolated from the inner tissues of potato tubers and exhibited strong biocontrol activity. Based on its whole-genome sequence, DMW1 belongs to the Bacillus velezensis species, and it is similar to the model strain B. velezensis FZB42. 12 secondary metabolite biosynthetic gene clusters (BGCs), including two unknown function BGCs, were detected in the DMW1 genome. The strain was shown to be genetically amenable, and seven secondary metabolites acting antagonistically against plant pathogens were identified by a combined genetic and chemical approach. Strain DMW1 did significantly improve the growth of tomato and soybean seedlings, and it was able to control the Phytophthora sojae and Ralstonia solanacearum that were present in the plant seedlings. Due to these properties, the endophytic strain DMW1 appears to be a promising candidate for comparative investigations performed together with the Gram-positive model rhizobacterium FZB42, which is only able to colonize the rhizoplane. IMPORTANCE Phytopathogens are responsible for the wide spread of plant diseases as well as for great losses of crop yields. At present, the strategies used to control plant disease, including the development of resistant cultivars and chemical control, may become ineffective due to the adaptive evolution of pathogens. Therefore, the use of beneficial microorganisms to deal with plant diseases attracts great attention. In the present study, a new strain DMW1, belonging to the species B. velezensis, was discovered with outstanding biocontrol properties. It showed plant growth promotion and disease control abilities that are comparable with those of B. velezensis FZB42 under greenhouse conditions. According to a genomic analysis and a bioactive metabolites analysis, genes that are responsible for promoting plant growth were detected, and metabolites with different antagonistic activities were identified. Our data provide a basis for DMW1 to be further developed and applied as a biopesticide, which is similar to the closely related model strain FZB42.

Collapse

Affiliation(s)

Chenjie Yu Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Key Laboratory of Integrated Management of Crop Diseases and Pests, Ministry of Education, Nanjing, China
Han Chen Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Key Laboratory of Integrated Management of Crop Diseases and Pests, Ministry of Education, Nanjing, China
Linli zhu Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Key Laboratory of Integrated Management of Crop Diseases and Pests, Ministry of Education, Nanjing, China
Yan Song Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Key Laboratory of Integrated Management of Crop Diseases and Pests, Ministry of Education, Nanjing, China
Qifan Jiang Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Key Laboratory of Integrated Management of Crop Diseases and Pests, Ministry of Education, Nanjing, China
Yaming Zhang Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Key Laboratory of Integrated Management of Crop Diseases and Pests, Ministry of Education, Nanjing, China
Qurban Ali Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Key Laboratory of Integrated Management of Crop Diseases and Pests, Ministry of Education, Nanjing, China
Qin Gu Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Key Laboratory of Integrated Management of Crop Diseases and Pests, Ministry of Education, Nanjing, China
Xuewen Gao Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Key Laboratory of Integrated Management of Crop Diseases and Pests, Ministry of Education, Nanjing, China
Rainer Borriss Humboldt University Berlin, Institut für Biologie, Berlin, Germany
Suomeng Dong Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Key Laboratory of Integrated Management of Crop Diseases and Pests, Ministry of Education, Nanjing, China
Huijun Wu Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Key Laboratory of Integrated Management of Crop Diseases and Pests, Ministry of Education, Nanjing, China

Collapse

Liu K, Chen Q, Huang GH. An Efficient Feature Selection Algorithm for Gene Families Using NMF and ReliefF. Genes (Basel) 2023;14:421. [PMID: 36833348 PMCID: PMC9957060 DOI: 10.3390/genes14020421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 01/24/2023] [Accepted: 01/25/2023] [Indexed: 02/10/2023] Open

Iqbal S, Qasim M, Rahman H, Khan N, Paracha RZ, Bhatti MF, Javed A, Janjua HA. Genome mining, antimicrobial and plant growth-promoting potentials of halotolerant Bacillus paralicheniformis ES-1 isolated from salt mine. Mol Genet Genomics 2023;298:79-93. [PMID: 36301366 DOI: 10.1007/s00438-022-01964-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Accepted: 10/11/2022] [Indexed: 01/10/2023]

Abstract

Salinity severely affects crop yield by hindering nitrogen uptake and reducing plant growth. Plant growth-promoting bacteria (PGPB) are capable of providing cross-protection against biotic/abiotic stresses and facilitating plant growth. Genome-level knowledge of PGPB is necessary to translate the knowledge into a product as efficient biofertilizers and biocontrol agents. The current study aimed to isolate and characterize indigenous plant growth-promoting strains with the potential to promote plant growth under various stress conditions. In this regard, 72 bacterial strains were isolated from various saline-sodic soil/lakes; 19 exhibited multiple in vitro plant growth-promoting traits, including indole 3 acetic acid production, phosphate solubilization, siderophore synthesis, lytic enzymes production, biofilm formation, and antibacterial activities. To get an in-depth insight into genome composition and diversity, whole-genome sequence and genome mining of one promising Bacillus paralicheniformis strain ES-1 were performed. The strain ES-1 genome carries 12 biosynthetic gene clusters, at least six genomic islands, and four prophage regions. Genome mining identified plant growth-promoting conferring genes such as phosphate solubilization, nitrogen fixation, tryptophan production, siderophore, acetoin, butanediol, chitinase, hydrogen sulfate synthesis, chemotaxis, and motility. Comparative genome analysis indicates the region of genome plasticity which shapes the structure and function of B. paralicheniformis and plays a crucial role in habitat adaptation. The strain ES-1 has a relatively large accessory genome of 649 genes (~ 19%) and 180 unique genes. Overall, these results provide valuable insight into the bioactivity and genomic insight into B. paralicheniformis strain ES-1 with its potential use in sustainable agriculture.

Collapse

Conant GC. POInT: Modeling Polyploidy in the Era of Ubiquitous Genomics. Methods Mol Biol 2023;2545:77-90. [PMID: 36720808 DOI: 10.1007/978-1-0716-2561-3_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]

Duan G, Wu G, Chen X, Tian D, Li Z, Sun Y, Du Z, Hao L, Song S, Gao Y, Xiao J, Zhang Z, Bao Y, Tang B, Zhao W. HGD: an integrated homologous gene database across multiple species. Nucleic Acids Res 2022;51:D994-D1002. [PMID: 36318261 PMCID: PMC9825607 DOI: 10.1093/nar/gkac970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 09/28/2022] [Accepted: 10/17/2022] [Indexed: 11/06/2022] Open

Affiliation(s)

Guangya Duan
Gangao Wu
Xiaoning Chen National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
Dongmei Tian National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
Zhaohua Li National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
Yanling Sun National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
Zhenglin Du National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
Lili Hao National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
Shuhui Song National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
Yuan Gao National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
Jingfa Xiao National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
Zhang Zhang National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
Yiming Bao National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
Bixia Tang Correspondence may also be addressed to Bixia Tang.
Wenming Zhao To whom correspondence should be addressed. Tel: +86 1084097636; Fax: +86 1084097720;

Collapse

Magid M, Wold JR, Moraga R, Cubrinovska I, Houston DM, Gartrell BD, Steeves TE. Leveraging an existing whole-genome resequencing population data set to characterize toll-like receptor gene diversity in a threatened bird. Mol Ecol Resour 2022;22:2810-2825. [PMID: 35635119 PMCID: PMC9543821 DOI: 10.1111/1755-0998.13656] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 04/29/2022] [Accepted: 05/26/2022] [Indexed: 11/27/2022]

Ncube D, Tallafuss A, Serafin J, Bruckner J, Farnsworth DR, Miller AC, Eisen JS, Washbourne P. A conserved transcriptional fingerprint of multi-neurotransmitter neurons necessary for social behavior. BMC Genomics 2022;23:675. [PMID: 36175871 PMCID: PMC9523972 DOI: 10.1186/s12864-022-08879-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Accepted: 09/02/2022] [Indexed: 11/11/2022] Open

Benndorf R, Velazquez R, Zehr JD, Pond SLK, Martin JL, Lucaci AG. Human HspB1, HspB3, HspB5 and HspB8: Shaping these disease factors during vertebrate evolution. Cell Stress Chaperones 2022;27:309-323. [PMID: 35678958 PMCID: PMC9346038 DOI: 10.1007/s12192-022-01268-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 03/21/2022] [Accepted: 03/22/2022] [Indexed: 12/05/2022] Open

Peng W, Yang Y, Xu J, Peng E, Dai S, Dai L, Wang Y, Yi T, Wang B, Li D, Song N. TALE Transcription Factors in Sweet Orange (Citrus sinensis): Genome-Wide Identification, Characterization, and Expression in Response to Biotic and Abiotic Stresses. FRONTIERS IN PLANT SCIENCE 2022;12:814252. [PMID: 35126435 PMCID: PMC8811264 DOI: 10.3389/fpls.2021.814252] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/13/2021] [Accepted: 12/13/2021] [Indexed: 06/14/2023]

Abstract

Three-amino-acid-loop-extension (TALE) transcription factors comprise one of the largest gene families in plants, in which they contribute to regulation of a wide variety of biological processes, including plant growth and development, as well as governing stress responses. Although sweet orange (Citrus sinensis) is among the most commercially important fruit crops cultivated worldwide, there have been relatively few functional studies on TALE genes in this species. In this study, we investigated 18 CsTALE gene family members with respect to their phylogeny, physicochemical properties, conserved motif/domain sequences, gene structures, chromosomal location, cis-acting regulatory elements, and protein-protein interactions (PPIs). These CsTALE genes were classified into two subfamilies based on sequence homology and phylogenetic analyses, and the classification was equally strongly supported by the highly conserved gene structures and motif/domain compositions. CsTALEs were found to be unevenly distributed on the chromosomes, and duplication analysis revealed that segmental duplication and purifying selection have been major driving force in the evolution of these genes. Expression profile analysis indicated that CsTALE genes exhibit a discernible spatial expression pattern in different tissues and differing expression patterns in response to different biotic/abiotic stresses. Of the 18 CsTALE genes examined, 10 were found to be responsive to high temperature, four to low temperature, eight to salt, and four to wounding. Moreover, the expression of CsTALE3/8/12/16 was induced in response to infection with the fungal pathogen Diaporthe citri and bacterial pathogen Candidatus Liberibacter asiaticus, whereas the expression of CsTALE15/17 was strongly suppressed. The transcriptional activity of CsTALE proteins was also verified in yeast, with yeast two-hybrid assays indicating that CsTALE3/CsTALE8, CsTALE3/CsTALE11, CsTALE10/CsTALE12, CsTALE14/CsTALE8, CsTALE14/CsTALE11 can form respective heterodimers. The findings of this study could lay the foundations for elucidating the biological functions of the TALE family genes in sweet orange and contribute to the breeding of stress-tolerant plants.

Collapse

Affiliation(s)

Weiye Peng College of Plant Protection, Hunan Agricultural University, Changsha, China Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha, China
Yang Yang College of Plant Protection, Hunan Agricultural University, Changsha, China Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha, China
Jing Xu College of Plant Protection, Hunan Agricultural University, Changsha, China Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha, China
Erping Peng College of Plant Protection, Hunan Agricultural University, Changsha, China Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha, China
Suming Dai Horticulture College, Hunan Agricultural University, Changsha, China National Center for Citrus Improvement Changsha, Changsha, China
Liangying Dai College of Plant Protection, Hunan Agricultural University, Changsha, China Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha, China
Yunsheng Wang College of Plant Protection, Hunan Agricultural University, Changsha, China Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha, China
Tuyong Yi College of Plant Protection, Hunan Agricultural University, Changsha, China Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha, China
Bing Wang College of Plant Protection, Hunan Agricultural University, Changsha, China Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha, China
Dazhi Li Horticulture College, Hunan Agricultural University, Changsha, China National Center for Citrus Improvement Changsha, Changsha, China
Na Song College of Plant Protection, Hunan Agricultural University, Changsha, China Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha, China

Collapse

Huang LC, Taujale R, Gravel N, Venkat A, Yeung W, Byrne DP, Eyers PA, Kannan N. KinOrtho: a method for mapping human kinase orthologs across the tree of life and illuminating understudied kinases. BMC Bioinformatics 2021;22:446. [PMID: 34537014 PMCID: PMC8449880 DOI: 10.1186/s12859-021-04358-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2021] [Accepted: 09/06/2021] [Indexed: 12/11/2022] Open

Abstract

BACKGROUND

Protein kinases are among the largest druggable family of signaling proteins, involved in various human diseases, including cancers and neurodegenerative disorders. Despite their clinical relevance, nearly 30% of the 545 human protein kinases remain highly understudied. Comparative genomics is a powerful approach for predicting and investigating the functions of understudied kinases. However, an incomplete knowledge of kinase orthologs across fully sequenced kinomes severely limits the application of comparative genomics approaches for illuminating understudied kinases. Here, we introduce KinOrtho, a query- and graph-based orthology inference method that combines full-length and domain-based approaches to map one-to-one kinase orthologs across 17 thousand species.

RESULTS

Using multiple metrics, we show that KinOrtho performed better than existing methods in identifying kinase orthologs across evolutionarily divergent species and eliminated potential false positives by flagging sequences without a proper kinase domain for further evaluation. We demonstrate the advantage of using domain-based approaches for identifying domain fusion events, highlighting a case between an understudied serine/threonine kinase TAOK1 and a metabolic kinase PIK3C2A with high co-expression in human cells. We also identify evolutionary fission events involving the understudied OBSCN kinase domains, further highlighting the value of domain-based orthology inference approaches. Using KinOrtho-defined orthologs, Gene Ontology annotations, and machine learning, we propose putative biological functions of several understudied kinases, including the role of TP53RK in cell cycle checkpoint(s), the involvement of TSSK3 and TSSK6 in acrosomal vesicle localization, and potential functions for the ULK4 pseudokinase in neuronal development.

CONCLUSIONS

In sum, KinOrtho presents a novel query-based tool to identify one-to-one orthologous relationships across thousands of proteomes that can be applied to any protein family of interest. We exploit KinOrtho here to identify kinase orthologs and show that its well-curated kinome ortholog set can serve as a valuable resource for illuminating understudied kinases, and the KinOrtho framework can be extended to any protein-family of interest.

Collapse

Hybrid Deep Learning Based on a Heterogeneous Network Profile for Functional Annotations of Plasmodium falciparum Genes. Int J Mol Sci 2021;22:ijms221810019. [PMID: 34576183 PMCID: PMC8468833 DOI: 10.3390/ijms221810019] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 09/13/2021] [Accepted: 09/14/2021] [Indexed: 12/15/2022] Open

Molecular underpinnings of the early brain developmental response to differential feeding in the honey bee Apis mellifera. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2021;1864:194732. [PMID: 34242825 DOI: 10.1016/j.bbagrm.2021.194732] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 06/25/2021] [Accepted: 06/29/2021] [Indexed: 12/14/2022]

Harris CD, Torrance EL, Raymann K, Bobay LM. CoreCruncher: Fast and Robust Construction of Core Genomes in Large Prokaryotic Data Sets. Mol Biol Evol 2021;38:727-734. [PMID: 32886787 PMCID: PMC7826169 DOI: 10.1093/molbev/msaa224] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open

Hao Y, Lee HJ, Baraboo M, Burch K, Maurer T, Somarelli JA, Conant GC. Baby Genomics: Tracing the Evolutionary Changes That Gave Rise to Placentation. Genome Biol Evol 2021;12:35-47. [PMID: 32053193 PMCID: PMC7144826 DOI: 10.1093/gbe/evaa026] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/02/2020] [Indexed: 12/12/2022] Open

Boyle JH, Rastas PMA, Huang X, Garner AG, Vythilingam I, Armbruster PA. A Linkage-Based Genome Assembly for the Mosquito Aedes albopictus and Identification of Chromosomal Regions Affecting Diapause. INSECTS 2021;12:167. [PMID: 33669192 PMCID: PMC7919801 DOI: 10.3390/insects12020167] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Revised: 02/08/2021] [Accepted: 02/10/2021] [Indexed: 12/16/2022]

Koonin EV, Makarova KS, Wolf YI. Evolution of Microbial Genomics: Conceptual Shifts over a Quarter Century. Trends Microbiol 2021;29:582-592. [PMID: 33541841 DOI: 10.1016/j.tim.2021.01.005] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 01/07/2021] [Accepted: 01/08/2021] [Indexed: 12/20/2022]

Hernández-Salmerón JE, Moreno-Hagelsieb G. Progress in quickly finding orthologs as reciprocal best hits: comparing blast, last, diamond and MMseqs2. BMC Genomics 2020;21:741. [PMID: 33099302 PMCID: PMC7585182 DOI: 10.1186/s12864-020-07132-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Accepted: 10/09/2020] [Indexed: 02/07/2023] Open

Nucleotide substitution rates of diatom plastid encoded protein genes are positively correlated with genome architecture. Sci Rep 2020;10:14358. [PMID: 32873883 PMCID: PMC7462845 DOI: 10.1038/s41598-020-71473-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Accepted: 08/17/2020] [Indexed: 01/02/2023] Open

Owen CL, Stern DB, Hilton SK, Crandall KA. Hemiptera phylogenomic resources: Tree‐based orthology prediction and conserved exon identification. Mol Ecol Resour 2020;20:1346-1360. [DOI: 10.1111/1755-0998.13180] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Revised: 04/02/2020] [Accepted: 04/27/2020] [Indexed: 12/21/2022]

Phylogenetic tree building in the genomic age. Nat Rev Genet 2020;21:428-444. [PMID: 32424311 DOI: 10.1038/s41576-020-0233-0] [Citation(s) in RCA: 165] [Impact Index Per Article: 41.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/20/2020] [Indexed: 01/22/2023]

Galperin MY, Kristensen DM, Makarova KS, Wolf YI, Koonin EV. Microbial genome analysis: the COG approach. Brief Bioinform 2020;20:1063-1070. [PMID: 28968633 DOI: 10.1093/bib/bbx117] [Citation(s) in RCA: 152] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Revised: 08/01/2017] [Indexed: 11/15/2022] Open

Prasanna AN, Gerber D, Kijpornyongpan T, Aime MC, Doyle VP, Nagy LG. Model Choice, Missing Data, and Taxon Sampling Impact Phylogenomic Inference of Deep Basidiomycota Relationships. Syst Biol 2020;69:17-37. [PMID: 31062852 DOI: 10.1093/sysbio/syz029] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2017] [Revised: 04/21/2019] [Accepted: 04/26/2019] [Indexed: 11/12/2022] Open

Abstract

Resolving deep divergences in the tree of life is challenging even for analyses of genome-scale phylogenetic data sets. Relationships between Basidiomycota subphyla, the rusts and allies (Pucciniomycotina), smuts and allies (Ustilaginomycotina), and mushroom-forming fungi and allies (Agaricomycotina) were found particularly recalcitrant both to traditional multigene and genome-scale phylogenetics. Here, we address basal Basidiomycota relationships using concatenated and gene tree-based analyses of various phylogenomic data sets to examine the contribution of several potential sources of bias. We evaluate the contribution of biological causes (hard polytomy, incomplete lineage sorting) versus unmodeled evolutionary processes and factors that exacerbate their effects (e.g., fast-evolving sites and long-branch taxa) to inferences of basal Basidiomycota relationships. Bayesian Markov Chain Monte Carlo and likelihood mapping analyses reject the hard polytomy with confidence. In concatenated analyses, fast-evolving sites and oversimplified models of amino acid substitution favored the grouping of smuts with mushroom-forming fungi, often leading to maximal bootstrap support in both concatenation and coalescent analyses. On the contrary, the most conserved data subsets grouped rusts and allies with mushroom-forming fungi, although this relationship proved labile, sensitive to model choice, to different data subsets and to missing data. Excluding putative long-branch taxa, genes with high proportions of missing data and/or with strong signal failed to reveal a consistent trend toward one or the other topology, suggesting that additional sources of conflict are at play. While concatenated analyses yielded strong but conflicting support, individual gene trees mostly provided poor support for any resolution of rusts, smuts, and mushroom-forming fungi, suggesting that the true Basidiomycota tree might be in a part of tree space that is difficult to access using both concatenation and gene tree-based approaches. Inference-based assessments of absolute model fit strongly reject best-fit models for the vast majority of genes, indicating a poor fit of even the most commonly used models. While this is consistent with previous assessments of site-homogenous models of amino acid evolution, this does not appear to be the sole source of confounding signal. Our analyses suggest that topologies uniting smuts with mushroom-forming fungi can arise as a result of inappropriate modeling of amino acid sites that might be prone to systematic bias. We speculate that improved models of sequence evolution could shed more light on basal splits in the Basidiomycota, which, for now, remain unresolved despite the use of whole genome data.

Collapse

Comparative Genomics and CAZyme Genome Repertoires of Marine Zobellia amurskyensis KMM 3526^T and Zobellia laminariae KMM 3676^T. Mar Drugs 2019;17:md17120661. [PMID: 31771309 PMCID: PMC6950322 DOI: 10.3390/md17120661] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Revised: 11/21/2019] [Accepted: 11/22/2019] [Indexed: 01/01/2023] Open

Rubanov LI, Zaraisky AG, Shilovsky GA, Seliverstov AV, Zverkov OA, Lyubetsky VA. Screening for mouse genes lost in mammals with long lifespans. BioData Min 2019;12:20. [PMID: 31728160 PMCID: PMC6842137 DOI: 10.1186/s13040-019-0208-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Accepted: 10/25/2019] [Indexed: 12/23/2022] Open

Abstract

Background

Gerontogenes include those that modulate life expectancy in various species and may be the actual longevity genes. We believe that a long (relative to body weight) lifespan in individual rodent and primate species can be due, among other things, to the loss of particular genes that are present in short-lived species of the same orders. These genes can also explain the widely different rates of aging among diverse species as well as why similarly sized rodents or primates sometimes have anomalous life expectancies (e.g., naked mole-rats and humans). Here, we consider the gene loss in the context of the prediction of Williams’ theory that concerns the reallocation of physiological resources of an organism between active reproduction (r-strategy) and self-maintenance (K-strategy). We have identified such lost genes using an original computer-aided approach; the software considers the loss of a gene as disruptions in gene orthology, local gene synteny or both.

Results

A method and software identifying the genes that are absent from a predefined set of species but present in another predefined set of species are suggested. Examples of such pairs of sets include long-lived vs short-lived, homeothermic vs poikilothermic, amniotic vs anamniotic, aquatic vs terrestrial, and neotenic vs nonneotenic species, among others. Species are included in one of two sets according to the property of interest, such as longevity or homeothermy. The program is universal towards these pairs, i.e., towards the underlying property, although the sets should include species with quality genome assemblies. Here, the proposed method was applied to study the longevity of Euarchontoglires species. It largely predicted genes that are highly expressed in the testis, epididymis, uterus, mammary glands, and the vomeronasal and other reproduction-related organs. This agrees with Williams’ theory that hypothesizes a species transition from r-strategy to K-strategy. For instance, the method predicts the mouse gene Smpd5, which has an expression level 20 times greater in the testis than in organs unrelated to reproduction as experimentally demonstrated elsewhere. At the same time, its paralog Smpd3 is not predicted by the program and is widely expressed in many organs not specifically related to reproduction.

Conclusions

The method and program, which were applied here to screen for gene losses that can accompany increased lifespan, were also applied to study reduced regenerative capacity and development of the telencephalon, neoteny, etc. Some of these results have been carefully tested experimentally. Therefore, we assume that the method is widely applicable.

Collapse

Emms DM, Kelly S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol 2019;20:238. [PMID: 31727128 PMCID: PMC6857279 DOI: 10.1186/s13059-019-1832-y] [Citation(s) in RCA: 2815] [Impact Index Per Article: 563.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Accepted: 09/23/2019] [Indexed: 12/22/2022] Open

Andrade CH, Neves BJ, Melo-Filho CC, Rodrigues J, Silva DC, Braga RC, Cravo PVL. In Silico Chemogenomics Drug Repositioning Strategies for Neglected Tropical Diseases. Curr Med Chem 2019. [DOI: 10.2174/0929867325666180309114824] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Testis-specific Arf promoter expression in a transposase-aided BAC transgenic mouse model. Mol Biol Rep 2019;46:6243-6252. [PMID: 31583563 DOI: 10.1007/s11033-019-05063-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Accepted: 09/04/2019] [Indexed: 10/25/2022]

Hu X, Friedberg I. SwiftOrtho: A fast, memory-efficient, multiple genome orthology classifier. Gigascience 2019;8:giz118. [PMID: 31648300 PMCID: PMC6812468 DOI: 10.1093/gigascience/giz118] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Revised: 06/07/2019] [Accepted: 09/05/2019] [Indexed: 11/13/2022] Open

Lafond M, Meghdari Miardan M, Sankoff D. Accurate prediction of orthologs in the presence of divergence after duplication. Bioinformatics 2019;34:i366-i375. [PMID: 29950018 PMCID: PMC6022570 DOI: 10.1093/bioinformatics/bty242] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Verbruggen B, Gunnarsson L, Kristiansson E, Österlund T, Owen SF, Snape JR, Tyler CR. ECOdrug: a database connecting drugs and conservation of their targets across species. Nucleic Acids Res 2019;46:D930-D936. [PMID: 29140522 PMCID: PMC5753218 DOI: 10.1093/nar/gkx1024] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2017] [Accepted: 10/23/2017] [Indexed: 12/12/2022] Open

Xu L, Dong Z, Fang L, Luo Y, Wei Z, Guo H, Zhang G, Gu YQ, Coleman-Derr D, Xia Q, Wang Y. OrthoVenn2: a web server for whole-genome comparison and annotation of orthologous clusters across multiple species. Nucleic Acids Res 2019;47:W52-W58. [PMID: 31053848 PMCID: PMC6602458 DOI: 10.1093/nar/gkz333] [Citation(s) in RCA: 569] [Impact Index Per Article: 113.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Revised: 04/16/2019] [Accepted: 04/25/2019] [Indexed: 12/28/2022] Open

Rey C, Veber P, Boussau B, Sémon M. CAARS: comparative assembly and annotation of RNA-Seq data. Bioinformatics 2019;35:2199-2207. [PMID: 30452539 PMCID: PMC6596894 DOI: 10.1093/bioinformatics/bty903] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Revised: 09/13/2018] [Accepted: 11/16/2018] [Indexed: 02/05/2023] Open

Nigam K, Sanyal S, Gupta S, Gupta OP, Mahdi AA, Bhatt MLB. Alteration of the Risk of Oral Pre-Cancer and Cancer in North India Population by CYP1A1 Polymorphism Genotypes and Haplotype. Asian Pac J Cancer Prev 2019;20:345-354. [PMID: 30803192 PMCID: PMC6897020 DOI: 10.31557/apjcp.2019.20.2.345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open

Torres Manno MA, Pizarro MD, Prunello M, Magni C, Daurelio LD, Espariz M. GeM-Pro: a tool for genome functional mining and microbial profiling. Appl Microbiol Biotechnol 2019;103:3123-3134. [DOI: 10.1007/s00253-019-09648-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Revised: 01/11/2019] [Accepted: 01/14/2019] [Indexed: 11/30/2022]

Guillén Y, Casillas S, Ruiz A. Genome-Wide Patterns of Sequence Divergence of Protein-Coding Genes Between Drosophila buzzatii and D. mojavensis. J Hered 2019;110:92-101. [PMID: 30124907 DOI: 10.1093/jhered/esy041] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Accepted: 08/14/2018] [Indexed: 12/15/2022] Open

Medeiros Filho F, do Nascimento APB, dos Santos MT, Carvalho-Assef APD, da Silva FAB. Gene regulatory network inference and analysis of multidrug-resistant Pseudomonas aeruginosa. Mem Inst Oswaldo Cruz 2019;114:e190105. [PMID: 31389522 PMCID: PMC6684008 DOI: 10.1590/0074-02760190105] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Accepted: 06/26/2019] [Indexed: 12/04/2022] Open

Oti M, Pane A, Sammeth M. Comparative Genomics in Drosophila. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2018;1704:433-450. [PMID: 29277877 DOI: 10.1007/978-1-4939-7463-4_17] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Abstract

Since the pioneering studies of Thomas Hunt Morgan and coworkers at the dawn of the twentieth century, Drosophila melanogaster and its sister species have tremendously contributed to unveil the rules underlying animal genetics, development, behavior, evolution, and human disease. Recent advances in DNA sequencing technologies launched Drosophila into the post-genomic era and paved the way for unprecedented comparative genomics investigations. The complete sequencing and systematic comparison of the genomes from 12 Drosophila species represents a milestone achievement in modern biology, which allowed a plethora of different studies ranging from the annotation of known and novel genomic features to the evolution of chromosomes and, ultimately, of entire genomes. Despite the efforts of countless laboratories worldwide, the vast amount of data that were produced over the past 15 years is far from being fully explored.In this chapter, we will review some of the bioinformatic approaches that were developed to interrogate the genomes of the 12 Drosophila species. Setting off from alignments of the entire genomic sequences, the degree of conservation can be separately evaluated for every region of the genome, providing already first hints about elements that are under purifying selection and therefore likely functional. Furthermore, the careful analysis of repeated sequences sheds light on the evolutionary dynamics of transposons, an enigmatic and fascinating class of mobile elements housed in the genomes of animals and plants. Comparative genomics also aids in the computational identification of the transcriptionally active part of the genome, first and foremost of protein-coding loci, but also of transcribed nevertheless apparently noncoding regions, which were once considered "junk" DNA. Eventually, the synergy between functional and comparative genomics also facilitates in silico and in vivo studies on cis-acting regulatory elements, like transcription factor binding sites, that due to the high degree of sequence variability usually impose increased challenges for bioinformatics approaches.

Collapse

Ambrosino L, Ruggieri V, Bostan H, Miralto M, Vitulo N, Zouine M, Barone A, Bouzayen M, Frusciante L, Pezzotti M, Valle G, Chiusano ML. Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations: an example for tomato and grapevine. BMC Bioinformatics 2018;19:435. [PMID: 30497367 PMCID: PMC6266932 DOI: 10.1186/s12859-018-2420-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open

Abstract

Background

“Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework.

Results

We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at http://biosrv.cab.unina.it/comparalogs/gene/search.

Conclusions

The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses between two species.

Electronic supplementary material

The online version of this article (10.1186/s12859-018-2420-y) contains supplementary material, which is available to authorized users.

Collapse

Lassalle F, Planel R, Penel S, Chapulliot D, Barbe V, Dubost A, Calteau A, Vallenet D, Mornico D, Bigot T, Guéguen L, Vial L, Muller D, Daubin V, Nesme X. Ancestral Genome Estimation Reveals the History of Ecological Diversification in Agrobacterium. Genome Biol Evol 2018;9:3413-3431. [PMID: 29220487 PMCID: PMC5739047 DOI: 10.1093/gbe/evx255] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/01/2017] [Indexed: 12/12/2022] Open

Affiliation(s)

Florent Lassalle Ecologie Microbienne, CNRS, INRA, VetAgro Sup, UCBL, Université de Lyon, Villeurbanne, France.,Biométrie et Biologie Evolutive, CNRS, UCBL, Université de Lyon, Villeurbanne, France.,Ecole Normale Supérieure de Lyon, Lyon, France
Rémi Planel Biométrie et Biologie Evolutive, CNRS, UCBL, Université de Lyon, Villeurbanne, France
Simon Penel Biométrie et Biologie Evolutive, CNRS, UCBL, Université de Lyon, Villeurbanne, France
David Chapulliot Ecologie Microbienne, CNRS, INRA, VetAgro Sup, UCBL, Université de Lyon, Villeurbanne, France
Valérie Barbe Commissariat à l'Energie Atomique et aux Energies Alternatives (CEA) Direction de la Recherche Fondamentale, Institut de Biologie Francois-Jacob (IBFJ), Genoscope, Evry, France
Audrey Dubost Ecologie Microbienne, CNRS, INRA, VetAgro Sup, UCBL, Université de Lyon, Villeurbanne, France
Alexandra Calteau Commissariat à l'Energie Atomique et aux Energies Alternatives (CEA) Direction de la Recherche Fondamentale, Institut de Biologie Francois-Jacob (IBFJ), Genoscope, Evry, France.,Laboratoire d'Analyse Bioinformatiques pour la Génomique et le Métabolisme, CNRS, UMR 8030, Evry, France.,UEVE, Université d'Evry Val d'Essonne, France
David Vallenet Commissariat à l'Energie Atomique et aux Energies Alternatives (CEA) Direction de la Recherche Fondamentale, Institut de Biologie Francois-Jacob (IBFJ), Genoscope, Evry, France.,Laboratoire d'Analyse Bioinformatiques pour la Génomique et le Métabolisme, CNRS, UMR 8030, Evry, France.,UEVE, Université d'Evry Val d'Essonne, France
Damien Mornico Commissariat à l'Energie Atomique et aux Energies Alternatives (CEA) Direction de la Recherche Fondamentale, Institut de Biologie Francois-Jacob (IBFJ), Genoscope, Evry, France.,Laboratoire d'Analyse Bioinformatiques pour la Génomique et le Métabolisme, CNRS, UMR 8030, Evry, France.,UEVE, Université d'Evry Val d'Essonne, France
Thomas Bigot Biométrie et Biologie Evolutive, CNRS, UCBL, Université de Lyon, Villeurbanne, France
Laurent Guéguen Biométrie et Biologie Evolutive, CNRS, UCBL, Université de Lyon, Villeurbanne, France
Ludovic Vial Ecologie Microbienne, CNRS, INRA, VetAgro Sup, UCBL, Université de Lyon, Villeurbanne, France
Daniel Muller Ecologie Microbienne, CNRS, INRA, VetAgro Sup, UCBL, Université de Lyon, Villeurbanne, France
Vincent Daubin Biométrie et Biologie Evolutive, CNRS, UCBL, Université de Lyon, Villeurbanne, France
Xavier Nesme Ecologie Microbienne, CNRS, INRA, VetAgro Sup, UCBL, Université de Lyon, Villeurbanne, France

Collapse