Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Blanchette M, Green ED, Miller W, Haussler D. Reconstructing large regions of an ancestral mammalian genome in silico. Genome Res 2004;14:2412-23. [PMID: 15574820 PMCID: PMC534665 DOI: 10.1101/gr.2800104] [Citation(s) in RCA: 106] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2004] [Accepted: 10/05/2004] [Indexed: 11/25/2022]

For:	Blanchette M, Green ED, Miller W, Haussler D. Reconstructing large regions of an ancestral mammalian genome in silico. Genome Res 2004;14:2412-23. [PMID: 15574820 PMCID: PMC534665 DOI: 10.1101/gr.2800104] [Citation(s) in RCA: 106] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2004] [Accepted: 10/05/2004] [Indexed: 11/25/2022]

Number

Cited by Other Article(s)

Lim D, Baek C, Blanchette M. Graphylo: A deep learning approach for predicting regulatory DNA and RNA sites from whole-genome multiple alignments. iScience 2024;27:109002. [PMID: 38362268 PMCID: PMC10867641 DOI: 10.1016/j.isci.2024.109002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Revised: 12/17/2023] [Accepted: 01/19/2024] [Indexed: 02/17/2024] Open

Reconstruction of hundreds of reference ancestral genomes across the eukaryotic kingdom. Nat Ecol Evol 2023;7:355-366. [PMID: 36646945 PMCID: PMC9998269 DOI: 10.1038/s41559-022-01956-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 11/22/2022] [Indexed: 01/18/2023]

Gupta MK, Vadde R. Next-generation development and application of codon model in evolution. Front Genet 2023;14:1091575. [PMID: 36777719 PMCID: PMC9911445 DOI: 10.3389/fgene.2023.1091575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 01/17/2023] [Indexed: 01/28/2023] Open

Campitelli LF, Yellan I, Albu M, Barazandeh M, Patel ZM, Blanchette M, Hughes TR. Reconstruction of full-length LINE-1 progenitors from ancestral genomes. Genetics 2022;221:6584822. [PMID: 35552404 PMCID: PMC9252281 DOI: 10.1093/genetics/iyac074] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 04/27/2022] [Indexed: 11/24/2022] Open

Heydeck D, Reisch F, Schäfer M, Kakularam KR, Roigas SA, Stehling S, Püschel GP, Kuhn H. The Reaction Specificity of Mammalian ALOX15 Orthologs is Changed During Late Primate Evolution and These Alterations Might Offer Evolutionary Advantages for Hominidae. Front Cell Dev Biol 2022;10:871585. [PMID: 35531094 PMCID: PMC9068934 DOI: 10.3389/fcell.2022.871585] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 04/01/2022] [Indexed: 01/03/2023] Open

Lichman BR. Ancestral Sequence Reconstruction for Exploring Alkaloid Evolution. Methods Mol Biol 2022;2505:165-179. [PMID: 35732944 DOI: 10.1007/978-1-0716-2349-7_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Schull JK, Turakhia Y, Hemker JA, Dally WJ, Bejerano G. OUP accepted manuscript. Genome Biol Evol 2022;14:6529394. [PMID: 35171243 PMCID: PMC8920512 DOI: 10.1093/gbe/evac013] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/10/2022] [Indexed: 11/14/2022] Open

Lim D, Blanchette M. EvoLSTM: context-dependent models of sequence evolution using a sequence-to-sequence LSTM. Bioinformatics 2021;36:i353-i361. [PMID: 32657367 PMCID: PMC7355264 DOI: 10.1093/bioinformatics/btaa447] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Buckley RM, Kortschak RD, Adelson DL. Divergent genome evolution caused by regional variation in DNA gain and loss between human and mouse. PLoS Comput Biol 2018;14:e1006091. [PMID: 29677183 PMCID: PMC5931693 DOI: 10.1371/journal.pcbi.1006091] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2017] [Revised: 05/02/2018] [Accepted: 03/15/2018] [Indexed: 12/31/2022] Open

Sharma V, Hiller M. Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation. Nucleic Acids Res 2017. [PMID: 28645144 PMCID: PMC5737078 DOI: 10.1093/nar/gkx554] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open

Feng B, Zhou L, Tang J. Ancestral Genome Reconstruction on Whole Genome Level. Curr Genomics 2017;18:306-315. [PMID: 29081686 PMCID: PMC5635614 DOI: 10.2174/1389202918666170307120943] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2016] [Revised: 10/08/2016] [Accepted: 11/03/2016] [Indexed: 11/22/2022] Open

Holmes IH. Solving the master equation for Indels. BMC Bioinformatics 2017;18:255. [PMID: 28494756 PMCID: PMC5427538 DOI: 10.1186/s12859-017-1665-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2017] [Accepted: 04/30/2017] [Indexed: 01/09/2023] Open

Hague MT, Feldman CR, Brodie ED, Brodie ED. Convergent adaptation to dangerous prey proceeds through the same first‐step mutation in the garter snake Thamnophis sirtalis. Evolution 2017;71:1504-1518. [DOI: 10.1111/evo.13244] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Accepted: 03/24/2017] [Indexed: 12/28/2022]

Sundaram V, Choudhary MNK, Pehrsson E, Xing X, Fiore C, Pandey M, Maricque B, Udawatta M, Ngo D, Chen Y, Paguntalan A, Ray T, Hughes A, Cohen BA, Wang T. Functional cis-regulatory modules encoded by mouse-specific endogenous retrovirus. Nat Commun 2017;8:14550. [PMID: 28348391 PMCID: PMC5379053 DOI: 10.1038/ncomms14550] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Accepted: 01/11/2017] [Indexed: 01/30/2023] Open

Affiliation(s)

Vasavi Sundaram Division of Biological and Biomedical Sciences, Washington University School of Medicine, 660 S. Euclid Avenue, St. Louis, Missouri 63110, USA Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Mayank N. K. Choudhary Division of Biological and Biomedical Sciences, Washington University School of Medicine, 660 S. Euclid Avenue, St. Louis, Missouri 63110, USA Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Erica Pehrsson Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Xiaoyun Xing Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Christopher Fiore Division of Biological and Biomedical Sciences, Washington University School of Medicine, 660 S. Euclid Avenue, St. Louis, Missouri 63110, USA Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Manishi Pandey Division of Biological and Biomedical Sciences, Washington University School of Medicine, 660 S. Euclid Avenue, St. Louis, Missouri 63110, USA Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Brett Maricque Division of Biological and Biomedical Sciences, Washington University School of Medicine, 660 S. Euclid Avenue, St. Louis, Missouri 63110, USA Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Methma Udawatta Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Duc Ngo Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Yujie Chen Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Asia Paguntalan Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Tammy Ray Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Ava Hughes Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Barak A. Cohen Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA
Ting Wang Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, Missouri 63110, USA

Collapse

Dynamics of genome size evolution in birds and mammals. Proc Natl Acad Sci U S A 2017;114:E1460-E1469. [PMID: 28179571 DOI: 10.1073/pnas.1616702114] [Citation(s) in RCA: 232] [Impact Index Per Article: 33.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open

Jarvis ED. Perspectives from the Avian Phylogenomics Project: Questions that Can Be Answered with Sequencing All Genomes of a Vertebrate Class. Annu Rev Anim Biosci 2016;4:45-59. [PMID: 26884102 DOI: 10.1146/annurev-animal-021815-111216] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Tremblay-Savard O, Reinharz V, Waldispühl J. Reconstruction of ancestral RNA sequences under multiple structural constraints. BMC Genomics 2016;17:862. [PMID: 28185557 PMCID: PMC5123390 DOI: 10.1186/s12864-016-3105-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Perdomo-Sabogal A, Nowick K, Piccini I, Sudbrak R, Lehrach H, Yaspo ML, Warnatz HJ, Querfurth R. Human Lineage-Specific Transcriptional Regulation through GA-Binding Protein Transcription Factor Alpha (GABPa). Mol Biol Evol 2016;33:1231-44. [PMID: 26814189 PMCID: PMC4839217 DOI: 10.1093/molbev/msw007] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open

Berthelot C, Muffato M, Abecassis J, Roest Crollius H. The 3D organization of chromatin explains evolutionary fragile genomic regions. Cell Rep 2015;10:1913-24. [PMID: 25801028 DOI: 10.1016/j.celrep.2015.02.046] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2014] [Revised: 12/17/2014] [Accepted: 02/18/2015] [Indexed: 10/23/2022] Open

Papamichos SI, Margaritis D, Kotsianidis I. Adaptive Evolution Coupled with Retrotransposon Exaptation Allowed for the Generation of a Human-Protein-Specific Coding Gene That Promotes Cancer Cell Proliferation and Metastasis in Both Haematological Malignancies and Solid Tumours: The Extraordinary Case of MYEOV Gene. SCIENTIFICA 2015;2015:984706. [PMID: 26568894 PMCID: PMC4629056 DOI: 10.1155/2015/984706] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/15/2015] [Accepted: 09/27/2015] [Indexed: 06/05/2023]

Duchemin W, Daubin V, Tannier E. Reconstruction of an ancestral Yersinia pestis genome and comparison with an ancient sequence. BMC Genomics 2015;16 Suppl 10:S9. [PMID: 26450112 PMCID: PMC4603589 DOI: 10.1186/1471-2164-16-s10-s9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open

Green RE, Braun EL, Armstrong J, Earl D, Nguyen N, Hickey G, Vandewege MW, St John JA, Capella-Gutiérrez S, Castoe TA, Kern C, Fujita MK, Opazo JC, Jurka J, Kojima KK, Caballero J, Hubley RM, Smit AF, Platt RN, Lavoie CA, Ramakodi MP, Finger JW, Suh A, Isberg SR, Miles L, Chong AY, Jaratlerdsiri W, Gongora J, Moran C, Iriarte A, McCormack J, Burgess SC, Edwards SV, Lyons E, Williams C, Breen M, Howard JT, Gresham CR, Peterson DG, Schmitz J, Pollock DD, Haussler D, Triplett EW, Zhang G, Irie N, Jarvis ED, Brochu CA, Schmidt CJ, McCarthy FM, Faircloth BC, Hoffmann FG, Glenn TC, Gabaldón T, Paten B, Ray DA. Three crocodilian genomes reveal ancestral patterns of evolution among archosaurs. Science 2014;346:1254449. [PMID: 25504731 PMCID: PMC4386873 DOI: 10.1126/science.1254449] [Citation(s) in RCA: 230] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Affiliation(s)

Richard E Green Department of Biomolecular Engineering, University of California, Santa Cruz, CA 95064, USA.
Edward L Braun Department of Biology and Genetics Institute, University of Florida, Gainesville, FL 32611, USA
Joel Armstrong Department of Biomolecular Engineering, University of California, Santa Cruz, CA 95064, USA. Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA 95064, USA
Dent Earl Department of Biomolecular Engineering, University of California, Santa Cruz, CA 95064, USA. Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA 95064, USA
Ngan Nguyen Department of Biomolecular Engineering, University of California, Santa Cruz, CA 95064, USA. Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA 95064, USA
Glenn Hickey Department of Biomolecular Engineering, University of California, Santa Cruz, CA 95064, USA. Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA 95064, USA
Michael W Vandewege Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University, Mississippi State, MS 39762, USA
John A St John Department of Biomolecular Engineering, University of California, Santa Cruz, CA 95064, USA
Salvador Capella-Gutiérrez Bioinformatics and Genomics Programme, Centre for Genomic Regulation, 08003 Barcelona, Spain. Universitat Pompeu Fabra, 08003 Barcelona, Spain
Todd A Castoe Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, CO 80045, USA. Department of Biology, University of Texas, Arlington, TX 76019, USA
Colin Kern Department of Computer and Information Sciences, University of Delaware, Newark, DE 19717, USA
Matthew K Fujita Department of Biology, University of Texas, Arlington, TX 76019, USA
Juan C Opazo Instituto de Ciencias Ambientales y Evolutivas, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Chile
Jerzy Jurka Genetic Information Research Institute, Mountain View, CA 94043, USA
Kenji K Kojima Genetic Information Research Institute, Mountain View, CA 94043, USA
Juan Caballero Institute for Systems Biology, Seattle, WA 98109, USA
Robert M Hubley Institute for Systems Biology, Seattle, WA 98109, USA
Arian F Smit Institute for Systems Biology, Seattle, WA 98109, USA
Roy N Platt Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University, Mississippi State, MS 39762, USA. Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Mississippi State, MS 39762, USA
Christine A Lavoie Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University, Mississippi State, MS 39762, USA
Meganathan P Ramakodi Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University, Mississippi State, MS 39762, USA. Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Mississippi State, MS 39762, USA
John W Finger Department of Environmental Health Science, University of Georgia, Athens, GA 30602, USA
Alexander Suh Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany. Department of Evolutionary Biology (EBC), Uppsala University, SE-752 36 Uppsala, Sweden
Sally R Isberg Porosus Pty. Ltd., Palmerston, NT 0831, Australia. Faculty of Veterinary Science, University of Sydney, Sydney, NSW 2006, Australia. Centre for Crocodile Research, Noonamah, NT 0837, Australia
Lee Miles Faculty of Veterinary Science, University of Sydney, Sydney, NSW 2006, Australia
Amanda Y Chong Faculty of Veterinary Science, University of Sydney, Sydney, NSW 2006, Australia
Weerachai Jaratlerdsiri Faculty of Veterinary Science, University of Sydney, Sydney, NSW 2006, Australia
Jaime Gongora Faculty of Veterinary Science, University of Sydney, Sydney, NSW 2006, Australia
Christopher Moran Faculty of Veterinary Science, University of Sydney, Sydney, NSW 2006, Australia
Andrés Iriarte Departamento de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Montevideo, Uruguay
John McCormack Moore Laboratory of Zoology, Occidental College, Los Angeles, CA 90041, USA
Shane C Burgess College of Agriculture and Life Sciences, University of Arizona, Tucson, AZ 85721, USA
Scott V Edwards Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
Eric Lyons School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
Christina Williams Department of Molecular Biomedical Sciences, North Carolina State University, Raleigh, NC 27607, USA
Matthew Breen Department of Molecular Biomedical Sciences, North Carolina State University, Raleigh, NC 27607, USA
Jason T Howard Howard Hughes Medical Institute, Department of Neurobiology, Duke University Medical Center, Durham, NC 27710, USA
Cathy R Gresham Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Mississippi State, MS 39762, USA
Daniel G Peterson Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Mississippi State, MS 39762, USA. Department of Plant and Soil Sciences, Mississippi State University, Mississippi State, MS 39762, USA
Jürgen Schmitz Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany
David D Pollock Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, CO 80045, USA
David Haussler Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA 95064, USA. Howard Hughes Medical Institute, Bethesda, MD 20814, USA
Eric W Triplett Department of Microbiology and Cell Science, University of Florida, Gainesville, FL 32611, USA
Guojie Zhang China National GeneBank, BGI-Shenzhen, Shenzhen, China. Center for Social Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
Naoki Irie Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo, Japan
Erich D Jarvis Howard Hughes Medical Institute, Department of Neurobiology, Duke University Medical Center, Durham, NC 27710, USA
Christopher A Brochu Department of Earth and Environmental Sciences, University of Iowa, Iowa City, IA 52242, USA
Carl J Schmidt Department of Animal and Food Sciences, University of Delaware, Newark, DE 19717, USA
Fiona M McCarthy School of Animal and Comparative Biomedical Sciences, University of Arizona, Tucson, AZ 85721, USA
Brant C Faircloth Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90019, USA. Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
Federico G Hoffmann Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University, Mississippi State, MS 39762, USA. Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Mississippi State, MS 39762, USA
Travis C Glenn Department of Environmental Health Science, University of Georgia, Athens, GA 30602, USA
Toni Gabaldón Bioinformatics and Genomics Programme, Centre for Genomic Regulation, 08003 Barcelona, Spain. Universitat Pompeu Fabra, 08003 Barcelona, Spain. Institució Catalana de Recerca i Estudis Avançats, 08010 Barcelona, Spain
Benedict Paten Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA 95064, USA
David A Ray Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University, Mississippi State, MS 39762, USA. Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Mississippi State, MS 39762, USA. Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA.

Collapse

Paten B, Zerbino DR, Hickey G, Haussler D. A unifying model of genome evolution under parsimony. BMC Bioinformatics 2014;15:206. [PMID: 24946830 PMCID: PMC4082375 DOI: 10.1186/1471-2105-15-206] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2013] [Accepted: 05/08/2014] [Indexed: 11/23/2022] Open

Rajaraman A, Tannier E, Chauve C. FPSAC: fast phylogenetic scaffolding of ancient contigs. ACTA ACUST UNITED AC 2013;29:2987-94. [PMID: 24068034 DOI: 10.1093/bioinformatics/btt527] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Hiller M, Agarwal S, Notwell JH, Parikh R, Guturu H, Wenger AM, Bejerano G. Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish. Nucleic Acids Res 2013;41:e151. [PMID: 23814184 PMCID: PMC3753653 DOI: 10.1093/nar/gkt557] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

A scalable and flexible approach for investigating the genomic landscapes of phylogenetic incongruence. Mol Phylogenet Evol 2013;66:1067-74. [DOI: 10.1016/j.ympev.2012.11.023] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2012] [Revised: 11/16/2012] [Accepted: 11/25/2012] [Indexed: 11/19/2022]

Blanchette M. Exploiting ancestral mammalian genomes for the prediction of human transcription factor binding sites. BMC Bioinformatics 2012;13 Suppl 19:S2. [PMID: 23281809 PMCID: PMC3526440 DOI: 10.1186/1471-2105-13-s19-s2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

A "forward genomics" approach links genotype to phenotype using independent phenotypic losses among related species. Cell Rep 2012;2:817-23. [PMID: 23022484 DOI: 10.1016/j.celrep.2012.08.032] [Citation(s) in RCA: 90] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2012] [Revised: 07/31/2012] [Accepted: 08/30/2012] [Indexed: 12/27/2022] Open

Romiguier J, Ranwez V, Douzery EJP, Galtier N. Genomic evidence for large, long-lived ancestors to placental mammals. Mol Biol Evol 2012;30:5-13. [PMID: 22949523 DOI: 10.1093/molbev/mss211] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Ashkenazy H, Penn O, Doron-Faigenboim A, Cohen O, Cannarozzi G, Zomer O, Pupko T. FastML: a web server for probabilistic reconstruction of ancestral sequences. Nucleic Acids Res 2012;40:W580-4. [PMID: 22661579 PMCID: PMC3394241 DOI: 10.1093/nar/gks498] [Citation(s) in RCA: 229] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open

Sadri J, Diallo AB, Blanchette M. Predicting site-specific human selective pressure using evolutionary signatures. Bioinformatics 2011;27:i266-74. [PMID: 21685080 PMCID: PMC3117352 DOI: 10.1093/bioinformatics/btr241] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Birin H, Tuller T. Efficient algorithms for reconstructing gene content by co-evolution. BMC Bioinformatics 2011;12 Suppl 9:S12. [PMID: 22151715 PMCID: PMC3283311 DOI: 10.1186/1471-2105-12-s9-s12] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

In a previous study we demonstrated that co-evolutionary information can be utilized for improving the accuracy of ancestral gene content reconstruction. To this end, we defined a new computational problem, the Ancestral Co-Evolutionary (ACE) problem, and developed algorithms for solving it.

RESULTS

In the current paper we generalize our previous study in various ways. First, we describe new efficient computational approaches for solving the ACE problem. The new approaches are based on reductions to classical methods such as linear programming relaxation, quadratic programming, and min-cut. Second, we report new computational hardness results related to the ACE, including practical cases where it can be solved in polynomial time.Third, we generalize the ACE problem and demonstrate how our approach can be used for inferring parts of the genomes of non-ancestral organisms. To this end, we describe a heuristic for finding the portion of the genome ('dominant set') that can be used to reconstruct the rest of the genome with the lowest error rate. This heuristic utilizes both evolutionary information and co-evolutionary information.We implemented these algorithms on a large input of the ACE problem (95 unicellular organisms, 4,873 protein families, and 10, 576 of co-evolutionary relations), demonstrating that some of these algorithms can outperform the algorithm used in our previous study. In addition, we show that based on our approach a 'dominant set' cab be used reconstruct a major fraction of a genome (up to 79%) with relatively low error-rate (e.g. 0.11). We find that the 'dominant set' tends to include metabolic and regulatory genes, with high evolutionary rate, and low protein abundance and number of protein-protein interactions.

CONCLUSIONS

The ACE problem can be efficiently extended for inferring the genomes of organisms that exist today. In addition, it may be solved in polynomial time in many practical cases. Metabolic and regulatory genes were found to be the most important groups of genes necessary for reconstructing gene content of an organism based on other related genomes.

Collapse

Wray GA. CNCing Is Believing. Science 2011;333:946-7. [DOI: 10.1126/science.1210771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Snir S, Pachter L. Tracing the most parsimonious indel history. J Comput Biol 2011;18:967-86. [PMID: 21728862 DOI: 10.1089/cmb.2010.0325] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Paten B, Earl D, Nguyen N, Diekhans M, Zerbino D, Haussler D. Cactus: Algorithms for genome multiple sequence alignment. Genome Res 2011;21:1512-28. [PMID: 21665927 DOI: 10.1101/gr.123356.111] [Citation(s) in RCA: 157] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Mancheron A, Uricaru R, Rivals E. An alternative approach to multiple genome comparison. Nucleic Acids Res 2011;39:e101. [PMID: 21646341 PMCID: PMC3159434 DOI: 10.1093/nar/gkr177] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open

Ma J. Reconstructing the history of large-scale genomic changes: biological questions and computational challenges. J Comput Biol 2011;18:879-93. [PMID: 21563973 DOI: 10.1089/cmb.2010.0189] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

Horvath JE, Sheedy CB, Merrett SL, Diallo AB, Swofford DL, NISC Comparative Sequencing Program, Green ED, Willard HF. Comparative analysis of the primate X-inactivation center region and reconstruction of the ancestral primate XIST locus. Genome Res 2011;21:850-62. [PMID: 21518738 DOI: 10.1101/gr.111849.110] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Error and error mitigation in low-coverage genome assemblies. PLoS One 2011;6:e17034. [PMID: 21340033 PMCID: PMC3038916 DOI: 10.1371/journal.pone.0017034] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2010] [Accepted: 01/10/2011] [Indexed: 11/19/2022] Open

Tuller T, Birin H, Kupiec M, Ruppin E. Reconstructing ancestral genomic sequences by co-evolution: formal definitions, computational issues, and biological examples. J Comput Biol 2010;17:1327-44. [PMID: 20874411 DOI: 10.1089/cmb.2010.0112] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Helaers R, Milinkovitch MC. MetaPIGA v2.0: maximum likelihood large phylogeny estimation using the metapopulation genetic algorithm and other stochastic heuristics. BMC Bioinformatics 2010;11:379. [PMID: 20633263 PMCID: PMC2912891 DOI: 10.1186/1471-2105-11-379] [Citation(s) in RCA: 84] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2010] [Accepted: 07/15/2010] [Indexed: 11/11/2022] Open

Abstract

Background

The development, in the last decade, of stochastic heuristics implemented in robust application softwares has made large phylogeny inference a key step in most comparative studies involving molecular sequences. Still, the choice of a phylogeny inference software is often dictated by a combination of parameters not related to the raw performance of the implemented algorithm(s) but rather by practical issues such as ergonomics and/or the availability of specific functionalities.

Results

Here, we present MetaPIGA v2.0, a robust implementation of several stochastic heuristics for large phylogeny inference (under maximum likelihood), including a Simulated Annealing algorithm, a classical Genetic Algorithm, and the Metapopulation Genetic Algorithm (metaGA) together with complex substitution models, discrete Gamma rate heterogeneity, and the possibility to partition data. MetaPIGA v2.0 also implements the Likelihood Ratio Test, the Akaike Information Criterion, and the Bayesian Information Criterion for automated selection of substitution models that best fit the data. Heuristics and substitution models are highly customizable through manual batch files and command line processing. However, MetaPIGA v2.0 also offers an extensive graphical user interface for parameters setting, generating and running batch files, following run progress, and manipulating result trees. MetaPIGA v2.0 uses standard formats for data sets and trees, is platform independent, runs in 32 and 64-bits systems, and takes advantage of multiprocessor and multicore computers.

Conclusions

The metaGA resolves the major problem inherent to classical Genetic Algorithms by maintaining high inter-population variation even under strong intra-population selection. Implementation of the metaGA together with additional stochastic heuristics into a single software will allow rigorous optimization of each heuristic as well as a meaningful comparison of performances among these algorithms. MetaPIGA v2.0 gives access both to high customization for the phylogeneticist, as well as to an ergonomic interface and functionalities assisting the non-specialist for sound inference of large phylogenetic trees using nucleotide sequences. MetaPIGA v2.0 and its extensive user-manual are freely available to academics at http://www.metapiga.org.

Collapse

Hanson-Smith V, Kolaczkowski B, Thornton JW. Robustness of ancestral sequence reconstruction to phylogenetic uncertainty. Mol Biol Evol 2010;27:1988-99. [PMID: 20368266 PMCID: PMC2922618 DOI: 10.1093/molbev/msq081] [Citation(s) in RCA: 113] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Li G, Ma J, Zhang L. Greedy selection of species for ancestral state reconstruction on phylogenies: elimination is better than insertion. PLoS One 2010;5:e8985. [PMID: 20140213 PMCID: PMC2816206 DOI: 10.1371/journal.pone.0008985] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2009] [Accepted: 01/05/2010] [Indexed: 12/26/2022] Open

Kim J, Sinha S. Towards realistic benchmarks for multiple alignments of non-coding sequences. BMC Bioinformatics 2010;11:54. [PMID: 20102627 PMCID: PMC2823711 DOI: 10.1186/1471-2105-11-54] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2009] [Accepted: 01/26/2010] [Indexed: 02/02/2023] Open

Abstract

BACKGROUND

With the continued development of new computational tools for multiple sequence alignment, it is necessary today to develop benchmarks that aid the selection of the most effective tools. Simulation-based benchmarks have been proposed to meet this necessity, especially for non-coding sequences. However, it is not clear if such benchmarks truly represent real sequence data from any given group of species, in terms of the difficulty of alignment tasks.

RESULTS

We find that the conventional simulation approach, which relies on empirically estimated values for various parameters such as substitution rate or insertion/deletion rates, is unable to generate synthetic sequences reflecting the broad genomic variation in conservation levels. We tackle this problem with a new method for simulating non-coding sequence evolution, by relying on genome-wide distributions of evolutionary parameters rather than their averages. We then generate synthetic data sets to mimic orthologous sequences from the Drosophila group of species, and show that these data sets truly represent the variability observed in genomic data in terms of the difficulty of the alignment task. This allows us to make significant progress towards estimating the alignment accuracy of current tools in an absolute sense, going beyond only a relative assessment of different tools. We evaluate six widely used multiple alignment tools in the context of Drosophila non-coding sequences, and find the accuracy to be significantly different from previously reported values. Interestingly, the performance of most tools degrades more rapidly when there are more insertions than deletions in the data set, suggesting an asymmetric handling of insertions and deletions, even though none of the evaluated tools explicitly distinguishes these two types of events. We also examine the accuracy of two existing tools for annotating insertions versus deletions, and find their performance to be close to optimal in Drosophila non-coding sequences if provided with the true alignments.

CONCLUSION

We have developed a method to generate benchmarks for multiple alignments of Drosophila non-coding sequences, and shown it to be more realistic than traditional benchmarks. Apart from helping to select the most effective tools, these benchmarks will help practitioners of comparative genomics deal with the effects of alignment errors, by providing accurate estimates of the extent of these errors.

Collapse

Tuller T, Birin H, Gophna U, Kupiec M, Ruppin E. Reconstructing ancestral gene content by coevolution. Genome Res 2009;20:122-32. [PMID: 19948819 DOI: 10.1101/gr.096115.109] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Abstract

Inferring the gene content of ancestral genomes is a fundamental challenge in molecular evolution. Due to the statistical nature of this problem, ancestral genomes inferred by the maximum likelihood (ML) or the maximum-parsimony (MP) methods are prone to considerable error rates. In general, these errors are difficult to abolish by using longer genomic sequences or by analyzing more taxa. This study describes a new approach for improving ancestral genome reconstruction, the ancestral coevolver (ACE), which utilizes coevolutionary information to improve the accuracy of such reconstructions over previous approaches. The principal idea is to reduce the potentially large solution space by choosing a single optimal (or near optimal) solution that is in accord with the coevolutionary relationships between protein families. Simulation experiments, both on artificial and real biological data, show that ACE yields a marked decrease in error rate compared with ML or MP. Applied to a large data set (95 organisms, 4873 protein families, and 10,000 coevolutionary relationships), some of the ancestral genomes reconstructed by ACE were remarkably different in their gene content from those reconstructed by ML or MP alone (more than 10% in some nodes). These reconstructions, while having almost similar likelihood/parsimony scores as those obtained with ML/MP, had markedly higher concordance with the coevolutionary information. Specifically, when ACE was implemented to improve the results of ML, it added a large number of proteins to those encoded by LUCA (last universal common ancestor), most of them ribosomal proteins and components of the F(0)F(1)-type ATP synthase/ATPases, complexes that are vital in most living organisms. Our analysis suggests that LUCA appears to have been bacterial-like and had a genome size similar to the genome sizes of many extant organisms.

Collapse

Diallo AB, Makarenkov V, Blanchette M. Ancestors 1.0: a web server for ancestral sequence reconstruction. Bioinformatics 2009;26:130-1. [PMID: 19850756 DOI: 10.1093/bioinformatics/btp600] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Bradley RK, Holmes I. Evolutionary triplet models of structured RNA. PLoS Comput Biol 2009;5:e1000483. [PMID: 19714212 PMCID: PMC2725318 DOI: 10.1371/journal.pcbi.1000483] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2008] [Accepted: 07/23/2009] [Indexed: 12/31/2022] Open

Abstract

The reconstruction and synthesis of ancestral RNAs is a feasible goal for paleogenetics. This will require new bioinformatics methods, including a robust statistical framework for reconstructing histories of substitutions, indels and structural changes. We describe a "transducer composition" algorithm for extending pairwise probabilistic models of RNA structural evolution to models of multiple sequences related by a phylogenetic tree. This algorithm draws on formal models of computational linguistics as well as the 1985 protosequence algorithm of David Sankoff. The output of the composition algorithm is a multiple-sequence stochastic context-free grammar. We describe dynamic programming algorithms, which are robust to null cycles and empty bifurcations, for parsing this grammar. Example applications include structural alignment of non-coding RNAs, propagation of structural information from an experimentally-characterized sequence to its homologs, and inference of the ancestral structure of a set of diverged RNAs. We implemented the above algorithms for a simple model of pairwise RNA structural evolution; in particular, the algorithms for maximum likelihood (ML) alignment of three known RNA structures and a known phylogeny and inference of the common ancestral structure. We compared this ML algorithm to a variety of related, but simpler, techniques, including ML alignment algorithms for simpler models that omitted various aspects of the full model and also a posterior-decoding alignment algorithm for one of the simpler models. In our tests, incorporation of basepair structure was the most important factor for accurate alignment inference; appropriate use of posterior-decoding was next; and fine details of the model were least important. Posterior-decoding heuristics can be substantially faster than exact phylogenetic inference, so this motivates the use of sum-over-pairs heuristics where possible (and approximate sum-over-pairs). For more exact probabilistic inference, we discuss the use of transducer composition for ML (or MCMC) inference on phylogenies, including possible ways to make the core operations tractable.

Collapse

Wilson MA, Makova KD. Evolution and survival on eutherian sex chromosomes. PLoS Genet 2009;5:e1000568. [PMID: 19609352 PMCID: PMC2704370 DOI: 10.1371/journal.pgen.1000568] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2008] [Accepted: 06/18/2009] [Indexed: 11/19/2022] Open

Abstract

Since the two eutherian sex chromosomes diverged from an ancestral autosomal pair, the X has remained relatively gene-rich, while the Y has lost most of its genes through the accumulation of deleterious mutations in nonrecombining regions. Presently, it is unclear what is distinctive about genes that remain on the Y chromosome, when the sex chromosomes acquired their unique evolutionary rates, and whether X-Y gene divergence paralleled that of paralogs located on autosomes. To tackle these questions, here we juxtaposed the evolution of X and Y homologous genes (gametologs) in eutherian mammals with their autosomal orthologs in marsupial and monotreme mammals. We discovered that genes on the X and Y acquired distinct evolutionary rates immediately following the suppression of recombination between the two sex chromosomes. The Y-linked genes evolved at higher rates, while the X-linked genes maintained the lower evolutionary rates of the ancestral autosomal genes. These distinct rates have been maintained throughout the evolution of X and Y. Specifically, in humans, most X gametologs and, curiously, also most Y gametologs evolved under stronger purifying selection than similarly aged autosomal paralogs. Finally, after evaluating the current experimental data from the literature, we concluded that unique mRNA/protein expression patterns and functions acquired by Y (versus X) gametologs likely contributed to their retention. Our results also suggest that either the boundary between sex chromosome strata 3 and 4 should be shifted or that stratum 3 should be divided into two strata.

Using recently available marsupial and monotreme genomes, we investigated nascent sex chromosome evolution in mammals. We show that, in eutherian mammals, X and Y genes acquired distinct evolutionary rates and functional constraints immediately after recombination suppression; X-linked genes maintained lower, ancestral (autosomal), rates, whereas the evolutionary rates of Y-linked genes increased. Most X and, unexpectedly, Y genes evolved under stronger purifying selection than similarly aged autosomal paralogs. However, we also observed that the divergence of gametologs and paralogs shared similar features. In addition, many Y-linked copies evolved unique functions and expression patterns compared to their counterparts on the X chromosome. Therefore, our results suggest that to be retained on the Y chromosome, genes need to acquire separately valuable expression and/or functions to be safeguarded by purifying selection.

Collapse

Fletcher W, Yang Z. INDELible: a flexible simulator of biological sequence evolution. Mol Biol Evol 2009;26:1879-88. [PMID: 19423664 PMCID: PMC2712615 DOI: 10.1093/molbev/msp098] [Citation(s) in RCA: 319] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open

Liberles DA. Reading the Story in DNA: A Beginner's Guide to Molecular Evolution. Syst Biol 2009. [DOI: 10.1093/sysbio/syp003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open