1
|
Fourie G, Van der Merwe NA, Wingfield BD, Bogale M, Wingfield MJ, Steenkamp ET. Mitochondrial introgression and interspecies recombination in the Fusarium fujikuroi species complex. IMA Fungus 2018; 9:37-48. [PMID: 30018871 PMCID: PMC6048563 DOI: 10.5598/imafungus.2018.09.01.04] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Accepted: 02/18/2018] [Indexed: 01/31/2023] Open
Abstract
The Fusarium fujikuroi species complex (FFSC) is an economically important monophyletic lineage in the genus Fusarium. Incongruence observed among mitochondrial gene trees, as well as the multiple non-orthologous copies of the internal transcribed spacer region of the ribosomal RNA genes, suggests that the origin and history of this complex likely involved interspecies gene flow. Based on this hypothesis, the mitochondrial genomes of non-conspecific species should harbour signatures of introgression or introgressive hybridization. The aim of this study was therefore to search for recombination between the mitochondrial genomes of different species in the FFSC. Using methods based on mt genome sequence similarity, five significant recombinant regions in both gene and intergenic regions were detected. Using coalescent-based methods and the sequences for individual mt genes, various ancestral recombination events between different lineages of the FFSC were also detected. These findings suggest that interspecies gene flow and introgression are likely to have played key roles in the evolution of the FFSC at both ancient and more recent time scales.
Collapse
Affiliation(s)
- Gerda Fourie
- Department of Genetics, Biochemistry and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Nicolaas A Van der Merwe
- Department of Genetics, Biochemistry and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Brenda D Wingfield
- Department of Genetics, Biochemistry and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Mesfin Bogale
- Department of Genetics, Biochemistry and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Michael J Wingfield
- Department of Genetics, Biochemistry and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Emma T Steenkamp
- Department of Genetics, Biochemistry and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| |
Collapse
|
2
|
Polanski A, Szczesna A, Garbulowski M, Kimmel M. Coalescence computations for large samples drawn from populations of time-varying sizes. PLoS One 2017; 12:e0170701. [PMID: 28170404 PMCID: PMC5295683 DOI: 10.1371/journal.pone.0170701] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Accepted: 01/09/2017] [Indexed: 11/19/2022] Open
Abstract
We present new results concerning probability distributions of times in the coalescence tree and expected allele frequencies for coalescent with large sample size. The obtained results are based on computational methodologies, which involve combining coalescence time scale changes with techniques of integral transformations and using analytical formulae for infinite products. We show applications of the proposed methodologies for computing probability distributions of times in the coalescence tree and their limits, for evaluation of accuracy of approximate expressions for times in the coalescence tree and expected allele frequencies, and for analysis of large human mitochondrial DNA dataset.
Collapse
Affiliation(s)
- Andrzej Polanski
- Institute of Informatics, Silesian University of Technology, ul. Akademicka 16, 44-100 Gliwice, Poland
- * E-mail:
| | - Agnieszka Szczesna
- Institute of Informatics, Silesian University of Technology, ul. Akademicka 16, 44-100 Gliwice, Poland
| | - Mateusz Garbulowski
- Institute of Informatics, Silesian University of Technology, ul. Akademicka 16, 44-100 Gliwice, Poland
- The Linnaeus Centre for Bioinformatics, Uppsala University, BMC, Uppsala, Sweden
| | - Marek Kimmel
- Systems Engineering Group, Silesian University of Technology, ul. Akademicka 16, 44-100 Gliwice, Poland
- Department of Statistics, Rice University, M.S. 138, 6100 Main Street, Houston, TX 77005, United States of America
| |
Collapse
|
3
|
Parida L. Ancestral recombinations graph: a reconstructability perspective using random-graphs framework. J Comput Biol 2011; 17:1345-70. [PMID: 20976875 DOI: 10.1089/cmb.2009.0243] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We present a random graphs framework to study pedigree history in an ideal (Wright Fisher) population. This framework correlates the underlying mathematical objects in, for example, pedigree graph, mtDNA or NRY Chr tree, ARG (Ancestral Recombinations Graph), and HUD used in literature, into a single unified random graph framework. It also gives a natural definition, based solely on the topology, of an ARG, one of the most interesting as well as useful mathematical objects in this area. The random graphs framework gives an alternative parametrization of the ARG that does not use the recombination rate q and instead uses a parameter M based on the (estimate of ) the number of non-mixing segments in the extant units. This seems more natural in a setting that attempts to tease apart the population dynamics from the biology of the units. This framework also gives a purely topological definition of GMRCA, analogous to MRCA on trees (which has a purely topological description i.e., it is a root, graph-theoretically speaking, of a tree). Secondly, with a natural extension of the ideas from random-graphs we present a sampling (simulation) algorithm to construct random instances of ARG/unilinear transmission graph. This is the first (to the best of the author's knowledge) algorithm that guarantees uniform sampling of the space of ARG instances, reflecting the ideal population model. Finally, using a measure of reconstructability of the past historical events given a collection of extant sequences, we conclude for a given set of extant sequences, the joint history of local segments along a chromosome is reconstructible.
Collapse
Affiliation(s)
- Laxmi Parida
- Computational Biology Center, IBM T J Watson Research, Yorktown, New York, USA.
| |
Collapse
|
4
|
McVean GAT, Cardin NJ. Approximating the coalescent with recombination. Philos Trans R Soc Lond B Biol Sci 2005; 360:1387-93. [PMID: 16048782 PMCID: PMC1569517 DOI: 10.1098/rstb.2005.1673] [Citation(s) in RCA: 207] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The coalescent with recombination describes the distribution of genealogical histories and resulting patterns of genetic variation in samples of DNA sequences from natural populations. However, using the model as the basis for inference is currently severely restricted by the computational challenge of estimating the likelihood. We discuss why the coalescent with recombination is so challenging to work with and explore whether simpler models, under which inference is more tractable, may prove useful for genealogy-based inference. We introduce a simplification of the coalescent process in which coalescence between lineages with no overlapping ancestral material is banned. The resulting process has a simple Markovian structure when generating genealogies sequentially along a sequence, yet has very similar properties to the full model, both in terms of describing patterns of genetic variation and as the basis for statistical inference.
Collapse
Affiliation(s)
- Gilean A T McVean
- Department of Statistics, University of Oxford, 1 South Parks Road, Oxford, OX1 3TG, UK.
| | | |
Collapse
|
5
|
Takebayashi N, Newbigin E, Uyenoyama MK. Maximum-likelihood estimation of rates of recombination within mating-type regions. Genetics 2005; 167:2097-109. [PMID: 15342543 PMCID: PMC1471000 DOI: 10.1534/genetics.103.021535] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Features common to many mating-type regions include recombination suppression over large genomic tracts and cosegregation of genes of various functions, not necessarily related to reproduction. Model systems for homomorphic self-incompatibility (SI) in flowering plants share these characteristics. We introduce a method for the exact computation of the joint probability of numbers of neutral mutations segregating at the determinant of mating type and at a linked marker locus. The underlying Markov model incorporates strong balancing selection into a two-locus coalescent. We apply the method to obtain a maximum-likelihood estimate of the rate of recombination between a marker locus, 48A, and S-RNase, the determinant of SI specificity in pistils of Nicotiana alata. Even though the sampled haplotypes show complete allelic linkage disequilibrium and recombinants have never been detected, a highly significant deficiency of synonymous substitutions at 48A compared to S-RNase suggests a history of recombination. Our maximum-likelihood estimate indicates a rate of recombination of perhaps 3 orders of magnitude greater than the rate of synonymous mutation. This approach may facilitate the construction of genetic maps of regions tightly linked to targets of strong balancing selection.
Collapse
Affiliation(s)
- Naoki Takebayashi
- Department of Biology, Duke University, Durham, North Carolina 27708-0338, USA
| | | | | |
Collapse
|
6
|
Abstract
What is the chance that some part of a stretch of genome will survive? In a population of constant size, and with no selection, the probability of survival of some part of a stretch of map length y < 1 approaches y/log(yt/2) for log(yt) > or = 1. Thus, the whole genome is certain to be lost, but the rate of loss is extremely slow. This solution extends to give the whole distribution of surviving block sizes as a function of time. We show that the expected number of blocks at time t is 1+yt and give expressions for the moments of the number of blocks and the total amount of genome that survives for a given time. The solution is based on a branching process and assumes complete interference between crossovers, so that each descendant carries only a single block of ancestral material. We consider cases where most individuals carry multiple blocks, either because there are multiple crossovers in a long genetic map, or because enough time has passed that most individuals in the population are related to each other. For species such as ours, which have a long genetic map, the genome of any individual which leaves descendants (approximately 80% of the population for a Poisson offspring number with mean two) is likely to persist for an extremely long time, in the form of a few short blocks of genome.
Collapse
Affiliation(s)
- S J E Baird
- Museum of Vertebrate Zoology, UC Berkeley, 3101 Valley Life Sciences Building, Berkeley, CA 94720-3160, USA
| | | | | |
Collapse
|
7
|
Polanski A, Bobrowski A, Kimmel M. A note on distributions of times to coalescence, under time-dependent population size. Theor Popul Biol 2003; 63:33-40. [PMID: 12464493 DOI: 10.1016/s0040-5809(02)00010-2] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Expressions for marginal distributions of times in the time-varying coalescence process are derived. The proposed method allows also for computation of joint probability distribution for pairs, triples, etc. of coalescence times. The expressions derived are useful for (1) extending several statistics from time constant to time-varying case, (2) increasing efficiency and accuracy of simulations in time-varying evolution, and (3) debugging coalescence simulation software.
Collapse
Affiliation(s)
- A Polanski
- Department of Statistics, Rice University, 6100 Main Street, Mail Stop 138, P.O. Box 1892, Houston, TX, USA
| | | | | |
Collapse
|
8
|
Abstract
We investigate the shape of a phylogenetic tree reconstructed from sequences evolving under the coalescent with recombination. The motivation is that evolutionary inferences are often made from phylogenetic trees reconstructed from population data even though recombination may well occur (mtDNA or viral sequences) or does occur (nuclear sequences). We investigate the size and direction of biases when a single tree is reconstructed ignoring recombination. Standard software (PHYLIP) was used to construct the best phylogenetic tree from sequences simulated under the coalescent with recombination. With recombination present, the length of terminal branches and the total branch length are larger, and the time to the most recent common ancestor smaller, than for a tree reconstructed from sequences evolving with no recombination. The effects are pronounced even for small levels of recombination that may not be immediately detectable in a data set. The phylogenies when recombination is present superficially resemble phylogenies for sequences from an exponentially growing population. However, exponential growth has a different effect on statistics such as Tajima's D. Furthermore, ignoring recombination leads to a large overestimation of the substitution rate heterogeneity and the loss of the molecular clock. These results are discussed in relation to viral and mtDNA data sets.
Collapse
Affiliation(s)
- M H Schierup
- Department of Ecology and Genetics, University of Aarhus, DK-8000 Aarhus C., Denmark.
| | | |
Collapse
|
9
|
|