Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Chen GK, Marjoram P, Wall JD. Fast and flexible simulation of DNA sequence data. Genome Res 2009;19:136-42. [PMID: 19029539 DOI: 10.1101/gr.083634.108] [Citation(s) in RCA: 254] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Number

Cited by Other Article(s)

Atanda SA, Bandillo N. Genomic-inferred cross-selection methods for multi-trait improvement in a recurrent selection breeding program. PLANT METHODS 2024;20:133. [PMID: 39218896 PMCID: PMC11367796 DOI: 10.1186/s13007-024-01258-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 08/05/2024] [Indexed: 09/04/2024]

Shang J, Xu A, Bi M, Zhang Y, Li F, Liu JX. A review: simulation tools for genome-wide interaction studies. Brief Funct Genomics 2024:elae034. [PMID: 39173096 DOI: 10.1093/bfgp/elae034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 07/25/2024] [Accepted: 08/10/2024] [Indexed: 08/24/2024] Open

Peixoto MA, Coelho IF, Leach KA, Lübberstedt T, Bhering LL, Resende MFR. Use of simulation to optimize a sweet corn breeding program: implementing genomic selection and doubled haploid technology. G3 (BETHESDA, MD.) 2024;14:jkae128. [PMID: 38869242 PMCID: PMC11304600 DOI: 10.1093/g3journal/jkae128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 04/06/2024] [Accepted: 05/21/2024] [Indexed: 06/14/2024]

Pocrnic I, Lourenco D, Misztal I. Single nucleotide polymorphism profile for quantitative trait nucleotide in populations with small effective size and its impact on mapping and genomic predictions. Genetics 2024;227:iyae103. [PMID: 38913695 PMCID: PMC11304960 DOI: 10.1093/genetics/iyae103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 06/07/2024] [Accepted: 06/16/2024] [Indexed: 06/26/2024] Open

Dubey R, Zustovi R, Landschoot S, Dewitte K, Verlinden G, Haesaert G, Maenhout S. Harnessing monocrop breeding strategies for intercrops. FRONTIERS IN PLANT SCIENCE 2024;15:1394413. [PMID: 38799097 PMCID: PMC11119317 DOI: 10.3389/fpls.2024.1394413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Accepted: 04/22/2024] [Indexed: 05/29/2024]

Legarra A, Bermann M, Mei Q, Christensen OF. Estimating genomic relationships of metafounders across and within breeds using maximum likelihood, pseudo-expectation-maximization maximum likelihood and increase of relationships. Genet Sel Evol 2024;56:35. [PMID: 38698347 DOI: 10.1186/s12711-024-00892-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Accepted: 03/18/2024] [Indexed: 05/05/2024] Open

Abstract

BACKGROUND

The theory of "metafounders" proposes a unified framework for relationships across base populations within breeds (e.g. unknown parent groups), and base populations across breeds (crosses) together with a sensible compatibility with genomic relationships. Considering metafounders might be advantageous in pedigree best linear unbiased prediction (BLUP) or single-step genomic BLUP. Existing methods to estimate relationships across metafounders Γ are not well adapted to highly unbalanced data, genotyped individuals far from base populations, or many unknown parent groups (within breed per year of birth).

METHODS

We derive likelihood methods to estimate Γ . For a single metafounder, summary statistics of pedigree and genomic relationships allow deriving a cubic equation with the real root being the maximum likelihood (ML) estimate of Γ . This equation is tested with Lacaune sheep data. For several metafounders, we split the first derivative of the complete likelihood in a term related to Γ , and a second term related to Mendelian sampling variances. Approximating the first derivative by its first term results in a pseudo-EM algorithm that iteratively updates the estimate of Γ by the corresponding block of the H-matrix. The method extends to complex situations with groups defined by year of birth, modelling the increase of Γ using estimates of the rate of increase of inbreeding ( Δ F ), resulting in an expanded Γ and in a pseudo-EM+ Δ F algorithm. We compare these methods with the generalized least squares (GLS) method using simulated data: complex crosses of two breeds in equal or unsymmetrical proportions; and in two breeds, with 10 groups per year of birth within breed. We simulate genotyping in all generations or in the last ones.

RESULTS

For a single metafounder, the ML estimates of the Lacaune data corresponded to the maximum. For simulated data, when genotypes were spread across all generations, both GLS and pseudo-EM(+ Δ F ) methods were accurate. With genotypes only available in the most recent generations, the GLS method was biased, whereas the pseudo-EM(+ Δ F ) approach yielded more accurate and unbiased estimates.

CONCLUSIONS

We derived ML, pseudo-EM and pseudo-EM+ Δ F methods to estimate Γ in many realistic settings. Estimates are accurate in real and simulated data and have a low computational cost.

Collapse

Azevedo CF, Ferrão LFV, Benevenuto J, de Resende MDV, Nascimento M, Nascimento ACC, Munoz PR. Using visual scores for genomic prediction of complex traits in breeding programs. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023;137:9. [PMID: 38102495 DOI: 10.1007/s00122-023-04512-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Accepted: 11/21/2023] [Indexed: 12/17/2023]

Abstract

KEY MESSAGE

An approach for handling visual scores with potential errors and subjectivity in scores was evaluated in simulated and blueberry recurrent selection breeding schemes to assist breeders in their decision-making. Most genomic prediction methods are based on assumptions of normality due to their simplicity and ease of implementation. However, in plant and animal breeding, continuous traits are often visually scored as categorical traits and analyzed as a Gaussian variable, thus violating the normality assumption, which could affect the prediction of breeding values and the estimation of genetic parameters. In this study, we examined the main challenges of visual scores for genomic prediction and genetic parameter estimation using mixed models, Bayesian, and machine learning methods. We evaluated these approaches using simulated and real breeding data sets. Our contribution in this study is a five-fold demonstration: (i) collecting data using an intermediate number of categories (1-3 and 1-5) is the best strategy, even considering errors associated with visual scores; (ii) Linear Mixed Models and Bayesian Linear Regression are robust to the normality violation, but marginal gains can be achieved when using Bayesian Ordinal Regression Models (BORM) and Random Forest Classification; (iii) genetic parameters are better estimated using BORM; (iv) our conclusions using simulated data are also applicable to real data in autotetraploid blueberry; and (v) a comparison of continuous and categorical phenotypes found that investing in the evaluation of 600-1000 categorical data points with low error, when it is not feasible to collect continuous phenotypes, is a strategy for improving predictive abilities. Our findings suggest the best approaches for effectively using visual scores traits to explore genetic information in breeding programs and highlight the importance of investing in the training of evaluator teams and in high-quality phenotyping.

Collapse

Fritsche-Neto R, Ali J, De Asis EJ, Allahgholipour M, Labroo MR. Improving hybrid rice breeding programs via stochastic simulations: number of parents, number of hybrids, tester update, and genomic prediction of hybrid performance. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023;137:3. [PMID: 38085288 PMCID: PMC10716074 DOI: 10.1007/s00122-023-04508-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 11/18/2023] [Indexed: 12/18/2023]

Ayala NM, Genetti M, Corbett-Detig R. Inferring multi-locus selection in admixed populations. PLoS Genet 2023;19:e1011062. [PMID: 38015992 PMCID: PMC10707604 DOI: 10.1371/journal.pgen.1011062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 12/08/2023] [Accepted: 11/13/2023] [Indexed: 11/30/2023] Open

Aoki S, Ishihama F, Fukasawa K. Robustness of genetic diversity measures under spatial sampling and a new frequency-independent measure. PeerJ 2023;11:e16027. [PMID: 37744217 PMCID: PMC10512937 DOI: 10.7717/peerj.16027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 08/13/2023] [Indexed: 09/26/2023] Open

Grohmann CJ, Shull CM, Crum TE, Schwab C, Safranski TJ, Decker JE. Analysis of polygenic selection in purebred and crossbred pig genomes using generation proxy selection mapping. Genet Sel Evol 2023;55:62. [PMID: 37710159 PMCID: PMC10500877 DOI: 10.1186/s12711-023-00836-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 08/25/2023] [Indexed: 09/16/2023] Open

Abstract

BACKGROUND

Artificial selection on quantitative traits using breeding values and selection indices in commercial livestock breeding populations causes changes in allele frequency over time at hundreds or thousands of causal loci and the surrounding genomic regions. In population genetics, this type of selection is called polygenic selection. Researchers and managers of pig breeding programs are motivated to understand the genetic basis of phenotypic diversity across genetic lines, breeds, and populations using selection mapping analyses. Here, we applied generation proxy selection mapping (GPSM), a genome-wide association analysis of single nucleotide polymorphism (SNP) genotypes (38,294-46,458 markers) of birth date, in four pig populations (15,457, 15,772, 16,595 and 8447 pigs per population) to identify loci responding to artificial selection over a period of five to ten years. Gene-drop simulation analyses were conducted to provide context for the GPSM results. Selected loci within and across each population of pigs were compared in the context of swine breeding objectives.

RESULTS

The GPSM identified 49 to 854 loci as under selection (Q-values less than 0.10) across 15 subsets of pigs based on combinations of populations. The number of significant associations increased when data were pooled across populations. In addition, several significant associations were identified in more than one population. These results indicate concurrent selection objectives, similar genetic architectures, and shared causal variants responding to selection across these pig populations. Negligible error rates (less than or equal to 0.02%) of false-positive associations were found when testing GPSM on gene-drop simulated genotypes, suggesting that GPSM distinguishes selection from random genetic drift in actual pig populations.

CONCLUSIONS

This work confirms the efficacy and the negligible error rates of the GPSM method in detecting selected loci in commercial pig populations. Our results suggest shared selection objectives and genetic architectures across swine populations. The identified polygenic selection highlights loci that are important to swine production.

Collapse

Bonizzoni P, Boucher C, Cozzi D, Gagie T, Köppl D, Rossi M. Data Structures for SMEM-Finding in the PBWT. INTERNATIONAL SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL : SPIRE ... : PROCEEDINGS. SPIRE (SYMPOSIUM) 2023;14240:89-101. [PMID: 39149146 PMCID: PMC11325217 DOI: 10.1007/978-3-031-43980-3_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]

Hu W, Hao Z, Du P, Di Vincenzo F, Manzi G, Cui J, Fu YX, Pan YH, Li H. Genomic inference of a severe human bottleneck during the Early to Middle Pleistocene transition. Science 2023;381:979-984. [PMID: 37651513 DOI: 10.1126/science.abq7487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 07/11/2023] [Indexed: 09/02/2023]

Affiliation(s)

Wangjie Hu CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Brain Functional Genomics of Ministry of Education, School of Life Science, East China Normal University, Shanghai, China
Ziqian Hao College of Artificial Intelligence and Big Data for Medical Sciences, Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan, China
Pengyuan Du CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China College of Artificial Intelligence and Big Data for Medical Sciences, Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan, China
Fabio Di Vincenzo Natural History Museum, University of Florence, Florence, Italy
Giorgio Manzi Department of Environmental Biology, Sapienza University of Rome, Rome, Italy
Jialong Cui Key Laboratory of Brain Functional Genomics of Ministry of Education, School of Life Science, East China Normal University, Shanghai, China
Yun-Xin Fu Department of Biostatistics and Data Science, School of Public Health, University of Texas Health Science Center at Houston, Houston, TX, USA Key Laboratory for Conservation and Utilization of Bioresources, Yunnan University, Kunming, China
Yi-Hsuan Pan Key Laboratory of Brain Functional Genomics of Ministry of Education, School of Life Science, East China Normal University, Shanghai, China
Haipeng Li CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China

Collapse

Labroo MR, Endelman JB, Gemenet DC, Werner CR, Gaynor RC, Covarrubias-Pazaran GE. Clonal diploid and autopolyploid breeding strategies to harness heterosis: insights from stochastic simulation. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023;136:147. [PMID: 37291402 DOI: 10.1007/s00122-023-04377-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 05/05/2023] [Indexed: 06/10/2023]

Abstract

KEY MESSAGE

Reciprocal recurrent selection sometimes increases genetic gain per unit cost in clonal diploids with heterosis due to dominance, but it typically does not benefit autopolyploids. Breeding can change the dominance as well as additive genetic value of populations, thus utilizing heterosis. A common hybrid breeding strategy is reciprocal recurrent selection (RRS), in which parents of hybrids are typically recycled within pools based on general combining ability. However, the relative performances of RRS and other breeding strategies have not been thoroughly compared. RRS can have relatively increased costs and longer cycle lengths, but these are sometimes outweighed by its ability to harness heterosis due to dominance. Here, we used stochastic simulation to compare genetic gain per unit cost of RRS, terminal crossing, recurrent selection on breeding value, and recurrent selection on cross performance considering different amounts of population heterosis due to dominance, relative cycle lengths, time horizons, estimation methods, selection intensities, and ploidy levels. In diploids with phenotypic selection at high intensity, whether RRS was the optimal breeding strategy depended on the initial population heterosis. However, in diploids with rapid-cycling genomic selection at high intensity, RRS was the optimal breeding strategy after 50 years over almost all amounts of initial population heterosis under the study assumptions. Diploid RRS required more population heterosis to outperform other strategies as its relative cycle length increased and as selection intensity and time horizon decreased. The optimal strategy depended on selection intensity, a proxy for inbreeding rate. Use of diploid fully inbred parents vs. outbred parents with RRS typically did not affect genetic gain. In autopolyploids, RRS typically did not outperform one-pool strategies regardless of the initial population heterosis.

Collapse

Pocrnic I, Obšteter J, Gaynor RC, Wolc A, Gorjanc G. Assessment of long-term trends in genetic mean and variance after the introduction of genomic selection in layers: a simulation study. Front Genet 2023;14:1168212. [PMID: 37234871 PMCID: PMC10206274 DOI: 10.3389/fgene.2023.1168212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 05/02/2023] [Indexed: 05/28/2023] Open

Abstract

Nucleus-based breeding programs are characterized by intense selection that results in high genetic gain, which inevitably means reduction of genetic variation in the breeding population. Therefore, genetic variation in such breeding systems is typically managed systematically, for example, by avoiding mating the closest relatives to limit progeny inbreeding. However, intense selection requires maximum effort to make such breeding programs sustainable in the long-term. The objective of this study was to use simulation to evaluate the long-term impact of genomic selection on genetic mean and variance in an intense layer chicken breeding program. We developed a large-scale stochastic simulation of an intense layer chicken breeding program to compare conventional truncation selection to genomic truncation selection optimized with either minimization of progeny inbreeding or full-scale optimal contribution selection. We compared the programs in terms of genetic mean, genic variance, conversion efficiency, rate of inbreeding, effective population size, and accuracy of selection. Our results confirmed that genomic truncation selection has immediate benefits compared to conventional truncation selection in all specified metrics. A simple minimization of progeny inbreeding after genomic truncation selection did not provide any significant improvements. Optimal contribution selection was successful in having better conversion efficiency and effective population size compared to genomic truncation selection, but it must be fine-tuned for balance between loss of genetic variance and genetic gain. In our simulation, we measured this balance using trigonometric penalty degrees between truncation selection and a balanced solution and concluded that the best results were between 45° and 65°. This balance is specific to the breeding program and depends on how much immediate genetic gain a breeding program may risk vs. save for the future. Furthermore, our results show that the persistence of accuracy is better with optimal contribution selection compared to truncation selection. In general, our results show that optimal contribution selection can ensure long-term success in intensive breeding programs using genomic selection.

Collapse

Obšteter J, Strachan LK, Bubnič J, Prešern J, Gorjanc G. SIMplyBee: an R package to simulate honeybee populations and breeding programs. Genet Sel Evol 2023;55:31. [PMID: 37161307 PMCID: PMC10169377 DOI: 10.1186/s12711-023-00798-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 03/31/2023] [Indexed: 05/11/2023] Open

Abstract

BACKGROUND

The Western honeybee is an economically important species globally, but has been experiencing colony losses that lead to economical damage and decreased genetic variability. This situation is spurring additional interest in honeybee breeding and conservation programs. Stochastic simulators are essential tools for rapid and low-cost testing of breeding programs and methods, yet no existing simulator allows for a detailed simulation of honeybee populations. Here we describe SIMplyBee, a holistic simulator of honeybee populations and breeding programs. SIMplyBee is an R package and hence freely available for installation from CRAN http://cran.r-project.org/package=SIMplyBee .

IMPLEMENTATION

SIMplyBee builds upon the stochastic simulator AlphaSimR that simulates individuals with their corresponding genomes and quantitative genetic values. To enable honeybee-specific simulations, we extended AlphaSimR by developing classes for global simulation parameters, SimParamBee, for a honeybee colony, Colony, and multiple colonies, MultiColony. We also developed functions to address major honeybee specificities: honeybee genome, haplodiploid inheritance, social organisation, complementary sex determination, polyandry, colony events, and quantitative genetics at the individual- and colony-levels.

RESULTS

We describe its implementation for simulating a honeybee genome, creating a honeybee colony and its members, addressing haplodiploid inheritance and complementary sex determination, simulating colony events, creating and managing multiple colonies at the same time, and obtaining genomic data and honeybee quantitative genetics. Further documentation, available at http://www.SIMplyBee.info , provides details on these operations and describes additional operations related to genomics, quantitative genetics, and other functionalities.

DISCUSSION

SIMplyBee is a holistic simulator of honeybee populations and breeding programs. It simulates individual honeybees with their genomes, colonies with colony events, and individual- and colony-level genetic and breeding values. Regarding the latter, SIMplyBee takes a user-defined function to combine individual- into colony-level values and hence allows for modeling any type of interaction within a colony. SIMplyBee provides a research platform for testing breeding and conservation strategies and their effect on future genetic gain and genetic variability. Future developments of SIMplyBee will focus on improving the simulation of honeybee genomes, optimizing the simulator's performance, and including spatial awareness in mating functions and phenotype simulation. We invite the honeybee genetics and breeding community to join us in the future development of SIMplyBee.

Collapse

Werner CR, Gaynor RC, Sargent DJ, Lillo A, Gorjanc G, Hickey JM. Genomic selection strategies for clonally propagated crops. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023;136:74. [PMID: 36952013 PMCID: PMC10036424 DOI: 10.1007/s00122-023-04300-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 01/14/2023] [Indexed: 05/27/2023]

Lubanga N, Massawe F, Mayes S, Gorjanc G, Bančič J. Genomic selection strategies to increase genetic gain in tea breeding programs. THE PLANT GENOME 2023;16:e20282. [PMID: 36349831 DOI: 10.1002/tpg2.20282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2022] [Accepted: 10/01/2022] [Indexed: 05/10/2023]

Silva JM, Qi W, Pinho AJ, Pratas D. AlcoR: alignment-free simulation, mapping, and visualization of low-complexity regions in biological data. Gigascience 2022;12:giad101. [PMID: 38091509 PMCID: PMC10716826 DOI: 10.1093/gigascience/giad101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 09/29/2023] [Accepted: 11/07/2023] [Indexed: 12/18/2023] Open

Abstract

BACKGROUND

Low-complexity data analysis is the area that addresses the search and quantification of regions in sequences of elements that contain low-complexity or repetitive elements. For example, these can be tandem repeats, inverted repeats, homopolymer tails, GC-biased regions, similar genes, and hairpins, among many others. Identifying these regions is crucial because of their association with regulatory and structural characteristics. Moreover, their identification provides positional and quantity information where standard assembly methodologies face significant difficulties because of substantial higher depth coverage (mountains), ambiguous read mapping, or where sequencing or reconstruction defects may occur. However, the capability to distinguish low-complexity regions (LCRs) in genomic and proteomic sequences is a challenge that depends on the model's ability to find them automatically. Low-complexity patterns can be implicit through specific or combined sources, such as algorithmic or probabilistic, and recurring to different spatial distances-namely, local, medium, or distant associations.

FINDINGS

This article addresses the challenge of automatically modeling and distinguishing LCRs, providing a new method and tool (AlcoR) for efficient and accurate segmentation and visualization of these regions in genomic and proteomic sequences. The method enables the use of models with different memories, providing the ability to distinguish local from distant low-complexity patterns. The method is reference and alignment free, providing additional methodologies for testing, including a highly flexible simulation method for generating biological sequences (DNA or protein) with different complexity levels, sequence masking, and a visualization tool for automatic computation of the LCR maps into an ideogram style. We provide illustrative demonstrations using synthetic, nearly synthetic, and natural sequences showing the high efficiency and accuracy of AlcoR. As large-scale results, we use AlcoR to unprecedentedly provide a whole-chromosome low-complexity map of a recent complete human genome and the haplotype-resolved chromosome pairs of a heterozygous diploid African cassava cultivar.

CONCLUSIONS

The AlcoR method provides the ability of fast sequence characterization through data complexity analysis, ideally for scenarios entangling the presence of new or unknown sequences. AlcoR is implemented in C language using multithreading to increase the computational speed, is flexible for multiple applications, and does not contain external dependencies. The tool accepts any sequence in FASTA format. The source code is freely provided at https://github.com/cobilab/alcor.

Collapse

DoVale JC, Carvalho HF, Sabadin F, Fritsche-Neto R. Genotyping marker density and prediction models effects in long-term breeding schemes of cross-pollinated crops. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022;135:4523-4539. [PMID: 36261658 DOI: 10.1007/s00122-022-04236-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Accepted: 10/09/2022] [Indexed: 06/16/2023]

Abstract

In genomic recurrent selection, the more markers, the better because they buffer the linkage disequilibrium losses caused by recombination over cycles, and consequently, provide higher responses to selection. Reductions of genotyping marker density have been extensively evaluated as potential strategies to reduce the genotyping costs of genomic selection (GS). Low-density marker panels are appealing in GS because they entail lower multicollinearity and computing time and allow more individuals to be genotyped for the same cost. However, statistical models used in GS are usually evaluated with empirical data, using "static" training sets and populations. This may be adequate for making predictions during a breeding program's initial cycles but not for the long-term. Moreover, studies that focus on long selective breeding cycles generally do not consider GS models with the effect of dominance, which is particularly important for breeding outcomes in cross-pollinated crops. Hence, dominance effects are important and unexplored in GS for long-term programs involving allogamous species. To address it, we employed two approaches: analysis of empirical maize datasets and simulations of long-term breeding applying phenotypic and genomic recurrent selection (intrapopulation and reciprocal schemes). In both schemes, we simulated twenty breeding cycles and assessed the effect of marker density reduction on the population mean, the best crosses, additive variance, selective accuracy, and response to selection with models [additive, additive-dominant, general (GCA), and this plus specific combining ability (GCA + SCA)]. Our results indicate that marker reduction based on linkage disequilibrium levels provides useful predictions only within a cycle, as accuracy significantly decreases over cycles. In the long-term, without training set updating, high-marker density provides the best responses to selection. The model to be used depends on the breeding scheme: additive for intrapopulation and additive-dominant or GCA + SCA for reciprocal.

Collapse

Estimating the genome-wide mutation rate from thousands of unrelated individuals. Am J Hum Genet 2022;109:2178-2184. [PMID: 36370709 PMCID: PMC9748258 DOI: 10.1016/j.ajhg.2022.10.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 10/15/2022] [Indexed: 11/13/2022] Open

Guo Y, Betzen B, Salcedo A, He F, Bowden RL, Fellers JP, Jordan KW, Akhunova A, Rouse MN, Szabo LJ, Akhunov E. Population genomics of Puccinia graminis f.sp. tritici highlights the role of admixture in the origin of virulent wheat rust races. Nat Commun 2022;13:6287. [PMID: 36271077 PMCID: PMC9587050 DOI: 10.1038/s41467-022-34050-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 10/12/2022] [Indexed: 12/25/2022] Open

Affiliation(s)

Yuanwen Guo grid.36567.310000 0001 0737 1259Department of Plant Pathology, Kansas State University, Manhattan, KS USA
Bliss Betzen grid.36567.310000 0001 0737 1259Department of Plant Pathology, Kansas State University, Manhattan, KS USA ,6grid.36567.310000 0001 0737 1259Present Address: USDA-APHIS-PPQ Field Operations, Kansas State University, Manhattan, KS USA
Andres Salcedo grid.36567.310000 0001 0737 1259Department of Plant Pathology, Kansas State University, Manhattan, KS USA ,7grid.40803.3f0000 0001 2173 6074Present Address: Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC USA
Fei He grid.36567.310000 0001 0737 1259Department of Plant Pathology, Kansas State University, Manhattan, KS USA ,8grid.9227.e0000000119573309Present Address: State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
Robert L. Bowden grid.512831.cUSDA-ARS, Hard Winter Wheat Genetics Research Unit, Manhattan, KS USA
John P. Fellers grid.512831.cUSDA-ARS, Hard Winter Wheat Genetics Research Unit, Manhattan, KS USA
Katherine W. Jordan grid.36567.310000 0001 0737 1259Department of Plant Pathology, Kansas State University, Manhattan, KS USA ,2grid.512831.cUSDA-ARS, Hard Winter Wheat Genetics Research Unit, Manhattan, KS USA
Alina Akhunova grid.36567.310000 0001 0737 1259Department of Plant Pathology, Kansas State University, Manhattan, KS USA ,3grid.36567.310000 0001 0737 1259Integrated Genomics Facility, Kansas State University, Manhattan, KS USA
Mathew N. Rouse grid.512864.c0000 0000 8881 3436Department of Plant Pathology, University of Minnesota & USDA-ARS, Cereal Disease Lab, St. Paul, MN USA
Les J. Szabo grid.512864.c0000 0000 8881 3436Department of Plant Pathology, University of Minnesota & USDA-ARS, Cereal Disease Lab, St. Paul, MN USA
Eduard Akhunov grid.36567.310000 0001 0737 1259Department of Plant Pathology, Kansas State University, Manhattan, KS USA ,5grid.36567.310000 0001 0737 1259Wheat Genetics Resource Center, Kansas State University, Manhattan, KS USA

Collapse

Sabadin F, DoVale JC, Platten JD, Fritsche-Neto R. Optimizing self-pollinated crop breeding employing genomic selection: From schemes to updating training sets. FRONTIERS IN PLANT SCIENCE 2022;13:935885. [PMID: 36275547 PMCID: PMC9583387 DOI: 10.3389/fpls.2022.935885] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 09/12/2022] [Indexed: 06/16/2023]

Wangkumhang P, Greenfield M, Hellenthal G. An efficient method to identify, date, and describe admixture events using haplotype information. Genome Res 2022;32:1553-1564. [PMID: 35794007 PMCID: PMC9435750 DOI: 10.1101/gr.275994.121] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Accepted: 06/28/2022] [Indexed: 11/24/2022]

Joint inference of ancestry and genotypes of parents from children. iScience 2022;25:104768. [PMID: 35942102 PMCID: PMC9356179 DOI: 10.1016/j.isci.2022.104768] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 05/18/2022] [Accepted: 07/11/2022] [Indexed: 12/02/2022] Open

Perera M, Montserrat DM, Barrabes M, Geleta M, Giro-I-Nieto X, Ioannidis AG. Generative Moment Matching Networks for Genotype Simulation. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2022;2022:1379-1383. [PMID: 36086656 DOI: 10.1109/embc48229.2022.9871045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Baller JL, Kachman SD, Kuehn LA, Spangler ML. Using pooled data for genomic prediction in a bivariate framework with missing data. J Anim Breed Genet 2022;139:489-501. [PMID: 35698863 PMCID: PMC9544112 DOI: 10.1111/jbg.12727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Accepted: 05/21/2022] [Indexed: 11/29/2022]

Chintalapati M, Patterson N, Moorjani P. The spatiotemporal patterns of major human admixture events during the European Holocene. eLife 2022;11:77625. [PMID: 35635751 PMCID: PMC9293011 DOI: 10.7554/elife.77625] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Accepted: 05/29/2022] [Indexed: 11/16/2022] Open

See GM, Fix JS, Schwab CR, Spangler ML. Imputation of non-genotyped F1 dams to improve genetic gain in swine crossbreeding programs. J Anim Sci 2022;100:6572187. [PMID: 35451025 PMCID: PMC9126202 DOI: 10.1093/jas/skac148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 04/20/2022] [Indexed: 11/12/2022] Open

Abstract

This study investigated using imputed genotypes from non-genotyped animals which were not in the pedigree for the purpose of genetic selection and improving genetic gain for economically relevant traits. Simulations were used to mimic a 3-breed crossbreeding system that resembled a modern swine breeding scheme. The simulation consisted of three purebred (PB) breeds A, B, and C each with 25 and 425 mating males and females, respectively. Males from A and females from B were crossed to produce AB females (n = 1,000), which were crossed with males from C to produce crossbreds (CB; n = 10,000). The genome consisted of three chromosomes with 300 quantitative trait loci and ~9,000 markers. Lowly heritable reproductive traits were simulated for A, B, and AB (h2 = 0.2, 0.2, and 0.15, respectively), whereas a moderately heritable carcass trait was simulated for C (h2 = 0.4). Genetic correlations between reproductive traits in A, B, and AB were moderate (rg = 0.65). The goal trait of the breeding program was AB performance. Selection was practiced for four generations where AB and CB animals were first produced in generations 1 and 2, respectively. Non-genotyped AB dams were imputed using FImpute beginning in generation 2. Genotypes of PB and CB were used for imputation. Imputation strategies differed by three factors: 1) AB progeny genotyped per generation (2, 3, 4, or 6), 2) known or unknown mates of AB dams, and 3) genotyping rate of females from breeds A and B (0% or 100%). PB selection candidates from A and B were selected using estimated breeding values for AB performance, whereas candidates from C were selected by phenotype. Response to selection using imputed genotypes of non-genotyped animals was then compared to the scenarios where true AB genotypes (trueGeno) or no AB genotypes/phenotypes (noGeno) were used in genetic evaluations. The simulation was replicated 20 times. The average increase in genotype concordance between unknown and known sire imputation strategies was 0.22. Genotype concordance increased as the number of genotyped CB increased with little additional gain beyond 9 progeny. When mates of AB were known and more than 4 progeny were genotyped per generation, the phenotypic response in AB did not differ (P > 0.05) from trueGeno yet was greater (P < 0.05) than noGeno. Imputed genotypes of non-genotyped animals can be used to increase performance when 4 or more progeny are genotyped and sire pedigrees of CB animals are known.

Collapse

Zhao R, Pei S, Yau SST. New Genome Sequence Detection via Natural Vector Convex Hull Method. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022;19:1782-1793. [PMID: 33237867 DOI: 10.1109/tcbb.2020.3040706] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Baumdicker F, Bisschop G, Goldstein D, Gower G, Ragsdale AP, Tsambos G, Zhu S, Eldon B, Ellerman EC, Galloway JG, Gladstein AL, Gorjanc G, Guo B, Jeffery B, Kretzschumar WW, Lohse K, Matschiner M, Nelson D, Pope NS, Quinto-Cortés CD, Rodrigues MF, Saunack K, Sellinger T, Thornton K, van Kemenade H, Wohns AW, Wong Y, Gravel S, Kern AD, Koskela J, Ralph PL, Kelleher J. Efficient ancestry and mutation simulation with msprime 1.0. Genetics 2022;220:iyab229. [PMID: 34897427 PMCID: PMC9176297 DOI: 10.1093/genetics/iyab229] [Citation(s) in RCA: 104] [Impact Index Per Article: 52.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 12/03/2021] [Indexed: 11/13/2022] Open

Affiliation(s)

Franz Baumdicker Cluster of Excellence “Controlling Microbes to Fight Infections”, Mathematical and Computational Population Genetics, University of Tübingen, 72076 Tübingen, Germany
Gertjan Bisschop Institute of Evolutionary Biology, The University of Edinburgh, Edinburgh EH9 3FL, UK
Daniel Goldstein Khoury College of Computer Sciences, Northeastern University, Boston, MA 02115, USA Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Graham Gower Lundbeck GeoGenetics Centre, Globe Institute, University of Copenhagen, 1350 Copenhagen K, Denmark
Aaron P Ragsdale Department of Integrative Biology, University of Wisconsin–Madison, Madison, WI 53706, USA
Georgia Tsambos Melbourne Integrative Genomics, School of Mathematics and Statistics, University of Melbourne, Parkville, VIC 3010, Australia
Sha Zhu Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, UK
Bjarki Eldon Leibniz Institute for Evolution and Biodiversity Science, Museum für Naturkunde, Berlin 10115, Germany
E Castedo Ellerman Fresh Pond Research Institute, Cambridge, MA 02140, USA
Jared G Galloway Department of Biology, Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403-5289, USA Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98102, USA
Ariella L Gladstein Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7264, USA Embark Veterinary, Inc., Boston, MA 02111, USA
Gregor Gorjanc The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh EH25 9RG, UK
Bing Guo Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
Ben Jeffery Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, UK
Warren W Kretzschumar Center for Hematology and Regenerative Medicine, Karolinska Institute, 141 83 Huddinge, Sweden
Konrad Lohse Institute of Evolutionary Biology, The University of Edinburgh, Edinburgh EH9 3FL, UK
Michael Matschiner Natural History Museum, University of Oslo, 0318 Oslo, Norway
Dominic Nelson Department of Human Genetics, McGill University, Montréal, QC H3A 0C7, Canada
Nathaniel S Pope Department of Entomology, Pennsylvania State University, State College, PA 16802, USA
Consuelo D Quinto-Cortés National Laboratory of Genomics for Biodiversity (LANGEBIO), Unit of Advanced Genomics, CINVESTAV, Irapuato, Mexico
Murillo F Rodrigues Department of Biology, Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403-5289, USA
Kumar Saunack IIT Bombay, Powai, Mumbai 400 076, India
Thibaut Sellinger Professorship for Population Genetics, Department of Life Science Systems, Technical University of Munich, 85354 Freising, Germany
Kevin Thornton Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697, USA
Hugo van Kemenade
Anthony W Wohns Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, UK Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Yan Wong Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, UK
Simon Gravel Department of Human Genetics, McGill University, Montréal, QC H3A 0C7, Canada
Andrew D Kern Department of Biology, Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403-5289, USA
Jere Koskela Department of Statistics, University of Warwick, Coventry CV4 7AL, UK
Peter L Ralph Department of Biology, Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403-5289, USA Department of Mathematics, University of Oregon, Eugene, OR 97403-5289, USA
Jerome Kelleher Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, UK

Collapse

Omer EA, Hinrichs D, Addo S, Roessler R. Development of a breeding program for improving the milk yield performance of Butana cattle under smallholder production conditions using a stochastic simulation approach. J Dairy Sci 2022;105:5261-5270. [DOI: 10.3168/jds.2021-21307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 01/20/2022] [Indexed: 11/19/2022]

Batista LG, Mello VH, Souza AP, Margarido GRA. Genomic prediction with allele dosage information in highly polyploid species. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022;135:723-739. [PMID: 34800132 DOI: 10.1007/s00122-021-03994-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 11/06/2021] [Indexed: 06/13/2023]

Covarrubias-Pazaran G, Gebeyehu Z, Gemenet D, Werner C, Labroo M, Sirak S, Coaldrake P, Rabbi I, Kayondo SI, Parkes E, Kanju E, Mbanjo EGN, Agbona A, Kulakow P, Quinn M, Debaene J. Breeding Schemes: What Are They, How to Formalize Them, and How to Improve Them? FRONTIERS IN PLANT SCIENCE 2022;12:791859. [PMID: 35126417 PMCID: PMC8813775 DOI: 10.3389/fpls.2021.791859] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2021] [Accepted: 12/10/2021] [Indexed: 05/26/2023]

Affiliation(s)

Giovanny Covarrubias-Pazaran Excellence in Breeding Platform, Consultative Group on International Agricultural Research, Texcoco, Mexico Independent Researcher, Addis Ababa, Ethiopia
Zelalem Gebeyehu Independent Researcher, Addis Ababa, Ethiopia
Dorcus Gemenet Excellence in Breeding Platform, Consultative Group on International Agricultural Research, Texcoco, Mexico International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
Christian Werner Excellence in Breeding Platform, Consultative Group on International Agricultural Research, Texcoco, Mexico International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
Marlee Labroo Excellence in Breeding Platform, Consultative Group on International Agricultural Research, Texcoco, Mexico International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
Solomon Sirak Excellence in Breeding Platform, Consultative Group on International Agricultural Research, Texcoco, Mexico
Peter Coaldrake Excellence in Breeding Platform, Consultative Group on International Agricultural Research, Texcoco, Mexico
Ismail Rabbi International Institute for Tropical Agriculture (IITA), Ibadan, Nigeria
Siraj Ismail Kayondo International Institute for Tropical Agriculture (IITA), Ibadan, Nigeria
Elizabeth Parkes International Institute for Tropical Agriculture (IITA), Ibadan, Nigeria
Edward Kanju International Institute for Tropical Agriculture (IITA), Ibadan, Nigeria
Edwige Gaby Nkouaya Mbanjo International Institute for Tropical Agriculture (IITA), Ibadan, Nigeria
Afolabi Agbona International Institute for Tropical Agriculture (IITA), Ibadan, Nigeria
Peter Kulakow International Institute for Tropical Agriculture (IITA), Ibadan, Nigeria
Michael Quinn Excellence in Breeding Platform, Consultative Group on International Agricultural Research, Texcoco, Mexico International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
Jan Debaene Excellence in Breeding Platform, Consultative Group on International Agricultural Research, Texcoco, Mexico International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico

Collapse

Legarra A, Garcia-Baccino CA, Wientjes YCJ, Vitezica ZG. The correlation of substitution effects across populations and generations in the presence of nonadditive functional gene action. Genetics 2021;219:iyab138. [PMID: 34718531 PMCID: PMC8664574 DOI: 10.1093/genetics/iyab138] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 08/19/2021] [Indexed: 11/14/2022] Open

Lu CW, Yao CT, Hung CM. Domestication obscures genomic estimates of population history. Mol Ecol 2021;31:752-766. [PMID: 34779057 DOI: 10.1111/mec.16277] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 11/05/2021] [Accepted: 11/08/2021] [Indexed: 11/28/2022]

Powell O, Mrode R, Gaynor RC, Johnsson M, Gorjanc G, Hickey JM. Genomic evaluations using data recorded on smallholder dairy farms in low- to middle-income countries. JDS COMMUNICATIONS 2021;2:366-370. [PMID: 36337118 PMCID: PMC9623656 DOI: 10.3168/jdsc.2021-0092] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Accepted: 07/14/2021] [Indexed: 12/02/2022]

Abstract

•

Genomic evaluations outperformed pedigree-based genetic evaluations.

•

Shared haplotypes captured "hidden" genetic relationships to strengthen connectedness in genomic evaluations.

•

Genomic evaluations were possible using LMIC smallholder records from herds with ≤4 cows. . Modelling herd as a random effect produced EBVs with the highest accuracies.

Breeding has increased genetic gain for dairy cattle in advanced economies but has had limited success in improving dairy cattle in low- to middle-income countries (LMIC). Genetic evaluations are a central component of delivering genetic gain, because they separate the genetic and environmental effects of animals' phenotypes. Genetic evaluations have been successful in advanced economies because of large data sets and strong genetic connectedness, provided by the widespread use of artificial insemination (AI) and accurate recording of pedigree information. In smallholder dairy production systems of many LMICs, the limited use of AI and small herd sizes results in a data structure with insufficient genetic connectedness between herds to facilitate genetic evaluations based on pedigree. Genomic information keeps track of shared haplotypes rather than shared relatives captured by pedigree records. Therefore, genomic information could capture “hidden” genetic relationships, that are not captured by pedigree information, to strengthen genetic connectedness in LMIC smallholder dairy data sets. This study's objective was to use simulation to quantify the power of genomic information to enable genetic evaluation using LMIC smallholder dairy data sets. The results from this study show that (1) genetic evaluations using genomic information were more accurate than those using pedigree information in populations with a high effective population size and weak genetic connectedness; and (2) genetic evaluations modeling herd as a random effect had higher or equal accuracy than those modeling herd as a fixed effect. This demonstrates the potential of genomic information to be an enabling technology in LMIC smallholder dairy production systems by facilitating genetic evaluations with in situ records collected from herds of ≤4 cows. The establishment of routine genomic evaluations could allow the development of LMIC breeding programs comprising an informal set of nucleus animals distributed across many small herds within the target environment. These nucleus animals could be used for genetic evaluation, and the best animals could be disseminated to participating smallholder dairy farms. Together, this could increase the productivity, profitability, and sustainability of LMIC smallholder dairy production systems.

Collapse

Rios EF, Andrade MHML, Resende MFR, Kirst M, de Resende MDV, de Almeida Filho JE, Gezan SA, Munoz P. Genomic prediction in family bulks using different traits and cross-validations in pine. G3-GENES GENOMES GENETICS 2021;11:6321952. [PMID: 34544139 PMCID: PMC8496210 DOI: 10.1093/g3journal/jkab249] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Accepted: 07/02/2021] [Indexed: 11/13/2022]

da Silva ÉDB, Xavier A, Faria MV. Impact of Genomic Prediction Model, Selection Intensity, and Breeding Strategy on the Long-Term Genetic Gain and Genetic Erosion in Soybean Breeding. Front Genet 2021;12:637133. [PMID: 34539725 PMCID: PMC8440908 DOI: 10.3389/fgene.2021.637133] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Accepted: 08/05/2021] [Indexed: 11/21/2022] Open

Abstract

Genomic-assisted breeding has become an important tool in soybean breeding. However, the impact of different genomic selection (GS) approaches on short- and long-term gains is not well understood. Such gains are conditional on the breeding design and may vary with a combination of the prediction model, family size, selection strategies, and selection intensity. To address these open questions, we evaluated various scenarios through a simulated closed soybean breeding program over 200 breeding cycles. Genomic prediction was performed using genomic best linear unbiased prediction (GBLUP), Bayesian methods, and random forest, benchmarked against selection on phenotypic values, true breeding values (TBV), and random selection. Breeding strategies included selections within family (WF), across family (AF), and within pre-selected families (WPSF), with selection intensities of 2.5, 5.0, 7.5, and 10.0%. Selections were performed at the F4 generation, where individuals were phenotyped and genotyped with a 6K single nucleotide polymorphism (SNP) array. Initial genetic parameters for the simulation were estimated from the SoyNAM population. WF selections provided the most significant long-term genetic gains. GBLUP and Bayesian methods outperformed random forest and provided most of the genetic gains within the first 100 generations, being outperformed by phenotypic selection after generation 100. All methods provided similar performances under WPSF selections. A faster decay in genetic variance was observed when individuals were selected AF and WPSF, as 80% of the genetic variance was depleted within 28-58 cycles, whereas WF selections preserved the variance up to cycle 184. Surprisingly, the selection intensity had less impact on long-term gains than did the breeding strategies. The study supports that genetic gains can be optimized in the long term with specific combinations of prediction models, family size, selection strategies, and selection intensity. A combination of strategies may be necessary for balancing the short-, medium-, and long-term genetic gains in breeding programs while preserving the genetic variance.

Collapse

Evidence for opposing selective forces operating on human-specific duplicated TCAF genes in Neanderthals and humans. Nat Commun 2021;12:5118. [PMID: 34433829 PMCID: PMC8387397 DOI: 10.1038/s41467-021-25435-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Accepted: 08/04/2021] [Indexed: 11/30/2022] Open

Vargas Jurado N, Kuehn LA, Keele JW, Lewis RM. Accuracy of GEBV of sires based on pooled allele frequency of their progeny. G3-GENES GENOMES GENETICS 2021;11:6321233. [PMID: 34510188 DOI: 10.1093/g3journal/jkab231] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 06/17/2021] [Indexed: 11/12/2022]

Hasan AR, Ness RW. Recombination Rate Variation and Infrequent Sex Influence Genetic Diversity in Chlamydomonas reinhardtii. Genome Biol Evol 2021;12:370-380. [PMID: 32181819 PMCID: PMC7186780 DOI: 10.1093/gbe/evaa057] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/13/2020] [Indexed: 12/12/2022] Open

Gaynor RC, Gorjanc G, Hickey JM. AlphaSimR: an R package for breeding program simulations. G3-GENES GENOMES GENETICS 2021;11:6025179. [PMID: 33704430 PMCID: PMC8022926 DOI: 10.1093/g3journal/jkaa017] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Accepted: 11/05/2020] [Indexed: 01/03/2023]

Rowan TN, Durbin HJ, Seabury CM, Schnabel RD, Decker JE. Powerful detection of polygenic selection and evidence of environmental adaptation in US beef cattle. PLoS Genet 2021;17:e1009652. [PMID: 34292938 PMCID: PMC8297814 DOI: 10.1371/journal.pgen.1009652] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 06/09/2021] [Indexed: 12/19/2022] Open

Gonen S, Wimmer V, Gaynor RC, Byrne E, Gorjanc G, Hickey JM. Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations. CROP SCIENCE 2021;61:2243-2253. [PMID: 34413534 PMCID: PMC8362159 DOI: 10.1002/csc2.20409] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Accepted: 11/07/2020] [Indexed: 06/13/2023]

Johnsson M, Whalen A, Ros-Freixedes R, Gorjanc G, Chen CY, Herring WO, de Koning DJ, Hickey JM. Genetic variation in recombination rate in the pig. Genet Sel Evol 2021;53:54. [PMID: 34171988 PMCID: PMC8235837 DOI: 10.1186/s12711-021-00643-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 06/02/2021] [Indexed: 11/10/2022] Open

Korgaonkar A, Han C, Lemire AL, Siwanowicz I, Bennouna D, Kopec RE, Andolfatto P, Shigenobu S, Stern DL. A novel family of secreted insect proteins linked to plant gall development. Curr Biol 2021;31:1836-1849.e12. [PMID: 33657407 PMCID: PMC8119383 DOI: 10.1016/j.cub.2021.01.104] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 12/23/2020] [Accepted: 01/28/2021] [Indexed: 12/17/2022]

Korgaonkar A, Han C, Lemire AL, Siwanowicz I, Bennouna D, Kopec RE, Andolfatto P, Shigenobu S, Stern DL. A novel family of secreted insect proteins linked to plant gall development. Curr Biol 2021. [PMID: 33974861 DOI: 10.1101/2020.10.28.359562] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]

Long-term comparison between index selection and optimal independent culling in plant breeding programs with genomic prediction. PLoS One 2021;16:e0235554. [PMID: 33970915 PMCID: PMC8109766 DOI: 10.1371/journal.pone.0235554] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Accepted: 01/20/2021] [Indexed: 11/19/2022] Open

Svedberg J, Shchur V, Reinman S, Nielsen R, Corbett-Detig R. Inferring Adaptive Introgression Using Hidden Markov Models. Mol Biol Evol 2021;38:2152-2165. [PMID: 33502512 PMCID: PMC8097282 DOI: 10.1093/molbev/msab014] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open