1
|
Genetic diversity and genome-wide association study of 13 agronomic traits in 977 Beta vulgaris L. germplasms. BMC Genomics 2023; 24:413. [PMID: 37488485 PMCID: PMC10364417 DOI: 10.1186/s12864-023-09522-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Accepted: 07/17/2023] [Indexed: 07/26/2023] Open
Abstract
BACKGROUND Sugar beet (Beta vulgaris L.) is an economically essential sugar crop worldwide. Its agronomic traits are highly diverse and phenotypically plastic, influencing taproot yield and quality. The National Beet Medium-term Gene Bank in China maintains more than 1700 beet germplasms with diverse countries of origin. However, it lacks detailed genetic background associated with morphological variability and diversity. RESULTS Here, a comprehensive genome-wide association study (GWAS) of 13 agronomic traits was conducted in a panel of 977 sugar beet accessions. Almost all phenotypic traits exhibited wide genetic diversity and high coefficient of variation (CV). A total of 170,750 high-quality single-nucleotide polymorphisms (SNPs) were obtained using the genotyping-by-sequencing (GBS). Neighbour-joining phylogenetic analysis, principal component analysis, population structure and kinship showed no obvious relationships among these genotypes based on subgroups or regional sources. GWAS was carried out using a mixed linear model, and 159 significant associations were detected for these traits. Within the 25 kb linkage disequilibrium decay of the associated markers, NRT1/PTR FAMILY 6.3 (BVRB_5g097760); nudix hydrolase 15 (BVRB_8g182070) and TRANSPORT INHIBITOR RESPONSE 1 (BVRB_8g181550); transcription factor MYB77 (BVRB_2g023500); and ethylene-responsive transcription factor ERF014 (BVRB_1g000090) were predicted to be strongly associated with the taproot traits of root groove depth (RGD); root shape (RS); crown size (CS); and flesh colour (FC), respectively. For the aboveground traits, UDP-glycosyltransferase 79B6 (BVRB_9g223780) and NAC domain-containing protein 7 (BVRB_5g097990); F-box protein At1g10780 (BVRB_6g140760); phosphate transporter PHO1 (BVRB_3g048660); F-box protein CPR1 (BVRB_8g181140); and transcription factor MYB77 (BVRB_2g023500) and alcohol acyltransferase 9 (BVRB_2g023460) might be associated with the hypocotyl colour (HC); plant type (PT); petiole length (PL); cotyledon size (C); and fascicled leaf type (FLT) of sugar beet, respectively. AP-2 complex subunit mu (BVRB_5g106130), trihelix transcription factor ASIL2 (BVRB_2g041790) and late embryogenesis abundant protein 18 (BVRB_5g106150) might be involved in pollen quantity (PQ) variation. The candidate genes extensively participated in hormone response, nitrogen and phosphorus transportation, secondary metabolism, fertilization and embryo maturation. CONCLUSIONS The genetic basis of agronomical traits is complicated in heterozygous diploid sugar beet. The putative valuable genes found in this study will help further elucidate the molecular mechanism of each phenotypic trait for beet breeding.
Collapse
|
2
|
Identification of qBK2.1, a novel QTL controlling rice resistance against Fusarium fujikuroi. BOTANICAL STUDIES 2023; 64:11. [PMID: 37079162 PMCID: PMC10119339 DOI: 10.1186/s40529-023-00375-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 04/09/2023] [Indexed: 05/03/2023]
Abstract
BACKGROUND Bakanae disease caused by Fusarium fujikuroi is an increasing threat to rice production. The infected plants show symptoms such as elongation, slenderness, chlorosis, a large leaf angle, and even death. Bakanae disease is traditionally managed by seed treatment. However, fungicide-resistant F. fujikuroi isolates have emerged in several Asian areas, including Taiwan. This study aimed to identify new bakanae resistance quantitative trait loci (QTLs) and provide molecular markers to assist future breeding. RESULTS A population of F2:9 recombinant inbred lines (RILs) was derived from the cross between an elite japonica Taiwanese cultivar 'Taikeng 16 (TK16)' and an indica variety 'Budda'. 'Budda' was found highly resistant to all 24 representative isolates of the F. fujikuroi population in Taiwan. For the RIL population, 6,492 polymorphic single nucleotide polymorphisms (SNPs) spanning the rice genome were obtained by genotyping-by-sequencing (GBS) technique, and the disease severity index (DSI) was evaluated by inoculation with a highly virulent F. fujikuroi isolate Ff266. Trait-marker association analysis of 166 RILs identified two QTLs in 'Budda'. qBK2.1 (21.97-30.15 Mb) is a novel and first bakanae resistance QTL identified on chromosome 2. qBK1.8 (5.24-8.66 Mb) partially overlaps with the previously reported qBK1.3 (4.65-8.41 Mb) on chromosome 1. The log of odds (LOD) scores of qBK1.8 and qBK2.1 were 4.75 and 6.13, accounting for 4.9% and 8.1% of the total phenotypic variation, respectively. 64 RILs carrying both qBK1.8 and qBK2.1 showed lower DSI (7%) than the lines carrying only qBK1.8 (15%), only qBK2.1 (13%), or none of the two QTLs (21%). For the future application of identified QTLs, 11 KBioscience competitive allele-specific PCR (KASP) markers and 3 insertion-deletion (InDel) markers were developed. CONCLUSIONS Compared to other important rice diseases, knowledge of bakanae resistance has been insufficient, which limited the development and deployment of resistant cultivars. The discovery of qBK2.1 has provided a new source of bakanae resistance. The resistant RILs inheriting good plant type, good taste, and high yield characteristics from 'TK16' can be used as good resistance donors. Our newly developed markers targeting qBK2.1 and qBK1.8 can also serve as an important basis for future fine-mapping and resistance breeding.
Collapse
|
3
|
Genome-wide association study of leaf-related traits in tea plant in Guizhou based on genotyping-by-sequencing. BMC PLANT BIOLOGY 2023; 23:196. [PMID: 37046207 PMCID: PMC10091845 DOI: 10.1186/s12870-023-04192-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Accepted: 03/24/2023] [Indexed: 06/19/2023]
Abstract
BACKGROUND Studying the genetic characteristics of tea plant (Camellia spp.) leaf traits is essential for improving yield and quality through breeding and selection. Guizhou Plateau, an important part of the original center of tea plants, has rich genetic resources. However, few studies have explored the associations between tea plant leaf traits and single nucleotide polymorphism (SNP) markers in Guizhou. RESULTS In this study, we used the genotyping-by-sequencing (GBS) method to identify 100,829 SNP markers from 338 accessions of tea germplasm in Guizhou Plateau, a region with rich genetic resources. We assessed population structure based on high-quality SNPs, constructed phylogenetic relationships, and performed genome-wide association studies (GWASs). Four inferred pure groups (G-I, G-II, G-III, and G-IV) and one inferred admixture group (G-V), were identified by a population structure analysis, and verified by principal component analyses and phylogenetic analyses. Through GWAS, we identified six candidate genes associated with four leaf traits, including mature leaf size, texture, color and shape. Specifically, two candidate genes, located on chromosomes 1 and 9, were significantly associated with mature leaf size, while two genes, located on chromosomes 8 and 11, were significantly associated with mature leaf texture. Additionally, two candidate genes, located on chromosomes 1 and 2 were identified as being associated with mature leaf color and mature leaf shape, respectively. We verified the expression level of two candidate genes was verified using reverse transcription quantitative polymerase chain reaction (RT-qPCR) and designed a derived cleaved amplified polymorphism (dCAPS) marker that co-segregated with mature leaf size, which could be used for marker-assisted selection (MAS) breeding in Camellia sinensis. CONCLUSIONS In the present study, by using GWAS approaches with the 338 tea accessions population in Guizhou, we revealed a list of SNPs markers and candidate genes that were significantly associated with four leaf traits. This work provides theoretical and practical basis for the genetic breeding of related traits in tea plant leaves.
Collapse
|
4
|
Genome-based high-resolution mapping of fusarium wilt resistance in sweet basil. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2022; 321:111316. [PMID: 35696916 DOI: 10.1016/j.plantsci.2022.111316] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 04/05/2022] [Accepted: 05/07/2022] [Indexed: 06/15/2023]
Abstract
Fusarium wilt of basil is a disease of sweet basil (Ocimum basilicum L.) plants caused by the fungus Fusarium oxysporum f. sp. basilici (FOB). Although resistant cultivars were released > 20 years ago, the underlying mechanism and the genes controlling the resistance remain unknown. We used genetic mapping to elucidate FOB resistance in an F2 population derived from a cross between resistant and susceptible cultivars. We performed genotyping by sequencing of 173 offspring and aligning the data to the sweet basil reference genome. In total, 23,411 polymorphic sites were detected, and a single quantitative trait locus (QTL) for FOB resistance was found. The confidence interval was < 600 kbp, harboring only 60 genes, including a cluster of putative disease-resistance genes. Based on homology to a fusarium resistance protein from wild tomato, we also investigated a candidate resistance gene that encodes a transmembrane leucine-rich repeat - receptor-like kinase - ubiquitin-like protease (LRR-RLK-ULP). Sequence analysis of that gene in the susceptible parent vs. the resistant parent revealed multiple indels, including an insertion of 20 amino acids next to the transmembrane domain, which might alter its functionality. Our findings suggest that this LRR-RLK-ULP might be responsible for FOB resistance in sweet basil and demonstrate the usefulness of the recently sequenced basil genome for QTL mapping and gene mining.
Collapse
|
5
|
Perspectives and recent progress of genome-wide association studies (GWAS) in fruits. Mol Biol Rep 2022; 49:5341-5352. [PMID: 35064403 DOI: 10.1007/s11033-021-07055-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 12/06/2021] [Indexed: 11/25/2022]
Abstract
BACKGROUND Earlier next-generation sequencing technologies are being vastly used to explore, administer, and investigate the gene space with accurate profiling of nucleotide variations in the germplasm. OVERVIEW AND PROGRESS: Recently, novel advancements in high-throughput sequencing technologies allow a genotyping-by-sequencing approach that has opened up new horizons for extensive genotyping exploiting single-nucleotide-polymorphisms (SNPs). This method acts as a bridge to support and minimize a genotype to phenotype gap allowing genetic selection at the genome-wide level, named genomic selection that could facilitate the selection of traits also in the pomology sector. In addition to this, genome-wide genotyping is a prerequisite for genome-wide association studies that have been used successfully to discover the genes, which control polygenic traits including the genetic loci, associated with the trait of interest in fruit crops. AIMS AND PROSPECTS This review article emphasizes the role of genome-wide approaches to unlock and explore the genetic potential along with the detection of SNPs affecting the phenotype of fruit crops and highlights the prospects of genome-wide association studies in fruits.
Collapse
|
6
|
Elucidating SNP-based genetic diversity and population structure of advanced breeding lines of bread wheat ( Triticum aestivum L .). PeerJ 2021; 9:e11593. [PMID: 34221720 PMCID: PMC8231316 DOI: 10.7717/peerj.11593] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 05/20/2021] [Indexed: 11/20/2022] Open
Abstract
Genetic diversity and population structure information are crucial for enhancing traits of interest and the development of superlative varieties for commercialization. The present study elucidated the population structure and genetic diversity of 141 advanced wheat breeding lines using single nucleotide polymorphism markers. A total of 14,563 high-quality identified genotyping-by-sequencing (GBS) markers were distributed covering 13.9 GB wheat genome, with a minimum of 1,026 SNPs on the homoeologous group four and a maximum of 2,838 SNPs on group seven. The average minor allele frequency was found 0.233, although the average polymorphism information content (PIC) and heterozygosity were 0.201 and 0.015, respectively. Principal component analyses (PCA) and population structure identified two major groups (sub-populations) based on SNPs information. The results indicated a substantial gene flow/exchange with many migrants (Nm = 86.428) and a considerable genetic diversity (number of different alleles, Na = 1.977; the number of effective alleles, Ne = 1.519; and Shannon's information index, I = 0.477) within the population, illustrating a good source for wheat improvement. The average PIC of 0.201 demonstrates moderate genetic diversity of the present evaluated advanced breeding panel. Analysis of molecular variance (AMOVA) detected 1% and 99% variance between and within subgroups. It is indicative of excessive gene traffic (less genetic differentiation) among the populations. These conclusions deliver important information with the potential to contribute new beneficial alleles using genome-wide association studies (GWAS) and marker-assisted selection to enhance genetic gain in South Asian wheat breeding programs.
Collapse
|
7
|
Natural variation of the Dt2 promoter controls plant height and node number in semi-determinant soybean. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2021; 41:40. [PMID: 37309444 PMCID: PMC10236065 DOI: 10.1007/s11032-021-01235-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Accepted: 05/23/2021] [Indexed: 06/14/2023]
Abstract
Soybean (Glycine max (L.) Merrill) is an important legume crop worldwide. Plant height (PH) is a quantitative trait that is closely related to node number (NN) and internode length (IL) on the main stem, which together affect soybean yield. To identify candidate genes controlling these three traits in soybean, we examined a recombinant inbred line (RIL) population derived from a cross between two soybean varieties with semi-determinate stems (Dt1Dt1Dt2Dt2), JKK378 and HXW. A quantitative trait locus (QTL) named qPH18 was identified that simultaneously controls PH, NN, and IL; this region harbors the semi-determinant gene Dt2. Sequencing of the Dt2 promoter from JKK378 identified three polymorphisms relative to HXW, including two single nucleotide polymorphism (SNPs) and an 18-bp insertion/deletion polymorphism (Indel). Dt2 expression was lower in the qPH18JKK378 group than in the qPH18HXW group, whereas the expression level of the downstream gene Dt1 showed the opposite tendency. A transient transfection assay confirmed that Dt2 promoter activity is lower in JKK378 compared to HXW. We propose that the polymorphisms in the dominant Dt2 promoter underlie the differences in Dt2 expression and its downstream gene Dt1 in the two parents, thereby affecting PH, NN, IL, and grain weight per plant without altering stem growth habit. Compared to the PH18HXW allele, the qPH18JKK378 allele suppresses Dt2 expression, which releases the inhibition of Dt1 expression, thus enhancing NN and grain yield. Our findings shed light on the mechanism underlying NN and PH in soybean and provide a molecular marker to facilitate breeding. Supplementary Information The online version contains supplementary material available at 10.1007/s11032-021-01235-y.
Collapse
|
8
|
Genetic mapping for agronomic traits in a MAGIC population of common bean (Phaseolus vulgaris L.) under drought conditions. BMC Genomics 2020; 21:799. [PMID: 33198642 PMCID: PMC7670608 DOI: 10.1186/s12864-020-07213-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Accepted: 11/05/2020] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND Common bean is an important staple crop in the tropics of Africa, Asia and the Americas. Particularly smallholder farmers rely on bean as a source for calories, protein and micronutrients. Drought is a major production constraint for common bean, a situation that will be aggravated with current climate change scenarios. In this context, new tools designed to understand the genetic basis governing the phenotypic responses to abiotic stress are required to improve transfer of desirable traits into cultivated beans. RESULTS A multiparent advanced generation intercross (MAGIC) population of common bean was generated from eight Mesoamerican breeding lines representing the phenotypic and genotypic diversity of the CIAT Mesoamerican breeding program. This population was assessed under drought conditions in two field trials for yield, 100 seed weight, iron and zinc accumulation, phenology and pod harvest index. Transgressive segregation was observed for most of these traits. Yield was positively correlated with yield components and pod harvest index (PHI), and negative correlations were found with phenology traits and micromineral contents. Founder haplotypes in the population were identified using Genotyping by Sequencing (GBS). No major population structure was observed in the population. Whole Genome Sequencing (WGS) data from the founder lines was used to impute genotyping data for GWAS. Genetic mapping was carried out with two methods, using association mapping with GWAS, and linkage mapping with haplotype-based interval screening. Thirteen high confidence QTL were identified using both methods and several QTL hotspots were found controlling multiple traits. A major QTL hotspot located on chromosome Pv01 for phenology traits and yield was identified. Further hotspots affecting several traits were observed on chromosomes Pv03 and Pv08. A major QTL for seed Fe content was contributed by MIB778, the founder line with highest micromineral accumulation. Based on imputed WGS data, candidate genes are reported for the identified major QTL, and sequence changes were identified that could cause the phenotypic variation. CONCLUSIONS This work demonstrates the importance of this common bean MAGIC population for genetic mapping of agronomic traits, to identify trait associations for molecular breeding tool design and as a new genetic resource for the bean research community.
Collapse
|
9
|
Identification of resistance loci in Chinese and Canadian canola/rapeseed varieties against Leptosphaeria maculans based on genome-wide association studies. BMC Genomics 2020; 21:501. [PMID: 32693834 PMCID: PMC7372758 DOI: 10.1186/s12864-020-06893-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 07/07/2020] [Indexed: 01/08/2023] Open
Abstract
Background The fungal pathogen Leptosphaeria maculans (Lm). causes blackleg disease on canola/rapeseed in many parts of the world. It is important to use resistant cultivars to manage the disease and minimize yield losses. In this study, twenty-two Lm isolates were used to identify resistance genes in a collection of 243 canola/rapeseed (Brassica napus L.) accessions from Canada and China. These Lm isolates carry different compliments of avirulence genes, and the investigation was based on a genome-wide association study (GWAS) and genotype-by-sequencing (GBS). Results Using the CROP-SNP pipeline, a total of 81,471 variants, including 78,632 SNPs and 2839 InDels, were identified. The GWAS was performed using TASSEL 5.0 with GLM + Q model. Thirty-two and 13 SNPs were identified from the Canadian and Chinese accessions, respectively, tightly associated with blackleg resistance with P values < 1 × 10− 4. These SNP loci were distributed on chromosomes A03, A05, A08, A09, C01, C04, C05, and C07, with the majority of them on A08 followed by A09 and A03. The significant SNPs identified on A08 were all located in a 2010-kb region and associated with resistance to 12 of the 22 Lm isolates. Furthermore, 25 resistance gene analogues (RGAs) were identified in these regions, including two nucleotide binding site (NBS) domain proteins, fourteen RLKs, three RLPs and six TM-CCs. These RGAs can be the potential candidate genes for blackleg resistance. Conclusion This study provides insights into potentially new genomic regions for discovery of additional blackleg resistance genes. The identified regions associated with blackleg resistance in the germplasm collection may also contribute directly to the development of canola varieties with novel resistance genes against blackleg of canola.
Collapse
|
10
|
Identification of a polymorphism within the Rosa multiflora muRdr1A gene linked to resistance to multiple races of Diplocarpon rosae W. in tetraploid garden roses (Rosa × hybrida). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2020; 133:103-117. [PMID: 31563968 DOI: 10.1007/s00122-019-03443-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Accepted: 09/18/2019] [Indexed: 06/10/2023]
Abstract
A QTL for resistance to several races of black spot co-located with the known Rrd1 locus in Rosa. A polymorphism in muRdr1A linked to black spot resistance was identified and molecular markers were designed. Black spot, caused by Diplocarpon rosae, is one of the most serious foliar diseases of landscape roses that reduces the marketability and weakens the plants against winter survival. Genetic resistance to black spot (BS) exists and race-specific resistance is a good target to implement marker-assisted selection. High-density single nucleotide polymorphism-based genetic maps were created for the female parent of a tetraploid cross between 'CA60' and 'Singing in the Rain' using genotyping-by-sequencing following a two-way pseudo-testcross strategy. The female linkage map was generated based on 227 individuals and included 31 linkage groups, 1055 markers, with a length of 1980 cM. Race-specific resistance to four D. rosae races (5, 7, 10, 14) was evaluated using a detached leaf assay. BS resistance was also evaluated under natural infection in the field. Resistance to races 5, 10 and 14 of D. rosae and field resistance co-located on chromosome 1. A unique sequence of 32 bp in exon 4 of the muRdr1A gene was identified in 'CA60' that co-segregates with D. rosae resistance. Two diagnostic markers, a presence/absence marker and an INDEL marker, specific to this sequence were designed and validated in the mapping population and a backcross population derived from 'CA60.' Resistance to D. rosae race 7 mapped to a different location on chromosome 1.
Collapse
|
11
|
Genomic Selection with Allele Dosage in Panicum maximum Jacq. G3 (BETHESDA, MD.) 2019; 9:2463-2475. [PMID: 31171567 PMCID: PMC6686918 DOI: 10.1534/g3.118.200986] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Accepted: 05/23/2019] [Indexed: 12/21/2022]
Abstract
Genomic selection is an efficient approach to get shorter breeding cycles in recurrent selection programs and greater genetic gains with selection of superior individuals. Despite advances in genotyping techniques, genetic studies for polyploid species have been limited to a rough approximation of studies in diploid species. The major challenge is to distinguish the different types of heterozygotes present in polyploid populations. In this work, we evaluated different genomic prediction models applied to a recurrent selection population of 530 genotypes of Panicum maximum, an autotetraploid forage grass. We also investigated the effect of the allele dosage in the prediction, i.e., considering tetraploid (GS-TD) or diploid (GS-DD) allele dosage. A longitudinal linear mixed model was fitted for each one of the six phenotypic traits, considering different covariance matrices for genetic and residual effects. A total of 41,424 genotyping-by-sequencing markers were obtained using 96-plex and Pst1 restriction enzyme, and quantitative genotype calling was performed. Six predictive models were generalized to tetraploid species and predictive ability was estimated by a replicated fivefold cross-validation process. GS-TD and GS-DD models were performed considering 1,223 informative markers. Overall, GS-TD data yielded higher predictive abilities than with GS-DD data. However, different predictive models had similar predictive ability performance. In this work, we provide bioinformatic and modeling guidelines to consider tetraploid dosage and observed that genomic selection may lead to additional gains in recurrent selection program of P. maximum.
Collapse
|
12
|
The first genetic linkage map for Fraxinus pennsylvanica and syntenic relationships with four related species. PLANT MOLECULAR BIOLOGY 2019; 99:251-264. [PMID: 30604323 DOI: 10.1007/s11103-018-0815-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2018] [Accepted: 12/15/2018] [Indexed: 06/09/2023]
Abstract
The genetic linkage map for green ash (Fraxinus pennsylvanica) contains 1201 DNA markers in 23 linkage groups spanning 2008.87cM. The green ash map shows stronger synteny with coffee than tomato. Green ash (Fraxinus pennsylvanica) is an outcrossing, diploid (2n = 46) hardwood tree species, native to North America. Native ash species in North America are being threatened by the rapid spread of the emerald ash borer (EAB, Agrilus planipennis), an invasive pest from Asia. Green ash, the most widely distributed ash species, is severely affected by EAB infestation, yet few genomic resources for genetic studies and improvement of green ash are available. In this study, a total of 5712 high quality single nucleotide polymorphisms (SNPs) were discovered using a minimum allele frequency of 1% across the entire genome through genotyping-by-sequencing. We also screened hundreds of genomic- and EST-based microsatellite markers (SSRs) from previous de novo assemblies (Staton et al., PLoS ONE 10:e0145031, 2015; Lane et al., BMC Genom 17:702, 2016). A first genetic linkage map of green ash was constructed from 90 individuals in a full-sib family, combining 2719 SNP and 84 SSR segregating markers among the parental maps. The consensus SNP and SSR map contains a total of 1201 markers in 23 linkage groups spanning 2008.87 cM, at an average inter-marker distance of 1.67 cM with a minimum logarithm of odds of 6 and maximum recombination fraction of 0.40. Comparisons of the organization the green ash map with the genomes of asterid species coffee and tomato, and genomes of the rosid species poplar and peach, showed areas of conserved gene order, with overall synteny strongest with coffee.
Collapse
|
13
|
A high-density genetic map of extra-long staple cotton (Gossypium barbadense) constructed using genotyping-by-sequencing based single nucleotide polymorphic markers and identification of fiber traits-related QTL in a recombinant inbred line population. BMC Genomics 2018; 19:489. [PMID: 29940861 PMCID: PMC6019718 DOI: 10.1186/s12864-018-4890-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 06/19/2018] [Indexed: 01/08/2023] Open
Abstract
Background Gossypium barbadense (Sea Island, Egyptian or Pima cotton) cotton has high fiber quality, however, few studies have investigated the genetic basis of its traits using molecular markers. Genome complexity reduction approaches such as genotyping-by-sequencing have been utilized to develop abundant markers for the construction of high-density genetic maps to locate quantitative trait loci (QTLs). Results The Chinese G. barbadense cultivar 5917 and American Pima S-7 were used to develop a recombinant inbred line (RIL) population with 143 lines. The 143 RILs together with their parents were tested in three replicated field tests for lint yield traits (boll weight and lint percentage) and fiber quality traits (fiber length, fiber elongation, fiber strength, fiber uniformity and micronaire) and then genotyped using GBS to develop single-nucleotide polymorphism (SNP) markers. A high-density genetic map with 26 linkage groups (LGs) was constructed using 3557 GBS SNPs spanning a total genetic distance of 3076.23 cM at an average density of 1.09 cM between adjacent markers. A total of 42 QTLs were identified, including 24 QTLs on 12 LGs for fiber quality and 18 QTLs on 7 LGs for lint yield traits, with LG1 (9 QTLs), LG10 (7 QTLs) and LG14 (6 QTLs) carrying more QTLs. Common QTLs for the same traits and overlapping QTLs for different traits were detected. Each individual QTLs explained 0.97 to 20.7% of the phenotypic variation. Conclusions This study represents one of the first genetic mapping studies on the fiber quality and lint yield traits in a RIL population of G. barbadense using GBS-SNPs. The results provide important information for the subsequent fine mapping of QTLs and the prediction of candidate genes towards map-based cloning and marker-assisted selection in cotton.
Collapse
|
14
|
UGbS-Flex, a novel bioinformatics pipeline for imputation-free SNP discovery in polyploids without a reference genome: finger millet as a case study. BMC PLANT BIOLOGY 2018; 18:117. [PMID: 29902967 PMCID: PMC6003085 DOI: 10.1186/s12870-018-1316-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2017] [Accepted: 05/23/2018] [Indexed: 05/28/2023]
Abstract
BACKGROUND Research on orphan crops is often hindered by a lack of genomic resources. With the advent of affordable sequencing technologies, genotyping an entire genome or, for large-genome species, a representative fraction of the genome has become feasible for any crop. Nevertheless, most genotyping-by-sequencing (GBS) methods are geared towards obtaining large numbers of markers at low sequence depth, which excludes their application in heterozygous individuals. Furthermore, bioinformatics pipelines often lack the flexibility to deal with paired-end reads or to be applied in polyploid species. RESULTS UGbS-Flex combines publicly available software with in-house python and perl scripts to efficiently call SNPs from genotyping-by-sequencing reads irrespective of the species' ploidy level, breeding system and availability of a reference genome. Noteworthy features of the UGbS-Flex pipeline are an ability to use paired-end reads as input, an effective approach to cluster reads across samples with enhanced outputs, and maximization of SNP calling. We demonstrate use of the pipeline for the identification of several thousand high-confidence SNPs with high representation across samples in an F3-derived F2 population in the allotetraploid finger millet. Robust high-density genetic maps were constructed using the time-tested mapping program MAPMAKER which we upgraded to run efficiently and in a semi-automated manner in a Windows Command Prompt Environment. We exploited comparative GBS with one of the diploid ancestors of finger millet to assign linkage groups to subgenomes and demonstrate the presence of chromosomal rearrangements. CONCLUSIONS The paper combines GBS protocol modifications, a novel flexible GBS analysis pipeline, UGbS-Flex, recommendations to maximize SNP identification, updated genetic mapping software, and the first high-density maps of finger millet. The modules used in the UGbS-Flex pipeline and for genetic mapping were applied to finger millet, an allotetraploid selfing species without a reference genome, as a case study. The UGbS-Flex modules, which can be run independently, are easily transferable to species with other breeding systems or ploidy levels.
Collapse
|
15
|
Genome-wide association mapping of virulence gene in rice blast fungus Magnaporthe oryzae using a genotyping by sequencing approach. Genomics 2018; 111:661-668. [PMID: 29775784 DOI: 10.1016/j.ygeno.2018.05.011] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2018] [Revised: 05/04/2018] [Accepted: 05/11/2018] [Indexed: 01/22/2023]
Abstract
Magnaporthe oryzae is a fungal pathogen causing blast disease in many plant species. In this study, seventy three isolates of M. oryzae collected from rice (Oryza sativa) in 1996-2014 were genotyped using a genotyping-by-sequencing approach to detect genetic variation. An association study was performed to identify single nucleotide polymorphisms (SNPs) associated with virulence genes using 831 selected SNP and infection phenotypes on local and improved rice varieties. Population structure analysis revealed eight subpopulations. The division into eight groups was not related to the degree of virulence. Association mapping showed five SNPs associated with fungal virulence on chromosome 1, 2, 3, 4 and 7. The SNP on chromosome 1 was associated with virulence against RD6-Pi7 and IRBL7-M which might be linked to the previously reported AvrPi7.
Collapse
|
16
|
Crop Breeding Chips and Genotyping Platforms: Progress, Challenges, and Perspectives. MOLECULAR PLANT 2017; 10:1047-1064. [PMID: 28669791 DOI: 10.1016/j.molp.2017.06.008] [Citation(s) in RCA: 219] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Revised: 05/29/2017] [Accepted: 06/19/2017] [Indexed: 05/18/2023]
Abstract
There is a rapidly rising trend in the development and application of molecular marker assays for gene mapping and discovery in field crops and trees. Thus far, more than 50 SNP arrays and 15 different types of genotyping-by-sequencing (GBS) platforms have been developed in over 25 crop species and perennial trees. However, much less effort has been made on developing ultra-high-throughput and cost-effective genotyping platforms for applied breeding programs. In this review, we discuss the scientific bottlenecks in existing SNP arrays and GBS technologies and the strategies to develop targeted platforms for crop molecular breeding. We propose that future practical breeding platforms should adopt automated genotyping technologies, either array or sequencing based, target functional polymorphisms underpinning economic traits, and provide desirable prediction accuracy for quantitative traits, with universal applications under wide genetic backgrounds in crops. The development of such platforms faces serious challenges at both the technological level due to cost ineffectiveness, and the knowledge level due to large genotype-phenotype gaps in crop plants. It is expected that such genotyping platforms will be achieved in the next ten years in major crops in consideration of (a) rapid development in gene discovery of important traits, (b) deepened understanding of quantitative traits through new analytical models and population designs, (c) integration of multi-layer -omics data leading to identification of genes and pathways responsible for important breeding traits, and (d) improvement in cost effectiveness of large-scale genotyping. Crop breeding chips and genotyping platforms will provide unprecedented opportunities to accelerate the development of cultivars with desired yield potential, quality, and enhanced adaptation to mitigate the effects of climate change.
Collapse
|
17
|
Development of a high-density linkage map and mapping of the three-pistil gene (Pis1) in wheat using GBS markers. BMC Genomics 2017; 18:567. [PMID: 28760136 PMCID: PMC5537994 DOI: 10.1186/s12864-017-3960-7] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2016] [Accepted: 07/25/2017] [Indexed: 12/21/2022] Open
Abstract
Background The wheat mutant line three-pistil (TP) exhibits three pistils per floret. As TP normally has two or three seeds in each of the florets on the same spike, there is the possibility of increasing the number of grains per spike. Therefore, TP is a highly valuable mutant for breeding and for the study of floral development in wheat. To map the three-pistil gene (Pis1), genotyping-by-sequencing single-nucleotide polymorphism (GBS-SNP) data from an F2 mapping population (CM28 × CM28TP) was used to construct a genetic map that is of significant value. Results In the present study, a high-density genetic map of wheat containing 2917 GBS-SNP markers was constructed. Twenty-one linkage groups were resolved, with a total length of 2371.40 cM. The individual chromosomes range from 2.64 cM to 454.55 cM with an average marker density of 0.81 cM. The Pis1 gene was mapped using this high-resolution map, and two flanking SNP markers tightly linked to the gene, M70 and M71, were identified. The Pis1 is 3.00 cM from M70 and 1.10 cM from M71. In bread wheat genome, M70 and M71 were found to delimit a physical distance of 3.40 Mb, which encompasses 127 protein-coding genes. To validate the GBS-generated genotypic data and to eliminate missing marker data in the Pis1 region, five Kompetitive Allele-Specific PCR (KASP) assays were designed from corresponding GBS sequences, which harbor SNPs that surround Pis1. Three KASP-SNP markers, KM70, KM71, and KM75, were remapped to the Pis1 gene region. Conclusions This work not only lays the foundation for the map-based cloning of Pis1 but can also serve as a valuable tool for studying marker-trait association of important traits and marker-assisted breeding in wheat. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3960-7) contains supplementary material, which is available to authorized users.
Collapse
|
18
|
Validation of QTL mapping and transcriptome profiling for identification of candidate genes associated with nitrogen stress tolerance in sorghum. BMC PLANT BIOLOGY 2017; 17:123. [PMID: 28697783 PMCID: PMC5505042 DOI: 10.1186/s12870-017-1064-9] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2016] [Accepted: 06/25/2017] [Indexed: 05/10/2023]
Abstract
BACKGROUND Quantitative trait loci (QTLs) detected in one mapping population may not be detected in other mapping populations at all the time. Therefore, before being used for marker assisted breeding, QTLs need to be validated in different environments and/or genetic backgrounds to rule out statistical anomalies. In this regard, we mapped the QTLs controlling various agronomic traits in a recombinant inbred line (RIL) population in response to Nitrogen (N) stress and validated these with the reported QTLs in our earlier study to find the stable and consistent QTLs across populations. Also, with Illumina RNA-sequencing we checked the differential expression of gene (DEG) transcripts between parents and pools of RILs with high and low nitrogen use efficiency (NUE) and overlaid these DEGs on to the common validated QTLs to find candidate genes associated with N-stress tolerance in sorghum. RESULTS An F7 RIL population derived from a cross between CK60 (N-stress sensitive) and San Chi San (N-stress tolerant) inbred sorghum lines was used to map QTLs for 11 agronomic traits tested under different N-levels. Composite interval mapping analysis detected a total of 32 QTLs for 11 agronomic traits. Validation of these QTLs revealed that of the detected, nine QTLs from this population were consistent with the reported QTLs in earlier study using CK60/China17 RIL population. The validated QTLs were located on chromosomes 1, 6, 7, 8, and 9. In addition, root transcriptomic profiling detected 55 and 20 differentially expressed gene (DEG) transcripts between parents and pools of RILs with high and low NUE respectively. Also, overlay of these DEG transcripts on to the validated QTLs found candidate genes transcripts for NUE and also showed the expected differential expression. For example, DEG transcripts encoding Lysine histidine transporter 1 (LHT1) had abundant expression in San Chi San and the tolerant RIL pool, whereas DEG transcripts encoding seed storage albumin, transcription factor IIIC (TFIIIC) and dwarfing gene (DW2) encoding multidrug resistance-associated protein-9 homolog showed abundant expression in CK60 parent, similar to earlier study. CONCLUSIONS The validated QTLs among different mapping populations would be the most reliable and stable QTLs across germplasm. The DEG transcripts found in the validated QTL regions will serve as future candidate genes for enhancing NUE in sorghum using molecular approaches.
Collapse
|
19
|
Genome complexity of harmful microalgae. HARMFUL ALGAE 2017; 63:7-12. [PMID: 28366402 DOI: 10.1016/j.hal.2017.01.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2016] [Revised: 01/09/2017] [Accepted: 01/09/2017] [Indexed: 06/07/2023]
Abstract
During the past decade, next generation sequencing (NGS) technologies have provided new insights into the diversity, dynamics, and metabolic pathways of natural microbial communities. But, these new techniques face challenges related to the genome size and level of genome complexity of the species under investigation. Moreover, the coverage depth and the short-read length achieved by NGS based approaches also represent a major challenge for assembly. These factors could limit the use of these high-throughput sequencing methods for species lacking a reference genome and characterized by a high level of complexity. In the present work, the evolutionary history, mainly consisting of gene transfer events from bacteria and unicellular eukaryotes to microalgae, including harmful species, is discussed and reviewed as it relates to NGS application in microbial communities, with a particular focus on harmful algal bloom species and dinoflagellates. In the context of genetic population studies, genotyping-by-sequencing (GBS), an NGS based approach, could be used for the discovery and analysis of single nucleotide polymorphisms (SNPs). The NGS technologies are still relatively new and require further improvement. Specifically, there is a need to develop and standardize tools and approaches to handle large data sets, which have to be used for the majority of HAB species characterized by evolutionary highly dynamic genomes.
Collapse
|
20
|
Genotyping-by-sequencing provides the discriminating power to investigate the subspecies of Daucus carota (Apiaceae). BMC Evol Biol 2016; 16:234. [PMID: 27793080 PMCID: PMC5084430 DOI: 10.1186/s12862-016-0806-x] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2016] [Accepted: 10/14/2016] [Indexed: 12/05/2022] Open
Abstract
BACKGROUND The majority of the subspecies of Daucus carota have not yet been discriminated clearly by various molecular or morphological methods and hence their phylogeny and classification remains unresolved. Recent studies using 94 nuclear orthologs and morphological characters, and studies employing other molecular approaches were unable to distinguish clearly many of the subspecies. Fertile intercrosses among traditionally recognized subspecies are well documented. We here explore the utility of single nucleotide polymorphisms (SNPs) generated by genotyping-by-sequencing (GBS) to serve as an effective molecular method to discriminate the subspecies of the D. carota complex. RESULTS We used GBS to obtain SNPs covering all nine Daucus carota chromosomes from 162 accessions of Daucus and two related genera. To study Daucus phylogeny, we scored a total of 10,814 or 38,920 SNPs with a maximum of 10 or 30 % missing data, respectively. To investigate the subspecies of D. carota, we employed two data sets including 150 accessions: (i) rate of missing data 10 % with a total of 18,565 SNPs, and (ii) rate of missing data 30 %, totaling 43,713 SNPs. Consistent with prior results, the topology of both data sets separated species with 2n = 18 chromosome from all other species. Our results place all cultivated carrots (D. carota subsp. sativus) in a single clade. The wild members of D. carota from central Asia were on a clade with eastern members of subsp. sativus. The other subspecies of D. carota were in four clades associated with geographic groups: (1) the Balkan Peninsula and the Middle East, (2) North America and Europe, (3) North Africa exclusive of Morocco, and (4) the Iberian Peninsula and Morocco. Daucus carota subsp. maximus was discriminated, but neither it, nor subsp. gummifer (defined in a broad sense) are monophyletic. CONCLUSIONS Our study suggests that (1) the morphotypes identified as D. carota subspecies gummifer (as currently broadly circumscribed), all confined to areas near the Atlantic Ocean and the western Mediterranean Sea, have separate origins from sympatric members of other subspecies of D. carota, (2) D. carota subsp. maximus, on two clades with some accessions of subsp. carota, can be distinguished from each other but only with poor morphological support, (3) D. carota subsp. capillifolius, well distinguished morphologically, is an apospecies relative to North African populations of D. carota subsp. carota, (4) the eastern cultivated carrots have origins closer to wild carrots from central Asia than to western cultivated carrots, and (5) large SNP data sets are suitable for species-level phylogenetic studies in Daucus.
Collapse
|
21
|
Effects of methylation-sensitive enzymes on the enrichment of genic SNPs and the degree of genome complexity reduction in a two-enzyme genotyping-by-sequencing (GBS) approach: a case study in oil palm ( Elaeis guineensis). MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2016; 36:154. [PMID: 27942246 PMCID: PMC5104780 DOI: 10.1007/s11032-016-0572-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2016] [Accepted: 10/20/2016] [Indexed: 05/08/2023]
Abstract
Advances in next generation sequencing have facilitated a large-scale single nucleotide polymorphism (SNP) discovery in many crop species. Genotyping-by-sequencing (GBS) approach couples next generation sequencing with genome complexity reduction techniques to simultaneously identify and genotype SNPs. Choice of enzymes used in GBS library preparation depends on several factors including the number of markers required, the desired level of multiplexing, and whether the enrichment of genic SNP is preferred. We evaluated various combinations of methylation-sensitive (AatII, PstI, MspI) and methylation-insensitive (SphI, MseI) enzymes for their effectiveness in genome complexity reduction and enrichment of genic SNPs. We discovered that the use of two methylation-sensitive enzymes effectively reduced genome complexity and did not require a size selection step. On the contrary, the genome coverage of libraries constructed with methylation-insensitive enzymes was quite high, and the additional size selection step may be required to increase the overall read depth. We also demonstrated the effectiveness of methylation-sensitive enzymes in enriching for SNPs located in genic regions. When two methylation-insensitive enzymes were used, only 16% of SNPs identified were located in genes and 18% in the vicinity (± 5 kb) of the genic regions, while most SNPs resided in the intergenic regions. In contrast, a remarkable degree of enrichment was observed when two methylation-sensitive enzymes were employed. Almost two thirds of the SNPs were located either inside (32-36%) or in the vicinity (28-31%) of the genic regions. These results provide useful information to help researchers choose appropriate GBS enzymes in oil palm and other crop species.
Collapse
|
22
|
Joint assembly and genetic mapping of the Atlantic horseshoe crab genome reveals ancient whole genome duplication. Gigascience 2014; 3:9. [PMID: 24987520 PMCID: PMC4066314 DOI: 10.1186/2047-217x-3-9] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2013] [Accepted: 04/23/2014] [Indexed: 11/11/2022] Open
Abstract
Background Horseshoe crabs are marine arthropods with a fossil record extending back approximately 450 million years. They exhibit remarkable morphological stability over their long evolutionary history, retaining a number of ancestral arthropod traits, and are often cited as examples of “living fossils.” As arthropods, they belong to the Ecdysozoa, an ancient super-phylum whose sequenced genomes (including insects and nematodes) have thus far shown more divergence from the ancestral pattern of eumetazoan genome organization than cnidarians, deuterostomes and lophotrochozoans. However, much of ecdysozoan diversity remains unrepresented in comparative genomic analyses. Results Here we apply a new strategy of combined de novo assembly and genetic mapping to examine the chromosome-scale genome organization of the Atlantic horseshoe crab, Limulus polyphemus. We constructed a genetic linkage map of this 2.7 Gbp genome by sequencing the nuclear DNA of 34 wild-collected, full-sibling embryos and their parents at a mean redundancy of 1.1x per sample. The map includes 84,307 sequence markers grouped into 1,876 distinct genetic intervals and 5,775 candidate conserved protein coding genes. Conclusions Comparison with other metazoan genomes shows that the L. polyphemus genome preserves ancestral bilaterian linkage groups, and that a common ancestor of modern horseshoe crabs underwent one or more ancient whole genome duplications 300 million years ago, followed by extensive chromosome fusion. These results provide a counter-example to the often noted correlation between whole genome duplication and evolutionary radiations. The new, low-cost genetic mapping method for obtaining a chromosome-scale view of non-model organism genomes that we demonstrate here does not require laboratory culture, and is potentially applicable to a broad range of other species.
Collapse
|