1
|
Batista LG, Mello VH, Souza AP, Margarido GRA. Genomic prediction with allele dosage information in highly polyploid species. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:723-739. [PMID: 34800132 DOI: 10.1007/s00122-021-03994-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 11/06/2021] [Indexed: 06/13/2023]
Abstract
Including allele, dosage can improve genomic selection in highly polyploid species under higher frequency of different heterozygous genotypic classes and high dominance degree levels. Several studies have shown how to leverage allele dosage information to improve the accuracy of genomic selection models in autotetraploid. In this study, we expanded the methodology used for genomic selection in autotetraploid to higher (and mixed) ploidy levels. We adapted the models to build covariance matrices of both additive and digenic dominance effects that are subsequently used in genomic selection models. We applied these models using estimates of ploidy and allele dosage to sugarcane and sweet potato datasets and validated our results by also applying the models in simulated data. For the simulated datasets, including allele dosage information led up to 140% higher mean predictive abilities in comparison to using diploidized markers. Including dominance effects were highly advantageous when using diploidized markers, leading to mean predictive abilities which were up to 115% higher in comparison to only including additive effects. When the frequency of heterozygous genotypes in the population was low, such as in the sugarcane and sweet potato datasets, there was little advantage in including allele dosage information in the models. Overall, we show that including allele dosage can improve genomic selection in highly polyploid species under higher frequency of different heterozygous genotypic classes and high dominance degree levels.
Collapse
Affiliation(s)
- Lorena G Batista
- Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil
| | - Victor H Mello
- Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil
| | - Anete P Souza
- Center of Molecular Biology and Genetic Engineering, University of Campinas, Campinas, SP, 13083-970, Brazil
| | - Gabriel R A Margarido
- Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil.
| |
Collapse
|
2
|
Bansal M, Adamski NM, Toor PI, Kaur S, Sharma A, Srivastava P, Bansal U, Uauy C, Chhuneja P. A robust KASP marker for selection of four pairs of linked leaf rust and stripe rust resistance genes introgressed on chromosome arm 5DS from different wheat genomes. Mol Biol Rep 2021; 48:5209-5216. [PMID: 34213711 DOI: 10.1007/s11033-021-06525-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 06/25/2021] [Indexed: 11/24/2022]
Abstract
Stripe rust and leaf rust are among the most devastating diseases of wheat, limiting its production globally. Wheat wild relatives harbour genetic diversity for new genes and alleles for all major wheat diseases. However, the use of this genetic variation from wild progenitor and non-progenitor species has been limited in the breeding programs. Reasons include limited recombination of donor and recipient genomes and the lack of tertiary gene pool markers. Here, we describe the development of a SNP based marker from the flow-sorted and sequenced Aegilops umbellulata chromosome 5U which can be used for marker assisted selection of four pair of alien leaf rust and stripe rust resistance genes. Lr57-Yr40_CAPS16 marker was reported earlier to be linked with alien leaf and stripe rust resistance genes introgressed on wheat chromosome 5DS. Due to its dominant nature and laborious to work with, a new SNP-based KASP marker, XTa5DS-2754099_kasp23, was developed from the same CAPS marker contig. XTa5DS-2754099_kasp23 was tested in Aegilops umbellulata, Ae. geniculata, Ae. peregrina and Ae. caudata derived alien introgression lines, which harbour four pairs of linked leaf and stripe rust genes; Lr76-Yr70, Lr57-Yr40, LrP- YrP, LrAc-YrAc, respectively. This KASP marker was found to be effective for the selection of the aforesaid four pairs of leaf rust and stripe rust resistance genes. Further, we tested and validated XTa5DS-2754099_kasp23 on commercial varieties and advanced breeding lines from four countries (India, Egypt, Australia and UK) including hexaploid and durum wheat. Our results provide evidence that KASP marker, XTa5DS-2754099_kasp23 can be used in marker-assisted selection of the four pairs of rust resistance alien genes in wheat breeding programmes.
Collapse
Affiliation(s)
- Mitaly Bansal
- School of Agricultural Biotechnology, Punjab Agricultural University, Ludhiana, Punjab, 141 004, India
| | | | - Puneet Inder Toor
- School of Agricultural Biotechnology, Punjab Agricultural University, Ludhiana, Punjab, 141 004, India
| | - Satinder Kaur
- School of Agricultural Biotechnology, Punjab Agricultural University, Ludhiana, Punjab, 141 004, India
| | - Achla Sharma
- Department of Plant Breeding and Genetics, Punjab Agricultural University, Ludhiana, Punjab, 141 004, India
| | - Puja Srivastava
- Department of Plant Breeding and Genetics, Punjab Agricultural University, Ludhiana, Punjab, 141 004, India
| | - Urmil Bansal
- University of Sydney Plant Breeding Institute-Cobbitty, PMB 4011, Narellan, NSW, 2567, Australia
| | - Cristobal Uauy
- John Innes Centre, Norwich Research Park, Norwich, NR4 7UH, UK
| | - Parveen Chhuneja
- School of Agricultural Biotechnology, Punjab Agricultural University, Ludhiana, Punjab, 141 004, India.
| |
Collapse
|
3
|
High-Throughput Genotyping Technologies in Plant Taxonomy. Methods Mol Biol 2021; 2222:149-166. [PMID: 33301093 DOI: 10.1007/978-1-0716-0997-2_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Molecular markers provide researchers with a powerful tool for variation analysis between plant genomes. They are heritable and widely distributed across the genome and for this reason have many applications in plant taxonomy and genotyping. Over the last decade, molecular marker technology has developed rapidly and is now a crucial component for genetic linkage analysis, trait mapping, diversity analysis, and association studies. This chapter focuses on molecular marker discovery, its application, and future perspectives for plant genotyping through pangenome assemblies. Included are descriptions of automated methods for genome and sequence distance estimation, genome contaminant analysis in sequence reads, genome structural variation, and SNP discovery methods.
Collapse
|
4
|
Modern Approaches for Transcriptome Analyses in Plants. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2021; 1346:11-50. [DOI: 10.1007/978-3-030-80352-0_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
5
|
Manimekalai R, Suresh G, Govinda Kurup H, Athiappan S, Kandalam M. Role of NGS and SNP genotyping methods in sugarcane improvement programs. Crit Rev Biotechnol 2020; 40:865-880. [PMID: 32508157 DOI: 10.1080/07388551.2020.1765730] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Sugarcane (Saccharum spp.) is one of the most economically significant crops because of its high sucrose content and it is a promising biomass feedstock for biofuel production. Sugarcane genome sequencing and analysis is a difficult task due to its heterozygosity and polyploidy. Long sequence read technologies, PacBio Single-Molecule Real-Time (SMRT) sequencing, the Illumina TruSeq, and the Oxford Nanopore sequencing could solve the problem of genome assembly. On the applications side, next generation sequencing (NGS) technologies played a major role in the discovery of single nucleotide polymorphism (SNP) and the development of low to high throughput genotyping platforms. The two mainstream high throughput genotyping platforms are the SNP microarray and genotyping by sequencing (GBS). This paper reviews the NGS in sugarcane genomics, genotyping methodologies, and the choice of these methods. Array-based SNP genotyping is robust, provides consistent SNPs, and relatively easier downstream data analysis. The GBS method identifies large scale SNPs across the germplasm. A combination of targeted GBS and array-based genotyping methods should be used to increase the accuracy of genomic selection and marker-assisted breeding.
Collapse
Affiliation(s)
- Ramaswamy Manimekalai
- Crop Improvement Division, ICAR - Sugarcane Breeding Institute, Indian Council of Agricultural Research (ICAR), Coimbatore, Tamil Nadu, India
| | - Gayathri Suresh
- Crop Improvement Division, ICAR - Sugarcane Breeding Institute, Indian Council of Agricultural Research (ICAR), Coimbatore, Tamil Nadu, India
| | - Hemaprabha Govinda Kurup
- Crop Improvement Division, ICAR - Sugarcane Breeding Institute, Indian Council of Agricultural Research (ICAR), Coimbatore, Tamil Nadu, India
| | - Selvi Athiappan
- Crop Improvement Division, ICAR - Sugarcane Breeding Institute, Indian Council of Agricultural Research (ICAR), Coimbatore, Tamil Nadu, India
| | - Mallikarjuna Kandalam
- Business Development, Asia Pacific Japan region, Thermo Fisher Scientific, Waltham, MA, USA
| |
Collapse
|
6
|
Association between polymorphism in the promoter region of lncRNA GAS5 and the risk of colorectal cancer. Biosci Rep 2019; 39:BSR20190091. [PMID: 30902880 PMCID: PMC6465203 DOI: 10.1042/bsr20190091] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Revised: 03/17/2019] [Accepted: 03/22/2019] [Indexed: 02/08/2023] Open
Abstract
The growth arrest special 5 (GAS5), as a research hotspot of long noncoding RNAs (lncRNAs), has been reported to be associated with colorectal cancer (CRC). However, the association between polymorphisms in GAS5 and the risk of CRC was not clear. In the present study, a case–control study in 1078 CRC patients and 1175 matched healthy controls was performed to evaluate the association between the potential functional genetic variants in GAS5 and the risk of CRC. PCR-TaqMan, qPCR, dual-luciferase assay, electrophoretic mobility shift assay (EMSA), flow cytometry, migration and invasion assays were performed to evaluate the function of polymorphism. Results showed that subjects carrying the rs55829688 CT/TT genotypes had a significantly higher risk of CRC when compared with the CC genotype. Further qPCR assay confirmed that the CRC tissues with rs55829688 CT/TT genotypes had a higher GAS5 mRNA expression level. The dual-luciferase assay, qPCR and EMSA assay revealed that rs55829688 T>C polymorphism could decrease the expression level of GAS5 by impacting the binding ability of the transcription factor Yin Yang-1 (YY1) to the GAS5 promoter region. The expression of apoptosis-related proteins were detected by Western blot. Further, flow cytometry, migration, and invasion experiments showed that GAS5 repressed apoptosis and increased invasion and migration capability of CRC cells. Taken together, our findings provided evidence that the rs55829688 variant in the GAS5 promoter was associated with the risk of CRC and decreased expression of GAS5 by affecting the binding affinity of the transcription factors YY1 to GAS5.
Collapse
|
7
|
Zhang J, Zhang Q, Li L, Tang H, Zhang Q, Chen Y, Arrow J, Zhang X, Wang A, Miao C, Ming R. Recent polyploidization events in three Saccharum founding species. PLANT BIOTECHNOLOGY JOURNAL 2019; 17:264-274. [PMID: 29878497 PMCID: PMC6330536 DOI: 10.1111/pbi.12962] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Revised: 05/21/2018] [Accepted: 06/04/2018] [Indexed: 05/07/2023]
Abstract
The complexity of polyploid Saccharum genomes hindered progress of genome research and crop improvement in sugarcane. To understand their genome structure, transcriptomes of 59 F1 individuals derived from S. officinarumLA Purple and S. robustum Molokai 5829 (2n = 80, x = 10 for both) were sequenced, yielding 11 157 and 8998 SNPs and 83 and 105 linkage groups, respectively. Most markers in each linkage group aligned to single sorghum chromosome. However, 71 interchromosomal rearrangements were detected between sorghum and S. officinarum or S. robustum, and 24 (33.8%) of them were shared between S. officinarum and S. robustum, indicating their occurrence before the speciation event that separated these two species. More than 2000 gene pairs from S. spontaneum, S. officinarum and S. robustum were analysed to estimate their divergence time. Saccharum officinarum and S. robustum diverged about 385 thousand years ago, and the whole-genome duplication events occurred after the speciation event because of shared interchromosomal rearrangements. The ancestor of these two species diverged from S. spontaneum about 769 thousand years ago, and the reduction in basic chromosome number from 10 to 8 in S. spontaneum occurred after the speciation event but before the two rounds of whole-genome duplication. Our results proved that S. officinarum is a legitimate species in its own right and not a selection from S. robustum during the domestication process in the past 10 000 years. Our findings rejected a long-standing hypothesis and clarified the timing of speciation and whole-genome duplication events in Saccharum.
Collapse
Affiliation(s)
- Jisen Zhang
- FAFU and UIUC Joint Center for Genomics and BiotechnologyKey Laboratory of Sugarcane Biology and Genetic Breeding Ministry of AgricultureFujian Agriculture and Forestry UniversityFuzhouFujianChina
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, and Key Laboratory of GeneticsBreeding and Multiple Utilization of CorpsMinistry of EducationFujian Agriculture and Forestry UniversityFuzhouFujianChina
- College of Life SciencesFujian Normal UniversityFuzhouChina
| | - Qing Zhang
- FAFU and UIUC Joint Center for Genomics and BiotechnologyKey Laboratory of Sugarcane Biology and Genetic Breeding Ministry of AgricultureFujian Agriculture and Forestry UniversityFuzhouFujianChina
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, and Key Laboratory of GeneticsBreeding and Multiple Utilization of CorpsMinistry of EducationFujian Agriculture and Forestry UniversityFuzhouFujianChina
- College of Life SciencesFujian Agriculture and Forestry UniversityFuzhouFujianChina
| | - Leiting Li
- College of HorticultureNanjing Agricultural UniversityNanjingChina
| | - Haibao Tang
- FAFU and UIUC Joint Center for Genomics and BiotechnologyKey Laboratory of Sugarcane Biology and Genetic Breeding Ministry of AgricultureFujian Agriculture and Forestry UniversityFuzhouFujianChina
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, and Key Laboratory of GeneticsBreeding and Multiple Utilization of CorpsMinistry of EducationFujian Agriculture and Forestry UniversityFuzhouFujianChina
| | - Qiong Zhang
- Key Laboratory of Plant Germplasm Enhancement and Specialty AgricultureWuhan Botanical GardenChinese Academy of SciencesWuhanChina
| | - Yang Chen
- College of Life SciencesFujian Normal UniversityFuzhouChina
| | - Jie Arrow
- Department of Plant BiologyUniversity of Illinois at Urbana‐ChampaignUrbanaILUSA
| | - Xingtan Zhang
- FAFU and UIUC Joint Center for Genomics and BiotechnologyKey Laboratory of Sugarcane Biology and Genetic Breeding Ministry of AgricultureFujian Agriculture and Forestry UniversityFuzhouFujianChina
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, and Key Laboratory of GeneticsBreeding and Multiple Utilization of CorpsMinistry of EducationFujian Agriculture and Forestry UniversityFuzhouFujianChina
| | - Aiqin Wang
- State Key Lab for Conservation and Utilization of Subtropical Agro‐biological ResourcesGuangxi UniversityNanningChina
| | - Chenyong Miao
- FAFU and UIUC Joint Center for Genomics and BiotechnologyKey Laboratory of Sugarcane Biology and Genetic Breeding Ministry of AgricultureFujian Agriculture and Forestry UniversityFuzhouFujianChina
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, and Key Laboratory of GeneticsBreeding and Multiple Utilization of CorpsMinistry of EducationFujian Agriculture and Forestry UniversityFuzhouFujianChina
| | - Ray Ming
- FAFU and UIUC Joint Center for Genomics and BiotechnologyKey Laboratory of Sugarcane Biology and Genetic Breeding Ministry of AgricultureFujian Agriculture and Forestry UniversityFuzhouFujianChina
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, and Key Laboratory of GeneticsBreeding and Multiple Utilization of CorpsMinistry of EducationFujian Agriculture and Forestry UniversityFuzhouFujianChina
- Department of Plant BiologyUniversity of Illinois at Urbana‐ChampaignUrbanaILUSA
| |
Collapse
|
8
|
Association of ACE2 genetic polymorphisms with hypertension-related target organ damages in south Xinjiang. Hypertens Res 2018; 42:681-689. [PMID: 30542083 PMCID: PMC6477792 DOI: 10.1038/s41440-018-0166-6] [Citation(s) in RCA: 69] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Revised: 10/14/2018] [Accepted: 10/14/2018] [Indexed: 12/12/2022]
Abstract
Essential hypertension (EH) is a principal contributing factor in
worldwide cardiovascular disease mortality. Although interventions that minimize
environmental risk factors for EH are associated with reduced cardiovascular
disease, such approaches are limited for individuals with high genetic EH risk. In
this study, we investigated possible associations between ACE2 polymorphisms and
hypertension-related target organ damages in south Xinjiang, China. Four hundred and
two hypertensive patients were enrolled as study participants in an EH group, and
233 normotensive individuals were enrolled as control subjects. Participants were
recruited from the south Xinjiang region. Fourteen ACE2 polymorphisms were genotyped
by matrix-assisted laser desorption ionization time-of-flight mass spectrometry.
Risk genotypes of rs2074192 (TT+CT, OR = 1.72, 95% CI: 1.17–2.53), rs2106809 (TT,
OR = 1.71, 95% CI: 1.13–2.58), rs4240157 (CC+CT, OR = 1.99, 95% CI: 1.17–3.41),
rs4646155 (TT+CT, OR = 1.94, 95% CI: 1.06–3.54), rs4646188 (TT+CT, OR = 3.25, 95%
CI: 1.95–5.41), rs4830542 (CC+CT, OR = 1.88, 95% CI: 1.10–3.23), and rs879922
(CC+CG, OR = 4.86, 95% CI: 2.74–8.64) were associated with EH. Hypertensive patients
carrying the control genotype of rs2074192 (CC, OR = 2.37, 95% CI: 1.28–4.39) were
associated with CAS ≥50%, while those carrying a high-EH-risk genotype of rs4240157
(OR = 2.62, 95% CI: 1.24–5.54), rs4646155 (OR = 2.44, 95% CI: 1.16–5.10), or
rs4830542 (CC+CT, OR = 2.20, 95% CI: 1.03–4.69) were associated with atrial
fibrillation (AF), larger left atrial diameter, and higher levels of
renin–angiotensin–aldosterone system (RAAS) activation (renin and angiotensin I/II).
In conclusion, the ACE2 variant rs2074192 was associated with EH and EH with CAS
≥50%, while 3 ACE2 variants (rs4240157, rs4646155, and rs4830542) were associated
with EH- and hypertension-related AF and left atrial remodeling in south Xinjiang,
China.
Collapse
|
9
|
Association of ACE2 polymorphisms with susceptibility to essential hypertension and dyslipidemia in Xinjiang, China. Lipids Health Dis 2018; 17:241. [PMID: 30342552 PMCID: PMC6195726 DOI: 10.1186/s12944-018-0890-6] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Accepted: 10/07/2018] [Indexed: 12/21/2022] Open
Abstract
Background Cardiovascular benefits by reversing environmental risks factors for essential hypertension (EH) and dyslipidemia could be weaken by high genetic risk. We investigated possible associations between ACE2 polymorphisms and dyslipidemia in patients with EH. Methods Four hundred and two hypertensive patients were enrolled in an EH group and 233 normotensive individuals were enrolled as control group from the Xinjiang region of China. Fourteen ACE2 polymorphisms were genotyped by Matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Results Participants carrying T allele (TT + CT) of rs2074192 (P = 0.006), rs4646155 (P = 0.030) and rs4646188 (P < 0.001), C allele (CT + CT or CC + CG) of rs4240157 (P = 0.012), rs4830542 (P = 0.020) and rs879922 (P < 0.001) and TT genotype of rs2106809 (P = 0.012) were associated with EH. Meanwhile,ACE2 SNPs also exhibited association with dyslipidemia but exhibited obvious heterogeneity. rs1978124 (TT + CT, P = 0.009), rs2106809 (TT, P = 0.045), rs233575 (CC + CT, P = 0.018), rs4646188 (CC, P = 0.011) and rs879922 (CC + CG, P = 0.003) were association with increased LDL-C (≥1.8 mmol/L). rs2106809 (CC + CT, P < 0.001), rs2285666(TT + CT, P = 0.017), rs4646142(CC + CG, P = 0.044), rs4646155(TT + CT, P < 0.001) and rs4646188(TT + CT, P = 0.033) were association with decreased HDL-C (< 1.0 mmol/L). rs2074192 (TT + CT, P = 0.012), rs4240157 (CC + CT, P = 0.027), rs4646156 (AA+AT, P = 0.007), rs4646188 (TT + CT, P = 0.005), rs4830542 (CC + CT, P = 0.047) and rs879922 (CC + CG, P = 0.001) were association with increased TC (≥5.2 mmol/L). rs2106809 (P = 0.034) and rs4646188 (P = 0.013) were associated with hypertriglyceridemia. Further, ischemic stroke was more prevalent with rs4240157 (CC + CT, P = 0.043), rs4646188 (CC + CT, P = 0.013) and rs4830542 (CC + CT, P = 0.037). In addition, rs2048683 and rs6632677 were not association with EH, dyslipidemia and ischemic stroke. Conclusion The ACE2 rs4646188 variant may be a potential and optimal genetic susceptibility marker for EH, dyslipidemia and its related ischemic stroke. Electronic supplementary material The online version of this article (10.1186/s12944-018-0890-6) contains supplementary material, which is available to authorized users.
Collapse
|
10
|
Liu C, Li Y, Guan T, Lai Y, Shen Y, Zeyaweiding A, Zhao H, Li F, Maimaiti T. ACE2 polymorphisms associated with cardiovascular risk in Uygurs with type 2 diabetes mellitus. Cardiovasc Diabetol 2018; 17:127. [PMID: 30227878 PMCID: PMC6142339 DOI: 10.1186/s12933-018-0771-3] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 09/14/2018] [Indexed: 12/16/2022] Open
Abstract
Background Type 2 diabetes mellitus (T2D), rapidly increasing to epidemic proportions, globally escalates cardiovascular disease risk. Although intensive interventions and comprehensive management of environmental risks factors for T2D are associated with reduced cardiovascular disease, such approaches are limited for individuals with high genetic T2D risk. In this study we investigated possible associations of ACE2 polymorphisms and cardiovascular risks in Uygur patients with T2D. Methods 275 Uygur T2D patients and 272 non-diabetic Uygur individuals were enrolled as study participants. 14 ACE2 polymorphisms were genotyped by Matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Results ACE2 SNP rs1978124, rs2048683, rs2074192, rs233575, rs4240157, rs4646156, rs4646188 and rs879922 were associated with T2D (all P < 0.05). The 8 diabetic risk related ACE2 SNPs were further associated with diabetic related cardiovascular complications or events but exhibited heterogeneity as fellows: firstly, almost all diabetic risk related ACE2 SNPs (all P < 0.05) were associated with increased SBP except rs1978124 and rs2074192, while rs2074192, rs4646188 and rs879922 were associated elevated DBP (all P < 0.05). Secondly, SNP rs4646188 was not correlated with any type of dyslipidemia (TRIG, HDL-C, LDL-C or CHOL), and the other 7 diabetic risk related loci were at least correlated with one type of dyslipidemia (all P < 0.05). In particular, rs879922 were simultaneously correlated with four type of dyslipidemia (all P < 0.05). Thirdly, ACE2 SNP rs2074192 and rs879922 were associated with carotid arteriosclerosis stenosis (CAS) ≥ 50% (both P < 0.05). Fourthly, ACE2 SNP rs2074192, rs4240157, rs4646188 and 879922 were associated with increased MAU (all P < 0.05). In addition, ACE2 SNP rs2048683, rs4240157, rs4646156, rs4646188 and rs879922 were linked to heavier LVMI (all P < 0.05), but only rs4240157, rs4646156 and rs4646188 were associated with lower LVEF (all P < 0.05). Conclusion ACE2 SNP rs879922 may be a common genetic loci and optimal genetic susceptibility marker for T2D and T2D related cardiovascular risks in Uygurs. Electronic supplementary material The online version of this article (10.1186/s12933-018-0771-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Cheng Liu
- Department of Cardiology, Guangzhou First People's Hospital, Medical School, South China University of Technology, #1 Panfu Road, Guangzhou, 510180, China. .,Department of Cardiology, Guangzhou First People's Hospital, Guangzhou Medical University, Guangzhou, 510180, China. .,Department of Cardiology, Shufu People's Hospital, Kashgar Region, Xinjiang Uygur Autonomous Region (XUAR), 844100, China.
| | - Yanfang Li
- Department of Cardiology, Guangzhou First People's Hospital, Guangzhou Medical University, Guangzhou, 510180, China
| | - Tianwang Guan
- Department of Cardiology, Guangzhou First People's Hospital, Guangzhou Medical University, Guangzhou, 510180, China
| | - Yanxian Lai
- Department of Cardiology, Guangzhou First People's Hospital, Medical School, South China University of Technology, #1 Panfu Road, Guangzhou, 510180, China
| | - Yan Shen
- Department of Cardiology, Guangzhou First People's Hospital, Guangzhou Medical University, Guangzhou, 510180, China
| | - Abudurexiti Zeyaweiding
- Department of Cardiology, Shufu People's Hospital, Kashgar Region, Xinjiang Uygur Autonomous Region (XUAR), 844100, China
| | - Haiyan Zhao
- Department of Cardiology, Shufu People's Hospital, Kashgar Region, Xinjiang Uygur Autonomous Region (XUAR), 844100, China
| | - Fang Li
- Department of Cardiology, Shufu People's Hospital, Kashgar Region, Xinjiang Uygur Autonomous Region (XUAR), 844100, China
| | - Tutiguli Maimaiti
- Department of Cardiology, Shufu People's Hospital, Kashgar Region, Xinjiang Uygur Autonomous Region (XUAR), 844100, China
| |
Collapse
|
11
|
Sun Y, Wang J, Qiu Y, Liu T, Song J, Li X. Identification of 'Xinlimei' radish candidate genes associated with anthocyanin biosynthesis based on a transcriptome analysis. Gene 2018. [PMID: 29518548 DOI: 10.1016/j.gene.2018.03.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
Radish is an economically important vegetable crop belonging to the family Brassicaceae. The high anthocyanin content of the 'Xinlimei' radish roots has been associated with diverse health benefits. However, there is a lack of transcript-level information regarding anthocyanin biosynthesis. In the present study, the 'Xinlimei' radish root transcriptome was analyzed by RNA sequencing at five developmental stages. A total of 222,384,034 clean reads were obtained and 32,253 unigenes were annotated. Expression profiles revealed 10,890 differentially expressed genes (DEGs) among the five analyzed libraries. The DEGs were predominantly involved in KEGG pathways related to the biosynthesis of phenylpropanoids, flavonoids, flavone, and flavonol. The transcriptome data revealed 44 structural and 182 transcription factor genes (TFs) associated with the anthocyanin biosynthetic pathway. Ten structural genes (i.e., 4CL3, CHSB2, CHS1, CHS3, F3H1, F3'H, DFR, DFR1, ANS, and UFGT) and two MYB genes, which were highly and differentially expressed during root development, may be critical for anthocyanin biosynthesis. Additionally, the co-expression of TFs and structural genes was analyzed. Three structural genes (i.e., DFR, ANS, and UFGT) were validated by molecular cloning. The qRT-PCR results indicated that the expression profiles of DEGs were generally consistent with the high-throughput sequencing results. These findings helped identify candidate genes involved in anthocyanin biosynthesis and may be useful for clarifying the molecular mechanism underlying the accumulation of anthocyanins in radish roots.
Collapse
Affiliation(s)
- Yuyan Sun
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China; Institute of Vegetables, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Jinglei Wang
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Yang Qiu
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Tongjin Liu
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Jiangping Song
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Xixiang Li
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China.
| |
Collapse
|
12
|
Manivannan A, Kim JH, Yang EY, Ahn YK, Lee ES, Choi S, Kim DS. Next-Generation Sequencing Approaches in Genome-Wide Discovery of Single Nucleotide Polymorphism Markers Associated with Pungency and Disease Resistance in Pepper. BIOMED RESEARCH INTERNATIONAL 2018; 2018:5646213. [PMID: 29546063 PMCID: PMC5818978 DOI: 10.1155/2018/5646213] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2017] [Revised: 10/23/2017] [Accepted: 10/26/2017] [Indexed: 12/04/2022]
Abstract
Pepper is an economically important horticultural plant that has been widely used for its pungency and spicy taste in worldwide cuisines. Therefore, the domestication of pepper has been carried out since antiquity. Owing to meet the growing demand for pepper with high quality, organoleptic property, nutraceutical contents, and disease tolerance, genomics assisted breeding techniques can be incorporated to develop novel pepper varieties with desired traits. The application of next-generation sequencing (NGS) approaches has reformed the plant breeding technology especially in the area of molecular marker assisted breeding. The availability of genomic information aids in the deeper understanding of several molecular mechanisms behind the vital physiological processes. In addition, the NGS methods facilitate the genome-wide discovery of DNA based markers linked to key genes involved in important biological phenomenon. Among the molecular markers, single nucleotide polymorphism (SNP) indulges various benefits in comparison with other existing DNA based markers. The present review concentrates on the impact of NGS approaches in the discovery of useful SNP markers associated with pungency and disease resistance in pepper. The information provided in the current endeavor can be utilized for the betterment of pepper breeding in future.
Collapse
Affiliation(s)
- Abinaya Manivannan
- Vegetable Research Division, National Institute of Horticultural and Herbal Science, Rural Development Administration, Jeonju 55365, Republic of Korea
| | - Jin-Hee Kim
- Vegetable Research Division, National Institute of Horticultural and Herbal Science, Rural Development Administration, Jeonju 55365, Republic of Korea
| | - Eun-Young Yang
- Vegetable Research Division, National Institute of Horticultural and Herbal Science, Rural Development Administration, Jeonju 55365, Republic of Korea
| | - Yul-Kyun Ahn
- Department of Vegetable Crops, Korea National College of Agriculture and Fisheries, Jeonju 54874, Republic of Korea
| | - Eun-Su Lee
- Vegetable Research Division, National Institute of Horticultural and Herbal Science, Rural Development Administration, Jeonju 55365, Republic of Korea
| | - Sena Choi
- Vegetable Research Division, National Institute of Horticultural and Herbal Science, Rural Development Administration, Jeonju 55365, Republic of Korea
| | - Do-Sun Kim
- Vegetable Research Division, National Institute of Horticultural and Herbal Science, Rural Development Administration, Jeonju 55365, Republic of Korea
| |
Collapse
|
13
|
Thirugnanasambandam PP, Hoang NV, Henry RJ. The Challenge of Analyzing the Sugarcane Genome. FRONTIERS IN PLANT SCIENCE 2018; 9:616. [PMID: 29868072 PMCID: PMC5961476 DOI: 10.3389/fpls.2018.00616] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Accepted: 04/18/2018] [Indexed: 05/04/2023]
Abstract
Reference genome sequences have become key platforms for genetics and breeding of the major crop species. Sugarcane is probably the largest crop produced in the world (in weight of crop harvested) but lacks a reference genome sequence. Sugarcane has one of the most complex genomes in crop plants due to the extreme level of polyploidy. The genome of modern sugarcane hybrids includes sub-genomes from two progenitors Saccharum officinarum and S. spontaneum with some chromosomes resulting from recombination between these sub-genomes. Advancing DNA sequencing technologies and strategies for genome assembly are making the sugarcane genome more tractable. Advances in long read sequencing have allowed the generation of a more complete set of sugarcane gene transcripts. This is supporting transcript profiling in genetic research. The progenitor genomes are being sequenced. A monoploid coverage of the hybrid genome has been obtained by sequencing BAC clones that cover the gene space of the closely related sorghum genome. The complete polyploid genome is now being sequenced and assembled. The emerging genome will allow comparison of related genomes and increase understanding of the functioning of this polyploidy system. Sugarcane breeding for traditional sugar and new energy and biomaterial uses will be enhanced by the availability of these genomic resources.
Collapse
Affiliation(s)
- Prathima P. Thirugnanasambandam
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, Australia
- ICAR - Sugarcane Breeding Institute, Coimbatore, India
- *Correspondence: Prathima P. Thirugnanasambandam,
| | - Nam V. Hoang
- College of Agriculture and Forestry, Hue University, Hue, Vietnam
| | - Robert J. Henry
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, Australia
| |
Collapse
|
14
|
Efficient identification of SNPs in pooled DNA samples using a dual mononucleotide addition-based sequencing method. Mol Genet Genomics 2017; 292:1069-1081. [PMID: 28612167 PMCID: PMC5594057 DOI: 10.1007/s00438-017-1332-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 06/02/2017] [Indexed: 11/18/2022]
Abstract
Identifying single nucleotide polymorphism (SNPs) from pooled samples is critical for many studies and applications. SNPs determined by next-generation sequencing results may suffer from errors in both base calling and read mapping. Taking advantage of dual mononucleotide addition-based pyrosequencing, we present Epds, a method to efficiently identify SNPs from pooled DNA samples. On the basis of only five patterns of non-synchronistic extensions between the wild and mutant sequences using dual mononucleotide addition-based pyrosequencing, we employed an enumerative algorithm to infer the mutant locus and estimate the proportion of mutant sequence. According to the profiles resulting from three runs with distinct dual mononucleotide additions, Epds could recover the mutant bases. Results showed that our method had a false-positive rate of less than 3%. Series of simulations revealed that Epds outperformed the current method (PSM) in many situations. Finally, experiments based on profiles produced by real sequencing proved that our method could be successfully applied for the identification of mutants from pooled samples. The software for implementing this method and the experimental data are available at http://bioinfo.seu.edu.cn/Epds.
Collapse
|
15
|
Parida SK, Kalia S, Pandit A, Nayak P, Singh RK, Gaikwad K, Srivastava PS, Singh NK, Mohapatra T. Single nucleotide polymorphism in sugar pathway and disease resistance genes in sugarcane. PLANT CELL REPORTS 2016; 35:1629-1653. [PMID: 27289592 DOI: 10.1007/s00299-016-1978-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2016] [Accepted: 03/21/2016] [Indexed: 06/06/2023]
Abstract
Single nucleotide polymorphism in sugar pathway and disease resistance genes showing genetic association with sugar content and red rot resistance would be useful in marker-assisted genetic improvement of sugarcane. Validation and genotyping of potential sequence variants in candidate genes are necessary to understand their functional significance and trait association potential. We discovered, characterized, validated and genotyped SNPs and InDels in sugar pathway and disease resistance genes of Saccharum complex and sugarcane varieties using amplicon sequencing and CAPS assays. The SNPs were abundant in the non-coding 3'UTRs than 5'UTRs and coding sequences depicting a strong bias toward C to T transition substitutions than transversions. Sequencing of cloned amplicons validated 61.6 and 45.2 % SNPs detected in silico in 21 sugar pathway and 16 disease resistance genes, respectively. Sixteen SNPs in four sugar pathway genes and 10 SNPs in nine disease resistance genes were validated through cost-effective CAPS assay. Functional and adaptive significance of SNP and protein haplotypes identified in sugar pathway and disease resistance genes was assessed by correlating their allelic variation with missense amino acid substitutions in the functional domains, alteration in protein structure models and possible modulation of catalytic enzyme activity in contrasting high and low sugar and moderately red rot resistant and highly susceptible sugarcane genotypes. A strong genetic association of five SNPs in the sugar pathway and disease resistance genes, and an InDel marker in the promoter sequence of sucrose synthase-2 gene, with sugar content and red rot resistance, was evident. The functionally relevant SNPs and InDels, detected and validated in sugar pathway and disease resistance genes, and genic CAPS markers designed, would be of immense use in marker-assisted genetic improvement of sugarcane for sugar content and disease resistance.
Collapse
Affiliation(s)
- Swarup K Parida
- National Research Centre on Plant Biotechnology, New Delhi, 110012, India
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Sanjay Kalia
- National Research Centre on Plant Biotechnology, New Delhi, 110012, India
- Department of Biotechnology, CGO Complex, Lodhi Road, New Delhi, 110003, India
| | - Awadhesh Pandit
- National Research Centre on Plant Biotechnology, New Delhi, 110012, India
- National Centre for Biological Sciences, Bengaluru, 560065, Karnataka , India
| | - Preetam Nayak
- Utkal University, Vanivihar, Bhubaneswar, Odisha, 751004, India
| | - Ram Kushal Singh
- U.P. Council of Sugarcane Research, Shahjahanpur, Uttar Pradesh, 242001, India
| | - Kishor Gaikwad
- National Research Centre on Plant Biotechnology, New Delhi, 110012, India
| | | | - Nagendra K Singh
- National Research Centre on Plant Biotechnology, New Delhi, 110012, India
| | - Trilochan Mohapatra
- National Research Centre on Plant Biotechnology, New Delhi, 110012, India.
- Indian Council of Agricultural Research, Krishi Bhavan, New Delhi, 110001, India.
| |
Collapse
|
16
|
Vos PG, Uitdewilligen JGAML, Voorrips RE, Visser RGF, van Eck HJ. Development and analysis of a 20K SNP array for potato (Solanum tuberosum): an insight into the breeding history. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2015; 128:2387-401. [PMID: 26263902 PMCID: PMC4636999 DOI: 10.1007/s00122-015-2593-y] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2014] [Accepted: 07/23/2015] [Indexed: 05/21/2023]
Abstract
A 20K SNP array was developed and a comprehensive set of tetraploid cultivar was genotyped. This allowed us to identify footprints of the breeding history in contemporary breeding material such as identification of introgression segments, selection and founder signatures. A non-redundant subset of 15,138 previously identified SNPs and 4454 SNPs originating from the SolCAP project were combined into a 20k Infinium SNP array for genotyping a total of 569 potato genotypes. In this study we describe how this SNP array (encoded SolSTW array) was designed and analysed with fitTetra, software designed for autotetraploids. Genotypes from different countries and market segments, complemented with historic cultivars and important progenitors, were genotyped. This comprehensive set of genotypes combined with the deliberate inclusion of a large proportion of SNPs with a low minor allele frequency allowed us to distinguish genetic variation contributed by introgression breeding. This "new" (post 1945) genetic variation is located on specific chromosomal regions and enables the identification of SNP markers linked to R-genes. In addition, when the genetic composition of modern cultivars was compared with cultivars released before 1945, it appears that 96% of the genetic variants present in those ancestral cultivars remains polymorphic in modern cultivars. Hence, genetic erosion is almost absent in potato. Finally, we studied population genetic processes shaping the genetic composition of the modern European potato including drift, selection and founder effects. This resulted in the identification of major founders contributing to contemporary germplasm.
Collapse
Affiliation(s)
- Peter G Vos
- Wageningen UR, Plant Breeding, P.O. Box 386, 6700 AJ, Wageningen, The Netherlands
| | | | - Roeland E Voorrips
- Wageningen UR, Plant Breeding, P.O. Box 386, 6700 AJ, Wageningen, The Netherlands
| | - Richard G F Visser
- Wageningen UR, Plant Breeding, P.O. Box 386, 6700 AJ, Wageningen, The Netherlands
- Centre for BioSystems Genomics, P.O. Box 98, 6700 AB, Wageningen, The Netherlands
| | - Herman J van Eck
- Wageningen UR, Plant Breeding, P.O. Box 386, 6700 AJ, Wageningen, The Netherlands.
- Centre for BioSystems Genomics, P.O. Box 98, 6700 AB, Wageningen, The Netherlands.
| |
Collapse
|
17
|
Cheng J, Salentijn EMJ, Huang B, Denneboom C, Qi W, Dechesne AC, Krens FA, Visser RGF, van Loo EN. Detection of induced mutations in CaFAD2 genes by next-generation sequencing leading to the production of improved oil composition in Crambe abyssinica. PLANT BIOTECHNOLOGY JOURNAL 2015; 13:471-81. [PMID: 25393152 DOI: 10.1111/pbi.12269] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2014] [Revised: 08/10/2014] [Accepted: 08/20/2014] [Indexed: 05/09/2023]
Abstract
Crambe abyssinica is a hexaploid oil crop for industrial applications. An increase of erucic acid (C22:1) and reduction of polyunsaturated fatty acid (PUFA) contents in crambe oil is a valuable improvement. An increase in oleic acid (C18:1), a reduction in PUFA and possibly an increase in C22:1 can be obtained by down-regulating the expression of fatty acid desaturase2 genes (CaFAD2), which code for the enzyme that converts C18:1 into C18:2. We conducted EMS-mutagenesis in crambe, followed by Illumina sequencing, to screen mutations in three expressed CaFAD2 genes. Two novel analysis strategies were used to detect mutation sites. In the first strategy, mutation detection targeted specific sequence motifs. In the second strategy, every nucleotide position in a CaFAD2 fragment was tested for the presence of mutations. Seventeen novel mutations were detected in 1100 one-dimensional pools (11 000 individuals) in three expressed CaFAD2 genes, including non-sense mutations and mis-sense mutations in CaFAD2-C1, -C2 and -C3. The homozygous non-sense mutants for CaFAD2-C3 resulted in a 25% higher content of C18:1 and 25% lower content of PUFA compared to the wild type. The mis-sense mutations only led to small changes in oil composition. Concluding, targeted mutation detection using NGS in a polyploid was successfully applied and it was found that a non-sense mutation in even a single CaFAD2 gene can lead to changes in crambe oil composition. Stacking the mutations in different CaFAD2 may gain additional changes in C18:1 and PUFA contents.
Collapse
Affiliation(s)
- Jihua Cheng
- Wageningen UR Plant Breeding, Wageningen, The Netherlands; College of Life Science, Hubei University, Hubei, China
| | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Margarido GRA, Heckerman D. ConPADE: genome assembly ploidy estimation from next-generation sequencing data. PLoS Comput Biol 2015; 11:e1004229. [PMID: 25880203 PMCID: PMC4400156 DOI: 10.1371/journal.pcbi.1004229] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2014] [Accepted: 03/09/2015] [Indexed: 01/08/2023] Open
Abstract
As a result of improvements in genome assembly algorithms and the ever decreasing costs of high-throughput sequencing technologies, new high quality draft genome sequences are published at a striking pace. With well-established methodologies, larger and more complex genomes are being tackled, including polyploid plant genomes. Given the similarity between multiple copies of a basic genome in polyploid individuals, assembly of such data usually results in collapsed contigs that represent a variable number of homoeologous genomic regions. Unfortunately, such collapse is often not ideal, as keeping contigs separate can lead both to improved assembly and also insights about how haplotypes influence phenotype. Here, we describe a first step in avoiding inappropriate collapse during assembly. In particular, we describe ConPADE (Contig Ploidy and Allele Dosage Estimation), a probabilistic method that estimates the ploidy of any given contig/scaffold based on its allele proportions. In the process, we report findings regarding errors in sequencing. The method can be used for whole genome shotgun (WGS) sequencing data. We also show applicability of the method for variant calling and allele dosage estimation. Results for simulated and real datasets are discussed and provide evidence that ConPADE performs well as long as enough sequencing coverage is available, or the true contig ploidy is low. We show that ConPADE may also be used for related applications, such as the identification of duplicated genes in fragmented assemblies, although refinements are needed. Diploid organisms, such as human beings, have two “copies” of each chromosome, whereas polyploid organisms have multiple “copies” (we use quotes to stress that the “copies” are not identical). A key difference between diploid and polyploid organisms is that the “copies” tend to be less similar in polyploid organisms. This difference leads to important differences in the process of de novo genome assembly from short fragments of DNA. In particular, when assembling polyploid organisms, contigs corresponding to different copies of the chromosomes can be quite different, and merging them leads to loss of information. Thus, it is important to maintain distinct contigs, even though they correspond to copies of the same chromosomal region. An important step in doing so is to determine how many truly distinct copies of a chromosomal region are found in a single contig. For example, if there are 12 copies of a particular chromosome, the possible number of distinct copies could be anywhere from 1 to 12. We call this task “contig ploidy estimation”, and present a method for accomplishing it. This set of methods is useful for the de novo assembly of complex, polyploid genomes such as sugarcane, switchgrass, and wheat.
Collapse
Affiliation(s)
- Gabriel R. A. Margarido
- Microsoft Research, Los Angeles, California, United States of America
- Departamento de Genética, Escola Superior de Agricultura ‘‘Luiz de Queiroz”, Universidade de São Paulo, Piracicaba, Brazil
- * E-mail: (GRAM); (DH)
| | - David Heckerman
- Microsoft Research, Los Angeles, California, United States of America
- * E-mail: (GRAM); (DH)
| |
Collapse
|
19
|
Hayward AC, Tollenaere R, Dalton-Morgan J, Batley J. Molecular marker applications in plants. Methods Mol Biol 2015; 1245:13-27. [PMID: 25373746 DOI: 10.1007/978-1-4939-1966-6_2] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Individuals within a population of a sexually reproducing species will have some degree of heritable genomic variation caused by mutations, insertion/deletions (INDELS), inversions, duplications, and translocations. Such variation can be detected and screened using molecular, or genetic, markers. By definition, molecular markers are genetic loci that can be easily tracked and quantified in a population and may be associated with a particular gene or trait of interest. This chapter will review the current major applications of molecular markers in plants.
Collapse
Affiliation(s)
- Alice C Hayward
- School of Agriculture and Food Sciences, University of Queensland, Brisbane, QLD, Australia
| | | | | | | |
Collapse
|
20
|
Abstract
The detection and analysis of genetic variation plays an important role in plant breeding and this role is increasing with the continued development of genome sequencing technologies. Molecular genetic markers are important tools to characterize genetic variation and assist with genomic breeding. Processing and storing the growing abundance of molecular marker data being produced requires the development of specific bioinformatics tools and advanced databases. Molecular marker databases range from species specific through to organism wide and often host a variety of additional related genetic, genomic, or phenotypic information. In this chapter, we will present some of the features of plant molecular genetic marker databases, highlight the various types of marker resources, and predict the potential future direction of crop marker databases.
Collapse
|
21
|
Liu JJ, Sniezko RA, Sturrock RN, Chen H. Western white pine SNP discovery and high-throughput genotyping for breeding and conservation applications. BMC PLANT BIOLOGY 2014; 14:380. [PMID: 25547170 PMCID: PMC4302426 DOI: 10.1186/s12870-014-0380-6] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2014] [Accepted: 12/11/2014] [Indexed: 05/10/2023]
Abstract
BACKGROUND Western white pine (WWP, Pinus monticola Douglas ex D. Don) is of high interest in forest breeding and conservation because of its high susceptibility to the invasive disease white pine blister rust (WPBR, caused by the fungus Cronartium ribicola J. C. Fisch). However, WWP lacks genomic resource development and is evolutionarily far away from plants with available draft genome sequences. Here we report a single nucleotide polymorphism (SNP) study by bulked segregation-based RNA-Seq analysis. RESULTS A collection of resistance germplasm was used for construction of cDNA libraries and SNP genotyping. Approximately 36-89 million 2 × 100-bp reads were obtained per library and de-novo assembly generated the first shoot-tip reference transcriptome containing a total of 54,661 unique transcripts. Bioinformatic SNP detection identified >100,000 high quality SNPs in three expressed candidate gene groups: Pinus highly conserved genes (HCGs), differential expressed genes (DEGs) in plant defense response, and resistance gene analogs (RGAs). To estimate efficiency of in-silico SNP discovery, genotyping assay was developed by using Sequenom iPlex and it unveiled SNP success rates from 40.1% to 61.1%. SNP clustering analyses consistently revealed distinct populations, each composed of multiple full-sib seed families by parentage assignment in the WWP germplasm collection. Linkage disequilibrium (LD) analysis identified six genes in significant association with major gene (Cr2) resistance, including three RGAs (two NBS-LRR genes and one receptor-like protein kinase -RLK gene), two HCGs, and one DEG. At least one SNP locus provided an excellent marker for Cr2 selection across P. monticola populations. CONCLUSIONS The WWP shoot tip transcriptome and those validated SNP markers provide novel genomic resources for genetic, evolutionary and ecological studies. SNP loci of those candidate genes associated with resistant phenotypes can be used as positional and functional variation sites for further characterization of WWP major gene resistance against C. ribicola. Our results demonstrate that integration of RNA-seq-based transcriptome analysis and high-throughput genotyping is an effective approach for discovery of a large number of nucleotide variations and for identification of functional gene variants associated with adaptive traits in a non-model species.
Collapse
Affiliation(s)
- Jun-Jun Liu
- />Pacific Forestry Centre, Canadian Forest Service, Natural Resources Canada, 506 West Burnside Road, Victoria, BC V8Z 1M5 Canada
| | - Richard A Sniezko
- />USDA Forest Service, Dorena Genetic Resource Center, 34963 Shoreview Road, Cottage Grove, OR 97424 USA
| | - Rona N Sturrock
- />Pacific Forestry Centre, Canadian Forest Service, Natural Resources Canada, 506 West Burnside Road, Victoria, BC V8Z 1M5 Canada
| | - Hao Chen
- />Pacific Forestry Centre, Canadian Forest Service, Natural Resources Canada, 506 West Burnside Road, Victoria, BC V8Z 1M5 Canada
| |
Collapse
|
22
|
De novo transcriptome assembly and the identification of gene-associated single-nucleotide polymorphism markers in Asian and American ginseng roots. Mol Genet Genomics 2014; 290:1055-65. [PMID: 25527477 DOI: 10.1007/s00438-014-0974-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2014] [Accepted: 12/05/2014] [Indexed: 10/24/2022]
Abstract
We performed de novo transcriptome sequencing for Panax ginseng and Panax quinquefolius accessions using the 454 GS FLX Titanium System and discovered annotation-based genome-wide single-nucleotide polymorphism (SNPs) using next-generation ginseng transcriptome data without reference genome sequence. The comprehensive transcriptome characterization with the mature roots of four ginseng accessions generated 297,170 reads for 'Cheonryang' cultivar, 305,673 reads for 'Yunpoong' cultivar, 311,861 reads for the G03080 breeding line, and 308,313 reads for P. quinquefolius. In transcriptome assembly, the lengths of the sample read were 156.42 Mb for 'Cheonryang', 161.95 Mb for 'Yunpoong', 165.07 Mb for G03080 breeding line, and 166.48 Mb for P. quinquefolius. A total of 97 primer pairs were designed with the homozygous SNP presented in all four accessions. SNP genotyping using high-resolution melting (HRM) analysis was performed to validate the putative SNP markers of 97 primer pairs. Out of the 73 primer pairs, 73 primer pairs amplified the target sequence and 34 primer pairs showed polymorphic melting curves in samples from 11 P. ginseng cultivars and one P. quinquefolius accession. Among the 34 polymorphic HRM-SNP primers, four primers were useful to distinguish ginseng cultivars. In the present study, we demonstrated that de novo transcriptome assembly and mapping analyses are useful in providing four HRM-SNP primer pairs that reliably show a high degree of polymorphism among ginseng cultivars.
Collapse
|
23
|
Molecular markers and cotton genetic improvement: current status and future prospects. ScientificWorldJournal 2014; 2014:607091. [PMID: 25401149 PMCID: PMC4226190 DOI: 10.1155/2014/607091] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2014] [Accepted: 09/17/2014] [Indexed: 11/17/2022] Open
Abstract
Narrow genetic base and complex allotetraploid genome of cotton (Gossypium hirsutum L.) is stimulating efforts to avail required polymorphism for marker based breeding. The availability of draft genome sequence of G. raimondii and G. arboreum and next generation sequencing (NGS) technologies facilitated the development of high-throughput marker technologies in cotton. The concepts of genetic diversity, QTL mapping, and marker assisted selection (MAS) are evolving into more efficient concepts of linkage disequilibrium, association mapping, and genomic selection, respectively. The objective of the current review is to analyze the pace of evolution in the molecular marker technologies in cotton during the last ten years into the following four areas: (i) comparative analysis of low- and high-throughput marker technologies available in cotton, (ii) genetic diversity in the available wild and improved gene pools of cotton, (iii) identification of the genomic regions within cotton genome underlying economic traits, and (iv) marker based selection methodologies. Moreover, the applications of marker technologies to enhance the breeding efficiency in cotton are also summarized. Aforementioned genomic technologies and the integration of several other omics resources are expected to enhance the cotton productivity and meet the global fiber quantity and quality demands.
Collapse
|
24
|
Zhang MF, Jiang LM, Zhang DM, Jia GX. De novo transcriptome characterization of Lilium 'Sorbonne' and key enzymes related to the flavonoid biosynthesis. Mol Genet Genomics 2014; 290:399-412. [PMID: 25307066 DOI: 10.1007/s00438-014-0919-0] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2014] [Accepted: 09/09/2014] [Indexed: 11/25/2022]
Abstract
Lily is an important cut-flower and bulb crop in the commercial market. Here, transcriptome profiling of Lilium 'Sorbonne' was conducted through de novo sequencing based on Illumina platform. This research aims at revealing basic information and data that can be used for applied purposes especially the molecular regulatory information on flower color formation in lily. In total, 36,920,680 short reads which corresponded to 3.32 GB of total nucleotides, were produced through transcriptome sequencing. These reads were assembled into 39,636 Unigenes, of which 30,986 were annotated in Nr, Nt, Swiss-Prot, KEGG, COG, GO databases. Based on the three public protein databases, a total of 32,601 coding sequences were obtained. Meanwhile, 19,242 Unigenes were assigned to 128 KEGG pathways. Those with the greatest representation by unique sequences were for ''metabolic pathways'' (5,406 counts, 28.09 %). Our transcriptome revealed 156 Unigenes that encode key enzymes in the flavonoid biosynthesis pathway including CHS, CHI, F3H, FLS, DFR, etc. MISA software identified 2,762 simple sequence repeats, from which 1,975 primers pairs were designed. Over 2,762 motifs were identified, of which the most frequent was AG/CT (659, 23.86 %), followed by A/T (615, 22.27 %) and CCG/CGG (416, 15.06 %). Based on the results, we believe that the color formation of the Lilium 'Sorbonne' flower was mainly controlled by the flavonoid biosynthesis pathway. Additionally, this research provides initial genetic resources that will be valuable to the lily community for other molecular biology research, and the SSRs will facilitate marker-assisted selection in lily breeding.
Collapse
Affiliation(s)
- Ming-fang Zhang
- Beijing Key Laboratory of Ornamental Plants Germplasm Innovation and Molecular Breeding, National Engineering Research Center for Floriculture and College of Landscape Architecture, Beijing Forestry University, Beijing, 100083, China,
| | | | | | | |
Collapse
|
25
|
Aitken KS, McNeil MD, Berkman PJ, Hermann S, Kilian A, Bundock PC, Li J. Comparative mapping in the Poaceae family reveals translocations in the complex polyploid genome of sugarcane. BMC PLANT BIOLOGY 2014; 14:190. [PMID: 25059596 PMCID: PMC4222257 DOI: 10.1186/s12870-014-0190-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/12/2014] [Accepted: 07/14/2014] [Indexed: 05/18/2023]
Abstract
BACKGROUND The understanding of sugarcane genetics has lagged behind that of other members of the Poaceae family such as wheat, rice, barley and sorghum mainly due to the complexity, size and polyploidization of the genome. We have used the genetic map of a sugarcane cultivar to generate a consensus genetic map to increase genome coverage for comparison to the sorghum genome. We have utilized the recently developed sugarcane DArT array to increase the marker density within the genetic map. The sequence of these DArT markers plus SNP and EST-SSR markers was then used to form a bridge to the sorghum genomic sequence by BLAST alignment to start to unravel the complex genomic architecture of sugarcane. RESULTS Comparative mapping revealed that certain sugarcane chromosomes show greater levels of synteny to sorghum than others. On a macrosyntenic level a good collinearity was observed between sugarcane and sorghum for 4 of the 8 homology groups (HGs). These 4 HGs were syntenic to four sorghum chromosomes with from 98% to 100% of these chromosomes covered by these linked markers. Four major chromosome rearrangements were identified between the other four sugarcane HGs and sorghum, two of which were condensations of chromosomes reducing the basic chromosome number of sugarcane from x = 10 to x = 8. This macro level of synteny was transferred to other members within the Poaceae family such as maize to uncover the important evolutionary relationships that exist between sugarcane and these species. CONCLUSIONS Comparative mapping of sugarcane to the sorghum genome has revealed new information on the genome structure of sugarcane which will help guide identification of important genes for use in sugarcane breeding. Furthermore of the four major chromosome rearrangements identified in this study, three were common to maize providing some evidence that chromosome reduction from a common paleo-ancestor of both maize and sugarcane was driven by the same translocation events seen in both species.
Collapse
Affiliation(s)
- Karen S Aitken
- CSIRO Plant Industry, Queensland Bioscience Precinct, 306 Carmody Rd, St Lucia, Brisbane 4067, QLD, Australia
| | - Meredith D McNeil
- CSIRO Plant Industry, Queensland Bioscience Precinct, 306 Carmody Rd, St Lucia, Brisbane 4067, QLD, Australia
| | - Paul J Berkman
- CSIRO Plant Industry, Queensland Bioscience Precinct, 306 Carmody Rd, St Lucia, Brisbane 4067, QLD, Australia
| | - Scott Hermann
- BSES Limited, 50 Meiers Road, Indooroopilly, Brisbane 4068, QLD, Australia
| | - Andrzej Kilian
- Diversity Arrays P/L, 1 Wilf Crane Crescent, Yarralumla, Canberra 2600, ACT, Australia
| | - Peter C Bundock
- Southern Cross University, Ctr Plant Conservation Genetics, Lismore 2480, NSW, Australia
| | - Jingchuan Li
- CSIRO Plant Industry, Queensland Bioscience Precinct, 306 Carmody Rd, St Lucia, Brisbane 4067, QLD, Australia
| |
Collapse
|
26
|
Aitken KS, McNeil MD, Hermann S, Bundock PC, Kilian A, Heller-Uszynska K, Henry RJ, Li J. A comprehensive genetic map of sugarcane that provides enhanced map coverage and integrates high-throughput Diversity Array Technology (DArT) markers. BMC Genomics 2014; 15:152. [PMID: 24564784 PMCID: PMC4007999 DOI: 10.1186/1471-2164-15-152] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Accepted: 02/06/2014] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Sugarcane genetic mapping has lagged behind other crops due to its complex autopolyploid genome structure. Modern sugarcane cultivars have from 110-120 chromosomes and are in general interspecific hybrids between two species with different basic chromosome numbers: Saccharum officinarum (2n = 80) with a basic chromosome number of 10 and S. spontaneum (2n = 40-128) with a basic chromosome number of 8. The first maps that were constructed utilised the single dose (SD) markers generated using RFLP, more recent maps generated using AFLP and SSRs provided at most 60% genome coverage. Diversity Array Technology (DArT) markers are high throughput allowing greater numbers of markers to be generated. RESULTS Progeny from a cross between a sugarcane variety Q165 and a S. officinarum accession IJ76-514 were used to generate 2467 SD markers. A genetic map of Q165 was generated containing 2267 markers, These markers formed 160 linkage groups (LGs) of which 147 could be placed using allelic information into the eight basic homology groups (HGs) of sugarcane. The HGs contained from 13 to 23 LGs and from 204 to 475 markers with a total map length of 9774.4 cM and an average density of one marker every 4.3 cM. Each homology group contained on average 280 markers of which 43% were DArT markers 31% AFLP, 16% SSRs and 6% SNP markers. The multi-allelic SSR and SNP markers were used to place the LGs into HGs. CONCLUSIONS The DArT array has allowed us to generate and map a larger number of markers than ever before and consequently to map a larger portion of the sugarcane genome. This larger number of markers has enabled 92% of the LGs to be placed into the 8 HGs that represent the basic chromosome number of the ancestral species, S. spontaneum. There were two HGs (HG2 and 8) that contained larger numbers of LGs verifying the alignment of two sets of S. officinarum chromosomes with one set of S. spontaneum chromosomes and explaining the difference in basic chromosome number between the two ancestral species. There was also evidence of more complex structural differences between the two ancestral species.
Collapse
Affiliation(s)
- Karen S Aitken
- CSIRO Plant Industry, Queensland Bioscience Precinct, 306 Carmody Rd, St Lucia, QLD 4067, Australia.
| | | | | | | | | | | | | | | |
Collapse
|
27
|
Kim JE, Oh SK, Lee JH, Lee BM, Jo SH. Genome-wide SNP calling using next generation sequencing data in tomato. Mol Cells 2014; 37:36-42. [PMID: 24552708 PMCID: PMC3907006 DOI: 10.14348/molcells.2014.2241] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2013] [Revised: 11/25/2013] [Accepted: 11/26/2013] [Indexed: 01/03/2023] Open
Abstract
The tomato (Solanum lycopersicum L.) is a model plant for genome research in Solanaceae, as well as for studying crop breeding. Genome-wide single nucleotide polymorphisms (SNPs) are a valuable resource in genetic research and breeding. However, to do discovery of genome-wide SNPs, most methods require expensive high-depth sequencing. Here, we describe a method for SNP calling using a modified version of SAMtools that improved its sensitivity. We analyzed 90 Gb of raw sequence data from next-generation sequencing of two resequencing and seven transcriptome data sets from several tomato accessions. Our study identified 4,812,432 non-redundant SNPs. Moreover, the workflow of SNP calling was improved by aligning the reference genome with its own raw data. Using this approach, 131,785 SNPs were discovered from transcriptome data of seven accessions. In addition, 4,680,647 SNPs were identified from the genome of S. pimpinellifolium, which are 60 times more than 71,637 of the PI212816 transcriptome. SNP distribution was compared between the whole genome and transcriptome of S. pimpinellifolium. Moreover, we surveyed the location of SNPs within genic and intergenic regions. Our results indicated that the sufficient genome-wide SNP markers and very sensitive SNP calling method allow for application of marker assisted breeding and genome-wide association studies.
Collapse
Affiliation(s)
| | - Sang-Keun Oh
- Plant Genomics and Breeding Institutes, Seoul National University, Seoul 151-921,
Korea
| | | | - Bo-Mi Lee
- SEEDERS Inc., Daejeon 305-509,
Korea
| | | |
Collapse
|
28
|
Evaluation of assembly strategies using RNA-seq data associated with grain development of wheat (Triticum aestivum L.). PLoS One 2013; 8:e83530. [PMID: 24349528 PMCID: PMC3861526 DOI: 10.1371/journal.pone.0083530] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2013] [Accepted: 11/05/2013] [Indexed: 11/19/2022] Open
Abstract
Wheat (Triticum aestivum L.) is one of the most important crops cultivated worldwide. Identifying the complete transcriptome of wheat grain could serve as foundation for further study of wheat seed development. However, the relatively large size and the polyploid complexity of the genome have been substantial barriers to molecular genetics and transcriptome analysis of wheat. Alternatively, RNA sequencing has provided some useful information about wheat genes. However, because of the large number of short reads generated by RNA sequencing, factors that are crucial to transcriptome assembly, including software, candidate parameters and assembly strategies, need to be optimized and evaluated for wheat data. In the present study, four cDNA libraries associated with wheat grain development were constructed and sequenced. A total of 14.17 Gb of high-quality reads were obtained and used to assess different assembly strategies. The most successful approach was to filter the reads with Q30 prior to de novo assembly using Trinity, merge the assembled contigs with genes available in wheat cDNA reference data sets, and combine the resulting assembly with an assembly from a reference-based strategy. Using this approach, a relatively accurate and nearly complete transcriptome associated with wheat grain development was obtained, suggesting that this is an effective strategy for generation of a high-quality transcriptome from RNA sequencing data.
Collapse
|
29
|
Garcia AAF, Mollinari M, Marconi TG, Serang OR, Silva RR, Vieira MLC, Vicentini R, Costa EA, Mancini MC, Garcia MOS, Pastina MM, Gazaffi R, Martins ERF, Dahmer N, Sforça DA, Silva CBC, Bundock P, Henry RJ, Souza GM, van Sluys MA, Landell MGA, Carneiro MS, Vincentz MAG, Pinto LR, Vencovsky R, Souza AP. SNP genotyping allows an in-depth characterisation of the genome of sugarcane and other complex autopolyploids. Sci Rep 2013; 3:3399. [PMID: 24292365 PMCID: PMC3844970 DOI: 10.1038/srep03399] [Citation(s) in RCA: 94] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2013] [Accepted: 11/15/2013] [Indexed: 12/12/2022] Open
Abstract
Many plant species of great economic value (e.g., potato, wheat, cotton, and sugarcane) are polyploids. Despite the essential roles of autopolyploid plants in human activities, our genetic understanding of these species is still poor. Recent progress in instrumentation and biochemical manipulation has led to the accumulation of an incredible amount of genomic data. In this study, we demonstrate for the first time a successful genetic analysis in a highly polyploid genome (sugarcane) by the quantitative analysis of single-nucleotide polymorphism (SNP) allelic dosage and the application of a new data analysis framework. This study provides a better understanding of autopolyploid genomic structure and is a sound basis for genetic studies. The proposed methods can be employed to analyse the genome of any autopolyploid and will permit the future development of high-quality genetic maps to assist in the assembly of reference genome sequences for polyploid species.
Collapse
Affiliation(s)
- Antonio A F Garcia
- 1] Departamento de Genética, Escola Superior de Agricultura "Luiz de Queiroz" Universidade de São Paulo, Brazil [2]
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Ojeda DI, Dhillon B, Tsui CKM, Hamelin RC. Single-nucleotide polymorphism discovery in Leptographium longiclavatum, a mountain pine beetle-associated symbiotic fungus, using whole-genome resequencing. Mol Ecol Resour 2013; 14:401-10. [PMID: 24152017 DOI: 10.1111/1755-0998.12191] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2013] [Revised: 10/05/2013] [Accepted: 10/07/2013] [Indexed: 11/29/2022]
Abstract
Single-nucleotide polymorphisms (SNPs) are rapidly becoming the standard markers in population genomics studies; however, their use in nonmodel organisms is limited due to the lack of cost-effective approaches to uncover genome-wide variation, and the large number of individuals needed in the screening process to reduce ascertainment bias. To discover SNPs for population genomics studies in the fungal symbionts of the mountain pine beetle (MPB), we developed a road map to discover SNPs and to produce a genotyping platform. We undertook a whole-genome sequencing approach of Leptographium longiclavatum in combination with available genomics resources of another MPB symbiont, Grosmannia clavigera. We sequenced 71 individuals pooled into four groups using the Illumina sequencing technology. We generated between 27 and 30 million reads of 75 bp that resulted in a total of 1, 181 contigs longer than 2 kb and an assembled genome size of 28.9 Mb (N50 = 48 kb, average depth = 125x). A total of 9052 proteins were annotated, and between 9531 and 17,266 SNPs were identified in the four pools. A subset of 206 genes (containing 574 SNPs, 11% false positives) was used to develop a genotyping platform for this species. Using this roadmap, we developed a genotyping assay with a total of 147 SNPs located in 121 genes using the Illumina(®) Sequenom iPLEX Gold. Our preliminary genotyping (success rate = 85%) of 304 individuals from 36 populations supports the utility of this approach for population genomics studies in other MPB fungal symbionts and other fungal nonmodel species.
Collapse
Affiliation(s)
- Dario I Ojeda
- Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, British Columbia, Canada, V6T 1Z4
| | | | | | | |
Collapse
|
31
|
Dufresne F, Stift M, Vergilino R, Mable BK. Recent progress and challenges in population genetics of polyploid organisms: an overview of current state-of-the-art molecular and statistical tools. Mol Ecol 2013; 23:40-69. [DOI: 10.1111/mec.12581] [Citation(s) in RCA: 248] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2013] [Revised: 10/29/2013] [Accepted: 10/30/2013] [Indexed: 12/19/2022]
Affiliation(s)
- France Dufresne
- Département de Biologie; Université du Québec à Rimouski; Québec QC Canada G5L 3A1
| | - Marc Stift
- Department of Biology; University of Konstanz; Konstanz D 78457 Germany
| | - Roland Vergilino
- Department of Integrative Biology; University of Guelph; Guelph ON Canada N1G 2W1
| | - Barbara K. Mable
- Institute of Biodiversity; Animal Health and Comparative Medicine; College of Medical, Veterinary and Life Sciences; University of Glasgow; Glasgow UK
| |
Collapse
|
32
|
Next generation characterisation of cereal genomes for marker discovery. BIOLOGY 2013; 2:1357-77. [PMID: 24833229 PMCID: PMC4009793 DOI: 10.3390/biology2041357] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2013] [Revised: 10/29/2013] [Accepted: 11/08/2013] [Indexed: 12/30/2022]
Abstract
Cereal crops form the bulk of the world’s food sources, and thus their importance cannot be understated. Crop breeding programs increasingly rely on high-resolution molecular genetic markers to accelerate the breeding process. The development of these markers is hampered by the complexity of some of the major cereal crop genomes, as well as the time and cost required. In this review, we address current and future methods available for the characterisation of cereal genomes, with an emphasis on faster and more cost effective approaches for genome sequencing and the development of markers for trait association and marker assisted selection (MAS) in crop breeding programs.
Collapse
|
33
|
Moyle RL, Birch RG. Diversity of sequences and expression patterns among alleles of a sugarcane loading stem gene. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2013; 126:1775-82. [PMID: 23546592 DOI: 10.1007/s00122-013-2091-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2012] [Accepted: 03/21/2013] [Indexed: 05/13/2023]
Abstract
Modern sugarcane cultivars are highly polyploid and aneuploid hybrids, which are propagated as clones. Their complex genome structure comprises 100-130 chromosomes and 10-13 hom(e)ologous copies of most loci. There is preliminary evidence of very high heterozygosity, with implications for genetic improvement approaches ranging from marker-assisted selection to transgenics. Here, we report that sugarcane cultivar Q200 has at least nine alleles at the Loading Stem Gene (ScLSG) locus. Exon-intron structure is identical and the predicted protein products show at least 92 % identity, across sugarcane alleles and the Sorghum homologue Sb07g027880. There is substantial variation in the 5' UTR and promoter regions including numerous allele-specific nucleotide polymorphisms, insertions and deletions. We developed an allele-specific qRT-PCR method to undertake the first compelling test of allele-specific expression in polyploid sugarcane. Seven alleles distinguished by this method all showed peak expression in the sucrose-loading zone of the stem, but there was apparent variability in expression patterns across other tissues. The ScLSG2 and ScLSG5 alleles appear promising for specificity of expression in stems, relative to leaf, meristem, emerging shoot and root tissues. Within the stem, there was activity in parenchyma, vascular and rind tissues. This expression pattern is of interest in basic research and biotechnology aimed at enhanced sucrose content, engineering value-added products, and manipulation of stem biomass composition.
Collapse
Affiliation(s)
- Richard L Moyle
- Hines Plant Science Building, University of Queensland, Brisbane 4072, Australia
| | | |
Collapse
|
34
|
de Siqueira Ferreira S, Nishiyama MY, Paterson AH, Souza GM. Biofuel and energy crops: high-yield Saccharinae take center stage in the post-genomics era. Genome Biol 2013; 14:210. [PMID: 23805917 PMCID: PMC3707038 DOI: 10.1186/gb-2013-14-6-210] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
The Saccharinae, especially sugarcane, Miscanthus and sorghum, present remarkable characteristics for bioenergy production. Biotechnology of these plants will be important for a sustainable feedstock supply. Herein, we review knowledge useful for their improvement and synergies gained by their parallel study.
Collapse
Affiliation(s)
- Savio de Siqueira Ferreira
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes 748, 05508-000 São Paulo, SP, Brazil
| | - Milton Yutaka Nishiyama
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes 748, 05508-000 São Paulo, SP, Brazil
| | - Andrew H Paterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA
| | - Glaucia Mendes Souza
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes 748, 05508-000 São Paulo, SP, Brazil
| |
Collapse
|
35
|
Bogart JP, Bi K. Genetic and genomic interactions of animals with different ploidy levels. Cytogenet Genome Res 2013; 140:117-36. [PMID: 23751376 DOI: 10.1159/000351593] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Polyploid animals have independently evolved from diploids in diverse taxa across the tree of life. We review a few polyploid animal species or biotypes where recently developed molecular and cytogenetic methods have significantly improved our understanding of their genetics, reproduction and evolution. Mitochondrial sequences that target the maternal ancestor of a polyploid show that polyploids may have single (e.g. unisexual salamanders in the genus Ambystoma) or multiple (e.g. parthenogenetic polyploid lizards in the genus Aspidoscelis) origins. Microsatellites are nuclear markers that can be used to analyze genetic recombinations, reproductive modes (e.g. Ambystoma) and recombination events (e.g. polyploid frogs such as Pelophylax esculentus). Hom(e)ologous chromosomes and rare intergenomic exchanges in allopolyploids have been distinguished by applying genome-specific fluorescent probes to chromosome spreads. Polyploids arise, and are maintained, through perturbations of the 'normal' meiotic program that would include pre-meiotic chromosome replication and genomic integrity of homologs. When possible, asexual, unisexual and bisexual polyploid species or biotypes interact with diploid relatives, and genes are passed from diploid to polyploid gene pools, which increase genetic diversity and ultimately evolutionary flexibility in the polyploid. When diploid relatives do not exist, polyploids can interact with another polyploid (e.g. species of African Clawed Frogs in the genus Xenopus). Some polyploid fish (e.g. salmonids) and frogs (Xenopus) represent independent lineages whose ancestors experienced whole genome duplication events. Some tetraploid frogs (P. esculentus) and fish (Squaliusalburnoides) may be in the process of becoming independent species, but diploid and triploid forms of these 'species' continue to genetically interact with the comparatively few tetraploid populations. Genetic and genomic interaction between polyploids and diploids is a complex and dynamic process that likely plays a crucial role for the evolution and persistence of polyploid animals. See also other articles in this themed issue.
Collapse
Affiliation(s)
- J P Bogart
- Department of Integrative Biology, University of Guelph, Guelph, Ont., Canada. jbogart @ uoguelph.ca
| | | |
Collapse
|
36
|
Zhang J, Arro J, Chen Y, Ming R. Haplotype analysis of sucrose synthase gene family in three Saccharum species. BMC Genomics 2013; 14:314. [PMID: 23663250 PMCID: PMC3668173 DOI: 10.1186/1471-2164-14-314] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2012] [Accepted: 04/29/2013] [Indexed: 11/10/2022] Open
Abstract
Background Sugarcane is an economically important crop contributing about 80% and 40% to the world sugar and ethanol production, respectively. The complicated genetics consequential to its complex polyploid genome, however, have impeded efforts to improve sugar yield and related important agronomic traits. Modern sugarcane cultivars are complex hybrids derived mainly from crosses among its progenitor species, S. officinarum and S. spontanuem, and to a lesser degree, S. robustom. Atypical of higher plants, sugarcane stores its photoassimilates as sucrose rather than as starch in its parenchymous stalk cells. In the sugar biosynthesis pathway, sucrose synthase (SuSy, UDP-glucose: D-fructose 2-a-D-glucosyltransferase, EC 2.4.1.13) is a key enzyme in the regulation of sucrose accumulation and partitioning by catalyzing the reversible conversion of sucrose and UDP into UDP-glucose and fructose. However, little is known about the sugarcane SuSy gene family members and hence no definitive studies have been reported regarding allelic diversity of SuSy gene families in Saccharum species. Results We identified and characterized a total of five sucrose synthase genes in the three sugarcane progenitor species through gene annotation and PCR haplotype analysis by analyzing 70 to 119 PCR fragments amplified from intron-containing target regions. We detected all but one (i.e. ScSuSy5) of ScSuSy transcripts in five tissue types of three Saccharum species. The average SNP frequency was one SNP per 108 bp, 81 bp, and 72 bp in S. officinarum, S. robustom, and S. spontanuem respectively. The average shared SNP is 15 between S. officinarum and S. robustom, 7 between S. officinarum and S. spontanuem , and 11 between S. robustom and S. spontanuem. We identified 27, 35, and 32 haplotypes from the five ScSuSy genes in S. officinarum, S. robustom, and S. spontanuem respectively. Also, 12, 11, and 9 protein sequences were translated from the haplotypes in S. officinarum, S. robustom, S. spontanuem, respectively. Phylogenetic analysis showed three separate clusters composed of SbSuSy1 and SbSuSy2, SbSuSy3 and SbSuSy5, and SbSuSy4. Conclusions The five members of the SuSy gene family evolved before the divergence of the genera in the tribe Andropogoneae at least 12 MYA. Each ScSuSy gene showed at least one non-synonymous substitution in SNP haplotypes. The SNP frequency is the lowest in S. officinarum, intermediate in S. robustum, and the highest in S. spontaneum, which may reflect the timing of the two rounds of whole genome duplication in these octoploids. The higher rate of shared SNP frequency between S. officinarum and S. robustum than between S. officinarum and in S. spontaneum confirmed that the speciation event separating S. officinarum and S. robustum occurred after their common ancestor diverged from S. spontaneum. The SNP and haplotype frequencies in three Saccharum species provide fundamental information for designing strategies to sequence these autopolyploid genomes.
Collapse
Affiliation(s)
- Jisen Zhang
- College of Life Sciences, Fujian Normal University, Fuzhou 350108, China
| | | | | | | |
Collapse
|
37
|
Next-generation sequencing of the Chrysanthemum nankingense (Asteraceae) transcriptome permits large-scale unigene assembly and SSR marker discovery. PLoS One 2013; 8:e62293. [PMID: 23626799 PMCID: PMC3633874 DOI: 10.1371/journal.pone.0062293] [Citation(s) in RCA: 91] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2012] [Accepted: 03/19/2013] [Indexed: 11/19/2022] Open
Abstract
Background Simple sequence repeats (SSRs) are ubiquitous in eukaryotic genomes. Chrysanthemum is one of the largest genera in the Asteraceae family. Only few Chrysanthemum expressed sequence tag (EST) sequences have been acquired to date, so the number of available EST-SSR markers is very low. Methodology/Principal Findings Illumina paired-end sequencing technology produced over 53 million sequencing reads from C. nankingense mRNA. The subsequent de novo assembly yielded 70,895 unigenes, of which 45,789 (64.59%) unigenes showed similarity to the sequences in NCBI database. Out of 45,789 sequences, 107 have hits to the Chrysanthemum Nr protein database; 679 and 277 sequences have hits to the database of Helianthus and Lactuca species, respectively. MISA software identified a large number of putative EST-SSRs, allowing 1,788 primer pairs to be designed from the de novo transcriptome sequence and a further 363 from archival EST sequence. Among 100 primer pairs randomly chosen, 81 markers have amplicons and 20 are polymorphic for genotypes analysis in Chrysanthemum. The results showed that most (but not all) of the assays were transferable across species and that they exposed a significant amount of allelic diversity. Conclusions/Significance SSR markers acquired by transcriptome sequencing are potentially useful for marker-assisted breeding and genetic analysis in the genus Chrysanthemum and its related genera.
Collapse
|
38
|
Mercati F, Riccardi P, Leebens-Mack J, Abenavoli MR, Falavigna A, Sunseri F. Single nucleotide polymorphism isolated from a novel EST dataset in garden asparagus (Asparagus officinalis L.). PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2013; 203-204:115-23. [PMID: 23415335 DOI: 10.1016/j.plantsci.2013.01.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2012] [Revised: 01/07/2013] [Accepted: 01/09/2013] [Indexed: 05/03/2023]
Abstract
Single nucleotide polymorphisms (SNPs) and simple sequence repeats (SSR) are abundant and evenly distributed co-dominant molecular markers in plant genomes. SSRs are valuable for marker assisted breeding and positional cloning of genes associated traits of interest. Although several high throughput platforms have been developed to identify SNP and SSR markers for analysis of segregant plant populations, breeding in garden asparagus (Asparagus officinalis L.) has been limited by a low content of such markers. In this study massively parallel GS-FLX pyro-sequencing technology (454 Life Sciences) has been used to sequence and compare transcriptome from two genotypes: a rust tolerant male (1770) and a susceptible female (G190). A total of 122,963 and 99,368 sequence reads, with an average length of 245.7bp, have been recovered from accessions 1770 and 190 respectively. A computational pipeline has been used to predict and visually inspect putative SNPs and SSR sequences. Analysis of Gene Ontology (GO) slim annotation assignments for all assembled uniscripts indicated that the 24,403 assemblies represent genes from a broad array of functions. Further, over 1800 putative SNPs and 1000 SSRs were detected. One hundred forty-four SNPs together with 60 selected SSRs were validated and used to develop a preliminary genetic map by using a large BC(1) population, derived from 1770 and G190. The abundance of SNPs and SSRs provides a foundation for the development of saturated genetic maps and their utilization in assisted asparagus breeding programs.
Collapse
Affiliation(s)
- Francesco Mercati
- Dipartimento di AGRARIA, Università Mediterranea di Reggio Calabria, Salita Melissari, 89124 Reggio Calabria (RC), Italy
| | | | | | | | | | | |
Collapse
|
39
|
Abstract
Advances in DNA sequencing provide tools for efficient large-scale discovery of markers for use in plants. Discovery options include large-scale amplicon sequencing, transcriptome sequencing, gene-enriched genome sequencing and whole genome sequencing. Examples of each of these approaches and their potential to generate molecular markers for specific applications have been described. Sequencing the whole genome of parents identifies all the polymorphisms available for analysis in their progeny. Sequencing PCR amplicons of sets of candidate genes from DNA bulks can be used to define the available variation in these genes that might be exploited in a population or germplasm collection. Sequencing of the transcriptomes of genotypes varying for the trait of interest may identify genes with patterns of expression that could explain the phenotypic variation. Sequencing genomic DNA enriched for genes by hybridization with probes for all or some of the known genes simplifies sequencing and analysis of differences in gene sequences between large numbers of genotypes and genes especially when working with complex genomes. Examples of application of the above-mentioned techniques have been described.
Collapse
|
40
|
Transcriptome de novo assembly from next-generation sequencing and comparative analyses in the hexaploid salt marsh species Spartina maritima and Spartina alterniflora (Poaceae). Heredity (Edinb) 2012; 110:181-93. [PMID: 23149455 DOI: 10.1038/hdy.2012.76] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Spartina species have a critical ecological role in salt marshes and represent an excellent system to investigate recurrent polyploid speciation. Using the 454 GS-FLX pyrosequencer, we assembled and annotated the first reference transcriptome (from roots and leaves) for two related hexaploid Spartina species that hybridize in Western Europe, the East American invasive Spartina alterniflora and the Euro-African S. maritima. The de novo read assembly generated 38 478 consensus sequences and 99% found an annotation using Poaceae databases, representing a total of 16 753 non-redundant genes. Spartina expressed sequence tags were mapped onto the Sorghum bicolor genome, where they were distributed among the subtelomeric arms of the 10 S. bicolor chromosomes, with high gene density correlation. Normalization of the complementary DNA library improved the number of annotated genes. Ecologically relevant genes were identified among GO biological function categories in salt and heavy metal stress response, C4 photosynthesis and in lignin and cellulose metabolism. Expression of some of these genes had been found to be altered by hybridization and genome duplication in a previous microarray-based study in Spartina. As these species are hexaploid, up to three duplicated homoeologs may be expected per locus. When analyzing sequence polymorphism at four different loci in S. maritima and S. alterniflora, we found up to four haplotypes per locus, suggesting the presence of two expressed homoeologous sequences with one or two allelic variants each. This reference transcriptome will allow analysis of specific Spartina genes of ecological or evolutionary interest, estimation of homoeologous gene expression variation using RNA-seq and further gene expression evolution analyses in natural populations.
Collapse
|
41
|
Pérez-de-Castro AM, Vilanova S, Cañizares J, Pascual L, Blanca JM, Díez MJ, Prohens J, Picó B. Application of genomic tools in plant breeding. Curr Genomics 2012; 13:179-95. [PMID: 23115520 PMCID: PMC3382273 DOI: 10.2174/138920212800543084] [Citation(s) in RCA: 90] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2011] [Revised: 09/16/2011] [Accepted: 10/11/2011] [Indexed: 02/08/2023] Open
Abstract
Plant breeding has been very successful in developing improved varieties using conventional tools and methodologies. Nowadays, the availability of genomic tools and resources is leading to a new revolution of plant breeding, as they facilitate the study of the genotype and its relationship with the phenotype, in particular for complex traits. Next Generation Sequencing (NGS) technologies are allowing the mass sequencing of genomes and transcriptomes, which is producing a vast array of genomic information. The analysis of NGS data by means of bioinformatics developments allows discovering new genes and regulatory sequences and their positions, and makes available large collections of molecular markers. Genome-wide expression studies provide breeders with an understanding of the molecular basis of complex traits. Genomic approaches include TILLING and EcoTILLING, which make possible to screen mutant and germplasm collections for allelic variants in target genes. Re-sequencing of genomes is very useful for the genome-wide discovery of markers amenable for high-throughput genotyping platforms, like SSRs and SNPs, or the construction of high density genetic maps. All these tools and resources facilitate studying the genetic diversity, which is important for germplasm management, enhancement and use. Also, they allow the identification of markers linked to genes and QTLs, using a diversity of techniques like bulked segregant analysis (BSA), fine genetic mapping, or association mapping. These new markers are used for marker assisted selection, including marker assisted backcross selection, ‘breeding by design’, or new strategies, like genomic selection. In conclusion, advances in genomics are providing breeders with new tools and methodologies that allow a great leap forward in plant breeding, including the ‘superdomestication’ of crops and the genetic dissection and breeding for complex traits.
Collapse
Affiliation(s)
- A M Pérez-de-Castro
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Camino de Vera 14, 46022 Valencia, Spain
| | | | | | | | | | | | | | | |
Collapse
|
42
|
Scaglione D, Lanteri S, Acquadro A, Lai Z, Knapp SJ, Rieseberg L, Portis E. Large-scale transcriptome characterization and mass discovery of SNPs in globe artichoke and its related taxa. PLANT BIOTECHNOLOGY JOURNAL 2012; 10:956-69. [PMID: 22849342 DOI: 10.1111/j.1467-7652.2012.00725.x] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
Cynara cardunculus (2n = 2× = 34) is a member of the Asteraceae family that contributes significantly to the agricultural economy of the Mediterranean basin. The species includes two cultivated varieties, globe artichoke and cardoon, which are grown mainly for food. Cynara cardunculus is an orphan crop species whose genome/transcriptome has been relatively unexplored, especially in comparison to other Asteraceae crops. Hence, there is a significant need to improve its genomic resources through the identification of novel genes and sequence-based markers, to design new breeding schemes aimed at increasing quality and crop productivity. We report the outcome of cDNA sequencing and assembly for eleven accessions of C. cardunculus. Sequencing of three mapping parental genotypes using Roche 454-Titanium technology generated 1.7 × 10⁶ reads, which were assembled into 38,726 reference transcripts covering 32 Mbp. Putative enzyme-encoding genes were annotated using the KEGG-database. Transcription factors and candidate resistance genes were surveyed as well. Paired-end sequencing was done for cDNA libraries of eight other representative C. cardunculus accessions on an Illumina Genome Analyzer IIx, generating 46 × 10⁶ reads. Alignment of the IGA and 454 reads to reference transcripts led to the identification of 195,400 SNPs with a Bayesian probability exceeding 95%; a validation rate of 90% was obtained by Sanger-sequencing of a subset of contigs. These results demonstrate that the integration of data from different NGS platforms enables large-scale transcriptome characterization, along with massive SNP discovery. This information will contribute to the dissection of key agricultural traits in C. cardunculus and facilitate the implementation of marker-assisted selection programs.
Collapse
|
43
|
High-throughput discovery of mutations in tef semi-dwarfing genes by next-generation sequencing analysis. Genetics 2012; 192:819-29. [PMID: 22904035 DOI: 10.1534/genetics.112.144436] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Tef (Eragrostis tef) is a major cereal crop in Ethiopia. Lodging is the primary constraint to increasing productivity in this allotetraploid species, accounting for losses of ∼15-45% in yield each year. As a first step toward identifying semi-dwarf varieties that might have improved lodging resistance, an ∼6× fosmid library was constructed and used to identify both homeologues of the dw3 semi-dwarfing gene of Sorghum bicolor. An EMS mutagenized population, consisting of ∼21,210 tef plants, was planted and leaf materials were collected into 23 superpools. Two dwarfing candidate genes, homeologues of dw3 of sorghum and rht1 of wheat, were sequenced directly from each superpool with 454 technology, and 120 candidate mutations were identified. Out of 10 candidates tested, six independent mutations were validated by Sanger sequencing, including two predicted detrimental mutations in both dw3 homeologues with a potential to improve lodging resistance in tef through further breeding. This study demonstrates that high-throughput sequencing can identify potentially valuable mutations in under-studied plant species like tef and has provided mutant lines that can now be combined and tested in breeding programs for improved lodging resistance.
Collapse
|
44
|
Aversano R, Ercolano MR, Caruso I, Fasano C, Rosellini D, Carputo D. Molecular tools for exploring polyploid genomes in plants. Int J Mol Sci 2012; 13:10316-10335. [PMID: 22949863 PMCID: PMC3431861 DOI: 10.3390/ijms130810316] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Revised: 08/03/2012] [Accepted: 08/06/2012] [Indexed: 11/16/2022] Open
Abstract
Polyploidy is a very common phenomenon in the plant kingdom, where even diploid species are often described as paleopolyploids. The polyploid condition may bring about several advantages compared to the diploid state. Polyploids often show phenotypes that are not present in their diploid progenitors or exceed the range of the contributing species. Some of these traits may play a role in heterosis or could favor adaptation to new ecological niches. Advances in genomics and sequencing technology may create unprecedented opportunities for discovering and monitoring the molecular effects of polyploidization. Through this review, we provide an overview of technologies and strategies that may allow an in-depth analysis of polyploid genomes. After introducing some basic aspects on the origin and genetics of polyploids, we highlight the main tools available for genome and gene expression analysis and summarize major findings. In the last part of this review, the implications of next generation sequencing are briefly discussed. The accumulation of knowledge on polyploid formation, maintenance, and divergence at whole-genome and subgenome levels will not only help plant biologists to understand how plants have evolved and diversified, but also assist plant breeders in designing new strategies for crop improvement.
Collapse
Affiliation(s)
- Riccardo Aversano
- Department of Soil, Plant, Environmental and Animal Production Sciences, University of Naples Federico II, Via Università 100, Portici 80055, Italy; E-Mails: (R.A.); (M.R.E.); (I.C.); (C.F.)
| | - Maria Raffaella Ercolano
- Department of Soil, Plant, Environmental and Animal Production Sciences, University of Naples Federico II, Via Università 100, Portici 80055, Italy; E-Mails: (R.A.); (M.R.E.); (I.C.); (C.F.)
| | - Immacolata Caruso
- Department of Soil, Plant, Environmental and Animal Production Sciences, University of Naples Federico II, Via Università 100, Portici 80055, Italy; E-Mails: (R.A.); (M.R.E.); (I.C.); (C.F.)
| | - Carlo Fasano
- Department of Soil, Plant, Environmental and Animal Production Sciences, University of Naples Federico II, Via Università 100, Portici 80055, Italy; E-Mails: (R.A.); (M.R.E.); (I.C.); (C.F.)
| | - Daniele Rosellini
- Department of Applied Biology, University of Perugia, Borgo XX Giugno 74, Perugia 06121, Italy; E-Mail:
| | - Domenico Carputo
- Department of Soil, Plant, Environmental and Animal Production Sciences, University of Naples Federico II, Via Università 100, Portici 80055, Italy; E-Mails: (R.A.); (M.R.E.); (I.C.); (C.F.)
| |
Collapse
|
45
|
Duan J, Xia C, Zhao G, Jia J, Kong X. Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data. BMC Genomics 2012; 13:392. [PMID: 22891638 PMCID: PMC3485621 DOI: 10.1186/1471-2164-13-392] [Citation(s) in RCA: 94] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2012] [Accepted: 08/03/2012] [Indexed: 11/23/2022] Open
Abstract
Background Rapid advances in next-generation sequencing methods have provided new opportunities for transcriptome sequencing (RNA-Seq). The unprecedented sequencing depth provided by RNA-Seq makes it a powerful and cost-efficient method for transcriptome study, and it has been widely used in model organisms and non-model organisms to identify and quantify RNA. For non-model organisms lacking well-defined genomes, de novo assembly is typically required for downstream RNA-Seq analyses, including SNP discovery and identification of genes differentially expressed by phenotypes. Although RNA-Seq has been successfully used to sequence many non-model organisms, the results of de novo assembly from short reads can still be improved by using recent bioinformatic developments. Results In this study, we used 212.6 million pair-end reads, which accounted for 16.2 Gb, to assemble the hexaploid wheat transcriptome. Two state-of-the-art assemblers, Trinity and Trans-ABySS, which use the single and multiple k-mer methods, respectively, were used, and the whole de novo assembly process was divided into the following four steps: pre-assembly, merging different samples, removal of redundancy and scaffolding. We documented every detail of these steps and how these steps influenced assembly performance to gain insight into transcriptome assembly from short reads. After optimization, the assembled transcripts were comparable to Sanger-derived ESTs in terms of both continuity and accuracy. We also provided considerable new wheat transcript data to the community. Conclusions It is feasible to assemble the hexaploid wheat transcriptome from short reads. Special attention should be paid to dealing with multiple samples to balance the spectrum of expression levels and redundancy. To obtain an accurate overview of RNA profiling, removal of redundancy may be crucial in de novo assembly.
Collapse
Affiliation(s)
- Jialei Duan
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Zhongguancun, Beijing, People's Republic of China
| | | | | | | | | |
Collapse
|
46
|
Duan J, Xia C, Zhao G, Jia J, Kong X. Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data. BMC Genomics 2012. [PMID: 22891638 DOI: 10.1186/1471‐2164‐13‐392] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Rapid advances in next-generation sequencing methods have provided new opportunities for transcriptome sequencing (RNA-Seq). The unprecedented sequencing depth provided by RNA-Seq makes it a powerful and cost-efficient method for transcriptome study, and it has been widely used in model organisms and non-model organisms to identify and quantify RNA. For non-model organisms lacking well-defined genomes, de novo assembly is typically required for downstream RNA-Seq analyses, including SNP discovery and identification of genes differentially expressed by phenotypes. Although RNA-Seq has been successfully used to sequence many non-model organisms, the results of de novo assembly from short reads can still be improved by using recent bioinformatic developments. RESULTS In this study, we used 212.6 million pair-end reads, which accounted for 16.2 Gb, to assemble the hexaploid wheat transcriptome. Two state-of-the-art assemblers, Trinity and Trans-ABySS, which use the single and multiple k-mer methods, respectively, were used, and the whole de novo assembly process was divided into the following four steps: pre-assembly, merging different samples, removal of redundancy and scaffolding. We documented every detail of these steps and how these steps influenced assembly performance to gain insight into transcriptome assembly from short reads. After optimization, the assembled transcripts were comparable to Sanger-derived ESTs in terms of both continuity and accuracy. We also provided considerable new wheat transcript data to the community. CONCLUSIONS It is feasible to assemble the hexaploid wheat transcriptome from short reads. Special attention should be paid to dealing with multiple samples to balance the spectrum of expression levels and redundancy. To obtain an accurate overview of RNA profiling, removal of redundancy may be crucial in de novo assembly.
Collapse
Affiliation(s)
- Jialei Duan
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Zhongguancun, Beijing, People's Republic of China
| | | | | | | | | |
Collapse
|
47
|
Jhanwar S, Priya P, Garg R, Parida SK, Tyagi AK, Jain M. Transcriptome sequencing of wild chickpea as a rich resource for marker development. PLANT BIOTECHNOLOGY JOURNAL 2012; 10:690-702. [PMID: 22672127 DOI: 10.1111/j.1467-7652.2012.00712.x] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
The transcriptome of cultivated chickpea (Cicer arietinum L.), an important crop legume, has recently been sequenced. Here, we report sequencing of the transcriptome of wild chickpea, C. reticulatum (PI489777), the progenitor of cultivated chickpea, by GS-FLX 454 technology. The optimized assembly of C. reticulatum transcriptome generated 37 265 transcripts in total with an average length of 946 bp. A total of 4072 simple sequence repeats (SSRs) could be identified in these transcript sequences, of which at least 561 SSRs were polymorphic between C. arietinum and C. reticulatum. In addition, a total of 36 446 single-nucleotide polymorphisms (SNPs) were identified after optimization of probability score, quality score, read depth and consensus base ratio. Several of these SSRs and SNPs could be associated with tissue-specific and transcription factor encoding transcripts. A high proportion (92-94%) of polymorphic SSRs and SNPs identified between the two chickpea species were validated successfully. Further, the estimation of synonymous substitution rates of orthologous transcript pairs suggested that the speciation event for divergence of C. arietinum and C. reticulatum may have happened approximately 0.53 million years ago. The results of our study provide a rich resource for exploiting genetic variations in chickpea for breeding programmes.
Collapse
Affiliation(s)
- Shalu Jhanwar
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, India
| | | | | | | | | | | |
Collapse
|
48
|
Lai K, Duran C, Berkman PJ, Lorenc MT, Stiller J, Manoli S, Hayden MJ, Forrest KL, Fleury D, Baumann U, Zander M, Mason AS, Batley J, Edwards D. Single nucleotide polymorphism discovery from wheat next-generation sequence data. PLANT BIOTECHNOLOGY JOURNAL 2012; 10:743-9. [PMID: 22748104 DOI: 10.1111/j.1467-7652.2012.00718.x] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Single nucleotide polymorphisms (SNPs) are the most abundant type of molecular genetic marker and can be used for producing high-resolution genetic maps, marker-trait association studies and marker-assisted breeding. Large polyploid genomes such as wheat present a challenge for SNP discovery because of the potential presence of multiple homoeologs for each gene. AutoSNPdb has been successfully applied to identify SNPs from Sanger sequence data for several species, including barley, rice and Brassica, but the volume of data required to accurately call SNPs in the complex genome of wheat has prevented its application to this important crop. DNA sequencing technology has been revolutionized by the introduction of next-generation sequencing, and it is now possible to generate several million sequence reads in a timely and cost-effective manner. We have produced wheat transcriptome sequence data using 454 sequencing technology and applied this for SNP discovery using a modified autoSNPdb method, which integrates SNP and gene annotation information with a graphical viewer. A total of 4,694,141 sequence reads from three bread wheat varieties were assembled to identify a total of 38 928 candidate SNPs. Each SNP is within an assembly complete with annotation, enabling the selection of polymorphism within genes of interest.
Collapse
Affiliation(s)
- Kaitao Lai
- School of Agriculture and Food Science, University of Queensland, Brisbane, QLD, Australia
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
49
|
Hendre PS, Kamalakannan R, Varghese M. High-throughput and parallel SNP discovery in selected candidate genes in Eucalyptus camaldulensis using Illumina NGS platform. PLANT BIOTECHNOLOGY JOURNAL 2012; 10:646-56. [PMID: 22607345 DOI: 10.1111/j.1467-7652.2012.00699.x] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Next generation sequencing (NGS) technologies have revolutionized the pace and scale of genomics- and transcriptomics-based SNP discovery across different plant and animal species. Herein, 72-base paired-end Illumina sequencing was employed for high-throughput, parallel and large-scale SNP discovery in 41 growth-related candidate genes in Eucalyptus camaldulensis. Approximately 100 kb of genome from 96 individuals was amplified and sequenced using a hierarchical DNA/PCR pooling strategy and assembled over corresponding E. grandis reference. A total of 1191 SNPs (minimum 5% other allele frequency) were identified with an average frequency of 1 SNP/83.9 bp, whereas in exons and introns, it was 1 SNP/108.4 bp and 1 SNP/65.6 bp, respectively. A total of 75 insertions and 89 deletions were detected of which approximately 15% were exonic. Transitions (Tr) were in excess than transversions (Tv) (Tr/Tv: 1.89), but exceeded in exons (Tr/Tv: 2.73). In exons, synonymous SNPs (Ka) prevailed over the non-synonymous SNPs (Ks; average Ka/Ks ratio: 0.72, range: 0-3.00 across genes). Many of the exonic SNPs/indels had potential to change amino acid sequence of respective genes. Transcription factors appeared more conserved, whereas enzyme coding genes appeared under relaxed control. Further, 541 SNPs were classified into 196 'equal frequency' (EF) blocks with almost similar minor allele frequencies to facilitate selection of one tag-SNP/EF-block. There were 241 (approximately 20%) 'zero-SNP' blocks with absence of SNPs in surrounding ±60 bp windows. The data thus indicated enormous extant and unexplored diversity in E. camaldulensis in the studied genes with potential applications for marker-trait associations.
Collapse
Affiliation(s)
- Prasad S Hendre
- ITC R&D Centre, Peenya Industrial Area, Bangalore, Karnataka, India.
| | | | | |
Collapse
|
50
|
Gholami M, Bekele WA, Schondelmaier J, Snowdon RJ. A tailed PCR procedure for cost-effective, two-order multiplex sequencing of candidate genes in polyploid plants. PLANT BIOTECHNOLOGY JOURNAL 2012; 10:635-45. [PMID: 22489678 DOI: 10.1111/j.1467-7652.2012.00696.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
Complex polyploid crop genomes can be recalcitrant towards conventional DNA sequencing approaches for allele mining in candidate genes for valuable traits. In the past, this has greatly complicated the transfer of knowledge on promising candidate genes from model plants to even closely related polyploid crops. Next-generation sequencing offers diverse solutions to overcome such difficulties. Here, we present a method for multiplexed 454 sequencing in gene-specific PCR amplicons that can simultaneously address multiple homologues of given target genes. We devised a simple two-step PCR procedure employing a set of barcoded M13/T7 universal fusion primers that enable a cost-effective and efficient amplification of large numbers of target gene amplicons. Sequencing-ready amplicons are generated that can be simultaneously sequenced in pools comprising multiple amplicons from multiple genotypes. High-depth sequencing allows resolution of the resulting sequence reads into contigs representing multiple homologous loci, with only insignificant off-target capture of paralogues or PCR artefacts. In a case study, the procedure was tested in the complex polyploid genome of Brassica napus for a set of nine genes identified in Arabidopsis as candidates for regulation of seed development and oil content. Up to six copies of these genes were expected in B. napus. SNP discovery was performed by pooled multiplex sequencing of 30 amplicons in 20 diverse B. napus accessions with interesting trait variation for oil content, providing a basis for comparative mapping to relevant quantitative trait loci and for subsequent marker-assisted breeding.
Collapse
Affiliation(s)
- Mahmood Gholami
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany Saaten-Union Biotec GmbH, Leopoldshoehe, Germany
| | | | | | | |
Collapse
|