1
|
Oueslati A, Ollitrault F, Baraket G, Salhi-Hannachi A, Navarro L, Ollitrault P. Towards a molecular taxonomic key of the Aurantioideae subfamily using chloroplastic SNP diagnostic markers of the main clades genotyped by competitive allele-specific PCR. BMC Genet 2016; 17:118. [PMID: 27539067 PMCID: PMC4991024 DOI: 10.1186/s12863-016-0426-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2016] [Accepted: 08/11/2016] [Indexed: 11/28/2022] Open
Abstract
Background Chloroplast DNA is a primary source of molecular variations for phylogenetic analysis of photosynthetic eukaryotes. However, the sequencing and analysis of multiple chloroplastic regions is difficult to apply to large collections or large samples of natural populations. The objective of our work was to demonstrate that a molecular taxonomic key based on easy, scalable and low-cost genotyping method should be developed from a set of Single Nucleotide Polymorphisms (SNPs) diagnostic of well-established clades. It was applied to the Aurantioideae subfamily, the largest group of the Rutaceae family that includes the cultivated citrus species. Results The publicly available nucleotide sequences of eight plastid genomic regions were compared for 79 accessions of the Aurantioideae subfamily to search for SNPs revealing taxonomic differentiation at the inter-tribe, inter-subtribe, inter-genus and interspecific levels. Diagnostic SNPs (DSNPs) were found for 46 of the 54 clade levels analysed. Forty DSNPs were selected to develop KASPar markers and their taxonomic value was tested by genotyping 108 accessions of the Aurantioideae subfamily. Twenty-seven markers diagnostic of 24 clades were validated and they displayed a very high rate of transferability in the Aurantioideae subfamily (only 1.2 % of missing data on average). The UPGMA from the validated markers produced a cladistic organisation that was highly coherent with the previous phylogenetic analysis based on the sequence data of the eight plasmid regions. In particular, the monophyletic origin of the “true citrus” genera plus Oxanthera was validated. However, some clarification remains necessary regarding the organisation of the other wild species of the Citreae tribe. Conclusions We validated the concept that with well-established clades, DSNPs can be selected and efficiently transformed into competitive allele-specific PCR markers (KASPar method) allowing cost-effective highly efficient cladistic analysis in large collections at subfamily level. The robustness of this genotyping method is an additional decisive advantage for network collaborative research. The availability of WGS data for the main “true citrus” species should soon make it possible to develop a set of DSNP markers allowing very fine resolution of this very important horticultural group. Electronic supplementary material The online version of this article (doi:10.1186/s12863-016-0426-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Amel Oueslati
- Laboratoire de Génétique Moléculaire, Immunologie et Biotechnologie LR99ES12, Faculté des Sciences de Tunis (FST), Université de Tunis El Manar, Campus Universitaire, El Manar-Tunis, 2092, Tunisia.,UMR Agap, CIRAD, Petit-Bourg, F-97170, Guadeloupe, France
| | - Frederique Ollitrault
- Centro de Protección Vegetal y Biotecnología, Instituto Valenciano de Investigaciones Agrarias (IVIA), Moncada, 46113, Valencia, Spain
| | - Ghada Baraket
- Laboratoire de Génétique Moléculaire, Immunologie et Biotechnologie LR99ES12, Faculté des Sciences de Tunis (FST), Université de Tunis El Manar, Campus Universitaire, El Manar-Tunis, 2092, Tunisia
| | - Amel Salhi-Hannachi
- Laboratoire de Génétique Moléculaire, Immunologie et Biotechnologie LR99ES12, Faculté des Sciences de Tunis (FST), Université de Tunis El Manar, Campus Universitaire, El Manar-Tunis, 2092, Tunisia
| | - Luis Navarro
- Centro de Protección Vegetal y Biotecnología, Instituto Valenciano de Investigaciones Agrarias (IVIA), Moncada, 46113, Valencia, Spain
| | | |
Collapse
|
2
|
Park HJ, Kim SK, Kang WS, Park JK, Kim YJ, Nam M, Kim JW, Chung JH. Association between IRS1 Gene Polymorphism and Autism Spectrum Disorder: A Pilot Case-Control Study in Korean Males. Int J Mol Sci 2016; 17:ijms17081227. [PMID: 27483248 PMCID: PMC5000625 DOI: 10.3390/ijms17081227] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2016] [Revised: 07/13/2016] [Accepted: 07/25/2016] [Indexed: 11/16/2022] Open
Abstract
The insulin-like growth factor (IGF) pathway is thought to play an important role in brain development. Altered levels of IGFs and their signaling regulators have been shown in autism spectrum disorder (ASD) patients. In this study, we investigated whether coding region single-nucleotide polymorphisms (cSNPs) of the insulin receptor substrates (IRS1 and IRS2), key mediators of the IGF pathway, were associated with ASD in Korean males. Two cSNPs (rs1801123 of IRS1, and rs4773092 of IRS2) were genotyped using direct sequencing in 180 male ASD patients and 147 male control subjects. A significant association between rs1801123 of IRS1 and ASD was shown in additive (p = 0.022, odds ratio (OR) = 0.66, 95% confidence interval (CI) = 0.46–0.95) and dominant models (p = 0.013, OR = 0.57, 95% CI = 0.37–0.89). Allele frequency analysis also showed an association between rs1801123 and ASD (p = 0.022, OR = 0.66, 95% CI = 0.46–0.94). These results suggest that IRS1 may contribute to the susceptibility of ASD in Korean males.
Collapse
Affiliation(s)
- Hae Jeong Park
- Kohwang Medical Research Institute, School of Medicine, Kyung Hee University, Seoul 02447, Korea.
| | - Su Kang Kim
- Kohwang Medical Research Institute, School of Medicine, Kyung Hee University, Seoul 02447, Korea.
| | - Won Sub Kang
- Department of Neuropsychiatry, School of Medicine, Kyung Hee University, Seoul 02447, Korea.
| | - Jin Kyung Park
- Department of Neuropsychiatry, School of Medicine, Kyung Hee University, Seoul 02447, Korea.
| | - Young Jong Kim
- Department of Neuropsychiatry, School of Medicine, Kyung Hee University, Seoul 02447, Korea.
| | - Min Nam
- Seoul Metropolitan Eunpyeong Hospital, Seoul 06801, Korea.
| | - Jong Woo Kim
- Department of Neuropsychiatry, School of Medicine, Kyung Hee University, Seoul 02447, Korea.
| | - Joo-Ho Chung
- Kohwang Medical Research Institute, School of Medicine, Kyung Hee University, Seoul 02447, Korea.
| |
Collapse
|
3
|
Gaur M, Das A, Subudhi E. High quality SNPs/Indels mining and characterization in ginger from ESTs data base. Bioinformation 2015; 11:85-9. [PMID: 25848168 PMCID: PMC4369683 DOI: 10.6026/97320630011085] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2015] [Accepted: 01/29/2015] [Indexed: 11/23/2022] Open
Abstract
Ginger (Zingiber officinale Rosc.) is an important herb of the family Zingiberaceae. It is accepted as a universal cure for a multitude of diseases in Indian systems of medicine and its rhizomes are equally popular as a spice ingredient throughout Asia. SNPs, the definitive genetic markers, representing the finest resolution of a DNA sequence, are abundantly found in populations having a lower rate of mutation and are used for genomic analysis. The public ESTs sequences mostly lack quality files, making high quality SNPs detection more difficult since it is exclusively based on sequence comparisons. In the present study, current dbESTs of NCBI was mined and 38115 ginger ESTs sequences were obtained and assembled into contigs using CAP3 program. In this analysis, recent software tool QualitySNP was used to detect 11523 potential SNPs sites, 8810 high quality SNPs and 1008 indels polymorphisms with a frequency of 1.61 SNPs / 10 kbp. Of ESTs libraries generated from three ginger tissues together, rhizomes had a frequency of 0.32 SNPs and 0.03 indels per 10 kbp whereas the leaves had a frequency of 2.51 SNPs and 0.23 indels per 10 kbp and root is showing relative frequency of 0.76/10 kbp SNPs and 0.02/10 kbp indels. The present analysis provides additional information about the tissue wise presence of haplotypes (222), distribution of high quality exonic (2355) and intronic (6455) SNPs and information about singletons (7538) in addition to contigs transitions and transversions ratio (0.57). Among all tissue detected SNPs, transversions number is higher in comparison to the number of transitions. Quality SNPs detected in this work can be used as markers for further ginger genetic experiments.
Collapse
Affiliation(s)
- Mahendra Gaur
- Centre of Biotechnology, Siksha ‘O’ Anushandhan University, Kalinga Nagar, Ghatikia, Bhubaneswar-751003, Odisha
| | - Aradhana Das
- Centre of Biotechnology, Siksha ‘O’ Anushandhan University, Kalinga Nagar, Ghatikia, Bhubaneswar-751003, Odisha
| | - Enketeswara Subudhi
- Centre of Biotechnology, Siksha ‘O’ Anushandhan University, Kalinga Nagar, Ghatikia, Bhubaneswar-751003, Odisha
| |
Collapse
|
4
|
Genomic reduction assisted single nucleotide polymorphism discovery using 454-pyrosequencing. Methods Mol Biol 2014; 1245:169-82. [PMID: 25373757 DOI: 10.1007/978-1-4939-1966-6_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
Abstract
We report the development of a simple genomic reduction protocol based on 454-pyrosequencing technology that discovers large numbers of single nucleotide polymorphisms (SNP) from pooled DNA samples. The method is based on the conservation of restriction endonuclease sites across samples and biotin separation for genomic reduction and the addition of multiplex identifier (MID) barcodes to each of the pooled samples to allow for postsequencing deconvolution of the pooled DNA fragments and SNP discovery.
Collapse
|
5
|
Molecular markers for genetics and plant breeding: the MFLP marker system and its application in narrow-leafed lupin (Lupinus angustifolius). Methods Mol Biol 2014; 1069:179-201. [PMID: 23996316 DOI: 10.1007/978-1-62703-613-9_13] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Since the development of molecular markers to tag genes of agronomic traits of interests, molecular markers have played an increasingly significant role in breeding programs. Molecular markers have been implemented for large-scale marker-assisted selection in the breeding program of many important crops including lupin. So far, more than a dozen molecular markers for disease resistance genes and for other agronomic traits of interest have been developed in lupin. The DNA fingerprinting method, "MFLP" has played a pivotal role in the success of lupin breeding program in Australia. Here, we describe the MFLP technique used in lupin breeding which could be easily transferable to other crop species.
Collapse
|
6
|
Chen C, Gmitter FG. Mining of haplotype-based expressed sequence tag single nucleotide polymorphisms in citrus. BMC Genomics 2013; 14:746. [PMID: 24175923 PMCID: PMC3827885 DOI: 10.1186/1471-2164-14-746] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2013] [Accepted: 10/22/2013] [Indexed: 11/18/2022] Open
Abstract
Background Single nucleotide polymorphisms (SNPs), the most abundant variations in a
genome, have been widely used in various studies. Detection and
characterization of citrus haplotype-based expressed sequence tag (EST) SNPs
will greatly facilitate further utilization of these gene-based
resources. Results In this paper, haplotype-based SNPs were mined out of publicly available
citrus expressed sequence tags (ESTs) from different citrus cultivars
(genotypes) individually and collectively for comparison. There were a total
of 567,297 ESTs belonging to 27 cultivars in varying numbers and
consequentially yielding different numbers of haplotype-based quality SNPs.
Sweet orange (SO) had the most (213,830) ESTs, generating 11,182 quality
SNPs in 3,327 out of 4,228 usable contigs. Summed from all the individually
mining results, a total of 25,417 quality SNPs were discovered –
15,010 (59.1%) were transitions (AG and CT), 9,114 (35.9%) were
transversions (AC, GT, CG, and AT), and 1,293 (5.0%) were
insertion/deletions (indels). A vast majority of SNP-containing contigs
consisted of only 2 haplotypes, as expected, but the percentages of 2
haplotype contigs varied widely in these citrus cultivars. BLAST of the
25,417 25-mer SNP oligos to the Clementine reference genome scaffolds
revealed 2,947 SNPs had “no hits found”, 19,943 had 1 unique hit
/ alignment, 1,571 had one hit and 2+ alignments per hit, and 956 had 2+
hits and 1+ alignment per hit. Of the total 24,293 scaffold hits, 23,955
(98.6%) were on the main scaffolds 1 to 9, and only 338 were on 87 minor
scaffolds. Most alignments had 100% (25/25) or 96% (24/25) nucleotide
identities, accounting for 93% of all the alignments. Considering almost all
the nucleotide discrepancies in the 24/25 alignments were at the SNP sites,
it served well as in silico validation of these SNPs, in addition
to and consistent with the rate (81%) validated by sequencing and SNaPshot
assay. Conclusions High-quality EST-SNPs from different citrus genotypes were detected, and
compared to estimate the heterozygosity of each genome. All the SNP oligo
sequences were aligned with the Clementine citrus genome to determine their
distribution and uniqueness and for in silico validation, in
addition to SNaPshot and sequencing validation of selected SNPs.
Collapse
Affiliation(s)
- Chunxian Chen
- University of Florida, IFAS, Citrus Research and Education Center, 700 Experiment Station Road, Lake Alfred, FL 33850, USA.
| | | |
Collapse
|
7
|
Rai KM, Singh SK, Bhardwaj A, Kumar V, Lakhwani D, Srivastava A, Jena SN, Yadav HK, Bag SK, Sawant SV. Large-scale resource development in Gossypium hirsutum L. by 454 sequencing of genic-enriched libraries from six diverse genotypes. PLANT BIOTECHNOLOGY JOURNAL 2013; 11:953-963. [PMID: 23782852 DOI: 10.1111/pbi.12088] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2013] [Revised: 04/19/2013] [Accepted: 05/04/2013] [Indexed: 06/02/2023]
Abstract
The sequence information has been proved to be an essential genomic resource in case of crop plants for their genetic improvement and better utilization by humans. To dissect the Gossypium hirsutum genome for large-scale development of genomic resources, we adopted hypomethylated restriction-based genomic enrichment strategy to sequence six diverse genotypes. Approximately 5.2-Gb data (more than 18.36 million reads) was generated which, after assembly, represents nearly 1.27-Gb genomic sequences. We predicted a total of 93,363 gene models (21,399 full length) and identified 35,923 gene models which were validated against already sequenced plant genomes. A total of 1,093 transcription factor-encoding genes, 3,135 promoter sequences and 78 miRNA (including 17 newly identified in Gossypium) were predicted. We identified significant no. of molecular markers including 47,093 novel simple sequence repeats and 66,364 novel single nucleotide polymorphisms. In addition, we developed NBRI-Comprehensive Cotton Genomics database, a web resource to provide access of cotton-related genomic resources developed at NBRI. This study contributes considerable amount of genomic resources and suggests a potential role of genic-enriched sequencing in genomic resource development for orphan crop plants.
Collapse
Affiliation(s)
- Krishan Mohan Rai
- Plant Molecular Biology Laboratory, CSIR-National Botanical Research Institute, Lucknow, India
| | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Boussaha M, Guyomard R, Cabau C, Esquerré D, Quillet E. Development and characterisation of an expressed sequence tags (EST)-derived single nucleotide polymorphisms (SNPs) resource in rainbow trout. BMC Genomics 2012; 13:238. [PMID: 22694767 PMCID: PMC3536561 DOI: 10.1186/1471-2164-13-238] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2012] [Accepted: 06/13/2012] [Indexed: 12/29/2022] Open
Abstract
Background There is considerable interest in developing high-throughput genotyping with single nucleotide polymorphisms (SNPs) for the identification of genes affecting important ecological or economical traits. SNPs are evenly distributed throughout the genome and are likely to be functionally relevant. In rainbow trout, in silico screening of EST databases represents an attractive approach for de novo SNP identification. Nevertheless, EST sequencing errors and assembly of EST paralogous sequences can lead to the identification of false positive SNPs which renders the reliability of EST-derived SNPs relatively low. Further validation of EST-derived SNPs is therefore required. The objective of this work was to assess the quality of and to validate a large number of rainbow trout EST-derived SNPs. Results A panel of 1,152 EST-derived SNPs was selected from the INRA Sigenae SNP database and was genotyped in standard and double haploid individuals from several populations using the Illumina GoldenGate BeadXpress assay. High-quality genotyping data were obtained for 958 SNPs representing a genotyping success rate of 83.2 %, out of which, 350 SNPs (36.5 %) were polymorphic in at least one population and were designated as true SNPs. They also proved to be a potential tool to investigate genetic diversity of the species, as the set of SNP successfully sorted individuals into three main groups using STRUCTURE software. Functional annotations revealed 28 non-synonymous SNPs, out of which four substitutions were predicted to affect protein functions. A subset of 223 true SNPs were polymorphic in the two INRA mapping reference families and were integrated into the INRA microsatellite-based linkage map. Conclusions Our results represent the first study of EST-derived SNPs validation in rainbow trout, a species whose genome sequences is not yet available. We designed several specific filters in order to improve the genotyping yield. Nevertheless, our selection criteria should be further improved in order to reduce the observed high rate of false positive SNPs which results from the occurrence of whole genome duplications.
Collapse
Affiliation(s)
- Mekki Boussaha
- INRA, UMR 1313 Génétique Animale et Biologie Intégrative, 78350, Jouy-en-Josas, France.
| | | | | | | | | |
Collapse
|
9
|
Ferguson ME, Hearne SJ, Close TJ, Wanamaker S, Moskal WA, Town CD, de Young J, Marri PR, Rabbi IY, de Villiers EP. Identification, validation and high-throughput genotyping of transcribed gene SNPs in cassava. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012; 124:685-95. [PMID: 22069119 DOI: 10.1007/s00122-011-1739-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2011] [Accepted: 10/18/2011] [Indexed: 05/05/2023]
Abstract
The availability of genomic resources can facilitate progress in plant breeding through the application of advanced molecular technologies for crop improvement. This is particularly important in the case of less researched crops such as cassava, a staple and food security crop for more than 800 million people. Here, expressed sequence tags (ESTs) were generated from five drought stressed and well-watered cassava varieties. Two cDNA libraries were developed: one from root tissue (CASR), the other from leaf, stem and stem meristem tissue (CASL). Sequencing generated 706 contigs and 3,430 singletons. These sequences were combined with those from two other EST sequencing initiatives and filtered based on the sequence quality. Quality sequences were aligned using CAP3 and embedded in a Windows browser called HarvEST:Cassava which is made available. HarvEST:Cassava consists of a Unigene set of 22,903 quality sequences. A total of 2,954 putative SNPs were identified. Of these 1,536 SNPs from 1,170 contigs and 53 cassava genotypes were selected for SNP validation using Illumina's GoldenGate assay. As a result 1,190 SNPs were validated technically and biologically. The location of validated SNPs on scaffolds of the cassava genome sequence (v.4.1) is provided. A diversity assessment of 53 cassava varieties reveals some sub-structure based on the geographical origin, greater diversity in the Americas as opposed to Africa, and similar levels of diversity in West Africa and southern, eastern and central Africa. The resources presented allow for improved genetic dissection of economically important traits and the application of modern genomics-based approaches to cassava breeding and conservation.
Collapse
Affiliation(s)
- Morag E Ferguson
- International Institute of Tropical Agriculture (IITA), c/o ILRI, P.O. Box 30709, Nairobi, Kenya.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Cronn R, Knaus BJ, Liston A, Maughan PJ, Parks M, Syring JV, Udall J. Targeted enrichment strategies for next-generation plant biology. AMERICAN JOURNAL OF BOTANY 2012; 99:291-311. [PMID: 22312117 DOI: 10.3732/ajb.1100356] [Citation(s) in RCA: 120] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
PREMISE OF THE STUDY The dramatic advances offered by modern DNA sequencers continue to redefine the limits of what can be accomplished in comparative plant biology. Even with recent achievements, however, plant genomes present obstacles that can make it difficult to execute large-scale population and phylogenetic studies on next-generation sequencing platforms. Factors like large genome size, extensive variation in the proportion of organellar DNA in total DNA, polyploidy, and gene number/redundancy contribute to these challenges, and they demand flexible targeted enrichment strategies to achieve the desired goals. METHODS In this article, we summarize the many available targeted enrichment strategies that can be used to target partial-to-complete organellar genomes, as well as known and anonymous nuclear targets. These methods fall under four categories: PCR-based enrichment, hybridization-based enrichment, restriction enzyme-based enrichment, and enrichment of expressed gene sequences. KEY RESULTS Examples of plant-specific applications exist for nearly all methods described. While some methods are well established (e.g., transcriptome sequencing), other promising methods are in their infancy (hybridization enrichment). A direct comparison of methods shows that PCR-based enrichment may be a reasonable strategy for accessing small genomic targets (e.g., ≤50 kbp), but that hybridization and transcriptome sequencing scale more efficiently if larger targets are desired. CONCLUSIONS While the benefits of targeted sequencing are greatest in plants with large genomes, nearly all comparative projects can benefit from the improved throughput offered by targeted multiplex DNA sequencing, particularly as the amount of data produced from a single instrument approaches a trillion bases per run.
Collapse
Affiliation(s)
- Richard Cronn
- Pacific Northwest Research Station, USDA Forest Service, Corvallis, Oregon 97331, USA.
| | | | | | | | | | | | | |
Collapse
|
11
|
Mammadov J, Aggarwal R, Buyyarapu R, Kumpatla S. SNP markers and their impact on plant breeding. INTERNATIONAL JOURNAL OF PLANT GENOMICS 2012; 2012:728398. [PMID: 23316221 PMCID: PMC3536327 DOI: 10.1155/2012/728398] [Citation(s) in RCA: 161] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2012] [Accepted: 11/14/2012] [Indexed: 05/19/2023]
Abstract
The use of molecular markers has revolutionized the pace and precision of plant genetic analysis which in turn facilitated the implementation of molecular breeding of crops. The last three decades have seen tremendous advances in the evolution of marker systems and the respective detection platforms. Markers based on single nucleotide polymorphisms (SNPs) have rapidly gained the center stage of molecular genetics during the recent years due to their abundance in the genomes and their amenability for high-throughput detection formats and platforms. Computational approaches dominate SNP discovery methods due to the ever-increasing sequence information in public databases; however, complex genomes pose special challenges in the identification of informative SNPs warranting alternative strategies in those crops. Many genotyping platforms and chemistries have become available making the use of SNPs even more attractive and efficient. This paper provides a review of historical and current efforts in the development, validation, and application of SNP markers in QTL/gene discovery and plant breeding by discussing key experimental strategies and cases exemplifying their impact.
Collapse
Affiliation(s)
- Jafar Mammadov
- Department of Trait Genetics and Technologies, Dow AgroSciences LLC, 9330 Zionsville Road, Indianapolis, IN 46268, USA
| | | | | | | |
Collapse
|
12
|
Louro B, Passos ALS, Souche EL, Tsigenopoulos C, Beck A, Lagnel J, Bonhomme F, Cancela L, Cerdà J, Clark MS, Lubzens E, Magoulas A, Planas JV, Volckaert FA, Reinhardt R, Canario AV. Gilthead sea bream (Sparus auratus) and European sea bass (Dicentrarchus labrax) expressed sequence tags: Characterization, tissue-specific expression and gene markers. Mar Genomics 2010; 3:179-91. [DOI: 10.1016/j.margen.2010.09.005] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2010] [Revised: 09/17/2010] [Accepted: 09/21/2010] [Indexed: 12/22/2022]
|
13
|
The impact of single nucleotide polymorphism in monomeric alpha-amylase inhibitor genes from wild emmer wheat, primarily from Israel and Golan. BMC Evol Biol 2010; 10:170. [PMID: 20534122 PMCID: PMC2898687 DOI: 10.1186/1471-2148-10-170] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2009] [Accepted: 06/09/2010] [Indexed: 11/14/2022] Open
Abstract
Background Various enzyme inhibitors act on key insect gut digestive hydrolases, including alpha-amylases and proteinases. Alpha-amylase inhibitors have been widely investigated for their possible use in strengthening a plant's defense against insects that are highly dependent on starch as an energy source. We attempted to unravel the diversity of monomeric alpha-amylase inhibitor genes of Israeli and Golan Heights' wild emmer wheat with different ecological factors (e.g., geography, water, and temperature). Population methods that analyze the nature and frequency of allele diversity within a species and the codon analysis method (comparing patterns of synonymous and non-synonymous changes in protein coding sequences) were used to detect natural selection. Results Three hundred and forty-eight sequences encoding monomeric alpha-amylase inhibitors (WMAI) were obtained from 14 populations of wild emmer wheat. The frequency of SNPs in WMAI genes was 1 out of 16.3 bases, where 28 SNPs were detected in the coding sequence. The results of purifying and the positive selection hypothesis (p < 0.05) showed that the sequences of WMAI were contributed by both natural selection and co-evolution, which ensured conservation of protein function and inhibition against diverse insect amylases. The majority of amino acid substitutions occurred at the C-terminal (positive selection domain), which ensured the stability of WMAI. SNPs in this gene could be classified into several categories associated with water, temperature, and geographic factors, respectively. Conclusions Great diversity at the WMAI locus, both between and within populations, was detected in the populations of wild emmer wheat. It was revealed that WMAI were naturally selected for across populations by a ratio of dN/dS as expected. Ecological factors, singly or in combination, explained a significant proportion of the variations in the SNPs. A sharp genetic divergence over very short geographic distances compared to a small genetic divergence between large geographic distances also suggested that the SNPs were subjected to natural selection, and ecological factors had an important evolutionary role in polymorphisms at this locus. According to population and codon analysis, these results suggested that monomeric alpha-amylase inhibitors are adaptively selected under different environmental conditions.
Collapse
|
14
|
Guerra Cardoso H, Doroteia Campos M, Rita Costa A, Catarina Campos M, Nothnagel T, Arnholdt-Schmitt B. Carrot alternative oxidase gene AOX2a demonstrates allelic and genotypic polymorphisms in intron 3. PHYSIOLOGIA PLANTARUM 2009; 137:592-608. [PMID: 19941625 DOI: 10.1111/j.1399-3054.2009.01299.x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Single nucleotide polymorphisms (SNPs) and insertion-deletions (InDels) are becoming important genetic markers for major crop species. In this study, we focus on variations at genomic level of the Daucus carota L. AOX2a gene. The use of gene-specific primers designed in exon regions on the boundaries of introns permitted to recognize intron length polymorphism (ILP) in intron 3 AOX2a by simple polymerase chain reaction (PCR) assays. The length of intron 3 can vary in individual carrot plants. Thus, allelic variation can be used as a tool to discriminate between single plant genotypes. Using this approach, individual plants from cv. Rotin and from diverse breeding lines and cultivars were identified that showed genetic variability by AOX2a ILPs. Repetitive patterns of intron length variation have been observed which allows grouping of genotypes. Polymorphic and identical PCR fragments revealed underlying high levels of sequence polymorphism. Variability was due to InDel events and intron single nucleotide polymorphisms (ISNPs), with a repetitive deletion in intron 3 affecting a putative pre-miRNA site. The results suggest that high AOX2a gene diversity in D. carota can be explored for the development of functional markers related to agronomic traits.
Collapse
Affiliation(s)
- Hélia Guerra Cardoso
- EU Marie Curie Chair, ICAAM, University of Evora, Apartado 94, 7002-554 Evora, Portugal
| | | | | | | | | | | |
Collapse
|
15
|
Oh YS, Nguyen DT, Park KH, Dirisala VR, Choi HJ, Park CK. Identification of 1,531 cSNPs from Full-length Enriched cDNA Libraries of the Korean Native Pig Using in Silico Analysis. Genomics Inform 2009. [DOI: 10.5808/gi.2009.7.2.065] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
16
|
Duran C, Appleby N, Vardy M, Imelfort M, Edwards D, Batley J. Single nucleotide polymorphism discovery in barley using autoSNPdb. PLANT BIOTECHNOLOGY JOURNAL 2009; 7:326-33. [PMID: 19386041 DOI: 10.1111/j.1467-7652.2009.00407.x] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Molecular markers are used to provide the link between genotype and phenotype, for the production of molecular genetic maps and to assess genetic diversity within and between related species. Single nucleotide polymorphisms (SNPs) are the most abundant molecular genetic marker. SNPs can be identified in silico, but care must be taken to ensure that the identified SNPs reflect true genetic variation and are not a result of errors associated with DNA sequencing. The SNP detection method autoSNP has been developed to identify SNPs from sequence data for any species. Confidence in the predicted SNPs is based on sequence redundancy, and haplotype co-segregation scores are calculated for a further independent measure of confidence. We have extended the autoSNP method to produce autoSNPdb, which integrates SNP and gene annotation information with a graphical viewer. We have applied this software to public barley expressed sequences, and the resulting database is available over the Internet. SNPs can be viewed and searched by sequence, functional annotation or predicted synteny with a reference genome, in this case rice. The correlation between SNPs and barley cultivar, expressed tissue type and development stage has been collated for ease of exploration. An average of one SNP per 240 bp was identified, with SNPs more prevalent in the 5' regions and simple sequence repeat (SSR) flanking sequences. Overall, autoSNPdb can provide a wealth of genetic polymorphism information for any species for which sequence data are available.
Collapse
Affiliation(s)
- Chris Duran
- Australian Centre for Plant Functional Genomics, School of Land, Crop and Food Sciences, Institute for Molecular Bioscience, University of Queensland, Brisbane, Qld 4072, Australia
| | | | | | | | | | | |
Collapse
|
17
|
Masouleh AK, Waters DLE, Reinke RF, Henry RJ. A high-throughput assay for rapid and simultaneous analysis of perfect markers for important quality and agronomic traits in rice using multiplexed MALDI-TOF mass spectrometry. PLANT BIOTECHNOLOGY JOURNAL 2009; 7:355-363. [PMID: 19379284 DOI: 10.1111/j.1467-7652.2009.00411.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
The application of single nucleotide polymorphisms (SNPs) in plant breeding involves the analysis of a large number of samples, and therefore requires rapid, inexpensive and highly automated multiplex methods to genotype the sequence variants. We have optimized a high-throughput multiplexed SNP assay for eight polymorphisms which explain two agronomic and three grain quality traits in rice. Gene fragments coding for the agronomic traits plant height (semi-dwarf, sd-1) and blast disease resistance (Pi-ta) and the quality traits amylose content (waxy), gelatinization temperature (alk) and fragrance (fgr) were amplified in a multiplex polymerase chain reaction. A single base extension reaction carried out at the polymorphism responsible for each of these phenotypes within these genes generated extension products which were quantified by a matrix-assisted laser desorption ionization-time of flight system. The assay detects both SNPs and indels and is co-dominant, simultaneously detecting both homozygous and heterozygous samples in a multiplex system. This assay analyses eight functional polymorphisms in one 5 microL reaction, demonstrating the high-throughput and cost-effective capability of this system. At this conservative level of multiplexing, 3072 assays can be performed in a single 384-well microtitre plate, allowing the rapid production of valuable information for selection in rice breeding.
Collapse
Affiliation(s)
- Ardashir K Masouleh
- Centre for Plant Conservation Genetics, Southern Cross University, Lismore, NSW 2480, Australia
| | | | | | | |
Collapse
|
18
|
Chen Q, Qi PF, Wei YM, Wang JR, Zheng YL. Molecular characterization of the pina gene in einkorn wheat. Biochem Genet 2009; 47:384-96. [PMID: 19390960 DOI: 10.1007/s10528-009-9239-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2008] [Accepted: 04/05/2009] [Indexed: 11/25/2022]
Abstract
Fifty-six sequences encoding the pina protein were characterized from three species or subspecies of einkorn wheat. These sequences contained 1,595 nucleotides, including 1,270 conserved sites, 21 single nucleotide polymorphisms (SNPs), and 16 indels. The average frequency of SNPs and indels was one out of 76.1 and 99.9 bases, respectively. Five SNPs and no indels were found in the translated sequences. Fourteen haplotypes were defined, and the accessions in each haplotype ranged from 1 to 18. There were nine haplotypes in Triticum monococcum ssp. aegilopoides, eight in T. monococcum ssp. monococcum, and two in T. urartu. Phylogenetic analysis showed that pina genes from different species or subspecies could be clearly differentiated based on the open reading frame. Genes from T. urartu grouped together, whereas genes from T. monococcum ssp. aegilopoides and T. monococcum ssp. monococcum were shared by three and two clusters, respectively. Both the haplotype and phylogenetic analyses indicated that T. monococcum ssp. aegilopoides was more diverse. These results would contribute to the understanding of functional aspects and efficient utilization of pina genes.
Collapse
Affiliation(s)
- Qing Chen
- Triticeae Research Institute, Sichuan Agricultural University, Yaan, Sichuan, China
| | | | | | | | | |
Collapse
|
19
|
Kerstens HH, Kollers S, Kommadath A, Del Rosario M, Dibbits B, Kinders SM, Crooijmans RP, Groenen MA. Mining for single nucleotide polymorphisms in pig genome sequence data. BMC Genomics 2009; 10:4. [PMID: 19126189 PMCID: PMC2637895 DOI: 10.1186/1471-2164-10-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2008] [Accepted: 01/06/2009] [Indexed: 11/30/2022] Open
Abstract
Background Single nucleotide polymorphisms (SNPs) are ideal genetic markers due to their high abundance and the highly automated way in which SNPs are detected and SNP assays are performed. The number of SNPs identified in the pig thus far is still limited. Results A total of 4.8 million whole genome shotgun sequences obtained from the NCBI trace-repository with center name "SDJVP", and project name "Sino-Danish Pig Genome Project" were analysed for the presence of SNPs. Available BAC and BAC-end sequences and their naming and mapping information, all obtained from SangerInstitute FTP site, served as a rough assembly of a reference genome. In 1.2 Gb of pig genome sequence, we identified 98,151 SNPs in which one of the sequences in the alignment represented the polymorphism and 6,374 SNPs in which two sequences represent an identical polymorphism. To benchmark the SNP identification method, 163 SNPs, in which the polymorphism was represented twice in the sequence alignment, were selected and tested on a panel of three purebred boar lines and wild boar. Of these 163 in silico identified SNPs, 134 were shown to be polymorphic in our animal panel. Conclusion This SNP identification method, which mines for SNPs in publicly available porcine shotgun sequences repositories, provides thousands of high quality SNPs. Benchmarking in an animal panel showed that more than 80% of the predicted SNPs represented true genetic variation.
Collapse
Affiliation(s)
- Hindrik Hd Kerstens
- Animal Breeding and Genetics Group, Wageningen University, Wageningen, the Netherlands.
| | | | | | | | | | | | | | | |
Collapse
|
20
|
|
21
|
Vezzulli S, Micheletti D, Riaz S, Pindo M, Viola R, This P, Walker MA, Troggio M, Velasco R. A SNP transferability survey within the genus Vitis. BMC PLANT BIOLOGY 2008; 8:128. [PMID: 19087337 PMCID: PMC2631481 DOI: 10.1186/1471-2229-8-128] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2008] [Accepted: 12/16/2008] [Indexed: 05/18/2023]
Abstract
BACKGROUND Efforts to sequence the genomes of different organisms continue to increase. The DNA sequence is usually decoded for one individual and its application is for the whole species. The recent sequencing of the highly heterozygous Vitis vinifera L. cultivar Pinot Noir (clone ENTAV 115) genome gave rise to several thousand polymorphisms and offers a good model to study the transferability of its degree of polymorphism to other individuals of the same species and within the genus. RESULTS This study was performed by genotyping 137 SNPs through the SNPlex Genotyping System (Applied Biosystems Inc.) and by comparing the SNPlex sequencing results across 35 (of the 137) regions from 69 grape accessions. A heterozygous state transferability of 31.5% across the unrelated cultivars of V. vinifera, of 18.8% across the wild forms of V. vinifera, of 2.3% among non-vinifera Vitis species, and of 0% with Muscadinia rotundifolia was found. In addition, mean allele frequencies were used to evaluate SNP informativeness and develop useful subsets of markers. CONCLUSION Using SNPlex application and corroboration from the sequencing analysis, the informativeness of SNP markers from the heterozygous grape cultivar Pinot Noir was validated in V. vinifera (including cultivars and wild forms), but had a limited application for non-vinifera Vitis species where a resequencing strategy may be preferred, knowing that homology at priming sites is sufficient. This work will allow future applications such as mapping and diversity studies, accession identification and genomic-research assisted breeding within V. vinifera.
Collapse
Affiliation(s)
- Silvia Vezzulli
- IASMA Research Centre, Via E. Mach 1, 38010 San Michele all'Adige (TN), Italy
| | - Diego Micheletti
- IASMA Research Centre, Via E. Mach 1, 38010 San Michele all'Adige (TN), Italy
| | - Summaira Riaz
- Department of Viticulture and Enology, University of California, One Shields Avenue, CA-95616 Davis, USA
| | - Massimo Pindo
- IASMA Research Centre, Via E. Mach 1, 38010 San Michele all'Adige (TN), Italy
| | - Roberto Viola
- IASMA Research Centre, Via E. Mach 1, 38010 San Michele all'Adige (TN), Italy
| | - Patrice This
- UMR 1097, DIA-PC Equipe Génétique Vigne, INRA-Supagro, 2 place Viala, F-34060 Montpellier, France
| | - M Andrew Walker
- Department of Viticulture and Enology, University of California, One Shields Avenue, CA-95616 Davis, USA
| | - Michela Troggio
- IASMA Research Centre, Via E. Mach 1, 38010 San Michele all'Adige (TN), Italy
| | - Riccardo Velasco
- IASMA Research Centre, Via E. Mach 1, 38010 San Michele all'Adige (TN), Italy
| |
Collapse
|
22
|
Scheibye-Alsing K, Hoffmann S, Frankel A, Jensen P, Stadler PF, Mang Y, Tommerup N, Gilchrist MJ, Nygård AB, Cirera S, Jørgensen CB, Fredholm M, Gorodkin J. Sequence assembly. Comput Biol Chem 2008; 33:121-36. [PMID: 19152793 DOI: 10.1016/j.compbiolchem.2008.11.003] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2008] [Revised: 11/28/2008] [Accepted: 11/28/2008] [Indexed: 01/20/2023]
Abstract
Despite the rapidly increasing number of sequenced and re-sequenced genomes, many issues regarding the computational assembly of large-scale sequencing data have remain unresolved. Computational assembly is crucial in large genome projects as well for the evolving high-throughput technologies and plays an important role in processing the information generated by these methods. Here, we provide a comprehensive overview of the current publicly available sequence assembly programs. We describe the basic principles of computational assembly along with the main concerns, such as repetitive sequences in genomic DNA, highly expressed genes and alternative transcripts in EST sequences. We summarize existing comparisons of different assemblers and provide a detailed descriptions and directions for download of assembly programs at: http://genome.ku.dk/resources/assembly/methods.html.
Collapse
Affiliation(s)
- K Scheibye-Alsing
- Division of Genetics and Bioinformatics, IBHV, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Vezzulli S, Troggio M, Coppola G, Jermakow A, Cartwright D, Zharkikh A, Stefanini M, Grando MS, Viola R, Adam-Blondon AF, Thomas M, This P, Velasco R. A reference integrated map for cultivated grapevine (Vitis vinifera L.) from three crosses, based on 283 SSR and 501 SNP-based markers. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2008. [PMID: 18504538 DOI: 10.1007/s00122-008-07943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
We have developed an integrated map from five elite cultivars of Vitis vinifera L.; Syrah, Pinot Noir, Grenache, Cabernet Sauvignon and Riesling which are parents of three segregating populations. A new source of markers, SNPs, identified in ESTs and unique BAC-end sequences was added to the available IGGP reference set of SSRs. The complete integrated map comprises 1,134 markers (350 AFLP, 332 BESs, 169 ESTs, 283 SSRs) spanning 1,443 cM over 19 linkage groups and shows a mean distance between neighbouring loci of 1.27 cM. Marker order was mainly conserved between the integrated map and the highly dense SyrahxPinot Noir consensus map except for few inversions. Moreover, the marker order has been validated through the assembled genome sequence of Pinot Noir. We have also assessed the transferability of SNP-based markers among five V. vinifera varieties, enabling marker validation across different genotypes. This integrated map can serve as a fundamental tool for molecular breeding in V. vinifera and related species and provide a basis for studies of genome organization and evolution in grapevines.
Collapse
|
24
|
Vezzulli S, Troggio M, Coppola G, Jermakow A, Cartwright D, Zharkikh A, Stefanini M, Grando MS, Viola R, Adam-Blondon AF, Thomas M, This P, Velasco R. A reference integrated map for cultivated grapevine (Vitis vinifera L.) from three crosses, based on 283 SSR and 501 SNP-based markers. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2008; 117:499-511. [PMID: 18504538 DOI: 10.1007/s00122-008-0794-3] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/29/2008] [Accepted: 05/02/2008] [Indexed: 05/17/2023]
Abstract
We have developed an integrated map from five elite cultivars of Vitis vinifera L.; Syrah, Pinot Noir, Grenache, Cabernet Sauvignon and Riesling which are parents of three segregating populations. A new source of markers, SNPs, identified in ESTs and unique BAC-end sequences was added to the available IGGP reference set of SSRs. The complete integrated map comprises 1,134 markers (350 AFLP, 332 BESs, 169 ESTs, 283 SSRs) spanning 1,443 cM over 19 linkage groups and shows a mean distance between neighbouring loci of 1.27 cM. Marker order was mainly conserved between the integrated map and the highly dense SyrahxPinot Noir consensus map except for few inversions. Moreover, the marker order has been validated through the assembled genome sequence of Pinot Noir. We have also assessed the transferability of SNP-based markers among five V. vinifera varieties, enabling marker validation across different genotypes. This integrated map can serve as a fundamental tool for molecular breeding in V. vinifera and related species and provide a basis for studies of genome organization and evolution in grapevines.
Collapse
|
25
|
Helftenbein G, Koslowski M, Dhaene K, Seitz G, Sahin U, Türeci O. In silico strategy for detection of target candidates for antibody therapy of solid tumors. Gene 2008; 414:76-84. [PMID: 18358640 DOI: 10.1016/j.gene.2008.02.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2007] [Revised: 02/05/2008] [Accepted: 02/13/2008] [Indexed: 10/22/2022]
Abstract
In contrast to earlier attempts for the identification of target candidates suitable for monoclonal antibody (mAb) based cancer therapies we concentrated on highly selective lineage-specific genes additionally preserved or even overexpressed in orthotopic cancers. In a script aided workflow we reduced all human entries of the RefSeq mRNA database to those encoding transmembrane domain bearing gene products and subjected them to BLAST analysis against the human EST database. All BLAST results were validated in a gene centric way allowing two types of data curation prior to expression profiling of matching ESTs in selected healthy tissues: (i) exclusion of questionable ESTs arising e.g. from genomic contamination and (ii) elimination of erroneously predicted mRNAs as well as transcripts with only weak EST coverage. The impact of such stringent input control on accuracy of prediction is underlined by RT-PCR confirmation of predicted tissue distribution patterns for a number of selected candidates.
Collapse
|
26
|
Cervigni GDL, Paniego N, Díaz M, Selva JP, Zappacosta D, Zanazzi D, Landerreche I, Martelotto L, Felitti S, Pessino S, Spangenberg G, Echenique V. Expressed sequence tag analysis and development of gene associated markers in a near-isogenic plant system of Eragrostis curvula. PLANT MOLECULAR BIOLOGY 2008; 67:1-10. [PMID: 18196464 DOI: 10.1007/s11103-007-9282-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2007] [Accepted: 12/22/2007] [Indexed: 05/05/2023]
Abstract
Eragrostis curvula (Schrad.) Nees is a forage grass native to the semiarid regions of Southern Africa, which reproduces mainly by pseudogamous diplosporous apomixis. A collection of ESTs was generated from four cDNA libraries, three of them obtained from panicles of near-isogenic lines with different ploidy levels and reproductive modes, and one obtained from 12 days-old plant leaves. A total of 12,295 high-quality ESTs were clustered and assembled, rendering 8,864 unigenes, including 1,490 contigs and 7,394 singletons, with a genome coverage of 22%. A total of 7,029 (79.11%) unigenes were functionally categorized by BLASTX analysis against sequences deposited in public databases, but only 37.80% could be classified according to Gene Ontology. Sequence comparison against the cereals genes indexes (GI) revealed 50% significant hits. A total of 254 EST-SSRs were detected from 219 singletons and 35 from contigs. Di- and tri- motifs were similarly represented with percentages of 38.95 and 40.16%, respectively. In addition, 190 SNPs and Indels were detected in 18 contigs generated from 3 to 4 libraries. The ESTs and the molecular markers obtained in this study will provide valuable resources for a wide range of applications including gene identification, genetic mapping, cultivar identification, analysis of genetic diversity, phenotype mapping and marker assisted selection.
Collapse
Affiliation(s)
- Gerardo D L Cervigni
- Centro de Recursos Naturales Renovables de la Zona Semiárida-CONICET, Camino de La Carrindanga Km 7.0, Bahia Blanca, Argentina
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Wang JR, Wei YM, Long XY, Yan ZH, Nevo E, Baum BR, Zheng YL. Molecular evolution of dimeric alpha-amylase inhibitor genes in wild emmer wheat and its ecological association. BMC Evol Biol 2008; 8:91. [PMID: 18366725 PMCID: PMC2324104 DOI: 10.1186/1471-2148-8-91] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2007] [Accepted: 03/24/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND alpha-Amylase inhibitors are attractive candidates for the control of seed weevils, as these insects are highly dependent on starch as an energy source. In this study, we aimed to reveal the structure and diversity of dimeric alpha-amylase inhibitor genes in wild emmer wheat from Israel and to elucidate the relationship between the emmer wheat genes and ecological factors using single nucleotide polymorphism (SNP) markers. Another objective of this study was to find out whether there were any correlations between SNPs in functional protein-coding genes and the environment. RESULTS The influence of ecological factors on the genetic structure of dimeric alpha-amylase inhibitor genes was evaluated by specific SNP markers. A total of 244 dimeric alpha-amylase inhibitor genes were obtained from 13 accessions in 10 populations. Seventy-five polymorphic positions and 74 haplotypes were defined by sequence analysis. Sixteen out of the 75 SNP markers were designed to detect SNP variations in wild emmer wheat accessions from different populations in Israel. The proportion of polymorphic loci P (5%), the expected heterozygosity He, and Shannon's information index in the 16 populations were 0.887, 0.404, and 0.589, respectively. The populations of wild emmer wheat showed great diversity in gene loci both between and within populations. Based on the SNP marker data, the genetic distance of pair-wise comparisons of the 16 populations displayed a sharp genetic differentiation over long geographic distances. The values of P, He, and Shannon's information index were negatively correlated with three climatic moisture factors, whereas the same values were positively correlated by Spearman rank correlation coefficients' analysis with some of the other ecological factors. CONCLUSION The populations of wild emmer wheat showed a wide range of diversity in dimeric alpha-amylase inhibitors, both between and within populations. We suggested that SNP markers are useful for the estimation of genetic diversity of functional genes in wild emmer wheat. These results show significant correlations between SNPs in the alpha-amylase inhibitor genes and ecological factors affecting diversity. Ecological factors, singly or in combination, explained a significant proportion of the variations in the SNPs, and the SNPs could be classified into several categories as ecogeographical predictors. It was suggested that the SNPs in the alpha-amylase inhibitor genes have been subjected to natural selection, and ecological factors had an important evolutionary influence on gene differentiation at specific loci.
Collapse
Affiliation(s)
- Ji-Rui Wang
- Triticeae Research Institute, Sichuan Agricultural University, Yaan, Sichuan 625014, China.
| | | | | | | | | | | | | |
Collapse
|
28
|
Discovery of cSNPs in pig using full-length enriched cDNA libraries of the Korean native pig as a source of genetic diversity. BIOTECHNOL BIOPROC E 2007. [DOI: 10.1007/bf02931066] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
29
|
Tacconi G, Baldassarre V, Collins NC, Bulgarelli D, Stanca AM, Valè G. Haplotype characterization and markers at the barley Mlo powdery mildew resistance locus as tools for marker-assisted selection. Genome 2007; 49:864-72. [PMID: 17036060 DOI: 10.1139/g06-044] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Recessive mlo alleles of the barley Mlo gene confer resistance to almost all known isolates of the powdery mildew fungal pathogen targeting barley (Hordeum vulgare). To characterize haplotypes present in the Mlo chromosomal region of cultivated Mlo and mlo barley genotypes, we conducted a polymorphism search in 3 predicted low-copy sequence regions adjacent to the Mlo gene by examining a sample of 4 Mlo and 3 mlo cultivars. Eight single-nucleotide polymorphisms (SNPs) and 1 insertion-deletion (indel) were detected, and easy to use PCR-based markers were developed for typing the SNPs. The PCR markers were used to characterize a collection of 46 Mlo and 25 mlo barley cultivars, identifying 3 distinct mlo-11 haplotypes, 1 mlo-9 haplotype, and 4 Mlo haplotypes. We summarized the haplotype and marker information obtained here and in a previous study to help breeders identify strategies for mlo marker-assisted selection. The ability of the markers to identify mlo-resistant genotypes in segregating populations was demonstrated using 2 resistance-characterized F2 populations derived by 3-way crosses.
Collapse
Affiliation(s)
- G Tacconi
- Istituto Sperimentale per la Cerealicoltura, Consiglio per Ricerca e Sperimentazione in Agricoltura, Fiorenzuola d'Arda, PC, Italy
| | | | | | | | | | | |
Collapse
|
30
|
Alarcon-Chaidez FJ, Sun J, Wikel SK. Transcriptome analysis of the salivary glands of Dermacentor andersoni Stiles (Acari: Ixodidae). INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2007; 37:48-71. [PMID: 17175446 DOI: 10.1016/j.ibmb.2006.10.002] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2006] [Revised: 10/10/2006] [Accepted: 10/10/2006] [Indexed: 05/13/2023]
Abstract
Amongst blood-feeding arthropods, ticks of the family Ixodidae (hard ticks) are vectors and reservoirs of a greater variety of infectious agents than any other ectoparasite. Salivary glands of ixodid ticks secrete a large number of pharmacologically active molecules that not only facilitate feeding but also promote establishment of infectious agents. Genomic, proteomic and immunologic characterization of bioactive salivary gland molecules are, therefore, important as they offer new insights into molecular events occurring at the tick-host interface and they have implications for development of novel control strategies. The present work uses complementary DNA (cDNA) sequence analysis to identify salivary gland transcripts expressed by the Rocky Mountain wood tick, Dermacentor andersoni, a vector of the human pathogens causing Rocky Mountain spotted fever, Colorado tick fever, tularemia, and Powassan encephalitis as well as the veterinary pathogen Anaplasma marginale. Dermacentor andersoni is also capable of inducing tick paralysis. Automated single-pass DNA sequencing was conducted on 1440 randomly selected cDNA clones from the salivary glands of adult female D. andersoni collected during the early stages of feeding (18-24h). Analysis of the expressed sequence tags (ESTs) resulted in 544 singletons and 218 clusters with more than one quality read and attempts were made to assign putative functions to tick genes based on amino acid identity to published protein databases. Approximately 25.6% (195) of the sequences showed limited or no homology to previously identified gene products. A number of novel sequences were identified which presented significant sequence similarity to mammalian genes normally associated with extracellular matrix (ECM), regulation of immune responses, tumor suppression, and wound healing. Several coding sequences possessed various degrees of homology to previously described proteins from other tick species. Preliminary nucleotide variation analysis of these and other tick sequences suggests extensive nucleotide diversity, which has implications for evolution of tick feeding. Intra-species diversity studies can be a promising tool for identifying sequence variations potentially associated with phenotypic traits affecting vector-host-pathogen interactions.
Collapse
Affiliation(s)
- Francisco J Alarcon-Chaidez
- School of Medicine, Department of Immunology, University of Connecticut Health Center, 263 Farmington Avenue, MC3710, Farmington, CT 06030, USA.
| | | | | |
Collapse
|
31
|
Establishment of a pipeline to analyse non-synonymous SNPs in Bos taurus. BMC Genomics 2006; 7:298. [PMID: 17125523 PMCID: PMC1684264 DOI: 10.1186/1471-2164-7-298] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2006] [Accepted: 11/26/2006] [Indexed: 11/10/2022] Open
Abstract
Background Single nucleotide polymorphisms (SNPs) are an abundant form of genetic variation in the genome of every species and are useful for gene mapping and association studies. Of particular interest are non-synonymous SNPs, which may alter protein function and phenotype. We therefore examined bovine expressed sequences for non-synonymous SNPs and validated and tested selected SNPs for their association with measured traits. Results Over 500,000 public bovine expressed sequence tagged (EST) sequences were used to search for coding SNPs (cSNPs). A total of 15,353 SNPs were detected in the transcribed sequences studied, of which 6,325 were predicted to be coding SNPs with the remaining 9,028 SNPs presumed to be in untranslated regions. Of the cSNPs detected, 2,868 were predicted to result in a change in the amino acid encoded. In order to determine the actual number of non-synonymous polymorphic SNPs we designed assays for 920 of the putative SNPs. These SNPs were then genotyped through a panel of cattle DNA pools using chip-based MALDI-TOF mass spectrometry. Of the SNPs tested, 29% were found to be polymorphic with a minor allele frequency >10%. A subset of the SNPs was genotyped through animal resources in order to look for association with age of puberty, facial eczema resistance or meat yield. Three SNPs were nominally associated with resistance to the disease facial eczema (P < 0.01). Conclusion We have identified 15,353 putative SNPs in or close to bovine genes and 2,868 of these SNPs were predicted to be non-synonymous. Approximately 29% of the non-synonymous SNPs were polymorphic and common with a minor allele frequency >10%. Of the SNPs detected in this study, 99% have not been previously reported. These novel SNPs will be useful for association studies or gene mapping.
Collapse
|
32
|
Tang J, Vosman B, Voorrips RE, van der Linden CG, Leunissen JAM. QualitySNP: a pipeline for detecting single nucleotide polymorphisms and insertions/deletions in EST data from diploid and polyploid species. BMC Bioinformatics 2006; 7:438. [PMID: 17029635 PMCID: PMC1618865 DOI: 10.1186/1471-2105-7-438] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2005] [Accepted: 10/09/2006] [Indexed: 11/21/2022] Open
Abstract
Background Single nucleotide polymorphisms (SNPs) are important tools in studying complex genetic traits and genome evolution. Computational strategies for SNP discovery make use of the large number of sequences present in public databases (in most cases as expressed sequence tags (ESTs)) and are considered to be faster and more cost-effective than experimental procedures. A major challenge in computational SNP discovery is distinguishing allelic variation from sequence variation between paralogous sequences, in addition to recognizing sequencing errors. For the majority of the public EST sequences, trace or quality files are lacking which makes detection of reliable SNPs even more difficult because it has to rely on sequence comparisons only. Results We have developed a new algorithm to detect reliable SNPs and insertions/deletions (indels) in EST data, both with and without quality files. Implemented in a pipeline called QualitySNP, it uses three filters for the identification of reliable SNPs. Filter 1 screens for all potential SNPs and identifies variation between or within genotypes. Filter 2 is the core filter that uses a haplotype-based strategy to detect reliable SNPs. Clusters with potential paralogs as well as false SNPs caused by sequencing errors are identified. Filter 3 screens SNPs by calculating a confidence score, based upon sequence redundancy and quality. Non-synonymous SNPs are subsequently identified by detecting open reading frames of consensus sequences (contigs) with SNPs. The pipeline includes a data storage and retrieval system for haplotypes, SNPs and alignments. QualitySNP's versatility is demonstrated by the identification of SNPs in EST datasets from potato, chicken and humans. Conclusion QualitySNP is an efficient tool for SNP detection, storage and retrieval in diploid as well as polyploid species. It is available for running on Linux or UNIX systems. The program, test data, and user manual are available at and as Additional files.
Collapse
Affiliation(s)
- Jifeng Tang
- Plant Research International, PO Box 16, 6700 AA Wageningen, The Netherlands.
| | | | | | | | | |
Collapse
|
33
|
Cordeiro GM, Eliott F, McIntyre CL, Casu RE, Henry RJ. Characterisation of single nucleotide polymorphisms in sugarcane ESTs. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2006; 113:331-43. [PMID: 16791699 DOI: 10.1007/s00122-006-0300-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2005] [Accepted: 04/21/2006] [Indexed: 05/04/2023]
Abstract
Commercial sugarcane cultivars (Saccharum spp. hybrids) are both polyploid and aneuploid with chromosome numbers in excess of 100; these chromosomes can be assigned to 8 homology groups. To determine the utility of single nucleotide polymorphisms (SNPs) as a means of improving our understanding of the complex sugarcane genome, we developed markers to a suite of SNPs identified in a list of sugarcane ESTs. Analysis of 69 EST contigs showed a median of 9 SNPs per EST and an average of 1 SNP per 50 bp of coding sequence. The quantitative presence of each base at 58 SNP loci within 19 contiguous sequence sets was accurately and reliably determined for 9 sugarcane genotypes, including both commercial cultivars and ancestral species, through the use of quantitative light emission technology in pyrophosphate sequencing. Across the 9 genotypes tested, 47 SNP loci were polymorphic and 11 monomorphic. Base frequency at individual SNP loci was found to vary approximately twofold between Australian sugarcane cultivars and more widely between cultivars and wild species. Base quantity was shown to segregate as expected in the IJ76-514 x Q165 sugarcane mapping population, indicating that SNPs that occur on one or two sugarcane chromosomes have the potential to be mapped. The use of SNP base frequencies from five of the developed markers was able to clearly distinguish all genotypes in the population. The use of SNP base frequencies from a further six markers within an EST contig was able to help establish the likely copy number of the locus in two genotypes tested. This is the first instance of a technology that has been able to provide an insight into the copy number of a specific gene locus in hybrid sugarcane. The identification of specific and numerous haplotypes/alleles present in a genotype by pyrophosphate sequencing or alternative techniques ultimately will provide the basis for identifying associations between specific alleles and phenotype and between allele dosage and phenotype in sugarcane.
Collapse
Affiliation(s)
- Giovanni M Cordeiro
- Centre for Plant Conservation Genetics, Southern Cross University, PO Box 157, Lismore 2480, Australia.
| | | | | | | | | |
Collapse
|
34
|
Yamamoto N, Tsugane T, Watanabe M, Yano K, Maeda F, Kuwata C, Torki M, Ban Y, Nishimura S, Shibata D. Expressed sequence tags from the laboratory-grown miniature tomato (Lycopersicon esculentum) cultivar Micro-Tom and mining for single nucleotide polymorphisms and insertions/deletions in tomato cultivars. Gene 2005; 356:127-34. [PMID: 15975739 DOI: 10.1016/j.gene.2005.04.026] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2004] [Revised: 03/08/2005] [Accepted: 04/07/2005] [Indexed: 11/25/2022]
Abstract
Laboratory-grown miniature tomato (Lycopersicon esculentum) cultivar Micro-Tom has attracted attention as a host for functional genomics research. In this study, we generated 35,824 expressed sequence tags (ESTs) from leaves and fruits of Micro-Tom. The ESTs comprised 10,287 unigenes (5007 contigs and 5280 singletons), including 1858 novel tomato unigenes. Of the 18 unigenes that shared strong homology with tobacco chloroplast genome sequences, one unigene was likely derived from polyadenylated transcripts of the atpH gene. Interestingly, ESTs for vacuolar invertase, pectate lyase and alcohol acyl transferase were underrepresented in the Micro-Tom data set. From all of the ESTs, we mined 2039 candidate single nucleotide polymorphisms (SNPs) and 121 candidate insertions and deletions (indels) based on homology with four tomato inbred lines, E6203, R11-13, Rio Grande PtoR and R11-12, and a wild relative, L. pennellii TA56, for which sequence data was publicly available with more than 5000 entries. Direct genome sequencing of several SNP or indel sites in Micro-Tom and L. esculentum E6203 suggested that more than 69% of the candidate sites were truly polymorphic, making them useful for the preparation of DNA markers.
Collapse
Affiliation(s)
- Naoki Yamamoto
- The Graduate School of Life Sciences, Tohoku University, Kazusa-Kamatari 2-6-7, Kisarazu, Chiba 292-0818, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Single nucleotide polymorphisms (SNPs) discovery and linkage disequilibrium (LD) in forest trees. ACTA ACUST UNITED AC 2005. [DOI: 10.1007/s11632-005-0024-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
36
|
Savage D, Batley J, Erwin T, Logan E, Love CG, Lim GAC, Mongin E, Barker G, Spangenberg GC, Edwards D. SNPServer: a real-time SNP discovery tool. Nucleic Acids Res 2005; 33:W493-5. [PMID: 15980519 PMCID: PMC1160223 DOI: 10.1093/nar/gki462] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
SNPServer is a real-time flexible tool for the discovery of SNPs (single nucleotide polymorphisms) within DNA sequence data. The program uses BLAST, to identify related sequences, and CAP3, to cluster and align these sequences. The alignments are parsed to the SNP discovery software autoSNP, a program that detects SNPs and insertion/deletion polymorphisms (indels). Alternatively, lists of related sequences or pre-assembled sequences may be entered for SNP discovery. SNPServer and autoSNP use redundancy to differentiate between candidate SNPs and sequence errors. For each candidate SNP, two measures of confidence are calculated, the redundancy of the polymorphism at a SNP locus and the co-segregation of the candidate SNP with other SNPs in the alignment. SNPServer is available at .
Collapse
Affiliation(s)
- David Savage
- Plant Biotechnology Centre, La Trobe UniversityBundoora 3086, Victoria, Australia
| | - Jacqueline Batley
- Plant Biotechnology Centre, La Trobe UniversityBundoora 3086, Victoria, Australia
| | - Tim Erwin
- Plant Biotechnology Centre, La Trobe UniversityBundoora 3086, Victoria, Australia
- Victorian Bioinformatics Consortium, Plant Biotechnology Centre, Primary Industries Research Victoria, La Trobe UniversityBundoora 3086, Victoria, Australia
| | - Erica Logan
- Plant Biotechnology Centre, La Trobe UniversityBundoora 3086, Victoria, Australia
- Victorian Bioinformatics Consortium, Plant Biotechnology Centre, Primary Industries Research Victoria, La Trobe UniversityBundoora 3086, Victoria, Australia
| | - Christopher G. Love
- Plant Biotechnology Centre, La Trobe UniversityBundoora 3086, Victoria, Australia
- Victorian Bioinformatics Consortium, Plant Biotechnology Centre, Primary Industries Research Victoria, La Trobe UniversityBundoora 3086, Victoria, Australia
| | - Geraldine A. C. Lim
- Plant Biotechnology Centre, La Trobe UniversityBundoora 3086, Victoria, Australia
- Victorian Bioinformatics Consortium, Plant Biotechnology Centre, Primary Industries Research Victoria, La Trobe UniversityBundoora 3086, Victoria, Australia
| | - Emmanuel Mongin
- Plant Biotechnology Centre, La Trobe UniversityBundoora 3086, Victoria, Australia
- Victorian Bioinformatics Consortium, Plant Biotechnology Centre, Primary Industries Research Victoria, La Trobe UniversityBundoora 3086, Victoria, Australia
| | - Gary Barker
- School of Biological Sciences, University of BristolBristol BS8 1UG, UK
| | - German C. Spangenberg
- Plant Biotechnology Centre, La Trobe UniversityBundoora 3086, Victoria, Australia
- Victorian Bioinformatics Consortium, Plant Biotechnology Centre, Primary Industries Research Victoria, La Trobe UniversityBundoora 3086, Victoria, Australia
| | - David Edwards
- Plant Biotechnology Centre, La Trobe UniversityBundoora 3086, Victoria, Australia
- Victorian Bioinformatics Consortium, Plant Biotechnology Centre, Primary Industries Research Victoria, La Trobe UniversityBundoora 3086, Victoria, Australia
- To whom correspondence should be addressed. Tel: +61 3 9479 5633; Fax: +61 3 9479 3618;
| |
Collapse
|
37
|
Abstract
Expressed sequence tag (EST) data are a major contributor to the known plant sequence space. Organization of the data into non-redundant clusters representing tentative unique genes provides snapshots of the gene repertoires of a species. This chapter reviews availability of sequences and sequence analysis results and describes several resources and tools that should facilitate broad-based utilization of EST data for gene structure annotation, gene discovery, and comparative genomics.
Collapse
Affiliation(s)
- Qunfeng Dong
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, Iowa 50011-3260, USA
| | | | | | | | | |
Collapse
|
38
|
Abstract
Advances in genotyping and sequencing technologies, coupled with the development of sophisticated statistical methods, have afforded investigators novel opportunities to define the role of sequence variation in the development of common human diseases. At the forefront of these investigations is the use of dense maps of single-nucleotide polymorphisms (SNPs) and the haplotypes derived from these polymorphisms. Here we review basic concepts of high-density genetic maps of SNPs and haplotypes and how they are typically generated and used in human genetic research. We also provide useful examples and tools available for researchers interested in incorporating haplotypes into their studies. Finally, we discuss the latest concepts for the analysis of haplotypes related to human disease, including haplotype blocks, the International HapMap Project, and the future directions of these resources.
Collapse
Affiliation(s)
- Dana C Crawford
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA.
| | | |
Collapse
|
39
|
Wang JR, Wei YM, Yan ZH, Zheng YL. Detection of single nucleotide polymorphisms in 24 kDa dimeric α-amylase inhibitors from cultivated wheat and its diploid putative progenitors. Biochim Biophys Acta Gen Subj 2005; 1723:309-20. [PMID: 15837429 DOI: 10.1016/j.bbagen.2005.03.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2005] [Revised: 02/27/2005] [Accepted: 03/01/2005] [Indexed: 11/30/2022]
Abstract
Seventeen new genes encoding 24 kDa family dimeric alpha-amylase inhibitors had been characterized from cultivated wheat and its diploid putative progenitors. And the different alpha-amylase inhibitors in this family, which were determined by coding regions single nucleotide polymorphisms (cSNPs) of their genes, were investigated. The amino acid sequences of 24 kDa alpha-amylase inhibitors shared very high coherence (91.2%). It indicated that the dimeric alpha-amylase inhibitors in the 24 kDa family were derived from common ancestral genes by phylogenetic analysis. Eight alpha-amylase inhibitor genes were characterized from one hexaploid wheat variety, and clustered into four subgroups, indicating that the 24 kDa dimeric alpha-amylase inhibitors in cultivated wheat were encoded by multi-gene. Forty-five cSNPs, including 35 transitions and 10 transversions, were found, and resulted in a total of ten amino acid changes. The cSNPs at the first site of a codon cause much more nonsynonymous (92.9%) than synonymous mutations, while nonsynonymous and synonymous mutations were almost equal when the cSNPs were at the third site. It was observed that there was Ile105 instead of Val105 at the active region Val104-Val105-Asp106-Ala107 of the alpha-amylase inhibitor by cSNPs in some inhibitors from Aegilops speltoides, diploid and hexaploid wheats.
Collapse
Affiliation(s)
- Ji-Rui Wang
- Triticeae Research Institute, Sichuan Agricultural University, Dujiangyan 611830, China
| | | | | | | |
Collapse
|
40
|
Lopez C, Piégu B, Cooke R, Delseny M, Tohme J, Verdier V. Using cDNA and genomic sequences as tools to develop SNP strategies in cassava (Manihot esculenta Crantz). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2005; 110:425-31. [PMID: 15650816 DOI: 10.1007/s00122-004-1833-3] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2004] [Accepted: 10/01/2004] [Indexed: 05/18/2023]
Abstract
Single nucleotide polymorphisms (SNP) are the most abundant type of DNA polymorphism found in animal and plant genomes. They provide an important new source of molecular markers that are useful in genetic mapping, map-based positional cloning, quantitative trait locus mapping and the assessment of genetic distances between individuals. Very little is known on the frequency of SNPs in cassava. We have exploited the recently-developed collection of cassava expressed sequence tags (ESTs) to detect SNPs in the five cultivars of cassava used to generate the sequences. The frequency of intra-cultivar and inter-cultivar SNPs after analysis of 111 contigs was one polymorphism per 905 and one per 1,032 bp, respectively; totaling 1 each 509 bp. We have obtained further information on the frequency of SNPs in six cassava cultivars by analysis of 33 amplicons obtained from 3' EST and BAC end sequences. Overall, about 11 kb of DNA sequence was obtained for each cultivar. A total of 186 SNPs (136 and 50 from ESTs and BAC ends, respectively) were identified. Among these, 146 were intra-cultivar polymorphisms, while 80 were inter-cultivar polymorphisms. Thus the total frequency of SNPs was one per 62 bp. This information will help to develop new strategies for EST mapping as well as their association with phenotypic characteristics.
Collapse
Affiliation(s)
- C Lopez
- Laboratoire Génome et Développement des Plantes, UMR5096, CNRS-Université de Perpignan-Institut de Recherche pour le Développement, 52 Av Paul Alduy, 66860, Perpignan Cedex, France
| | | | | | | | | | | |
Collapse
|
41
|
Lopez C, Jorge V, Piégu B, Mba C, Cortes D, Restrepo S, Soto M, Laudié M, Berger C, Cooke R, Delseny M, Tohme J, Verdier V. A unigene catalogue of 5700 expressed genes in cassava. PLANT MOLECULAR BIOLOGY 2004; 56:541-54. [PMID: 15630618 DOI: 10.1007/s11103-004-0123-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/11/2004] [Accepted: 04/07/2004] [Indexed: 05/23/2023]
Abstract
Two economically important characters, starch content and cassava bacterial blight resistance, were targeted to generate a large collection of cassava ESTs. Two libraries were constructed from cassava root tissues of varieties with high and low starch contents. Other libraries were constructed from plant tissues challenged by the pathogen Xanthomonas axonopodis pv.manihotis. We report here the single pass sequencing of 11,954 cDNA clones from the 5' ends, including 111 from the 3' ends. Cluster analysis permitted the identification of a unigene set of 5,700 sequences. Sequence analyses permitted the assignment of a putative functional category for 37% of sequences whereas approximately 16% sequences did not show any significant similarity with other proteins present in the database and therefore can be considered as cassava specific genes. A group of genes belonging to a large multigene family was identified. We characterize a set of genes detected only in infected libraries putatively involved in the defense response to pathogen infection. By comparing two libraries obtained from cultivars contrasting in their starch content a group of genes associated to starch biosynthesis and differentially expressed was identified. This is the first large cassava EST resource developed today and publicly available thus making a significant contribution to genomic knowledge of cassava.
Collapse
Affiliation(s)
- Camilo Lopez
- Laboratoire Génome et Développement des Plantes, UMR5096, CNRS-Université de Perpignan--Institut de Recherche pour le Développement, 52 Av Paul Alduy, 66860, Perpignan Cedex, France
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Clegg N, Abbott D, Ferguson C, Coleman R, Nelson PS. Characterization and comparative analyses of transcriptomes from the normal and neoplastic human prostate. Prostate 2004; 60:227-39. [PMID: 15176052 DOI: 10.1002/pros.20055] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
BACKGROUND The prostate gland is a highly specialized organ with functional attributes that serve to enhance the fertility of mammalian species. Pathological processes affecting the prostate include benign prostate hypertrophy and prostate carcinoma; diseases that account for major morbidity and mortality in middle-aged and elderly men. To facilitate studies of biological processes uniquely represented in the prostate and assess molecular alterations associated with prostate carcinoma, we sought to establish the diversity of gene expression in the normal and neoplastic prostate through the compilation and analysis of a prostate transcriptome. METHODS We assembled and annotated ESTs derived from prostate cDNA libraries that were either produced in our laboratory or available from public sequence repositories such as CGAP, dbEST, and Unigene. Determinations of differential gene expression between the normal prostate, other normal tissues, and neoplastic prostate tissues was performed using statistical algorithms. Confirmation of differential expression was performed by quantitative PCR and Northern analysis. RESULTS A total of 99,448 high-quality ESTs were assembled and annotated to produce a prostate transcriptome comprised of 24,580 distinct TUs. Comparative analyses of gene expression levels identified 61 TUs with exclusive expression in the prostate and 45 TUs with high levels of expression in the prostate relative to at least 25 other normal tissues (P > 0.99). Comparative analyses of ESTs derived from neoplastic prostate tissues identified 75 genes with dysregulated expression in cancer (P > 0.99). CONCLUSIONS The human prostate expresses a diverse repertoire of genes that reflect a functionally complex organ. The identification of genes with prostate-restricted or enhanced expression may provide additional insights into the biochemical processes that interact to form the developmental, signaling, and metabolic pathways of the normal and neoplastic gland.
Collapse
Affiliation(s)
- Nigel Clegg
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | | | | | | | | |
Collapse
|
43
|
Sogayar MC, Camargo AA, Bettoni F, Carraro DM, Pires LC, Parmigiani RB, Ferreira EN, de Sá Moreira E, do Rosário D de O Latorre M, Simpson AJG, Cruz LO, Degaki TL, Festa F, Massirer KB, Sogayar MC, Filho FC, Camargo LP, Cunha MAV, De Souza SJ, Faria M, Giuliatti S, Kopp L, de Oliveira PSL, Paiva PB, Pereira AA, Pinheiro DG, Puga RD, S de Souza JE, Albuquerque DM, Andrade LEC, Baia GS, Briones MRS, Cavaleiro-Luna AMS, Cerutti JM, Costa FF, Costanzi-Strauss E, Espreafico EM, Ferrasi AC, Ferro ES, Fortes MAHZ, Furchi JRF, Giannella-Neto D, Goldman GH, Goldman MHS, Gruber A, Guimarães GS, Hackel C, Henrique-Silva F, Kimura ET, Leoni SG, Macedo C, Malnic B, Manzini B CV, Marie SKN, Martinez-Rossi NM, Menossi M, Miracca EC, Nagai MA, Nobrega FG, Nobrega MP, Oba-Shinjo SM, Oliveira MK, Orabona GM, Otsuka AY, Paço-Larson ML, Paixão BMC, Pandolfi JRC, Pardini MIMC, Passos Bueno MR, Passos GAS, Pesquero JB, Pessoa JG, Rahal P, Rainho CA, Reis CP, Ricca TI, Rodrigues V, Rogatto SR, Romano CM, Romeiro JG, Rossi A, Sá RG, Sales MM, Sant'Anna SC, Santarosa PL, Segato F, Silva WA, Silva IDCG, Silva NP, Soares-Costa A, Sonati MF, Strauss BE, Tajara EH, Valentini SR, Villanova FE, Ward LS, Zanette DL. A transcript finishing initiative for closing gaps in the human transcriptome. Genome Res 2004; 14:1413-23. [PMID: 15197164 PMCID: PMC442158 DOI: 10.1101/gr.2111304] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2003] [Accepted: 03/12/2004] [Indexed: 11/24/2022]
Abstract
We report the results of a transcript finishing initiative, undertaken for the purpose of identifying and characterizing novel human transcripts, in which RT-PCR was used to bridge gaps between paired EST clusters, mapped against the genomic sequence. Each pair of EST clusters selected for experimental validation was designated a transcript finishing unit (TFU). A total of 489 TFUs were selected for validation, and an overall efficiency of 43.1% was achieved. We generated a total of 59,975 bp of transcribed sequences organized into 432 exons, contributing to the definition of the structure of 211 human transcripts. The structure of several transcripts reported here was confirmed during the course of this project, through the generation of their corresponding full-length cDNA sequences. Nevertheless, for 21% of the validated TFUs, a full-length cDNA sequence is not yet available in public databases, and the structure of 69.2% of these TFUs was not correctly predicted by computer programs. The TF strategy provides a significant contribution to the definition of the complete catalog of human genes and transcripts, because it appears to be particularly useful for identification of low abundance transcripts expressed in a restricted set of tissues as well as for the delineation of gene boundaries and alternatively spliced isoforms.
Collapse
|
44
|
Cheng TC, Xia QY, Qian JF, Liu C, Lin Y, Zha XF, Xiang ZH. Mining single nucleotide polymorphisms from EST data of silkworm, Bombyx mori, inbred strain Dazao. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2004; 34:523-530. [PMID: 15147754 DOI: 10.1016/j.ibmb.2004.02.004] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2004] [Accepted: 02/19/2004] [Indexed: 05/24/2023]
Abstract
We made use of 81,635 expressed sequence tags (ESTs) derived from 12 different cDNA libraries of the silkworm, Bombyx mori, inbred strain Dazao (P50), to identify high-quality candidate single nucleotide polymorphisms (SNPs). By PHRAP assembling, 12,980 contigs containing 11,537 contigs assembled by more than one read were obtained, and 101 candidate SNPs and 27 single base insertions/deletions were identified from 117 contigs assembled from 1576 high-quality reads base-called with PHRED and screened on the basis of the neighborhood quality standard (NQS). Simultaneously, we also predicted 40 SNPs in coding regions (cSNPs), of which 26 were predicted to lead to amino acid non-synonymous variations and 14 synonymous substitutions. Also, the 1.66:1 ratio of transition/transversion is different from that of other insects. As the first SNP analysis of a Lepidoptera, B. mori, the single nucleotide polymorphic density is estimated to be 1.3 x 10(-3) by sequence diversity. This analysis shows that expressed sequences from multiple libraries may provide an abundant source of comparative reads to mine for cSNPs from the silkworm genome.
Collapse
Affiliation(s)
- Ting-Cai Cheng
- The Key Sericultural Laboratory of Agricultural Ministry, Southwest Agricultural University, Chongqing 400716, China
| | | | | | | | | | | | | |
Collapse
|
45
|
Hayashi K, Hashimoto N, Daigen M, Ashikawa I. Development of PCR-based SNP markers for rice blast resistance genes at the Piz locus. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2004; 108:1212-20. [PMID: 14740086 DOI: 10.1007/s00122-003-1553-0] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/09/2003] [Accepted: 11/21/2003] [Indexed: 05/07/2023]
Abstract
We assessed the utility of single-nucleotide polymorphisms (SNPs) and small insertion/deletion polymorphisms (InDels) as DNA markers in genetic analysis and breeding of rice. Toward this end, we surveyed SNPs and InDels in the chromosomal region containing the Piz and Piz-t rice blast resistance genes and developed PCR-based markers for typing the SNPs. Analysis of sequences from a blast-susceptible Japanese cultivar and two cultivars each containing one of these genes revealed that SNPs are abundant in the Piz and Piz-t regions (on average, one SNP every 248 bp), but the number of InDels was much lower. The dense distribution of SNPs facilitated the generation of SNP markers in the vicinity of the genes. For typing these SNPs, we used a modified allele-specific PCR method. Of the 49 candidate allele-specific markers, 33 unambiguously and reproducibly discriminated between the two alleles. We used the markers for mapping the Piz and Piz-t genes and evaluating the size of DNA segments introgressed from the Piz donor cultivar in Japanese near-isogenic lines containing Piz. Our findings suggest that, because of its ability to generate numerous markers within a target region and its simplicity in assaying genotypes, SNP genotyping with allele-specific PCR is a valuable tool for gene mapping, map-based cloning, and marker-assisted selection in crops, especially rice.
Collapse
Affiliation(s)
- K Hayashi
- National Agricultural Research Center, 1-2-1 Inada, Joetsu, 943-0193 Niigata, Japan
| | | | | | | |
Collapse
|
46
|
Tian AG, Wang J, Cui P, Han YJ, Xu H, Cong LJ, Huang XG, Wang XL, Jiao YZ, Wang BJ, Wang YJ, Zhang JS, Chen SY. Characterization of soybean genomic features by analysis of its expressed sequence tags. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2004; 108:903-13. [PMID: 14624337 DOI: 10.1007/s00122-003-1499-2] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2003] [Accepted: 10/09/2003] [Indexed: 05/23/2023]
Abstract
We analyzed 314,254 soybean expressed sequence tags (ESTs), including 29,540 from our laboratory and 284,714 from GenBank. These ESTs were assembled into 56,147 unigenes. About 76.92% of the unigenes were homologous to genes from Arabidopsis thaliana ( Arabidopsis). The putative products of these unigenes were annotated according to their homology with the categorized proteins of Arabidopsis. Genes corresponding to cell growth and/or maintenance, enzymes and cell communication belonged to the slow-evolving class, whereas genes related to transcription regulation, cell, binding and death appeared to be fast-evolving. Soybean unigenes with no match to genes within the Arabidopsis genome were identified as soybean-specific genes. These genes were mainly involved in nodule development and the synthesis of seed storage proteins. In addition, we also identified 61 genes regulated by salicylic acid, 1,322 transcription factor genes and 326 disease resistance-like genes from soybean unigenes. SSR analysis showed that the soybean genome was more complex than the Arabidopsis and the Medicago truncatula genomes. GC content in soybean unigene sequences is similar to that in Arabidopsis and M. truncatula. Furthermore, the combined analysis of the EST database and the BAC-contig sequences revealed that the total gene number in the soybean genome is about 63,501.
Collapse
Affiliation(s)
- Ai-Guo Tian
- Plant Biotechnology Laboratory, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Datun Road, 100101, Beijing, China
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Vettore AL, da Silva FR, Kemper EL, Souza GM, da Silva AM, Ferro MIT, Henrique-Silva F, Giglioti EA, Lemos MVF, Coutinho LL, Nobrega MP, Carrer H, França SC, Bacci Júnior M, Goldman MHS, Gomes SL, Nunes LR, Camargo LEA, Siqueira WJ, Van Sluys MA, Thiemann OH, Kuramae EE, Santelli RV, Marino CL, Targon MLPN, Ferro JA, Silveira HCS, Marini DC, Lemos EGM, Monteiro-Vitorello CB, Tambor JHM, Carraro DM, Roberto PG, Martins VG, Goldman GH, de Oliveira RC, Truffi D, Colombo CA, Rossi M, de Araujo PG, Sculaccio SA, Angella A, Lima MMA, de Rosa Júnior VE, Siviero F, Coscrato VE, Machado MA, Grivet L, Di Mauro SMZ, Nobrega FG, Menck CFM, Braga MDV, Telles GP, Cara FAA, Pedrosa G, Meidanis J, Arruda P. Analysis and functional annotation of an expressed sequence tag collection for tropical crop sugarcane. Genome Res 2003; 13:2725-35. [PMID: 14613979 PMCID: PMC403815 DOI: 10.1101/gr.1532103] [Citation(s) in RCA: 216] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST) program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. Of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged.
Collapse
MESH Headings
- Computational Biology/methods
- Computational Biology/statistics & numerical data
- DNA, Complementary/analysis
- DNA, Complementary/classification
- DNA, Complementary/physiology
- DNA, Plant/analysis
- DNA, Plant/classification
- DNA, Plant/physiology
- Expressed Sequence Tags
- Gene Expression Regulation, Plant
- Gene Library
- Molecular Sequence Data
- Organ Specificity/genetics
- Peptides/classification
- Peptides/genetics
- Peptides/physiology
- Plant Proteins/classification
- Plant Proteins/genetics
- Plant Proteins/physiology
- Polymorphism, Genetic/genetics
- Protein Structure, Tertiary/genetics
- Saccharum/genetics
- Saccharum/growth & development
- Saccharum/physiology
- Sequence Analysis, DNA/methods
- Signal Transduction/genetics
Collapse
Affiliation(s)
- André L Vettore
- Centro de Biologia Molecular e Engenharia Genética, Instituto da Computação, Universidade Estadual de Campinas, 13083-970 Campinas-SP, Brazil
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Kim H, Schmidt CJ, Decker KS, Emara MG. A double-screening method to identify reliable candidate non-synonymous SNPs from chicken EST data. Anim Genet 2003; 34:249-54. [PMID: 12873212 DOI: 10.1046/j.1365-2052.2003.01003.x] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Discovery of non-synonymous single nucleotide polymorphisms (nsSNP), which cause amino acid substitutions, is important because they are more likely to alter protein function than synonymous SNPs (sSNP) or those SNPs that do not result in amino acid changes. By changing the coding sequences, nsSNP may play a role in heritable differences between individual organisms. In the chicken and many other vertebrates, the main obstacle for identifying nsSNP is that there is insufficient protein and mRNA sequence information for self-species referencing and thus, determination of the correct reading frame for expressed sequence tags (ESTs) is difficult. Therefore, in order to estimate the correct reading frame at nsSNP in chicken ESTs, a double-screening approach was designed using self- or cross-species protein referencing, in addition to the ESTScan coding region estimation programme. Starting with 23 427 chicken ESTs, 1210 potential SNPs were discovered using a phred/phrap/polyphred/consed pipeline process and among these, 108 candidate nsSNP were identified with the double screening method. A searchable SNP database (chicksnps) for the candidate chicken SNPs, including both nsSNPs and sSNPs is available at http://chicksnps.afs.udel.edu. The chicken SNP data described in this paper have been submitted to the data base SNP under National Center for Biotechnology Information assay ID ss4387050-ss4388259.
Collapse
Affiliation(s)
- H Kim
- Department of Animal and Food Sciences, University of Delaware, Newark, DE 19716, USA
| | | | | | | |
Collapse
|
49
|
Abstract
The current chicken genetic map contains at least 1,965 loci within 50 linkage groups, and it covers about 4,000 cM. About 235 of these loci have homology with known human or mammalian genes. The remaining loci are anonymous molecular DNA markers, including microsatellites, amplified fragment length polymorphism (AFLP), randomly amplified polymorphic DNA (RAPD), CR1 elements, and others. A third generation genetic map for human uses single nucleotide polymorphisms (SNP), which have allowed the mapping of complex traits by linkage disequilibrium. One advantage of SNP is that they are usually linked to the gene of interest, and association of the SNP with traits of economic importance can be analyzed using candidate gene approaches. With the tremendous advancements in characterizing chicken expressed sequence tags (EST), the identification of genetic polymorphisms such as SNP in chicken genes has become a reality. Our laboratory has undertaken an in silico analysis of the chicken EST at the University of Delaware by using a Phred/Phrap/Polyphred/Consed pipeline to identify candidate chicken SNP. Initial scanning of 23,427 chicken EST identified a total of 1,209 candidate SNP, with at least 182 non-synonymous SNP that result in an amino acid change observed. Validation of these candidate chicken SNP is ongoing. Placement of the SNP on the chicken genetic map will enhance marker density, thus allowing for mapping of complex traits through linkage analysis and linkage disequilibrium. Application of SNP to identify disease resistance genes in chickens is of special interest to our laboratory, especially in regards to Marek's disease and coccidiosis.
Collapse
Affiliation(s)
- M G Emara
- Department of Animal and Food Sciences, University of Delaware, Newark, Delaware 19717, USA.
| | | |
Collapse
|
50
|
Ye Z, Parry JM. Identification of polymorphisms in the human Reprimo gene using public EST data. TERATOGENESIS, CARCINOGENESIS, AND MUTAGENESIS 2003; 22:485-93. [PMID: 12395409 DOI: 10.1002/tcm.10044] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The human Reprimo gene is a recently identified cytoplasmic protein, which plays an important role in the regulation of p53-dependent G2 arrest of the cell cycle. Genetic variations in the Reprimo gene that may influence enzyme activity can be of both biological and epidemiological significance. The human expressed sequence tag (EST) database is a wealth of resources, which can be used to rapidly screen for potential polymorphisms in proteins of physiological interest. On the basis of the alignment of human EST sequences, we identified two candidate polymorphisms at nucleotides 824 and 839 in the 3'-untranslated region of the Reprimo gene. The presence of these polymorphisms was confirmed in a Caucasian population (n=82) by the use of the allele specific polymerase chain reaction (PCR). The rare allele frequency at position 824 (38.4%) is much higher than rare allele frequency at position 839 (3.7%). Our results suggest that the human EST data may serve as a valuable source for the rapid identification of genetic variation.
Collapse
Affiliation(s)
- Zheng Ye
- Center for Molecular Genetics and Toxicology, School of Biological Sciences, University of Wales Swansea, Singleton Park, Swansea, United Kingdom.
| | | |
Collapse
|