1
|
Improved protocols for BAC insert DNA isolation, BAC end sequencing and FISH for construction of BAC based physical map of genes on the chromosomes. Mol Biol Rep 2020; 47:2405-2413. [PMID: 32020430 DOI: 10.1007/s11033-020-05283-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Accepted: 01/27/2020] [Indexed: 10/25/2022]
Abstract
Bacterial artificial chromosome (BAC) library is an important genomic resource useful in targeted marker development, positional cloning, physical mapping and a substrate for genome sequencing for better understanding the genome organization of a species. The present manuscript elucidates the improvement in protocols for economical and efficient BAC insert DNA isolation, BAC end sequencing and FISH for physical localization on the metaphase chromosome complements. BAC clones of Clarias magur, maintained in 384-well plate format in our laboratory, were used in this study. The protocols gave consistent and efficient results. We use routinely these protocols for BAC insert DNA extraction, generating end sequence data of the clone and constructing DNA probes to hybridize on the metaphase spreads of C. magur using FISH for physical their localization.
Collapse
|
2
|
Said R, Zheng L, Saunders T, Zeidler M, Papagerakis S, Papagerakis P. Generation of Amelx-iCre Mice Supports Ameloblast-Specific Role for Stim1. J Dent Res 2019; 98:1002-1010. [PMID: 31329049 DOI: 10.1177/0022034519858976] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
The identification and targeting of the molecular pathways regulating amelogenesis is an ongoing challenge in dental research, and progress has been restricted by the limited number of genetic tools available to study gene function in ameloblasts. Here, we generated 4 transgenic Cre-driver mouse lines that express improved Cre (iCre)-recombinase from the locus of the mouse ameloblast-specific gene amelogenin X (Amelx-iCre) with a large (250-kb) bacterial artificial chromosome DNA vector. All 4 Amelx-iCre transgenic lines were bred with ROSA26 reporter mice to characterize the iCre developmental pattern with the LacZ gene encoding β-galactosidase enzyme activity assay and Cre protein immunohistochemistry. From the 4 generated transgenic lines, 2 were selected for further analysis because they expressed a high amount of Cre recombinase exclusively in ameloblasts and showed developmental stage- and cell-specific β-galactosidase activity mimicking the endogenous amelogenin expression. To test the functionality of the selected transgenic models, we bred the 2 Amelx-iCre mice lines with stromal interaction molecule 1 (Stim1) floxed mice to generate ameloblast-specific Stim1 conditional knockout mice (Stim1 cKO). STIM1 protein serves as one of the main calcium sensors in ameloblasts and plays a major role in enamel mineralization and ameloblast differentiation. Amelx-iCre mice displayed exclusive CRE-mediated recombination in incisor and molar ameloblasts. Stim1 cKO mice showed a severely defected enamel phenotype, including reduced structural integrity concomitant with increased attrition and smaller teeth. The phenotype and genotype of the Amelx-iCre/Stim1 cKO showed significant differences with the previously reported Ker14-Cre/Stim1 cKO, highlighting the need for cell- and stage-specific Cre lines for an accurate phenotype-genotype comparison. Furthermore, our model has the advantage of carrying the entire Amelx gene locus rather than being limited to an Amelx partial promoter construct, which greatly enhances the stability and the specificity of our Cre expression. As such, the Amelx-iCre transgenic lines that we developed may serve as a powerful tool for targeting ameloblast-specific gene expression in future investigations.
Collapse
Affiliation(s)
- R Said
- 1 Department of Anatomy and Cell Biology, College of Medicine, University of Saskatchewan, Saskatoon, SK, Canada.,2 College of Dentistry, University of Saskatchewan, Saskatoon, SK, Canada
| | - L Zheng
- 3 Department of Orthodontics, School of Dentistry, Ohio State University, Columbus, OH, USA
| | - T Saunders
- 4 Transgenic Animal Model Core, University of Michigan, Ann Arbor, MI, USA
| | - M Zeidler
- 4 Transgenic Animal Model Core, University of Michigan, Ann Arbor, MI, USA
| | - S Papagerakis
- 5 Department of Otolaryngology-Head and Neck Surgery, School of Medicine, University of Michigan, Ann Arbor, MI, USA.,6 Department of Surgery, College of Medicine, University of Saskatchewan, Saskatoon, SK, Canada
| | - P Papagerakis
- 1 Department of Anatomy and Cell Biology, College of Medicine, University of Saskatchewan, Saskatoon, SK, Canada.,2 College of Dentistry, University of Saskatchewan, Saskatoon, SK, Canada.,7 Department of Cariology, Restorative Sciences, and Endodontics, School of Dentistry, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
3
|
Wei X, Xu Z, Wang G, Hou J, Ma X, Liu H, Liu J, Chen B, Luo M, Xie B, Li R, Ruan J, Liu X. pBACode: a random-barcode-based high-throughput approach for BAC paired-end sequencing and physical clone mapping. Nucleic Acids Res 2017; 45:e52. [PMID: 27980066 PMCID: PMC5397170 DOI: 10.1093/nar/gkw1261] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Accepted: 12/09/2016] [Indexed: 12/14/2022] Open
Abstract
Applications that use Bacterial Artificial Chromosome (BAC) libraries often require paired-end sequences and knowledge of the physical location of each clone in plates. To facilitate obtaining this information in high-throughput, we generated pBACode vectors: a pool of BAC cloning vectors, each with a pair of random barcodes flanking its cloning site. In a pBACode BAC library, the BAC ends and their linked barcodes can be sequenced in bulk. Barcode pairs are determined by sequencing the empty pBACode vectors, which allows BAC ends to be paired according to their barcodes. For physical clone mapping, the barcodes are used as unique markers for their linked genomic sequence. After multi-dimensional pooling of BAC clones, the barcodes are sequenced and deconvoluted to locate each clone. We generated a pBACode library of 94,464 clones for the flounder Paralichthys olivaceus and obtained paired-end sequence from 95.4% of the clones. Incorporating BAC paired-ends into the genome preassembly improved its continuity by over 10-fold. Furthermore, we were able to use the barcodes to map the physical locations of each clone in just 50 pools, with up to 11 808 clones per pool. Our physical clone mapping located 90.2% of BAC clones, enabling targeted characterization of chromosomal rearrangements.
Collapse
Affiliation(s)
- Xiaolin Wei
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China.,PTN (Peking University-Tsinghua University-National Institute of Biological Sciences) Joint Graduate Program, Beijing 100084, China.,School of Life Sciences, Peking University, Beijing 100084, China
| | - Zhichao Xu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China.,PTN (Peking University-Tsinghua University-National Institute of Biological Sciences) Joint Graduate Program, Beijing 100084, China
| | - Guixing Wang
- Beidaihe Central Experiment Station, Chinese Academy of Fishery Sciences, Qinhuangdao 066100, China
| | - Jilun Hou
- Beidaihe Central Experiment Station, Chinese Academy of Fishery Sciences, Qinhuangdao 066100, China
| | - Xiaopeng Ma
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China.,PTN (Peking University-Tsinghua University-National Institute of Biological Sciences) Joint Graduate Program, Beijing 100084, China
| | - Haijin Liu
- Beidaihe Central Experiment Station, Chinese Academy of Fishery Sciences, Qinhuangdao 066100, China
| | - Jiadong Liu
- National Key Laboratory of Crop Genetic Improvement and College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Bo Chen
- National Key Laboratory of Crop Genetic Improvement and College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Meizhong Luo
- National Key Laboratory of Crop Genetic Improvement and College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Bingyan Xie
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Ruiqiang Li
- Novogene Bioinformatics Institute, Beijing 100083, China
| | - Jue Ruan
- Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China
| | - Xiao Liu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
| |
Collapse
|
4
|
Jiang Y, Xu P, Liu Z. Generation of physical map contig-specific sequences. Front Genet 2014; 5:243. [PMID: 25101119 PMCID: PMC4105628 DOI: 10.3389/fgene.2014.00243] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2014] [Accepted: 07/07/2014] [Indexed: 12/13/2022] Open
Abstract
Rapid advances of the next-generation sequencing technologies have allowed whole genome sequencing of many species. However, with the current sequencing technologies, the whole genome sequence assemblies often fall in short in one of the four quality measurements: accuracy, contiguity, connectivity, and completeness. In particular, small-sized contigs and scaffolds limit the applicability of whole genome sequences for genetic analysis. To enhance the quality of whole genome sequence assemblies, particularly the scaffolding capabilities, additional genomic resources are required. Among these, sequences derived from known physical locations offer great powers for scaffolding. In this mini-review, we will describe the principles, procedures and applications of physical-map-derived sequences, with the focus on physical map contig-specific sequences.
Collapse
Affiliation(s)
- Yanliang Jiang
- Centre for Applied Aquatic Genomics, Chinese Academy of Fishery Sciences Beijing, China
| | - Peng Xu
- Centre for Applied Aquatic Genomics, Chinese Academy of Fishery Sciences Beijing, China
| | - Zhanjiang Liu
- Aquatic Genomics Unit, The Fish Molecular Genetics and Biotechnology Laboratory, School of Fisheries, Aquaculture and Aquatic Sciences, and Program of Cell and Molecular Biosciences, Auburn University AL, USA
| |
Collapse
|
5
|
HTS-PEG: a method for high throughput sequencing of the paired-ends of genomic libraries. PLoS One 2013; 7:e52257. [PMID: 23284958 PMCID: PMC3527410 DOI: 10.1371/journal.pone.0052257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2012] [Accepted: 11/09/2012] [Indexed: 11/19/2022] Open
Abstract
Second generation sequencing has been widely used to sequence whole genomes. Though various paired-end sequencing methods have been developed to construct the long scaffold from contigs derived from shotgun sequencing, the classical paired-end sequencing of the Bacteria Artificial Chromosome (BAC) or fosmid libraries by the Sanger method still plays an important role in genome assembly. However, sequencing libraries with the Sanger method is expensive and time-consuming. Here we report a new strategy to sequence the paired-ends of genomic libraries with parallel pyrosequencing, using a Chinese amphioxus (Branchiostoma belcheri) BAC library as an example. In total, approximately 12,670 non-redundant paired-end sequences were generated. Mapping them to the primary scaffolds of Chinese amphioxus, we obtained 413 ultra-scaffolds from 1,182 primary scaffolds, and the N50 scaffold length was increased approximately 55 kb, which is about a 10% improvement. We provide a universal and cost-effective method for sequencing the ultra-long paired-ends of genomic libraries. This method can be very easily implemented in other second generation sequencing platforms.
Collapse
|
6
|
SplinkBES: a splinkerette-based method for generating long end sequences from large insert DNA libraries. Biotechniques 2009; 47:681-2, 684, 686, passim. [PMID: 19737131 DOI: 10.2144/000113122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
We report on the development of a novel splinkerette-based method for generating long end sequences from large insert library clones, using a carrot (Daucus carota L.) BAC library as a model. The procedure involves digestion of the BAC DNA with a 6-bp restriction enzyme, followed by ligation of splinkerette adaptors that serve as primer-annealing sites for PCR amplification of the BAC ends. The resulting amplicons are sequenced from both directions, and when overlapping, the pairs of sequences are assembled, originating two types of BAC end sequences (BESs): ungapped and gapped. The average sequence length for ungapped and gapped BESs was 698 and 1055 nucleotides, respectively, with an overall average length of 838 nucleotides. This is considerably higher than the average length typically obtained by direct end sequencing. Through the analysis of actual and in silico-generated BES of different lengths from carrot and five model organisms, we demonstrated that longer BESs are more informative, since they had more matches to the GenBank database and contained more simple sequence repeats (SSRs). A pilot high-throughput procedure is proposed for splinkerette-based end sequencing (SplinkBES). This method may contribute to generating more robust BES analysis and provide a richer source of BES-derived markers for genomics, mapping, and breeding.
Collapse
|
7
|
Liu GE, Alkan C, Jiang L, Zhao S, Eichler EE. Comparative analysis of Alu repeats in primate genomes. Genome Res 2009; 19:876-85. [PMID: 19411604 DOI: 10.1101/gr.083972.108] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Using bacteria artificial chromosome (BAC) end sequences (16.9 Mb) and high-quality alignments of genomic sequences (17.4 Mb), we performed a global assessment of the divergence distributions, phylogenies, and consensus sequences for Alu elements in primates including lemur, marmoset, macaque, baboon, and chimpanzee as compared to human. We found that in lemurs, Alu elements show a broader and more symmetric sequence divergence distribution, suggesting a steady rate of Alu retrotransposition activity among prosimians. In contrast, Alu elements in anthropoids show a skewed distribution shifted toward more ancient elements with continual declining rates in recent Alu activity along the hominoid lineage of evolution. Using an integrated approach combining mutation profile and insertion/deletion analyses, we identified nine novel lineage-specific Alu subfamilies in lemur (seven), marmoset (one), and baboon/macaque (one) containing multiple diagnostic mutations distinct from their human counterparts-Alu J, S, and Y subfamilies, respectively. Among these primates, we show that that the lemur has the lowest density of Alu repeats (55 repeats/Mb), while marmoset has the greatest abundance (188 repeats/Mb). We estimate that approximately 70% of lemur and 16% of marmoset Alu elements belong to lineage-specific subfamilies. Our analysis has provided an evolutionary framework for further classification and refinement of the Alu repeat phylogeny. The differences in the distribution and rates of Alu activity have played an important role in subtly reshaping the structure of primate genomes. The functional consequences of these changes among the diverse primate lineages over such short periods of evolutionary time are an important area of future investigation.
Collapse
Affiliation(s)
- George E Liu
- USDA, ARS, ANRI, Bovine Functional Genomics Laboratory, Beltsville, MD 20705, USA.
| | | | | | | | | |
Collapse
|
8
|
Chapus C, Edwards SV. Genome evolution in Reptilia: in silico chicken mapping of 12,000 BAC-end sequences from two reptiles and a basal bird. BMC Genomics 2009; 10 Suppl 2:S8. [PMID: 19607659 PMCID: PMC2966332 DOI: 10.1186/1471-2164-10-s2-s8] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND With the publication of the draft chicken genome and the recent production of several BAC clone libraries from non-avian reptiles and birds, it is now possible to undertake more detailed comparative genomic studies in Reptilia. Of interest in particular are the genomic events that transformed the large, repeat-rich genomes of mammals and non-avian reptiles into the minimalist chicken genome. We have used paired BAC end sequences (BESs) from the American alligator (Alligator mississippiensis), painted turtle (Chrysemys picta) and emu (Dromaius novaehollandiae) to investigate patterns of sequence divergence, gene and retroelement content, and microsynteny between these species and chicken. RESULTS From a total of 11,967 curated BESs, we successfully mapped 725, 773 and 2597 sequences in alligator, turtle, and emu, respectively, to sites in the draft chicken genome using a stringent BLAST protocol. Most commonly, sequences mapped to a single site in the chicken genome. Of 1675, 1828 and 2936 paired BESs obtained for alligator, turtle, and emu, respectively, a total of 34 (alligator, 2%), 24 (turtle, 1.3%) and 479 (emu, 16.3%) pairs were found to map with high confidence and in the correct orientation and with BAC-sized intermarker distances to single chicken chromosomes, including 25 such paired hits in emu mapping to the chicken Z chromosome. By determining the insert sizes of a subset of BAC clones from these three species, we also found a significant correlation between the intermarker distance in alligator and turtle and in chicken, with slopes as expected on the basis of the ratio of the genome sizes. CONCLUSION Our results suggest that a large number of small-scale chromosomal rearrangements and deletions in the lineage leading to chicken have drastically reduced the number of detected syntenies observed between the chicken and alligator, turtle, and emu genomes and imply that small deletions occurring widely throughout the genomes of reptilian and avian ancestors led to the ~50% reduction in genome size observed in birds compared to reptiles. We have also mapped and identified likely gene regions in hundreds of new BAC clones from these species.
Collapse
Affiliation(s)
- Charles Chapus
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| | - Scott V Edwards
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
9
|
Wu C, Proestou D, Carter D, Nicholson E, Santos F, Zhao S, Zhang HB, Goldsmith MR. Construction and sequence sampling of deep-coverage, large-insert BAC libraries for three model lepidopteran species. BMC Genomics 2009; 10:283. [PMID: 19558662 PMCID: PMC2718931 DOI: 10.1186/1471-2164-10-283] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2008] [Accepted: 06/26/2009] [Indexed: 11/11/2022] Open
Abstract
BACKGROUND Manduca sexta, Heliothis virescens, and Heliconius erato represent three widely-used insect model species for genomic and fundamental studies in Lepidoptera. Large-insert BAC libraries of these insects are critical resources for many molecular studies, including physical mapping and genome sequencing, but not available to date. RESULTS We report the construction and characterization of six large-insert BAC libraries for the three species and sampling sequence analysis of the genomes. The six BAC libraries were constructed with two restriction enzymes, two libraries for each species, and each has an average clone insert size ranging from 152-175 kb. We estimated that the genome coverage of each library ranged from 6-9 x, with the two combined libraries of each species being equivalent to 13.0-16.3 x haploid genomes. The genome coverage, quality and utility of the libraries were further confirmed by library screening using 6 approximately 8 putative single-copy probes. To provide a first glimpse into these genomes, we sequenced and analyzed the BAC ends of approximately 200 clones randomly selected from the libraries of each species. The data revealed that the genomes are AT-rich, contain relatively small fractions of repeat elements with a majority belonging to the category of low complexity repeats, and are more abundant in retro-elements than DNA transposons. Among the species, the H. erato genome is somewhat more abundant in repeat elements and simple repeats than those of M. sexta and H. virescens. The BLAST analysis of the BAC end sequences suggested that the evolution of the three genomes is widely varied, with the genome of H. virescens being the most conserved as a typical lepidopteran, whereas both genomes of H. erato and M. sexta appear to have evolved significantly, resulting in a higher level of species- or evolutionary lineage-specific sequences. CONCLUSION The high-quality and large-insert BAC libraries of the insects, together with the identified BACs containing genes of interest, provide valuable information, resources and tools for comprehensive understanding and studies of the insect genomes and for addressing many fundamental questions in Lepidoptera. The sample of the genomic sequences provides the first insight into the constitution and evolution of the insect genomes.
Collapse
Affiliation(s)
- Chengcang Wu
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843-2474, USA
- Current address: Lucigen Corporation, 2120 West Greenview Dr, Middleton, WI 53562, USA
| | - Dina Proestou
- Department of Biological Sciences, University of Rhode Island, Kingston, RI 02881-0816, USA
| | - Dorothy Carter
- Department of Biological Sciences, University of Rhode Island, Kingston, RI 02881-0816, USA
| | - Erica Nicholson
- Department of Biological Sciences, University of Rhode Island, Kingston, RI 02881-0816, USA
| | - Filippe Santos
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843-2474, USA
| | - Shaying Zhao
- The Institute for Genomic Research, 9712 Medical Center Dr, Rockville, MD 20850, USA
| | - Hong-Bin Zhang
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843-2474, USA
| | - Marian R Goldsmith
- Department of Biological Sciences, University of Rhode Island, Kingston, RI 02881-0816, USA
| |
Collapse
|
10
|
Li W, Kotoshiba S, Kaldis P. Genetic mouse models to investigate cell cycle regulation. Transgenic Res 2009; 18:491-8. [PMID: 19418238 DOI: 10.1007/s11248-009-9276-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2009] [Accepted: 04/20/2009] [Indexed: 01/21/2023]
Abstract
Early studies on cell cycle regulation were based on experiments in model systems (Yeast, Xenopus, Starfish, Drosophila) and have shaped the way we understand many events that control the cell cycle. Although these model systems are of great value, the last decade was highlighted by studies done in human cells and using in vivo mouse models. Mouse models are irreplaceable tools for understanding the genetics, development, and survival strategies of mammals. New developments in generating targeting vectors and mutant mice have improved our approaches to study cell cycle regulation and cancer. Here we summarize the most recent advances of mouse model approaches in dissecting the mechanisms of cell cycle regulation and the relevance to human disease.
Collapse
Affiliation(s)
- Weimin Li
- Department of Pharmacology, University of Wisconsin-Madison, 3725 MSC, 1300 University Avenue, Madison, WI 53706, USA
| | | | | |
Collapse
|
11
|
Huo N, Vogel JP, Lazo GR, You FM, Ma Y, McMahon S, Dvorak J, Anderson OD, Luo MC, Gu YQ. Structural characterization of Brachypodium genome and its syntenic relationship with rice and wheat. PLANT MOLECULAR BIOLOGY 2009; 70:47-61. [PMID: 19184460 DOI: 10.1007/s11103-009-9456-3] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2008] [Accepted: 01/07/2009] [Indexed: 05/22/2023]
Abstract
Brachypodium distachyon (Brachypodium) has been recently recognized as an emerging model system for both comparative and functional genomics in grass species. In this study, 55,221 repeat masked Brachypodium BAC end sequences (BES) were used for comparative analysis against the 12 rice pseudomolecules. The analysis revealed that approximately 26.4% of BES have significant matches with the rice genome and 82.4% of the matches were homologous to known genes. Further analysis of paired-end BES and approximately 1.0 Mb sequences from nine selected BACs proved to be useful in revealing conserved regions and regions that have undergone considerable genomic changes. Differential gene amplification, insertions/deletions and inversions appeared to be the common evolutionary events that caused variations of microcolinearity at different orthologous genomic regions. It was found that approximately 17% of genes in the two genomes are not colinear in the orthologous regions. Analysis of BAC sequences also revealed higher gene density (approximately 9 kb/gene) and lower repeat DNA content (approximately 13.1%) in Brachypodium when compared to the orthologous rice regions, consistent with the smaller size of the Brachypodium genome. The 119 annotated Brachypodium genes were BLASTN compared against the wheat EST database and deletion bin mapped wheat ESTs. About 77% of the genes retrieved significant matches in the EST database, while 9.2% matched to the bin mapped ESTs. In some cases, genes in single Brachypodium BACs matched to multiple ESTs that were mapped to the same deletion bins, suggesting that the Brachypodium genome will be useful for ordering wheat ESTs within the deletion bins and developing specific markers at targeted regions in the wheat genome.
Collapse
Affiliation(s)
- Naxin Huo
- Genomics and Gene Discovery Research Unit, USDA-ARS, Western Regional Research Center, 800 Buchanan Street, Albany, CA 94710, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Wang X, Zhang Q, Sun X, Chen Y, Zhai T, Zhuang W, Qi J, Wang Z. Fosmid library construction and initial analysis of end sequences in female half-smooth tongue sole (Cynoglossus semilaevis). MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2009; 11:236-242. [PMID: 18763017 DOI: 10.1007/s10126-008-9137-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2008] [Accepted: 07/27/2008] [Indexed: 05/26/2023]
Abstract
Half-smooth tongue sole (Cynoglossus semilaevis: Pleuronectiformes) is a commercially important cultured marine flatfish in China and forms an important fishery resource, but the research of its genome is underdeveloped. In this study, we constructed a female C. semilaevis fosmid library and analyzed the fosmid end sequences to provide a preliminary assessment of the genome. The library consists of 49,920 clones with an average insert size of about 39 kb, amounting to 3.23 genome equivalents. Fosmid stability assays indicate that female C. semilaevis DNA was stable during propagation in the fosmid system. Library screening with eight microsatellite markers yielded between two and five positive clones, and none of those tested was absent from the library. End-sequencing of both 5' and 3' ends of 1,152 individual clones generated 2,247 sequences after trimming, with an average sequence length of 855 bp. BLASTN searches of the nr and EST databases of GenBank and BLASTX searches of the nr database resulted in 259 (11.53%) and 287 (12.77%) significant hits (E < e (-5)), respectively. Repetitive sequences analysis resulted in 5.23% of base pairs masked using both the Fugu and Danio databases, repetitive elements were composed of retroelements, DNA transposons, satellites, simple repeats, and low-complexity sequences. The fosmid library, in conjunction with the fosmid end sequences, will serve as a useful resource for large-scale genome sequencing, physical mapping, and positional cloning, and provide a better understanding of female C. semilaevis genome.
Collapse
Affiliation(s)
- Xubo Wang
- Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, 5 Yushan Road, Qingdao, 266003, People's Republic of China
| | | | | | | | | | | | | | | |
Collapse
|
13
|
Ratnakumar A, Barris W, McWilliam S, Brauning R, McEwan JC, Snelling WM, Dalrymple BP. A multiway analysis for identifying high integrity bovine BACs. BMC Genomics 2009; 10:46. [PMID: 19166603 PMCID: PMC2660975 DOI: 10.1186/1471-2164-10-46] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2008] [Accepted: 01/23/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In large genomics projects involving many different types of analyses of bacterial artificial chromosomes (BACs), such as fingerprinting, end sequencing (BES) and full BAC sequencing there are many opportunities for the identities of BACs to become confused. However, by comparing the results from the different analyses, inconsistencies can be identified and a set of high integrity BACs preferred for future research can be defined. RESULTS The location of each bovine BAC in the BAC fingerprint-based genome map and in the genome assembly were compared based on the reported BESs, and for a smaller number of BACs the full sequence. BACs with consistent positions in all three datasets, or if the full sequence was not available, for both the fingerprint map and BES-based alignments, were deemed to be correctly positioned. BACs with consistent BES-based and fingerprint-based locations, but with conflicting locations based on the fully sequenced BAC, appeared to have been misidentified during sequencing, and included a number of apparently swapped BACs. Inconsistencies between BES-based and fingerprint map positions identified thirty one plates from the CHORI-240 library that appear to have suffered substantial systematic problems during the end-sequencing of the BACs. No systematic problems were identified in the fingerprinting of the BACs. Analysis of BACs overlapping in the assembly identified a small overrepresentation of clones with substantial overlap in the library and a substantial enrichment of highly overlapping BACs on the same plate in the CHORI-240 library. More than half of these BACs appear to have been present as duplicates on the original BAC-library plates and thus should be avoided in subsequent projects. CONCLUSION Our analysis shows that approximately 95% of the bovine CHORI-240 library clones with both a BAC fingerprint and two BESs mapping to the genome in the expected orientations (approximately 27% of all BACs) have consistent locations in the BAC fingerprint map and the genome assembly. We have developed a broadly applicable methodology for checking the integrity of BAC-based datasets even where only incomplete and partially assembled genomic sequence is available.
Collapse
Affiliation(s)
- Abhirami Ratnakumar
- CSIRO Livestock Industries, 306 Carmody Road, St. Lucia, QLD 4067, Australia.
| | | | | | | | | | | | | |
Collapse
|
14
|
Murakami K, Toyoda A, Hattori M, Kuroki Y, Fujiyama A, Kojima T, Matsuda M, Sakaki Y, Yamamoto MT. BAC library construction and BAC end sequencing of five Drosophila species: the comparative map with the D. melanogaster genome. Genes Genet Syst 2008; 83:245-56. [PMID: 18670136 DOI: 10.1266/ggs.83.245] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
We constructed and characterized arrayed bacterial artificial chromosome (BAC) libraries of five Drosophila species (D. melanogaster, D. simulans, D. sechellia, D. auraria, and D. ananassae), which are genetically well characterized in the studies of meiosis, evolution, population genetics, and developmental biology. The BAC libraries comprise 8,000 to 12,500 clones for each species, estimated to cover the most of the genomes. We sequenced both ends of most of these BAC clones with a success rate of 91%. Of these, 53,701 clones consisting of non-repetitive BAC end sequences (BESs) were mapped with reference of the public D. melanogaster genome sequences. The BES mapping estimated that the BAC libraries of D. auraria and D. ananassae covered 47% and 57% of the D. melanogaster genome, respectively, and those of D. melanogaster, D. sechellia, and D. simulans covered 94-97%. The low coverage by BESs of D. auraria and D. ananassae may be due to the high sequence divergence with D. melanogaster. From the comparative BES mapping, 111 possible breakpoints of chromosomal rearrangements were identified in these four species. The breakpoints of the major chromosome rearrangement between D. simulans and D. melanogaster on the third chromosome were determined within 20 kb in 84E and 30 kb in 93E/F. Corresponding breakpoints were also identified in D. sechellia. The BAC clones described here will be an important addition to the Drosophila genomic resources.
Collapse
|
15
|
Amniote phylogenomics: testing evolutionary hypotheses with BAC library scanning and targeted clone analysis of large-scale DNA sequences from reptiles. Methods Mol Biol 2008; 422:91-117. [PMID: 18629663 DOI: 10.1007/978-1-59745-581-7_7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/07/2023]
Abstract
Phylogenomics research integrating established principles of systematic biology and taking advantage of the wealth of DNA sequences being generated by genome science holds promise for answering long-standing evolutionary questions with orders of magnitude more primary data than in the past. Although it is unrealistic to expect whole-genome initiatives to proceed rapidly for commercially unimportant species such as reptiles, practical approaches utilizing genomic libraries of large-insert clones pave the way for a phylogenomics of species that are nevertheless essential for testing evolutionary hypotheses within a phylogenetic framework. This chapter reviews the case for adopting genome-enabled approaches to evolutionary studies and outlines a program for using bacterial artificial chromosome (BAC) libraries or plasmid libraries as a basis for completing "genome scans" of reptiles. We have used BACs to close a critical gap in the genome database for Reptilia, the sister group of mammals, and present the methodological approaches taken to achieve this as a guideline for designing similar comparative studies. In addition, we provide a detailed step-by-step protocol for BAC-library screening and shotgun sequencing of specific clones containing target genes of evolutionary interest. Taken together, the genome scanning and shotgun sequencing techniques offer complementary diagnostic potential and can substantially increase the scale and power of analyses aimed at testing evolutionary hypotheses for nonmodel species.
Collapse
|
16
|
Menke DB, Guenther C, Kingsley DM. Dual hindlimb control elements in the Tbx4 gene and region-specific control of bone size in vertebrate limbs. Development 2008; 135:2543-53. [DOI: 10.1242/dev.017384] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The Tbx4 transcription factor is crucial for normal hindlimb and vascular development, yet little is known about how its highly conserved expression patterns are generated. We have used comparative genomics and functional scanning in transgenic mice to identify a dispersed group of enhancers controlling Tbx4 expression in different tissues. Two independent enhancers control hindlimb expression, one located upstream and one downstream of the Tbx4 coding exons. These two enhancers, hindlimb enhancer A and hindlimb enhancer B (HLEA and HLEB), differ in their primary sequence, in their precise patterns of activity within the hindlimb, and in their degree of sequence conservation across animals. HLEB is highly conserved from fish to mammals. Although Tbx4 expression and hindlimb development occur at different axial levels in fish and mammals, HLEB cloned from either fish or mouse is capable of driving expression at the appropriate position of hindlimb development in mouse embryos. HLEA is highly conserved only in mammals. Deletion of HLEA from the endogenous mouse locus reduces expression of Tbx4 in the hindlimb during embryogenesis, bypasses the embryonic lethality of Tbx4-null mutations, and produces viable, fertile mice with characteristic changes in the size of bones in the hindlimb but not the forelimb. We speculate that dual hindlimb enhancers provide a flexible genomic mechanism for altering the strength and location of Tbx4 expression during normal development, making it possible to separately modify the size of forelimb and hindlimb bones during vertebrate evolution.
Collapse
Affiliation(s)
- Douglas B. Menke
- Howard Hughes Medical Institute and Department of Developmental Biology,Stanford University, Stanford, CA 94305-5329, USA
| | - Catherine Guenther
- Howard Hughes Medical Institute and Department of Developmental Biology,Stanford University, Stanford, CA 94305-5329, USA
| | - David M. Kingsley
- Howard Hughes Medical Institute and Department of Developmental Biology,Stanford University, Stanford, CA 94305-5329, USA
| |
Collapse
|
17
|
Rogatcheva MB, Chen K, Larkin DM, Meyers SN, Marron BM, He W, Schook LB, Beever JE. Piggy-BACing the Human Genome I: Constructing a Porcine BAC Physical Map Through Comparative Genomics. Anim Biotechnol 2008; 19:28-42. [DOI: 10.1080/10495390701807634] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
18
|
Li Y, Uhm T, Ren C, Wu C, Santos TS, Lee MK, Yan B, Santos F, Zhang A, Scheuring C, Sanchez A, Millena AC, Nguyen HT, Kou H, Liu D, Zhang HB. A plant-transformation-competent BIBAC/BAC-based map of rice for functional analysis and genetic engineering of its genomic sequence. Genome 2007; 50:278-88. [PMID: 17502901 DOI: 10.1139/g07-006] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Sequencing of the rice genome has provided a platform for functional genomics research of rice and other cereal species. However, multiple approaches are needed to determine the functions of its genes and sequences and to use the genome sequencing results for genetic improvement of cereal crops. Here, we report a plant-transformation-competent, binary bacterial artificial chromosome (BIBAC) and bacterial artificial chromosome (BAC) based map of rice to facilitate these studies. The map was constructed from 20 835 BIBAC and BAC clones, and consisted of 579 overlapping BIBAC/BAC contigs. To facilitate functional analysis of chromosome 8 genomic sequence and cloning of the genes and QTLs mapped to the chromosome, we anchored the chromosomal contigs to the existing rice genetic maps. The chromosomal map consists of 11 contigs, 59 genetic markers, and 36 sequence tagged sites, spanning a total of ca. 38 Mb in physical length. Comparative analysis between the genetic and physical maps of chromosome 8 showed that there are 3 "hot" and 2 "cold" spots of genetic recombination along the chromosomal arms in addition to the "cold spot" in the centromeric region, suggesting that the sequence component contents of a chromosome may affect its local genetic recombination frequencies. Because of its plant transformability, the BIBAC/BAC map could provide a platform for functional analysis of the rice genome sequence and effective use of the sequencing results for gene and QTL cloning and molecular breeding.
Collapse
Affiliation(s)
- Yaning Li
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843-2474, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Wang Z, Miyake T, Edwards SV, Amemiya CT. Tuatara (Sphenodon) Genomics: BAC Library Construction, Sequence Survey, and Application to the DMRT Gene Family. J Hered 2006; 97:541-8. [PMID: 17135461 DOI: 10.1093/jhered/esl040] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The tuatara (Sphenodon punctatus) is of "extraordinary biological interest" as the most distinctive surviving reptilian lineage (Rhyncocephalia) in the world. To provide a genomic resource for an understanding of genome evolution in reptiles, and as part of a larger project to produce genomic resources for various reptiles (evogen.jgi.doe.gov/second_levels/BACs/our_libraries.html), a large-insert bacterial artificial chromosome (BAC) library from a male tuatara was constructed. The library consists of 215 424 individual clones whose average insert size was empirically determined to be 145 kb, yielding a genomic coverage of approximately 6.3x. A BAC-end sequencing analysis of 121 420 bp of sequence revealed a genomic GC content of 46.8%, among the highest observed thus far for vertebrates, and identified several short interspersed repetitive elements (mammalian interspersed repeat-type repeats) and long interspersed repetitive elements, including chicken repeat 1 element. Finally, as a quality control measure the arrayed library was screened with probes corresponding to 2 conserved noncoding regions of the candidate sex-determining gene DMRT1 and the DM domain of the related DMRT2 gene. A deep coverage contig spanning nearly 300 kb was generated, supporting the deep coverage and utility of the library for exploring tuatara genomics.
Collapse
Affiliation(s)
- Zhenshan Wang
- Department of Biology, University of Washington, Seattle, WA 98195, USA.
| | | | | | | |
Collapse
|
20
|
Xu P, Wang S, Liu L, Peatman E, Somridhivej B, Thimmapuram J, Gong G, Liu Z. Channel catfish BAC-end sequences for marker development and assessment of syntenic conservation with other fish species. Anim Genet 2006; 37:321-6. [PMID: 16879340 DOI: 10.1111/j.1365-2052.2006.01453.x] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
In the present study, 25 195 BAC ends for channel catfish (Ictalurus punctatus) were sequenced, generating 20 366 clean BAC-end sequences (BES), with an average read length of 557 bp after trimming. A total of 11 414 601 bp were generated, representing approximately 1.2% of the catfish genome. Based on this survey, the catfish genome was found to be highly AT-rich, with 60.7% A+T and 39.3% G+C. Approximately 12% of the catfish genome consisted of dispersed repetitive elements, with the Tc1/mariner transposons making up the largest percentage by base pair (4.57%). Microsatellites were detected in 17.5% of BES. Catfish BACs were anchored to the zebrafish and Tetraodon genome sequences by BLASTN, generating 16% and 8.2% significant hits (E < e(-5)) respectively. A total of 1074 and 773 significant hits were unique to the zebrafish and Tetraodon genomes, respectively, of which 417 and 406, respectively, were identified as known genes in other species, providing a major genome resource for comparative genomic mapping.
Collapse
Affiliation(s)
- P Xu
- The Fish Molecular Genetics and Biotechnology Laboratory, Department of Fisheries and Allied Aquacultures and Program of Cell and Molecular Biosciences, Aquatic Genomics Unit, Auburn University, Auburn, AL 36849, USA
| | | | | | | | | | | | | | | |
Collapse
|
21
|
Scholz C, Döderlein G, Simon HH. One step cloning of defined DNA fragments from large genomic clones. BMB Rep 2006; 39:464-7. [PMID: 16889693 DOI: 10.5483/bmbrep.2006.39.4.464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Recently, the nucleotide sequences of entire genomes became available. This information combined with older sequencing data discloses the exact chromosomal location of millions of nucleotide markers stored in the databases at NCBI, EMBO or DDBJ. Despite having resolved the intron/exon structures of all described genes within these genomes with a stroke of a pen, the sequencing data opens up other interesting possibilities. For example, the genomic mapping of the end sequences of the human, murine and rat BAC libraries generated at The Institute for Genomic Research (TIGR), reveals now the entire encompassed sequence of the inserts for more than a million of these clones. Since these clones are individually stored, they are now an invaluable source for experiments which depend on genomic DNA. Isolation of smaller fragments from such clones with standard methods is a time consuming process. We describe here a reliable one-step cloning technique to obtain a DNA fragment with a defined size and sequence from larger genomic clones in less than 48 hours using a standard vector with a multiple cloning site, and common restriction enzymes and equipment. The only prerequisites are the sequences of ends of the insert and of the underlying genome.
Collapse
Affiliation(s)
- Christian Scholz
- Interdisciplinary Center for Neuroscience, Department of Neuroanatomy, University of Heidelberg, Im Neuenheimer Feld 307, 69120 Heidelberg, Germany
| | | | | |
Collapse
|
22
|
Lai CWJ, Yu Q, Hou S, Skelton RL, Jones MR, Lewis KLT, Murray J, Eustice M, Guan P, Agbayani R, Moore PH, Ming R, Presting GG. Analysis of papaya BAC end sequences reveals first insights into the organization of a fruit tree genome. Mol Genet Genomics 2006; 276:1-12. [PMID: 16703363 DOI: 10.1007/s00438-006-0122-z] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2006] [Accepted: 03/22/2006] [Indexed: 02/03/2023]
Abstract
Papaya (Carica papaya L.) is a major tree fruit crop of tropical and subtropical regions with an estimated genome size of 372 Mbp. We present the analysis of 4.7% of the papaya genome based on BAC end sequences (BESs) representing 17 million high-quality bases. Microsatellites discovered in 5,452 BESs and flanking primer sequences are available to papaya breeding programs at http://www.genomics.hawaii.edu/papaya/BES . Sixteen percent of BESs contain plant repeat elements, the vast majority (83.3%) of which are class I retrotransposons. Several novel papaya-specific repeats were identified. Approximately 19.1% of the BESs have homology to Arabidopsis cDNA. Increasing numbers of completely sequenced plant genomes and BES projects enable novel approaches to comparative plant genomics. Paired BESs of Carica, Arabidopsis, Populus, Brassica and Lycopersicon were mapped onto the completed genomes of Arabidopsis and Populus. In general the level of microsynteny was highest between closely related organisms. However, papaya revealed a higher degree of apparent synteny with the more distantly related poplar than with the more closely related Arabidopsis. This, as well as significant colinearity observed between peach and poplar genome sequences, support recent observations of frequent genome rearrangements in the Arabidopsis lineage and suggest that the poplar genome sequence may be more useful for elucidating the papaya and other rosid genomes. These insights will play a critical role in selecting species and sequencing strategies that will optimally represent crop genomes in sequence databases.
Collapse
Affiliation(s)
- Chun Wan J Lai
- Department of Molecular Biosciences and Bioengineering, University of Hawaii, 1955 East-West Road, Agricultural Sciences Building Room 218, Honolulu, HI, 96822, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Luo M, Kim H, Kudrna D, Sisneros NB, Lee SJ, Mueller C, Collura K, Zuccolo A, Buckingham EB, Grim SM, Yanagiya K, Inoko H, Shiina T, Flajnik MF, Wing RA, Ohta Y. Construction of a nurse shark (Ginglymostoma cirratum) bacterial artificial chromosome (BAC) library and a preliminary genome survey. BMC Genomics 2006; 7:106. [PMID: 16672057 PMCID: PMC1513397 DOI: 10.1186/1471-2164-7-106] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2005] [Accepted: 05/03/2006] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Sharks are members of the taxonomic class Chondrichthyes, the oldest living jawed vertebrates. Genomic studies of this group, in comparison to representative species in other vertebrate taxa, will allow us to theorize about the fundamental genetic, developmental, and functional characteristics in the common ancestor of all jawed vertebrates. AIMS In order to obtain mapping and sequencing data for comparative genomics, we constructed a bacterial artificial chromosome (BAC) library for the nurse shark, Ginglymostoma cirratum. RESULTS The BAC library consists of 313,344 clones with an average insert size of 144 kb, covering ~4.5 x 1010 bp and thus providing an 11-fold coverage of the haploid genome. BAC end sequence analyses revealed, in addition to LINEs and SINEs commonly found in other animal and plant genomes, two new groups of nurse shark-specific repetitive elements, NSRE1 and NSRE2 that seem to be major components of the nurse shark genome. Screening the library with single-copy or multi-copy gene probes showed 6-28 primary positive clones per probe of which 50-90% were true positives, demonstrating that the BAC library is representative of the different regions of the nurse shark genome. Furthermore, some BAC clones contained multiple genes, making physical mapping feasible. CONCLUSION We have constructed a deep-coverage, high-quality, large insert, and publicly available BAC library for a cartilaginous fish. It will be very useful to the scientific community interested in shark genomic structure, comparative genomics, and functional studies. We found two new groups of repetitive elements specific to the nurse shark genome, which may contribute to the architecture and evolution of the nurse shark genome.
Collapse
Affiliation(s)
- Meizhong Luo
- Arizona Genomics Institute, Department of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
- College of Life Sciences and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - HyeRan Kim
- Arizona Genomics Institute, Department of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | - Dave Kudrna
- Arizona Genomics Institute, Department of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | - Nicholas B Sisneros
- Arizona Genomics Institute, Department of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | - So-Jeong Lee
- Arizona Genomics Institute, Department of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | - Christopher Mueller
- Arizona Genomics Institute, Department of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | - Kristi Collura
- Arizona Genomics Institute, Department of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | - Andrea Zuccolo
- Arizona Genomics Institute, Department of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | - E Bryan Buckingham
- University of Maryland, Department of Microbiology and Immunology, 655 West Baltimore Street, BRB3-052, Baltimore, MD 21201, USA
| | - Suzanne M Grim
- University of Maryland, Department of Microbiology and Immunology, 655 West Baltimore Street, BRB3-052, Baltimore, MD 21201, USA
| | - Kazuyo Yanagiya
- Department of Molecular Life Science, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, 143 Shimokasuya, Isehara, Kanagawa 259-1143, Japan
| | - Hidetoshi Inoko
- Department of Molecular Life Science, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, 143 Shimokasuya, Isehara, Kanagawa 259-1143, Japan
| | - Takashi Shiina
- Department of Molecular Life Science, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, 143 Shimokasuya, Isehara, Kanagawa 259-1143, Japan
| | - Martin F Flajnik
- University of Maryland, Department of Microbiology and Immunology, 655 West Baltimore Street, BRB3-052, Baltimore, MD 21201, USA
| | - Rod A Wing
- Arizona Genomics Institute, Department of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | - Yuko Ohta
- University of Maryland, Department of Microbiology and Immunology, 655 West Baltimore Street, BRB3-052, Baltimore, MD 21201, USA
| |
Collapse
|
24
|
Leeb T, Vogl C, Zhu B, de Jong PJ, Binns MM, Chowdhary BP, Scharfe M, Jarek M, Nordsiek G, Schrader F, Blöcker H. A human-horse comparative map based on equine BAC end sequences. Genomics 2006; 87:772-6. [PMID: 16603334 DOI: 10.1016/j.ygeno.2006.03.002] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2005] [Revised: 12/15/2005] [Accepted: 03/04/2006] [Indexed: 11/18/2022]
Abstract
In an effort to increase the density of sequence-based markers for the horse genome we generated 9473 BAC end sequences (BESs) from the CHORI-241 BAC library with an average read length of 677 bp. BLASTN searches with the BESs revealed 4036 meaningful hits (E <or= 10(-5)) in the human genome that provide useful markers for the human-horse comparative map. The 4036 BLASTN hits allowed the anchoring of 3079 BAC clones to the human genome, on average one corresponding equine BAC clone per megabase of human DNA. We used the BLASTN anchored BESs for an in silico prediction of the gene content and chromosome assignment of comparatively mapped equine BAC clones. As a first verification of our in silico mapping strategy we placed 19 equine BESs with matches to HSA6 onto the RH map. All markers were assigned to the predicted localizations on ECA10, ECA20, and ECA31, respectively.
Collapse
Affiliation(s)
- Tosso Leeb
- Institute of Animal Breeding and Genetics, University of Veterinary Medicine Hannover, Bünteweg 17p, 30559 Hannover, Germany.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Hong JM, Chae SH, Oriero N, Larkin DM, Choi CB, Lee JY, Lewin HA, Bae JH, Choi I, Yeo JS. Identification and chromosomal localization of repeat sequences through BAC end sequence analysis in Korean cattle. J Genet 2005; 84:329-35. [PMID: 16385167 DOI: 10.1007/bf02715805] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- J M Hong
- Institute of Biotechnology and Department of Biotechnology, Gyeongsan 712-749, Korea
| | | | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Milosavljevic A, Harris RA, Sodergren EJ, Jackson AR, Kalafus KJ, Hodgson A, Cree A, Dai W, Csuros M, Zhu B, de Jong PJ, Weinstock GM, Gibbs RA. Pooled genomic indexing of rhesus macaque. Genome Res 2005; 15:292-301. [PMID: 15687293 PMCID: PMC546531 DOI: 10.1101/gr.3162505] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Pooled genomic indexing (PGI) is a method for mapping collections of bacterial artificial chromosome (BAC) clones between species by using a combination of clone pooling and DNA sequencing. PGI has been used to map a total of 3858 BAC clones covering approximately 24% of the rhesus macaque (Macaca mulatta) genome onto 4178 homologous loci in the human genome. A number of intrachromosomal rearrangements were detected by mapping multiple segments within the individual rhesus BACs onto multiple disjoined loci in the human genome. Transversal pooling designs involving shuffled BAC arrays were employed for robust mapping even with modest DNA sequence read coverage. A further innovation, short-tag pooled genomic indexing (ST-PGI), was also introduced to further improve the economy of mapping by sequencing multiple, short, mapable tags within a single sequencing reaction.
Collapse
Affiliation(s)
- Aleksandar Milosavljevic
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Abe K, Noguchi H, Tagawa K, Yuzuriha M, Toyoda A, Kojima T, Ezawa K, Saitou N, Hattori M, Sakaki Y, Moriwaki K, Shiroishi T. Contribution of Asian mouse subspecies Mus musculus molossinus to genomic constitution of strain C57BL/6J, as defined by BAC-end sequence-SNP analysis. Genome Res 2004; 14:2439-47. [PMID: 15574823 PMCID: PMC534668 DOI: 10.1101/gr.2899304] [Citation(s) in RCA: 81] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2004] [Accepted: 09/27/2004] [Indexed: 11/24/2022]
Abstract
MSM/Ms is an inbred strain derived from the Japanese wild mouse, Mus musculus molossinus. It is believed that subspecies molossinus has contributed substantially to the genome constitution of common laboratory strains of mice, although the majority of their genome is derived from the west European M. m. domesticus. Information on the molossinus genome is thus essential not only for genetic studies involving molossinus but also for characterization of common laboratory strains. Here, we report the construction of an arrayed bacterial artificial chromosome (BAC) library from male MSM/Ms genomic DNA, covering approximately 1x genome equivalent. Both ends of 176,256 BAC clone inserts were sequenced, and 62,988 BAC-end sequence (BES) pairs were mapped onto the C57BL/6J genome (NCBI mouse Build 30), covering 2,228,164 kbp or 89% of the total genome. Taking advantage of the BES map data, we established a computer-based clone screening system. Comparison of the MSM/Ms and C57BL/6J sequences revealed 489,200 candidate single nucleotide polymorphisms (SNPs) in 51,137,941 bp sequenced. The overall nucleotide substitution rate was as high as 0.0096. The distribution of SNPs along the C57BL/6J genome was not uniform: The majority of the genome showed a high SNP rate, and only 5.2% of the genome showed an extremely low SNP rate (percentage identity = 0.9997); these sequences are likely derived from the molossinus genome.
Collapse
Affiliation(s)
- Kuniya Abe
- Technology and Development Team for Mammalian Cellular Dynamics, BioResource Center, RIKEN Tsukuba Institute, Tsukuba, Ibaraki 305-0074, Japan.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Zhao S, Shetty J, Hou L, Delcher A, Zhu B, Osoegawa K, de Jong P, Nierman WC, Strausberg RL, Fraser CM. Human, mouse, and rat genome large-scale rearrangements: stability versus speciation. Genome Res 2004; 14:1851-60. [PMID: 15364903 PMCID: PMC524408 DOI: 10.1101/gr.2663304] [Citation(s) in RCA: 106] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Using paired-end sequences from bacterial artificial chromosomes, we have constructed high-resolution synteny and rearrangement breakpoint maps among human, mouse, and rat genomes. Among the >300 syntenic blocks identified are segments of over 40 Mb without any detected interspecies rearrangements, as well as regions with frequently broken synteny and extensive rearrangements. As closely related species, mouse and rat share the majority of the breakpoints and often have the same types of rearrangements when compared with the human genome. However, the breakpoints not shared between them indicate that mouse rearrangements are more often interchromosomal, whereas intrachromosomal rearrangements are more prominent in rat. Centromeres may have played a significant role in reorganizing a number of chromosomes in all three species. The comparison of the three species indicates that genome rearrangements follow a path that accommodates a delicate balance between maintaining a basic structure underlying all mammalian species and permitting variations that are necessary for speciation.
Collapse
Affiliation(s)
- Shaying Zhao
- Institute for Genomic Research, Rockville, Maryland 20850, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Hong CP, Lee SJ, Park JY, Plaha P, Park YS, Lee YK, Choi JE, Kim KY, Lee JH, Lee J, Jin H, Choi SR, Lim YP. Construction of a BAC library of Korean ginseng and initial analysis of BAC-end sequences. Mol Genet Genomics 2004; 271:709-16. [PMID: 15197578 DOI: 10.1007/s00438-004-1021-9] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2003] [Accepted: 04/30/2004] [Indexed: 10/26/2022]
Abstract
We estimated the genome size of Korean ginseng (Panax ginseng C.A. Meyer), a medicinal herb, constructed a HindIII BAC library, and analyzed BAC-end sequences to provide an initial characterization of the library. The 1C nuclear DNA content of Korean ginseng was estimated to be 3.33 pg (3.12 x 10(3) Mb). The BAC library consists of 106,368 clones with an average size of 98.61 kb, amounting to 3.34 genome equivalents. Sequencing of 2167 BAC clones generated 2492 BAC-end sequences with an average length of 400 bp. Analysis using BLAST and motif searches revealed that 10.2%, 20.9% and 3.8% of the BAC-end sequences contained protein-coding regions, transposable elements and microsatellites, respectively. A comparison of the functional categories represented by the protein-coding regions found in BAC-end sequences with those of Arabidopsis revealed that proteins pertaining to energy metabolism, subcellular localization, cofactor requirement and transport facilitation were more highly represented in the P. ginseng sample. In addition, a sequence encoding a glucosyltransferase-like protein implicated in the ginsenoside biosynthesis pathway was also found. The majority of the transposable element sequences found belonged to the gypsy type (67.6%), followed by copia (11.7%) and LINE (8.0%) retrotransposons, whereas DNA transposons accounted for only 2.1% of the total in our sequence sample. Higher levels of transposable elements than protein-coding regions suggest that mobile elements have played an important role in the evolution of the genome of Korean ginseng, and contributed significantly to its complexity. We also identified 103 microsatellites with 3-38 repeats in their motifs. The BAC library and BAC-end sequences will serve as a useful resource for physical mapping, positional cloning and genome sequencing of P. ginseng.
Collapse
Affiliation(s)
- C P Hong
- Department of Horticulture, and Genome Research Center, Chungnam National University, 305-764, Daejeon, Korea
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Chung YJ, Jonkers J, Kitson H, Fiegler H, Humphray S, Scott C, Hunt S, Yu Y, Nishijima I, Velds A, Holstege H, Carter N, Bradley A. A whole-genome mouse BAC microarray with 1-Mb resolution for analysis of DNA copy number changes by array comparative genomic hybridization. Genome Res 2004; 14:188-96. [PMID: 14707179 PMCID: PMC314296 DOI: 10.1101/gr.1878804] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2003] [Accepted: 10/29/2003] [Indexed: 11/24/2022]
Abstract
Microarray-based comparative genomic hybridization (CGH) has become a powerful method for the genome-wide detection of chromosomal imbalances. Although BAC microarrays have been used for mouse CGH studies, the resolving power of these analyses was limited because high-density whole-genome mouse BAC microarrays were not available. We therefore developed a mouse BAC microarray containing 2803 unique BAC clones from mouse genomic libraries at 1-Mb intervals. For the general amplification of BAC clone DNA prior to spotting, we designed a set of three novel degenerate oligonucleotide-primed (DOP) PCR primers that preferentially amplify mouse genomic sequences while minimizing unwanted amplification of contaminating Escherichia coli DNA. The resulting 3K mouse BAC microarrays reproducibly identified DNA copy number alterations in cell lines and primary tumors, such as single-copy deletions, regional amplifications, and aneuploidy.
Collapse
Affiliation(s)
- Yeun-Jun Chung
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Larkin DM, Everts-van der Wind A, Rebeiz M, Schweitzer PA, Bachman S, Green C, Wright CL, Campos EJ, Benson LD, Edwards J, Liu L, Osoegawa K, Womack JE, de Jong PJ, Lewin HA. A cattle-human comparative map built with cattle BAC-ends and human genome sequence. Genome Res 2003; 13:1966-72. [PMID: 12902387 PMCID: PMC403790 DOI: 10.1101/gr.1560203] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
As a step toward the goal of adding the cattle genome to those available for multispecies comparative genome analysis, 40,224 cattle BAC clones were end-sequenced, yielding 60,547 sequences (BAC end sequences, BESs) after trimming with an average read length of 515 bp. Cattle BACs were anchored to the human and mouse genome sequences by BLASTN search, revealing 29.4% and 10.1% significant hits (E < e-5), respectively. More than 60% of all cattle BES hits in both the human and mouse genomes are located within known genes. In order to confirm in silico predictions of orthologyand their relative position on cattle chromosomes, 84 cattle BESs with similarity to sequences on HSA11 were mapped using a cattle-hamster radiation hybrid (RH) panel. Resulting RH maps of BTA15 and BTA29 cover approximately 85% of HSA11 sequence, revealing a complex patchwork shuffling of segments not explained by a simple translocation followed by internal rearrangements. Overlay of the mouse conserved syntenies onto HSA11 revealed that segmental boundaries appear to be conserved in all three species. The BAC clone-based comparative map provides a foundation for the evolutionary analysis of mammalian karyotypes and for sequencing of the cattle genome.
Collapse
Affiliation(s)
- Denis M Larkin
- Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, 61801 USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Bradeen JM, Naess SK, Song J, Haberlach GT, Wielgus SM, Buell CR, Jiang J, Helgeson JP. Concomitant reiterative BAC walking and fine genetic mapping enable physical map development for the broad-spectrum late blight resistance region, RB. Mol Genet Genomics 2003; 269:603-11. [PMID: 12827499 DOI: 10.1007/s00438-003-0865-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2002] [Accepted: 05/06/2003] [Indexed: 12/17/2022]
Abstract
The wild potato species Solanum bulbocastanum is a source of genes for potent late blight resistance. We previously mapped resistance to a single region of the S. bulbocastanum chromosome 8 and named the region RB (for "Resistance from S. Bulbocastanum"). We now report physical mapping and contig construction for the RB region via a novel reiterative method of BAC walking and concomitant fine genetic mapping. BAC walking was initiated using RFLP markers previously shown to be associated with late blight resistance. Subcontig extension was accomplished using new probes developed from BAC ends. Significantly, BAC end and partial BAC sequences were also used to develop PCR-based markers to enhance map resolution in the RB region. As they were developed from BAC clones of known position relative to RB, our PCR-based markers are known a priori to be physically closer to the resistance region. These markers allowed the efficient screening of large numbers of segregating progeny at the cotyledon stage, and permitted us to assign the resistance phenotype to a region of approximately 55 kb. Our markers also directed BAC walking efforts away from regions distantly related to RB in favor of the 55-kb region. Because the S. bulbocastanum genotype used in BAC library construction is heterozygous for RB (RB/rb), codominant PCR-based markers, originally developed for fine-scale mapping, were also used to determine homolog origins for individual BAC clones. Ultimately, BAC contigs were constructed for the RB region from both resistant (RB) and susceptible (rb) homologs.
Collapse
Affiliation(s)
- J M Bradeen
- Department of Plant Pathology, University of Wisconsin, 1630 Linden Drive, Madison, WI 53706, USA
| | | | | | | | | | | | | | | |
Collapse
|
33
|
Hong YS, Hogan JR, Wang X, Sarkar A, Sim C, Loftus BJ, Ren C, Huff ER, Carlile JL, Black K, Zhang HB, Gardner MJ, Collins FH. Construction of a BAC library and generation of BAC end sequence-tagged connectors for genome sequencing of the African malaria mosquito Anopheles gambiae. Mol Genet Genomics 2003; 268:720-8. [PMID: 12655398 DOI: 10.1007/s00438-003-0813-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2002] [Accepted: 01/06/2003] [Indexed: 11/28/2022]
Abstract
A Bacterial Artificial Chromosome (BAC) genomic DNA library of Anopheles gambiae, the major human malaria vector in sub-Saharan Africa, was constructed and characterized. This library (ND-TAM) is composed of 30,720 BAC clones in eighty 384-well plates. The estimated average insert size of the library is 133 kb, with an overall genome coverage of approximately 14-fold. The ends of approximately two-thirds of the clones in the library were sequenced, yielding 32,340 pair-mate ends. A statistical analysis (G-test) of the results of PCR screening of the library indicated a random distribution of BACs in the genome, although one gap encompassing the white locus on the X-chromosome was identified. Furthermore, combined with another previously constructed BAC library (ND-1), ~2,000 BACs have been physically mapped by polytene chromosomal in situ hybridization. These BAC end pair mates and physically mapped BACs have been useful for both the assembly of a fully sequenced A. gambiae genome and for linking the assembled sequence to the three polytene chromosomes. This ND-TAM library is now publicly available at both http://www.malaria.mr4.org/mr4pages/index.html/ and http://hbz.tamu.edu/, providing a valuable resource to the mosquito research community.
Collapse
Affiliation(s)
- Y S Hong
- Center for Tropical Disease Research and Training, Department of Biological Sciences, University of Notre Dame, IN 46556, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Abstract
Autoimmune diseases are, in general, under complex genetic control and subject to strong interactions between genetics and the environment. Greater knowledge of the underlying genetics will provide immunologists with a framework for study of the immune dysregulation that occurs in such diseases. Ascertaining the number of genes that are involved and their characterization have, however, proven to be difficult. Improved methods of genetic analysis and the availability of a draft sequence of the complete mouse genome have markedly improved the outlook for such research, and they have emphasized the advantages of mice as a model system. In this review, we provide an overview of the genetic analysis of autoimmune diseases and of the crucial role of congenic and consomic mouse strains in such research.
Collapse
Affiliation(s)
- Ute C Rogner
- Institut Pasteur, Unité Génétique Moléculaire Murine, 25 rue du Docteur Roux, 75015 Paris, France
| | | |
Collapse
|
35
|
Jaffe DB, Butler J, Gnerre S, Mauceli E, Lindblad-Toh K, Mesirov JP, Zody MC, Lander ES. Whole-genome sequence assembly for mammalian genomes: Arachne 2. Genome Res 2003; 13:91-6. [PMID: 12529310 PMCID: PMC430950 DOI: 10.1101/gr.828403] [Citation(s) in RCA: 244] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2002] [Accepted: 10/30/2002] [Indexed: 10/27/2022]
Abstract
We previously described the whole-genome assembly program Arachne, presenting assemblies of simulated data for small to mid-sized genomes. Here we describe algorithmic adaptations to the program, allowing for assembly of mammalian-size genomes, and also improving the assembly of smaller genomes. Three principal changes were simultaneously made and applied to the assembly of the mouse genome, during a six-month period of development: (1) Supercontigs (scaffolds) were iteratively broken and rejoined using several criteria, yielding a 64-fold increase in length (N50), and apparent elimination of all global misjoins; (2) gaps between contigs in supercontigs were filled (partially or completely) by insertion of reads, as suggested by pairing within the supercontig, increasing the N50 contig length by 50%; (3) memory usage was reduced fourfold. The outcome of this mouse assembly and its analysis are described in (Mouse Genome Sequencing Consortium 2002).
Collapse
Affiliation(s)
- David B Jaffe
- Whitehead Institute/MIT Center for Genome Research, Cambridge, Massachusetts 02141, USA.
| | | | | | | | | | | | | | | |
Collapse
|
36
|
Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P, Antonarakis SE, Attwood J, Baertsch R, Bailey J, Barlow K, Beck S, Berry E, Birren B, Bloom T, Bork P, Botcherby M, Bray N, Brent MR, Brown DG, Brown SD, Bult C, Burton J, Butler J, Campbell RD, Carninci P, Cawley S, Chiaromonte F, Chinwalla AT, Church DM, Clamp M, Clee C, Collins FS, Cook LL, Copley RR, Coulson A, Couronne O, Cuff J, Curwen V, Cutts T, Daly M, David R, Davies J, Delehaunty KD, Deri J, Dermitzakis ET, Dewey C, Dickens NJ, Diekhans M, Dodge S, Dubchak I, Dunn DM, Eddy SR, Elnitski L, Emes RD, Eswara P, Eyras E, Felsenfeld A, Fewell GA, Flicek P, Foley K, Frankel WN, Fulton LA, Fulton RS, Furey TS, Gage D, Gibbs RA, Glusman G, Gnerre S, Goldman N, Goodstadt L, Grafham D, Graves TA, Green ED, Gregory S, Guigó R, Guyer M, Hardison RC, Haussler D, Hayashizaki Y, Hillier LW, Hinrichs A, Hlavina W, Holzer T, Hsu F, Hua A, Hubbard T, Hunt A, Jackson I, Jaffe DB, Johnson LS, Jones M, Jones TA, Joy A, Kamal M, Karlsson EK, Karolchik D, Kasprzyk A, Kawai J, Keibler E, Kells C, Kent WJ, Kirby A, Kolbe DL, Korf I, Kucherlapati RS, Kulbokas EJ, Kulp D, Landers T, Leger JP, Leonard S, Letunic I, Levine R, Li J, Li M, Lloyd C, Lucas S, Ma B, Maglott DR, Mardis ER, Matthews L, Mauceli E, Mayer JH, McCarthy M, McCombie WR, McLaren S, McLay K, McPherson JD, Meldrim J, Meredith B, Mesirov JP, Miller W, Miner TL, Mongin E, Montgomery KT, Morgan M, Mott R, Mullikin JC, Muzny DM, Nash WE, Nelson JO, Nhan MN, Nicol R, Ning Z, Nusbaum C, O'Connor MJ, Okazaki Y, Oliver K, Overton-Larty E, Pachter L, Parra G, Pepin KH, Peterson J, Pevzner P, Plumb R, Pohl CS, Poliakov A, Ponce TC, Ponting CP, Potter S, Quail M, Reymond A, Roe BA, Roskin KM, Rubin EM, Rust AG, Santos R, Sapojnikov V, Schultz B, Schultz J, Schwartz MS, Schwartz S, Scott C, Seaman S, Searle S, Sharpe T, Sheridan A, Shownkeen R, Sims S, Singer JB, Slater G, Smit A, Smith DR, Spencer B, Stabenau A, Stange-Thomann N, Sugnet C, Suyama M, Tesler G, Thompson J, Torrents D, Trevaskis E, Tromp J, Ucla C, Ureta-Vidal A, Vinson JP, Von Niederhausern AC, Wade CM, Wall M, Weber RJ, Weiss RB, Wendl MC, West AP, Wetterstrand K, Wheeler R, Whelan S, Wierzbowski J, Willey D, Williams S, Wilson RK, Winter E, Worley KC, Wyman D, Yang S, Yang SP, Zdobnov EM, Zody MC, Lander ES. Initial sequencing and comparative analysis of the mouse genome. Nature 2002; 420:520-62. [PMID: 12466850 DOI: 10.1038/nature01262] [Citation(s) in RCA: 4849] [Impact Index Per Article: 220.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2002] [Accepted: 10/31/2002] [Indexed: 12/18/2022]
Abstract
The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.
Collapse
MESH Headings
- Animals
- Base Composition
- Chromosomes, Mammalian/genetics
- Conserved Sequence/genetics
- CpG Islands/genetics
- Evolution, Molecular
- Gene Expression Regulation
- Genes/genetics
- Genetic Variation/genetics
- Genome
- Genome, Human
- Genomics
- Humans
- Mice/classification
- Mice/genetics
- Mice, Knockout
- Mice, Transgenic
- Models, Animal
- Multigene Family/genetics
- Mutagenesis
- Neoplasms/genetics
- Physical Chromosome Mapping
- Proteome/genetics
- Pseudogenes/genetics
- Quantitative Trait Loci/genetics
- RNA, Untranslated/genetics
- Repetitive Sequences, Nucleic Acid/genetics
- Selection, Genetic
- Sequence Analysis, DNA
- Sex Chromosomes/genetics
- Species Specificity
- Synteny
Collapse
|
37
|
Abstract
In the postgenomic era the mouse will be central to the challenge of ascribing a function to the 40,000 or so genes that constitute our genome. In this review, we summarize some of the classic and modern approaches that have fueled the recent dramatic explosion in mouse genetics. Together with the sequencing of the mouse genome, these tools will have a profound effect on our ability to generate new and more accurate mouse models and thus provide a powerful insight into the function of human genes during the processes of both normal development and disease.
Collapse
|
38
|
Ma L, Liu Y, Ky B, Shughrue PJ, Austin CP, Morris JA. Cloning and characterization of Disc1, the mouse ortholog of DISC1 (Disrupted-in-Schizophrenia 1). Genomics 2002; 80:662-72. [PMID: 12504857 DOI: 10.1006/geno.2002.7012] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We cloned the mouse ortholog of DISC1 (Disrupted-in-Schizophrenia 1), a candidate gene for schizophrenia. Disc1 is 3163 nucleotides long and has 60% identity with the human DISC1. Disc1 encodes 851 amino acids and has 56% identity with the human protein. Disc1 maps to the DISC1 syntenic region in the mouse, and genomic structure is conserved. A Disc1 splice variant deletes a portion of Disc1 beginning at amino acids orthologous to the human truncation. Bioinformatic analysis and cross-species comparisons revealed sequence conservation distributed across the genes and conservation of leucine zipper and coiled-coil domains in both orthologs. In situ hybridization in adult mouse brain revealed a restricted expression pattern, with highest levels in the dentate gyrus of the hippocampus and lower expression in CA1-CA3 of the hippocampus, cerebellum, cerebral cortex, and olfactory bulbs. Identification of Disc1 will facilitate the study of DISC1's function and creation of mouse models of DISC1 disruption.
Collapse
MESH Headings
- Alternative Splicing
- Amino Acid Sequence
- Animals
- Base Sequence
- Blotting, Northern
- Brain/metabolism
- Chromosome Mapping
- Chromosomes, Human, Pair 1/genetics
- Cloning, Molecular
- DNA, Complementary/chemistry
- DNA, Complementary/genetics
- Exons
- Gene Expression
- Genes/genetics
- Humans
- In Situ Hybridization
- Introns
- Male
- Mice
- Molecular Sequence Data
- Nerve Tissue Proteins/genetics
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- Rats
- Sequence Alignment
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
- Synteny
Collapse
Affiliation(s)
- Lei Ma
- Department of Neuroscience, West Point, Pennsylvania 19486, USA
| | | | | | | | | | | |
Collapse
|
39
|
Holt RA, Subramanian GM, Halpern A, Sutton GG, Charlab R, Nusskern DR, Wincker P, Clark AG, Ribeiro JMC, Wides R, Salzberg SL, Loftus B, Yandell M, Majoros WH, Rusch DB, Lai Z, Kraft CL, Abril JF, Anthouard V, Arensburger P, Atkinson PW, Baden H, de Berardinis V, Baldwin D, Benes V, Biedler J, Blass C, Bolanos R, Boscus D, Barnstead M, Cai S, Center A, Chaturverdi K, Christophides GK, Chrystal MA, Clamp M, Cravchik A, Curwen V, Dana A, Delcher A, Dew I, Evans CA, Flanigan M, Grundschober-Freimoser A, Friedli L, Gu Z, Guan P, Guigo R, Hillenmeyer ME, Hladun SL, Hogan JR, Hong YS, Hoover J, Jaillon O, Ke Z, Kodira C, Kokoza E, Koutsos A, Letunic I, Levitsky A, Liang Y, Lin JJ, Lobo NF, Lopez JR, Malek JA, McIntosh TC, Meister S, Miller J, Mobarry C, Mongin E, Murphy SD, O'Brochta DA, Pfannkoch C, Qi R, Regier MA, Remington K, Shao H, Sharakhova MV, Sitter CD, Shetty J, Smith TJ, Strong R, Sun J, Thomasova D, Ton LQ, Topalis P, Tu Z, Unger MF, Walenz B, Wang A, Wang J, Wang M, Wang X, Woodford KJ, Wortman JR, Wu M, Yao A, Zdobnov EM, Zhang H, Zhao Q, Zhao S, Zhu SC, Zhimulev I, Coluzzi M, della Torre A, Roth CW, Louis C, Kalush F, Mural RJ, Myers EW, Adams MD, Smith HO, Broder S, Gardner MJ, Fraser CM, Birney E, Bork P, Brey PT, Venter JC, Weissenbach J, Kafatos FC, Collins FH, Hoffman SL. The genome sequence of the malaria mosquito Anopheles gambiae. Science 2002; 298:129-49. [PMID: 12364791 DOI: 10.1126/science.1076181] [Citation(s) in RCA: 1399] [Impact Index Per Article: 63.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Anopheles gambiae is the principal vector of malaria, a disease that afflicts more than 500 million people and causes more than 1 million deaths each year. Tenfold shotgun sequence coverage was obtained from the PEST strain of A. gambiae and assembled into scaffolds that span 278 million base pairs. A total of 91% of the genome was organized in 303 scaffolds; the largest scaffold was 23.1 million base pairs. There was substantial genetic variation within this strain, and the apparent existence of two haplotypes of approximately equal frequency ("dual haplotypes") in a substantial fraction of the genome likely reflects the outbred nature of the PEST strain. The sequence produced a conservative inference of more than 400,000 single-nucleotide polymorphisms that showed a markedly bimodal density distribution. Analysis of the genome sequence revealed strong evidence for about 14,000 protein-encoding transcripts. Prominent expansions in specific families of proteins likely involved in cell adhesion and immunity were noted. An expressed sequence tag analysis of genes regulated by blood feeding provided insights into the physiological adaptations of a hematophagous insect.
Collapse
Affiliation(s)
- Robert A Holt
- Celera Genomics, 45 West Gude Drive, Rockville, MD 20850, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Gregory SG, Sekhon M, Schein J, Zhao S, Osoegawa K, Scott CE, Evans RS, Burridge PW, Cox TV, Fox CA, Hutton RD, Mullenger IR, Phillips KJ, Smith J, Stalker J, Threadgold GJ, Birney E, Wylie K, Chinwalla A, Wallis J, Hillier L, Carter J, Gaige T, Jaeger S, Kremitzki C, Layman D, Maas J, McGrane R, Mead K, Walker R, Jones S, Smith M, Asano J, Bosdet I, Chan S, Chittaranjan S, Chiu R, Fjell C, Fuhrmann D, Girn N, Gray C, Guin R, Hsiao L, Krzywinski M, Kutsche R, Lee SS, Mathewson C, McLeavy C, Messervier S, Ness S, Pandoh P, Prabhu AL, Saeedi P, Smailus D, Spence L, Stott J, Taylor S, Terpstra W, Tsai M, Vardy J, Wye N, Yang G, Shatsman S, Ayodeji B, Geer K, Tsegaye G, Shvartsbeyn A, Gebregeorgis E, Krol M, Russell D, Overton L, Malek JA, Holmes M, Heaney M, Shetty J, Feldblyum T, Nierman WC, Catanese JJ, Hubbard T, Waterston RH, Rogers J, de Jong PJ, Fraser CM, Marra M, McPherson JD, Bentley DR. A physical map of the mouse genome. Nature 2002; 418:743-50. [PMID: 12181558 DOI: 10.1038/nature00957] [Citation(s) in RCA: 251] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
A physical map of a genome is an essential guide for navigation, allowing the location of any gene or other landmark in the chromosomal DNA. We have constructed a physical map of the mouse genome that contains 296 contigs of overlapping bacterial clones and 16,992 unique markers. The mouse contigs were aligned to the human genome sequence on the basis of 51,486 homology matches, thus enabling use of the conserved synteny (correspondence between chromosome blocks) of the two genomes to accelerate construction of the mouse map. The map provides a framework for assembly of whole-genome shotgun sequence data, and a tile path of clones for generation of the reference sequence. Definition of the human-mouse alignment at this level of resolution enables identification of a mouse clone that corresponds to almost any position in the human genome. The human sequence may be used to facilitate construction of other mammalian genome maps using the same strategy.
Collapse
Affiliation(s)
- Simon G Gregory
- The Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Abstract
The emergence of proteomics, the large-scale analysis of proteins, has been inspired by the realization that the final product of a gene is inherently more complex and closer to function than the gene itself. Shortfalls in the ability of bioinformatics to predict both the existence and function of genes have also illustrated the need for protein analysis. Moreover, only through the study of proteins can posttranslational modifications be determined, which can profoundly affect protein function. Proteomics has been enabled by the accumulation of both DNA and protein sequence databases, improvements in mass spectrometry, and the development of computer algorithms for database searching. In this review, we describe why proteomics is important, how it is conducted, and how it can be applied to complement other existing technologies. We conclude that currently, the most practical application of proteomics is the analysis of target proteins as opposed to entire proteomes. This type of proteomics, referred to as functional proteomics, is always driven by a specific biological question. In this way, protein identification and characterization has a meaningful outcome. We discuss some of the advantages of a functional proteomics approach and provide examples of how different methodologies can be utilized to address a wide variety of biological problems.
Collapse
Affiliation(s)
- Paul R Graves
- Department of Pharmacology and Cancer Biology, Duke University, Durham, North Carolina 27710, USA
| | | |
Collapse
|
42
|
Lindblad-Toh K, Lander ES, McPherson JD, Waterston RH, Rodgers J, Birney E. Progress in sequencing the mouse genome. Genesis 2001; 31:137-41. [PMID: 11783003 DOI: 10.1002/gene.10016] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|