1
|
How small is the center of science? Short cross-disciplinary cycles in co-authorship graphs. Scientometrics 2014. [DOI: 10.1007/s11192-014-1468-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
2
|
Venter JC, Levy S, Stockwell T, Remington K, Halpern A. Massive parallelism, randomness and genomic advances. Nat Genet 2003; 33 Suppl:219-27. [PMID: 12610531 DOI: 10.1038/ng1114] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
In reviewing the past decade, it is clear that genomics was, and still is, driven by innovative technologies, perhaps more so than any other scientific area in recent memory. From the outset, computing, mathematics and new automated laboratory techniques have been key components in allowing the field to move forward rapidly. We highlight some key innovations that have come together to nurture the explosive growth that makes a new era of genomics a reality. We also document how these new approaches have fueled further innovations and discoveries.
Collapse
Affiliation(s)
- J Craig Venter
- The Center for the Advancement of Genomics, 1901 Research Blvd., Rockville, Maryland 20850, USA.
| | | | | | | | | |
Collapse
|
3
|
Mardis E, McPherson J, Martienssen R, Wilson RK, McCombie WR. What is finished, and why does it matter. Genome Res 2002; 12:669-71. [PMID: 11997333 DOI: 10.1101/gr.032102] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Affiliation(s)
- Elaine Mardis
- Washington University School of Medicine, Genome Sequencing Center, St. Louis, Missouri 63108, USA
| | | | | | | | | |
Collapse
|
4
|
Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, Smith HO, Yandell M, Evans CA, Holt RA, Gocayne JD, Amanatides P, Ballew RM, Huson DH, Wortman JR, Zhang Q, Kodira CD, Zheng XH, Chen L, Skupski M, Subramanian G, Thomas PD, Zhang J, Gabor Miklos GL, Nelson C, Broder S, Clark AG, Nadeau J, McKusick VA, Zinder N, Levine AJ, Roberts RJ, Simon M, Slayman C, Hunkapiller M, Bolanos R, Delcher A, Dew I, Fasulo D, Flanigan M, Florea L, Halpern A, Hannenhalli S, Kravitz S, Levy S, Mobarry C, Reinert K, Remington K, Abu-Threideh J, Beasley E, Biddick K, Bonazzi V, Brandon R, Cargill M, Chandramouliswaran I, Charlab R, Chaturvedi K, Deng Z, Di Francesco V, Dunn P, Eilbeck K, Evangelista C, Gabrielian AE, Gan W, Ge W, Gong F, Gu Z, Guan P, Heiman TJ, Higgins ME, Ji RR, Ke Z, Ketchum KA, Lai Z, Lei Y, Li Z, Li J, Liang Y, Lin X, Lu F, Merkulov GV, Milshina N, Moore HM, Naik AK, Narayan VA, Neelam B, Nusskern D, Rusch DB, Salzberg S, Shao W, Shue B, Sun J, Wang Z, Wang A, Wang X, Wang J, Wei M, Wides R, Xiao C, Yan C, Yao A, Ye J, Zhan M, Zhang W, Zhang H, Zhao Q, Zheng L, Zhong F, Zhong W, Zhu S, Zhao S, Gilbert D, Baumhueter S, Spier G, Carter C, Cravchik A, Woodage T, Ali F, An H, Awe A, Baldwin D, Baden H, Barnstead M, Barrow I, Beeson K, Busam D, Carver A, Center A, Cheng ML, Curry L, Danaher S, Davenport L, Desilets R, Dietz S, Dodson K, Doup L, Ferriera S, Garg N, Gluecksmann A, Hart B, Haynes J, Haynes C, Heiner C, Hladun S, Hostin D, Houck J, Howland T, Ibegwam C, Johnson J, Kalush F, Kline L, Koduru S, Love A, Mann F, May D, McCawley S, McIntosh T, McMullen I, Moy M, Moy L, Murphy B, Nelson K, Pfannkoch C, Pratts E, Puri V, Qureshi H, Reardon M, Rodriguez R, Rogers YH, Romblad D, Ruhfel B, Scott R, Sitter C, Smallwood M, Stewart E, Strong R, Suh E, Thomas R, Tint NN, Tse S, Vech C, Wang G, Wetter J, Williams S, Williams M, Windsor S, Winn-Deen E, Wolfe K, Zaveri J, Zaveri K, Abril JF, Guigó R, Campbell MJ, Sjolander KV, Karlak B, Kejariwal A, Mi H, Lazareva B, Hatton T, Narechania A, Diemer K, Muruganujan A, Guo N, Sato S, Bafna V, Istrail S, Lippert R, Schwartz R, Walenz B, Yooseph S, Allen D, Basu A, Baxendale J, Blick L, Caminha M, Carnes-Stine J, Caulk P, Chiang YH, Coyne M, Dahlke C, Deslattes Mays A, Dombroski M, Donnelly M, Ely D, Esparham S, Fosler C, Gire H, Glanowski S, Glasser K, Glodek A, Gorokhov M, Graham K, Gropman B, Harris M, Heil J, Henderson S, Hoover J, Jennings D, Jordan C, Jordan J, Kasha J, Kagan L, Kraft C, Levitsky A, Lewis M, Liu X, Lopez J, Ma D, Majoros W, McDaniel J, Murphy S, Newman M, Nguyen T, Nguyen N, Nodell M, Pan S, Peck J, Peterson M, Rowe W, Sanders R, Scott J, Simpson M, Smith T, Sprague A, Stockwell T, Turner R, Venter E, Wang M, Wen M, Wu D, Wu M, Xia A, Zandieh A, Zhu X. The sequence of the human genome. Science 2001; 291:1304-51. [PMID: 11181995 DOI: 10.1126/science.1058040] [Citation(s) in RCA: 7690] [Impact Index Per Article: 334.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
A 2.91-billion base pair (bp) consensus sequence of the euchromatic portion of the human genome was generated by the whole-genome shotgun sequencing method. The 14.8-billion bp DNA sequence was generated over 9 months from 27,271,853 high-quality sequence reads (5.11-fold coverage of the genome) from both ends of plasmid clones made from the DNA of five individuals. Two assembly strategies-a whole-genome assembly and a regional chromosome assembly-were used, each combining sequence data from Celera and the publicly funded genome effort. The public data were shredded into 550-bp segments to create a 2.9-fold coverage of those genome regions that had been sequenced, without including biases inherent in the cloning and assembly procedure used by the publicly funded group. This brought the effective coverage in the assemblies to eightfold, reducing the number and size of gaps in the final assembly over what would be obtained with 5.11-fold coverage. The two assembly strategies yielded very similar results that largely agree with independent mapping data. The assemblies effectively cover the euchromatic regions of the human chromosomes. More than 90% of the genome is in scaffold assemblies of 100,000 bp or more, and 25% of the genome is in scaffolds of 10 million bp or larger. Analysis of the genome sequence revealed 26,588 protein-encoding transcripts for which there was strong corroborating evidence and an additional approximately 12,000 computationally derived genes with mouse matches or other weak supporting evidence. Although gene-dense clusters are obvious, almost half the genes are dispersed in low G+C sequence separated by large tracts of apparently noncoding sequence. Only 1.1% of the genome is spanned by exons, whereas 24% is in introns, with 75% of the genome being intergenic DNA. Duplications of segmental blocks, ranging in size up to chromosomal lengths, are abundant throughout the genome and reveal a complex evolutionary history. Comparative genomic analysis indicates vertebrate expansions of genes associated with neuronal function, with tissue-specific developmental regulation, and with the hemostasis and immune systems. DNA sequence comparisons between the consensus sequence and publicly funded genome data provided locations of 2.1 million single-nucleotide polymorphisms (SNPs). A random pair of human haploid genomes differed at a rate of 1 bp per 1250 on average, but there was marked heterogeneity in the level of polymorphism across the genome. Less than 1% of all SNPs resulted in variation in proteins, but the task of determining which SNPs have functional consequences remains an open challenge.
Collapse
Affiliation(s)
- J C Venter
- Celera Genomics, 45 West Gude Drive, Rockville, MD 20850, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
5
|
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann Y, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blöcker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowki J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ, Szustakowki J. Initial sequencing and analysis of the human genome. Nature 2001; 409:860-921. [PMID: 11237011 DOI: 10.1038/35057062] [Citation(s) in RCA: 14567] [Impact Index Per Article: 633.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
Collapse
Affiliation(s)
- E S Lander
- Whitehead Institute for Biomedical Research, Center for Genome Research, Cambridge, MA 02142, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Sibley K, Cuthbert-Heavens D, Knowles MA. Loss of heterozygosity at 4p16.3 and mutation of FGFR3 in transitional cell carcinoma. Oncogene 2001; 20:686-91. [PMID: 11314002 DOI: 10.1038/sj.onc.1204110] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2000] [Revised: 10/31/2000] [Accepted: 11/13/2000] [Indexed: 01/06/2023]
Abstract
4p16.3 has previously been identified as a region of non-random LOH in transitional cell carcinoma, suggesting the presence of a tumour suppressor gene. One candidate within this region is fibroblast growth factor receptor 3 (FGFR3). Germline mutations in FGFR3 are known to cause several autosomal dominant skeletal dysplasias, the severity of which depends on the position and nature of the mutation in the protein. We investigated the frequency and nature of FGFR3 mutations in a panel of transitional cell carcinomas and cell lines and studied the possible link between mutation and loss of heterozygosity (LOH) on 4p16.3. FGFR3 coding sequence from 63 transitional cell carcinomas (TCC) of various stages and grades, and 18 cell lines was analysed by fluorescent SSCP. Samples with abnormal migration patterns were sequenced to identify the mutation or polymorphism. Thirty-one of the 63 tumours had previously been assessed to have LOH at 4p16.3. Twenty-six of the 63 tumours (41%) and 4/18 (22%) of the cell lines had missense mutations in FGFR3. All mutations detected in our panel have been reported in the germline where all apart from one cause lethal conditions. One tumour contained K652Q which has recently been identified in less severe cases of skeletal dysplasia. Tumours with and without LOH at 4p16.3 had mutations in FGFR3 suggesting that these two events are not causally linked. The frequency of FGFR3 mutation indicates that this protein plays an important role in TCC.
Collapse
MESH Headings
- Carcinoma, Transitional Cell/classification
- Carcinoma, Transitional Cell/genetics
- Chromosomes, Human, Pair 4/genetics
- DNA, Neoplasm
- Humans
- Loss of Heterozygosity
- Polymerase Chain Reaction
- Polymorphism, Single-Stranded Conformational
- Protein-Tyrosine Kinases
- Receptor, Fibroblast Growth Factor, Type 3
- Receptors, Fibroblast Growth Factor/genetics
- Sequence Analysis, DNA
- Urinary Bladder Neoplasms/classification
- Urinary Bladder Neoplasms/genetics
Collapse
Affiliation(s)
- K Sibley
- ICRF Clinical Centre, St James's University Hospital, Beckett Street, Leeds LS9 7TF, UK
| | | | | |
Collapse
|
7
|
Vuoristo JT, Berrettini WH, Overhauser J, Prockop DJ, Ferraro TN, Ala-Kokko L. Sequence and genomic organization of the human G-protein Golfalpha gene (GNAL) on chromosome 18p11, a susceptibility region for bipolar disorder and schizophrenia. Mol Psychiatry 2000; 5:495-501. [PMID: 11032382 DOI: 10.1038/sj.mp.4000758] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The sequence and genomic organization of the human Golfalpha (GNAL) gene were determined. The human GNAL gene was found to contain 12 coding exons, and it spans over 80 kb on chromosome 18p11. 5' RACE analysis suggested an additional transcription initiation start site. Sequence analysis of the putative promoter region revealed conserved binding sites for several transcription factors. Sequence analysis of the 3'-untranslated region revealed the presence of two Alu sequences and two polyadenylation signals. 3' RACE analysis confirmed the functionality of the most downstream poly-a signal. The human GNAL was found to be expressed as a single transcript of about 5.9 kb in the brain. One highly informative dinucleotide repeat was found in intron 5. Additionally, a processed pseudogene for asparagine synthetase was found about 6 kb upstream of the GNAL gene. Knowledge of the sequence and structure of the human GNAL gene provides essential information for further analysis of the GNAL locus at chromosome 18p11 which has been linked to bipolar disorder and schizophrenia.
Collapse
Affiliation(s)
- J T Vuoristo
- Biocenter Oulu and Department of Medical Biochemistry, University of Oulu, Finland
| | | | | | | | | | | |
Collapse
|
8
|
Still IH, Vince P, Cowell JK. The third member of the transforming acidic coiled coil-containing gene family, TACC3, maps in 4p16, close to translocation breakpoints in multiple myeloma, and is upregulated in various cancer cell lines. Genomics 1999; 58:165-70. [PMID: 10366448 DOI: 10.1006/geno.1999.5829] [Citation(s) in RCA: 110] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We have recently identified a novel gene, TACC1 (transforming acidic coiled coil-containing gene 1), which is located close to FGFR1 within a region amplified in breast cancer on human chromosome 8p11. The coiled coil domain of this gene identified a series of cDNAs in the expressed sequence tag database, which suggested the existence of a family of TACC genes comprising at least three family members. We have now characterized the human and mouse TACC3 cDNAs, and demonstrate that this gene is upregulated in various cancer cell lines, and at Embryonic Day 15 in mice, suggesting that the TACC3 protein is involved in the control of cell growth and differentiation. The TACC3 gene maps telomeric to the FGFR3 gene in 4p16.3, close to a region disrupted by translocation breakpoints associated with multiple myeloma. Thus, TACC1, TACC2, and TACC3 map close to the corresponding FGFR1, FGFR2, and FGFR3 genes. The phylogenetic relationship among the three TACC genes is similar to that of the three FGFR family members. These relationships suggest that the FGFR and TACC genes arose from a physically linked ancestral gene pair. Subsequently, this gene pair has undergone two successive rounds of gene duplication to give rise to the three FGFR/TACC gene pairs on chromosomes 4, 8, and 10.
Collapse
Affiliation(s)
- I H Still
- Center for Molecular Genetics, NB20, The Lerner Research Institute, Cleveland Clinic Foundation, 9500 Euclid Avenue, Cleveland, Ohio, 44195, USA.
| | | | | |
Collapse
|
9
|
Fang YY, Bain S, Haan EA, Eyre HJ, MacDonald M, Wright TJ, Altherr MR, Riess O, Sutherland G, Callen DF. High resolution characterization of an interstitial deletion of less than 1.9 Mb at 4p16.3 associated with Wolf-Hirschhorn syndrome. ACTA ACUST UNITED AC 1997. [DOI: 10.1002/(sici)1096-8628(19970905)71:4<453::aid-ajmg15>3.0.co;2-f] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
10
|
Abstract
AbstractThe human gene for γ-glutamyl carboxylase is 13 kb in length and contains 15 exons. Transcription starts at a cytosine 217 base pair upstream of the first codon. There are two major transcripts in all tissues examined. They are distinguished by the presence of an Alu sequence in the 3′ nontranslated end of the longer species. Relative mRNA levels for 12 bovine tissues are presented.
Collapse
|
11
|
Martin F, Schaller A, Eglite S, Schümperli D, Müller B. The gene for histone RNA hairpin binding protein is located on human chromosome 4 and encodes a novel type of RNA binding protein. EMBO J 1997; 16:769-78. [PMID: 9049306 PMCID: PMC1169678 DOI: 10.1093/emboj/16.4.769] [Citation(s) in RCA: 132] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
The hairpin structure at the 3' end of animal histone mRNAs controls histone RNA 3' processing, nucleocytoplasmic transport, translation and stability of histone mRNA. Functionally overlapping, if not identical, proteins binding to the histone RNA hairpin have been identified in nuclear and polysomal extracts. Our own results indicated that these hairpin binding proteins (HBPs) bind their target RNA as monomers and that the resulting ribonucleoprotein complexes are extremely stable. These features prompted us to select for HBP-encoding human cDNAs by RNA-mediated three-hybrid selection in Saccharomyces cerevesiae. Whole cell extract from one selected clone contained a Gal4 fusion protein that interacted with histone hairpin RNA in a sequence- and structure-specific manner similar to a fraction enriched for bovine HBP, indicating that the cDNA encoded HBP. DNA sequence analysis revealed that the coding sequence did not contain any known RNA binding motifs. The HBP gene is composed of eight exons covering 19.5 kb on the short arm of chromosome 4. Translation of the HBP open reading frame in vitro produced a 43 kDa protein with RNA binding specificity identical to murine or bovine HBP. In addition, recombinant HBP expressed in S. cerevisiae was functional in histone pre-mRNA processing, confirming that we have indeed identified the human HBP gene.
Collapse
Affiliation(s)
- F Martin
- Abteilung für Entwicklungsbiologie, Zoologisches Institut der Universität Bern, Switzerland
| | | | | | | | | |
Collapse
|
12
|
Wang ZF, Whitfield ML, Ingledue TC, Dominski Z, Marzluff WF. The protein that binds the 3' end of histone mRNA: a novel RNA-binding protein required for histone pre-mRNA processing. Genes Dev 1996; 10:3028-40. [PMID: 8957003 DOI: 10.1101/gad.10.23.3028] [Citation(s) in RCA: 220] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Replication-dependent histone mRNAs are not polyadenylated but end in a conserved 26-nucleotide structure that contains a stem-loop. Much of the cell cycle regulation of histone mRNA is post-transcriptional and is mediated by the 3' end of histone mRNA. The stem-loop binding protein (SLBP) that binds the 3' end of histone mRNA is a candidate for the factor that participates in most, if not all, of the post-transcriptional regulatory events. We have cloned the cDNA for the SLBP from humans, mice, and frogs, using the recently developed yeast three-hybrid system. The human SLBP is a 31-kD protein and contains a novel RNA-binding domain, which has been mapped to a 73-amino-acid region of the protein. The cloned SLBP is the protein bound to the 3' end of histone mRNA as antibodies specific for the SLBP remove all specific binding activity from nuclear and polyribosomal extracts. These depleted extracts do not cleave histone pre-mRNA efficiently, demonstrating that the SLBP is required for efficient histone pre-mRNA processing.
Collapse
Affiliation(s)
- Z F Wang
- Program in Molecular Biology and Biotechnology, University of North Carolina at Chapel Hill 27599, USA
| | | | | | | | | |
Collapse
|
13
|
Lin Z, Cui X, Li H. Multiplex genotype determination at a large number of gene loci. Proc Natl Acad Sci U S A 1996; 93:2582-7. [PMID: 8637917 PMCID: PMC39840 DOI: 10.1073/pnas.93.6.2582] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
To facilitate large-scale genotype analysis, an efficient PCR-based multiplex approach has been developed. For simultaneously amplifying the target sequences at a large number of genetic loci, locus-specific primers containing 5' universal tails are used. Attaching the universal tails to the target sequences in the initial PCR steps allows replacement of all specific primers with a pair of primers identical to the universal tails and converts the multiplex amplification into "uniplex." Simultaneous amplification of 26 genetic loci with this approach is described. The multiplex amplification can be coupled with genotype determination. By incorporating a single-base mismatch between a primer and the template into the target sequences, a polymorphic site can be converted into a desirable restriction fragment length polymorphism when it is necessary. In this way, the allelic PCR products for the polymorphic loci can be discriminated by gel electrophoresis after restriction enzyme digestion. In this study, 32 loci were typed in such a multiplex way.
Collapse
Affiliation(s)
- Z Lin
- Coriell Institute for Medical Research, Camden, NJ 08103, USA
| | | | | |
Collapse
|
14
|
Abstract
DNA polymorphism in the Y chromosome, examined at a 729-base pair intron located immediately upstream of the ZFY zinc-finger exon, revealed no sequence variation in a worldwide sample of 38 human males. This finding cannot be explained by global constraint on the intron sequence, because interspecific comparisons with other nonhuman primates revealed phylogenetically informative sequence changes. The invariance likely results from either a recent selective sweep, a recent origin for modern Homo sapiens, recurrent male population bottlenecks, or historically small effective male population sizes. A coalescence model predicts an expected time to a most recent common ancestral male lineage of 270,000 years (95 percent confidence limits: 0 to 800,000 years).
Collapse
Affiliation(s)
- R L Dorit
- Department of Biology, Yale University, New Haven, CT 06511, USA
| | | | | |
Collapse
|
15
|
Abstract
Alu sequences are interspersed throughout the genomes of primate cells, occurring singly and in clusters around RNA polymerase II-transcribed genes. Because these repeat elements are capable of positioning nucleosomes in in vitro reconstitutes (Englander, E. W., Wolffe, A. P., and Howard, B. H. (1993) J. Biol. Chem. 268, 19565-19573), we investigated whether they also influence in vivo chromatin structure. When assayed collectively using consensus sequence probes and native chromatin as template, Alu family members were found to confer rotational positioning on nucleosomes or nucleosome-like particles. In particular, a 10-base pair pattern of DNase I nicking that spanned the RNA polymerase III box A promoter motif extended upstream to cover diverse 5'-flanking sequences, suggesting that Alu repeats may influence patterns of nucleosome formation over neighboring regions. Computational analysis of a set of naturally occurring Alu sequences indicated that nucleosome positioning information is intrinsic to these elements. Inasmuch as local chromatin organization influences gene expression, the capacity of Alu sequences to affect chromatin structure as demonstrated here may help to clarify some features of these elements.
Collapse
Affiliation(s)
- E W Englander
- Laboratory of Molecular Growth Regulation, NICHD, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | |
Collapse
|
16
|
SUTTON GRANGERG, WHITE OWEN, ADAMS MARKD, KERLAVAGE ANTHONYR. TIGR Assembler: A New Tool for Assembling Large Shotgun Sequencing Projects. ACTA ACUST UNITED AC 1995. [DOI: 10.1089/gst.1995.1.9] [Citation(s) in RCA: 82] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
|
17
|
Dangel AW, Mendoza AR, Baker BJ, Daniel CM, Carroll MC, Wu LC, Yu CY. The dichotomous size variation of human complement C4 genes is mediated by a novel family of endogenous retroviruses, which also establishes species-specific genomic patterns among Old World primates. Immunogenetics 1994; 40:425-36. [PMID: 7545960 DOI: 10.1007/bf00177825] [Citation(s) in RCA: 92] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
The human complement C4 genes in the HLA exhibit an unusual, dichotomous size polymorphism and a four-gene, modular variation involving novel gene RP, complement C4, steroid 21-hydroxylase (CYP21), and tenascin-like Gene X (RCCX). The C4 gene size dichotomy is mediated by an endogenous retrovirus, HERV-K(C4). Nearly identical sequences for this retrotransposon are present precisely at the same location in the long C4 genes from the tandem RCCX Module I and Module II. Specific nucleotide substitutions between the long and short C4 genes have been identified and used for diagnosis. Southern blot analyses revealed that HERV-K(C4) is present at more than 30 locations in the human genome, exhibits variations in the population, and its analogs exist in the genomes of Old World primates with species-specific patterns. Evidence of intrachromosomal recombination between the two long terminal repeats of HERV-K(C4) is found near the huntingtin locus on chromosome 4. It is possible that members of HERV-K(C4) are involved in genetic instabilities including the RCCX modules, and in protecting the host genome from retroviral attack through an antisense strategy.
Collapse
Affiliation(s)
- A W Dangel
- Children's Hospital Research Foundation, Columbus, OH 43205
| | | | | | | | | | | | | |
Collapse
|
18
|
Riess O, Winkelmann B, Epplen JT. Toward the complete genomic map and molecular pathology of human chromosome 4. Hum Genet 1994; 94:1-18. [PMID: 8034286 DOI: 10.1007/bf02272834] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
The identification of disease genes via molecular DNA cloning has revolutionized human genetics and medicine. Both the candidate gene approach and positional cloning have been used successfully. The defects causing Huntington's disease, facioscapulohumeral muscular dystrophy, piebaldism, Hurler/Scheie syndrome, one form of autosomal recessive retinitis pigmentosa, and a second locus for autosomal dominant polycystic kidney disease have recently been localized to chromosome 4. In addition to the rapid progress in the cloning of the 203-megabase chromosome, the presence of more than 60 closely spaced microsatellites on this chromosome will undoubtedly lead to the localization of additional disease genes. In order to consider cloned genes as potential candidates for disorders assigned to chromosome 4, it is important to collect and order all genes with respect to their chromosomal localization. Analysis of cytogenetically visible interstitial and terminal deletions should also be helpful in defining new disease gene loci and in mapping novel genes. These data represent the status quo of the integrated molecular map for chromosome 4.
Collapse
Affiliation(s)
- O Riess
- Molekulare Humangenetik Ruhr-Universität, Bochum, Germany
| | | | | |
Collapse
|
19
|
Hubert R, MacDonald M, Gusella J, Arnheim N. High resolution localization of recombination hot spots using sperm typing. Nat Genet 1994; 7:420-4. [PMID: 7920662 DOI: 10.1038/ng0794-420] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
We have applied sperm DNA typing to determine the distribution of crossover events within a one megabase region of the short arm of human chromosome 4 near the locus for Huntington disease. A total of 29 recombinants were detected among 602 sperm typed after whole genome amplification. These recombinants were typed for seven polymorphic markers. The 280 kilobase D4S10-D4S126 interval was found to undergo recombination at a 6-9-fold greater rate per unit of physical distance than the adjacent 720 kb D4S126-D4S127 interval. Sperm typing has the potential to dissect mammalian recombination hot spots to the point where DNA sequence analysis may reveal the molecular basis for hyperrecombination.
Collapse
Affiliation(s)
- R Hubert
- Molecular Biology Program, University of Southern California, Los Angeles 90089-1340
| | | | | | | |
Collapse
|
20
|
|
21
|
|
22
|
Valdes JM, Tagle DA, Collins FS. Island rescue PCR: a rapid and efficient method for isolating transcribed sequences from yeast artificial chromosomes and cosmids. Proc Natl Acad Sci U S A 1994; 91:5377-81. [PMID: 8202494 PMCID: PMC43998 DOI: 10.1073/pnas.91.12.5377] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The identification of transcripts from large genomic regions cloned in yeast artificial chromosomes (YACs) or cosmids continues to be a critical and often rate-limiting step in positional cloning of human disease genes. We have developed a PCR-based method for rapid and efficient generation of probes from YACs or cosmids that can be used for cDNA library screening. The method, which we call island rescue PCR (IRP), is based upon the observation that the 5' ends of many genes are associated with (G+C)-rich regions called CpG islands. In IRP, the YAC of interest is digested with a restriction enzyme that recognizes sequences of high CpG content, and vectorette linkers are ligated to the cleaved ends. The PCR is used to amplify the region extending from the cleaved restriction enzyme site to the nearest SINE (Alu) repeat. In many cases this product contains sequences from the 5' end of the associated gene. cDNA clones isolated with these products are then verified by mapping them back to the original YAC. The method allows rapid screening of > 500 kb of human genomic insert in one experiment, is tolerant of contaminating yeast sequences, and can also be applied to cosmid pools. In a control experiment, the method was able to identify cDNA clones for the neurofibromatosis type 1 (NF1) gene using a probe generated from a YAC in the region. Application of IRP has yielded nine other genes from YACs isolated from chromosome locations 4p16.3 and 17q21.
Collapse
Affiliation(s)
- J M Valdes
- Program in Cellular and Molecular Biology, University of Michigan, Ann Arbor 48109
| | | | | |
Collapse
|
23
|
Abstract
During the past year, improvements in the physical and genetic maps of the human genome, in combination with more efficient methods to isolate genes from cloned DNA, have made an increasing impact on the identification of disease genes. Sequence analysis of genomic DNA and the random sequencing and mapping of cDNA clones is helping to integrate the transcript map with the developing physical and genetic maps.
Collapse
Affiliation(s)
- A P Monaco
- Imperial Cancer Research Fund Laboratories, John Radcliffe Hospital, Headington, Oxford, UK
| |
Collapse
|
24
|
Koop BF, Hood L. Striking sequence similarity over almost 100 kilobases of human and mouse T-cell receptor DNA. Nat Genet 1994; 7:48-53. [PMID: 8075639 DOI: 10.1038/ng0594-48] [Citation(s) in RCA: 122] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
We report here the comparative DNA sequence analysis of nearly 100 kilobases of contiguous DNA in the C delta to C alpha region of the alpha/delta T cell receptor loci (TCRAC/TCRDC) of mouse and man. This analysis--the largest genomic sequence comparison so far--provides new insights into the functions of the T cell receptor genes as well as the surrounding chromosome structure through the identification of actively conserved DNA sequences. In this comparison we have identified a very high level of organizational and noncoding sequence similarity (approximately 71%) in contrast to previous findings in the beta-globin gene cluster. This observation begins to question the notion that much of the chromosomal non-coding sequence is junk.
Collapse
Affiliation(s)
- B F Koop
- Department of Biology, University of Victoria, British Columbia, Canada
| | | |
Collapse
|
25
|
Ferrin LJ, Camerini-Otero RD. Long-range mapping of gaps and telomeres with RecA-assisted restriction endonuclease (RARE) cleavage. Nat Genet 1994; 6:379-83. [PMID: 8054978 DOI: 10.1038/ng0494-379] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
RecA-assisted restriction endonuclease (RARE) cleavage is a method to perform sequence-specific cleavage of genomic DNA, and is useful in physical mapping studies. After making two modifications, we have applied this method to mapping large regions of DNA in several cell types, including a notorious gap near the Huntington disease (HD) locus on chromosome 4. RARE cleavage fragments were analysed by pulsed field gel electrophoresis and Southern blotting and the distances between cleavage sites determined with accuracy. Using RARE cleavage, the gap measured was less than 60 kilobases in length. RARE cleavage is also a straightforward technique to map the distance from a marker to a telomere. The terminal 1.7 megabases of several HD and control cell lines were mapped with no large differences between cell lines in this region.
Collapse
Affiliation(s)
- L J Ferrin
- Genetics and Biochemistry Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892
| | | |
Collapse
|
26
|
Adams MD, Kerlavage AR, Kelley JM, Gocayne JD, Fields C, Fraser CM, Venter JC. A model for high-throughput automated DNA sequencing and analysis core facilities. Nature 1994; 368:474-5. [PMID: 8133896 DOI: 10.1038/368474a0] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Affiliation(s)
- M D Adams
- Institute for Genomic Research, Gaithersburg, Maryland 20878
| | | | | | | | | | | | | |
Collapse
|
27
|
Cross SH, Charlton JA, Nan X, Bird AP. Purification of CpG islands using a methylated DNA binding column. Nat Genet 1994; 6:236-44. [PMID: 8012384 DOI: 10.1038/ng0394-236] [Citation(s) in RCA: 358] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
CpG islands are short stretches of DNA containing a high density of non-methylated CpG dinucleotides, predominantly associated with coding regions. We have constructed an affinity matrix that contains the methyl-CpG binding domain from the rat chromosomal protein MeCP2, attached to a solid support. A column containing the matrix fractionates DNA according to its degree of CpG methylation, strongly retaining those sequences that are highly methylated. Using this column, we have developed a procedure for bulk isolation of CpG islands from human genomic DNA. As CpG islands overlap with approximately 60% of human genes, the resulting CpG island library can be used to isolate full-length cDNAs and to place genes on genomic maps.
Collapse
Affiliation(s)
- S H Cross
- Institute of Cell and Molecular Biology, University of Edinburgh, UK
| | | | | | | |
Collapse
|
28
|
Abstract
Estimation of gene number in mammals is difficult due to the high proportion of noncoding DNA within the nucleus. In this study, we provide a direct measurement of the number of genes in human and mouse. We have taken advantage of the fact that many mammalian genes are associated with CpG islands whose distinctive properties allow their physical separation from bulk DNA. Our results suggest that there are approximately 45,000 CpG islands per haploid genome in humans and 37,000 in the mouse. Sequence comparison confirms that about 20% of the human CpG islands are absent from the homologous mouse genes. Analysis of a selection of genes suggests that both human and mouse are losing CpG islands over evolutionary time due to de novo methylation in the germ line followed by CpG loss through mutation. This process appears to be more rapid in rodents. Combining the number of CpG islands with the proportion of island-associated genes, we estimate that the total number of genes per haploid genome is approximately 80,000 in both organisms.
Collapse
Affiliation(s)
- F Antequera
- Institute of Cell and Molecular Biology, University of Edinburgh, Scotland
| | | |
Collapse
|
29
|
Winterbourne DJ, Thomas S, Hermon-Taylor J. Suppression of anchorage-independent growth after gene transfection. Br J Cancer 1993; 68:251-8. [PMID: 7688547 PMCID: PMC1968545 DOI: 10.1038/bjc.1993.323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
A novel procedure for isolating anchorage-dependent cells has been developed. It involves negative selection of cells growing in suspension followed by clonal replica screening for anchorage-dependent growth. Cells which have regained anchorage-dependent growth have been isolated from a library of the Chinese hamster ovary cell line, CHO-K1, transfected with pSV2neo and human genomic DNA. One anchorage-dependent clone, 1042AC, has been studied in detail. Anchorage-dependent growth of 1042AC is stable when cultured as adherent monolayers, but revertants appear rapidly when cultured in suspension. Suppression is unlikely to be due to loss or mutation of hamster genes conferring anchorage-independent growth as hybrids between 1042AC and CHO-K1 have the suppressed phenotype of 1042AC. Furthermore, a population of cells obtained from the hybrid by selecting for revertants to anchorage-independent growth showed selective loss of the transgenome derived from 1042AC. The growth suppression was not due to transfection of the human Krev-1 gene, which has previously been shown to restore anchorage-dependent growth, nor was there any evidence of alteration in the endogenous hamster Krev-1 gene. However, evidence for a human gene being responsible for the suppressed phenotype has not been obtained yet.
Collapse
Affiliation(s)
- D J Winterbourne
- Department of Surgery, St George's Hospital Medical School, London, UK
| | | | | |
Collapse
|
30
|
Ozawa N, Kano T, Taga C, Hattori M, Sakaki Y, Suzuki H. An exon-trapping system with a newly constructed trapping vector pEXT2; its application to the proximal region of the human chromosome 21 long arm. FEBS Lett 1993; 325:303-8. [PMID: 7686513 DOI: 10.1016/0014-5793(93)81094-g] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
We have developed an exon-trapping system with a newly constructed trapping vector containing multiple cloning sites (designated pEXT2). The system revealed high sensitivity for trapping a control exon from several hundred kbp of DNA. We have applied the system to the cosmid clones located on human chromosome 21p11-q21, and identified two fragments highly homologous to neurofibromatosis 1 (NF1) gene and a clearly transcribed fragment hybridized with approximately 1.6 kb RNA from human brain and human glioblastoma A172 cell.
Collapse
Affiliation(s)
- N Ozawa
- Shionogi Institute for Medical Science, Osaka, Japan
| | | | | | | | | | | |
Collapse
|
31
|
Lucassen AM, Julier C, Beressi JP, Boitard C, Froguel P, Lathrop M, Bell JI. Susceptibility to insulin dependent diabetes mellitus maps to a 4.1 kb segment of DNA spanning the insulin gene and associated VNTR. Nat Genet 1993; 4:305-10. [PMID: 8358440 DOI: 10.1038/ng0793-305] [Citation(s) in RCA: 187] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Recent studies have demonstrated that a locus at 11p15.5 confers susceptibility to insulin dependent diabetes mellitus (IDDM). This locus has been shown to lie within a 19 kb region. We present a detailed sequence comparison of the predominant haplotypes found in this region in a population of French Caucasian IDDM patients and controls. Identification of polymorphisms both associated and unassociated with IDDM has allowed us to define further the region of association to 4.1 kb. Ten polymorphisms within this region are in strong linkage disequilibrium with each other and extend across the insulin gene locus and the variable number tandem repeat (VNTR) situated immediately 5' to the insulin gene. These represent a set of candidate disease polymorphisms one or more of which may account for the susceptibility to IDDM.
Collapse
Affiliation(s)
- A M Lucassen
- Molecular Immunology group, Institute of Molecular Medicine, John Radcliffe Hospital, Oxford, UK
| | | | | | | | | | | | | |
Collapse
|
32
|
Adams MD, Kerlavage AR, Fields C, Venter JC. 3,400 new expressed sequence tags identify diversity of transcripts in human brain. Nat Genet 1993; 4:256-67. [PMID: 8358434 DOI: 10.1038/ng0793-256] [Citation(s) in RCA: 235] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
We present the results of the partial sequencing of over 3,400 expressed sequence tags (ESTs) from human brain cDNA clones, which increases the number of distinct genes expressed in the brain, that are represented by ESTs, to about 6,000. By choosing clones in an unbiased manner, it is possible to construct a profile of the transcriptional activity of the brain at different stages. Proteins that comprise the cytoskeleton are the most abundant; however, a large variety of regulatory proteins are also seen. About half of the ESTs predicted to contain a protein-coding region have no matches in the public peptide databases and may represent new gene families.
Collapse
Affiliation(s)
- M D Adams
- Receptor Biochemistry and Molecular Biology Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, Maryland 20892
| | | | | | | |
Collapse
|