1
|
Cristiano L. The pseudogenes of eukaryotic translation elongation factors (EEFs): Role in cancer and other human diseases. Genes Dis 2022; 9:941-958. [PMID: 35685457 PMCID: PMC9170609 DOI: 10.1016/j.gendis.2021.03.009] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Accepted: 03/29/2021] [Indexed: 02/06/2023] Open
Abstract
The eukaryotic translation elongation factors (EEFs), i.e. EEF1A1, EEF1A2, EEF1B2, EEF1D, EEF1G, EEF1E1 and EEF2, are coding-genes that play a central role in the elongation step of translation but are often altered in cancer. Less investigated are their pseudogenes. Recently, it was demonstrated that pseudogenes have a key regulatory role in the cell, especially via non-coding RNAs, and that the aberrant expression of ncRNAs has an important role in cancer development and progression. The present review paper, for the first time, collects all that published about the EEFs pseudogenes to create a base for future investigations. For most of them, the studies are in their infancy, while for others the studies suggest their involvement in normal cell physiology but also in various human diseases. However, more investigations are needed to understand their functions in both normal and cancer cells and to define which can be useful biomarkers or therapeutic targets.
Collapse
|
2
|
C7ORF41 Regulates Inflammation by Inhibiting NF- κB Signaling Pathway. BIOMED RESEARCH INTERNATIONAL 2021; 2021:7413605. [PMID: 33506033 PMCID: PMC7806384 DOI: 10.1155/2021/7413605] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 12/02/2020] [Accepted: 12/18/2020] [Indexed: 11/18/2022]
Abstract
Inflammation is an important biological process for eliciting immune responses against physiological and pathological stimuli. Inflammation must be efficiently regulated to ensure homeostasis in the body. Nuclear factor-kappa B (NF-κB) signaling is crucial for inflammatory and immune responses. Aberrant activation of NF-κB signaling leads to development of numerous human diseases. In this study, we investigated the function of chromosome 7 open reading frame 41 (C7ORF41) in NF-κB signaling during inflammation. C7ORF41 was upregulated in cells stimulated with tumor necrosis factor-alpha or lipopolysaccharide. Moreover, overexpression of C7ORF41 inhibited the activation of NF-κB and decreased the expression of its downstream target genes. Notably, small hairpin RNA-mediated depletion of C7ORF41 increased the levels of downstream genes and enabled the activation of NF-κB. In conclusion, C7ORF41 negatively regulated inflammation via NF-κB signaling and p65 phosphorylation in vitro. These findings may help to diagnose and prognosticate inflammatory conditions and may help develop new strategies for the management of inflammation-related diseases.
Collapse
|
3
|
Hausner L, Tschäpe JA, Schmitt HP, Hentschel F, Hartmann T, Frölich L. Clinical characterization of a presenilin 1 mutation (F177S) in a family with very early‐onset Alzheimer's disease in the third decade of life. Alzheimers Dement 2013; 10:e27-39. [DOI: 10.1016/j.jalz.2013.02.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2012] [Revised: 01/09/2013] [Accepted: 02/06/2013] [Indexed: 10/26/2022]
Affiliation(s)
- Lucrezia Hausner
- Department of Geriatric Psychiatry, Central Institute of Mental Health, Medical Faculty MannheimUniversity of HeidelbergHeidelbergGermany
- Department of Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty MannheimUniversity of HeidelbergHeidelbergGermany
| | - Jakob A. Tschäpe
- Department of Neurodegeneration and NeurobiologySaarland UniversitySaarbrueckkenGermany
- Department of NeurologySaarland UniversitySaarbrueckkenGermany
| | | | - Frank Hentschel
- Department of Neuroradiology, Central Institute of Mental Health, Medical Faculty MannheimUniversity of HeidelbergHeidelbergGermany
| | - Tobias Hartmann
- Department of Neurodegeneration and NeurobiologySaarland UniversitySaarbrueckkenGermany
- Department of NeurologySaarland UniversitySaarbrueckkenGermany
| | - Lutz Frölich
- Department of Geriatric Psychiatry, Central Institute of Mental Health, Medical Faculty MannheimUniversity of HeidelbergHeidelbergGermany
| |
Collapse
|
4
|
Tsai IJ, Otto TD, Berriman M. Improving draft assemblies by iterative mapping and assembly of short reads to eliminate gaps. Genome Biol 2010; 11:R41. [PMID: 20388197 PMCID: PMC2884544 DOI: 10.1186/gb-2010-11-4-r41] [Citation(s) in RCA: 228] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2010] [Revised: 03/10/2010] [Accepted: 04/13/2010] [Indexed: 12/17/2022] Open
Abstract
IMAGE generates local assemblies, closing gaps in genomes assembled from paired-end next generation sequencing data, often without the need for new data Advances in sequencing technology allow genomes to be sequenced at vastly decreased costs. However, the assembled data frequently are highly fragmented with many gaps. We present a practical approach that uses Illumina sequences to improve draft genome assemblies by aligning sequences against contig ends and performing local assemblies to produce gap-spanning contigs. The continuity of a draft genome can thus be substantially improved, often without the need to generate new data.
Collapse
Affiliation(s)
- Isheng J Tsai
- Parasite Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | | | | |
Collapse
|
5
|
AZFc region of the Y chromosome shows singular structural organization. Chromosome Res 2010; 18:419-30. [PMID: 20373140 DOI: 10.1007/s10577-010-9123-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2009] [Revised: 03/01/2010] [Accepted: 03/02/2010] [Indexed: 10/19/2022]
Abstract
Owing to clonal inheritance, haploid status and lack of recombination, structural polymorphism in the human Y chromosome is more prevalent than that in the remaining parts of the genome. We studied structural organization of the AZFc region, assessed microdeletions therein and studied copy number variation (CNV) of several candidate genes in 750 Indian males. FISH mapping of 13 Y-specific BAC/cosmid clones uncovered a hitherto unreported AZFc configuration showing inter-DAZ gene sequence onto the Yp instead of Yq region. Such inter-DAZ gene arrangements were also detected in five German males (European Y). In 40-50% males, partial u3 and one of the green amplicons, g1, g2 or g3 was present on the Yp in addition to Yq, suggesting an alteration in the IR3 region. Among other AZFc candidates, complete TTY3 and partial CDY1 BAC sequences were detected on the proximal 5p and distal 15q regions, respectively, in both the sexes. However, primers deduced from these clones showed male specific amplification of TTY3 and CDY1 exons suggesting (re)organization of their flanking sequences between Y and autosomes. Importantly, approximately 5% males showed CNV of various Y-linked genes, and approximately 3%, random microdeletions across the AZF region. Present study demonstrates hitherto unreported singular structural organization with respect to DAZ, TTY3 and CDY1 genes highlighting organizational complexities of the human Y chromosome in the global context.
Collapse
|
6
|
Cadenas AM, Regueiro M, Gayden T, Singh N, Zhivotovsky LA, Underhill PA, Herrera RJ. Male amelogenin dropouts: phylogenetic context, origins and implications. Forensic Sci Int 2007; 166:155-63. [PMID: 16781100 DOI: 10.1016/j.forsciint.2006.05.002] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2006] [Revised: 04/03/2006] [Accepted: 05/07/2006] [Indexed: 11/21/2022]
Abstract
Several commercial PCR multiplex kits incorporate the amelogenin locus for the purpose of human gender identification. Consequently, erroneous results in the electropherogram profile of this locus can carry important forensic implications. In this study, dropout of the amelogenin Y allele was detected in 5 out of 77 phenotypically normal Kathmandu males using the AmpFlSTR Identifiler kit. A battery of male-specific markers including SNPs, STRs, STSs, and a minisatellite were amplified for the five amelogenin null samples in order to delineate the breakpoints of the deletions as well as assess the overall integrity of the Y-chromosome. This study represents the first to examine the haplogroup affiliation of the AMGY deletions. The analyses performed suggest a single origin for the five deletions as indicated by their allocation to a specific Y-haplogroup (J2b2-M241), related Y-STR haplotypes and identical regional localization of breakpoints. The age estimated from the microsatellite variation for the amelogenin deletions (if they are associated by descent) is approximately 6.5+/-3.3 ky, younger than the previously reported related age of the M241 haplogroup representatives (13-14 ky). Our data in combination with previous publications suggest a concentration of afflicted individuals in the Indian subcontinent, possibly as a result of common ancestry. The elevated incidence of the amelogenin dropout in these populations accentuates the need to utilize other loci for gender determination in order to obtain an accurate set of inclusion criteria in forensic casework.
Collapse
Affiliation(s)
- A M Cadenas
- Department of Biological Sciences, Florida International University, University Park, OE 304, Miami, FL 33199, USA
| | | | | | | | | | | | | |
Collapse
|
7
|
Tapias A, Monasterio P, Ciudad CJ, Noé V. Characterization of the 5'-flanking region of the human transcription factor Sp3 gene. ACTA ACUST UNITED AC 2005; 1730:126-36. [PMID: 16024108 DOI: 10.1016/j.bbaexp.2005.06.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2004] [Revised: 05/23/2005] [Accepted: 06/06/2005] [Indexed: 12/11/2022]
Abstract
A fragment of 1079 bp from the 5'-flanking region of the human Sp3 gene was isolated and characterized. The Sp3 promoter is a GC-rich region that contains putative binding sites for Elk-1, c-Myb, NF-1, Ap1, Sp1, NF-Y, Ap2 and USF. Several transcriptional start sites located between 70 and 132 bp upstream of the translational start site were identified. The proximal promoter was contained in the first 281 bp 5' of the translational start, whereas the region including up to -225 relative to the translational start was referred as the minimal promoter. Transient transfections and luciferase assays revealed activation of the Sp3 proximal promoter upon overexpression of either Sp1 or Sp3, alone or in combination. Gel-shift and supershift assays demonstrated specific binding of Sp1 and Sp3 proteins to the GC box located in the proximal promoter of Sp3. Overexpression of NF-YA had a synergistic effect on Sp1 overexpression and an additive effect on Sp3 overexpression. Additionally, overexpression of NF-YA, Sp1 and Sp3 altogether had a synergistic effect on Sp3 promoter activity. Furthermore, binding of the NF-Y complex to the CCAAT box located in the proximal promoter of Sp3 was observed in gel-shift assays.
Collapse
Affiliation(s)
- Alicia Tapias
- Department of Biochemistry and Molecular Biology, School of Pharmacy, University of Barcelona, Avenue Diagonal 643, Barcelona E-08028, Spain
| | | | | | | |
Collapse
|
8
|
Abstract
RhoB is a small GTP-binding protein that is involved in apoptotic signal transduction. We have cloned the mouse RhoB mRNA including a 1377 nucleotide 3'-untranslated region (UTR) that contains six AU-rich elements (AREs) as well as several uridine-rich stretches. There is 94% homology overall between the mouse and rat RhoB genes and 92% homology between the mouse and a putative human clone. Ultraviolet light (UVL) induces RhoB production through regulated changes in gene transcription and mRNA stabilization although the latter mechanism is unknown. We observed that UVL increased the half-life of RhoB mRNA from 63 min to 3.3 h in NIH/3T3 cells and from 87 min to 2.7 h in normal human keratinocyte cells. In vitro mobility shift assays demonstrated that HuR bound the 3'-UTR of RhoB at three distinct locations (nucleotides 1342-1696, 1765-1920 and 1897-1977) suggesting a regulatory role for this RNA-binding protein. HuR immunoprecipitations were positive for RhoB mRNA indicating an in vivo association, and Western blot analysis and immunofluorescence demonstrated that HuR rapidly partitions from the nucleus to the cytoplasm after UVL. Therefore, we propose a model in which UVL induces stress-activated signal transduction leading to nuclear/cytoplasmic shuttling of HuR and subsequent stabilization of RhoB mRNA.
Collapse
Affiliation(s)
- Cara J Westmark
- Department of Pathology and Laboratory Medicine, Waisman Center for Developmental Disabilities, University of Wisconsin, Madison, WI 53705, USA
| | | | | |
Collapse
|
9
|
Hammer SE, Strehl S, Hagemann S. Homologs of Drosophila P transposons were mobile in zebrafish but have been domesticated in a common ancestor of chicken and human. Mol Biol Evol 2004; 22:833-44. [PMID: 15616143 DOI: 10.1093/molbev/msi068] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A substantial fraction of vertebrate and invertebrate genomes is composed of mobile elements and their derivatives. One of the most intensively studied transposon families, the P elements of Drosophila, was thought to exist exclusively in the genomes of dipteran insects. Based on the data provided by the human genome project, in 2001 our group has identified a P element-homologous sequence in the human genome. This P element-homologous human gene, named Phsa, is 19,533 nucleotides long, comprises six exons and five introns, and encodes a protein of still unknown function with a length of 903 amino acid residues. The N-terminal THAP domain of the putative Phsa protein shows similarities to the site-specific DNA-binding domain of the Drosophila P element transposase. In the present study, FISH analysis and the screening of a human lambda genomic library revealed a single copy of Phsa located on the long arm of chromosome 4, upstream of a gene coding for the hypothetical protein DKFZp686L1814. The same gene arrangement was found for the homologous gene Pgga in the genome of chicken, thus, displaying Pgga at orthologous position on the long arm of chromosome 4. The single-copy gene status and the absence of terminal inverted repeats and target-site duplications indicate that Phsa and Pgga constitute domesticated stationary sequences. In contrast, a considerable number of P-homologous sequences with terminal inverted repeats and intact target-site duplications could be identified in zebrafish, strongly indicating that Pdre elements were mobile within the zebrafish genome. Pdre elements are the first P-like transposons identified in a vertebrate species. With respect to Phsa, gene expression studies showed that Phsa is expressed in a broad range of human tissues, suggesting that the putative Phsa protein plays a not yet understood but essential role in a specific metabolic pathway. We demonstrate that P-homologous DNA sequences occur in the genomes of 21 analyzed vertebrates but only as rudiments in the rodents. Finally, the evolutionary history of P element-homologous vertebrate sequences is discussed in the context of the "molecular domestication" hypothesis versus the "source gene hypothesis."
Collapse
Affiliation(s)
- Sabine E Hammer
- Laboratories of Genome Dynamics, Center of Anatomy and Cell Biology, Medical University of Vienna, Vienna, Austria
| | | | | |
Collapse
|
10
|
Nguyen N, Zhang X, Olashaw N, Seto E. Molecular Cloning and Functional Characterization of the Transcription Factor YY2. J Biol Chem 2004; 279:25927-34. [PMID: 15087442 DOI: 10.1074/jbc.m402525200] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
YY1 is a ubiquitous zinc finger transcription factor that binds to and regulates promoters and enhancers of many cellular and viral genes. Here we report the isolation of a human cDNA encoding a DNA sequence-specific binding protein with significant homology to the transcription factor YY1. A sequence analysis of this novel protein, YY2, revealed an overall 65% identity in the DNA sequence and a 56% identity in protein sequence compared with human YY1. The most pronounced similarity between YY1 and YY2 exists within the zinc finger regions of the two proteins, and consistent with this observation, YY2 can bind to and regulate some promoters known to be controlled by YY1. Similar to YY1, YY2 contains both transcriptional activation and repression functions. The finding of a protein with structure and function similar to YY1 provides a new opportunity to explore additional mechanisms by which YY1-responsive genes can be regulated and suggests that gene regulation by YY1 is far more complicated than previously assumed.
Collapse
Affiliation(s)
- Nang Nguyen
- Department of Medical Microbiology and Immunology, College of Medicine, University of South Florida, Tampa, Florida 33612, USA
| | | | | | | |
Collapse
|
11
|
Kherrouche Z, De Launoit Y, Monté D. Human E2F6 is alternatively spliced to generate multiple protein isoforms. Biochem Biophys Res Commun 2004; 317:749-60. [PMID: 15081404 DOI: 10.1016/j.bbrc.2004.03.099] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2004] [Indexed: 01/29/2023]
Abstract
E2F6 protein belongs to the family of the E2F transcription factors. Here, we showed that the human E2F6 gene contains nine exons distributed along 20.4kbp of genomic DNA on chromosome 2 leading to the transcription of six alternatively spliced E2F6 mRNAs that encode four different E2F6 proteins. Moreover, we identified an E2F6 pseudogene localized on chromosome 22 completely spliced and devoid of exons 2, 3, and 4, and part of exons 1 and 5. Definition of the transcriptional initiation site and sequence analysis show that the gene contains a TATA less, CAAT less, GC-rich promoter with multiple transcription start sites. Regulatory elements necessary for basal transcription reside within a 134bp fragment as determined by transient transfection experiments.
Collapse
Affiliation(s)
- Zoulika Kherrouche
- CNRS UMR 8117, Institut de Biologie de Lille, 1 rue Calmette, BP 447, 59021 Lille, France
| | | | | |
Collapse
|
12
|
Carroll EA, Gerrelli D, Gasca S, Berg E, Beier DR, Copp AJ, Klingensmith J. Cordon-bleu is a conserved gene involved in neural tube formation. Dev Biol 2003; 262:16-31. [PMID: 14512015 DOI: 10.1016/s0012-1606(03)00323-3] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
The axial midline is an important source of patterning and morphogenesis cues in the vertebrate embryo. The midline derives from a small group of cells in the gastrulating embryo, known as "the organizer" in recognition of its ability to organize an entire body plan. The mammalian organizer, the node, gives rise to axial midline structures: the notochord, dorsal foregut, and part of the floor plate of the neural tube. Only some of the genes that direct midline development are known. In this study, we present the complete coding sequence for a novel gene, cordon-bleu (cobl), expressed specifically in the node and its derivatives until organogenesis stages. The deduced sequence does not resemble any gene of known function. However, cobl is widely conserved: apparent orthologs and paralogs are found in many vertebrate species, with several sequence domains of high conservation but unknown function. We find that chicken cordon-bleu is similarly expressed in the node and its derivatives, suggesting functional conservation. We also report the sequence and nonoverlapping expression of a related mouse gene, Coblr1. Finally, we show that cobl interacts with the neurulation gene Vangl2 to facilitate midbrain neural tube closure, demonstrating roles for both cobl and Vangl2 in midbrain neurulation.
Collapse
Affiliation(s)
- Elizabeth A Carroll
- Department of Cell Biology, Box 3709, Duke University Medical Center, Durham, NC 27710, USA
| | | | | | | | | | | | | |
Collapse
|
13
|
Philibert RA, Nelson JJ, Bedell B, Goedken R, Sandhu HK, Noyes R, Crowe RR. Role of elastin polymorphisms in panic disorder. Am J Med Genet B Neuropsychiatr Genet 2003; 117B:7-10. [PMID: 12555228 DOI: 10.1002/ajmg.b.10014] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Panic disorder (PD) is a complex neuropsychiatric disorder that has been associated with an increased frequency of mitral valve prolapse. Elastin is a prominent component of mitral valves and, in a genome screen of 23 pedigrees with PD, we found evidence of linkage to the region of chromosome 7 that contains the elastin gene (ELN). Therefore, we examined the minimal essential promoter and coding regions of ELN in 23 independent probands from the families in our linkage studies using single strand conformational polymorphism analysis. We found three polymorphisms including one that coded for a non-conservative amino acid change. However, none of these polymorphisms were associated with panic disorder in a case-control analysis or linked to it in multiplex pedigrees. In our pedigrees, exonic polymorphisms in ELN do not play a major role in the genetic vulnerability to PD.
Collapse
Affiliation(s)
- Robert A Philibert
- Department of Psychiatry, The University of Iowa, Rm 2-205 Psychiatry Research/MEB, Iowa City, Iowa, USA.
| | | | | | | | | | | | | |
Collapse
|
14
|
Igaz P, Fitzimons CP, Szalai C, Falus A. Histamine genomics in silico: polymorphisms of the human genes involved in the synthesis, action and degradation of histamine. AMERICAN JOURNAL OF PHARMACOGENOMICS : GENOMICS-RELATED RESEARCH IN DRUG DEVELOPMENT AND CLINICAL PRACTICE 2002; 2:67-72. [PMID: 12083955 DOI: 10.2165/00129785-200202010-00006] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
BACKGROUND Histamine is a ubiquitous biogenic amine involved in the regulation of numerous basic physiological and pathophysiological processes. The DNA sequences of the genes encoding proteins (enzymes and receptors) that participate in the synthesis, degradation and cellular binding of histamine are already identified. OBJECTIVE We analyzed the in silico available human sequences to find genetic polymorphisms in histamine-related genes (L-histidine decarboxylase, histamine receptors, histamine N-methyl transferase and diamine-oxidase), and compared these data with findings concerning structure-function relationships in order to get information about the possible pathophysiological relevance of these polymorphisms. METHODS Sequence analysis was performed at the National Center for Biotechnology Information Database. The search tool BLAST was applied. RESULTS Several sequence variations were found, and it is conceivable that some of these genetic polymorphisms may be related to various pathological conditions. Among sequence variations, variants with no amino acid change, variants resulting in amino acid alterations, and many nucleotide changes involving non-coding sequences were revealed. CONCLUSIONS Histamine genomics may provide a new tool for medical prediction and drug design in the future.
Collapse
Affiliation(s)
- Peter Igaz
- 2nd Department of Medicine, Faculty of Medicine, Semmelweis University Medical School, 1089 Budapest, Nagyvárad tér 4, Hungary
| | | | | | | |
Collapse
|
15
|
Abstract
Recent spectacular advances in the technologies and strategies for DNA sequencing have profoundly accelerated the detailed analysis of genomes from myriad organisms. The past few years alone have seen the publication of near-complete or draft versions of the genome sequence of several well-studied, multicellular organisms - most notably, the human. As well as providing data of fundamental biological significance, these landmark accomplishments have yielded important strategic insights that are guiding current and future genome-sequencing projects.
Collapse
Affiliation(s)
- E D Green
- Genome Technology Branch and NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA.
| |
Collapse
|
16
|
Lu K, Lee MH, Carpten JD, Sekhon M, Patel SB. High-resolution physical and transcript map of human chromosome 2p21 containing the sitosterolaemia locus. Eur J Hum Genet 2001; 9:364-74. [PMID: 11378825 PMCID: PMC1350990 DOI: 10.1038/sj.ejhg.5200627] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2000] [Revised: 12/11/2000] [Accepted: 12/19/2000] [Indexed: 01/22/2023] Open
Abstract
Sitosterolaemia (phytosterolaemia) is an autosomal recessive disorder characterised by the presence of tendon xanthomas in the face of normal or mildly elevated plasma cholesterol levels, premature atherosclerotic disease and has diagnostically elevated plasma and tissue plant sterol concentrations. Affected individuals show an increased absorption of both cholesterol and sitosterol from the diet, decreased bile clearance of these sterols and their metabolites resulting in markedly expanded whole body cholesterol and sitosterol pools. The defective gene is therefore hypothesised to play a crucial role in regulating dietary cholesterol absorption, and its elucidation may shed light on these molecular processes. We have previously localised the defective gene to human chromosome 2p21, between microsatellite markers D2S1788 and D2S1352, a distance of approximately 15 cM. Recently, the disease locus interval has been narrowed to lie between D2S2294 and D2S2291/D2S2174. We have constructed a high-resolution YAC and BAC contigs by using known STSs and generating novel STSs from the minimal interval. Eight previously identified genes and 60 ESTs were mapped to these contigs. The BAC contig contains 60 BAC clones and 108 STSs and encompasses a physical distance of approximately 2.0 cM between microsatellite markers D2S2294 and D2S2291. These results will not only facilitate cloning of the sitosterolaemia gene, but also other disease genes located in this region, and accelerate sequencing of the corresponding genomic clones.
Collapse
Affiliation(s)
- Kangmo Lu
- Division of Endocrinology, Diabetes and Medical Genetics, Department of Medicine, Medical University of South Carolina, Charleston, SC 29403, USA
| | - Mi-Hye Lee
- Division of Endocrinology, Diabetes and Medical Genetics, Department of Medicine, Medical University of South Carolina, Charleston, SC 29403, USA
| | - John D Carpten
- Prostate Cancer Investigation Group, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Mundeep Sekhon
- Genome Sequencing Center, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Shailendra B Patel
- Division of Endocrinology, Diabetes and Medical Genetics, Department of Medicine, Medical University of South Carolina, Charleston, SC 29403, USA
- *Correspondence: SB Patel, Division of Endocrinology, Diabetes and Medical Genetics, STR 541, 114 Doughty Street, Charleston, SC 29403, USA. Tel: +1 843 876 5227; Fax: +1 843 876 5133; E-mail:
| |
Collapse
|
17
|
Younger RM, Amadou C, Bethel G, Ehlers A, Lindahl KF, Forbes S, Horton R, Milne S, Mungall AJ, Trowsdale J, Volz A, Ziegler A, Beck S. Characterization of clustered MHC-linked olfactory receptor genes in human and mouse. Genome Res 2001; 11:519-30. [PMID: 11282967 PMCID: PMC311051 DOI: 10.1101/gr.gr-1603r] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Olfactory receptor (OR) loci frequently cluster and are present on most human chromosomes. They are members of the seven transmembrane receptor (7-TM) superfamily and, as such, are part of one of the largest mammalian multigene families, with an estimated copy number of up to 1000 ORs per haploid genome. As their name implies, ORs are known to be involved in the perception of odors and possibly also in other, nonolfaction-related, functions. Here, we report the characterization of ORs that are part of the MHC-linked OR clusters in human and mouse (partial sequence only). These clusters are of particular interest because of their possible involvement in olfaction-driven mate selection. In total, we describe 50 novel OR loci (36 human, 14 murine), making the human MHC-linked cluster the largest sequenced OR cluster in any organism so far. Comparative and phylogenetic analyses confirm the cluster to be MHC-linked but divergent in both species and allow the identification of at least one ortholog that will be useful for future regulatory and functional studies. Quantitative feature analysis shows clear evidence of duplications of blocks of OR genes and reveals the entire cluster to have a genomic environment that is very different from its neighboring regions. Based on in silico transcript analysis, we also present evidence of extensive long-distance splicing in the 5'-untranslated regions and, for the first time, of alternative splicing within the single coding exon of ORs. Taken together with our previous finding that ORs are also polymorphic, the presented data indicate that the expression, function, and evolution of these interesting genes might be more complex than previously thought.
Collapse
Affiliation(s)
- R M Younger
- The Sanger Centre, Wellcome Trust Genome Campus, Hinxton, CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Younger RM, Amadou C, Bethel G, Ehlers A, Lindahl KF, Forbes S, Horton R, Milne S, Mungall AJ, Trowsdale J, Volz A, Ziegler A, Beck S. Characterization of Clustered MHC-Linked Olfactory Receptor Genes in Human and Mouse. Genome Res 2001. [DOI: 10.1101/gr.160301] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Olfactory receptor (OR) loci frequently cluster and are present on most human chromosomes. They are members of the seven transmembrane receptor (7-TM) superfamily and, as such, are part of one of the largest mammalian multigene families, with an estimated copy number of up to 1000 ORs per haploid genome. As their name implies, ORs are known to be involved in the perception of odors and possibly also in other, nonolfaction-related, functions. Here, we report the characterization of ORs that are part of the MHC-linked OR clusters in human and mouse (partial sequence only). These clusters are of particular interest because of their possible involvement in olfaction-driven mate selection. In total, we describe 50 novel OR loci (36 human, 14 murine), making the human MHC-linked cluster the largest sequenced OR cluster in any organism so far. Comparative and phylogenetic analyses confirm the cluster to be MHC-linked but divergent in both species and allow the identification of at least one ortholog that will be useful for future regulatory and functional studies. Quantitative feature analysis shows clear evidence of duplications of blocks of OR genes and reveals the entire cluster to have a genomic environment that is very different from its neighboring regions. Based on in silico transcript analysis, we also present evidence of extensive long-distance splicing in the 5′-untranslated regions and, for the first time, of alternative splicing within the single coding exon of ORs. Taken together with our previous finding that ORs are also polymorphic, the presented data indicate that the expression, function, and evolution of these interesting genes might be more complex than previously thought.[The sequence data described in this paper have been submitted to the EMBL nucleotide data library under accession nos.Z84475, Z98744, Z98745, AL021807, AL021808, AL022723, AL022727,AL031893, AL035402, AL035542, AL050328, AL050339, AL078630, AL096770,AL121944, AL133160, and AL133267.]
Collapse
|
19
|
Braybrook C, Warry G, Howell G, Arnason A, Bjornsson A, Moore GE, Ross MT, Stanier P. Identification and characterization of KLHL4, a novel human homologue of the Drosophila Kelch gene that maps within the X-linked cleft palate and Ankyloglossia (CPX) critical region. Genomics 2001; 72:128-36. [PMID: 11401425 DOI: 10.1006/geno.2000.6478] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
X-linked cleft palate (CPX) is a rare nonsyndromic form of orofacial clefting that is, unlike more common forms, inherited as a highly penetrant Mendelian trait. Linkage studies using a large Icelandic kindred localized the gene to Xq21.3, and a physical map defining a 2.0-Mb candidate region was subsequently constructed. Genomic sequence is now available for much of the critical region and has been surveyed for potential transcriptional units. Through this analysis, we have identified a novel human homologue of Kelch, KLHL4. The transcript represents a mRNA of approximately 3.6 kb and encodes a protein of 718 amino acids. Protein domain analysis reveals six tandem repeats (Kelch repeats) at the C-terminus and a POZ/BTB protein-binding domain toward the N-terminus, characteristic of Drosophila Kelch and other family members. KLHL4 consists of 11 exons spanning a genomic interval of approximately 150 kb. From EST sequences and RT-PCR analysis, there is evidence for the use of alternative 3' UTRs. The mRNA is expressed in a range of fetal tissues including tongue, palate, and mandible. Mutational analysis in affected CPX patients revealed one sequence alteration that was most likely to be a silent polymorphism.
Collapse
Affiliation(s)
- C Braybrook
- Institute of Reproductive and Developmental Biology, Imperial College, Hammersmith Campus, Du Cane Road, London, W12 ONN, United Kingdom
| | | | | | | | | | | | | | | |
Collapse
|
20
|
Doudney K, Murdoch JN, Paternotte C, Bentley L, Gregory S, Copp AJ, Stanier P. Comparative physical and transcript maps of approximately 1 Mb around loop-tail, a gene for severe neural tube defects on distal mouse chromosome 1 and human chromosome 1q22-q23. Genomics 2001; 72:180-92. [PMID: 11401431 DOI: 10.1006/geno.2000.6463] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The homozygous loop-tail (Lp) mouse has a severe neural tube closure defect, analogous to the craniorachischisis phenotype seen in humans. Linkage analysis and physical mapping have previously localized the Lp locus to a region on mouse chromosome 1 defined by the markers D1Mit113-Tagln2. Here we report the construction of sequence-ready bacterial clone contigs encompassing the Lp critical region in both mouse and the orthologous human region (1q22-q23). Twenty-two genes, one EST, and one pseudogene have been identified using a combination of EST database screening, exon amplification, and genomic sequence analysis. The preliminary gene map is Cen-Estm33-AA693056-Ly9-Cd48-Slam-Cd84-Kiaa1215-Nhlh1-Kiaa0253-Copa-Pxf-H326-Pea15-Casq1-Atp1a4-Atp1a2-Estm34-Kcnj9-Kcnj10-Kiaa1355-Tagln2-Nesg1-Crp-Tel. The genes between Slam and Kiaa1355 are positional candidates for Lp. The comparative gene content and order are identical between mouse and human, indicating a high degree of conservation between the two species in this region. Together, the physical and transcript maps described here serve as resources for the identification of the Lp mutation and further define the conservation of this genomic region between mouse and human.
Collapse
Affiliation(s)
- K Doudney
- Institute of Reproductive and Developmental Biology, Imperial College, Hammersmith Campus, Du Cane Road, London, W12 ONN, United Kingdom
| | | | | | | | | | | | | |
Collapse
|
21
|
Wendl MC, Marra MA, Hillier LW, Chinwalla AT, Wilson RK, Waterston RH. Theories and Applications for Sequencing Randomly Selected Clones. Genome Res 2001. [DOI: 10.1101/gr.133901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Theory is developed for the process of sequencing randomly selected large-insert clones. Genome size, library depth, clone size, and clone distribution are considered relevant properties and perfect overlap detection for contig assembly is assumed. Genome-specific and nonrandom effects are neglected. Order of magnitude analysis indicates library depth is of secondary importance compared to the other variables, especially as clone size diminishes. In such cases, the well-known Poisson coverage law is a good approximation. Parameters derived from these models are used to examine performance for the specific case of sequencing random human BAC clones. We compare coverage and redundancy rates for libraries possessing uniform and nonuniform clone distributions. Results are measured against data from map-based human-chromosome-2 sequencing. We conclude that the map-based approach outperforms random clone sequencing, except early in a project. However, simultaneous use of both strategies can be beneficial if a performance-based estimate for halting random clone sequencing is made. Results further show that the random approach yields maximum effectiveness using nonbiased rather than biased libraries.
Collapse
|
22
|
Miller-Podraza H. Polyglycosylceramides, Poly-N-acetyllactosamine-Containing Glycosphingolipids: Methods of Analysis, Structure, and Presumable Biological Functions. Chem Rev 2000; 100:4663-82. [PMID: 11749361 DOI: 10.1021/cr990347o] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
- H Miller-Podraza
- Institute of Medical Biochemistry, Göteborg University, P.O. Box 440, SE 405 30 Göteborg, Sweden
| |
Collapse
|
23
|
Okada S, Fujisawa M, Sone T, Nakayama S, Nishiyama R, Takenaka M, Yamaoka S, Sakaida M, Kono K, Takahama M, Yamato KT, Fukuzawa H, Brennicke A, Ohyama K. Construction of male and female PAC genomic libraries suitable for identification of Y-chromosome-specific clones from the liverwort, Marchantia polymorpha. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2000; 24:421-8. [PMID: 11069714 DOI: 10.1046/j.1365-313x.2000.00882.x] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Unlike higher plants, the dioecious liverwort, Marchantia polymorpha, has uniquely small sex chromosomes, with X chromosomes present only in female gametophytes and Y chromosomes only in male gametophytes. We have constructed respective genomic libraries for male and female plantlets using a P1-derived artificial chromosome (pCYPAC2). With an average insert size of approximately 90 kb, each PAC library is estimated to cover the entire genome with a probability of more than 99.9%. Male-specific PAC clones were screened for by differential hybridization using male and female genomic DNAs as separate probes. Seventy male-specific PAC clones were identified. The male specificity of one of the clones, pMM4G7, was verified by Southern hybridization and PCR analysis. This clone was indeed located on the Y chromosome as verified by fluorescence in situ hybridization (FISH). This result shows that the Y chromosome contains unique sequences that are not present either on the X chromosome or any of the autosomes. Thus, the respective male and female libraries for M. polymorpha offer an opportunity to identify key genes involved in the process of sex differentiation and this unique system of sex determination.
Collapse
Affiliation(s)
- S Okada
- Laboratory of Plant Molecular Biology, Division of Integrated Life Science, Graduate School of Biostudies, Kyoto University, Kyoto 606-8502, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Mullikin JC, Hunt SE, Cole CG, Mortimore BJ, Rice CM, Burton J, Matthews LH, Pavitt R, Plumb RW, Sims SK, Ainscough RM, Attwood J, Bailey JM, Barlow K, Bruskiewich RM, Butcher PN, Carter NP, Chen Y, Clee CM, Coggill PC, Davies J, Davies RM, Dawson E, Francis MD, Joy AA, Lamble RG, Langford CF, Macarthy J, Mall V, Moreland A, Overton-Larty EK, Ross MT, Smith LC, Steward CA, Sulston JE, Tinsley EJ, Turney KJ, Willey DL, Wilson GD, McMurray AA, Dunham I, Rogers J, Bentley DR. An SNP map of human chromosome 22. Nature 2000; 407:516-20. [PMID: 11029003 DOI: 10.1038/35035089] [Citation(s) in RCA: 112] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The human genome sequence will provide a reference for measuring DNA sequence variation in human populations. Sequence variants are responsible for the genetic component of individuality, including complex characteristics such as disease susceptibility and drug response. Most sequence variants are single nucleotide polymorphisms (SNPs), where two alternate bases occur at one position. Comparison of any two genomes reveals around 1 SNP per kilobase. A sufficiently dense map of SNPs would allow the detection of sequence variants responsible for particular characteristics on the basis that they are associated with a specific SNP allele. Here we have evaluated large-scale sequencing approaches to obtaining SNPs, and have constructed a map of 2,730 SNPs on human chromosome 22. Most of the SNPs are within 25 kilobases of a transcribed exon, and are valuable for association studies. We have scaled up the process, detecting over 65,000 SNPs in the genome as part of The SNP Consortium programme, which is on target to build a map of 1 SNP every 5 kilobases that is integrated with the human genome sequence and that is freely available in the public domain.
Collapse
|
25
|
Giacalone J, Delobette S, Gibaja V, Ni L, Skiadas Y, Qi R, Edington J, Lai Z, Gebauer D, Zhao H, Anantharaman T, Mishra B, Brown LG, Saxena R, Page DC, Schwartz DC. Optical mapping of BAC clones from the human Y chromosome DAZ locus. Genome Res 2000; 10:1421-9. [PMID: 10984460 PMCID: PMC310922 DOI: 10.1101/gr.112100] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/1999] [Accepted: 07/12/2000] [Indexed: 11/25/2022]
Abstract
The accurate mapping of clones derived from genomic regions containing complex arrangements of repeated elements presents special problems for DNA sequencers. Recent advances in the automation of optical mapping have enabled us to map a set of 16 BAC clones derived from the DAZ locus of the human Y chromosome long arm, a locus in which the entire DAZ gene as well as subsections within the gene copies have been duplicated. High-resolution optical mapping employing seven enzymes places these clones into two contigs representing four distinct copies of the DAZ gene and highlights a number of differences between individual copies of DAZ.
Collapse
Affiliation(s)
- J Giacalone
- W.M. Keck Laboratory for Biomolecular Imaging, Department of Chemistry, New York University, New York, New York 10003, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Abstract
Here we show that Lgl1 protein, cub-1-related proteins, coch-5b2-related proteins, coagulation factor C of horse-shoe crab and a predicted protein of Plasmodium falciparum share a homologous domain. Since this domain-type was first identified in Limulus factor C, Coch-5b2 and Lgl1 we propose the name LCCL for this domain-family. The LCCL module of coch-5b2 is of special biological interest because it has been shown recently that mutations affecting this module cause the deafness disorder DFNA9 in humans. With a view to defining the structure and function of the LCCL domain of human coch-5b2 protein, we have expressed it in Escherichia coli and subjected it to preliminary structural characterization. Structure prediction and circular dichroism studies on the recombinant protein indicate that the domain possesses both alpha helices and beta strands. It is shown that the mutations which cause hearing loss in humans affect residues that are critical for the integrity of the LCCL module of the coch-5b2 protein.
Collapse
Affiliation(s)
- M Trexler
- Institute of Enzymology, Biological Research Center, Hungarian Academy of Sciences, Budapest, Hungary
| | | | | |
Collapse
|
27
|
Abstract
The human genome sequence will underpin human biology and medicine in the next century, providing a single, essential reference to all genetic information. The international program to determine the complete DNA sequence (3,000 million bases) is well underway. As of January 2000, 50% of the sequence is available in the public domain. A comprehensive working draft is expected this year, and the entire sequence is projected to be finished in 2003. DNA sequencing is carried out on mapped, overlapping bacterial clones of 150-200 kb. The working draft comprises assembled unfinished sequence and is released immediately in the public domain. The draft sequence of each clone is then completed, by closing any remaining gaps and resolving any ambiguities, before the entire sequence is checked, annotated, and submitted to the public databases. The sequence of each clone is finished to an accuracy of >99.99%. The availability of a reference sequence of the genome provides the basis for studying the nature of sequence variation, particularly single nucleotide polymorphisms (SNPs), in human populations. SNP typing is a powerful tool for genetic analysis, and will enable us to uncover the association of loci at specific sites in the genome with many disease traits. SNPs occur at a frequency of approximately 1 SNP/kb throughout the genome when the sequence of any two individuals is compared. Programs to detect and map SNPs in the human genome are underway with the aim of establishing a SNP map of the genome during the next two years. The human genome sequence will provide a complete description of all the genes. Annotation of the sequence with the gene structures is achieved by a combination of computational analysis (predictive and homology-based) and experimental confirmation by cDNA sequencing. Detecting homologies between newly defined gene products and proteins of known function helps to postulate biochemical functions for them, which can then be tested. Establishing the association of specific genes with disease phenotypes by mutation screening, particularly for monogenic disorders, provides further assistance in defining the functions of some gene products, as well as helping to establish the cause of the disease. As our knowledge of gene sequences and sequence variation in populations increases, we will pinpoint more and more of the genes and proteins that are important in common, complex diseases. A more detailed understanding of the function of the human genome will be achieved as we identify sequences that control gene expression. Given the availability of gene sequences, the expression status of genes in particular tissues can be monitored in parallel. By comparing corresponding genomic sequences in different species (for example: man, mouse, chicken, and zebrafish), regions that have been highly conserved during evolution can be identified, many of which reflect conserved functions such as gene regulation. These approaches promise to greatly accelerate our interpretation of the human genome sequence.
Collapse
Affiliation(s)
- D R Bentley
- The Sanger Centre, The Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| |
Collapse
|
28
|
Thomas JW, Summers TJ, Lee-Lin SQ, Maduro VV, Idol JR, Mastrian SD, Ryan JF, Jamison DC, Green ED. Comparative genome mapping in the sequence-based era: early experience with human chromosome 7. Genome Res 2000; 10:624-33. [PMID: 10810084 PMCID: PMC310865 DOI: 10.1101/gr.10.5.624] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The success of the ongoing Human Genome Project has resulted in accelerated plans for completing the human genome sequence and the earlier-than-anticipated initiation of efforts to sequence the mouse genome. As a complement to these efforts, we are utilizing the available human sequence to refine human-mouse comparative maps and to assemble sequence-ready mouse physical maps. Here we describe how the first glimpses of genomic sequence from human chromosome 7 are directly facilitating these activities. Specifically, we are actively enhancing the available human-mouse comparative map by analyzing human chromosome 7 sequence for the presence of orthologs of mapped mouse genes. Such orthologs can then be precisely positioned relative to mapped human STSs and other genes. The chromosome 7 sequence generated to date has allowed us to more than double the number of genes that can be placed on the comparative map. The latter effort reveals that human chromosome 7 is represented by at least 20 orthologous segments of DNA in the mouse genome. A second component of our program involves systematically analyzing the evolving human chromosome 7 sequence for the presence of matching mouse genes and expressed-sequence tags (ESTs). Mouse-specific hybridization probes are designed from such sequences and used to screen a mouse bacterial artificial chromosome (BAC) library, with the resulting data used to assemble BAC contigs based on probe-content data. Nascent contigs are then expanded using probes derived from newly generated BAC-end sequences. This approach produces BAC-based sequence-ready maps that are known to contain a gene(s) and are homologous to segments of the human genome for which sequence is already available. Our ongoing efforts have thus far resulted in the isolation and mapping of >3,800 mouse BACs, which have been assembled into >100 contigs. These contigs include >250 genes and represent approximately 40% of the mouse genome that is homologous to human chromosome 7. Together, these approaches illustrate how the availability of genomic sequence directly facilitates studies in comparative genomics and genome evolution.
Collapse
Affiliation(s)
- J W Thomas
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892 USA
| | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Wilson MJ, Torkar M, Haude A, Milne S, Jones T, Sheer D, Beck S, Trowsdale J. Plasticity in the organization and sequences of human KIR/ILT gene families. Proc Natl Acad Sci U S A 2000; 97:4778-83. [PMID: 10781084 PMCID: PMC18309 DOI: 10.1073/pnas.080588597] [Citation(s) in RCA: 500] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The approximately 1-Mb leukocyte receptor complex at 19q13.4 is a key polymorphic immunoregion containing all of the natural killer-receptor KIR and related ILT genes. When the organization of the leukocyte receptor complex was compared from two haplotypes, the gene content in the KIR region varied dramatically, with framework loci flanking regions of widely variable gene content. The ILT genes were more stable in number except for ILT6, which was present only in one haplotype. Analysis of Alu repeats and comparison of KIR gene sequences, which are over 90% identical, are consistent with a recent origin. KIR genesis was followed by extensive duplication/deletion as well as intergenic sequence exchange, reminiscent of MHC class I genes, which provide KIR ligands.
Collapse
Affiliation(s)
- M J Wilson
- Immunology Division, Department of Pathology, Tennis Court Road, Cambridge CB2 1QP, United Kingdom
| | | | | | | | | | | | | | | |
Collapse
|
30
|
Abstract
It is now widely accepted that cancer is attributed to the accumulation of genetic alterations in cells. Thus, to understand the molecular mechanisms of cancer metastasis, it is indispensable to identify the genes whose alterations accumulate during cancer progression as well as the genes whose expression is responsible for the acquisition of metastatic potential in cancer cells. Molecular analyses of cancer cells in various stages of progression have revealed that alterations in tumor suppressor genes and oncogenes accumulate during tumor progression and correlate with the clinical aggressiveness of cancer. Comparative analyses of gene expression profiles between metastatic and non-metastatic cells have revealed that various genes are differentially expressed in association with the metastatic potential of cancer cells. A number of genes have been also identified as having functions in inducing or suppressing metastasis in experimental models. However, the association between causative genetic alterations and resulting phenotypic alterations with respect to the metastatic potential of cancer cells is not fully understood. Therefore, elucidation of genotype-phenotype correlation will be required to further understand a complex process of metastasis. Here, I review the progress on molecular studies of tumor progression and metastasis of the past 20 years and discuss the future direction in this field of science.
Collapse
Affiliation(s)
- J Yokota
- Biology Division, National Cancer Center Research Institute, 1-1 Tsukiji 5-chome, Chuo-ku, Tokyo 104-0045, Japan.
| |
Collapse
|
31
|
Ellsworth RE, Jamison DC, Touchman JW, Chissoe SL, Braden Maduro VV, Bouffard GG, Dietrich NL, Beckstrom-Sternberg SM, Iyer LM, Weintraub LA, Cotton M, Courtney L, Edwards J, Maupin R, Ozersky P, Rohlfing T, Wohldmann P, Miner T, Kemp K, Kramer J, Korf I, Pepin K, Antonacci-Fulton L, Fulton RS, Minx P, Hillier LW, Wilson RK, Waterston RH, Miller W, Green ED. Comparative genomic sequence analysis of the human and mouse cystic fibrosis transmembrane conductance regulator genes. Proc Natl Acad Sci U S A 2000; 97:1172-7. [PMID: 10655503 PMCID: PMC15558 DOI: 10.1073/pnas.97.3.1172] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The identification of the cystic fibrosis transmembrane conductance regulator gene (CFTR) in 1989 represents a landmark accomplishment in human genetics. Since that time, there have been numerous advances in elucidating the function of the encoded protein and the physiological basis of cystic fibrosis. However, numerous areas of cystic fibrosis biology require additional investigation, some of which would be facilitated by information about the long-range sequence context of the CFTR gene. For example, the latter might provide clues about the sequence elements responsible for the temporal and spatial regulation of CFTR expression. We thus sought to establish the sequence of the chromosomal segments encompassing the human CFTR and mouse Cftr genes, with the hope of identifying conserved regions of biologic interest by sequence comparison. Bacterial clone-based physical maps of the relevant human and mouse genomic regions were constructed, and minimally overlapping sets of clones were selected and sequenced, eventually yielding approximately 1.6 Mb and approximately 358 kb of contiguous human and mouse sequence, respectively. These efforts have produced the complete sequence of the approximately 189-kb and approximately 152-kb segments containing the human CFTR and mouse Cftr genes, respectively, as well as significant amounts of flanking DNA. Analyses of the resulting data provide insights about the organization of the CFTR/Cftr genes and potential sequence elements regulating their expression. Furthermore, the generated sequence reveals the precise architecture of genes residing near CFTR/Cftr, including one known gene (WNT2/Wnt2) and two previously unknown genes that immediately flank CFTR/Cftr.
Collapse
Affiliation(s)
- R E Ellsworth
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Peoples R, Franke Y, Wang YK, Pérez-Jurado L, Paperna T, Cisco M, Francke U. A physical map, including a BAC/PAC clone contig, of the Williams-Beuren syndrome--deletion region at 7q11.23. Am J Hum Genet 2000; 66:47-68. [PMID: 10631136 PMCID: PMC1288354 DOI: 10.1086/302722] [Citation(s) in RCA: 119] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Williams-Beuren syndrome (WBS) is a developmental disorder caused by haploinsufficiency for genes in a 2-cM region of chromosome band 7q11.23. With the exception of vascular stenoses due to deletion of the elastin gene, the various features of WBS have not yet been attributed to specific genes. Although >/=16 genes have been identified within the WBS deletion, completion of a physical map of the region has been difficult because of the large duplicated regions flanking the deletion. We present a physical map of the WBS deletion and flanking regions, based on assembly of a bacterial artificial chromosome/P1-derived artificial chromosome contig, analysis of high-throughput genome-sequence data, and long-range restriction mapping of genomic and cloned DNA by pulsed-field gel electrophoresis. Our map encompasses 3 Mb, including 1.6 Mb within the deletion. Two large duplicons, flanking the deletion, of >/=320 kb contain unique sequence elements from the internal border regions of the deletion, such as sequences from GTF2I (telomeric) and FKBP6 (centromeric). A third copy of this duplicon exists in inverted orientation distal to the telomeric flanking one. These duplicons show stronger sequence conservation with regard to each other than to the presumptive ancestral loci within the common deletion region. Sequence elements originating from beyond 7q11.23 are also present in these duplicons. Although the duplicons are not present in mice, the order of the single-copy genes in the conserved syntenic region of mouse chromosome 5 is inverted relative to the human map. A model is presented for a mechanism of WBS-deletion formation, based on the orientation of duplicons' components relative to each other and to the ancestral elements within the deletion region.
Collapse
Affiliation(s)
- Risa Peoples
- Department of Genetics, Stanford University School of Medicine, and Howard Hughes Medical Institute, Stanford; and Servicio de Genética, Hospital Universitario La Paz, and Departamento de Bioquimica, Facultad de Medicina, Universidad Autonoma de Madrid, Madrid
| | - Yvonne Franke
- Department of Genetics, Stanford University School of Medicine, and Howard Hughes Medical Institute, Stanford; and Servicio de Genética, Hospital Universitario La Paz, and Departamento de Bioquimica, Facultad de Medicina, Universidad Autonoma de Madrid, Madrid
| | - Yu-Ker Wang
- Department of Genetics, Stanford University School of Medicine, and Howard Hughes Medical Institute, Stanford; and Servicio de Genética, Hospital Universitario La Paz, and Departamento de Bioquimica, Facultad de Medicina, Universidad Autonoma de Madrid, Madrid
| | - Luis Pérez-Jurado
- Department of Genetics, Stanford University School of Medicine, and Howard Hughes Medical Institute, Stanford; and Servicio de Genética, Hospital Universitario La Paz, and Departamento de Bioquimica, Facultad de Medicina, Universidad Autonoma de Madrid, Madrid
| | - Tamar Paperna
- Department of Genetics, Stanford University School of Medicine, and Howard Hughes Medical Institute, Stanford; and Servicio de Genética, Hospital Universitario La Paz, and Departamento de Bioquimica, Facultad de Medicina, Universidad Autonoma de Madrid, Madrid
| | - Michael Cisco
- Department of Genetics, Stanford University School of Medicine, and Howard Hughes Medical Institute, Stanford; and Servicio de Genética, Hospital Universitario La Paz, and Departamento de Bioquimica, Facultad de Medicina, Universidad Autonoma de Madrid, Madrid
| | - Uta Francke
- Department of Genetics, Stanford University School of Medicine, and Howard Hughes Medical Institute, Stanford; and Servicio de Genética, Hospital Universitario La Paz, and Departamento de Bioquimica, Facultad de Medicina, Universidad Autonoma de Madrid, Madrid
| |
Collapse
|
33
|
Dunham I, Shimizu N, Roe BA, Chissoe S, Hunt AR, Collins JE, Bruskiewich R, Beare DM, Clamp M, Smink LJ, Ainscough R, Almeida JP, Babbage A, Bagguley C, Bailey J, Barlow K, Bates KN, Beasley O, Bird CP, Blakey S, Bridgeman AM, Buck D, Burgess J, Burrill WD, O'Brien KP. The DNA sequence of human chromosome 22. Nature 1999; 402:489-95. [PMID: 10591208 DOI: 10.1038/990031] [Citation(s) in RCA: 813] [Impact Index Per Article: 32.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Knowledge of the complete genomic DNA sequence of an organism allows a systematic approach to defining its genetic components. The genomic sequence provides access to the complete structures of all genes, including those without known function, their control elements, and, by inference, the proteins they encode, as well as all other biologically important sequences. Furthermore, the sequence is a rich and permanent source of information for the design of further biological studies of the organism and for the study of evolution through cross-species sequence comparison. The power of this approach has been amply demonstrated by the determination of the sequences of a number of microbial and model organisms. The next step is to obtain the complete sequence of the entire human genome. Here we report the sequence of the euchromatic part of human chromosome 22. The sequence obtained consists of 12 contiguous segments spanning 33.4 megabases, contains at least 545 genes and 134 pseudogenes, and provides the first view of the complex chromosomal landscapes that will be found in the rest of the genome.
Collapse
Affiliation(s)
- I Dunham
- Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Marth GT, Korf I, Yandell MD, Yeh RT, Gu Z, Zakeri H, Stitziel NO, Hillier L, Kwok PY, Gish WR. A general approach to single-nucleotide polymorphism discovery. Nat Genet 1999; 23:452-6. [PMID: 10581034 DOI: 10.1038/70570] [Citation(s) in RCA: 440] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Single-nucleotide polymorphisms (SNPs) are the most abundant form of human genetic variation and a resource for mapping complex genetic traits. The large volume of data produced by high-throughput sequencing projects is a rich and largely untapped source of SNPs (refs 2, 3, 4, 5). We present here a unified approach to the discovery of variations in genetic sequence data of arbitrary DNA sources. We propose to use the rapidly emerging genomic sequence as a template on which to layer often unmapped, fragmentary sequence data and to use base quality values to discern true allelic variations from sequencing errors. By taking advantage of the genomic sequence we are able to use simpler yet more accurate methods for sequence organization: fragment clustering, paralogue identification and multiple alignment. We analyse these sequences with a novel, Bayesian inference engine, POLYBAYES, to calculate the probability that a given site is polymorphic. Rigorous treatment of base quality permits completely automated evaluation of the full length of all sequences, without limitations on alignment depth. We demonstrate this approach by accurate SNP predictions in human ESTs aligned to finished and working-draft quality genomic sequences, a data set representative of the typical challenges of sequence-based SNP discovery.
Collapse
Affiliation(s)
- G T Marth
- Washington University Department of Genetics and Genome Sequencing Center, St. Louis, Missouri, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
White PS, Sulman EP, Porter CJ, Matise TC. A comprehensive view of human chromosome 1. Genome Res 1999; 9:978-88. [PMID: 10523526 PMCID: PMC310818 DOI: 10.1101/gr.9.10.978] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/1999] [Accepted: 08/18/1999] [Indexed: 12/25/2022]
Abstract
Comprehensive representations of human chromosomes combining diverse genomic data sets, localizing expressed sequences, and reflecting physical distance are essential for disease gene identification and sequencing efforts. We have developed a method (CompView) for integrating genomic information derived from available cytogenetic, genetic linkage, radiation hybrid, physical, and transcript-based mapping approaches. CompView generates chromosome representations with substantially higher resolution, coverage, and integration than current maps of the human genome. The CompView process was used to build a representation of human chromosome 1, yielding a map with >13,000 unique elements, an effective resolution of 910 kb, and a marker density of 50 kb. CompView creates comprehensive and fully integrated depictions of a chromosome's clinical, biological, and structural information.
Collapse
Affiliation(s)
- P S White
- Division of Oncology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104 USA.
| | | | | | | |
Collapse
|
36
|
Stephens R, Horton R, Humphray S, Rowen L, Trowsdale J, Beck S. Gene organisation, sequence variation and isochore structure at the centromeric boundary of the human MHC. J Mol Biol 1999; 291:789-99. [PMID: 10452889 DOI: 10.1006/jmbi.1999.3004] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We have mapped and sequenced the region immediately centromeric of the human major histocompatibility complex (MHC). A cluster of 13 genes/pseudogenes was identified in a 175 kb PAC linking the TAPASIN locus with the class II region. It includes two novel human genes (BING4 and SACM2L) and a thus far unnoticed human leucocyte antigen (HLA) class II pseudogene, termed HLA-DPA3. Analysis of the G+C content revealed an isochore boundary which, together with the previously reported telomeric boundary, defines the MHC class II region as one of the first completely sequenced isochores in the human genome. Comparison of the sequence with limited sequence from other cell lines shows that the high sequence variation found within the classical class II region extends beyond the identified isochore boundary leading us to propose the concept of an "extended MHC". By comparative analysis, we have precisely identified the mouse/human synteny breakpoint at the centromeric end of the extended MHC class II region between the genes HSET and PHF1.
Collapse
Affiliation(s)
- R Stephens
- Immunology, Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QP, UK
| | | | | | | | | | | |
Collapse
|