1
|
Besnard J, Sonawala U, Maharjan B, Collakova E, Finlayson SA, Pilot G, McDowell J, Okumoto S. Increased Expression of UMAMIT Amino Acid Transporters Results in Activation of Salicylic Acid Dependent Stress Response. FRONTIERS IN PLANT SCIENCE 2021; 11:606386. [PMID: 33574824 PMCID: PMC7870477 DOI: 10.3389/fpls.2020.606386] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Accepted: 12/17/2020] [Indexed: 05/29/2023]
Abstract
In addition to their role in the biosynthesis of important molecules such as proteins and specialized metabolites, amino acids are known to function as signaling molecules through various pathways to report nitrogen status and trigger appropriate metabolic and cellular responses. Moreover, changes in amino acid levels through altered amino acid transporter activities trigger plant immune responses. Specifically, loss of function of major amino acid transporter, over-expression of cationic amino acid transporter, or over-expression of the positive regulators of membrane amino acid export all lead to dwarfed phenotypes and upregulated salicylic acid (SA)-induced stress marker genes. However, whether increasing amino acid exporter protein levels lead to similar stress phenotypes has not been investigated so far. Recently, a family of transporters, namely USUALLY MULTIPLE ACIDS MOVE IN AND OUT TRANSPORTERS (UMAMITs), were identified as amino acid exporters. The goal of this study was to investigate the effects of increased amino acid export on plant development, growth, and reproduction to further examine the link between amino acid transport and stress responses. The results presented here show strong evidence that an increased expression of UMAMIT transporters induces stress phenotypes and pathogen resistance, likely due to the establishment of a constitutive stress response via a SA-dependent pathway.
Collapse
Affiliation(s)
- Julien Besnard
- Department of Plant Pathology, Physiology and Weed Science, Virginia Tech, Blacksburg, VA, United States
- Department of Soil and Crop Sciences, Texas A&M AgriLife Research, Texas A&M University, College Station, TX, United States
| | - Unnati Sonawala
- Department of Plant Pathology, Physiology and Weed Science, Virginia Tech, Blacksburg, VA, United States
| | - Bal Maharjan
- Department of Soil and Crop Sciences, Texas A&M AgriLife Research, Texas A&M University, College Station, TX, United States
| | - Eva Collakova
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA, United States
| | - Scott A. Finlayson
- Department of Soil and Crop Sciences, Texas A&M AgriLife Research, Texas A&M University, College Station, TX, United States
- Faculty of Molecular and Environmental Plant Sciences, Texas A&M University, College Station, TX, United States
| | - Guillaume Pilot
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA, United States
| | - John McDowell
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA, United States
| | - Sakiko Okumoto
- Department of Plant Pathology, Physiology and Weed Science, Virginia Tech, Blacksburg, VA, United States
- Department of Soil and Crop Sciences, Texas A&M AgriLife Research, Texas A&M University, College Station, TX, United States
| |
Collapse
|
2
|
Watanabe Y, Shibata K, Maekawa M. Cell line differences in replication timing of human glutamate receptor genes and other large genes associated with neural disease. Epigenetics 2014; 9:1350-9. [PMID: 25437050 PMCID: PMC4622467 DOI: 10.4161/15592294.2014.967585] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2014] [Revised: 09/04/2014] [Accepted: 09/11/2014] [Indexed: 01/30/2023] Open
Abstract
There is considerable current interest in the function of epigenetic mechanisms in neuroplasticity with regard to learning and memory formation and to a range of neural diseases. Previously, we described replication timing on human chromosome 21q in the THP-1 human cell line (2n = 46, XY) and showed that several genes associated with neural diseases, such as the neuronal glutamate receptor subunit GluR-5 (GRIK1) and amyloid precursor protein (APP), were located in regions where replication timing transitioned from early to late S phase. Here, we compared replication timing of all known human glutamate receptor genes (26 genes in total) and APP in 6 different human cell lines including human neuron-related cell lines. Replication timings were obtained by integrating our previously reported data with new data generated here and information from the online database ReplicationDomain. We found that many of the glutamate receptor genes were clearly located in replication timing transition zones in neural precursor cells, but this relationship was less clear in embryonic stem cells before neural differentiation; in the latter, the genes were often located in later replication timing zones that displayed DNA hypermethylation. Analysis of selected large glutamate receptor genes (> 200 kb), and of APP, showed that their precise replication timing patterns differed among the cell lines. We propose that the transition zones of DNA replication timing are altered by epigenetic mechanisms, and that these changes may affect the neuroplasticity that is important to memory and learning, and may also have a role in the development of neural diseases.
Collapse
Affiliation(s)
- Yoshihisa Watanabe
- Department of Laboratory Medicine; Hamamatsu University School of Medicine; Hamamatsu, Japan
| | - Kiyoshi Shibata
- Research Equipment Center; Hamamatsu University School of Medicine; Hamamatsu, Japan
| | - Masato Maekawa
- Department of Laboratory Medicine; Hamamatsu University School of Medicine; Hamamatsu, Japan
| |
Collapse
|
3
|
R/G-band boundaries: genomic instability and human disease. Clin Chim Acta 2013; 419:108-12. [PMID: 23434413 DOI: 10.1016/j.cca.2013.02.011] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2012] [Revised: 01/29/2013] [Accepted: 02/03/2013] [Indexed: 01/19/2023]
Abstract
The human genome is composed of large-scale compartmentalized structures resulting from variations in the amount of guanine and cytosine residues (GC%) and in the timing of DNA replication. These compartmentalized structures are related to the light- and dark-staining bands along chromosomes after the appropriate staining. Here we describe our current understanding of the biological importance of the boundaries between these light and dark bands (the so-called R/G boundaries). These R/G boundaries were identified following integration of information obtained from analyses of chromosome bands and genome sequences. This review also discusses the potential medical significance of these chromosomal regions for conditions related to genomic instability, such as cancer and neural disease. We propose that R/G-chromosomal boundaries, which correspond to regions showing a switch in replication timing from early to late S phase (early/late-switch regions) and of transition in GC%, have an extremely low number of replication origins and more non-B-form DNA structures than other genomic regions. Further, we suggest that genes located at R/G boundaries and which contain such DNA sequences have an increased risk of genetic instability and of being associated with human diseases. Finally, we propose strategies for genome and epigenome analyses based on R/G boundaries.
Collapse
|
4
|
Watanabe Y, Abe T, Ikemura T, Maekawa M. Relationships between replication timing and GC content of cancer-related genes on human chromosomes 11q and 21q. Gene 2009; 433:26-31. [DOI: 10.1016/j.gene.2008.12.004] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2008] [Revised: 11/28/2008] [Accepted: 12/05/2008] [Indexed: 10/21/2022]
|
5
|
Watanabe Y, Shibata K, Ikemura T, Maekawa M. Replication timing of extremely large genes on human chromosomes 11q and 21q. Gene 2008; 421:74-80. [DOI: 10.1016/j.gene.2008.06.016] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2008] [Revised: 06/13/2008] [Accepted: 06/16/2008] [Indexed: 01/10/2023]
|
6
|
Hrabcová I, Kypr J. The Longest (A+T) and (G+C) Blocks in the Human and Other Genomes. J Biomol Struct Dyn 2008; 25:337-45. [DOI: 10.1080/07391102.2008.10507182] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
7
|
Watanabe Y, Shibata K, Sugimura H, Maekawa M. p53-Dependent change in replication timing of the human genome. Biochem Biophys Res Commun 2007; 364:289-93. [DOI: 10.1016/j.bbrc.2007.09.136] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2007] [Accepted: 09/30/2007] [Indexed: 10/22/2022]
|
8
|
Watanabe Y, Ikemura T, Sugimura H. Amplicons on human chromosome 11q are located in the early/late-switch regions of replication timing. Genomics 2004; 84:796-805. [PMID: 15475258 DOI: 10.1016/j.ygeno.2004.08.001] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2004] [Accepted: 08/04/2004] [Indexed: 11/16/2022]
Abstract
Amplicons are frequently found in human tumor genomes, but the mechanism of their generation is still poorly understood. We previously measured the replication timing of the genes along the entire length of human chromosomes 11q and 21q and found that many "disease-related" genes are located in timing-transition regions. In this study, further scrutiny of the updated replication-timing map of human chromosome 11q revealed that both amplicons on human chromosomal bands 11q13 and 11q22 are located in the early/late-switch regions of replication timing in two human cell lines (THP-1 and Jurkat). Moreover, examination of synteny in the human and mouse genomes revealed that synteny breakage in both genomes occurred primarily at the early/late-switch regions of replication timing that we had identified. In conclusion, we found that the early/late-switch regions of replication timing coincided with "unstable" regions of the genome.
Collapse
Affiliation(s)
- Yoshihisa Watanabe
- First Department of Pathology, Hamamatsu University School of Medicine, 1-20-1 Handayama, Hamamatsu, Shizuoka 431-3192, Japan
| | | | | |
Collapse
|
9
|
Iwase M, Satta Y, Hirai Y, Hirai H, Imai H, Takahata N. The amelogenin loci span an ancient pseudoautosomal boundary in diverse mammalian species. Proc Natl Acad Sci U S A 2003; 100:5258-63. [PMID: 12672962 PMCID: PMC154332 DOI: 10.1073/pnas.0635848100] [Citation(s) in RCA: 103] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
The mammalian amelogenin (AMEL) genes are found on both the X and Y chromosomes (gametologous). Comparison of the genomic AMEL sequences in five primates and three other mammals reveals that the 5' portion of the gametologous AMEL loci began to differentiate in the common ancestor of extant mammals, whereas the 3' portion differentiated independently within species of different mammals. The boundary is marked by a transposon insertion in intron 2 and is shared by all species examined. In addition, 540-kb DNA sequences from the short arm of the human X chromosome are aligned with their Y gametologous sequences. The pattern and extent of sequence differences in the 5' portion of the AMEL loci extend to a proximal region that contains the ZFX locus, and those in the 3' portion extend all the way down to the pseudoautosomal boundary (PAB)1. We concluded that the AMEL locus spans an ancient PAB, and that both the ancient and present PABs were determined by chance events during the evolution of mammals and primates. Sex chromosome differentiation likely took place in a region that contains the male-determining loci by suppressing homologous recombination.
Collapse
Affiliation(s)
- Mineyo Iwase
- Department of Biosystems Science, Graduate University for Advanced Studies (Sokendai), Hayama, Kanagawa 240-0193, Japan
| | | | | | | | | | | |
Collapse
|
10
|
Zeeberg B. Shannon information theoretic computation of synonymous codon usage biases in coding regions of human and mouse genomes. Genome Res 2002; 12:944-55. [PMID: 12045147 PMCID: PMC1383734 DOI: 10.1101/gr.213402] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2001] [Accepted: 03/06/2002] [Indexed: 11/24/2022]
Abstract
Exonic GC of human mRNA reference sequences (RefSeqs), as well as A, C, G, and T in codon position 3 are linearly correlated with genomic GC. These observations utilize information from the completed human genome sequence and a large, high-quality set of human and mouse coding sequences, and are in accord with similar determinations published by others. A Shannon Information Theoretic measure of bias in synonymous codon usage was developed. When applied to either human or mouse RefSeqs, this measure is nonlinearly correlated with genomic, exonic, and third codon position A, C, G, and T. Information values between orthologous mouse and human RefSeqs are linearly correlated: mouse = 0.092 + 0.55 human. Mouse genes were consistently placed in genomic regions whose GC content was closer to 50% than was the GC content of the human ortholog. Since the (nonlinear) information versus percent GC curve has a minimum at 50% GC and monotonically increases with increasing distance from 50% GC, this phenomenon directly results in the low slope of 0.55. This appears to be a manifestation of an evolutionary strategy for placement of genes in regions of the genome with a GC content that relates synonymous codon bias and protein folding.
Collapse
Affiliation(s)
- Barry Zeeberg
- Laboratory of Molecular Pharmacology, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.
| |
Collapse
|
11
|
Abstract
The human genome is described in the literature as being composed of the isochores, i.e., long (hundreds of kilobases) segments with a homogeneous (G + C) content. We calculated the (G + C) content variations along the DNA molecules of the human chromosomes 21 and 22 and found the variations to be higher everywhere compared to the randomized sequences. Hence the (G + C) content is certainly not homogeneous on the isochore scale in the two human chromosomes. In addition, we found no significant difference between the two human molecules and the genome of E. coli regarding the (G + C) content variations. Hence no isochores are either present in the DNA molecules of the human chromosomes 21 and 22, or the isochores are also present in the genome of Escherichia coli. In any case, the present communication demonstrates that the isochores should be defined in unambiguous molecular terms if they are to be used for an up-to-date genome structure characterization.
Collapse
Affiliation(s)
- D Häring
- Institute of Biophysics, Academy of Sciences of the Czech Republic, Královopolská 135, Brno, CZ-61265, Czech Republic
| | | |
Collapse
|
12
|
Watanabe Y, Tenzen T, Nagasaka Y, Inoko H, Ikemura T. Replication timing of the human X-inactivation center (XIC) region: correlation with chromosome bands. Gene 2000; 252:163-72. [PMID: 10903448 DOI: 10.1016/s0378-1119(00)00208-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The human genome is composed of long-range G+C% mosaic structures, which are thought to be related to chromosome bands. Replication timing during S phase is associated with chromosomal band zones; thus, band boundaries are thought to correspond to regions where replication timing switches. The proximal limit of the human X-inactivation center (XIC) has been localized cytologically to the junction zone between Xq13.1 and Xq13.2. Using PCR-based quantification of the newly replicated DNA from cell-cycle fractionated THP-1 cells, the replication timing in and around the XIC was determined at the genome sequence level. We found two regions where replication timing changes from the early to late period during S phase. One is located near a large inverted duplication proximal to the XIC, and the other is near the XIST locus. We propose that the 1Mb late-replicated zone (from the large inverted duplication to XIST) corresponds to a G-band Xq13.2. Several common characteristics were observed in the XIST region and the MHC class II-III junction which was previously defined as a band boundary. These characteristics included differential high-density clustering of Alu and LINE repeats, and the presence of polypurine/polypyrimidine tracts, MER41A, MER57 and MER58B.
Collapse
Affiliation(s)
- Y Watanabe
- Division of Evolutionary Genetics, Department of Population Genetics, National Institute of Genetics, Yata 1111, Mishima, 411-8540, Shizuoka-ken, Japan
| | | | | | | | | |
Collapse
|
13
|
Abstract
The nuclear genomes of vertebrates are mosaics of isochores, very long stretches (>>300kb) of DNA that are homogeneous in base composition and are compositionally correlated with the coding sequences that they embed. Isochores can be partitioned in a small number of families that cover a range of GC levels (GC is the molar ratio of guanine+cytosine in DNA), which is narrow in cold-blooded vertebrates, but broad in warm-blooded vertebrates. This difference is essentially due to the fact that the GC-richest 10-15% of the genomes of the ancestors of mammals and birds underwent two independent compositional transitions characterized by strong increases in GC levels. The similarity of isochore patterns across mammalian orders, on the one hand, and across avian orders, on the other, indicates that these higher GC levels were then maintained, at least since the appearance of ancestors of warm-blooded vertebrates. After a brief review of our current knowledge on the organization of the vertebrate genome, evidence will be presented here in favor of the idea that the generation and maintenance of the GC-richest isochores in the genomes of warm-blooded vertebrates were due to natural selection.
Collapse
Affiliation(s)
- G Bernardi
- Laboratorio di Evoluzione Molecolare, Stazione Zoologica Anton Dohrn, Napoli, Italy.
| |
Collapse
|
14
|
Häring D, Kypr J. Variations of the mononucleotide and short oligonucleotide distributions in the genomes of various organisms. J Theor Biol 1999; 201:141-56. [PMID: 10556022 DOI: 10.1006/jtbi.1999.1019] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
We calculated the variation coefficients of the mononucleotide and short oligonucleotide distributions in over 1700 long genomic sequences originating from six organisms to demonstrate that the human and Escherichia coli genomic sequences were the least and the most uniform, respectively. The most non-random genomic distributions were exhibited by the four canonical nucleotides, followed by the strong and weak nucleotides, while the distributions of purine or pyrimidine nucleotides and especially the distributions of (A+C) and (G+T) were significantly more uniform even in the human genome. In the human and mouse genomes, the highest coefficients of variation were further observed with the oligonucleotides where CG was combined with the strong nucleotides while its combination with the weak nucleotides significantly decreased the variation which, however, was still very high. High variation was also exhibited by the remaining oligonucleotides composed exclusively of the strong nucleotides or those containing only weak nucleotides. On the other hand, the distributions of oligonucleotides containing similar and especially the same numbers of the strong and weak nucleotides, but no CG or TA dinucleotide, were the most uniform. The information following from the present analysis will be useful not only in the identification of important genomic regions but also in computer simulations of the genomic nucleotide sequences in order to trace and reproduce the pathways of genome evolution.
Collapse
Affiliation(s)
- D Häring
- Academy of Sciences of the Czech Republic, Královopolská 135, Brno, CZ-61265, Czech Republic
| | | |
Collapse
|
15
|
Abstract
BACKGROUND Nucleotide substitution rates and G + C content vary considerably among mammalian genes. It has been proposed that the mammalian genome comprises a mosaic of regions - termed isochores - with differing G + C content. The regional variation in gene G + C content might therefore be a reflection of the isochore structure of chromosomes, but the factors influencing the variation of nucleotide substitution rate are still open to question. RESULTS To examine whether nucleotide substitution rates and gene G + C content are influenced by the chromosomal location of genes, we compared human and murid (mouse or rat) orthologues known to belong to one of the chromosomal (autosomal) segments conserved between these species. Multiple members of gene families were excluded from the dataset. Sets of neighbouring genes were defined as those lying within 1 centiMorgan (cM) of each other on the mouse genetic map. For both synonymous substitution rates and G + C content at silent sites, neighbouring genes were found to be significantly more similar to each other than sets of genes randomly drawn from the dataset. Moreover, we demonstrated that the regional similarities in G + C content (isochores) and synonymous substitution rate were independent of each other. CONCLUSIONS Our results provide the first substantial statistical evidence for the existence of a regional variation in the synonymous substitution rate within the mammalian genome, indicating that different chromosomal regions evolve at different rates. This regional phenomenon which shapes gene evolution could reflect the existence of 'evolutionary rate units' along the chromosome.
Collapse
Affiliation(s)
- G Matassi
- Institute of Genetics, University of Nottingham, Queens Medical Centre, Nottingham, NG7 2UH, UK.
| | | | | |
Collapse
|
16
|
|
17
|
Tenzen T, Yamagata T, Fukagawa T, Sugaya K, Ando A, Inoko H, Gojobori T, Fujiyama A, Okumura K, Ikemura T. Precise switching of DNA replication timing in the GC content transition area in the human major histocompatibility complex. Mol Cell Biol 1997; 17:4043-50. [PMID: 9199339 PMCID: PMC232257 DOI: 10.1128/mcb.17.7.4043] [Citation(s) in RCA: 74] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The human genome is composed of long-range G+C% (GC%) mosaic structures thought to be related to chromosome bands. We previously reported a boundary of megabase-sized GC% mosaic domains at the junction area between major histocompatibility complex (MHC) classes II and III, proposing it as a possible chromosome band boundary. DNA replication timing during the S phase is known to be correlated cytogenetically with chromosome band zones, and thus the band boundaries have been predicted to contain a switch point for DNA replication timing. In this study, to identify to the nucleotide sequence level the replication switch point during the S phase, we determined the precise DNA replication timing for MHC classes II and III, focusing on the junction area. To do this, we used PCR-based quantitation of nascent DNA obtained from synchronized human myeloid leukemia HL60 cells. The replication timing changed precisely in the boundary region with a 2-h difference between the two sides, supporting the prediction that this region may be a chromosome band boundary. We supposed that replication fork movement terminates (pauses) or significantly slows in the switch region, which contains dense Alu clusters; polypurine/polypyrimidine tracts; di-, tri-, or tetranucleotide repeats; and medium-reiteration-frequency sequences. Because the nascent DNA in the switch region was recovered at low efficiency, we investigated whether this region is associated with the nuclear scaffold and found three scaffold-associated regions in and around the switch region.
Collapse
Affiliation(s)
- T Tenzen
- Department of Evolutionary Genetics, National Institute of Genetics, and The Graduate University for Advanced Studies, Shizuoka-ken, Japan.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Platzer M, Rotman G, Bauer D, Uziel T, Savitsky K, Bar-Shira A, Gilad S, Shiloh Y, Rosenthal A. Ataxia-telangiectasia locus: sequence analysis of 184 kb of human genomic DNA containing the entire ATM gene. Genome Res 1997; 7:592-605. [PMID: 9199932 DOI: 10.1101/gr.7.6.592] [Citation(s) in RCA: 85] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Ataxia-telangiectasia (A-T) is an autosomal recessive disorder involving cerebellar degeneration, immunodeficiency, chromosomal instability, radiosensitivity, and cancer predisposition. The genomic organization of the A-T gene, designated ATM, was established recently. To date, more than 100 A-T-associated mutations have been reported in the ATM gene that do not support the existence of one or several mutational hotspots. To allow genotype/phenotype correlations it will be important to find additional ATM mutations. The nature and location of the mutations will also provide insights into the molecular processes that underly the disease. To facilitate the search for ATM mutations and to establish the basis for the identification of transcriptional regulatory elements, we have sequenced and report here 184,490 bp of genomic sequence from the human 11q22-23 chromosomal region containing the entire ATM gene, spanning 146 kb, and 10 kb of the 5'-region of an adjacent gene named E14/NPAT. The latter shares a bidirectional promoter with ATM and is transcribed in the opposite direction. The entire region is transcribed to approximately 85% and translated to 5%. Genome-wide repeats were found to constitute 37.2%, with LINE (17.1%) and Alu (14.6%) being the main repetitive elements. The high representation of LINE repeats is attributable to the presence of three full-length LINE-1s, inserted in the same orientation in introns 18 and 63 as well as downstream of the ATM gene. Homology searches suggest that ATM exon 2 could have derived from a mammalian interspersed repeat (MIR). Promoter recognition algorithms identified divergent promoter elements within the CpG island, which lies between the ATM and E14/NPAT genes, and provide evidence for a putative second ATM promoter located within intron 3, immediately upstream of the first coding exon. The low G+C level (38.1%) of the ATM locus is reflected in a strongly biased codon and amino acid usage of the gene.
Collapse
Affiliation(s)
- M Platzer
- Department of Genome Analysis, Institute of Molecular Biotechnology, Jena, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Sugaya K, Sasanuma S, Nohata J, Kimura T, Fukagawa T, Nakamura Y, Ando A, Inoko H, Ikemura T, Mita K. Gene organization of human NOTCH4 and (CTG)n polymorphism in this human counterpart gene of mouse proto-oncogene Int3. Gene 1997; 189:235-44. [PMID: 9168133 DOI: 10.1016/s0378-1119(96)00857-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
The cDNA and genomic clones for the human counterpart of the mouse mammary tumor gene Int3 were isolated and sequenced. We designated this human major histocompatibility complex (MHC) class III gene as NOTCH4, since very recently, by sequencing cDNA clones, the complete form of the mouse proto-oncogene Int3 has been clarified and named Notch4. The present human NOTCH4 sequence is the first example of the genomic sequence for the extracellular portion of the mammalian Notch4, and by comparing it with the mouse Notch4 cDNA sequence, the exon/intron organization was clarified. The comparison of the predicted amino acid sequence of human NOTCH4 with those of other Notch homologues of a wide range of species revealed four subfamilies for mammalian Notch. In the protein coding region of human NOTCH4, we found (CTG)n repeats showing a variable number tandem repeat (VNTR) polymorphism for different human leukocyte antigen (HLA) haplotypes. Ten genes mapped on 6p21.3, including NOTCH4, were found to have counterparts structurally and functionally similar to those mostly mapped on 9q33-q34, indicating segmental chromosome duplication during the course of evolution. Similarity of genes on chromosomes 1, 6, 9 and 19 was also discussed.
Collapse
Affiliation(s)
- K Sugaya
- Genome Research Group, National Institute of Radiological Sciences, Anagawa, Chiba-ken, Japan.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
De Sario A, Geigl EM, Palmieri G, D'Urso M, Bernardi G. A compositional map of human chromosome band Xq28. Proc Natl Acad Sci U S A 1996; 93:1298-302. [PMID: 8577758 PMCID: PMC40074 DOI: 10.1073/pnas.93.3.1298] [Citation(s) in RCA: 26] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
The molar fractions of guanine plus cytosine (GC) in DNA were determined for 36 yeast artificial chromosomes (YACs) which almost completely cover human chromosome band Xq28, a terminal reverse band, corresponding to about 8 Mb of DNA. This allowed the construction of the most complete compositional map to date of a chromosomal band; three regions were observed: (i) a proximal 3.5-Mb region formed by GC-poor L and GC-rich H1 isochores; (ii) a middle 2,2-Mb region essentially formed by a GC-rich H2 isochore and a very GC-rich H3 isochore separated by a GC-poor L isochore, YACs from this region being characterized by a striking compositional heterogeneity and instability; and (iii) a distal 1.3-Mb region exclusively formed by GC-poor L isochores. Gene and CpG island concentrations increased with the GC levels of the isochores, as expected. Xq28 exemplifies a subset of reverse bands which are different from the two other subsets, namely from telomeric bands, which are characterized by specific cytogenetic properties and by the predominance of H2 and H3 isochores, and from the majority of reverse bands, which do not contain H2 and H3 isochores.
Collapse
Affiliation(s)
- A De Sario
- Laboratoire de Génétique Moléculaire, Institut Jacques Monod, Paris, France
| | | | | | | | | |
Collapse
|
21
|
Fukagawa T, Nakamura Y, Okumura K, Nogami M, Ando A, Inoko H, Saitou N, Ikemura T. Human pseudoautosomal boundary-like sequences: expression and involvement in evolutionary formation of the present-day pseudoautosomal boundary of human sex chromosomes. Hum Mol Genet 1996; 5:23-32. [PMID: 8789436 DOI: 10.1093/hmg/5.1.23] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
The human genome is composed of long-range mosaic structures of G+C% (GC%), which are thought to be related to chromosome bands. We previously identified a boundary of Mb-level domains of GC% mosaic structures in the human major histocompatibility complex (MHC) and found in the domain boundary a sequence very similar to pseudoautosomal boundary (PAB) sequences of human sex chromosomes. We designated it 'PABL' and found many PABLs in the human genome. By analysis of six genomic and six transcribed PABLs, a core and consensus sequence of about 650 nt was defined; the 3'- and 5'-edges of the PABLs were strictly conserved. Northern blot analysis showed sizes of PABL transcripts to be 5-10 kb in length. Divergence time of PABLs was estimated to be 60-120 million years ago by analysis of human PABLs and PABXY1 of seven primates, and the evolutionary rates deduced showed PABLs to have been under selective constraints. A model for evolutionary formation of the present pseudoautosomal boundary was proposed by postulation of illegitimate recombination between two PABLs.
Collapse
Affiliation(s)
- T Fukagawa
- Department of Evolutionary Genetics, National Institute of Genetics, Mishima, Japan
| | | | | | | | | | | | | | | |
Collapse
|
22
|
Sharp PM, Averof M, Lloyd AT, Matassi G, Peden JF. DNA sequence evolution: the sounds of silence. Philos Trans R Soc Lond B Biol Sci 1995; 349:241-7. [PMID: 8577834 DOI: 10.1098/rstb.1995.0108] [Citation(s) in RCA: 167] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Silent sites (positions that can undergo synonymous substitutions) in protein-coding genes can illuminate two evolutionary processes. First, despite being silent, they may be subject to natural selection. Among eukaryotes this is exemplified by yeast, where synonymous codon usage patterns are shaped by selection for particular codons that are more efficiently and/or accurately translated by the most abundant tRNAs; codon usage across the genome, and the abundance of different tRNA species, are highly co-adapted. Second, in the absence of selection, silent sites reveal underlying mutational patterns. Codon usage varies enormously among human genes, and yet silent sites do not appear to be influenced by natural selection, suggesting that mutation patterns vary among regions of the genome. At first, the yeast and human genomes were thought to reflect a dichotomy between unicellular and multicellular organisms. However, it now appears that natural selection shapes codon usage in some multicellular species (e.g. Drosophila and Caenorhabditis), and that regional variations in mutation biases occur in yeast. Silent sites (in serine codons) also provide evidence for mutational events changing adjacent nucleotides simultaneously.
Collapse
Affiliation(s)
- P M Sharp
- Department of Genetics, University of Nottingham, Queens Medical Centre, U.K
| | | | | | | | | |
Collapse
|
23
|
Fukagawa T, Sugaya K, Matsumoto K, Okumura K, Ando A, Inoko H, Ikemura T. A boundary of long-range G + C% mosaic domains in the human MHC locus: pseudoautosomal boundary-like sequence exists near the boundary. Genomics 1995; 25:184-91. [PMID: 7774916 DOI: 10.1016/0888-7543(95)80124-5] [Citation(s) in RCA: 62] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The human genome is composed of long-range G+C% (GC%) mosaic structures related to chromosome bands. We found the human MHC locus to be an example of megabase-level GC% mosaic structures and predicted a possible boundary of the megabase-level domains within an undercharacterized 450-kb region harboring the junction of MHC classes II and III. Chromosome walking of the 450-kb region and base-compositional analysis precisely located the boundary of the mosaic domains, disclosing a sharp GC% transition. Near the transition point there was a 20-kb dense Alu cluster, a 30-kb dense LINE-1 cluster, and a sequence highly homologous with the pseudoautosomal boundary of the short arms of human sex chromosomes (PAB1X and PAB1Y); PAB1X and PAB1Y are the interface between sex-specific and pseudoautosomal regions. Many PAB1XY-like sequences (PABLs) were detected by hybridization against genomic DNA, and the new sequences defined the complete form of PABLs to be about 650 nt.
Collapse
Affiliation(s)
- T Fukagawa
- Department of Evolutionary Genetics, National Institute of Genetics, Shizuoka-ken, Japan
| | | | | | | | | | | | | |
Collapse
|
24
|
Affiliation(s)
- J Trowsdale
- Human Immunogenetics Laboratory, Imperial Cancer Research Fund, London, UK
| |
Collapse
|
25
|
Stenico M, Lloyd AT, Sharp PM. Codon usage in Caenorhabditis elegans: delineation of translational selection and mutational biases. Nucleic Acids Res 1994; 22:2437-46. [PMID: 8041603 PMCID: PMC308193 DOI: 10.1093/nar/22.13.2437] [Citation(s) in RCA: 211] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Synonymous codon usage varies considerably among Caenorhabditis elegans genes. Multivariate statistical analyses reveal a single major trend among genes. At one end of the trend lie genes with relatively unbiased codon usage. These genes appear to be lowly expressed, and their patterns of codon usage are consistent with mutational biases influenced by the neighbouring nucleotide. At the other extreme lie genes with extremely biased codon usage. These genes appear to be highly expressed, and their codon usage seems to have been shaped by selection favouring a limited number of translationally optimal codons. Thus, the frequency of these optimal codons in a gene appears to be correlated with the level of gene expression, and may be a useful indicator in the case of genes (or open reading frames) whose expression levels (or even function) are unknown. A second, relatively minor trend among genes is correlated with the frequency of G at synonymously variable sites. It is not yet clear whether this trend reflects variation in base composition (or mutational biases) among regions of the C.elegans genome, or some other factor. Sequence divergence between C.elegans and C.briggsae has also been studied.
Collapse
Affiliation(s)
- M Stenico
- Department of Genetics, Trinity College, Dublin, Ireland
| | | | | |
Collapse
|
26
|
Berkhout B, van Hemert FJ. The unusual nucleotide content of the HIV RNA genome results in a biased amino acid composition of HIV proteins. Nucleic Acids Res 1994; 22:1705-11. [PMID: 8202375 PMCID: PMC308053 DOI: 10.1093/nar/22.9.1705] [Citation(s) in RCA: 71] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Extremely high frequencies of the A nucleotide are found in the RNA genomes of the lentivirus group of retroviruses. It is presently unknown what molecular force is responsible for this A-pressure. In this manuscript, we demonstrate a correlation between this 'A-pressure' and the amino acid-usage of the lentivirus family. We compared the amino acid composition of the Gag and Pol proteins of the human immunodeficiency viruses type 1 and 2 (HIV-1 and HIV-2) with that of the second group of human retroviruses; the human T-cell leukemia viruses type I and II (HTLV-I and HTLV-II). Differences in total amino acid content correlate with the preference for A-rich codons in the HIV genome. A pair-wise comparison of homologous amino acid positions in the Pol proteins indicates that both conservative and non-conservative changes can be accounted for by this A-bias. The putative molecular mechanism underlying this A-pressure and the evolutionary consequences are discussed.
Collapse
Affiliation(s)
- B Berkhout
- Department of Virology, University of Amsterdam, The Netherlands
| | | |
Collapse
|
27
|
Ochman H, Groisman EA. The origin and evolution of species differences in Escherichia coli and Salmonella typhimurium. EXS 1994; 69:479-93. [PMID: 7994120 DOI: 10.1007/978-3-0348-7527-1_27] [Citation(s) in RCA: 26] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Since diverging from a common ancestor some 120 million years, Escherichia coli and Salmonella typhimurium have accumulated numerous phenotypic characteristics which have traditionally been used to distinguish these enteric species. While most of the genetic differences between these species are due to the accumulation of point mutations, the majority of the observed variation in phenotypic characters is attributable to segments of the genome confined to only one of the species. We have analyzed the map positions, G+C contents, nucleotide sequences and functions of regions unique to the Salmonella chromosome in an attempt to determine the ancestry of species-specific sequences. Some of the Salmonella-specific regions had uncharacteristically low base compositions and contained open reading frames of atypical codon usage patterns suggesting that portions of the genome were acquired by horizontal transfer from distantly-related bacterial species. The role of these species-specific sequences was assayed by constructing mutant strains harboring deletions in the corresponding regions of the genome. Several functions were ascribed to these unique portions of the Salmonella chromosome, including one encoding proteins involved in virulence and invasion of host epithelial cells.
Collapse
Affiliation(s)
- H Ochman
- Department of Biology, University of Rochester, NY 14627
| | | |
Collapse
|
28
|
Abstract
This review will first present some properties (including compositional pattern, correlations between isochores and chromosomal bands, and gene distribution) of the human genome, the most extensively studied among vertebrate genomes. It will then explain how these properties came about during the evolution of the vertebrates.
Collapse
Affiliation(s)
- G Bernardi
- Laboratoire de Génétique Moléculaire, Institut Jacques Monod, Paris, France
| |
Collapse
|
29
|
Engelkamp D, Schäfer BW, Mattei MG, Erne P, Heizmann CW. Six S100 genes are clustered on human chromosome 1q21: identification of two genes coding for the two previously unreported calcium-binding proteins S100D and S100E. Proc Natl Acad Sci U S A 1993; 90:6547-51. [PMID: 8341667 PMCID: PMC46969 DOI: 10.1073/pnas.90.14.6547] [Citation(s) in RCA: 139] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
The human genome contains large regions that are highly structured. Sequence-related members of multigene families are often found in a clustered organization. Here we describe a previously unrecognized gene cluster composed of genes coding for calcium-binding proteins of the S100 family. The linkage of six S100 genes was established by pulsed-field gel electrophoresis, and a contiguous DNA sequence of 15 kilobases containing the full coding region of four different S100 genes was characterized. This is the tightest mammalian gene cluster discovered so far to our knowledge. Two additional S100 genes are located within the cluster, both of which exhibit unique structural features when compared with other S100 genes. The product of S100E is cysteine-rich, whereas that of S100D contains a long hydrophobic N-terminal tail. The gene cluster was assigned to chromosome 1q21, one of the bands showing rearrangements in neoplasms at high frequency. The deregulated expression of some S100 genes in the cluster during tumor progression suggests that chromosomal abnormalities may influence the expression of S100 genes in late stages of cancer, particularly in association with the formation of metastases.
Collapse
Affiliation(s)
- D Engelkamp
- Abteilung für Klinische Chemie, Kinderspital, Universität Zürich, Switzerland
| | | | | | | | | |
Collapse
|
30
|
Abstract
The mammalian chromosome is longitudinally heterogeneous in structure and function and this is the basis for the specific banding patterns produced by various chromosome staining techniques. The two most frequently used techniques are G, or Giemsa banding and R, or reverse banding. Each type of stained band is characterised by variations in gene density, time of replication, base composition, density of repeat sequences, and chromatin packaging. It is increasingly apparent that R and G bands, which are complementary to each other, represent separate compartments of the euchromatic human genome, with R bands containing the vast majority of genes. R bands are also more GC-rich, contain a higher density of Alu repeats, and replicate earlier in S phase, than G bands. These properties may be interdependent and may have coevolved.
Collapse
Affiliation(s)
- J M Craig
- MRC Human Genetics Unit, Western General Hospital, Edinburgh
| | | |
Collapse
|
31
|
Collins DW, Jukes TH. Relationship between G + C in silent sites of codons and amino acid composition of human proteins. J Mol Evol 1993; 36:201-13. [PMID: 8483158 DOI: 10.1007/bf00160475] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
We have investigated the relationship between the G + C content of silent (synonymous) sites in codons and the amino acid composition of encoded proteins for approximately 1,600 human genes. There are positive correlations between silent site G + C and the proportions of codons for Arg, Pro, Ala, Trp, His, Gln, and Leu and negative ones for Tyr, Phe, Asn, Ile, Lys, Asp, Thr, and Glu. The median proteins coded by groups of genes that differ in silent-site G + C content also differ in amino acid composition, as do some proteins coded by homologous genes. The pattern of compositional change can be largely explained by directional mutation pressure, the genetic code, and differences in the frequencies of accepted amino acid substitutions; the shifts in protein composition are likely to be selectively neutral.
Collapse
Affiliation(s)
- D W Collins
- Space Sciences Laboratory, University of California, Berkeley
| | | |
Collapse
|
32
|
Abstract
We model the base compositional structure of the human and Escherichia coli genomes. Three particular properties are first quantified: (1) There is a significant tendency for any region of either genome to have a strand-symmetric base composition. (2) The variation in base composition from region to region, within each genome, is very much larger than expected from common homogeneous stochastic models. (3) A given local base composition tends to persist over a scale of at least kilobases (E. coli) or tens of kilobases (human). Multidomain stochastic models from the literature are reviewed and sharpened. In particular, quantitative measurements of the third property lead us to suggest a significant shift in the style of domain models, in which the variation of A+T content with position is modeled by a random walk with frequent small steps rather than with large quantum jumps. As an application, we suggest a way to reduce the amount of computation in the assembly of large sequences from sequences of randomly chosen fragments.
Collapse
Affiliation(s)
- J W Fickett
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, New Mexico 87545
| | | | | |
Collapse
|
33
|
Abstract
Statistical approaches help in the determination of significant configurations in protein and nucleic acid sequence data. Three recent statistical methods are discussed: (i) score-based sequence analysis that provides a means for characterizing anomalies in local sequence text and for evaluating sequence comparisons; (ii) quantile distributions of amino acid usage that reveal general compositional biases in proteins and evolutionary relations; and (iii) r-scan statistics that can be applied to the analysis of spacings of sequence markers.
Collapse
Affiliation(s)
- S Karlin
- Department of Mathematics, Stanford University, CA 94305
| | | |
Collapse
|
34
|
Matsumoto K, Arai M, Ishihara N, Ando A, Inoko H, Ikemura T. Cluster of fibronectin type III repeats found in the human major histocompatibility complex class III region shows the highest homology with the repeats in an extracellular matrix protein, tenascin. Genomics 1992; 12:485-91. [PMID: 1373119 DOI: 10.1016/0888-7543(92)90438-x] [Citation(s) in RCA: 68] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Walking and sequencing a genome portion centromeric of CYP21B in the human MHC class III region disclosed a cluster of fibronectin type III repeats in an approximately 50-kb DNA segment. Fibronectin type III repeats are known to consist of ca. 90 amino acid residues and exist in a wide range of protein species. Homology searches in protein databases showed that the repeats found had the highest homology with the repeats of human tenascin, an extracellular matrix protein. One cDNA sequence located immediately centromeric of CYP21B, the 3' portion of which is transcribed by the opposite strand of CYP21B, was found also to have six type III repeats followed by a fibrinogen domain. Pairwise homology comparison of these repeats in the MHC locus with those of human tenascin showed a general parallelism in their gene organization, indicating that the newly found repeats are elements of certain tenascin-like genea.
Collapse
Affiliation(s)
- K Matsumoto
- DNA Research Center, National Institute of Genetics, Shizuoka-ken, Japan
| | | | | | | | | | | |
Collapse
|
35
|
|
36
|
Krane DE, Hartl DL, Ochman H. Rapid determination of nucleotide content and its application to the study of genome structure. Nucleic Acids Res 1991; 19:5181-5. [PMID: 1833723 PMCID: PMC328873 DOI: 10.1093/nar/19.19.5181] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
We have developed a sensitive, reliable and accurate procedure for estimating the base composition of small samples of DNAs. This method has been applied to the analysis of genomic DNAs from several sources including large regions of human DNA cloned as yeast artificial chromosomes. To determine whether the human genome is compartmentalized into large segments of homogeneous base composition, we examined the GC content of a 1.2 megabase contig spanning the cystic fibrosis gene.
Collapse
Affiliation(s)
- D E Krane
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110
| | | | | |
Collapse
|
37
|
Ikemura T, Wada K. Evident diversity of codon usage patterns of human genes with respect to chromosome banding patterns and chromosome numbers; relation between nucleotide sequence data and cytogenetic data. Nucleic Acids Res 1991; 19:4333-9. [PMID: 1886761 PMCID: PMC328617 DOI: 10.1093/nar/19.16.4333] [Citation(s) in RCA: 67] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
The sequences of the human genome compiled in DNA databases are now about 10 megabase pairs (Mb), and thus the size of the sequences is several times the average size of chromosome bands at high resolution. By surveying this large quantity of data, it may be possible to clarify the global characteristics of the human genome, that is, correlation of gene sequence data (kb-level) to cytogenetic data (Mb-level). By extensively searching the GenBank database, we calculated codon usages in about 2000 human sequences. The highest G + C percentage at the third codon position was 97%, and that of about 250 sequences was 80% or more. The lowest G + C% was 27%, and that in about 150 sequences was 40% or less. A major portion of the GC-rich genes was found to be on special subsets of R-bands (T-bands and/or terminal R-bands). AT-rich genes, however, were mainly on G-bands or non-T-type internal R-bands. Average G + C% at the third position for individual chromosomes differed among chromosomes, and were related to T-band density, quinacrine dullness, and mitotic chiasmata density in the respective chromosomes.
Collapse
Affiliation(s)
- T Ikemura
- DNA Research Center, National Institute of Genetics, Shizuoka-ken, Japan
| | | |
Collapse
|
38
|
Ragoussis J, Monaco A, Mockridge I, Kendall E, Campbell RD, Trowsdale J. Cloning of the HLA class II region in yeast artificial chromosomes. Proc Natl Acad Sci U S A 1991; 88:3753-7. [PMID: 1673791 PMCID: PMC51531 DOI: 10.1073/pnas.88.9.3753] [Citation(s) in RCA: 26] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Yeast artificial chromosomes (YACs) have been applied to clone the entire class II region of the human major histocompatibility complex (MHC), including its flanking regions, in a contig over 1.5 million base pairs (bp) long. The human DNA inserts in the YACs have a size between 60 and 1300 kbp and were isolated from two EcoRI partial digest libraries. The gaps between DRA and DRB, DRB and DQA, and DOB and DPA, which had not been cloned by other means, have been bridged with YAC clones. The contig extends through the 400 kpb of DNA between the DRA and C4 genes, thus linking the class II region with the complement gene cluster in the class III region. The cloning in YACs has been supported by a conventional cosmid walk of 290 kbp in the C4-DRA region. Restriction enzyme sites in the YAC clones were compared to the sites in the cosmid walk, to published cosmid clones, and to the already existing physical maps, leading to a detailed characterization of a region of the human genome over 1500 kbp. The YAC clones will be valuable for functional analysis of the MHC.
Collapse
Affiliation(s)
- J Ragoussis
- Imperial Cancer Research Fund, Holborn, London, United Kingdom
| | | | | | | | | | | |
Collapse
|