1
|
Classification des populations humaines dans les essais cliniques randomisés : une revue systématique. Rev Med Interne 2020. [DOI: 10.1016/j.revmed.2020.10.126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
2
|
The global impact ofWolbachiaon mitochondrial diversity and evolution. J Evol Biol 2017; 30:2204-2210. [DOI: 10.1111/jeb.13186] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2017] [Revised: 09/01/2017] [Accepted: 09/25/2017] [Indexed: 02/06/2023]
|
3
|
Comparative population genomics in animals uncovers the determinants of genetic diversity. Nature 2014; 515:261-3. [DOI: 10.1038/nature13685] [Citation(s) in RCA: 405] [Impact Index Per Article: 40.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2014] [Accepted: 07/17/2014] [Indexed: 02/07/2023]
|
4
|
Abstract
In an attempt to understand the origin of CpG islands (CGIs) in mammalian genomes, we have studied their location and structure according to the expression pattern of genes and to the G + C content of isochores in which they are embedded. We show that CGIs located over the transcription start site (named start CGIs) are very different structurally from the others (named no-start CGIs): (1) 61.6% of the no-start CGIs are due to repeated sequences (79 % are due to Alus), whereas only 5.6% of the start CGIs are due to such repeats; (2) start CGIs are longer and display a higher CpGo/e ratio and G + C level than no-start CGIs. The frequency of tissue-specific genes associated to a start CGI varies according to the genomic G + C content, from 25% in G + C-poor isochores to 64% in G + C-rich isochores. Conversely, the frequency of housekeeping genes associated to a start CGI (90%) is independent of the isochore context. Interestingly, the structure of start CGIs is very similar for tissue-specific and housekeeping genes. Moreover, 93% of genes expressed in early embryo are found to exhibit a CpG island over their transcription start point. These observations are consistent with the hypothesis that the occurrence of these CGIs is the consequence of gene expression at this stage, when the methylation pattern is installed.
Collapse
|
5
|
|
6
|
A Medicago truncatula homoglutathione synthetase is derived from glutathione synthetase by gene duplication. PLANT PHYSIOLOGY 2001; 126:1706-15. [PMID: 11500568 PMCID: PMC117169 DOI: 10.1104/pp.126.4.1706] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2001] [Revised: 04/03/2001] [Accepted: 05/04/2001] [Indexed: 05/19/2023]
Abstract
Glutathione (GSH) and homo-GSH (hGSH) are the major low-molecular weight thiols synthesized in Medicago truncatula. Two M. truncatula cDNAs (gshs1 and gshs2) corresponding to a putative GSH synthetase (GSHS) and a putative hGSH synthetase (hGSHS) were characterized. Heterologous expression of gshs1 and gshs2 cDNAs in an Escherichia coli strain deficient in GSHS activity showed that GSHS1 and GSHS2 are a GSHS and an hGSHS, respectively. Leucine-534 and proline-535 present in hGSHS were substituted by alanines that are conserved in plant GSHS. These substitutions resulted in a strongly stimulated GSH accumulation in the transformed E. coli strain showing that these residues play a crucial role in the differential recognition of beta-alanine and glycine by hGSHS. Phylogenetic analysis of GSHS2 and GSHS1 with other eukaryotic GSHS sequences indicated that gshs2 and gshs1 are the result of a gene duplication that occurred after the divergence between Fabales, Solanales, and Brassicales. Analysis of the structure of gshs1 and gshs2 genes shows they are both present in a cluster and in the same orientation in the M. truncatula genome, suggesting that the duplication of gshs1 and gshs2 occurred via a tandem duplication.
Collapse
|
7
|
Does recombination improve selection on codon usage? Lessons from nematode and fly complete genomes. Proc Natl Acad Sci U S A 2001; 98:5688-92. [PMID: 11320215 PMCID: PMC33274 DOI: 10.1073/pnas.091427698] [Citation(s) in RCA: 186] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Understanding the factors responsible for variations in mutation patterns and selection efficacy along chromosomes is a prerequisite for deciphering genome sequences. Population genetics models predict a positive correlation between the efficacy of selection at a given locus and the local rate of recombination because of Hill-Robertson effects. Codon usage is considered one of the most striking examples that support this prediction at the molecular level. In a wide range of species including Caenorhabditis elegans and Drosophila melanogaster, codon usage is essentially shaped by selection acting for translational efficiency. Codon usage bias correlates positively with recombination rate in Drosophila, apparently supporting the hypothesis that selection on codon usage is improved by recombination. Here we present an exhaustive analysis of codon usage in C. elegans and D. melanogaster complete genomes. We show that in both genomes there is a positive correlation between recombination rate and the frequency of optimal codons. However, we demonstrate that in both species, this effect is due to a mutational bias toward G and C bases in regions of high recombination rate, possibly as a direct consequence of the recombination process. The correlation between codon usage bias and recombination rate in these species appears to be essentially determined by recombination-dependent mutational patterns, rather than selective effects. This result highlights that it is necessary to take into account the mutagenic effect of recombination to understand the evolutionary role and impact of recombination.
Collapse
|
8
|
The elevated GC content at exonic third sites is not evidence against neutralist models of isochore evolution. Mol Biol Evol 2001; 18:757-62. [PMID: 11319260 DOI: 10.1093/oxfordjournals.molbev.a003858] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The human genome is divided into isochores, large stretches (>>300 kb) of genomic DNA with more or less consistent GC content. Mutational/neutralist and selectionist models have been put forward to explain their existence. A major criticism of the mutational models is that they cannot account for the higher GC content at fourfold-redundant silent sites within exons (GC4) than in flanking introns (GCi). Indeed, it has been asserted that it is hard to envisage a mutational bias explanation, as it is difficult to see how repair enzymes might act differently in exons and their flanking introns. However, this rejection, we note, ignores the effects of transposable elements (TEs), which are a major component of introns and tend to cause them to have a GC content different from (usually lower than) that dictated by point mutational processes alone. As TEs tend not to insert at the extremities of introns, this model predicts that GC content at the extremities of introns should be more like that at GC4 than are the intronic interiors. This we show to be true. The model also correctly predicts that small introns should have a composition more like that at GC4 than large introns. We conclude that the logic of the previous rejection of neutralist models is unsafe.
Collapse
|
9
|
|
10
|
Abstract
In many unicellular organisms, invertebrates, and plants, synonymous codon usage biases result from a coadaptation between codon usage and tRNAs abundance to optimize the efficiency of protein synthesis. However, it remains unclear whether natural selection acts at the level of the speed or the accuracy of mRNAs translation. Here we show that codon usage can improve the fidelity of protein synthesis in multicellular species. As predicted by the model of selection for translational accuracy, we find that the frequency of codons optimal for translation is significantly higher at codons encoding for conserved amino acids than at codons encoding for nonconserved amino acids in 548 genes compared between Caenorhabditis elegans and Homo sapiens. Although this model predicts that codon bias correlates positively with gene length, a negative correlation between codon bias and gene length has been observed in eukaryotes. This suggests that selection for fidelity of protein synthesis is not the main factor responsible for codon biases. The relationship between codon bias and gene length remains unexplained. Exploring the differences in gene expression process in eukaryotes and prokaryotes should provide new insights to understand this key question of codon usage.
Collapse
|
11
|
Transposons but not retrotransposons are located preferentially in regions of high recombination rate in Caenorhabditis elegans. Genetics 2000; 156:1661-9. [PMID: 11102365 PMCID: PMC1461346 DOI: 10.1093/genetics/156.4.1661] [Citation(s) in RCA: 100] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We analyzed the distribution of transposable elements (TEs: transposons, LTR retrotransposons, and non-LTR retrotransposons) in the chromosomes of the nematode Caenorhabditis elegans. The density of transposons (DNA-based elements) along the chromosomes was found to be positively correlated with recombination rate, but this relationship was not observed for LTR or non-LTR retrotransposons (RNA-based elements). Gene (coding region) density is higher in regions of low recombination rate. However, the lower TE density in these regions is not due to the counterselection of TE insertions within exons since the same positive correlation between TE density and recombination rate was found in noncoding regions (both in introns and intergenic DNA). These data are not compatible with a global model of selection acting against TE insertions, for which an accumulation of elements in regions of reduced recombination is expected. We also found no evidence for a stronger selection against TE insertions on the X chromosome compared to the autosomes. The difference in distribution of the DNA and RNA-based elements along the chromosomes in relation to recombination rate can be explained by differences in the transposition processes.
Collapse
|
12
|
The covariation between TpA deficiency, CpG deficiency, and G+C content of human isochores is due to a mathematical artifact. Mol Biol Evol 2000; 17:1620-5. [PMID: 11070050 DOI: 10.1093/oxfordjournals.molbev.a026261] [Citation(s) in RCA: 67] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
CpG and TpA dinucleotides are underrepresented in the human genome. The CpG deficiency is due to the high mutation rate from C to T in methylated CpG's. The TpA suppression was thought to reflect a counterselection against TpA's destabilizing effect in RNA. Unexpectedly, the TpA and CpG deficiencies vary according to the G+C contents of sequences. It has been proposed that the variation in CpG suppression was correlated with a particular chromatin organization in G+C-rich isochores. Here, we present an improved model of dinucleotide evolution accounting for the overlap between successive dinucleotides. We show that an increased mutation rate from CpG to TpG or CpA induces both an apparent TpA deficiency and a correlation between CpG and TpA deficiencies and G+C content. Moreover, this model shows that the ratio of observed over expected CpG frequency underestimates the real CpG deficiency in G+C-rich sequences. The predictions of our model fit well with observed frequencies in human genomic data. This study suggests that previously published selectionist interpretations of patterns of dinucleotide frequencies should be taken with caution. Moreover, we propose new criteria to identify unmethylated CpG islands taking into account this bias in the measure of CpG depletion.
Collapse
|
13
|
Abstract
BTG3 belongs to a family of structurally related genes whose biochemical functions remain elusive. In order to investigate the mechanism underlying BTG3-mediated functions, we tried to identify BTG3 potential partners. The use of the yeast 'two-hybrid system', with BTG3 as bait, enabled us to isolate BANP (BTG3 Associated Nuclear Protein). Other commonly used protein-binding assays did not confirm this yeast interaction. However, BANP had never been described before, and this prompted us to further characterise this gene. In this paper, we present data on its molecular organization in mouse, then we speculate on the nature of this nuclear protein, and finally we localise BANP on the human chromosome 16q24 subregion; we discuss the fact that frequent loss of heterozygosity within this region has been observed in different tumours.
Collapse
MESH Headings
- 3T3 Cells
- Amino Acid Sequence
- Animals
- Base Sequence
- Blotting, Northern
- Cell Cycle Proteins
- Cell Nucleus/metabolism
- Chromosome Mapping
- Chromosomes, Human, Pair 16/genetics
- Cloning, Molecular
- DNA/chemistry
- DNA/genetics
- DNA, Complementary/chemistry
- DNA, Complementary/genetics
- DNA-Binding Proteins
- Epitopes
- Gene Expression
- HeLa Cells
- Humans
- In Situ Hybridization, Fluorescence
- Luciferases/genetics
- Luciferases/metabolism
- Mice
- Microscopy, Fluorescence
- Molecular Sequence Data
- Nuclear Proteins/genetics
- Nuclear Proteins/isolation & purification
- Nuclear Proteins/metabolism
- Oligopeptides
- Peptides/genetics
- Promoter Regions, Genetic/genetics
- Protein Binding
- RNA/genetics
- RNA/metabolism
- Recombinant Fusion Proteins/genetics
- Recombinant Fusion Proteins/metabolism
- Saccharomyces cerevisiae/genetics
- Sequence Alignment
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
- Tissue Distribution
- Two-Hybrid System Techniques
Collapse
|
14
|
tRNA gene number and codon usage in the C. elegans genome are co-adapted for optimal translation of highly expressed genes. Trends Genet 2000; 16:287-9. [PMID: 10858656 DOI: 10.1016/s0168-9525(00)02041-2] [Citation(s) in RCA: 276] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
15
|
Chromosomal distribution and coding capacity of the human endogenous retrovirus HERV-W family. AIDS Res Hum Retroviruses 2000; 16:731-40. [PMID: 10826480 DOI: 10.1089/088922200308738] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Some genomic elements of the multicopy HERV-W endogenous retroviral family have been previously identified in databases. One of them, located on chromosome 7, contains a single complete open reading frame (ORF) putatively encoding an envelope protein. We have experimentally investigated the genomic complexity and coding capacity of the HERV-W family. The human haploid genome contains at least 70, 100, and 30 HERV-W-related gag, pro, and env regions, respectively, widely and heterogeneously dispersed among chromosomes. Using in vitro transcription-translation procedures, three putative HERV-W gag, pro, and env ORFs were detected on chromosomes 3, 6, and 7, respectively, and their sequences analyzed. A 363 amino acid gag ORF containing matrix and carboxy-terminal truncated capsid domains encoded a putative 45-kDa protein. No gag-pro ORF was found, but a pro sequence containing a DTG active site was detected. Finally, the previously described 538 amino acid HERV-W env ORF, located on chromosome 7, was shown to be unique and encoded a putative 80-kDa glycosylated protein. Proteins of molecular mass identical to the one obtained by an in vitro transcription-translation procedure were detected in human placenta, using anti HERV-W Gag- and Env-specific antibodies. The absence of an HERV-W replication-competent provirus versus the existence of HERV-W-related Gag and Env proteins in healthy human placenta is discussed with respect to particle formation, physiology, and pathology.
Collapse
MESH Headings
- Amino Acid Sequence
- Animals
- Blotting, Southern
- Chromosome Mapping
- Chromosomes, Human, Pair 3/genetics
- Chromosomes, Human, Pair 6/genetics
- Chromosomes, Human, Pair 7/genetics
- Endogenous Retroviruses/genetics
- Endogenous Retroviruses/metabolism
- Endopeptidases/genetics
- Gene Products, env/chemistry
- Gene Products, env/genetics
- Genes, Viral
- Genes, env/genetics
- Genes, gag/genetics
- Humans
- Molecular Sequence Data
- Placenta/metabolism
- Polymerase Chain Reaction
Collapse
|
16
|
Abstract
The human genome is estimated to contain 23,000 to 33,000 retropseudogenes. To study the properties of genes giving rise to these retroelements, we compared the structure and expression of genes with or without known retropseudogenes. Four main features have emerged from the analysis of 181 genes associated to retropseudogenes: Reverse-transcribed genes are (1) widely expressed, (2) highly conserved, (3) short, and (4) GC-poor. The first two properties probably reflect the fact that genes giving rise to retropseudogenes have to be expressed in the germ-line. The two latter points suggest that reverse-transcription and transposition is more efficient for short GC-poor mRNAs. In addition, this analysis allowed us to reject previous hypotheses that widely expressed genes are GC rich. Rather, globally, genes with a wide tissue distribution are GC poor.
Collapse
|
17
|
Abstract
We present here HOBACGEN, a database system devoted to comparative genomics in bacteria. HOBACGEN contains all available protein genes from bacteria, archaea, and yeast, taken from SWISS-PROT/TrEMBL and classified into families. It also includes multiple alignments and phylogenetic trees built from these families. The database is organized under a client/server architecture with a client written in Java, which may run on any platform. This client integrates a graphical interface allowing users to select families according to various criteria and notably to select homologs common to a given set of taxa. This interface also allows users to visualize multiple alignments and trees associated to families. In tree displays, protein gene names are colored according to the taxonomy of the corresponding organisms. Users may access all information associated to sequences and multiple alignments by clicking on genes. This graphic tool thus gives a rapid and simple access to all data required to interpret homology relationships between genes and distinguish orthologs from paralogs. Instructions for installation of the client or the server are available at http://pbil.univ-lyon1. fr/databases/hobacgen.html.
Collapse
|
18
|
Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. Mol Biol Evol 2000; 17:68-74. [PMID: 10666707 DOI: 10.1093/oxfordjournals.molbev.a026239] [Citation(s) in RCA: 390] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
To determine whether gene expression patterns affect mutation rates and/or selection intensity in mammalian genes, we studied the relationships between substitution rates and tissue distribution of gene expression. For this purpose, we analyzed 2,400 human/rodent and 834 mouse/rat orthologous genes, and we measured (using expressed sequence tag data) their expression patterns in 19 tissues from three development states. We show that substitution rates at nonsynonymous sites are strongly negatively correlated with tissue distribution breadth: almost threefold lower in ubiquitous than in tissue-specific genes. Nonsynonymous substitution rates also vary considerably according to the tissues: the average rate is twofold lower in brain-, muscle-, retina- and neuron-specific genes than in lymphocyte-, lung-, and liver-specific genes. Interestingly, 5' and 3' untranslated regions (UTRs) show exactly the same trend. These results demonstrate that the expression pattern is an essential factor in determining the selective pressure on functional sites in both coding and noncoding regions. Conversely, silent substitution rates do not vary with expression pattern, even in ubiquitously expressed genes. This latter result thus suggests that synonymous codon usage is not constrained by selection in mammals. Furthermore, this result also indicates that there is no reduction of mutation rates in genes expressed in the germ line, contrary to what had been hypothesized based on the fact that transcribed DNA is more efficiently repaired than nontranscribed DNA.
Collapse
|
19
|
Human and nematode orthologs--lessons from the analysis of 1800 human genes and the proteome of Caenorhabditis elegans. Gene 1999; 238:163-70. [PMID: 10570994 DOI: 10.1016/s0378-1119(99)00298-x] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Recently, we have defined and analyzed over 1800 orthologous human and rodent genes. Here we extend this work to compare human and Caenorhabditis elegans coding sequences. 1880 human proteins were compared with about 20000 predicted nematode proteins presumably comprising nearly the complete proteome of C. elegans. We found that 44% of human/rodent orthologs have convincing nematode counterparts. On average, the amino acid similarity and identity between aligned human and C. elegans orthologous gene products are 69.3% and 49.1% respectively, and the nucleotide identity is 49.8%. Detailed investigation of our results suggests that some nematode gene predictions are incorrect, leading to erroneous pairing with human genes (e.g. calcineurin and polymerase II elongation factor III). Furthermore, other proteins (i.e. homologs of human ribosomal proteins S20 and L41, thymosin) are missing entirely from the nematode proteome, suggesting that it may not be complete. These results underscore the fact that metazoan gene prediction is a very challenging task and that most computer-predicted nematode genes require supporting evidence of their existence from comparative genomics and/or laboratory investigation.
Collapse
|
20
|
Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis. Proc Natl Acad Sci U S A 1999; 96:4482-7. [PMID: 10200288 PMCID: PMC16358 DOI: 10.1073/pnas.96.8.4482] [Citation(s) in RCA: 593] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We measured the expression pattern and analyzed codon usage in 8,133, 1,550, and 2,917 genes, respectively, from Caenorhabditis elegans, Drosophila melanogaster, and Arabidopsis thaliana. In those three species, we observed a clear correlation between codon usage and gene expression levels and showed that this correlation is not due to a mutational bias. This provides direct evidence for selection on silent sites in those three distantly related multicellular eukaryotes. Surprisingly, there is a strong negative correlation between codon usage and protein length. This effect is not due to a smaller size of highly expressed proteins. Thus, for a same-expression pattern, the selective pressure on codon usage appears to be lower in genes encoding long rather than short proteins. This puzzling observation is not predicted by any of the current models of selection on codon usage and thus raises the question of how translation efficiency affects fitness in multicellular organisms.
Collapse
|
21
|
Regulation of dauer larva development in Caenorhabditis elegans by daf-18, a homologue of the tumour suppressor PTEN. Curr Biol 1999; 9:329-32. [PMID: 10209098 DOI: 10.1016/s0960-9822(99)80143-2] [Citation(s) in RCA: 96] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The tumour suppressor gene PTEN (also called MMAC1 or TEP1) is somatically mutated in a variety of cancer types [1] [2] [3] [4]. In addition, germline mutation of PTEN is responsible for two dominantly inherited, related cancer syndromes called Cowden disease and Bannayan-Ruvalcaba-Riley syndrome [4]. PTEN encodes a dual-specificity phosphatase that inhibits cell spreading and migration partly by inhibiting integrin-mediated signalling [5] [6] [7]. Furthermore, PTEN regulates the levels of phosphatidylinositol 3,4,5-trisphosphate (PIP3) by specifically dephosphorylating position 3 on the inositol ring [8]. We report here that the dauer formation gene daf-18 is the Caenorhabditis elegans homologue of PTEN. DAF-18 is a component of the insulin-like signalling pathway controlling entry into diapause and adult longevity that is regulated by the DAF-2 receptor tyrosine kinase and the AGE-1 PI 3-kinase [9]. Others have shown that mutation of daf-18 suppresses the life extension and constitutive dauer formation associated with daf-2 or age-1 mutants. Similarly, we show that inactivation of daf-18 by RNA-mediated interference mimics this suppression, and that a wild-type daf-18 transgene rescues the dauer defect. These results indicate that PTEN/daf-18 antagonizes the DAF-2-AGE-1 pathway, perhaps by catalyzing dephosphorylation of the PIP3 generated by AGE-1. These data further support the notion that mutations of PTEN contribute to the development of human neoplasia through an aberrant activation of the PI 3-kinase signalling cascade.
Collapse
|
22
|
Abstract
The multiple sclerosis-associated retrovirus (MSRV) isolated from plasma of MS patients was found to be phylogenetically and experimentally related to human endogenous retroviruses (HERVs). To characterize the MSRV-related HERV family and to test the hypothesis of a replication-competent HERV, we have investigated the expression of MSRV-related sequences in healthy tissues. The expression of MSRV-related transcripts restricted to the placenta led to the isolation of overlapping cDNA clones from a cDNA library. These cDNAs spanned a 7.6-kb region containing gag, pol, and env genes; RU5 and U3R flanking sequences; a polypurine tract; and a primer binding site (PBS). As this PBS showed similarity to avian retrovirus PBSs used by tRNATrp, this new HERV family was named HERV-W. Several genomic elements were identified, one of them containing a complete HERV-W unit, spanning all cDNA clones. Elements of this multicopy family were not replication competent, as gag and pol open reading frames (ORFs) were interrupted by frameshifts and stop codons. A complete ORF putatively coding for an envelope protein was found both on the HERV-W DNA prototype and within an RU5-env-U3R polyadenylated cDNA clone. Placental expression of 8-, 3.1-, and 1.3-kb transcripts was observed, and a putative splicing strategy was described. The apparently tissue-restricted HERV-W long terminal repeat expression is discussed with respect to physiological and pathological contexts.
Collapse
|
23
|
Highly conserved RNA sequences that are sensors of environmental stress. Mol Cell Biol 1998; 18:7371-82. [PMID: 9819424 PMCID: PMC109319 DOI: 10.1128/mcb.18.12.7371] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/1998] [Accepted: 08/19/1998] [Indexed: 01/29/2023] Open
Abstract
The putative function of highly conserved regions (HCRs) within 3' untranslated regions (3'UTRs) as regulatory RNA sequences was efficiently and quantitatively assessed by using modular retroviral vectors. This strategy led to the identification of HCRs that alter gene expression in response to oxidative or mitogenic stress. Databases were screened for UTR sequences of >100 nucleotides that had retained 70% identity over more than 300 million years of evolution. The effects of 10 such HCRs on a standard reporter mRNA or protein were studied. To this end, we developed a modular retroviral vector that can allow for a direct comparison of the effects of different HCRs on gene expression independent of their gene-intrinsic 5'UTR, promoter, protein coding region, or poly(A) sequence. Five of the HCRs tested decreased mRNA steady-state levels 2- to 10-fold relative to controls, presumably by altering mRNA stability. One HCR increased translation, and one decreased translation. Elevated mitogen levels caused four HCRs to increase protein levels twofold. One HCR increased protein levels fourfold in response to hypoxia. Although nonconserved UTR sequences may also have a role, these results provide evidence that sequences that are highly conserved during evolution are good candidates for RNA motifs with posttranscriptional regulatory functions in gene expression.
Collapse
|
24
|
New insulin-like proteins with atypical disulfide bond pattern characterized in Caenorhabditis elegans by comparative sequence analysis and homology modeling. Genome Res 1998; 8:348-53. [PMID: 9548970 DOI: 10.1101/gr.8.4.348] [Citation(s) in RCA: 112] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
We have identified three new families of insulin homologs in Caenorhabditis elegans. In two of these families, concerted mutations suggest that an additional disulfide bond links B and A domains, and that the A-domain internal disulfide bond is substituted by a hydrophobic interaction. Homology modeling remarkably confirms these predictions and shows that despite this atypical disulfide bond pattern and the absence of C-like peptide, all these proteins may adopt the same fold as the insulin. Interestingly, whereas we identified 10 insulin-like peptides, only one insulin-like-receptor (daf-2) has been found. We propose that these insulin-related peptides may correspond to different activators or inhibitors of the daf-2 insulin-regulating pathway.
Collapse
|
25
|
[The morphogenesis of the temporomandibular joint considered in its phylogenic aspects in vestibular orientation]. REVUE DE STOMATOLOGIE ET DE CHIRURGIE MAXILLO-FACIALE 1997; 98 Suppl 1:89-92. [PMID: 9471708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Comparative phylogenic movements of skull points can be obtained using vestibular orientation and skulls from various mammals after reducing the bimediocondyle distance to approximately 100 mm (as in man). With this method, the condyle is seen to move backwardly and rise in a very thin bony area which will be covered by the increasing volume of the encephalus. A "mandibulocerebral" index of less than 100 is a criteria of humanoid species. A part posterior to the fixed zone in primate which will become the temporal condyl in man is eroded by the mandibular condyle. The tympanum migrates upwardly and posteriorly in primates and turns forwardly in man, contributing to limit the glenoid cavity posteriorly. The anterior limit to mandibular propulsion in primates is the intercanine bite; this disappears in man as does the post-glenoid apophysis which is reduced to a tubercle with no limiting effect. In man, function of the temporomandibular joint depends on its more delicate construction than in primates. The major humanoidizing processes alone, i.e. posterior movement of the jaws and their reduced volume, bipedism, and amplification of the cerebral volume, are sufficient to explain this situation which results from the effect of phylogenic evolution.
Collapse
|
26
|
Cloning and characterization of a gene encoding a novel immunodominant antigen of Trypanosoma cruzi. Mol Biochem Parasitol 1997; 87:193-204. [PMID: 9247930 DOI: 10.1016/s0166-6851(97)00068-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
A Trypanosoma cruzi genomic expression library was screened with a pool of sera obtained from chronic chagasic patients. The recombinant antigen (Tc40) isolated from this library reacted with a large number of serum samples of chronic chagasic patients, suggesting that the presence of anti-Tc40 antibodies may be specifically associated to Chagas' disease. The full-length sequence of the Tc40 gene was determined after isolation of genomic and cDNA clones. The Tc40 cDNA includes a large open reading frame (2745 bp-long) that encodes a polypeptide of 100 kDa without any homology with previously described T. cruzi sequences. In contrast with other T. cruzi antigens whose immunodominant B-cell epitopes are composed by amino acid repetitive motifs, Tc40 does not show any amino acid repetition. Antibodies against the Tc40 recombinant protein reacted with three native polypeptides of 100, 41 and 38 kDa which are tightly associated with membranes or cytoskeleton and expressed in all developmental stages of the parasite life cycle. A transcript of 3.9-kb was detected in Northern blot analysis which is large enough to encode a 100 kDa polypeptide. Tc40 genes were mapped on a chromosomal band of 1.1 Mbp and in a few copies per haploid genome in the G strain.
Collapse
|
27
|
Abstract
Important progress has been made in the past two years in the identification of Pol II promoters. For most other regulatory elements, however, current biological knowledge is still insufficient to allow the development of prediction tools. The phylogenetic-footprinting strategy, which is based on the comparative analysis of homologous sequences, is a very efficient approach to identify new unknown regulatory elements. The recent organization of large-scale sequencing projects for some model vertebrate organisms will be extremely valuable for the prediction of regulatory elements in the human genome.
Collapse
|
28
|
Evolutionary affinities of the order Perissodactyla and the phylogenetic status of the superordinal taxa Ungulata and Altungulata. Mol Phylogenet Evol 1997; 7:195-200. [PMID: 9126561 DOI: 10.1006/mpev.1996.0391] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Contrary to morphological claims, molecular data indicate that the order Perissodactyla (e.g., horses, rhinoceroses, and tapirs) is neither part of the superordinal taxon Paenungulata (Sirenia, Proboscidea, and Hyracoidea) nor an immediate outgroup of the paenungulates. Rather, Perissodactyla is closer to Carnivora and Cetartiodactyla (Cetacea+Artiodactyla) than it is to the paenungulates. Therefore, two morphologically defined superordinal taxa, Altungulata (Proboscidea, Sirenia, Hyracoidea, and Perissodactyla) and Ungulata (Altungulata and Cetartiodactyla), are invalidated. Perissodactyla, Carnivora, and Cetartiodactyla are shown to constitute a rather tight trichotomy. However, a molecular analysis of 36 protein sequences with a total concatenated length of 7885 aligned amino acids indicates that Perissodactyla is closer to Cetartiodactyla than either taxa is to Carnivora. The relationships among Paenungulata, Primates, and the clade consisting of Perissodactyla, Carnivora, and Cetartiodactylaa could not be resolved on the basis of the available data.
Collapse
|
29
|
Cloning of the mouse BTG3 gene and definition of a new gene family (the BTG family) involved in the negative control of the cell cycle. Leukemia 1997; 11:370-5. [PMID: 9067576 DOI: 10.1038/sj.leu.2400599] [Citation(s) in RCA: 101] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
It is well known that loss of tumor suppressor genes and more generally of antiproliferative genes plays a key role in the development of most tumors. We report here the cloning of the mouse BTG3 gene and show that its human counterpart maps on chromosome 21. This evolutionarily conserved gene codes for a 30 kDa protein and is expressed in most adult murine and human tissues analyzed. However, we demonstrate that its expression is cell cycle dependent and peaks at the end of the G1 phase. This gene is homologous to the human BTG1, BTG2 and TOB genes which were demonstrated to act as inhibitors of cell proliferation. Its description allowed us to define better this seven gene family (the BTG gene family) at the structural level and to speculate about its physiological role in normal and tumoral cells. This family is mainly characterized by the presence of two conserved domains (BTG boxes A and B) of as yet undetermined function which are separated by a non-conserved 20-25 amino acid sequence.
Collapse
|
30
|
LALNVIEW: a graphical viewer for pairwise sequence alignments. COMPUTER APPLICATIONS IN THE BIOSCIENCES : CABIOS 1996; 12:507-10. [PMID: 9021269 DOI: 10.1093/bioinformatics/12.6.507] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
LALNVIEW is a graphical program for visualising local alignments between two sequences (protein or nucleic acids). Sequences are represented by coloured rectangles to give an overall picture of their similarities. LALNVIEW can display sequence features (exon, intron, active site, domain, propeptide, etc.) along with the alignment. When using LALNVIEW through our Web servers, sequence features are automatically extracted from database annotations (SWISS-PROT, GenBank, EMBL or HOVERGEN) and displayed with the alignment. LALNVIEW is a useful tool for analysing pairwise sequence alignments and for making the link between sequence homology and what is known about the structure or function of sequences. LALNVIEW executables for UNIX, Macintosh and PC computers are freely available from our server (http:// expasy.hcuge.ch/sprot/lalnview.html).
Collapse
|
31
|
Abstract
Ever since they have been classified as ruminants in the Old Testament (Leviticus 11:6, Deuteronomy 14:7) and equated with hyraxes in the vulgate Latin translation, rabbits and their relatives (order Lagomorpha) have frequently experienced radical changes in taxonomic rank. By using 91 orthologous protein sequences, we have attempted to answer the classical question "What, if anything, is a rabbit?". Here we show that Lagomorpha is significantly more closely related to Primates and Scandentia (tree shrews) than it is to rodents. This newly determined phylogenetic position invalidates the superordinal taxon Glires (Lagomorpha + Rodentia), and indicates that the morphological 'synapomorphies' previously used to cluster rodents and lagomorphs into Glires, may actually represent symplesiomorphies or homoplasies that are of no phylogenetic value. This raises the possibility that the ancestral eutherian morphotype may have possessed many rodent-like morphological characters.
Collapse
|
32
|
Abstract
We report the isolation and characterization of a chicken cDNA which putatively encodes an actin-like protein (chACTL). This 394-amino-acid (aa) polypeptide shares sequence homology (81, 70 and 67% identical aa, respectively) with three actin-related proteins (ARP) described for Drosophila melanogaster (ARP14D), Caenorhabditis elegans (ACTL) and Saccharomyces cerevisiae (ACT2). At least six chACTL transcripts were detected in different tissues during chick embryogenesis. Sequence analysis suggests that at least three groups of ARP have been evolutionarily conserved.
Collapse
|
33
|
Statistical analysis of vertebrate sequences reveals that long genes are scarce in GC-rich isochores. J Mol Evol 1995; 40:308-17. [PMID: 7723057 DOI: 10.1007/bf00163235] [Citation(s) in RCA: 186] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
We compared the exon/intron organization of vertebrate genes belonging to different isochore classes, as predicted by their GC content at third codon position. Two main features have emerged from the analysis of sequences published in GenBank: (1) genes coding for long proteins (i.e., > or = 500 aa) are almost two times more frequent in GC-poor than in GC-rich isochores; (2) intervening sequences (= sum of introns) are on average three times longer in GC-poor than in GC-rich isochores. These patterns are observed among human, mouse, rat, cow, and even chicken genes and are therefore likely to be common to all warm-blooded vertebrates. Analysis of Xenopus sequences suggests that the same patterns exist in cold-blooded vertebrates. It could be argued that such results do not reflect the reality because sequence databases are not representative of entire genomes. However, analysis of biases in GenBank revealed that the observed discrepancies between GC-rich and GC-poor isochores are not artifactual, and are probably largely underestimated. We investigated the distribution of microsatellites and interspersed repeats in introns of human and mouse genes from different isochores. This analysis confirmed previous studies showing that L1 repeats are almost absent from GC-rich isochores. Microsatellites and SINES (Alu, B1, B2) are found at roughly equal frequencies in introns from all isochore classes. Globally, the presence of repeated sequences does not account for the increased intron length in GC-poor isochores. The relationships between gene structure and global genome organization and evolution are discussed.
Collapse
|
34
|
Abstract
Comparison of homologous genes is a major step for many studies related to genome structure, function or evolution. Similarity search programs easily find genes homologous to a given sequence. However, only very tedious manual procedures allow the retrieval of all sets of homologous genes sequenced for a given set of species. Moreover, this search often generates errors due to the complexity of data to be managed simultaneously: phylogenetic trees, alignments, taxonomy, sequences and related information. HOVERGEN helps to solve these problems by integrating all this information. HOVERGEN corresponds to GenBank sequences from all vertebrate species, with some data corrected, clarified, or completed, notably to address the problem of redundancy. Coding sequences have been classified in gene families. Protein multiple alignments and phylogenetic trees have been calculated for each family. Sequences and related information have been structured in an ACNUC database which permits complex selections. A graphical interface has been developed to visualize and edit trees. Genes are displayed in color, according to their taxonomy. Users have directly access to all information attached to sequences and to multiple alignments simply by clicking on genes. This graphical tool gives thus a rapid and simple access to all data necessary to interpret homology relationships between genes. HOVERGEN allows the user to easily select sets of homologous vertebrate genes, and thus is particularly useful for comparative sequence analysis, or molecular evolution studies.
Collapse
|
35
|
Rearrangement of CCND1 (BCL1/PRAD1) 3' untranslated region in mantle-cell lymphomas and t(11q13)-associated leukemias. Blood 1994; 83:3689-96. [PMID: 8204893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Rearrangement and overexpression of CCND1 (BCL1/PRAD1), a member of the cyclin G1 gene family, are consistent features of t(11q13)-bearing B-lymphoid tumors (particularly mantle-cell lymphoma [MCL]). Its deregulation is thought to perturb the G1-S transition of the cell cycle and thereby to contribute to tumor development. As suggested by previously published studies, rearrangement of the 3' untranslated region (3' UTR) of CCND1 may contribute to its activation in some lymphoid tumors. To define further the prevalence of such rearrangements, we report here the result of the molecular study of 34 MCL and six t(11q13)-associated leukemias using a set of probes specific to the different parts of the CCND1 transcript. We also sequenced the entire cDNA of the overexpressed CCND1 transcripts in a t(11q13)-associated leukemia. DNA from four of these 40 patients showed rearrangement of the 3' UTR of CCND1 coexisting with major translocation cluster (MTC) rearrangement. Southern blot and sequence analyses showed that, as a result of these rearrangements, the 3' AU-rich region containing sequences involved in mRNA stability and in translational control is eliminated. Moreover, the finding that the CCND1 mRNA half-life was greater than 3 hours (normal tissues, 0.5 hours) in three t(11q13)-associated cell lines stresses the importance of posttranscriptional derangement in the activation of CCND1. Finally, we did not observe any mutation in the coding frame of the CCND1 cDNA analyzed.
Collapse
|
36
|
Sequence analysis reveals that the BTG1 anti-proliferative gene is conserved throughout evolution in its coding and 3' non-coding regions. Gene 1993; 129:303-6. [PMID: 8325512 DOI: 10.1016/0378-1119(93)90284-a] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
The human BTG1 gene (expressing an anti-proliferative function) is an evolutionarily conserved gene homologous to the murine PC3/TIS21 genes. Here, we report the cloning and sequencing of the murine BTG1 coding region and chicken BTG1 cDNA. The putative human and mouse BTG1 proteins are 100% identical; the chicken BTG1 cDNA contains an open reading frame of 170 amino acids with a 91% identity to its human and murine counterparts. The 3'-untranslated region of BTG1 is also highly conserved (82% homology between human and chicken), suggesting that it plays a key role in the regulation of BTG1 expression. These data confirm that BTG1 is phylogenetically highly conserved and that BTG1 and PC3/TIS21 may constitute the first members of a new family of functionally related genes.
Collapse
|
37
|
Strong conservation of non-coding sequences during vertebrates evolution: potential involvement in post-transcriptional regulation of gene expression. Nucleic Acids Res 1993; 21:2315-22. [PMID: 8506129 PMCID: PMC309526 DOI: 10.1093/nar/21.10.2315] [Citation(s) in RCA: 135] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Comparison of nucleotide sequences from different classes of vertebrates that diverged more than 300 million years ago, revealed the existence of highly conserved regions (HCRs) with more than 70% similarity over 100 to 1450 nt in non-coding parts of genes. Such a conservation is unexpected because it is much longer and stronger than what is necessary for specifying the binding of a regulatory protein. HCRs are relatively frequent, particularly in genes that are essential to cell life. In multigene families, conserved regions are specific of each isotype and are probably involved in the control of their specific pattern of expression. Studying HCRs distribution within genes showed that functional constraints are generally much stronger in 3'-non-coding regions than in promoters or introns. The 3'-HCRs are particularly A + T-rich and are always located in the transcribed untranslated regions of genes, which suggests that they are involved in post-transcriptional processes. However, current knowledge of mechanisms that regulate mRNA export, localisation, translation, or degradation is not sufficient to explain the strong functional constraints that we have characterised.
Collapse
|
38
|
Abstract
The BTG1 gene locus has been shown to be involved in a t(8;12)(q24;q22) chromosomal translocation in a case of B-cell chronic lymphocytic leukemia. We report here the cloning and sequencing of the human BTG1 cDNA and establish the genomic organization of this gene. The full-length cDNA isolated from a lymphoblastoid cell line contains an open reading frame of 171 amino acids. BTG1 expression is maximal in the G0/G1 phases of the cell cycle and is down-regulated when cells progress throughout G1. Furthermore, transfection experiments of NIH3T3 cells indicate that BTG1 negatively regulates cell proliferation. The BTG1 open reading frame is 60% homologous to PC3, an immediate early gene induced by nerve growth factor in rat PC12 cells. Sequence and Northern blot analyses indicate that BTG1 and PC3 are not cognate genes. We then postulate that these two genes are the first members of a new family of antiproliferative genes.
Collapse
MESH Headings
- 3T3 Cells
- Amino Acid Sequence
- Animals
- Base Sequence
- Cell Cycle
- Cell Division/genetics
- Chromosomes, Human, Pair 12
- Chromosomes, Human, Pair 8
- Cloning, Molecular
- DNA/genetics
- DNA/isolation & purification
- DNA Probes
- Gene Library
- Humans
- Leukemia, Lymphocytic, Chronic, B-Cell/genetics
- Lymphocytes/physiology
- Mice
- Molecular Sequence Data
- Multigene Family
- Neoplasm Proteins/genetics
- RNA/genetics
- RNA/isolation & purification
- Restriction Mapping
- Sequence Homology, Nucleic Acid
- Transfection
- Translocation, Genetic
Collapse
|