1
|
Negative information for building phylogenies. RECENT PATENTS ON DNA & GENE SEQUENCES 2013; 7:128-136. [PMID: 22974263 DOI: 10.2174/1872215611307020007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2012] [Accepted: 09/07/2012] [Indexed: 06/01/2023]
Abstract
An absent word (also called a forbidden word or an unword in other contexts) in a sequence is a segment that does not appear in the given sequence. It is a minimal absent word if all its proper factors occur in the given sequence. In this article, we review the concept of minimal absent words, which includes the notion of shortest absent words but is much stronger. We present an efficient method for computing the minimal absent words of bounded length for DNA sequence using a Trie of bounded depth, representing bounded length factors. This method outputs the whole set of minimal absent words and furthermore our technique provides a linear-time algorithm with less memory usage than previous solutions. We also present an approach to distinguish sequences of different organisms using their minimal absent words. Our solution applies a length-weighted index to discriminate sequences and the results show that we can build phylogenetic tree based on the patent collected information.
Collapse
|
2
|
Abstract
Results of electrophoretic surveys have suggested that hemoglobin polymorphism may be maintained by balancing selection in natural populations of house mice, Mus musculus. Here we report a survey of nucleotide variation in the adult globin genes of house mice from South America. We surveyed nucleotide polymorphism in two closely linked alpha-globin paralogs and two closely linked beta-globin paralogs to test whether patterns of variation are consistent with a model of long-term balancing selection. Surprisingly high levels of nucleotide polymorphism at the two beta-globin paralogs were attributable to the segregation of two highly divergent haplotypes, Hbbs (which carries two identical beta-globin paralogs) and Hbbd (which carries two functionally divergent beta-globin paralogs). Interparalog gene conversion on the Hbbs haplotype has produced a highly unusual situation in which the two paralogs are more similar to one another than either one is to its allelic counterpart on the Hbbd haplotype. Levels of nucleotide polymorphism and linkage disequilibrium at the two beta-globin paralogs suggest a complex history of diversity-enhancing selection that may be responsible for long-term maintenance of alternative protein alleles. The alternative two-locus beta-globin haplotypes are associated with pronounced differences in intraerythrocyte glutathione and nitric oxide metabolism, suggesting a possible mechanism for selection on hemoglobin function.
Collapse
|
3
|
Abstract
Background Globins occur in all three kingdoms of life: they can be classified into single-domain globins and chimeric globins. The latter comprise the flavohemoglobins with a C-terminal FAD-binding domain and the gene-regulating globin coupled sensors, with variable C-terminal domains. The single-domain globins encompass sequences related to chimeric globins and «truncated» hemoglobins with a 2-over-2 instead of the canonical 3-over-3 α-helical fold. Results A census of globins in 26 archaeal, 245 bacterial and 49 eukaryote genomes was carried out. Only ~25% of archaea have globins, including globin coupled sensors, related single domain globins and 2-over-2 globins. From one to seven globins per genome were found in ~65% of the bacterial genomes: the presence and number of globins are positively correlated with genome size. Globins appear to be mostly absent in Bacteroidetes/Chlorobi, Chlamydia, Lactobacillales, Mollicutes, Rickettsiales, Pastorellales and Spirochaetes. Single domain globins occur in metazoans and flavohemoglobins are found in fungi, diplomonads and mycetozoans. Although red algae have single domain globins, including 2-over-2 globins, the green algae and ciliates have only 2-over-2 globins. Plants have symbiotic and nonsymbiotic single domain hemoglobins and 2-over-2 hemoglobins. Over 90% of eukaryotes have globins: the nematode Caenorhabditis has the most putative globins, ~33. No globins occur in the parasitic, unicellular eukaryotes such as Encephalitozoon, Entamoeba, Plasmodium and Trypanosoma. Conclusion Although Bacteria have all three types of globins, Archaeado not have flavohemoglobins and Eukaryotes lack globin coupled sensors. Since the hemoglobins in organisms other than animals are enzymes or sensors, it is likely that the evolution of an oxygen transport function accompanied the emergence of multicellular animals.
Collapse
|
4
|
The mammalian αD-globin gene lineage and a new model for the molecular evolution of α-globin gene clusters at the stem of the mammalian radiation. Mol Phylogenet Evol 2006; 38:439-48. [PMID: 16024261 DOI: 10.1016/j.ympev.2005.05.014] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2005] [Revised: 05/30/2005] [Accepted: 05/31/2005] [Indexed: 10/25/2022]
Abstract
We have explored the evolution of the alpha-globin gene family by comparative sequence and phylogenetic analyses of mammalian alpha-globin genes. Our analyses reveal the existence of a new alpha-globin gene lineage in mammals that is related to the alpha(D)-globin genes of birds, squamates and turtles. The gene is located in the middle of the alpha-globin gene cluster of a marsupial, Sminthopsis macroura and of humans. It exists in a wide variety of additional mammals, including pigs, cows, cats, and dogs, but is a pseudogene in American marsupials. Evolutionary analyses suggest that the gene has generally evolved under purifying selection, indicative of a functional gene. The presence of mRNA products in humans, pigs, and cows also suggest that the gene is expressed and likely to be functional. The analyses support the hypothesis that the alpha(D)-globin gene lineage has an ancient evolutionary origin that predates the divergence of amniotes. The structural similarity of alpha-globin gene clusters of marsupials and humans suggest that an eight gene cluster (5'-zeta2-zeta1-alpha(D)-alpha3-alpha2-alpha1-theta-omega-3'), including seven alpha-like genes and one beta-like globin gene (omega-globin) existed in the common ancestor of all marsupial and eutherian mammals. This basic structure has remained relatively stable in marsupials and in the lineage leading to humans, although omega-globin has been lost from the alpha-globin gene cluster of humans.
Collapse
|
5
|
Abstract
Proteins evolved through the shuffling of functional domains, and therefore, the same domain can be found in different proteins and species. Interactions between such conserved domains often involve specific, well-determined binding surfaces reflecting their important biological role in a cell. To find biologically relevant interactions we developed a method of systematically comparing and classifying protein domain interactions from the structural data. As a result, a set of conserved binding modes (CBMs) was created using the atomic detail of structure alignment data and the protein domain classification of the Conserved Domain Database. A conserved binding mode is inferred when different members of interacting domain families dock in the same way, such that their structural complexes superimpose well. Such domain interactions with recurring structural themes have greater significance to be biologically relevant, unlike spurious crystal packing interactions. Consequently, this study gives lower and upper bounds on the number of different types of interacting domain pairs in the structure database on the order of 1000-2000. We use CBMs to create domain interaction networks, which highlight functionally significant connections by avoiding many infrequent links between highly connected nodes. The CBMs also constitute a library of docking templates that may be used in molecular modeling to infer the characteristics of an unknown binding surface, just as conserved domains may be used to infer the structure of an unknown protein. The method's ability to sort through and classify large numbers of putative interacting domain pairs is demonstrated on the oligomeric interactions of globins.
Collapse
|
6
|
Axolotl hemoglobin: cDNA-derived amino acid sequences of two α globins and a β globin from an adult Ambystoma mexicanum. Comp Biochem Physiol B Biochem Mol Biol 2005; 142:258-68. [PMID: 16143550 DOI: 10.1016/j.cbpc.2005.07.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2005] [Revised: 07/03/2005] [Accepted: 07/05/2005] [Indexed: 10/25/2022]
Abstract
Erythrocytes of the adult axolotl, Ambystoma mexicanum, have multiple hemoglobins. We separated and purified two kinds of hemoglobin, termed major hemoglobin (Hb M) and minor hemoglobin (Hb m), from a five-year-old male by hydrophobic interaction column chromatography on Alkyl Superose. The hemoglobins have two distinct alpha type globin polypeptides (alphaM and alpham) and a common beta globin polypeptide, all of which were purified in FPLC on a reversed-phase column after S-pyridylethylation. The complete amino acid sequences of the three globin chains were determined separately using nucleotide sequencing with the assistance of protein sequencing. The mature globin molecules were composed of 141 amino acid residues for alphaM globin, 143 for alpham globin and 146 for beta globin. Comparing primary structures of the five kinds of axolotl globins, including two previously established alpha type globins from the same species, with other known globins of amphibians and representatives of other vertebrates, we constructed phylogenetic trees for amphibian hemoglobins and tetrapod hemoglobins. The molecular trees indicated that alphaM, alpham, beta and the previously known alpha major globin were adult types of globins and the other known alpha globin was a larval type. The existence of two to four more globins in the axolotl erythrocyte is predicted.
Collapse
|
7
|
Characterizing conserved structural contacts by pair-wise relative contacts and relative packing groups. J Mol Biol 2005; 354:706-21. [PMID: 16269154 DOI: 10.1016/j.jmb.2005.09.081] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2005] [Revised: 09/06/2005] [Accepted: 09/26/2005] [Indexed: 10/25/2022]
Abstract
To adequately deal with the inherent complexity of interactions between protein side-chains, we develop and describe here a novel method for characterizing protein packing within a fold family. Instead of approaching side-chain interactions absolutely from one residue to another, we instead consider the relative interactions of contacting residue pairs. The basic element, the pair-wise relative contact, is constructed from a sequence alignment and contact analysis of a set of structures and consists of a cluster of similarly oriented, interacting, side-chain pairs. To demonstrate this construct's usefulness in analyzing protein structure, we used the pair-wise relative contacts to analyze two sets of protein structures as defined by SCOP: the diverse globin-like superfamily (126 structures) and the more uniform heme binding globin family (a 94 structure subset of the globin-like superfamily). The superfamily structure set produced 1266 unique pair-wise relative contacts, whereas the family structure subset gave 1001 unique pair-wise relative contacts. For both sets, we show that these constructs can be used to accurately and automatically differentiate between fold classes. Furthermore, these pair-wise relative contacts correlate well with sequence identity and thus provide a direct relationship between changes in sequence and changes in structure. To capture the complexity of protein packing, these pair-wise relative contacts can be superimposed around a single residue to create a multi-body construct called a relative packing group. Construction of convex hulls around the individual packing groups provides a measure of the variation in packing around a residue and defines an approximate volume of space occupied by the groups interacting with a residue. We find that these relative packing groups are useful in understanding the structural quality of sequence or structure alignments. Moreover, they provide context to calculate a value for structural randomness, which is important in properly assessing the quality of a structural alignment. The results of this study provide the framework for future analysis for correlating sequence changes to specific structure changes.
Collapse
|
8
|
Neuroglobin and cytoglobin in search of their role in the vertebrate globin family. J Inorg Biochem 2005; 99:110-9. [PMID: 15598495 DOI: 10.1016/j.jinorgbio.2004.11.009] [Citation(s) in RCA: 253] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2004] [Revised: 10/01/2004] [Accepted: 10/24/2004] [Indexed: 11/16/2022]
Abstract
Neuroglobin and cytoglobin are two recent additions to the family of heme-containing respiratory proteins of man and other vertebrates. Here, we review the present state of knowledge of the structures, ligand binding kinetics, evolution and expression patterns of these two proteins. These data provide a first glimpse into the possible physiological roles of these globins in the animal's metabolism. Both, neuroglobin and cytoglobin are structurally similar to myoglobin, although they contain distinct cavities that may be instrumental in ligand binding. Kinetic and structural studies show that neuroglobin and cytoglobin belong to the class of hexa-coordinated globins with a biphasic ligand-binding kinetics. Nevertheless, their oxygen affinities resemble that of myoglobin. While neuroglobin is evolutionarily related to the invertebrate nerve-globins, cytoglobin shares a more recent common ancestry with myoglobin. Neuroglobin expression is confined mainly to brain and a few other tissues, with the highest expression observed in the retina. Present evidence points to an important role of neuroglobin in neuronal oxygen homeostasis and hypoxia protection, though other functions are still conceivable. Cytoglobin is predominantly expressed in fibroblasts and related cell types, but also in distinct nerve cell populations. Much less is known about its function, although in fibroblasts it might be involved in collagen synthesis.
Collapse
|
9
|
Automatic generation and evaluation of sparse protein signatures for families of protein structural domains. Protein Sci 2005; 14:13-23. [PMID: 15608116 PMCID: PMC2253312 DOI: 10.1110/ps.04929005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
We identified key residues from the structural alignment of families of protein domains from SCOP which we represented in the form of sparse protein signatures. A signature-generating algorithm (SigGen) was developed and used to automatically identify key residues based on several structural and sequence-based criteria. The capacity of the signatures to detect related sequences from the SWISSPROT database was assessed by receiver operator characteristic (ROC) analysis and jack-knife testing. Test signatures for families from each of the main SCOP classes are described in relation to the quality of the structural alignments, the SigGen parameters used, and their diagnostic performance. We show that automatically generated signatures are potently diagnostic for their family (ROC50 scores typically >0.8), consistently outperform random signatures, and can identify sequence relationships in the "twilight zone" of protein sequence similarity (<40%). Signatures based on 15%-30% of alignment positions occurred most frequently among the best-performing signatures. When alignment quality is poor, sparser signatures perform better, whereas signatures generated from higher-quality alignments of fewer structures require more positions to be diagnostic. Our validation of signatures from the Globin family shows that when sequences from the structural alignment are removed and new signatures generated, the omitted sequences are still detected. The positions highlighted by the signature often correspond (alignment specificity >0.7) to the key positions in the original (non-jack-knifed) alignment. We discuss potential applications of sparse signatures in sequence annotation and homology modeling.
Collapse
|
10
|
Neuroglobin and other hexacoordinated hemoglobins show a weak temperature dependence of oxygen binding. Biophys J 2005; 87:1196-204. [PMID: 15298922 PMCID: PMC1304458 DOI: 10.1529/biophysj.104.042168] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Mouse and human neuroglobins, as well as the hemoglobins from Drosophila melanogaster and Arabidopsis thaliana, were recombinantly expressed in Escherichia coli, and their ligand-binding properties were studied versus temperature. These globins have a common feature of being hexacoordinated (via the distal histidine) under deoxy conditions, as evidenced by a large amplitude for the alpha absorption band at 560 nm and the Soret band at 426 nm. The transition from the hexacoordinated form to the CO bound species is slow, as expected for a replacement reaction Fe-His --> Fe --> FeCO. The intrinsic binding rates would indicate a high oxygen affinity for the pentacoordinated form, due to rapid association and slow (100 ms-1 s) dissociation. However, the competing protein ligand results in a much lower affinity, on the order of magnitude of 1 torr. In addition to decreasing the affinity for external ligand, the competitive internal ligand leads to a weaker observed temperature dependence of the ligand affinity, since the difference in equilibrium energy for the two ligands is much lower than that of ligand binding to pentacoordinated hemoglobin. This effect could be of biological relevance for certain organisms, since it could provide a globin with an oxygen affinity that is nearly independent of temperature.
Collapse
|
11
|
[Quantitative analysis of human globin gene expression in beta-thalassemia using real-time RT-PCR]. YI CHUAN = HEREDITAS 2005; 27:57-64. [PMID: 15730961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/01/2023]
Abstract
Whole blood samples were collected from 100 normal healthy adults, from umbilical cord of 33 newborn infants, 111 individuals with beta-thalassemia minor (beta(T)/beta(A),alphaalpha/alphaalpha) and 39 with beta-thalassemia major (beta(T)/beta(T),alphaalpha/alphaalpha). Prior to quantitative analysis of globin gene expression, DNA was extracted from all blood samples and used for beta-thalassemia genotype analysis. Different types of beta globin gene mutations were analyzed using reverse dot blotting (RDB) method. Total RNA were extracted and subjected to real-time RT-PCR for quantitative measurement of alpha, beta and gamma globin mRNA using three sets of primers and fluorescent-labeled probes, designed according to the sequences of alpha, beta and gamma human globin gene. Real-time RT-PCR was performed in ABI 7700 system. Following the real-time RT-PCR, the mean values of alpha, beta and gamma globin mRNA were calculated and the ratios of alpha/beta, alpha/(beta + gamma ) and gamma /(beta + gamma ) were determined to characterize the relative expression levels of different globin genes among normal adult, infant, beta-thalassemia minor and beta-thalassemia major patients. The resultant data were analyzed using SPSS 10.0 software to determine statistical significance of human globin gene expression among normal controls and beta-thalassemia patients. Due to vast variations of the mean globin gene mRNA levels among different groups, log conversion of alpha/beta + 1, alpha/(beta + gamma ) + 1 and gamma /(beta + gamma ) +1 was used for statistical analyses and intergroup comparison. The alpha/beta globin gene mRNA ratios were determined to be 4.62+/-1.20, 7.81+/-2.89, 13.51+/-5.12, and 188.24+/-374.04 for normal healthy adult (beta(A)/beta(A),alphaalpha/alphaalpha), infant (beta(A)/beta(A),alphaalpha/alphaalpha), beta- thalassemia minor (beta(T)/beta(A),alphaalpha/alphaalpha) and beta-thalassemia major(beta(T)/beta(T),alphaalpha/alphaalpha) respectively. The alpha/(beta+ gamma ) ratios were 4.43+/-1.17, 0.56+/-0.49, 9.62+/-4.37, and 2.14+/-1.58 for normal healthy adult (beta(A)/beta(A),alphaalpha/alphaalpha), infant (beta(A)/beta(A),alphaalpha/alphaalpha), beta- thalassemia minor (beta(T)/beta(A),alphaalpha/alphaalpha) and beta- thalassemia major(beta(T)/beta(T),alphaalpha/alphaalpha) respectively. The gamma /(beta+ gamma ) ratios were 0.04+/-0.03, 0.92+/-0.06, 0.28+/-0.18, and 0.95+/-0.04 for normal healthy adult (beta(A)/beta(A),alphaalpha/alphaalpha), infant (beta(A)/beta(A),alphaalpha/alphaalpha), beta- thalassemia minor (beta(T)/beta(A),alphaalpha/alphaalpha) and beta- thalassemia major (beta(T)/beta(T),alphaalpha/alphaalpha) respectively. Following statistical analyses, the alpha/beta and alpha/(beta+ gamma ) globin gene mRNA ratios were significantly different among four different groups (normal adult, normal infant, beta- thalassemia minor and beta- thalassemia major). The gamma /(beta + gamma ) globin gene mRNA ratio was significantly different among all groups except for between infant and beta- thalassemia major patients. Human beta globin gene mRNA levels decrease progressively and dramatically from normal adults to beta-thalassemia patients with beta-thalassemia major having the lowest levels. On the other hand, the gamma globin gene mRNA levels increase progressively from normal adult to beta-thalassemia patients with beta-thalassemia major having the highest levels. Infants have relatively lower levels of beta but higher levels of gamma globin gene mRNA as compared to those in normal adults. Thus, the relative expression levels of alpha, beta or gamma globin genes varied but inter-related among different ages of normal individuals and different beta-thalassemia genotypes.
Collapse
|
12
|
The complete amino acid sequences of four globins from the land leech Haemadipsa zeylanica var. japonica. Comp Biochem Physiol B Biochem Mol Biol 2004; 137:443-54. [PMID: 15081996 DOI: 10.1016/j.cbpc.2004.01.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2003] [Revised: 01/15/2004] [Accepted: 01/17/2004] [Indexed: 11/16/2022]
Abstract
The amino acid sequences of four globins from the land leech, Haemadipsa zeylanica var. japonica, were determined using nucleotide sequencing and protein sequencing. The mature globin-molecules were composed of 146 amino acid residues for M-1 globin, 156 for M-2 globin, 143 for D-1 globin, and 149 for D-2 globin. Alignment of the four kinds of globins by Clustal X revealed 22 invariant amino acids. The four globins were 26-33% identical. A striking feature of amino acid alteration was: the replacement of the E7 distal-His of D-1 globin by phenylalanine because histidine is conserved among the rest of the globins of H. zeylanica, those of other representative species (Lumbricus and Tylorrhynchus) of Annelida and most other hemoglobins. A phylogenetic tree constructed of 18 globin structures including two species of leeches, H. zeylanica (a land leech) and Macrobdella decora (a freshwater leech), T. heterochaetus (a representative species of polychaetes), L. terrestris (a representative species of oligochaetes), and human alpha and beta globins strongly indicated that the leech globins first separated from globin lineage of annelids.
Collapse
|
13
|
Abstract
The key position of the Ciona intestinalis basal to the vertebrate phylogenetic tree brings up the question of which respiratory proteins are used by the tunicate to facilitate oxygen transport and storage. The publication of the Ciona draft genome sequence suggests that globin genes are completely missing and that-like some molluscs and arthropods-the sea squirt uses hemocyanin instead of hemoglobin for respiration. However, we report here the presence and expression of at least four distinct globin gene/protein sequences in Ciona. This finding is in agreement with the ancestral phylogeny of the vertebrate globins. Moreover, it seems likely that the Ciona hemocyanin-like sequences have enzymatic instead of respiratory functions.
Collapse
|
14
|
Abstract
Vertebrates possess multiple respiratory globins that differ in terms of structure, function, and tissue distribution. Three types of globins have been described so far: hemoglobin facilitates the transport of oxygen in the blood, myoglobin serves oxygen transport and storage in the muscle, and neuroglobin has a yet unidentified function in nerve cells. Here we report the identification of a fourth and novel type of globin in mouse, man, and zebrafish. It is expressed in apparently all types of human tissue and therefore has been called cytoglobin (CYGB). Mouse and human CYGBs comprise 190 amino acids; the zebrafish CYGB, 174 amino acids. The human CYGB gene is located on chromosome 17q25. The mammalian genes display a unique exon-intron pattern with an additional exon resulting in a C-terminal extension of the protein, which is absent in the fish CYGB. Phylogenetic analyses suggest that the CYGBs had a common ancestor with vertebrate myoglobins. This indicates that the vertebrate myoglobins are in fact a specialized intracellular globin that evolved in adaptation to the special needs of muscle cells.
Collapse
|
15
|
Neuroglobins from the zebrafish Danio rerio and the pufferfish Tetraodon nigroviridis. Biochem Biophys Res Commun 2001; 287:418-21. [PMID: 11554744 DOI: 10.1006/bbrc.2001.5614] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Neuroglobin is a recently discovered respiratory, porphyrin-containing protein that is expressed in the brain of mouse and man. Here we show that neuroglobin is also present in the teleost fish. Complete cDNA sequences are reported from the pufferfish Tetraodon nigroviridis and the zebrafish Danio rerio. In addition, the neuroglobin gene of T. nigroviridis was sequenced, demonstrating the conservation of the B12.2, E11.0 and G7.0 introns plus the presence of an additional intron in the 5' noncoding region. The fish neuroglobins each comprise 159 amino acids and are 84.3% identical. Phylogenetic analyses show a basal position of the neuroglobins within the metazoan globin tree. An enhanced amino acid substitution rate was estimated for the fish neuroglobins ( approximately 0.93 x 10(-9) amino acid substitutions per site and year) compared with their mammalian proteins ( approximately 0.39 x 10(-9) replacements per site and year).
Collapse
|
16
|
Abstract
The mammaglobin gene has been shown to be preferentially expressed in breast tissue. Few genes match its specificity. Mammaglobin has generated much interest, and studies are ongoing to develop diagnostic tests for breast cancer based on the detection of mammaglobin. While searching the Incyte Genomics Lifeseq database for tissue-specific markers, we observed a second secretoglobin, BU101, also known as lipophilin B. We report here that mammaglobin, in breast tissue, is found as a complex with BU101. The complex was isolated from breast cancer tissue and was characterized as the biologically relevant form of mammaglobin.
Collapse
|
17
|
|
18
|
New views of evolution and regulation of vertebrate beta-like globin gene clusters from an orphaned gene in marsupials. Proc Natl Acad Sci U S A 2001; 98:1327-9. [PMID: 11171947 PMCID: PMC33376 DOI: 10.1073/pnas.98.4.1327] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
19
|
Abstract
Haemoglobins and myoglobins constitute related protein families that function in oxygen transport and storage in humans and other vertebrates. Here we report the identification of a third globin type in man and mouse. This protein is predominantly expressed in the brain, and therefore we have called it neuroglobin. Mouse neuroglobin is a monomer with a high oxygen affinity (half saturation pressure, P50 approximately 2 torr). Analogous to myoglobin, neuroglobin may increase the availability of oxygen to brain tissue. The human neuroglobin gene (NGB), located on chromosome 14q24, has a unique exon-intron structure. Neuroglobin represents a distinct protein family that diverged early in metazoan evolution, probably before the Protostomia/Deuterostomia split.
Collapse
|
20
|
The rat beta (b miny)-globin promoter: nuclear protein factors and erythroid-specific induction of transcription. Cell Mol Life Sci 1999; 56:871-81. [PMID: 11212345 DOI: 10.1007/pl00000617] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
We show that the rat adult beta (b miny)-globin gene is transcriptionally active. The - 100-bp promoter region contains control elements that are important for the induction of transcription in rat erythroleukemia (REL) cells. By using DNaseI footprinting and gel mobility shift assays, we have shown that the CCAAT box, a regulatory element from an analyzed promoter region, binds NF-Y and GATA-1 transcription factors. Another regulatory element from this region, betaDRE, binds erythroid-specific and ubiquitous factors from REL cells. Although both the CCAAT box region and betaDRE element bind the same protein factors before and after induction of REL cells, we show by South-Western blot analysis that the concentrations of 150-kDa, 70-kDa and 60-kDa factors binding to the betaDRE are increased in DMSO-induced REL cells.
Collapse
|
21
|
Beta-globin haplotypes analysis in Afro-Brazilians from the Amazon region: evidence for a significant gene flow from Atlantic West Africa. Ann Hum Biol 1999; 26:365-73. [PMID: 10462156 DOI: 10.1080/030144699282679] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
Beta-globin gene cluster haplotypes were analysed in betaA-, betaS- and betaC-globin gene-bearing chromosomes in black people from Curiau, Pacoval and Trombetas, three communities made up of descendants of African slaves, located in the Northern region of Brazil. The betaA haplotype distribution is consistent with the African origin of the populations, with some degree of local differentiation and admixture with people of Caucasian ancestry and/or Amerindians. In addition, the betaS haplotype distribution (60% Bantu; 30% Senegal and 10% Benin) suggests that although African slaves brought to Northern region have been predominantly from regions where the Bantu haplotype predominates, there is also evidence of the presence of slaves from West Africa, particularly from the Atlantic West.
Collapse
|
22
|
Relative levels of alpha-, beta-, and gamma-mRNA from patients with severe and intermediate beta-thalassemia major. Acta Haematol 1997; 97:205-10. [PMID: 9158662 DOI: 10.1159/000203684] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
We have determined the relative quantities of gamma- and beta-mRNAs and the alpha/beta-mRNA ratios in 37 patients with beta-thalassemia major with specific genotypes, namely 8 with a homozygosity for codon (CD) 39 (C-->T), 7 with a homozygosity for IVS-I-110 (G-->A), 5 with a homozygosity for IVS-I-6 (T-->C), for 15 patients with compound heterozygosities for 2 of these 3 mutations, and for 2 patients with the IVS-I-110 (G-->A)/-87 (C-->G) mutations. None had an alpha-thalassemia. Twelve patients had thalassemia intermedia and the remainder, transfusion-dependent severe conditions. Differences in phenotype were observed for compound heterozygotes involving the IVS-I-6 (T-->C) mutation in combination with either the IVS-I-110 (G-->A) or the CD 39 (C-->T) mutations: patients with thalassemia intermedia had a lower alpha/beta-mRNA ratio, about half of that of the patients with severe beta-thalassemia major. This might suggest a higher beta-mRNA synthesis in some patients than in others with the same genotype; mutations in promoter, enhancer, and/or locus control region sequences may be responsible for these differences. In vitro chain synthesis data were too incomplete to be helpful in this study. The RT-PCR procedure allowed the separation of abnormal (extended) mRNA from normal beta-RNA in subjects carrying the IVS-I-110 (G-->T) mutation. The relative quantities of this beta Th-mRNA (% of beta A + beta Th) were determined by scanning of the appropriate autoradiograms; they averaged 25% for homozygotes and about 4% for heterozygotes, indicating a considerable instability of the message.
Collapse
|
23
|
Amino acid sequence of myoglobin from the chiton Liolophura japonica and a phylogenetic tree for molluscan globins. JOURNAL OF PROTEIN CHEMISTRY 1993; 12:45-50. [PMID: 8427632 DOI: 10.1007/bf01024913] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Myoglobin was isolated from the radular muscle of the chiton Liolophura japonica, a primitive archigastropodic mollusc. Liolophura contains three monomeric myoglobins (I, II, and III), and the complete amino acid sequence of myoglobin I has been determined. It is composed of 145 amino acid residues, and the molecular mass was calculated to be 16,070 D. The E7 distal histidine, which is replaced by valine or glutamine in several molluscan globins, is conserved in Liolophura myoglobin. The autoxidation rate at physiological conditions indicated that Liolophura oxymyoglobin is fairly stable when compared with other molluscan myoglobins. The amino acid sequence of Liolophura myoglobin shows low homology (11-21%) with molluscan dimeric myoglobins and hemoglobins, but shows higher homology (26-29%) with monomeric myoglobins from the gastropodic molluscs Aplysia, Dolabella, and Bursatella. A phylogenetic tree was constructed from 19 molluscan globin sequences. The tree separated them into two distinct clusters, a cluster for muscle myoglobins and a cluster for erythrocyte or gill hemoglobins. The myoglobin cluster is divided further into two subclusters, corresponding to monomeric and dimeric myoglobins, respectively. Liolophura myoglobin was placed on the branch of monomeric myoglobin lineage, showing that it diverged earlier from other monomeric myoglobins. The hemoglobin cluster is also divided into two subclusters. One cluster contains homodimeric, heterodimeric, tetrameric, and didomain chains of erythrocyte hemoglobins of the blood clams Anadara, Scapharca, and Barbatia. Of special interest is the other subcluster. It consists of three hemoglobin chains derived from the bacterial symbiontharboring clams Calyptogena and Lucina, in which hemoglobins are supposed to play an important role in maintaining the symbiosis with sulfide bacteria.
Collapse
|
24
|
A comparison of several similarity indices used in the classification of protein sequences: a multivariate analysis. Nucleic Acids Res 1992; 20:3631-7. [PMID: 1641329 PMCID: PMC334011 DOI: 10.1093/nar/20.14.3631] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
The present work describes an attempt to identify reliable criteria which could be used as distance indices between protein sequences. Seven different criteria have been tested: i and ii) the scores of the alignments as given by the BESTFIT and the FASTA programs; iii) the ratio parameter, i.e. the BESTFIT score divided by the length of the aligned peptides; iv and v) the statistical significance (Z-scores) of the scores calculated by BESTFIT and FASTA, as obtained by comparison with shuffled sequences; vi) the Z-scores provided by the program RELATE which performs a segment-by-segment comparison of 2 sequences, and vii) an original distance index calculated by the program DOCMA from all the pairwise dotplots between the sequences. These 7 criteria have been tested against the aminoacid sequences of 39 globins and those of the 20 aminoacyl-tRNA synthetases from E. coli. The distances between the sequences were analyzed by the multivariate analysis techniques. The results show that the distances calculated from the scores of the pairwise alignments are not adequately sensitive. The Z-score from RELATE is not selective enough and too demanding in computer time. Three criteria gave a classification consistent with the known similarities between the sequences in the sets, namely the Z-scores from BESTFIT and FASTA and the multiple dotplot comparison distance index from DOCMA.
Collapse
|
25
|
Abstract
A simple isoelectric focusing technique is presented for the screening of alpha- and beta-globin chain variants on polyacrylamide gel in the presence of 8 M urea. No detergent was used. Variants focused at discrete pH values, depending on the charge variation (-2, -1, 0, +1 or +2) introduced by the mutation. A linear relationship was evidenced between the charge variation and the shift of the apparent isoelectric point of the variants. The practical interest of this procedure is illustrated.
Collapse
|