151
|
Bouck J, Miller W, Gorrell JH, Muzny D, Gibbs RA. Analysis of the quality and utility of random shotgun sequencing at low redundancies. Genome Res 1998; 8:1074-84. [PMID: 9799794 PMCID: PMC310787 DOI: 10.1101/gr.8.10.1074] [Citation(s) in RCA: 48] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The currently favored approach for sequencing the human genome involves selecting representative large-insert clones (100-200 kb), randomly shearing this DNA to construct shotgun libraries, and then sequencing many different isolates from the library. This method, entitled directed random shotgun sequencing, requires highly redundant sequencing to obtain a complete and accurate finished consensus sequence. Recently it has been suggested that a rapidly generated lower redundancy sequence might be of use to the scientific community. Low-redundancy sequencing has been examined previously using simulated data sets. Here we utilize trace data from a number of projects submitted to GenBank to perform reconstruction experiments that mimic low-redundancy sequencing. These low-redundancy sequences have been examined for the completeness and quality of the consensus product, information content, and usefulness for interspecies comparisons. The data presented here suggest three different sequencing strategies, each with different utilities. (1) Nearly complete sequence data can be obtained by sequencing a random shotgun library at sixfold redundancy. This may therefore represent a good point to switch from a random to directed approach. (2) Sequencing can be performed with as little as twofold redundancy to find most of the information about exons, EST hits, and putative exon similarity matches. (3) To obtain contiguity of coding regions, sequencing at three- to fourfold redundancy would be appropriate. From these results, we suggest that a useful intermediate product for genome sequencing might be obtained by three- to fourfold redundancy. Such a product would allow a large amount of biologically useful data to be extracted while postponing the majority of work involved in producing a high quality consensus sequence.
Collapse
|
152
|
Florea L, Hartzell G, Zhang Z, Rubin GM, Miller W. A computer program for aligning a cDNA sequence with a genomic DNA sequence. Genome Res 1998; 8:967-74. [PMID: 9750195 PMCID: PMC310774 DOI: 10.1101/gr.8.9.967] [Citation(s) in RCA: 559] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We address the problem of efficiently aligning a transcribed and spliced DNA sequence with a genomic sequence containing that gene, allowing for introns in the genomic sequence and a relatively small number of sequencing errors. A freely available computer program, described herein, solves the problem for a 100-kb genomic sequence in a few seconds on a workstation.
Collapse
|
153
|
Zhang Z, Schäffer AA, Miller W, Madden TL, Lipman DJ, Koonin EV, Altschul SF. Protein sequence similarity searches using patterns as seeds. Nucleic Acids Res 1998; 26:3986-90. [PMID: 9705509 PMCID: PMC147803 DOI: 10.1093/nar/26.17.3986] [Citation(s) in RCA: 217] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Protein families often are characterized by conserved sequence patterns or motifs. A researcher frequently wishes to evaluate the significance of a specific pattern within a protein, or to exploit knowledge of known motifs to aid the recognition of greatly diverged but homologous family members. To assist in these efforts, the pattern-hit initiated BLAST (PHI-BLAST) program described here takes as input both a protein sequence and a pattern of interest that it contains. PHI-BLAST searches a protein database for other instances of the input pattern, and uses those found as seeds for the construction of local alignments to the query sequence. The random distribution of PHI-BLAST alignment scores is studied analytically and empirically. In many instances, the program is able to detect statistically significant similarity between homologous proteins that are not recognizably related using traditional single-pass database search methods. PHI-BLAST is applied to the analysis of CED4-like cell death regulators, HS90-type ATPase domains, archaeal tRNA nucleotidyltransferases and archaeal homologs of DnaG-type DNA primases.
Collapse
|
154
|
Abstract
Given a strong match between regions of two sequences, how far can the match be meaningfully extended if gaps are allowed in the resulting alignment? The aim is to avoid searching beyond the point that a useful extension of the alignment is likely to be found. Without loss of generality, we can restrict attention to the suffixes of the sequences that follow the strong match, which leads to the following formal problem. Given two sequences and a fixed X > 0, align initial portions of the sequences subject to the constraint that no section of the alignment scores below -X. Our results indicate that computing an optimal alignment under this constraint is very expensive. However, less rigorous conditions on the alignment can be guaranteed by quite efficient algorithms. One of these variants has been implemented in a new release of the Blast suite of database search programs.
Collapse
|
155
|
Cragg L, Hebbel RP, Miller W, Solovey A, Selby S, Enright H. The iron chelator L1 potentiates oxidative DNA damage in iron-loaded liver cells. Blood 1998; 92:632-8. [PMID: 9657765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
Iron-mediated carcinogenesis is thought to occur through the generation of oxygen radicals. Iron chelators are used in attempts to prevent the long term consequences of iron overload. In particular, 1,2-dimethyl-3-hydroxypyrid-4-one (L1), has shown promise as an effective chelator. Using an established hepatocellular model of iron overload, we studied the generation of iron-catalyzed oxidative DNA damage and the influence of iron chelators, including L1, on such damage. Iron loading of HepG2 cells was found to greatly exacerbate hydrogen peroxide-mediated DNA damage. Desferrithiocin was protective against iron/hydrogen peroxide-induced DNA damage; deferoxamine had no effect. In contrast, L1 exposure markedly potentiated hydrogen peroxide-mediated oxidative DNA damage in iron-loaded liver cells. However, when exposure to L1 was maintained during incubation with hydrogen peroxide, L1 exerted a protective effect. We interpret this as indicating that L1's potential toxicity is highly dependent on the L1:iron ratio. In vitro studies examining iron-mediated ascorbate oxidation in the presence of L1 showed that an L1:iron ratio must be at least 3 to 1 for L1 to inhibit the generation of free radicals; at lower concentrations of L1 increased oxygen radical generation occurs. In the clinical setting, such potentiation of iron-catalyzed oxidative DNA damage at low L1:iron ratios may lead to long-term toxicities that might preclude administration of L1 as an iron chelator. Whether this implication in fact extends to the in vivo situation will have to be verified in animal studies.
Collapse
|
156
|
McGuigan C, Perry A, Yarnold CJ, Sutton PW, Lowe D, Miller W, Rahim SG, Slater MJ. Synthesis and evaluation of some masked phosphate esters of the anti-herpesvirus drug 882C (netivudine) as potential antiviral agents. Antivir Chem Chemother 1998; 9:233-43. [PMID: 9875402 DOI: 10.1177/095632029800900304] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
A number of symmetric and asymmetric 5'-phosphate esters of the potent anti-varicella-zoster virus (VZV) agent 1-(beta-D-arabinofuranosyl)-5-prop-1-ynyluracil (882C; netivudine) were prepared as potential lipophilic, membrane-soluble prodrugs of the bio-active phosphate forms. The compounds were prepared by the base-catalysed coupling of various phosphorochloridates with the free nucleoside analogue. Compounds were fully characterized by a range of spectroscopic and analytical methods and were studied for their inhibition of several viruses in tissue culture. All of the phosphate esters were inactive against human cytomegalovirus, herpes simplex virus type 2, VZV, human immunodeficiency virus type 1 and influenza A virus (EC50 > 100 microM) except the 5'-(4-nitrophenyl phenyl) phosphate, which inhibited influenza A virus. The relative rate of esterase-mediated hydrolysis of one of the lead target structures was measured in order to rationalize the poor antiviral action, and data were collected on possible metabolites in support of this analysis. Cell-specific esterases are implicated as key determinants of the antiviral potency of prodrugs of this type.
Collapse
|
157
|
Abstract
This study was undertaken to examine differences in caffeine and nicotine use between the psychiatric population and the addicted population in a private psychiatric inpatient facility. Eighty-six patients on an adult addictive disease inpatient unit and 80 patients on an adult psychiatry unit in a private psychiatric hospital were interviewed with regard to their use of nicotine and caffeine. In addition, demographic information and primary diagnoses were obtained from the psychiatric admission assessment in the medical record as listed by the admitting psychiatrist. Although there was little difference in psychiatric patients vs. chemically dependent patients with regard to the percentage of caffeine users, the chemically dependent individuals drank more coffee, soft drinks, and tea. A much greater percentage of the chemically dependent individuals also smoked cigarettes, although not in a greater amount than the psychiatric patients who smoked. Because group assignment was not random, ordinary least squares (OLS) regression analyses were conducted to determine the independent associations of age, sex, education, and treatment population in predicting levels of caffeine and tobacco use. Even after controlling for demographic differences between the two samples, chemically dependent patients still reported higher levels of daily caffeine and tobacco use than patients on the general psychiatric unit.
Collapse
|
158
|
Li J, Noguchi CT, Miller W, Hardison R, Schechter AN. Multiple regulatory elements in the 5'-flanking sequence of the human epsilon-globin gene. J Biol Chem 1998; 273:10202-9. [PMID: 9553070 DOI: 10.1074/jbc.273.17.10202] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
We have previously reported, on the basis of transfection experiments, the existence of a silencer element in the 5'-flanking region of the human embryonic (epsilon) globin gene, located at -270 base pairs 5' to the cap site, which provides negative regulation for this gene. Experiments in transgenic mice suggest the physiological importance of this epsilon-globin silencer, but also suggest that down-regulation of epsilon-globin gene expression may involve other negative elements flanking the epsilon-globin gene. We have now extended the analysis of epsilon-globin gene regulation to include the flanking region spanning up to 6 kilobase pairs 5' to the locus control region using reporter gene constructs with deletion mutations and transient transfection assays. We have identified and characterized other strong negative regulatory regions, as well as several positive regions that affect transcription activation. The negative regulatory regions at -3 kilobase pairs (epsilonNRA-I and epsilonNRA-II), flanked by a positive control element, has a strong effect on the epsilon-globin promoter both in erythroid K562 and nonerythroid HeLa cells and contains several binding sites for transcription factor GATA-1, as evidenced from DNA-protein binding assays. The GATA-1 sites within epsilonNRA-II are directly needed for negative control. Both epsilonNRA-I and epsilonNRA-II are active on a heterologous promoter and hence appear to act as transcription silencers. Another negative control region located at -1.7 kilobase pairs (epsilonNRB) does not exhibit general silencer activity as epsilonNRB does not affect transcription activity when used in conjunction with an epsilon-globin minimal promoter. The negative effect of epsilonNRB is erythroid specific, but not stage-specific as it can repress transcription activity in both K562 erythroid cells as well as in primary cultures of adult erythroid cells. Phylogenetic DNA sequence comparisons with other primate and other mammalian species show unusual degree of flanking sequence homology for the epsilon-globin gene, including in several of the regions identified in these functional and DNA-protein binding analyses, providing alternate evidence for their potential importance. We suggest that the down-regulation of epsilon-globin gene expression as development progresses involves complex, cooperative interactions of these negative regulatory elements, epsilonNRA-I/epsilonNRA-II, epsilonNRB, the epsilon-globin silencer and probably other negative and positive elements in the 5'-flanking region of the epsilon-globin gene.
Collapse
|
159
|
Chui DH, Hardison R, Riemer C, Miller W, Carver MF, Molchanova TP, Efremov GD, Huisman TH. An electronic database of human hemoglobin variants on the World Wide Web. Blood 1998; 91:2643-4. [PMID: 9531571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
|
160
|
Hardison RC, Chui DH, Riemer CR, Miller W, Carver MF, Molchanova TP, Efremov GD, Huisman TH. Access to a syllabus of human hemoglobin variants (1996) via the World Wide Web. Hemoglobin 1998; 22:113-27. [PMID: 9576329 DOI: 10.3109/03630269809092136] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Information on mutations in human hemoglobin is important in many efforts, including understanding the pathophysiology of hemoglobin diseases, developing therapies, elucidating the dynamics of sequence alterations inhuman populations, and dissecting the details of protein structure/function relationships. Currently, information is available on a large number of mutations and variants, but is distributed among thousands of papers. In an effort to organize this voluminous data set, two Syllabi have been prepared compiling succinct information on human hemoglobin abnormalities. In both of these, each entry provides amino acid and/or DNA sequence alterations, hematological and clinical data, methodology used for characterization, ethnic distribution, and functional properties and stability of the hemoglobin, together with appropriate literature references. A Syllabus of Human Hemoglobin Variants (1996) describes 693 abnormal hemoglobins resulting from alterations in the alpha-, beta-, gamma-, and delta-globin chains, including special abnormalities such as double mutations, hybrid chains, elongated chains, deletions, and insertions. We have converted this resource to an electronic form that is accessible via the World Wide Web at the Globin Gene Server (http://globin.cse.psu.edu). Hyperlinks are provided from each entry in the tables of variants to the corresponding full description. In addition, a simple query interface allows the user to find all entries containing a designated word or phrase. We are in the process of converting A Syllabus of Thalassemia Mutations (1997) to a similar electronic format.
Collapse
|
161
|
Hardison R, Riemer C, Chui DH, Huisman TH, Miller W. Electronic access to sequence alignments, experimental results, and human mutations as an aid to studying globin gene regulation. Genomics 1998; 47:429-37. [PMID: 9480762 DOI: 10.1006/geno.1997.5147] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
162
|
Meisenberg B, Brehm T, Schmeckel A, Miller W, McMillan R. A combination of low-dose cyclophosphamide and colony-stimulating factors is more cost-effective than granulocyte-colony-stimulating factors alone in mobilizing peripheral blood stem and progenitor cells. Transfusion 1998; 38:209-15. [PMID: 9531956 DOI: 10.1046/j.1537-2995.1998.38298193107.x] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
BACKGROUND The use of peripheral blood progenitor cells (PBPCs) instead of autologous bone marrow leads to more rapid engraftment following high-dose chemotherapy. Mobilization regimens differ with respect to toxicity, efficiency, and cost. STUDY DESIGN AND METHODS Two cohorts of patients with breast cancer received one of two mobilization regimens: granulocyte-colony-stimulating factor (G-CSF) at 10 micrograms per kg was given subcutaneously for 5 days, with leukapheresis begun on Day 6, or low-dose cyclophosphamide followed by sequential granulocyte-macrophage-CSF (GM-CSF) at 5 micrograms per kg for 5 days and by G-CSF at 10 micrograms per kg, with leukapheresis begun on Day 11. Results of CD34+ cell collection, engraftment, and costs of mobilization were determined. RESULTS The combination chemotherapy and growth factor regimen was more efficient in mobilizing CD34+ cells. Sixty-six percent of patients reached a target 4 x 10(6) CD34+ cells per kg in a single leukapheresis session with the combination regimen, compared to 14 percent who received G-CSF alone (p < 0.01). The mean number of leukapheresis sessions required to reach a target of 4 x 10(6) CD34+ cells per kg was 1.3 for the combination regimen and 2.7 for the regimen of G-CSF alone (p < 0.01). One patient in the chemotherapy and growth factor group developed febrile neutropenia. Engraftment was similar in both cohorts of patients. The cost of mobilization, including all supplies and cryopreservation, was $7381 for the G-CSF regimen and $5508 for the chemotherapy regimen (p < 0.05). This reduction was attributed to the lower number of leukapheresis and cryopreservation sessions, which outweighed the slight increase in expense for chemotherapy and growth factor in the combination regimen. CONCLUSION This combination mobilization regimen allowed the predictable and efficient collection of CD34+ cells from the peripheral blood in a limited number of leukapheresis sessions, which reduced the cost of mobilization by approximately 25 percent.
Collapse
|
163
|
Miller W, Riehl E, Napier M, Barber K, Dabideen H. Use of physician assistants as surgery/trauma house staff at an American College of Surgeons-verified Level II trauma center. THE JOURNAL OF TRAUMA 1998; 44:372-6. [PMID: 9498514 DOI: 10.1097/00005373-199802000-00025] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
BACKGROUND Historically, surgical physicians staff trauma centers, which provide trauma patients with improved outcomes. Such benefits fuel the expansion of designated trauma centers. Cutbacks in residency programs of surgical specialties, however, necessitate substitutions for traditional trauma providers. METHODS A literature and record review was conducted to examine the use of physician assistants in a large community hospital's verified trauma center. Current and historical outcomes were analyzed regarding the trauma surgeon/physician assistant model. RESULTS Injury Severity Scores increased 19%, transfer time to the operating room decreased 43%, transfer time to the intensive care unit decreased 51%, and transfer time to the floor decreased 20%. The length of stay for admissions decreased 13%, and the length of stay for neurotrauma intensive care unit patients decreased 33%. CONCLUSION The Hurley Medical Center trauma surgeon/physician assistant model is a viable alternative for verified trauma centers unable to maintain a surgical residency program. Consistency and quality of care indicated by shortened length of stay is a hallmark of such a model providing the highest quality of care.
Collapse
|
164
|
Ansari-Lari MA, Oeltjen JC, Schwartz S, Zhang Z, Muzny DM, Lu J, Gorrell JH, Chinault AC, Belmont JW, Miller W, Gibbs RA. Comparative sequence analysis of a gene-rich cluster at human chromosome 12p13 and its syntenic region in mouse chromosome 6. Genome Res 1998; 8:29-40. [PMID: 9445485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The Human Genome Project has created a formidable challenge: the extraction of biological information from extensive amounts of raw sequence. With the increasing availability of genomic sequence from other species, one approach to extracting coding and regulatory element information is through cross-species sequence comparison. To assess the strengths and weaknesses of this methodology for large-scale sequence analysis, 227 kb of mouse sequence syntenic to a gene-rich cluster on human chromosome 12p13 was obtained. Primarily through percent identity plots (PIPs) of SIM comparative sequence alignments, the sequence of coding regions, putative alternative exons, conserved noncoding regions, and correlation in repetitive element insertions were easily determined. The analysis demonstrated that the number, order, and orientation of all 17 genes are conserved between the two species, whereas two human pseudogenes are absent in mouse. In addition, apart from MIRs, no direct correlation of distribution or position of the majority of repetitive elements between the two species is seen. Finally, in examining the synonymous and nonsynonymous substitution rates in the conserved genes, a large variation in nonsynonymous rates is observed indicating that the genes in this region are diverging at different rates. This study indicates the utility and strength of large-scale cross-species sequence comparisons in the extraction of biological information from raw sequence, especially when combined with other computational tools such as GRAIL and BLAST.
Collapse
|
165
|
Hardison R, Slightom JL, Gumucio DL, Goodman M, Stojanovic N, Miller W. Locus control regions of mammalian beta-globin gene clusters: combining phylogenetic analyses and experimental results to gain functional insights. Gene X 1997; 205:73-94. [PMID: 9461381 DOI: 10.1016/s0378-1119(97)00474-5] [Citation(s) in RCA: 175] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Locus control regions (LCRs) are cis-acting DNA segments needed for activation of an entire locus or gene cluster. They are operationally defined as DNA sequences needed to achieve a high level of gene expression regardless of the position of integration in transgenic mice or stably transfected cells. This review brings together the large amount of DNA sequence data from the beta-globin LCR with the vast amount of functional data obtained through the use of biochemical, cellular and transgenic experimental systems. Alignment of orthologous LCR sequences from five mammalian species locates numerous conserved regions, including previously identified cis-acting elements within the cores of nuclease hypersensitive sites (HSs) as well as conserved regions located between the HS cores. The distribution of these conserved sequences, combined with the effects of LCR fragments utilized in expression studies, shows that important sites are more widely distributed in the LCR than previously anticipated, especially in and around HS2 and HS3. We propose that the HS cores plus HS flanking DNAs comprise a 'unit' to which proteins bind and form an optimally functional structure. Multiple HS units (at least three: HS2, HS3 and HS4 cores plus flanking DNAs) together establish a chromatin structure that allows the proper developmental regulation of genes within the cluster.
Collapse
|
166
|
Kundel HL, Gefter W, Aronchick J, Miller W, Hatabu H, Whitfill CH, Miller W. Accuracy of bedside chest hard-copy screen-film versus hard- and soft-copy computed radiographs in a medical intensive care unit: receiver operating characteristic analysis. Radiology 1997; 205:859-63. [PMID: 9393548 DOI: 10.1148/radiology.205.3.9393548] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
PURPOSE To compare the clinical diagnostic accuracy of hard-copy readings of screen-film bedside chest radiographs and both hard- and soft-copy readings of bedside chest computed radiographs obtained in a medical intensive care unit. MATERIALS AND METHODS Two samples of 95 cases were assembled from chest images obtained in 541 patients with either screen-film radiography or computed radiography. The cases were stratified according to the clinical problem for which the examination was ordered; the corresponding diagnosis was verified by a panel of two or three radiologists. Four radiologists blindly read the hard-copy images obtained with screen-film or computed radiography. Six months later, the radiologists read the computed radiographs by using an 8-bit, 1,684 x 2,048-pixel display. The data were analyzed by using multireader-multicase receiver operating characteristic (ROC) analysis of variance. RESULTS No statistically significant differences in the area under the ROC curve were found between any of the methods. CONCLUSION The results provide some justification for using bedside chest computed radiography and for reading soft-copy images from a high-quality display.
Collapse
|
167
|
Langens-Gerrits M, Lilien-Kipnis H, Croes T, Miller W, Kollöffel C, de Klerk GJ. BULB GROWTH IN LILY REGENERATED IN VITRO. ACTA ACUST UNITED AC 1997. [DOI: 10.17660/actahortic.1997.430.39] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
168
|
Abstract
The FASTA package of sequence comparison programs has been expanded to include FASTX and FASTY, which compare a DNA sequence to a protein sequence database, translating the DNA sequence in three frames and aligning the translated DNA sequence to each sequence in the protein database, allowing gaps and frameshifts. Also new are TFASTX and TFASTY, which compare a protein sequence to a DNA sequence database, translating each sequence in the DNA database in six frames and scoring alignments with gaps and frameshifts. FASTX and TFASTX allow only frameshifts between codons, while FASTY and TFASTY allow substitutions or frameshifts within a codon. We examined the performance of FASTX and FASTY using different gap-opening, gap-extension, frameshift, and nucleotide substitution penalties. In general, FASTX and FASTY perform equivalently when query sequences contain 0-10% errors. We also evaluated the statistical estimates reported by FASTX and FASTY. These estimates are quite accurate, except when an out-of-frame translation produces a low-complexity protein sequence. We used FASTX to scan the Mycoplasma genitalium, Haemophilus influenzae, and Methanococcus jannaschii genomes for unidentified or misidentified protein-coding genes. We found at least 9 new protein-coding genes in the three genomes and at least 35 genes with potentially incorrect boundaries.
Collapse
|
169
|
Hardison RC, Oeltjen J, Miller W. Long human-mouse sequence alignments reveal novel regulatory elements: a reason to sequence the mouse genome. Genome Res 1997; 7:959-66. [PMID: 9331366 DOI: 10.1101/gr.7.10.959] [Citation(s) in RCA: 209] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
|
170
|
Abstract
We develop several algorithms for the problem of aligning DNA sequence with a protein sequence. Our methods account for frameshift errors, but not for introns in the DNA sequence. Thus, they are particularly appropriate for comparing a cDNA sequence that suffers from sequencing errors with an amino acid sequence or a protein sequence database. We describe algorithms for computing optimal alignments for several definitions of DNA-protein alignment, verify sufficient conditions for equivalence of certain definitions, describe techniques for efficient implementation, and discuss experience with these ideas in a new release of the FASTA suite of database-searching programs.
Collapse
|
171
|
Weisdorf D, Miller J, Verfaillie C, Burns L, Wagner J, Blazar B, Davies S, Miller W, Hannan P, Steinbuch M, Ramsay N, McGlave P. Cytokine-primed bone marrow stem cells vs. peripheral blood stem cells for autologous transplantation: a randomized comparison of GM-CSF vs. G-CSF. Biol Blood Marrow Transplant 1997; 3:217-23. [PMID: 9360784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Autologous transplantation for non-Hodgkins lymphoma and Hodgkin's disease is widely used as standard therapy for those with high-risk or relapsed tumor. Peripheral blood stem cell (PBSC) collections have nearly completely replaced bone marrow stem cell (BMSC) harvests because of the perceived advantages of more rapid engraftment, less tumor contamination in the inoculum, and better survival after therapy. The advantage of PBSC, however, may derive from the hematopoietic stimulating cytokines used for PBSC mobilization. Therefore, we tested a randomized comparison of GM-CSF vs. G-CSF used to prime either BMSC or PBSC before collection for use in autologous transplantation. Sixty-two patients receiving transplants (31 PBSC; 31 BMSC) for non-Hodgkin's lymphoma (n = 51) or Hodgkin's disease (n = 11) were treated. All patients received 6 days of randomly assigned cytokine. Those with cellular marrow in morphologic remission underwent BMSC harvest, while those with hypocellular marrow or microscopic marrow tumor involvement had PBSC collected. Neutrophil recovery was similarly rapid in all groups (median 14 days; range 10-23 days), though two patients had delayed neutrophil recovery using GM-CSF primed PBSC (p = 0.01). Red cell and platelet recovery were significantly quicker after BMSC mobilized with GM-CSF or PBSC mobilized with G-CSF. This speedier hematologic recovery resulted in earlier hospital discharge as well. However, in multivariate analysis, neither the stem cell source nor randomly assigned G-CSF vs. GM-CSF was independently associated with earlier multilineage hematologic recovery or shorter hospital stay. Relapse-free survival was not independently affected by either the assigned stem cell source or the randomly assigned priming cytokine, though malignant relapse was more frequent in those assigned to PBSC (RR of relapse 3.15, p = 0.03). These data document that BMSC, when collected following cytokine priming, can yield a similarly rapid hematologic recovery and short hospital stay compared with cytokine-primed PBSC. Using primed BMSC, no difference in malignant relapse or relapse-free survival was observed. These findings suggest that despite widespread use of PBSC for transplantation, BMSC, when collected following hematopoietically stimulating cytokines, may remain a satisfactory source of stem cells for autologous transplantation. G-CSF and GM-CSF are both effective in priming autologous PBSC or BMSC for collection.
Collapse
|
172
|
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997; 25:3389-402. [PMID: 9254694 PMCID: PMC146917 DOI: 10.1093/nar/25.17.3389] [Citation(s) in RCA: 50897] [Impact Index Per Article: 1885.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSI-BLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.
Collapse
|
173
|
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997. [PMID: 9254694 DOI: 10.1093/naar/25.17.3389] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023] Open
Abstract
The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSI-BLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.
Collapse
|
174
|
McPhelim J, Bell C, Stewart H, Miller W, Summerhill L, Milroy R. 822 Stobhill lung cancer support service. Lung Cancer 1997. [DOI: 10.1016/s0169-5002(97)80202-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
175
|
Chen GQ, Shi XG, Tang W, Xiong SM, Zhu J, Cai X, Han ZG, Ni JH, Shi GY, Jia PM, Liu MM, He KL, Niu C, Ma J, Zhang P, Zhang TD, Paul P, Naoe T, Kitamura K, Miller W, Waxman S, Wang ZY, de The H, Chen SJ, Chen Z. Use of arsenic trioxide (As2O3) in the treatment of acute promyelocytic leukemia (APL): I. As2O3 exerts dose-dependent dual effects on APL cells. Blood 1997; 89:3345-53. [PMID: 9129041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Recent clinical studies in China showed that As2O3 is an effective and relatively safe drug in the treatment of acute promyelocytic leukemia (APL). We found previously that As2O3 can trigger apoptosis of APL cell line NB4 cells, which is associated with downregulation of bcl-2 gene expression and modulation of PML-RAR alpha chimeric protein. To further understand the mechanisms of this alternative therapy for APL, we investigated in this report the effects of a wide range of concentrations of As2O2 on cultured primary APL cells, all-trans retinoic acid (ATRA)-susceptible (NB4 cells) and ATRA-resistant (MR2 subclone) APL cell lines. The results indicated that As2O3 had dose-dependent dual effects on APL cells: inducing preferentially apoptosis at relatively high concentrations (0.5 to 2 micromol/L) and inducing partial differentiation at low concentrations (0.1 to 0.5 micromol/L). The rapid modulation and degradation of PML-RAR alpha proteins, which was induced by As2O3 at 0.1 to 2 micromol/L, could contribute to these two effects. Bone marrow and peripheral blood examination showed that myelocyte-like cells, probably as a result of partial in vivo differentiation, and degenerative cells increased after 2 to 3 weeks of continuous in vivo As2O3 treatment when leukemic promyelocytes decreased. In conclusion, combination of induction of apoptosis and partial differention could be the main cellular mechanisms of As2O3 in the treatment of APL, and PML-RAR alpha could play an important role in determining the specific effects of As2O3 on APL cells.
Collapse
|