1
|
Ku M, Authié P, Bourgine M, Anna F, Noirat A, Moncoq F, Vesin B, Nevo F, Lopez J, Souque P, Blanc C, Fert I, Chardenoux S, Lafosse L, Cussigh D, Hardy D, Nemirov K, Guinet F, Langa Vives F, Majlessi L, Charneau P. Brain cross-protection against SARS-CoV-2 variants by a lentiviral vaccine in new transgenic mice. EMBO Mol Med 2021; 13:e14459. [PMID: 34647691 PMCID: PMC8646827 DOI: 10.15252/emmm.202114459] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Revised: 10/07/2021] [Accepted: 10/08/2021] [Indexed: 12/24/2022] Open
Abstract
COVID-19 vaccines already in use or in clinical development may have reduced efficacy against emerging SARS-CoV-2 variants. In addition, although the neurotropism of SARS-CoV-2 is well established, the vaccine strategies currently developed have not taken into account protection of the central nervous system. Here, we generated a transgenic mouse strain expressing the human angiotensin-converting enzyme 2, and displaying unprecedented brain permissiveness to SARS-CoV-2 replication, in addition to high permissiveness levels in the lung. Using this stringent transgenic model, we demonstrated that a non-integrative lentiviral vector, encoding for the spike glycoprotein of the ancestral SARS-CoV-2, used in intramuscular prime and intranasal boost elicits sterilizing protection of lung and brain against both the ancestral virus, and the Gamma (P.1) variant of concern, which carries multiple vaccine escape mutations. Beyond induction of strong neutralizing antibodies, the mechanism underlying this broad protection spectrum involves a robust protective T-cell immunity, unaffected by the recent mutations accumulated in the emerging SARS-CoV-2 variants.
Collapse
Affiliation(s)
- Min‐Wen Ku
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Pierre Authié
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Maryline Bourgine
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - François Anna
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Amandine Noirat
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Fanny Moncoq
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Benjamin Vesin
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Fabien Nevo
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Jodie Lopez
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Philippe Souque
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Catherine Blanc
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Ingrid Fert
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Sébastien Chardenoux
- Plate‐Forme Centre d'Ingénierie Génétique Murine CIGMInstitut PasteurParisFrance
| | - llta Lafosse
- Plate‐Forme Centre d'Ingénierie Génétique Murine CIGMInstitut PasteurParisFrance
| | - Delphine Cussigh
- Plate‐Forme Centre d'Ingénierie Génétique Murine CIGMInstitut PasteurParisFrance
| | - David Hardy
- Experimental Neuropatholgy UnitInstitut PasteurParisFrance
| | - Kirill Nemirov
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Françoise Guinet
- Lymphocytes and Immunity UnitImmunology DepartmentInstitut PasteurParisFrance
| | - Francina Langa Vives
- Plate‐Forme Centre d'Ingénierie Génétique Murine CIGMInstitut PasteurParisFrance
| | - Laleh Majlessi
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| | - Pierre Charneau
- Virology DepartmentInstitut Pasteur‐TheraVectys Joint LabParisFrance
| |
Collapse
|
2
|
Masedunskas A, Appaduray MA, Lucas CA, Lastra Cagigas M, Heydecker M, Holliday M, Meiring JCM, Hook J, Kee A, White M, Thomas P, Zhang Y, Adelstein RS, Meckel T, Böcking T, Weigert R, Bryce NS, Gunning PW, Hardeman EC. Parallel assembly of actin and tropomyosin, but not myosin II, during de novo actin filament formation in live mice. J Cell Sci 2018; 131:jcs.212654. [PMID: 29487177 DOI: 10.1242/jcs.212654] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2017] [Accepted: 02/12/2018] [Indexed: 01/04/2023] Open
Abstract
Many actin filaments in animal cells are co-polymers of actin and tropomyosin. In many cases, non-muscle myosin II associates with these co-polymers to establish a contractile network. However, the temporal relationship of these three proteins in the de novo assembly of actin filaments is not known. Intravital subcellular microscopy of secretory granule exocytosis allows the visualisation and quantification of the formation of an actin scaffold in real time, with the added advantage that it occurs in a living mammal under physiological conditions. We used this model system to investigate the de novo assembly of actin, tropomyosin Tpm3.1 (a short isoform of TPM3) and myosin IIA (the form of non-muscle myosin II with its heavy chain encoded by Myh9) on secretory granules in mouse salivary glands. Blocking actin polymerization with cytochalasin D revealed that Tpm3.1 assembly is dependent on actin assembly. We used time-lapse imaging to determine the timing of the appearance of the actin filament reporter LifeAct-RFP and of Tpm3.1-mNeonGreen on secretory granules in LifeAct-RFP transgenic, Tpm3.1-mNeonGreen and myosin IIA-GFP (GFP-tagged MYH9) knock-in mice. Our findings are consistent with the addition of tropomyosin to actin filaments shortly after the initiation of actin filament nucleation, followed by myosin IIA recruitment.
Collapse
Affiliation(s)
| | | | | | | | - Marco Heydecker
- School of Medical Sciences, UNSW Sydney, NSW 2052, Australia.,Membrane Dynamics, Department of Biology, Technische Universität Darmstadt, Schnittspahnstrasse 3, 64287 Darmstadt, Germany
| | - Mira Holliday
- School of Medical Sciences, UNSW Sydney, NSW 2052, Australia
| | | | - Jeff Hook
- School of Medical Sciences, UNSW Sydney, NSW 2052, Australia
| | - Anthony Kee
- School of Medical Sciences, UNSW Sydney, NSW 2052, Australia
| | - Melissa White
- South Australian Genome Editing, Facility Robinson Research Institute, University of Adelaide, Adelaide, SA 5005, Australia
| | - Paul Thomas
- South Australian Genome Editing, Facility Robinson Research Institute, University of Adelaide, Adelaide, SA 5005, Australia
| | - Yingfan Zhang
- NHLBI, National Institutes of Health, Bethesda, MD 20892, USA
| | | | - Tobias Meckel
- Membrane Dynamics, Department of Biology, Technische Universität Darmstadt, Schnittspahnstrasse 3, 64287 Darmstadt, Germany
| | - Till Böcking
- School of Medical Sciences, UNSW Sydney, NSW 2052, Australia
| | - Roberto Weigert
- Laboratory of Cellular and Molecular Biology, CCR, National Cancer Institute, Bethesda, MD 20892, USA
| | - Nicole S Bryce
- School of Medical Sciences, UNSW Sydney, NSW 2052, Australia
| | - Peter W Gunning
- School of Medical Sciences, UNSW Sydney, NSW 2052, Australia
| | - Edna C Hardeman
- School of Medical Sciences, UNSW Sydney, NSW 2052, Australia
| |
Collapse
|
3
|
Kamath U, De Jong K, Shehu A. Effective automated feature construction and selection for classification of biological sequences. PLoS One 2014; 9:e99982. [PMID: 25033270 PMCID: PMC4102475 DOI: 10.1371/journal.pone.0099982] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2013] [Accepted: 05/21/2014] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Many open problems in bioinformatics involve elucidating underlying functional signals in biological sequences. DNA sequences, in particular, are characterized by rich architectures in which functional signals are increasingly found to combine local and distal interactions at the nucleotide level. Problems of interest include detection of regulatory regions, splice sites, exons, hypersensitive sites, and more. These problems naturally lend themselves to formulation as classification problems in machine learning. When classification is based on features extracted from the sequences under investigation, success is critically dependent on the chosen set of features. METHODOLOGY We present an algorithmic framework (EFFECT) for automated detection of functional signals in biological sequences. We focus here on classification problems involving DNA sequences which state-of-the-art work in machine learning shows to be challenging and involve complex combinations of local and distal features. EFFECT uses a two-stage process to first construct a set of candidate sequence-based features and then select a most effective subset for the classification task at hand. Both stages make heavy use of evolutionary algorithms to efficiently guide the search towards informative features capable of discriminating between sequences that contain a particular functional signal and those that do not. RESULTS To demonstrate its generality, EFFECT is applied to three separate problems of importance in DNA research: the recognition of hypersensitive sites, splice sites, and ALU sites. Comparisons with state-of-the-art algorithms show that the framework is both general and powerful. In addition, a detailed analysis of the constructed features shows that they contain valuable biological information about DNA architecture, allowing biologists and other researchers to directly inspect the features and potentially use the insights obtained to assist wet-laboratory studies on retainment or modification of a specific signal. Code, documentation, and all data for the applications presented here are provided for the community at http://www.cs.gmu.edu/~ashehu/?q=OurTools.
Collapse
Affiliation(s)
- Uday Kamath
- Computer Science, George Mason University, Fairfax, Virginia, United States of America
| | - Kenneth De Jong
- Computer Science, George Mason University, Fairfax, Virginia, United States of America
- Krasnow Institute, George Mason University, Fairfax, Virginia, United States of America
| | - Amarda Shehu
- Computer Science, George Mason University, Fairfax, Virginia, United States of America
- Bioengineering, George Mason University, Fairfax, Virginia, United States of America
- School of Systems Biology, George Mason University, Fairfax, Virginia, United States of America
| |
Collapse
|
4
|
Two new methods for DNA splice site prediction based on neuro-fuzzy network and clustering. Neural Comput Appl 2013. [DOI: 10.1007/s00521-012-1257-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
5
|
Kharazmi J, Moshfegh C. Investigation of dmyc Promoter and Regulatory Regions. GENE REGULATION AND SYSTEMS BIOLOGY 2013; 7:85-102. [PMID: 23761963 PMCID: PMC3663572 DOI: 10.4137/grsb.s10751] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Products of the myc gene family integrate extracellular signals by modulating a wide range of their targets involved in cellular biogenesis and metabolism; the purpose of this integration is to regulate cell death, proliferation, and differentiation. However, understanding the regulation of myc at the transcription level remains a challenge. We performed rapid amplification of dmyc cDNA ends (5' RACE) and mapped the transcription start site at P1 promoter, 18 base pairs upstream of the start of the known EST GM01143 and within the 5' UTR. Our data show that the first TATA box, previously computationally predicted, is utilized to generate dmyc full length mRNA. The largest transcript contains all three exons, generated after the removal of the introns by constitutively regulated splicing events. Further investigation of Downstream Promoter Element (DPE) was achieved by studying lacZ reporter activity; investigation revealed that this element and its upstream cluster of binding sites are required for the dmyc intron 2 activity. These findings may provide valuable tools for further analysis of dmyc cis-elements.
Collapse
Affiliation(s)
- Jasmine Kharazmi
- Bio-Technopark Zurich, Molecular Biology Laboratory, Zurich, Switzerland. ; Institute of Molecular Life Sciences, University of Zurich-Irchel, Zurich, Switzerland
| | | |
Collapse
|
6
|
Kamath U, Compton J, Islamaj-Doğan R, De Jong KA, Shehu A. An evolutionary algorithm approach for feature generation from sequence data and its application to DNA splice site prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:1387-1398. [PMID: 22508909 DOI: 10.1109/tcbb.2012.53] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Associating functional information with biological sequences remains a challenge for machine learning methods. The performance of these methods often depends on deriving predictive features from the sequences sought to be classified. Feature generation is a difficult problem, as the connection between the sequence features and the sought property is not known a priori. It is often the task of domain experts or exhaustive feature enumeration techniques to generate a few features whose predictive power is then tested in the context of classification. This paper proposes an evolutionary algorithm to effectively explore a large feature space and generate predictive features from sequence data. The effectiveness of the algorithm is demonstrated on an important component of the gene-finding problem, DNA splice site prediction. This application is chosen due to the complexity of the features needed to obtain high classification accuracy and precision. Our results test the effectiveness of the obtained features in the context of classification by Support Vector Machines and show significant improvement in accuracy and precision over state-of-the-art approaches.
Collapse
Affiliation(s)
- Uday Kamath
- Department of Computer Science, George Mason University, Ashburn, VA 20147, USA.
| | | | | | | | | |
Collapse
|
7
|
Natural rules for Arabidopsis thaliana pre-mRNA splicing site selection. Open Life Sci 2012. [DOI: 10.2478/s11535-012-0060-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
AbstractThe accurate prediction of plant pre-mRNA splicing sites has been studied extensively. The rules for plant pre-mRNA splicing still remain unknown. This study, based on confirmed sequence data, systematically analyzed all expressed genes on Arabidopsis thaliana chromosome IV to quantitatively explore the natural splicing rules. The results indicated that defining Arabidopsis thaliana pre-mRNA splicing sites required a combination of multiple factors including (1) relative conserved consensus sequence at splicing site; (2) individual nucleotide distribution pattern in 50 nucleotides up- and down-stream regions of splicing site; (3) quantitative analysis of individual nucleotide distribution by using the formulations concluded from this study. The combination of all these factors together can bring the accuracy of Arabidopsis thaliana splicing site recognition over 99%. The results provide additional information to the future of plant pre-mRNA splicing research.
Collapse
|
8
|
Kamath U, Shehu A, De Jong KA. A two-stage evolutionary approach for effective classification of hypersensitive DNA sequences. J Bioinform Comput Biol 2011; 9:399-413. [PMID: 21714132 DOI: 10.1142/s0219720011005586] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2011] [Revised: 04/06/2011] [Accepted: 04/14/2011] [Indexed: 11/18/2022]
Abstract
Hypersensitive (HS) sites in genomic sequences are reliable markers of DNA regulatory regions that control gene expression. Annotation of regulatory regions is important in understanding phenotypical differences among cells and diseases linked to pathologies in protein expression. Several computational techniques are devoted to mapping out regulatory regions in DNA by initially identifying HS sequences. Statistical learning techniques like Support Vector Machines (SVM), for instance, are employed to classify DNA sequences as HS or non-HS. This paper proposes a method to automate the basic steps in designing an SVM that improves the accuracy of such classification. The method proceeds in two stages and makes use of evolutionary algorithms. An evolutionary algorithm first designs optimal sequence motifs to associate explicit discriminating feature vectors with input DNA sequences. A second evolutionary algorithm then designs SVM kernel functions and parameters that optimally separate the HS and non-HS classes. Results show that this two-stage method significantly improves SVM classification accuracy. The method promises to be generally useful in automating the analysis of biological sequences, and we post its source code on our website.
Collapse
Affiliation(s)
- Uday Kamath
- Department of Computer Science, George Mason University, Fairfax, Virginia 20123, USA.
| | | | | |
Collapse
|
9
|
Aberrant Single Exon Skipping is not Altered by Age in Exons of NF1, RABAC1, AATF or PCGF2 in Human Blood Cells and Fibroblasts. Genes (Basel) 2011; 2:562-77. [PMID: 24710210 PMCID: PMC3927615 DOI: 10.3390/genes2030562] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2011] [Revised: 07/21/2011] [Accepted: 07/25/2011] [Indexed: 11/17/2022] Open
Abstract
In human pre-mRNA splicing, infrequent errors occur resulting in erroneous splice products as shown in a genome-wide approach. One characteristic subgroup consists of products lacking one cassette exon. The noise in the splicing process, represented by those misspliced products, can be increased by cold shock treatment or by inhibiting the nonsense mediated decay. Here, we investigated whether the splicing noise frequency increases with age in vivo in peripheral bloods cells or in vitro in cultured and aged fibroblasts from healthy donors. Splicing noise frequency was measured for four erroneously skipped NF1 exons and one exon of RABAC1, AATF and PCGF2 by RT-qPCR. Measurements were validated in cultured fibroblasts treated with cold shock or puromycin. Intragenic but not interpersonal differences were detected in splicing noise frequencies in vivo in peripheral blood cells of 11 healthy donors (15 y–85 y) and in in vitro senescent fibroblasts from three further donors. No correlation to the age of the donors was found in the splicing noise frequencies. Our data demonstrates that splicing error frequencies are not altered by age in peripheral blood cells or in vitro aged fibroblasts in the tested exons of the four investigated genes, indicating a high importance of correct splicing in these proliferating aged cells.
Collapse
|
10
|
Vorechovsky I. Transposable elements in disease-associated cryptic exons. Hum Genet 2009; 127:135-54. [PMID: 19823873 DOI: 10.1007/s00439-009-0752-4] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2009] [Accepted: 09/27/2009] [Indexed: 11/28/2022]
Abstract
Transposable elements (TEs) make up a half of the human genome, but the extent of their contribution to cryptic exon activation that results in genetic disease is unknown. Here, a comprehensive survey of 78 mutation-induced cryptic exons previously identified in 51 disease genes revealed the presence of TEs in 40 cases (51%). Most TE-containing exons were derived from short interspersed nuclear elements (SINEs), with Alus and mammalian interspersed repeats (MIRs) covering >18 and >16% of the exonized sequences, respectively. The majority of SINE-derived cryptic exons had splice sites at the same positions of the Alu/MIR consensus as existing SINE exons and their inclusion in the mRNA was facilitated by phylogenetically conserved changes that improved both traditional and auxiliary splicing signals, thus marking intronic TEs amenable for pathogenic exonization. The overrepresentation of MIRs among TE exons is likely to result from their high average exon inclusion levels, which reflect their strong splice sites, a lack of splicing silencers and a high density of enhancers, particularly (G)AA(G) motifs. These elements were markedly depleted in antisense Alu exons, had the most prominent position on the exon-intron gradient scale and are proposed to promote exon definition through enhanced tertiary RNA interactions involving unpaired (di)adenosines. The identification of common mechanisms by which the most dynamic parts of the genome contribute both to new exon creation and genetic disease will facilitate detection of intronic mutations and the development of computational tools that predict TE hot-spots of cryptic exon activation.
Collapse
Affiliation(s)
- Igor Vorechovsky
- Division of Human Genetics, University of Southampton School of Medicine, MP808, Tremona Road, Southampton SO16 6YD, UK.
| |
Collapse
|
11
|
SpliceIT: a hybrid method for splice signal identification based on probabilistic and biological inference. J Biomed Inform 2009; 43:208-17. [PMID: 19800027 DOI: 10.1016/j.jbi.2009.09.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2008] [Revised: 08/25/2009] [Accepted: 09/21/2009] [Indexed: 11/23/2022]
Abstract
Splice sites define the boundaries of exonic regions and dictate protein synthesis and function. The splicing mechanism involves complex interactions among positional and compositional features of different lengths. Computational modeling of the underlying constructive information is especially challenging, in order to decipher splicing-inducing elements and alternative splicing factors. SpliceIT (Splice Identification Technique) introduces a hybrid method for splice site prediction that couples probabilistic modeling with discriminative computational or experimental features inferred from published studies in two subsequent classification steps. The first step is undertaken by a Gaussian support vector machine (SVM) trained on the probabilistic profile that is extracted using two alternative position-dependent feature selection methods. In the second step, the extracted predictions are combined with known species-specific regulatory elements, in order to induce a tree-based modeling. The performance evaluation on human and Arabidopsis thaliana splice site datasets shows that SpliceIT is highly accurate compared to current state-of-the-art predictors in terms of the maximum sensitivity, specificity tradeoff without compromising space complexity and in a time-effective way. The source code and supplementary material are available at: http://www.med.auth.gr/research/spliceit/.
Collapse
|
12
|
Zhou L, Pertea M, Delcher AL, Florea L. Sim4cc: a cross-species spliced alignment program. Nucleic Acids Res 2009; 37:e80. [PMID: 19429899 PMCID: PMC2699533 DOI: 10.1093/nar/gkp319] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Advances in sequencing technologies have accelerated the sequencing of new genomes, far outpacing the generation of gene and protein resources needed to annotate them. Direct comparison and alignment of existing cDNA sequences from a related species is an effective and readily available means to determine genes in the new genomes. Current spliced alignment programs are inadequate for comparing sequences between different species, owing to their low sensitivity and splice junction accuracy. A new spliced alignment tool, sim4cc, overcomes problems in the earlier tools by incorporating three new features: universal spaced seeds, to increase sensitivity and allow comparisons between species at various evolutionary distances, and powerful splice signal models and evolutionarily-aware alignment techniques, to improve the accuracy of gene models. When tested on vertebrate comparisons at diverse evolutionary distances, sim4cc had significantly higher sensitivity compared to existing alignment programs, more than 10% higher than the closest competitor for some comparisons, while being comparable in speed to its predecessor, sim4. Sim4cc can be used in one-to-one or one-to-many comparisons of genomic and cDNA sequences, and can also be effectively incorporated into a high-throughput annotation engine, as demonstrated by the mapping of 64 000 Fagus grandifolia 454 ESTs and unigenes to the poplar genome.
Collapse
Affiliation(s)
- Leming Zhou
- Department of Computer Science, George Washington University, Washington, DC 20052, USA
| | | | | | | |
Collapse
|
13
|
Královičová J, Vořechovský I. Global control of aberrant splice-site activation by auxiliary splicing sequences: evidence for a gradient in exon and intron definition. Nucleic Acids Res 2007; 35:6399-413. [PMID: 17881373 PMCID: PMC2095810 DOI: 10.1093/nar/gkm680] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Auxiliary splicing signals play a major role in the regulation of constitutive and alternative pre-mRNA splicing, but their relative importance in selection of mutation-induced cryptic or de novo splice sites is poorly understood. Here, we show that exonic sequences between authentic and aberrant splice sites that were activated by splice-site mutations in human disease genes have lower frequencies of splicing enhancers and higher frequencies of splicing silencers than average exons. Conversely, sequences between authentic and intronic aberrant splice sites have more enhancers and less silencers than average introns. Exons that were skipped as a result of splice-site mutations were smaller, had lower SF2/ASF motif scores, a decreased availability of decoy splice sites and a higher density of silencers than exons in which splice-site mutation activated cryptic splice sites. These four variables were the strongest predictors of the two aberrant splicing events in a logistic regression model. Elimination or weakening of predicted silencers in two reporters consistently promoted use of intron-proximal splice sites if these elements were maintained at their original positions, with their modular combinations producing expected modification of splicing. Together, these results show the existence of a gradient in exon and intron definition at the level of pre-mRNA splicing and provide a basis for the development of computational tools that predict aberrant splicing outcomes.
Collapse
Affiliation(s)
| | - Igor Vořechovský
- *To whom correspondence should be addressed. +44 2380 796425+44 2380 794264
| |
Collapse
|