1
|
Investigating resistance in clinical Mycobacterium tuberculosis complex isolates with genomic and phenotypic antimicrobial susceptibility testing: a multicentre observational study. THE LANCET. MICROBE 2022; 3:e672-e682. [PMID: 35907429 PMCID: PMC9436784 DOI: 10.1016/s2666-5247(22)00116-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Revised: 03/10/2022] [Accepted: 04/14/2022] [Indexed: 01/01/2023]
Abstract
BACKGROUND Whole-genome sequencing (WGS) of Mycobacterium tuberculosis complex has become an important tool in diagnosis and management of drug-resistant tuberculosis. However, data correlating resistance genotype with quantitative phenotypic antimicrobial susceptibility testing (AST) are scarce. METHODS In a prospective multicentre observational study, 900 clinical M tuberculosis complex isolates were collected from adults with drug-resistant tuberculosis in five high-endemic tuberculosis settings around the world (Georgia, Moldova, Peru, South Africa, and Viet Nam) between Dec 5, 2014, and Dec 12, 2017. Minimum inhibitory concentrations (MICs) and resulting binary phenotypic AST results for up to nine antituberculosis drugs were determined and correlated with resistance-conferring mutations identified by WGS. FINDINGS Considering WHO-endorsed critical concentrations as reference, WGS had high accuracy for prediction of resistance to isoniazid (sensitivity 98·8% [95% CI 98·5-99·0]; specificity 96·6% [95% CI 95·2-97·9]), levofloxacin (sensitivity 94·8% [93·3-97·6]; specificity 97·1% [96·7-97·6]), kanamycin (sensitivity 96·1% [95·4-96·8]; specificity 95·0% [94·4-95·7]), amikacin (sensitivity 97·2% [96·4-98·1]; specificity 98·6% [98·3-98·9]), and capreomycin (sensitivity 93·1% [90·0-96·3]; specificity 98·3% [98·0-98·7]). For rifampicin, pyrazinamide, and ethambutol, the specificity of resistance prediction was suboptimal (64·0% [61·0-67·1], 83·8% [81·0-86·5], and 40·1% [37·4-42·9], respectively). Specificity for rifampicin increased to 83·9% when borderline mutations with MICs overlapping with the critical concentration were excluded. Consequently, we highlighted mutations in M tuberculosis complex isolates that are often falsely identified as susceptible by phenotypic AST, and we identified potential novel resistance-conferring mutations. INTERPRETATION The combined analysis of mutations and quantitative phenotypes shows the potential of WGS to produce a refined interpretation of resistance, which is needed for individualised therapy, and eventually could allow differential drug dosing. However, variability of MIC data for some M tuberculosis complex isolates carrying identical mutations also reveals limitations of our understanding of the genotype and phenotype relationships (eg, including epistasis and strain genetic background). FUNDING Bill & Melinda Gates Foundation, German Centre for Infection Research, German Research Foundation, Excellence Cluster Precision Medicine of Inflammation (EXC 2167), and Leibniz ScienceCampus EvoLUNG.
Collapse
|
2
|
Integration of Two In-depth Quantitative Proteomics Approaches Determines the Kallikrein-related Peptidase 7 (KLK7) Degradome in Ovarian Cancer Cell Secretome. Mol Cell Proteomics 2019; 18:818-836. [PMID: 30705123 DOI: 10.1074/mcp.ra118.001304] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2018] [Indexed: 12/31/2022] Open
Abstract
Kallikrein-related peptidase 7 (KLK7) is a serine peptidase that is over expressed in ovarian cancer. In vitro functional analyses have suggested KLK7 to play a cancer progressive role, although monitoring of KLK7 expression has suggested a contradictory protective role for KLK7 in ovarian cancer patients. In order to help delineate its mechanism of action and thereby the functional roles, information on its substrate repertoire is crucial. Therefore, in this study a quantitative proteomics approach-PROtein TOpography and Migration Analysis Platform (PROTOMAP)-coupled with SILAC was used for in-depth analysis of putative KLK7 substrates from a representative ovarian cancer cell line, SKOV-3, secreted proteins. The Terminal Amine Isotopic Labeling of Substrates (TAILS) approach was used to determine the exact cleavage sites and to validate qPROTOMAP-identified putative substrates. By employing these two technically divergent approaches, exact cleavage sites on 16 novel putative substrates and two established substrates, matrix metalloprotease (MMP) 2 and insulin growth factor binding protein 3 (IGFBP3), were identified in the SKOV-3 secretome. Eight of these substrates were also identified on TAILS analysis of another ovarian cancer cell (OVMZ-6) secretome, with a further seven OVMZ-6 substrates common to the SKOV-3 qPROTOMAP profile. Identified substrates were significantly associated with the common processes of cell adhesion, extracellular matrix remodeling and cell migration according to the gene ontology (GO) biological process analysis. Biochemical validation supports a role for KLK7 in directly activating pro-MMP10, hydrolysis of IGFBP6 and cleavage of thrombospondin 1 with generation of a potentially bioactive N-terminal fragment. Overall, this study constitutes the most comprehensive analysis of the putative KLK7 degradome in any cancer to date, thereby opening new avenues for KLK7 research.
Collapse
|
3
|
Mass-spectrometry data for Rhizoctonia solani proteins produced during infection of wheat and vegetative growth. Data Brief 2016; 8:267-71. [PMID: 27331100 PMCID: PMC4906030 DOI: 10.1016/j.dib.2016.05.042] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2016] [Revised: 05/11/2016] [Accepted: 05/19/2016] [Indexed: 11/02/2022] Open
Abstract
Rhizoctonia solani is an important root infecting pathogen of a range of food staples worldwide including wheat, rice, maize, soybean, potato, legumes and others. Conventional resistance breeding strategies are hindered by the absence of tractable genetic resistance in any crop host. Understanding the biology and pathogenicity mechanisms of this fungus is important for addressing these disease issues, however, little is known about how R. solani causes disease. The data described in this article is derived from applying mass spectrometry based proteomics to identify soluble, membrane-bound and culture filtrate proteins produced under wheat infection and vegetative growth conditions. Comparisons of the data for sample types in this set will be useful to identify metabolic pathway changes as the fungus switches from saprophytic to a pathogenic lifestyle or pathogenicity related proteins contributing to the ability to cause disease on wheat. The data set is deposited in the PRIDE archive under identifier PRIDE: PXD002806.
Collapse
|
4
|
Proteomic Analysis of Rhizoctonia solani Identifies Infection-specific, Redox Associated Proteins and Insight into Adaptation to Different Plant Hosts. Mol Cell Proteomics 2016; 15:1188-203. [PMID: 26811357 PMCID: PMC4824849 DOI: 10.1074/mcp.m115.054502] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2015] [Indexed: 11/22/2022] Open
Abstract
Rhizoctonia solani is an important root infecting pathogen of a range of food staples worldwide including wheat, rice, maize, soybean, potato and others. Conventional resistance breeding strategies are hindered by the absence of tractable genetic resistance in any crop host. Understanding the biology and pathogenicity mechanisms of this fungus is important for addressing these disease issues, however, little is known about how R. solani causes disease. This study capitalizes on recent genomic studies by applying mass spectrometry based proteomics to identify soluble, membrane-bound and culture filtrate proteins produced under wheat infection and vegetative growth conditions. Many of the proteins found in the culture filtrate had predicted functions relating to modification of the plant cell wall, a major activity required for pathogenesis on the plant host, including a number found only under infection conditions. Other infection related proteins included a high proportion of proteins with redox associated functions and many novel proteins without functional classification. The majority of infection only proteins tested were confirmed to show transcript up-regulation during infection including a thaumatin which increased susceptibility to R. solani when expressed in Nicotiana benthamiana. In addition, analysis of expression during infection of different plant hosts highlighted how the infection strategy of this broad host range pathogen can be adapted to the particular host being encountered. Data are available via ProteomeXchange with identifier PXD002806.
Collapse
|
5
|
Management and dissemination of MS proteomic data with PROTICdb: example of a quantitative comparison between methods of protein extraction. Proteomics 2013; 13:1457-66. [PMID: 23468041 DOI: 10.1002/pmic.201200564] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2012] [Revised: 02/21/2013] [Accepted: 02/23/2013] [Indexed: 01/28/2023]
Abstract
High throughput MS-based proteomic experiments generate large volumes of complex data and necessitate bioinformatics tools to facilitate their handling. Needs include means to archive data, to disseminate them to the scientific communities, and to organize and annotate them to facilitate their interpretation. We present here an evolution of PROTICdb, a database software that now handles MS data, including quantification. PROTICdb has been developed to be as independent as possible from tools used to produce the data. Biological samples and proteomics data are described using ontology terms. A Taverna workflow is embedded, thus permitting to automatically retrieve information related to identified proteins by querying external databases. Stored data can be displayed graphically and a "Query Builder" allows users to make sophisticated queries without knowledge on the underlying database structure. All resources can be accessed programmatically using a Java client API or RESTful web services, allowing the integration of PROTICdb in any portal. An example of application is presented, where proteins extracted from a maize leaf sample by four different methods were compared using a label-free shotgun method. Data are available at http://moulon.inra.fr/protic/public. PROTICdb thus provides means for data storage, enrichment, and dissemination of proteomics data.
Collapse
|
6
|
EasyProt — An easy-to-use graphical platform for proteomics data analysis. J Proteomics 2013; 79:146-60. [DOI: 10.1016/j.jprot.2012.12.012] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2012] [Revised: 12/08/2012] [Accepted: 12/18/2012] [Indexed: 11/17/2022]
|
7
|
Abstract
Despite continuous advances in hyperglycemia treatments, a precise control through monitoring of glucose and glycated hemoglobin remains in most diabetic patients as the diagnosis/prognosis tool. An alternative perspective could be the discovery and quantitation of new blood glycated proteins formed by nonenzymatic reaction with circulatory glucose. As a result, the human hemolysate is an incomparable source of glycated proteins to further monitor glycemia and interpret changes at the level of this post-translational modification. The human hemolysate is here studied based on the differential labeling of proteins with isotopically labeled-glucose ([(13)C(6)] glucose), named glycation isotopic labeling. Due to the chemoselectivity of glycation, only preferential targets are labeled by this protocol. The approach provides qualitative data through the detection of preferential protein glycation sites as well as quantitative information to evaluate the abundance of this modification. This strategy was applied to human hemolysate samples corresponding to different glycemic states estimated by laboratory-certified concentrations of glycated hemoglobin. The glycation level of each protein can then be employed to interpret the effect of glucose exposition as a consequence of glycemic unbalance. This information should provide new molecular insights into protein glycation mechanisms that might generate a new hypothesis to clinicians to improve the understanding of underlying pathologies associated to prolonged hyperglycemia.
Collapse
|
8
|
Quantitative analysis of human cerebrospinal fluid proteins using a combination of cysteine tagging and amine-reactive isobaric labeling. J Proteome Res 2010; 10:249-58. [PMID: 20973565 DOI: 10.1021/pr100535f] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Highly complex and dynamic protein mixtures are hardly comprehensively resolved by direct shotgun proteomic analysis. As many proteins of biological interest are of low abundance, numerous analytical methodologies have been developed to reduce sample complexity and go deeper into proteomes. The present work describes an analytical strategy to perform cysteinyl-peptide subset enrichment and relative quantification through successive cysteine and amine-isobaric tagging. A cysteine-reactive covalent capture tag (C³T) allowed derivatization of cysteines and specific isolation on a covalent capture (CC) resin. The 6-plex amine-reactive tandem mass tags (TMT) served for relative quantification of the targeted peptides. The strategy was first evaluated on a model protein mixture with increasing concentrations to assess the specificity of the enrichment and the quantitative performances of the workflow. It was then applied to human cerebrospinal fluid (CSF) from post-mortem and ante-mortem samples. These studies confirmed the specificity of the C³T and the CC technique to cysteine-containing peptides. The model protein mixture analysis showed high precision and accuracy of the quantification with coefficients of variation and mean absolute errors of less than 10% on average. The CSF experiments demonstrated the potential of the strategy to study complex biological samples and identify differential brain-related proteins. In addition, the quantification data were highly correlated with a classical TMT experiment (i.e., without C³T cysteine-tagging and enrichment steps). Altogether, these results legitimate the use of this quantitative C³T strategy to enrich and relatively quantify cysteine-containing peptides in complex mixtures.
Collapse
|
9
|
The gel electrophoresis markup language (GelML) from the Proteomics Standards Initiative. Proteomics 2010; 10:3073-81. [PMID: 20677327 PMCID: PMC3193076 DOI: 10.1002/pmic.201000120] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2010] [Accepted: 06/09/2010] [Indexed: 11/11/2022]
Abstract
The Human Proteome Organisation's Proteomics Standards Initiative has developed the GelML (gel electrophoresis markup language) data exchange format for representing gel electrophoresis experiments performed in proteomics investigations. The format closely follows the reporting guidelines for gel electrophoresis, which are part of the Minimum Information About a Proteomics Experiment (MIAPE) set of modules. GelML supports the capture of metadata (such as experimental protocols) and data (such as gel images) resulting from gel electrophoresis so that laboratories can be compliant with the MIAPE Gel Electrophoresis guidelines, while allowing such data sets to be exchanged or downloaded from public repositories. The format is sufficiently flexible to capture data from a broad range of experimental processes, and complements other PSI formats for MS data and the results of protein and peptide identifications to capture entire gel-based proteome workflows. GelML has resulted from the open standardisation process of PSI consisting of both public consultation and anonymous review of the specifications.
Collapse
|
10
|
Managing the data explosion. A report on the HUPO-PSI Workshop. August 2008, Amsterdam, The Netherlands. Proteomics 2009; 9:499-501. [PMID: 19132688 DOI: 10.1002/pmic.200800838] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The plenary session of the Proteomics Standards Initiative (PSI) of the Human Proteome Organisation at the 7(th) annual HUPO world congress updated the delegates on the current status of the ongoing work of this group. The release of the new MS interchange format, mzML, was formally announced and delegates were also updated on the advances in the area of molecular interactions, protein separations, proteomics informatics and also on PEFF, a common sequence database format currently under review in the PSI documentation process. Community input on this initiative was requested. Finally, the impact these new data standards are having on the data submission process, which increasingly is an integral part of the publication process, was reviewed and discussed.
Collapse
|
11
|
Abstract
Bioinformatics tools may assist scientists in all steps of a typical 2-DE gel analysis workflow, that is, from the description of the sample preparation protocols, going through the gel image analysis and protein identification, to the publication of Internet-ready 2-DE gel databases. This short communication highlights in a single and summarised view, this workflow and the current bioinformatics solutions developed by the Proteome Informatics Group at the Swiss Institute of Bioinformatics.
Collapse
|
12
|
MIAPEGelDB, a web-based submission tool and public repository for MIAPE gel electrophoresis documents. J Proteomics 2008; 71:249-51. [PMID: 18590991 DOI: 10.1016/j.jprot.2008.06.005] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2008] [Revised: 06/06/2008] [Accepted: 06/09/2008] [Indexed: 10/21/2022]
Abstract
The HUPO Proteomics Standards Initiative (PSI) defines standards for data representation in proteomics to facilitate data exchange and comparison, and quality assessment. A set of minimum reporting requirements, called MIAPE (for Minimum Information About a Proteomics Experiment) is provided to ensure consistency of data set annotation. Like the MIAME reporting requirements for transcriptomics, it is anticipated that journal editors will soon require such annotation for published data sets, simplifying further mining of data. Therefore, tools for data entry and public repositories for long-term storage will be needed. MIAPEGelDB is a public repository and a web-based data entry tool for documents conforming to the MIAPE gel electrophoresis guidelines. It aims to guide authors through the publication of the minimal set of information for their proteomics experiments using a clear, sequential interface. After publication by their author, documents in MIAPEGelDB can be viewed in HTML or plain text formats, and further used through stable URL links from remote resources. MIAPEGelDB is accessible at: http://miapegeldb.expasy.org/.
Collapse
|
13
|
EuPA achieves visibility - an activity report on the first three years. J Proteomics 2008; 71:11-8. [PMID: 18541470 DOI: 10.1016/j.jprot.2008.03.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2008] [Revised: 03/28/2008] [Accepted: 03/31/2008] [Indexed: 10/22/2022]
Abstract
Plans for the European Proteomics Association (EuPA) were conceived and established during 2004 and 2005, and culminated in the formal inception of the organisation during the 4th HUPO World Congress held in Munich in 2005. The mission from the outset has been three-tiered and is to: i) strengthen the national Proteomics organizations in their efforts; ii) to co-ordinate and provide educational programs, and iii) to advance the networking of scientists through meetings, workshops and student exchange. Linked to the mission were objectives to emphasise the benefits and contributions of Proteomics to biological and industrial researchers, the general public and science policy makers in Europe. In addition, the EuPA set out to promote scientific exchange for all applications and technology development related to Proteomics, and coordinate joint activities of national Proteomics societies at the European level. To achieve these tasks an organisational structure was conceived whereby four Activity Committees (Conferences/Communications, Education, EuPA-HUPO-Interactions and Funding) were implemented and a General Council consisting of all member countries. The remarkable rise and progress the EuPA has achieved in this small time frame is reported here.
Collapse
|
14
|
The World-2DPAGE Constellation to promote and publish gel-based proteomics data through the ExPASy server. J Proteomics 2008; 71:245-8. [PMID: 18617148 DOI: 10.1016/j.jprot.2008.02.005] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2008] [Revised: 02/15/2008] [Accepted: 02/18/2008] [Indexed: 10/22/2022]
Abstract
Since it was launched in 1993, the ExPASy server has been and is still a reference in the proteomics world. ExPASy users access various databases, many dedicated tools, and lists of resources, among other services. A significant part of resources available is devoted to two-dimensional electrophoresis data. Our latest contribution to the expansion of the pool of on-line proteomics data is the World-2DPAGE Constellation, accessible at http://world-2dpage.expasy.org/. It is composed of the established WORLD-2DPAGE List of 2-D PAGE database servers, the World-2DPAGE Portal that queries simultaneously world-wide proteomics databases, and the recently created World-2DPAGE Repository. The latter component is a public standards-compliant repository for gel-based proteomics data linked to protein identifications published in the literature. It has been set up using the Make2D-DB package, a software tool that helps building SWISS-2DPAGE-like databases on one's own Web site. The lack of necessary informatics infrastructure to build and run a dedicated website is no longer an obstacle to make proteomics data publicly accessible on the Internet.
Collapse
|
15
|
Using bioinformatic resources in the proteomic analysis of biological fluids. Proteomics Clin Appl 2007; 1:900-15. [PMID: 21136743 DOI: 10.1002/prca.200700188] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2007] [Indexed: 12/24/2022]
Abstract
On-line databases targeted towards protein contents in biological fluids are scarce. Consequently, the investigation of proteins identified in a biological fluid most importantly depends on crosschecking information gathered from less specific resources. This review summarises the key databases and tools for collecting information on tissue specificity or expression profiles. It also emphasises the high connectivity between databases fruitfully used to corroborate and piece information together. Finally, selected issues related to appropriate bioinformatics tools in the context of clinical applications are succinctly discussed.
Collapse
|
16
|
ID: 058 The C-terminal globular domain of fibrinogen gamma chain suppresses angiogenesis and tumor growth. J Thromb Haemost 2006. [DOI: 10.1111/j.1538-7836.2006.00058.x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
17
|
Autumn 2005 Workshop of the Human Proteome Organisation Proteomics Standards Initiative (HUPO-PSI) Geneva, September, 4-6, 2005. Proteomics 2006; 6:738-41. [PMID: 16411267 DOI: 10.1002/pmic.200500868] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The autumn workshop of the Proteomics Standards Initiative of the Human Proteomics Organisation met to further advance the development of the existing standards in the fields of molecular interactions and mass spectrometry. In addition, new areas were addressed, in particular developing standards for the description and exchange of data from gel electrophoresis experiments. The General Proteomics Standards group is now working closely with the FuGE (Functional Genomics Experiment) efforts to define a general standard in which to encode data that will enable a systems biology approach to data analysis. Common to all these efforts is the field of protein modifications, and work has been initiated to establish an ontology in this field that can be used by both workers in the field of proteomics and the wider scientific community.
Collapse
|
18
|
SPS' Digest: The Swiss Proteomics Society selection of proteomics articles. Proteomics 2005; 5:3045-7. [PMID: 16041674 DOI: 10.1002/pmic.200402071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Despite the consolidation of the specialized proteomics literature around a few established journals, such as Proteomics, Molecular and Cellular Proteomics, and the Journal of Proteome Research, a lot of information is still spread in many different publications from different fields, such as analytical sciences, MS, bioinformatics, etc. The purpose of SPS' Digest is to gather a selection of proteomics articles, to categorize them, and to make the list available on a periodic basis through a web page and email alerts.
Collapse
|
19
|
Abstract
The SWISS-2DPAGE database was established in 1993 and is maintained collaboratively by the Swiss Institute of Bioinformatics (SIB) and the Biomedical Proteomics Research Group (BPRG) of the Geneva University Hospital. During these years, SWISS-2DPAGE underwent constant modification and improvement. Current content includes about 4000 identified spots corresponding to 1200 different protein entries in 36 reference maps from human, mouse, Arabidopsis thaliana, Dictyostelium discoideum, Escherichia coli, Saccharomyces cerevisiae and Staphylococcus aureus origins. With a high level of annotation and integration with other relevant databases, SWISS-2DPAGE is a reference source in the proteomics world. Queries to SWISS-2DPAGE database currently reach 1000 hits per day.
Collapse
|
20
|
Correlation of proteomic and transcriptomic profiles of Staphylococcus aureus during the post-exponential phase of growth. METHODS IN MICROBIOLOGY 2005; 60:247-57. [PMID: 15590099 DOI: 10.1016/j.mimet.2004.09.017] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2004] [Revised: 09/17/2004] [Accepted: 09/30/2004] [Indexed: 11/25/2022]
Abstract
A combined proteomic and transcriptomic analysis of Staphylococcus aureus strain N315 was performed to study a sequenced strain at the system level. Total protein and membrane protein extracts were prepared and analyzed using various proteomic workflows including: 2-DE, SDS-PAGE combined with microcapillary LC-MALDI-MS/MS, and multidimensional liquid chromatography. The presence of a protein was then correlated with its respective transcript level from S. aureus cells grown under the same conditions. Gene-expression data revealed that 97% of the 2'596 ORFs were detected during the post-exponential phase. At the protein level, 23% of these ORFs (591 proteins) were identified. Correlation of the two datasets revealed that 42% of the identified proteins (248 proteins) were amongst the top 25% of genes with highest mRNA signal intensities, and 69% of the identified proteins (406 proteins) were amongst the top 50% with the highest mRNA signal intensities. The fact that the remaining 31% of proteins were not strongly expressed at the RNA level indicates either that some low-abundance proteins were identified or that some transcripts or proteins showed extended half-lives. The most abundant classes identified with the combined proteomic and transcriptomic approach involved energy production, translational activities and nucleotide transport, reflecting an active metabolism. The simultaneous large-scale analysis of transcriptomes and proteomes enables a global and holistic view of the S. aureus biology, allowing the parallel study of multiple active events in an organism.
Collapse
MESH Headings
- Bacterial Proteins/biosynthesis
- Bacterial Proteins/genetics
- Bacterial Proteins/isolation & purification
- Chromatography, Liquid
- Computational Biology
- Electrophoresis, Gel, Two-Dimensional
- Electrophoresis, Polyacrylamide Gel
- Gene Expression Regulation, Developmental
- Nucleic Acid Hybridization
- Oligonucleotide Array Sequence Analysis
- Proteomics/methods
- RNA, Messenger/genetics
- RNA, Messenger/isolation & purification
- Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization
- Staphylococcus aureus/genetics
- Staphylococcus aureus/growth & development
- Staphylococcus aureus/metabolism
- Transcription, Genetic/genetics
- Transcription, Genetic/physiology
Collapse
|
21
|
Further steps towards data standardisation: The Proteomic Standards Initiative HUPO 3rd annual congress, Beijing 25-27th October, 2004. Proteomics 2005; 5:337-9. [PMID: 15700244 DOI: 10.1002/pmic.200401158] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The increasing volume of proteomics data currently being generated by increasingly high-throughput methodologies has led to an increasing need for methods by which such data can be accurately described, stored and exchanged between experimental researchers and data repositories. Work by the Proteomics Standards Initiative of the Human Proteome Organisation has laid the foundation for the development of standards by which experimental design can be described and data exchange facilitated. The progress of these efforts, and the direct benefits already accruing from them, were described at a plenary session of the 3(rd) Annual HUPO congress. Parallel sessions allowed the three work groups to present their progress to interested parties and to collect feedback from groups already implementing the available formats.
Collapse
|
22
|
Protein Identification and Analysis Tools on the ExPASy Server. THE PROTEOMICS PROTOCOLS HANDBOOK 2005:571-607. [PMID: 0 DOI: 10.1385/1-59259-890-0:571] [Citation(s) in RCA: 3659] [Impact Index Per Article: 192.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
|
23
|
Abstract
Approaches aimed at deciphering the proteome have illustrated the need for relatively complex and highly sensitive methodologies. The major elements of proteome analysis, such as powerful protein separation and enzymatic processing, mass spectrometry and dedicated bioinformatics have been assembled in the development of the molecular scanner. This highly flexible and data-rich approach has combined the power of electrophoretic protein separation, the simultaneous digestion and transfer of proteins through an enzymatic membrane, the immediate use of the MALDI mass spectrometer to scan a collecting membrane, and the development of dedicated bioinformatics tools to perform protein identification and molecular imaging of the proteome. Clinical applications of the molecular scanner have also started to be developed for disease diagnosis in biological material.
Collapse
MESH Headings
- Animals
- Biotechnology/instrumentation
- Biotechnology/methods
- Biotechnology/trends
- Electrophoresis, Gel, Two-Dimensional/instrumentation
- Electrophoresis, Gel, Two-Dimensional/methods
- Equipment Design
- Gene Expression Profiling/instrumentation
- Gene Expression Profiling/methods
- Gene Expression Profiling/trends
- Humans
- Proteins/analysis
- Proteins/chemistry
- Proteomics/instrumentation
- Proteomics/methods
- Proteomics/trends
- Sequence Analysis, Protein/instrumentation
- Sequence Analysis, Protein/methods
- Sequence Analysis, Protein/trends
- Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization/instrumentation
- Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization/methods
- Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization/trends
- Systems Integration
- User-Computer Interface
Collapse
|
24
|
The Make 2D-DB II package: conversion of federated two-dimensional gel electrophoresis databases into a relational format and interconnection of distributed databases. Proteomics 2003; 3:1441-4. [PMID: 12923769 DOI: 10.1002/pmic.200300483] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The Make 2D-DB tool has been previously developed to help build federated two-dimensional gel electrophoresis (2-DE) databases on one's own web site. The purpose of our work is to extend the strength of the first package and to build a more efficient environment. Such an environment should be able to fulfill the different needs and requirements arising from both the growing use of 2-DE techniques and the increasing amount of distributed experimental data.
Collapse
|
25
|
Abstract
The ExPASy (the Expert Protein Analysis System) World Wide Web server (http://www.expasy.org), is provided as a service to the life science community by a multidisciplinary team at the Swiss Institute of Bioinformatics (SIB). It provides access to a variety of databases and analytical tools dedicated to proteins and proteomics. ExPASy databases include SWISS-PROT and TrEMBL, SWISS-2DPAGE, PROSITE, ENZYME and the SWISS-MODEL repository. Analysis tools are available for specific tasks relevant to proteomics, similarity searches, pattern and profile searches, post-translational modification prediction, topology prediction, primary, secondary and tertiary structure analysis and sequence alignment. These databases and tools are tightly interlinked: a special emphasis is placed on integration of database entries with related resources developed at the SIB and elsewhere, and the proteomics tools have been designed to read the annotations in SWISS-PROT in order to enhance their predictions. ExPASy started to operate in 1993, as the first WWW server in the field of life sciences. In addition to the main site in Switzerland, seven mirror sites in different continents currently serve the user community.
Collapse
|
26
|
Hydrogen/deuterium exchange for higher specificity of protein identification by peptide mass fingerprinting. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2002; 16:616-626. [PMID: 11870900 DOI: 10.1002/rcm.616] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Genome sequencing projects produce large amounts of information that could be translated into potential protein sequences. Such amounts of material continuously increase protein database sizes. At present, 22 times more protein sequences are available in the SWISS-PROT and TrEMBL databases than 8 years ago in SWISS-PROT. One of the methods of choice for protein identification makes use of specific endoproteolytic cleavage followed by matrix-assisted laser desorption/ionisation mass spectrometric (MALDI-MS) analysis of the digested product. Since 1993, when this technique was first demonstrated, the conditions required for a correct identification have changed dramatically. Whilst 4-5 peptides with an uncertainty of 2-3 Da were sufficient for a correct identification in 1993, 10-13 peptides with less than 60 ppm mass error are now required for human and E. coli proteins. This evolution is directly related to the continuous increase in protein database sizes, which causes an increase in the number of false positive matches in identification results. Use of an information complement deduced from the primary protein sequence, in the process of identification by peptide mass fingerprints, can help to increase confidence in the identification results. In this article, we propose the exchange of labile hydrogen atoms with deuterium atoms to provide an alternative information complement. The exchange reaction with optimised techniques has shown an average 95% of hydrogen/deuterium (H/D) exchange on tryptic peptides. This level of exchange was sufficient to single out one or more peptides from a list of potential candidate proteins due to the dependence of H/D exchange on the peptide primary structure. This technique also has clear advantages in the identification of small proteins where direct protein identification is impaired by the limited number of endoproteolytic peptides. Then, information related to primary sequence obtained with this technique could help to identify proteins with high confidence without any expensive tandem mass spectrometry instruments.
Collapse
|
27
|
Abstract
Escherichia coli is a model organism for biochemical and biological studies as it is one of the best characterised prokaryote. Two-dimensional polyacrylamide gel electrophoresis, computer image analysis and different protein identification techniques gave rise, in 1995, to the Escherichia coli SWISS-2D PAGE database (http://www.expasy.ch/ch2d/). In the E. coli 3.5-10 SWISS-2D PAGE map, 40% of the E. coli proteome was displayed. The present study demonstrated that the use of narrow range pH gradients is able to potentially display up to a few copies of protein per E. coli cell. Moreover, the six new E. coli SWISS-2D PAGE maps (pH 4-5, 4.5-5.5, 5-6, 5.5-6.7, 6-9 and 6-11) presented here displayed altogether more than 70% of the entire E. coli proteome.
Collapse
|
28
|
Abstract
A number of two-dimensional electrophoresis (2-DE) reference maps from mouse samples have been established and could be accessed through the internet. An up-to-date list can be found in WORLD-2D PAGE (http://www.expasy.ch/ch2d/2d- index.html), an index of 2-DE databases and services. None of them were established from mouse white and brown adipose tissues, pancreatic islets, liver nuclei and skeletal muscle. This publication describes the mouse SWISS-2D PAGE database. Proteins present in samples of mouse (C57BI/6J) liver, liver nuclei, muscle, white and brown adipose tissue and pancreatic islets are assembled and described in an accessible uniform format. SWISS-2D PAGE can be accessed through the World Wide Web (WWW) network on the ExPASy molecular biology server (http://www.expasy.ch/ ch2d/).
Collapse
|
29
|
Abstract
This short communication describes the establishment of a two-dimensional electrophoresis (2-DE) reference map of nuclear proteins isolated from human liver. The human liver nuclei 2-DE reference map contains 1497 spots. In an initial identification study using peptide mass fingerprinting as a means of protein identification we were able to identify 26 spots corresponding to 15 different proteins. The human liver nuclei 2-DE reference map is now included in the SWISS-2DPAGE database, which can be accessed through the ExPASy server (http://www.expasy.ch/ch2d/).
Collapse
|
30
|
Abstract
SWISS-2DPAGE (http://www.expasy.ch/ch2d/ ) is an annotated two-dimensional polyacrylamide gel electro-phoresis (2-DE) database established in 1993. The current release contains 24 reference maps from human and mouse biological samples, as well as from Saccharomyces cerevisiae, Escherichia coli and Dictyostelium discoideum origin. These reference maps have now 2824 identified spots, corresponding to 614 separate protein entries in the database, in addition to virtual entries for each SWISS-PROT sequence or any user-entered amino acids sequence. Last year improvements in the SWISS-2DPAGE database are as follows: three new maps have been created and several others have been updated; cross-references to newly built federated 2-DE databases have been added; new functions to access the data have been provided through the ExPASy proteomics server.
Collapse
|
31
|
Improving protein identification from peptide mass fingerprinting through a parameterized multi-level scoring algorithm and an optimized peak detection. Electrophoresis 1999; 20:3535-50. [PMID: 10612280 DOI: 10.1002/(sici)1522-2683(19991201)20:18<3535::aid-elps3535>3.0.co;2-j] [Citation(s) in RCA: 108] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
We have developed a new algorithm to identify proteins by means of peptide mass fingerprinting. Starting from the matrix-assisted laser desorption/ionization-time-of-flight (MALDI-TOF) spectra and environmental data such as species, isoelectric point and molecular weight, as well as chemical modifications or number of missed cleavages of a protein, the program performs a fully automated identification of the protein. The first step is a peak detection algorithm, which allows precise and fast determination of peptide masses, even if the peaks are of low intensity or they overlap. In the second step the masses and environmental data are used by the identification algorithm to search in protein sequence databases (SWISS-PROT and/or TrEMBL) for protein entries that match the input data. Consequently, a list of candidate proteins is selected from the database, and a score calculation provides a ranking according to the quality of the match. To define the most discriminating scoring calculation we analyzed the respective role of each parameter in two directions. The first one is based on filtering and exploratory effects, while the second direction focuses on the levels where the parameters intervene in the identification process. Thus, according to our analysis, all input parameters contribute to the score, however with different weights. Since it is difficult to estimate the weights in advance, they have been computed with a generic algorithm, using a training set of 91 protein spectra with their environmental data. We tested the resulting scoring calculation on a test set of ten proteins and compared the identification results with those of other peptide mass fingerprinting programs.
Collapse
|
32
|
Abstract
This paper describes the set of two-dimensional electrophoresis (2-DE) resources currently available from the ExPASy proteomics Web server. These resources include the SWISS-2DPAGE database, 2-DE software packages, 2-DE technical and educational services, as well as indexes and search engines for 2-DE related sites over the Internet.
Collapse
|
33
|
Abstract
Identification and characterization of all proteins expressed by a genome in biological samples represent major challenges in proteomics. Today's commonly used high-throughput approaches combine two-dimensional electrophoresis (2-DE) with peptide mass fingerprinting (PMF) analysis. Although automation is often possible, a number of limitations still adversely affect the rate of protein identification and annotation in 2-DE databases: the sequential excision process of pieces of gel containing protein; the enzymatic digestion step; the interpretation of mass spectra (reliability of identifications); and the manual updating of 2-DE databases. We present a highly automated method that generates a fully annoated 2-DE map. Using a parallel process, all proteins of a 2-DE are first simultaneously digested proteolytically and electro-transferred onto a poly(vinylidene difluoride) membrane. The membrane is then directly scanned by MALDI-TOF MS. After automated protein identification from the obtained peptide mass fingerprints using PeptIdent software (http://www.expasy.ch/tools/peptident.html + ++), a fully annotated 2-D map is created on-line. It is a multidimensional representation of a proteome that contains interpreted PMF data in addition to protein identification results. This "MS-imaging" method represents a major step toward the development of a clinical molecular scanner.
Collapse
|
34
|
|
35
|
Abstract
SWISS-2DPAGE (http://www.expasy.ch/ch2d/) is an annotated two-dimensional polyacrylamide gel electrophoresis (2-D PAGE) database established in 1993. The current release contains 21 reference maps from human and mouse biological samples, as well as from Saccharomyces cerevisiae, Escherichia coli and Dictyostelium discoideum origin. These reference maps now have 2480 identified spots, corresponding to 528 separate protein entries in the database, in addition to virtual entries for each SWISS-PROT sequence. During the last year, the SWISS-2DPAGE has undergone major changes. Six new maps have been added, and new functions to access the data have been provided through the ExPASy server. Finally, an important change concerns the database funding source.
Collapse
|
36
|
Abstract
The combination of two-dimensional polyacrylamide gel electrophoresis (2-D PAGE), computer image analysis and several protein identification techniques allowed the Escherichia coli SWISS-2DPAGE database to be established. This is part of the ExPASy molecular biology server accessible through the WWW at the URL address http://www.expasy.ch/ch2d/ch2d-top.html . Here we report recent progress in the development of the E. coli SWISS-2DPAGE database. Proteins were separated with immobilized pH gradients in the first dimension and sodium dodecyl sulfate-polyacrylamide gel electrophoresis in the second dimension. To increase the resolution of the separation and thus the number of identified proteins, a variety of wide and narrow range immobilized pH gradients were used in the first dimension. Micropreparative gels were electroblotted onto polyvinylidene difluoride membranes and spots were visualized by amido black staining. Protein identification techniques such as amino acid composition analysis, gel comparison and microsequencing were used, as well as a recently described Edman "sequence tag" approach. Some of the above identification techniques were coupled with database searching tools. Currently 231 polypeptides are identified on the E. coli SWISS-2DPAGE map: 64 have been identified by N-terminal microsequencing, 39 by amino acid composition, and 82 by sequence tag. Of 153 proteins putatively identified by gel comparison, 65 have been confirmed. Many proteins have been identified using more than one technique. Faster progress in the E. coli proteome project will now be possible with advances in biochemical methodology and with the completion of the entire E. coli genome.
Collapse
|
37
|
Maintenance of transposable element copy number in natural populations of Drosophila melanogaster and D. simulans. Genetica 1998; 100:161-6. [PMID: 9440269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
To investigate the main forces controlling the containment of transposable elements (TE) in natural populations, we analyzed the copia, mdg1, and 412 elements in various populations of Drosophila melanogaster and D. simulans. A lower proportion of insertion sites on the X chromosome in comparison with the autosomes suggests that selection against the detrimental effects of TE insertions is the major force containing TE copies in populations of Drosophila. This selection effect hypothesis is strengthened by the absence of the negative correlation between recombinaiton rate and TE copy number along the chromosomes, which was expected under the alternative ectopic exchange model (selection against the deleterious rearrangements promoted by recombination between TE insertions). A cline in 412 copy number in relation to latitude was observed among the natural populations of D. simulans, with very high numbers existing in some local populations (around 60 copies in a sample from Canberra, Australia). An apparent absence of selection effects in this Canberra sample and a value of transposition rate equal to 1-2 x 10(-3) whatever the population and its copy number agree with the idea of recent but temporarily drastic TE movements in local populations. The high values of transposition rate in D. simulans clearly disfavor the hypothesis that the low amount of transposable elements in this species could result from a low transposition rate.
Collapse
|
38
|
Abstract
The SWISS-2DPAGE database (http: //www.expasy.ch/ch2d/ch2d-top.html ) consists of two-dimensional polyacrylamide gel electrophoresis images, as well as textual descriptions of the proteins that have been identified on them. The current release contains 15 reference maps from human biological samples, as well as from Saccharomyces cerevisiae , Escherichia coli and Dictyostelium discoideum origin. These reference maps have 2088 identified spots, corresponding to 410 separate protein entries in the database, in addition to virtual entries for each SWISS-PROT sequence.
Collapse
|
39
|
Make2ddb: a simple package to set up a two-dimensional electrophoresis database for the World Wide Web. Electrophoresis 1997; 18:2755-8. [PMID: 9504807 DOI: 10.1002/elps.1150181509] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Two-dimensional electrophoresis (2-DE) has become a highly reproducible protein separation technique that currently serves as the main basis for proteome research and in particular for protein identification. Also, the Internet provides large utilities for exchanging data, and we can observe increased interest among scientists to build remote 2-DE databases, since many members of the concerned community are now able to access the data. By preparing the data and programs that are required to create a federated 2-DE database, the Make2ddb package, described here, helps to build such a database on the user own World Wide Web site.
Collapse
|
40
|
|
41
|
Chromosomal distribution of the 412 retrotransposon in natural populations of Drosophila simulans. Heredity (Edinb) 1997; 79 ( Pt 2):128-34. [PMID: 9279009 DOI: 10.1038/hdy.1997.135] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
The insertion site localization of the 412 retrotransposable element was analysed by in situ hybridization to the polytene chromosomes of 57 individual genomes from 25 natural populations of Drosophila simulans. The 412 insertion sites along the chromosomes show a tendency to aggregate in the distal and proximal ends of the 2R arm, and in several local regions along the 3R arm. The distribution of the 412 insertion sites, weighted by DNA content, along the chromosome arms reveals an overall tendency for the site number to increase from the middle of the arm to the base and tip, with a decrease at the tips, especially pronounced for the X chromosome. Such a distribution differs slightly from that of D. melanogaster, which globally shows an increase of the 412 site number from base to tip of the chromosome arms, indicating differing behaviour of the 412 element in the two species. These results are discussed in connection with the recombination rate along the chromosome arms.
Collapse
|
42
|
DROSOPOSON: a knowledge base on chromosomal localization of transposable element insertions in Drosophila. COMPUTER APPLICATIONS IN THE BIOSCIENCES : CABIOS 1997; 13:61-8. [PMID: 9088710 DOI: 10.1093/bioinformatics/13.1.61] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
MOTIVATION What forces maintain transposable elements (TEs) in genomes and populations is one of the main questions to understand the dynamics of these elements, but the exact nature of these forces is still a matter of speculation. To test theoretical models of TE population dynamics, we need many data on the genomic distributions of various elements. These data are now accumulating for the species Drosophila melanogaster, but they are scattered in the literature. RESULTS The knowledge base DROSOPOSON thus brings together: (1) data available on Drosophila chromosomal localizations of TE insertions and on features of the polytene chromosomes (DNA content, recombination rate, break-points, etc); (2) statistical methods aimed at analysing the distribution of the TE insertions along the chromosomes. In this paper, we present the structure of the base, the data and the statistical methods. Theoretical models of containment of TE copy number in Drosophila can thus be tested.
Collapse
|
43
|
|
44
|
Chromosomal distribution of transposable elements in Drosophila melanogaster: test of the ectopic recombination model for maintenance of insertion site number. Genetics 1996; 144:197-204. [PMID: 8878685 PMCID: PMC1207493 DOI: 10.1093/genetics/144.1.197] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Data of insertion site localization and site occupancy frequency of P, hobo, I, copia, mdg1, mdg3, 412, 297, and roo transposable elements (TEs) on the polytene chromosomes of Drosophila melanogaster were extracted from the literature. We show that TE insertion site number per chromosomal division was significantly correlated with the amount of DNA. The insertion site number weighted by DNA content was not correlated with recombination rate for all TEs except hobo, for which a positive correlation was detected. No global tendency emerged in the relationship between TE site occupancy frequency, weighted by DNA content, and recombination rate; a strong negative correlation was, however, found for the 3L arm. A possible dominant deleterious effect of chromosomal rearrangements due to recombination between TE insertions is thus not the main factor explaining the dynamics of TEs, since this hypothesis implies a negative relationship between recombination rate and both TE insertion site number and site occupancy frequency. The alternative hypothesis of selection against deleterious effects of insertional mutations is discussed.
Collapse
|
45
|
Chromosomal distribution and population dynamics of the 412 retrotransposon in a natural population of Drosophila melanogaster. Chromosoma 1995; 103:693-9. [PMID: 7664616 DOI: 10.1007/bf00344230] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
The localization of the insertion sites of the 412 retrotransposable element was analysed by in situ hybridization to the polytene chromosomes of the genomes of males from a natural population of Drosophila melanogaster. Non-parametric statistical tests do not reveal any particular distribution of the insertion sites over the chromosomes, suggesting an apparently random distribution of the 412 element. Aggregation and dispersion tests were highly significant with data of copy number (when all genomes are pooled, many copies may be at a given site), suggesting the existence of sites with high insertion frequency. Comparison with other data from the literature confirms the tendency for a low proportion of insertions on the X chromosome in comparison with the autosomes, a result in agreement with selection acting against the detrimental effect of the 412 element insertions.
Collapse
|