1
|
Li W, Dong X, Zhang X, Cao J, Liu M, Zhou X, Long H, Cao H, Lin H, Zhang L. Genome assembly and resequencing shed light on evolution, population selection, and sex identification in Vernicia montana. HORTICULTURE RESEARCH 2024; 11:uhae141. [PMID: 38988615 PMCID: PMC11233859 DOI: 10.1093/hr/uhae141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Accepted: 05/08/2024] [Indexed: 07/12/2024]
Abstract
Vernicia montana is a dioecious plant widely cultivated for high-quality tung oil production and ornamental purposes in the Euphorbiaceae family. The lack of genomic information has severely hindered molecular breeding for genetic improvement and early sex identification in V. montana. Here, we present a chromosome-level reference genome of a male V. montana with a total size of 1.29 Gb and a contig N50 of 3.69 Mb. Genome analysis revealed that different repeat lineages drove the expansion of genome size. The model of chromosome evolution in the Euphorbiaceae family suggests that polyploidization-induced genomic structural variation reshaped the chromosome structure, giving rise to the diverse modern chromosomes. Based on whole-genome resequencing data and analyses of selective sweep and genetic diversity, several genes associated with stress resistance and flavonoid synthesis such as CYP450 genes and members of the LRR-RLK family, were identified and presumed to have been selected during the evolutionary process. Genome-wide association studies were conducted and a putative sex-linked insertion and deletion (InDel) (Chr 2: 102 799 917-102 799 933 bp) was identified and developed as a polymorphic molecular marker capable of effectively detecting the gender of V. montana. This InDel is located in the second intron of VmBASS4, suggesting a possible role of VmBASS4 in sex determination in V. montana. This study sheds light on the genome evolution and sex identification of V. montana, which will facilitate research on the development of agronomically important traits and genomics-assisted breeding.
Collapse
Affiliation(s)
- Wenying Li
- Key Laboratory of Cultivation and Protection for Non-Wood Forest Trees of the Ministry of Education and Key Laboratory of Non-Wood Forest Products of the Forestry Ministry, Central South University of Forestry and Technology, Shaoshan South Road, No.498, Tianxin District, Changsha, Hunan 410004, China
- College of Biology and Agricultural Resources, Huanggang Normal University, No.146 Xingang 2nd Road, Huangzhou District, Huanggang, Hubei 438000, China
| | - Xiang Dong
- Key Laboratory of Cultivation and Protection for Non-Wood Forest Trees of the Ministry of Education and Key Laboratory of Non-Wood Forest Products of the Forestry Ministry, Central South University of Forestry and Technology, Shaoshan South Road, No.498, Tianxin District, Changsha, Hunan 410004, China
| | - Xingtan Zhang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, No.7 Pengfei Road, Dapeng New District, Shenzhen 518120, China
| | - Jie Cao
- Key Laboratory of Cultivation and Protection for Non-Wood Forest Trees of the Ministry of Education and Key Laboratory of Non-Wood Forest Products of the Forestry Ministry, Central South University of Forestry and Technology, Shaoshan South Road, No.498, Tianxin District, Changsha, Hunan 410004, China
| | - Meilan Liu
- Key Laboratory of Cultivation and Protection for Non-Wood Forest Trees of the Ministry of Education and Key Laboratory of Non-Wood Forest Products of the Forestry Ministry, Central South University of Forestry and Technology, Shaoshan South Road, No.498, Tianxin District, Changsha, Hunan 410004, China
| | - Xu Zhou
- College of Landscape Architecture, Central South University of Forestry and Technology, Shaoshan South Road, No.498, Tianxin District, Changsha, Hunan 410004, China
| | - Hongxu Long
- Key Laboratory of Cultivation and Protection for Non-Wood Forest Trees of the Ministry of Education and Key Laboratory of Non-Wood Forest Products of the Forestry Ministry, Central South University of Forestry and Technology, Shaoshan South Road, No.498, Tianxin District, Changsha, Hunan 410004, China
| | - Heping Cao
- U.S. Department of Agriculture, Agricultural Research Service, Southern Regional Research Center, 1100 Allen Toussaint Blvd, New Orleans, LA 70124-4305, USA
| | - Hai Lin
- Key Laboratory of Cultivation and Protection for Non-Wood Forest Trees of the Ministry of Education and Key Laboratory of Non-Wood Forest Products of the Forestry Ministry, Central South University of Forestry and Technology, Shaoshan South Road, No.498, Tianxin District, Changsha, Hunan 410004, China
| | - Lin Zhang
- Key Laboratory of Cultivation and Protection for Non-Wood Forest Trees of the Ministry of Education and Key Laboratory of Non-Wood Forest Products of the Forestry Ministry, Central South University of Forestry and Technology, Shaoshan South Road, No.498, Tianxin District, Changsha, Hunan 410004, China
| |
Collapse
|
2
|
Zhao J, Xu Y, Li H, An W, Yin Y, Wang B, Wang L, Wang B, Duan L, Ren X, Liang X, Wang Y, Wan R, Huang T, Zhang B, Li Y, Luo J, Cao Y. Metabolite-based genome-wide association studies enable the dissection of the genetic bases of flavonoids, betaine and spermidine in wolfberry (Lycium). PLANT BIOTECHNOLOGY JOURNAL 2024; 22:1435-1452. [PMID: 38194521 PMCID: PMC11123438 DOI: 10.1111/pbi.14278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 10/28/2023] [Accepted: 12/14/2023] [Indexed: 01/11/2024]
Abstract
Wolfberry is a plant with medicinal and food values. However, its bioactive ingredients and the corresponding genetic bases have not been determined. Here, we de novo generated a chromosome-level genome assembly for wolfberry, yielding a genome sequence of ~1.77 Gb with contig N50 of 50.55 Mb and 39 224 predicted gene models. A variation map, using 307 re-sequenced accessions, was called based on this genome assembly. Furthermore, the fruit metabolome of these accessions was profiled using 563 annotated metabolites, which separated Lycium barbarum L. and non-L. barbarum L. The flavonoids, coumarins, alkaloids and nicotinic acid contents were higher in the former than in the latter. A metabolite-based genome-wide association study mapped 156 164 significant single nucleotide polymorphisms corresponding to 340 metabolites. This included 19 219 unique lead single nucleotide polymorphisms in 1517 significant association loci, of which three metabolites, flavonoids, betaine and spermidine, were highlighted. Two candidate genes, LbUGT (evm.TU.chr07.2692) and LbCHS (evm.TU.chr07.2738), with non-synonymous mutations, were associated with the flavonoids content. LbCHS is a structural gene that interacts with a nearby MYB transcription factor (evm.TU.chr07.2726) both in L. barbarum and L. ruthenicum. Thus, these three genes might be involved in the biosynthesis/metabolism of flavonoids. LbSSADH (evm.TU.chr09.627) was identified as possibly participating in betaine biosynthesis/metabolism. Four lycibarbarspermidines (E-G and O) were identified, and only the lycibarbarspermidines O content was higher in L. barbarum varieties than in non-L. barbarum varieties. The evm.TU.chr07.2680 gene associated with lycibarbarspermidines O was annotated as an acetyl-CoA-benzylalcohol acetyltransferase, suggesting that it is a candidate gene for spermidine biosynthesis. These results provide novel insights into the specific metabolite profile of non-L. barbarum L. and the genetic bases of flavonoids, betaine and spermidine biosynthesis/metabolism.
Collapse
Affiliation(s)
- Jianhua Zhao
- National Wolfberry Engineering Research Center/Wolfberry Science Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| | - Yuhui Xu
- Adsen Biotechnology Co., Ltd.UrumchiChina
| | - Haoxia Li
- Desertification Control Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| | - Wei An
- National Wolfberry Engineering Research Center/Wolfberry Science Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| | - Yue Yin
- National Wolfberry Engineering Research Center/Wolfberry Science Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| | - Bin Wang
- Wuhan Matware Biotechnology Co., Ltd.WuhanChina
| | - Liping Wang
- School of breeding and multiplcation (Sanya Institute of Breeding and Multiplication)Hainan, UniversitySanyaChina
| | - Bi Wang
- School of breeding and multiplcation (Sanya Institute of Breeding and Multiplication)Hainan, UniversitySanyaChina
| | - Linyuan Duan
- National Wolfberry Engineering Research Center/Wolfberry Science Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| | - Xiaoyue Ren
- National Wolfberry Engineering Research Center/Wolfberry Science Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| | - Xiaojie Liang
- National Wolfberry Engineering Research Center/Wolfberry Science Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| | - Yajun Wang
- National Wolfberry Engineering Research Center/Wolfberry Science Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| | - Ru Wan
- National Wolfberry Engineering Research Center/Wolfberry Science Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| | - Ting Huang
- National Wolfberry Engineering Research Center/Wolfberry Science Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| | - Bo Zhang
- National Wolfberry Engineering Research Center/Wolfberry Science Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| | - Yanlong Li
- National Wolfberry Engineering Research Center/Wolfberry Science Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| | - Jie Luo
- School of breeding and multiplcation (Sanya Institute of Breeding and Multiplication)Hainan, UniversitySanyaChina
| | - Youlong Cao
- National Wolfberry Engineering Research Center/Wolfberry Science Research Institute, Ningxia Academy of Agriculture and Forestry SciencesYinchuanChina
| |
Collapse
|
3
|
Bartas M, Volna A, Cerven J, Pucker B. Identification of annotation artifacts concerning the chalcone synthase (CHS). BMC Res Notes 2023; 16:109. [PMID: 37340477 DOI: 10.1186/s13104-023-06386-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 06/13/2023] [Indexed: 06/22/2023] Open
Abstract
OBJECTIVE Chalcone synthase (CHS) catalyzes the initial step of the flavonoid biosynthesis. The CHS encoding gene is well studied in numerous plant species. Rapidly growing sequence databases contain hundreds of CHS entries that are the result of automatic annotation. In this study, we evaluated apparent multiplication of CHS domains in CHS gene models of four plant species. MAIN FINDINGS CHS genes with an apparent triplication of the CHS domain encoding part were discovered through database searches. Such genes were found in Macadamia integrifolia, Musa balbisiana, Musa troglodytarum, and Nymphaea colorata. A manual inspection of the CHS gene models in these four species with massive RNA-seq data suggests that these gene models are the result of artificial fusions in the annotation process. While there are hundreds of seemingly correct CHS records in the databases, it is not clear why these annotation artifacts appeared.
Collapse
Affiliation(s)
- Martin Bartas
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Adriana Volna
- Department of Physics, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Jiri Cerven
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Boas Pucker
- Institute of Plant Biology & BRICS, TU Braunschweig, Braunschweig, Germany.
| |
Collapse
|
4
|
Genome-Wide Characterization and Analysis of bHLH Transcription Factors Related to Anthocyanin Biosynthesis in Cinnamomum camphora ('Gantong 1'). Int J Mol Sci 2023; 24:ijms24043498. [PMID: 36834907 PMCID: PMC9959432 DOI: 10.3390/ijms24043498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 01/19/2023] [Accepted: 02/08/2023] [Indexed: 02/12/2023] Open
Abstract
Cinnamomum camphora is one of the most commonly used tree species in landscaping. Improving its ornamental traits, particularly bark and leaf colors, is one of the key breeding goals. The basic helix-loop-helix (bHLH) transcription factors (TFs) are crucial in controlling anthocyanin biosynthesis in many plants. However, their role in C. camphora remains largely unknown. In this study, we identified 150 bHLH TFs (CcbHLHs) using natural mutant C. camphora 'Gantong 1', which has unusual bark and leaf colors. Phylogenetic analysis revealed that 150 CcbHLHs were divided into 26 subfamilies which shared similar gene structures and conserved motifs. According to the protein homology analysis, we identified four candidate CcbHLHs that were highly conserved compared to the TT8 protein in A. thaliana. These TFs are potentially involved in anthocyanin biosynthesis in C. camphora. RNA-seq analysis revealed specific expression patterns of CcbHLHs in different tissue types. Furthermore, we verified expression patterns of seven CcbHLHs (CcbHLH001, CcbHLH015, CcbHLH017, CcbHLH022, CcbHLH101, CcbHLH118, and CcbHLH134) in various tissue types at different growth stages using qRT-PCR. This study opens a new avenue for subsequent research on anthocyanin biosynthesis regulated by CcbHLH TFs in C. camphora.
Collapse
|
5
|
Danneels B, Blignaut M, Marti G, Sieber S, Vandamme P, Meyer M, Carlier A. Cyclitol metabolism is a central feature of Burkholderia leaf symbionts. Environ Microbiol 2023; 25:454-472. [PMID: 36451580 DOI: 10.1111/1462-2920.16292] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 11/28/2022] [Indexed: 12/03/2022]
Abstract
The symbioses between plants of the Rubiaceae and Primulaceae families with Burkholderia bacteria represent unique and intimate plant-bacterial relationships. Many of these interactions have been identified through PCR-dependent typing methods, but there is little information available about their functional and ecological roles. We assembled 17 new endophyte genomes representing endophytes from 13 plant species, including those of two previously unknown associations. Genomes of leaf endophytes belonging to Burkholderia s.l. show extensive signs of genome reduction, albeit to varying degrees. Except for one endophyte, none of the bacterial symbionts could be isolated on standard microbiological media. Despite their taxonomic diversity, all endophyte genomes contained gene clusters linked to the production of specialized metabolites, including genes linked to cyclitol sugar analog metabolism and in one instance non-ribosomal peptide synthesis. These genes and gene clusters are unique within Burkholderia s.l. and are likely horizontally acquired. We propose that the acquisition of secondary metabolite gene clusters through horizontal gene transfer is a prerequisite for the evolution of a stable association between these endophytes and their hosts.
Collapse
Affiliation(s)
- Bram Danneels
- Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
- Computational Biology Unit, Department of Informatics, University of Bergen, Norway
| | - Monique Blignaut
- Department of Plant and Soil Sciences, University of Pretoria, Pretoria, South Africa
| | - Guillaume Marti
- Metatoul-AgromiX Platform, LRSV, Université de Toulouse, CNRS, UT3, INP, Toulouse, France
- MetaboHUB-MetaToul, National Infrastructure of Metabolomics and Fluxomics, Toulouse, France
| | - Simon Sieber
- Department of Chemistry, University of Zurich, Zurich, Switzerland
| | - Peter Vandamme
- Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
| | - Marion Meyer
- Department of Plant and Soil Sciences, University of Pretoria, Pretoria, South Africa
| | - Aurélien Carlier
- Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| |
Collapse
|
6
|
Yang MH, Lu YS, Ho TC, Shen DHY, Huang YF, Chuang KP, Yuan CH, Tyan YC. Utilizing Proteomic Approach to Analyze Potential Antioxidant Proteins in Plant against Irradiation. Antioxidants (Basel) 2022; 11:antiox11122498. [PMID: 36552706 PMCID: PMC9774815 DOI: 10.3390/antiox11122498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 12/08/2022] [Accepted: 12/15/2022] [Indexed: 12/23/2022] Open
Abstract
Gamma-ray irradiation is an effective and clean method of sterilization by inactivating microorganisms. It can also be applied to induce anti-oxidants for future application. In this study, the mung bean (Vigna radiata) was exposed to gamma-ray irradiation under the dose of 0, 5 or 10 kGy. With increasing irradiation doses, the concentrations of malondiadehyde decreased while the levels of total flavonoids and DPPH (1,1-diphenyl-2-picrylhydrazyl) radical scavenging activity increased. It has been shown that consuming flavonoids can provide protective effects. In addition, proteomic analysis identified several proteins having anti-oxidant activities in the 5 kGy irradiated group. These proteins are Apocytochrome f, Systemin receptor SR 160, DELLA protein DWARF8, DEAD-box ATP-dependent RNA helicase 9, ζ-carotene desaturase (ZDS), and Floral homeotic protein AGAMOUS. Our findings indicate that plants contain a variety of phytochemicals and antioxidant proteins which may effectively prevent oxidative stress caused by irradiated peroxidation.
Collapse
Affiliation(s)
- Ming-Hui Yang
- Department of Medical Education and Research, Kaohsiung Veterans General Hospital, Kaohsiung 813, Taiwan
- Center of General Education, Shu-Zen Junior College of Medicine and Management, Kaohsiung 821, Taiwan
| | - Yi-Shan Lu
- Office of Safety, Health and Environment, Kaohsiung Medical University, Kaohsiung 807, Taiwan
| | - Tzu-Chuan Ho
- Department of Medical Imaging and Radiological Sciences, Kaohsiung Medical University, Kaohsiung 807, Taiwan
| | - Daniel Hueng-Yuan Shen
- Department of Nuclear Medicine, Kaohsiung Veterans General Hospital, Kaohsiung 813, Taiwan
| | - Ying-Fong Huang
- Department of Medical Imaging and Radiological Sciences, Kaohsiung Medical University, Kaohsiung 807, Taiwan
- Department of Nuclear Medicine, Kaohsiung Medical University Hospital, Kaohsiung 807, Taiwan
| | - Kuo-Pin Chuang
- Graduate Institute of Animal Vaccine Technology, College of Veterinary Medicine, National Pingtung University of Science and Technology, Pingtung 912, Taiwan
| | - Cheng-Hui Yuan
- Mass Spectrometry Laboratory, Department of Chemistry, National University of Singapore, Singapore 119077, Singapore
| | - Yu-Chang Tyan
- Department of Medical Imaging and Radiological Sciences, Kaohsiung Medical University, Kaohsiung 807, Taiwan
- Department of Nuclear Medicine, Kaohsiung Medical University Hospital, Kaohsiung 807, Taiwan
- Graduate Institute of Animal Vaccine Technology, College of Veterinary Medicine, National Pingtung University of Science and Technology, Pingtung 912, Taiwan
- School of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan
- Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan
- Department of Medical Research, Kaohsiung Medical University Hospital, Kaohsiung 807, Taiwan
- Center for Cancer Research, Kaohsiung Medical University, Kaohsiung 807, Taiwan
- Research Center for Precision Environmental Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan
- Correspondence:
| |
Collapse
|
7
|
Ying N, Wang Y, Song X, Qin B, Wu Y, Yang L, Fang W. Transcriptome analysis of Macrobrachium rosenbergii: Identification of precocious puberty and slow-growing information. J Invertebr Pathol 2022; 190:107752. [DOI: 10.1016/j.jip.2022.107752] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Revised: 03/18/2022] [Accepted: 03/24/2022] [Indexed: 01/29/2023]
|
8
|
Chen S, Su H, Xing H, Mao J, Sun P, Li M. Comparative Proteomics Reveals the Difference in Root Cold Resistance between Vitis. riparia × V. labrusca and Cabernet Sauvignon in Response to Freezing Temperature. PLANTS (BASEL, SWITZERLAND) 2022; 11:971. [PMID: 35406951 PMCID: PMC9003149 DOI: 10.3390/plants11070971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 03/28/2022] [Accepted: 04/01/2022] [Indexed: 06/14/2023]
Abstract
Grapevines, bearing fruit containing large amounts of bioactive metabolites that offer health benefits, are widely cultivated around the world. However, the cold damage incurred when grown outside in extremely low temperatures during the overwintering stage limits the expansion of production. Although the morphological, biochemical, and molecular levels in different Vitis species exposed to different temperatures have been investigated, differential expression of proteins in roots is still limited. Here, the roots of cold-resistant (Vitis. riparia × V. labrusca, T1) and cold-sensitive varieties (Cabernet Sauvignon, T3) at -4 °C, and also at -15 °C for the former (T2), were measured by iTRAQ-based proteomic analysis. Expression levels of genes encoding candidate proteins were validated by qRT-PCR, and the root activities during different treatments were determined using a triphenyl tetrazolium chloride method. The results show that the root activity of the cold-resistant variety was greater than that of the cold-sensitive variety, and it declined with the decrease in temperature. A total of 25 proteins were differentially co-expressed in T2 vs. T1 and T1 vs. T3, and these proteins were involved in stress response, bio-signaling, metabolism, energy, and translation. The relative expression levels of the 13 selected genes were consistent with their fold-change values of proteins. The signature translation patterns for the roots during spatio-temporal treatments of different varieties at different temperatures provide insight into the differential mechanisms of cold resistance of grapevine.
Collapse
Affiliation(s)
- Sijin Chen
- State Key Laboratory of Aridland Crop Science, Gansu Agricultural University, Lanzhou 730070, China; (S.C.); (H.S.); (H.X.)
| | - Hongyan Su
- State Key Laboratory of Aridland Crop Science, Gansu Agricultural University, Lanzhou 730070, China; (S.C.); (H.S.); (H.X.)
| | - Hua Xing
- State Key Laboratory of Aridland Crop Science, Gansu Agricultural University, Lanzhou 730070, China; (S.C.); (H.S.); (H.X.)
| | - Juan Mao
- College of Horticulture, Gansu Agricultural University, Lanzhou 730070, China;
| | - Ping Sun
- State Key Laboratory of Aridland Crop Science, Gansu Agricultural University, Lanzhou 730070, China; (S.C.); (H.S.); (H.X.)
| | - Mengfei Li
- State Key Laboratory of Aridland Crop Science, Gansu Agricultural University, Lanzhou 730070, China; (S.C.); (H.S.); (H.X.)
| |
Collapse
|
9
|
Cui Y, Fan B, Xu X, Sheng S, Xu Y, Wang X. A High-Density Genetic Map Enables Genome Synteny and QTL Mapping of Vegetative Growth and Leaf Traits in Gardenia. Front Genet 2022; 12:802738. [PMID: 35132310 PMCID: PMC8817757 DOI: 10.3389/fgene.2021.802738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Accepted: 12/13/2021] [Indexed: 11/13/2022] Open
Abstract
The gardenia is a traditional medicinal horticultural plant in China, but its molecular genetic research has been largely hysteretic. Here, we constructed an F1 population with 200 true hybrid individuals. Using the genotyping-by-sequencing method, a high-density sex-average genetic map was generated that contained 4,249 SNPs with a total length of 1956.28 cM and an average genetic distance of 0.46 cM. We developed 17 SNP-based Kompetitive Allele-Specific PCR markers and found that 15 SNPs were successfully genotyped, of which 13 single-nucleotide polymorphism genotypings of 96 F1 individuals showed genotypes consistent with GBS-mined genotypes. A genomic collinearity analysis between gardenia and the Rubiaceae species Coffea arabica, Coffea canephora and Ophiorrhiza pumila showed the relativity strong conservation of LG11 with NC_039,919.1, HG974438.1 and Bliw01000011.1, respectively. Lastly, a quantitative trait loci analysis at three phenotyping time points (2019, 2020, and 2021) yielded 18 QTLs for growth-related traits and 31 QTLs for leaf-related traits, of which qBSBN7-1, qCD8 and qLNP2-1 could be repeatably detected. Five QTL regions (qCD8 and qSBD8, qBSBN7 and qSI7, qCD4-1 and qLLLS4, qLNP10 and qSLWS10-2, qSBD10 and qLLLS10) with potential pleiotropic effects were also observed. This study provides novel insight into molecular genetic research and could be helpful for further gene cloning and marker-assisted selection for early growth and development traits in the gardenia.
Collapse
Affiliation(s)
- Yang Cui
- Research Center for Traditional Chinese Medicine Resources and Ethnic Minority Medicine, Jiangxi University of Chinese Medicine, Nanchang, China
| | - Baolian Fan
- Research Center for Traditional Chinese Medicine Resources and Ethnic Minority Medicine, Jiangxi University of Chinese Medicine, Nanchang, China
| | - Xu Xu
- Research Center for Traditional Chinese Medicine Resources and Ethnic Minority Medicine, Jiangxi University of Chinese Medicine, Nanchang, China
| | - Shasha Sheng
- Research Center for Traditional Chinese Medicine Resources and Ethnic Minority Medicine, Jiangxi University of Chinese Medicine, Nanchang, China
| | - Yuhui Xu
- Adsen Biotechnology Co., Ltd., Urumchi, China
| | - Xiaoyun Wang
- Research Center for Traditional Chinese Medicine Resources and Ethnic Minority Medicine, Jiangxi University of Chinese Medicine, Nanchang, China
| |
Collapse
|
10
|
Jia LN, Yan X, You ZH, Zhou X, Li LP, Wang L, Song KJ. NLPEI: A Novel Self-Interacting Protein Prediction Model Based on Natural Language Processing and Evolutionary Information. Evol Bioinform Online 2020; 16:1176934320984171. [PMID: 33488064 PMCID: PMC7768313 DOI: 10.1177/1176934320984171] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Accepted: 12/01/2020] [Indexed: 12/13/2022] Open
Abstract
The study of protein self-interactions (SIPs) can not only reveal the function of proteins at the molecular level, but is also crucial to understand activities such as growth, development, differentiation, and apoptosis, providing an important theoretical basis for exploring the mechanism of major diseases. With the rapid advances in biotechnology, a large number of SIPs have been discovered. However, due to the long period and high cost inherent to biological experiments, the gap between the identification of SIPs and the accumulation of data is growing. Therefore, fast and accurate computational methods are needed to effectively predict SIPs. In this study, we designed a new method, NLPEI, for predicting SIPs based on natural language understanding theory and evolutionary information. Specifically, we first understand the protein sequence as natural language and use natural language processing algorithms to extract its features. Then, we use the Position-Specific Scoring Matrix (PSSM) to represent the evolutionary information of the protein and extract its features through the Stacked Auto-Encoder (SAE) algorithm of deep learning. Finally, we fuse the natural language features of proteins with evolutionary features and make accurate predictions by Extreme Learning Machine (ELM) classifier. In the SIPs gold standard data sets of human and yeast, NLPEI achieved 94.19% and 91.29% prediction accuracy. Compared with different classifier models, different feature models, and other existing methods, NLPEI obtained the best results. These experimental results indicated that NLPEI is an effective tool for predicting SIPs and can provide reliable candidates for biological experiments.
Collapse
Affiliation(s)
- Li-Na Jia
- College of Information Science and Engineering, Zaozhuang University, Zaozhuang, China
| | - Xin Yan
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, China
- School of Foreign Languages, Zaozhuang University, Zaozhuang, China
| | - Zhu-Hong You
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, China
| | - Xi Zhou
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, China
| | - Li-Ping Li
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, China
| | - Lei Wang
- College of Information Science and Engineering, Zaozhuang University, Zaozhuang, China
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, China
- Lei Wang, Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, China.
| | - Ke-Jian Song
- School of information engineering, Jiangxi University of Science and Technology, Ganzhou, China
| |
Collapse
|
11
|
Kumar PS, Dabdoub SM, Ganesan SM. Probing periodontal microbial dark matter using metataxonomics and metagenomics. Periodontol 2000 2020; 85:12-27. [PMID: 33226714 DOI: 10.1111/prd.12349] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Our view of the periodontal microbial community has been shaped by a century or more of cultivation-based and microscopic investigations. While these studies firmly established the infection-mediated etiology of periodontal diseases, it was apparent from the very early days that periodontal microbiology suffered from what Staley and Konopka described as the "great plate count anomaly", in that these culturable bacteria were only a minor part of what was visible under the microscope. For nearly a century, much effort has been devoted to finding the right tools to investigate this uncultivated majority, also known as "microbial dark matter". The discovery that DNA was an effective tool to "see" microbial dark matter was a significant breakthrough in environmental microbiology, and oral microbiologists were among the earliest to capitalize on these advances. By identifying the order in which nucleotides are arranged in a stretch of DNA (DNA sequencing) and creating a repository of these sequences, sequence databases were created. Computational tools that used probability-driven analysis of these sequences enabled the discovery of new and unsuspected species and ascribed novel functions to these species. This review will trace the development of DNA sequencing as a quantitative, open-ended, comprehensive approach to characterize microbial communities in their native environments, and explore how this technology has shifted traditional dogmas on how the oral microbiome promotes health and its role in disease causation and perpetuation.
Collapse
Affiliation(s)
- Purnima S Kumar
- Department of Periodontology, College of Dentistry, The Ohio State University, Columbus, Ohio, USA
| | - Shareef M Dabdoub
- Department of Periodontology, College of Dentistry, The Ohio State University, Columbus, Ohio, USA
| | - Sukirth M Ganesan
- Department of Periodontics, College of Dentistry and Dental Clinics, The University of Iowa, Iowa City, Iowa, USA
| |
Collapse
|
12
|
Proença DN, Heine T, Senges CHR, Bandow JE, Morais PV, Tischler D. Bacterial Metabolites Produced Under Iron Limitation Kill Pinewood Nematode and Attract Caenorhabditis elegans. Front Microbiol 2019; 10:2166. [PMID: 31608025 PMCID: PMC6761702 DOI: 10.3389/fmicb.2019.02166] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2019] [Accepted: 09/03/2019] [Indexed: 11/19/2022] Open
Abstract
Pine Wilt Disease (PWD) is caused by Bursaphelenchus xylophilus, the pinewood nematode, and affects several species of pine trees worldwide. The ecosystem of the Pinus pinaster trees was investigated as a source of bacteria producing metabolites affecting this ecosystem: P. pinaster trees as target-plant, nematode as disease effector and its insect-vector as shuttle. For example, metals and metal-carrying compounds contribute to the complex tree-ecosystems. This work aimed to detect novel secondary metabolites like metallophores and related molecules produced under iron limitation by PWD-associated bacteria and to test their activity on nematodes. After screening 357 bacterial strains from Portugal and United States, two promising metallophore-producing strains Erwinia sp. A41C3 and Rouxiella sp. Arv20#4.1 were chosen and investigated in more detail. The genomes of these strains were sequenced, analyzed, and used to detect genetic potential for secondary metabolite production. A combinatorial approach of liquid chromatography-coupled tandem mass spectrometry (LC-MS) linked to molecular networking was used to describe these compounds. Two major metabolites were detected by HPLC analyses and described. One HPLC fraction of strain Arv20#4.1 showed to be a hydroxamate-type siderophore with higher affinity for chelation of Cu. The HPLC fraction of strain A41C3 with highest metal affinity showed to be a catecholate-type siderophore with higher affinity for chelation of Fe. LC-MS allowed the identification of several desferrioxamines from strain Arv20#4.1, in special desferrioxamine E, but no hit was obtained in case of strain A41C3 which might indicate that it is something new. Bacteria and their culture supernatants showed ability to attract C. elegans. HPLC fractions of those supernatant-extracts of Erwinia strain A41C3, enriched with secondary metabolites such as siderophores, were able to kill pinewood nematode. These results suggest that metabolites secreted under iron limitation have potential to biocontrol B. xylophilus and for management of Pine Wilt Disease.
Collapse
Affiliation(s)
- Diogo Neves Proença
- Department of Life Sciences and Laboratory of Environmental Microbiology of CEMMPRE, University of Coimbra, Coimbra, Portugal
| | - Thomas Heine
- Environmental Microbiology, TU Bergakademie Freiberg, Freiberg, Germany
| | - Christoph H. R. Senges
- Applied Microbiology, Faculty of Biology and Biotechnology, Ruhr University Bochum, Bochum, Germany
| | - Julia E. Bandow
- Applied Microbiology, Faculty of Biology and Biotechnology, Ruhr University Bochum, Bochum, Germany
| | - Paula V. Morais
- Department of Life Sciences and Laboratory of Environmental Microbiology of CEMMPRE, University of Coimbra, Coimbra, Portugal
| | - Dirk Tischler
- Environmental Microbiology, TU Bergakademie Freiberg, Freiberg, Germany
- Microbial Biotechnology, Faculty of Biology and Biotechnology, Ruhr University Bochum, Bochum, Germany
| |
Collapse
|
13
|
Tian L, Zhang D, Su P, Wei Y, Wang Z, Wang PX, Dai CJ, Gong GL. Design, recombinant expression, and antibacterial activity of a novel hybrid magainin-thanatin antimicrobial peptide. Prep Biochem Biotechnol 2019; 49:427-434. [PMID: 30861356 DOI: 10.1080/10826068.2018.1476875] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Antimicrobial peptides are small molecule polypeptides with biological activity, which can avoid the drug resistance. Magainin and thanatin are antimicrobial peptides with a broad spectrum of inhibitory microbes, and the core sequence of magainin is linked to a core sequence of thanatin. Here, the hybrid magainin-thanatin (MT) antimicrobial peptide was designed through bioinformatics analysis. The recombinant MT antimicrobial peptide was successfully expressed and purified in Escherichia coli BL21 (DE3). The molecular weight of the hybrid MT antimicrobial peptide was about 3.35 kDa. Moreover, the target protein indeed has an inhibitory effect on Staphylococcus aureus, E. coli DH5α, and Bacillus subtilis, with the minimum inhibitory concentrations 16.5, 20, and 9 μM, respectively. The rational designed hybrid MT antimicrobial peptide will hopefully provide large-scale fermentable antimicrobial peptides in the industrial production in the future.
Collapse
Affiliation(s)
- Lu Tian
- a School of Food and Bioengineering , Shaanxi University of Science & Technology , Xi'an , China
| | - Di Zhang
- a School of Food and Bioengineering , Shaanxi University of Science & Technology , Xi'an , China
| | - Peng Su
- a School of Food and Bioengineering , Shaanxi University of Science & Technology , Xi'an , China
| | - Yuan Wei
- a School of Food and Bioengineering , Shaanxi University of Science & Technology , Xi'an , China
| | - Zhongzhong Wang
- a School of Food and Bioengineering , Shaanxi University of Science & Technology , Xi'an , China
| | - Pan Xue Wang
- a School of Food and Bioengineering , Shaanxi University of Science & Technology , Xi'an , China
| | - Chun Ji Dai
- a School of Food and Bioengineering , Shaanxi University of Science & Technology , Xi'an , China
| | - Guo Li Gong
- a School of Food and Bioengineering , Shaanxi University of Science & Technology , Xi'an , China
| |
Collapse
|
14
|
Yuan Z, Fang Y, Zhang T, Fei Z, Han F, Liu C, Liu M, Xiao W, Zhang W, Wu S, Zhang M, Ju Y, Xu H, Dai H, Liu Y, Chen Y, Wang L, Zhou J, Guan D, Yan M, Xia Y, Huang X, Liu D, Wei H, Zheng H. The pomegranate (Punica granatum L.) genome provides insights into fruit quality and ovule developmental biology. PLANT BIOTECHNOLOGY JOURNAL 2018; 16:1363-1374. [PMID: 29271050 PMCID: PMC5999313 DOI: 10.1111/pbi.12875] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2017] [Revised: 11/26/2017] [Accepted: 12/18/2017] [Indexed: 05/18/2023]
Abstract
Pomegranate (Punica granatum L.) has an ancient cultivation history and has become an emerging profitable fruit crop due to its attractive features such as the bright red appearance and the high abundance of medicinally valuable ellagitannin-based compounds in its peel and aril. However, the limited genomic resources have restricted further elucidation of genetics and evolution of these interesting traits. Here, we report a 274-Mb high-quality draft pomegranate genome sequence, which covers approximately 81.5% of the estimated 336-Mb genome, consists of 2177 scaffolds with an N50 size of 1.7 Mb and contains 30 903 genes. Phylogenomic analysis supported that pomegranate belongs to the Lythraceae family rather than the monogeneric Punicaceae family, and comparative analyses showed that pomegranate and Eucalyptus grandis share the paleotetraploidy event. Integrated genomic and transcriptomic analyses provided insights into the molecular mechanisms underlying the biosynthesis of ellagitannin-based compounds, the colour formation in both peels and arils during pomegranate fruit development, and the unique ovule development processes that are characteristic of pomegranate. This genome sequence provides an important resource to expand our understanding of some unique biological processes and to facilitate both comparative biology studies and crop breeding.
Collapse
Affiliation(s)
- Zhaohe Yuan
- Co‐Innovation Center for Sustainable Forestry in Southern ChinaNanjing Forestry UniversityNanjingChina
- College of ForestryNanjing Forestry UniversityNanjingChina
| | - Yanming Fang
- Co‐Innovation Center for Sustainable Forestry in Southern ChinaNanjing Forestry UniversityNanjingChina
- College of Biology and the EnvironmentNanjing Forestry UniversityNanjingChina
| | - Taikui Zhang
- Co‐Innovation Center for Sustainable Forestry in Southern ChinaNanjing Forestry UniversityNanjingChina
- College of ForestryNanjing Forestry UniversityNanjingChina
| | - Zhangjun Fei
- Boyce Thompson InstituteCornell UniversityIthacaNYUSA
- USDA Robert W. Holley Center for Agriculture and HealthIthacaNYUSA
| | | | - Cuiyu Liu
- Co‐Innovation Center for Sustainable Forestry in Southern ChinaNanjing Forestry UniversityNanjingChina
- College of ForestryNanjing Forestry UniversityNanjingChina
| | - Min Liu
- Biomarker Technologies CorporationBeijingChina
| | - Wei Xiao
- Co‐Innovation Center for Sustainable Forestry in Southern ChinaNanjing Forestry UniversityNanjingChina
- College of ForestryNanjing Forestry UniversityNanjingChina
| | | | - Shan Wu
- Boyce Thompson InstituteCornell UniversityIthacaNYUSA
| | - Mengwei Zhang
- Co‐Innovation Center for Sustainable Forestry in Southern ChinaNanjing Forestry UniversityNanjingChina
- College of ForestryNanjing Forestry UniversityNanjingChina
| | - Youhui Ju
- Biomarker Technologies CorporationBeijingChina
| | - Huili Xu
- Co‐Innovation Center for Sustainable Forestry in Southern ChinaNanjing Forestry UniversityNanjingChina
- College of ForestryNanjing Forestry UniversityNanjingChina
| | - He Dai
- Biomarker Technologies CorporationBeijingChina
| | - Yujun Liu
- College of Biological Sciences and BiotechnologyBeijing Forestry UniversityBeijingChina
| | - Yanhui Chen
- College of HorticultureHenan Agricultural UniversityZhengzhouChina
| | - Lili Wang
- Biomarker Technologies CorporationBeijingChina
| | - Jianqing Zhou
- Co‐Innovation Center for Sustainable Forestry in Southern ChinaNanjing Forestry UniversityNanjingChina
- College of ForestryNanjing Forestry UniversityNanjingChina
| | - Dian Guan
- Biomarker Technologies CorporationBeijingChina
| | - Ming Yan
- Co‐Innovation Center for Sustainable Forestry in Southern ChinaNanjing Forestry UniversityNanjingChina
- College of ForestryNanjing Forestry UniversityNanjingChina
| | - Yanhua Xia
- Biomarker Technologies CorporationBeijingChina
| | - Xianbin Huang
- Co‐Innovation Center for Sustainable Forestry in Southern ChinaNanjing Forestry UniversityNanjingChina
- College of ForestryNanjing Forestry UniversityNanjingChina
| | | | - Hongmin Wei
- Co‐Innovation Center for Sustainable Forestry in Southern ChinaNanjing Forestry UniversityNanjingChina
- College of ForestryNanjing Forestry UniversityNanjingChina
| | | |
Collapse
|
15
|
Wen F, Wu X, Li T, Jia M, Liu X, Li P, Zhou X, Ji X, Yue X. Genome-wide survey of heat shock factors and heat shock protein 70s and their regulatory network under abiotic stresses in Brachypodium distachyon. PLoS One 2017; 12:e0180352. [PMID: 28683139 PMCID: PMC5500289 DOI: 10.1371/journal.pone.0180352] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2017] [Accepted: 06/14/2017] [Indexed: 11/18/2022] Open
Abstract
The heat shock protein 70s (Hsp70s) and heat shock factors (Hsfs) play key roles in protecting plant cells or tissues from various abiotic stresses. Brachypodium distachyon, recently developed an excellent model organism for functional genomics research, is related to the major cereal grain species. Although B. distachyon genome has been fully sequenced, the information of Hsf and Hsp70 genes and especially the regulatory network between Hsfs and Hsp70s remains incomplete. Here, a total of 24 BdHsfs and 29 BdHsp70s were identified in the genome by bioinformatics analysis and the regulatory network between Hsfs and Hsp70s were performed in this study. Based on highly conserved domain and motif analysis, BdHsfs were grouped into three classes, and BdHsp70s divided into six groups, respectively. Most of Hsf proteins contain five conserved domains: DBD, HR-A/B region, NLS and NES motifs and AHA domain, while Hsp70 proteins have three conserved domains: N-terminal nucleotide binding domain, peptide binding domain and a variable C-terminal lid region. Expression data revealed a large number of BdHsfs and BdHsp70s were induced by HS challenge, and a previous heat acclimation could induce the acquired thermotolerance to help seedling suffer the severe HS challenge, suggesting that the BdHsfs and BdHsp70s played a role in alleviating the damage by HS. The comparison revealed that, most BdHsfs and BdHsp70s genes responded to multiple abiotic stresses in an overlapping relationship, while some of them were stress specific response genes. Moreover, co-expression relationships and predicted protein-protein interaction network implied that class A and B Hsfs played as activator and repressors, respectively, suggesting that BdHsp70s might be regulated by both the activation and the repression mechanisms under stress condition. Our genomics analysis of BdHsfs and BdHsp70s provides important evolutionary and functional characterization for further investigation of the accurate regulatory mechanisms among Hsfs and Hsp70s in herbaceous plants.
Collapse
Affiliation(s)
- Feng Wen
- School of Pharmacy and Life Science, Jiujiang University, Jiujiang, China
- * E-mail:
| | - Xiaozhu Wu
- School of Pharmacy and Life Science, Jiujiang University, Jiujiang, China
| | - Tongjian Li
- School of Pharmacy and Life Science, Jiujiang University, Jiujiang, China
| | - Mingliang Jia
- School of Pharmacy and Life Science, Jiujiang University, Jiujiang, China
| | - Xinshen Liu
- School of Pharmacy and Life Science, Jiujiang University, Jiujiang, China
| | - Peng Li
- Shanghai Chenshan Plant Science Research Center, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS). Shanghai Chenshan Botanic Garden, Songjiang, Shanghai, China
| | - Xiaojian Zhou
- School of Pharmacy and Life Science, Jiujiang University, Jiujiang, China
| | - Xinxin Ji
- School of Pharmacy and Life Science, Jiujiang University, Jiujiang, China
| | - Xiaomin Yue
- School of Pharmacy and Life Science, Jiujiang University, Jiujiang, China
| |
Collapse
|
16
|
Kumari V, Kumar V, Bhalla TC. Functional interpretation and structural insights of Arabidopsis lyrata cytochrome P450 CYP71A13 involved in auxin synthesis. Bioinformation 2015; 11:330-5. [PMID: 26339148 PMCID: PMC4546991 DOI: 10.6026/97320630011330] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2014] [Revised: 12/11/2014] [Accepted: 12/14/2014] [Indexed: 11/29/2022] Open
Abstract
Cytochrome P450 CYP71A13 of Arabidopsis lyrata is a heme protein involved in biosynthesis of indole-3-acetonitrile which leads to the formation of indolyl-3-acetic acid. It catalyzes a unique reaction: formation of a carbon-nitrogen triple bond and dehydration of indolyl-3-acetaldoxime. Homology model of this 57 kDa polypeptide revealed that the heme existed between H-helix and J- helix in the hydrophobic pocket, although both helixes are involved in catalytic activity, where Gly305 and Thr308, 311 of H- helix were involved in its stabilization. The substrate indole-3-acetaldoxime was tightly fitted into the substrate pocket with the aromatic ring being surrounded by amino acid residues creating a hydrophobic environment. The smaller size of the substrate binding pocket in cytochrome P450 CYP71A13 was due to the bulkiness of the two amino acid residues Phe182 and Trp315 pointing into the substrate binding cavity. The apparent role of the heme in cytochrome P450 CYP71A13 was to tether the substrate in the catalysis by indole-3-acetaldoxime dehydratase. Since the crystal structure of cytochrome P450 CYP71A13 has not yet been solved, the modeled structure revealed mechanism of substrate recognition and catalysis.
Collapse
Affiliation(s)
- Vijaya Kumari
- Department of Biotechnology, Himachal Pradesh University, Shimla-171005
| | - Vijay Kumar
- Department of Biotechnology, Himachal Pradesh University, Shimla-171005
| | - Tek Chand Bhalla
- Department of Biotechnology, Himachal Pradesh University, Shimla-171005
| |
Collapse
|
17
|
Wiwanitkit V. Utilization of multiple "omics" studies in microbial pathogeny for microbiology insights. Asian Pac J Trop Biomed 2015; 3:330-3. [PMID: 23620861 DOI: 10.1016/s2221-1691(13)60073-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2013] [Accepted: 02/20/2013] [Indexed: 11/28/2022] Open
Abstract
In the present day, bioinformatics becomes the modern science with several advantages. Several new "omics" sciences have been introduced for a few years and those sciences can be applied in biomedical work. Here, the author will summarize and discuss on important applications of omics studies in microbiology focusing on microbial pathogeny. It can be seen that genomics and proteinomics can be well used in this area of biomedical studies.
Collapse
|
18
|
Mohayeji M, Capriotti AL, Cavaliere C, Piovesana S, Samperi R, Stampachiacchiere S, Toorchi M, Lagana A. Heterosis profile of sunflower leaves: A label free proteomics approach. J Proteomics 2014; 99:101-10. [DOI: 10.1016/j.jprot.2014.01.028] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2013] [Revised: 01/21/2014] [Accepted: 01/27/2014] [Indexed: 12/24/2022]
|
19
|
Bitra A, Hussain B, Tanwar AS, Anand R. Identification of Function and Mechanistic Insights of Guanine Deaminase from Nitrosomonas europaea: Role of the C-Terminal Loop in Catalysis. Biochemistry 2013; 52:3512-22. [DOI: 10.1021/bi400068g] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Affiliation(s)
- Aruna Bitra
- Department of Chemistry, IIT Bombay, Mumbai, India 400076
| | - Bhukya Hussain
- Department of Chemistry, IIT Bombay, Mumbai, India 400076
| | | | - Ruchi Anand
- Department of Chemistry, IIT Bombay, Mumbai, India 400076
| |
Collapse
|
20
|
Autore F, Pfuhl M, Quan X, Williams A, Roberts RG, Shanahan CM, Fraternali F. Large-scale modelling of the divergent spectrin repeats in nesprins: giant modular proteins. PLoS One 2013; 8:e63633. [PMID: 23671687 PMCID: PMC3646009 DOI: 10.1371/journal.pone.0063633] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2012] [Accepted: 04/09/2013] [Indexed: 11/29/2022] Open
Abstract
Nesprin-1 and nesprin-2 are nuclear envelope (NE) proteins characterized by a common structure of an SR (spectrin repeat) rod domain and a C-terminal transmembrane KASH [Klarsicht-ANC-Syne-homology] domain and display N-terminal actin-binding CH (calponin homology) domains. Mutations in these proteins have been described in Emery-Dreifuss muscular dystrophy and attributed to disruptions of interactions at the NE with nesprins binding partners, lamin A/C and emerin. Evolutionary analysis of the rod domains of the nesprins has shown that they are almost entirely composed of unbroken SR-like structures. We present a bioinformatical approach to accurate definition of the boundaries of each SR by comparison with canonical SR structures, allowing for a large-scale homology modelling of the 74 nesprin-1 and 56 nesprin-2 SRs. The exposed and evolutionary conserved residues identify important pbs for protein-protein interactions that can guide tailored binding experiments. Most importantly, the bioinformatics analyses and the 3D models have been central to the design of selected constructs for protein expression. 1D NMR and CD spectra have been performed of the expressed SRs, showing a folded, stable, high content α-helical structure, typical of SRs. Molecular Dynamics simulations have been performed to study the structural and elastic properties of consecutive SRs, revealing insights in the mechanical properties adopted by these modules in the cell.
Collapse
Affiliation(s)
- Flavia Autore
- Randall Division of Cell and Molecular Biophysics, School of Physical Sciences and Engineering, King's College London, London, United Kingdom
- Division of Cardiovascular Medicine, BHF Centre of Research Excellence, King's College London, London, United Kingdom
| | - Mark Pfuhl
- Randall Division of Cell and Molecular Biophysics, School of Physical Sciences and Engineering, King's College London, London, United Kingdom
| | - Xueping Quan
- Randall Division of Cell and Molecular Biophysics, School of Physical Sciences and Engineering, King's College London, London, United Kingdom
| | - Aisling Williams
- Randall Division of Cell and Molecular Biophysics, School of Physical Sciences and Engineering, King's College London, London, United Kingdom
- Division of Cardiovascular Medicine, BHF Centre of Research Excellence, King's College London, London, United Kingdom
| | - Roland G. Roberts
- Division of Medical and Molecular Genetics, Kings College London, Guy's Hospital, London, United Kingdom
| | - Catherine M. Shanahan
- Division of Cardiovascular Medicine, BHF Centre of Research Excellence, King's College London, London, United Kingdom
| | - Franca Fraternali
- Randall Division of Cell and Molecular Biophysics, School of Physical Sciences and Engineering, King's College London, London, United Kingdom
- The Thomas Young Centre for Theory and Simulation of Materials, London, United Kingdom
| |
Collapse
|
21
|
Blais EM, Chavali AK, Papin JA. Linking genome-scale metabolic modeling and genome annotation. Methods Mol Biol 2013; 985:61-83. [PMID: 23417799 DOI: 10.1007/978-1-62703-299-5_4] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Genome-scale metabolic network reconstructions, assembled from annotated genomes, serve as a platform for integrating data from heterogeneous sources and generating hypotheses for further experimental validation. Implementing constraint-based modeling techniques such as flux balance analysis (FBA) on network reconstructions allows for interrogating metabolism at a systems level, which aids in identifying and rectifying gaps in knowledge. With genome sequences for various organisms from prokaryotes to eukaryotes becoming increasingly available, a significant bottleneck lies in the structural and functional annotation of these sequences. Using topologically based and biologically inspired metabolic network refinement, we can better characterize enzymatic functions present in an organism and link annotation of these functions to candidate transcripts; both steps can be experimentally validated.
Collapse
Affiliation(s)
- Edik M Blais
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA
| | | | | |
Collapse
|
22
|
Lu HM, Yin DC, Liu YM, Guo WH, Zhou RB. Correlation between protein sequence similarity and crystallization reagents in the biological macromolecule crystallization database. Int J Mol Sci 2012; 13:9514-9526. [PMID: 22949812 PMCID: PMC3431810 DOI: 10.3390/ijms13089514] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Revised: 07/09/2012] [Accepted: 07/10/2012] [Indexed: 11/25/2022] Open
Abstract
The protein structural entries grew far slower than the sequence entries. This is partly due to the bottleneck in obtaining diffraction quality protein crystals for structural determination using X-ray crystallography. The first step to achieve protein crystallization is to find out suitable chemical reagents. However, it is not an easy task. Exhausting trial and error tests of numerous combinations of different reagents mixed with the protein solution are usually necessary to screen out the pursuing crystallization conditions. Therefore, any attempts to help find suitable reagents for protein crystallization are helpful. In this paper, an analysis of the relationship between the protein sequence similarity and the crystallization reagents according to the information from the existing databases is presented. We extracted information of reagents and sequences from the Biological Macromolecule Crystallization Database (BMCD) and the Protein Data Bank (PDB) database, classified the proteins into different clusters according to the sequence similarity, and statistically analyzed the relationship between the sequence similarity and the crystallization reagents. The results showed that there is a pronounced positive correlation between them. Therefore, according to the correlation, prediction of feasible chemical reagents that are suitable to be used in crystallization screens for a specific protein is possible.
Collapse
Affiliation(s)
- Hui-Meng Lu
- Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China; E-Mails: (H.-M.L.); (Y.-M.L.); (W.-H.G.); (R.-B.Z.)
- Key Laboratory for Space Bioscience and Biotechnology, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China
| | - Da-Chuan Yin
- Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China; E-Mails: (H.-M.L.); (Y.-M.L.); (W.-H.G.); (R.-B.Z.)
- Key Laboratory for Space Bioscience and Biotechnology, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China
| | - Yong-Ming Liu
- Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China; E-Mails: (H.-M.L.); (Y.-M.L.); (W.-H.G.); (R.-B.Z.)
- Key Laboratory for Space Bioscience and Biotechnology, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China
| | - Wei-Hong Guo
- Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China; E-Mails: (H.-M.L.); (Y.-M.L.); (W.-H.G.); (R.-B.Z.)
- Key Laboratory for Space Bioscience and Biotechnology, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China
| | - Ren-Bin Zhou
- Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China; E-Mails: (H.-M.L.); (Y.-M.L.); (W.-H.G.); (R.-B.Z.)
| |
Collapse
|
23
|
Seaver SMD, Henry CS, Hanson AD. Frontiers in metabolic reconstruction and modeling of plant genomes. JOURNAL OF EXPERIMENTAL BOTANY 2012; 63:2247-58. [PMID: 22238452 DOI: 10.1093/jxb/err371] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
A major goal of post-genomic biology is to reconstruct and model in silico the metabolic networks of entire organisms. Work on bacteria is well advanced, and is now under way for plants and other eukaryotes. Genome-scale modelling in plants is much more challenging than in bacteria. The challenges come from features characteristic of higher organisms (subcellular compartmentation, tissue differentiation) and also from the particular severity in plants of a general problem: genome content whose functions remain undiscovered. This problem results in thousands of genes for which no function is known ('undiscovered genome content') and hundreds of enzymatic and transport functions for which no gene is yet identified. The severity of the undiscovered genome content problem in plants reflects their genome size and complexity. To bring the challenges of plant genome-scale modelling into focus, we first summarize the current status of plant genome-scale models. We then highlight the challenges - and ways to address them - in three areas: identifying genes for missing processes, modelling tissues as opposed to single cells, and finding metabolic functions encoded by undiscovered genome content. We also discuss the emerging view that a significant fraction of undiscovered genome content encodes functions that counter damage to metabolites inflicted by spontaneous chemical reactions or enzymatic mistakes.
Collapse
Affiliation(s)
- Samuel M D Seaver
- Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA
| | | | | |
Collapse
|
24
|
Abstract
Several complex diseases are caused by the malfunction of human metabolism, and deciphering the underlying molecular mechanisms can elucidate their aetiology. Systems biology is an integrative approach combining experimental and computational biology to identify and describe the molecular mechanisms of complex biological systems. Systems medicine has the potential to elucidate the onset and progression of complex metabolic diseases through the use of computational approaches. Advances in biotechnology have resulted in the provision of high-throughput data, which provide information about different metabolic processes. The systems medicine approach can utilize such data to reconstruct genome-scale metabolic models that can be used to study the function of specific enzymes and pathways in the context of the complete metabolic network. In this review, we outline the importance of genome-scale models in systems medicine and discuss how they may contribute towards the development of personalized medicine.
Collapse
Affiliation(s)
- A Mardinoglu
- Department of Chemical and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | | |
Collapse
|
25
|
Abstract
Annotation of prokaryotic sequences can be separated into structural and functional annotation. Structural annotation is dependent on algorithmic interrogation of experimental evidence to discover the physical characteristics of a gene. This is done in an effort to construct accurate gene models, so understanding function or evolution of genes among organisms is not impeded. Functional annotation is dependent on sequence similarity to other known genes or proteins in an effort to assess the function of the gene. Combining structural and functional annotation across genomes in a comparative manner promotes higher levels of accurate annotation as well as an advanced understanding of genome evolution. As the availability of bacterial sequences increases and annotation methods improve, the value of comparative annotation will increase.
Collapse
Affiliation(s)
- Nicholas Beckloff
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | | | | | | |
Collapse
|
26
|
Liu F, Wang H, Li J. An integrated bioinformatics analysis of mouse testis protein profiles with new understanding. BMB Rep 2011; 44:347-51. [PMID: 21615991 DOI: 10.5483/bmbrep.2011.44.5.347] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The testis is major male gonad responsible for spermatogenesis and steroidogenesis. Much knowledge is still remained to be learned about the control of these events. In this study, we performed a comprehensive bioinformatics analysis on 1,196 mouse testis proteins screened from public protein database. Integrated function and pathway analysis were performed through Database for Annotation, Visualization and Integrated Discovery (DAVID) and ingenuity Pathway Analysis (IPA), and significant features were clustered. Protein membrane organization and gene density on chromosomes were analyzed and discussed. The enriched bioinformatics analysis could provide clues and basis to the development of diagnostic markers and therapeutic targets for infertility and male contraception.
Collapse
Affiliation(s)
- FuJun Liu
- Shandong Research Centre for Stem Cell Engineering, Yu-Huang-Ding Hospital, Yantai, Shandong Province, China.
| | | | | |
Collapse
|
27
|
Chaudhuri R, Ansari FA, Raghunandanan MV, Ramachandran S. FungalRV: adhesin prediction and immunoinformatics portal for human fungal pathogens. BMC Genomics 2011; 12:192. [PMID: 21496229 PMCID: PMC3224177 DOI: 10.1186/1471-2164-12-192] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2010] [Accepted: 04/15/2011] [Indexed: 01/11/2023] Open
Abstract
Background The availability of sequence data of human pathogenic fungi generates opportunities to develop Bioinformatics tools and resources for vaccine development towards benefitting at-risk patients. Description We have developed a fungal adhesin predictor and an immunoinformatics database with predicted adhesins. Based on literature search and domain analysis, we prepared a positive dataset comprising adhesin protein sequences from human fungal pathogens Candida albicans, Candida glabrata, Aspergillus fumigatus, Coccidioides immitis, Coccidioides posadasii, Histoplasma capsulatum, Blastomyces dermatitidis, Pneumocystis carinii, Pneumocystis jirovecii and Paracoccidioides brasiliensis. The negative dataset consisted of proteins with high probability to function intracellularly. We have used 3945 compositional properties including frequencies of mono, doublet, triplet, and multiplets of amino acids and hydrophobic properties as input features of protein sequences to Support Vector Machine. Best classifiers were identified through an exhaustive search of 588 parameters and meeting the criteria of best Mathews Correlation Coefficient and lowest coefficient of variation among the 3 fold cross validation datasets. The "FungalRV adhesin predictor" was built on three models whose average Mathews Correlation Coefficient was in the range 0.89-0.90 and its coefficient of variation across three fold cross validation datasets in the range 1.2% - 2.74% at threshold score of 0. We obtained an overall MCC value of 0.8702 considering all 8 pathogens, namely, C. albicans, C. glabrata, A. fumigatus, B. dermatitidis, C. immitis, C. posadasii, H. capsulatum and P. brasiliensis thus showing high sensitivity and specificity at a threshold of 0.511. In case of P. brasiliensis the algorithm achieved a sensitivity of 66.67%. A total of 307 fungal adhesins and adhesin like proteins were predicted from the entire proteomes of eight human pathogenic fungal species. The immunoinformatics analysis data on these proteins were organized for easy user interface analysis. A Web interface was developed for analysis by users. The predicted adhesin sequences were processed through 18 immunoinformatics algorithms and these data have been organized into MySQL backend. A user friendly interface has been developed for experimental researchers for retrieving information from the database. Conclusion FungalRV webserver facilitating the discovery process for novel human pathogenic fungal adhesin vaccine has been developed.
Collapse
Affiliation(s)
- Rupanjali Chaudhuri
- G.N Ramachandran Knowledge Centre for Genome Informatics, Institute of Genomics and Integrative Biology, Delhi 110007, India
| | | | | | | |
Collapse
|
28
|
Hartman AL, Norais C, Badger JH, Delmas S, Haldenby S, Madupu R, Robinson J, Khouri H, Ren Q, Lowe TM, Maupin-Furlow J, Pohlschroder M, Daniels C, Pfeiffer F, Allers T, Eisen JA. The complete genome sequence of Haloferax volcanii DS2, a model archaeon. PLoS One 2010; 5:e9605. [PMID: 20333302 PMCID: PMC2841640 DOI: 10.1371/journal.pone.0009605] [Citation(s) in RCA: 195] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2009] [Accepted: 02/11/2010] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Haloferax volcanii is an easily culturable moderate halophile that grows on simple defined media, is readily transformable, and has a relatively stable genome. This, in combination with its biochemical and genetic tractability, has made Hfx. volcanii a key model organism, not only for the study of halophilicity, but also for archaeal biology in general. METHODOLOGY/PRINCIPAL FINDINGS We report here the sequencing and analysis of the genome of Hfx. volcanii DS2, the type strain of this species. The genome contains a main 2.848 Mb chromosome, three smaller chromosomes pHV1, 3, 4 (85, 438, 636 kb, respectively) and the pHV2 plasmid (6.4 kb). CONCLUSIONS/SIGNIFICANCE The completed genome sequence, presented here, provides an invaluable tool for further in vivo and in vitro studies of Hfx. volcanii.
Collapse
Affiliation(s)
- Amber L. Hartman
- Department of Biology, Johns Hopkins University, Baltimore, Maryland, United States of America
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
- UC Davis Genome Center, University of California Davis, Davis, California, United States of America
| | - Cédric Norais
- Institut de Génétique et Microbiologie, Université Paris-Sud, Paris, France
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Jonathan H. Badger
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
| | - Stéphane Delmas
- Institute of Genetics, University of Nottingham, Nottingham, United Kingdom
| | - Sam Haldenby
- Institute of Genetics, University of Nottingham, Nottingham, United Kingdom
| | - Ramana Madupu
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
| | - Jeffrey Robinson
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
| | - Hoda Khouri
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
| | - Qinghu Ren
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
| | - Todd M. Lowe
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Julie Maupin-Furlow
- Department of Microbiology and Cell Science, University of Florida, Gainesville, Florida, United States of America
| | - Mecky Pohlschroder
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Charles Daniels
- Department of Microbiology, Ohio State University, Columbus, Ohio, United States of America
| | - Friedhelm Pfeiffer
- Department of Membrane Biochemistry, Max-Planck-Institute of Biochemistry, Martinsried, Germany
| | - Thorsten Allers
- Institute of Genetics, University of Nottingham, Nottingham, United Kingdom
| | - Jonathan A. Eisen
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
- UC Davis Genome Center, University of California Davis, Davis, California, United States of America
- Department of Medical Microbiology and Immunology, University of California Davis, Davis, California, United States of America
- Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
| |
Collapse
|
29
|
Chandramouli K, Qian PY. Proteomics: challenges, techniques and possibilities to overcome biological sample complexity. HUMAN GENOMICS AND PROTEOMICS : HGP 2009; 2009. [PMID: 20948568 PMCID: PMC2950283 DOI: 10.4061/2009/239204] [Citation(s) in RCA: 218] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/23/2009] [Accepted: 08/28/2009] [Indexed: 01/12/2023]
Abstract
Proteomics is the large-scale study of the structure and function of proteins in complex biological sample. Such an approach has the potential value to understand the complex nature of the organism. Current proteomic tools allow large-scale, high-throughput analyses for the detection, identification, and functional investigation of proteome. Advances in protein fractionation and labeling techniques have improved protein identification to include the least abundant proteins. In addition, proteomics has been complemented by the analysis of posttranslational modifications and techniques for the quantitative comparison of different proteomes. However, the major limitation of proteomic investigations remains the complexity of biological structures and physiological processes, rendering the path of exploration paved with various difficulties and pitfalls. The quantity of data that is acquired with new techniques places new challenges on data processing and analysis. This article provides a brief overview of currently available proteomic techniques and their applications, followed by detailed description of advantages and technical challenges. Some solutions to circumvent technical difficulties are proposed.
Collapse
|
30
|
Abstract
We present FIGfams, a new collection of over 100 000 protein families that are the product of manual curation and close strain comparison. Using the Subsystem approach the manual curation is carried out, ensuring a previously unattained degree of throughput and consistency. FIGfams are based on over 950 000 manually annotated proteins and across many hundred Bacteria and Archaea. Associated with each FIGfam is a two-tiered, rapid, accurate decision procedure to determine family membership for new proteins. FIGfams are freely available under an open source license. These can be downloaded at ftp://ftp.theseed.org/FIGfams/. The web site for FIGfams is http://www.theseed.org/wiki/FIGfams/
Collapse
Affiliation(s)
- Folker Meyer
- Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, USA.
| | | | | |
Collapse
|
31
|
Iwaniak A, Dziuba J. Analysis of Domains in Selected Plant and Animal Food Proteins - Precursors of Biologically Active Peptides - In Silico Approach. FOOD SCI TECHNOL INT 2009. [DOI: 10.1177/1082013208106320] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In silico methods are useful tool in protein structure-functional relationships analysis. BIOPEP and InterPro databases were applied to analyze the presence of bioactive fragments in the domains occurring in the sequences representing the major groups of proteins. Domains found in the proteins analyzed had mostly transporting (bovine β-lactoglobulin), immunoglobulin-like (chicken connectin), alpha-amylase inhibitor (a/β-wheat gliadin), calcium binding (chicken myosin) functions, or allowed straightly to assign the protein to an appropriate superfamily (bovine casein). It confirmed the thesis about the existence of the functional relations between the structure (sequence) and the domains with identified conformation. Amongst the domains present in the protein sequences we revealed the presence of fragments with the activities: antihypertensive, opioid, dipeptidylpeptidase IV inhibitors, immunomodulating, and neuropeptides. In the chicken connectin within the immunoglobulin-like domain we found immunomodulating fragments. InterPro analysis did not reveal the existence of any domains in a soybean globulin. It can be explained by the lack of the key structure information helpful in the defining the structure-function relationships. As the number of information in the applied databases will continue to increase we can expect to find stronger relationships between bioactivity of fragments encrypted in proteins and the functionality of domains. This might allow in the future to find evolutionary similarity between different origin food proteins - sources of bioactive peptides.
Collapse
Affiliation(s)
- A. Iwaniak
- rWarmia and Mazury University in Olsztyn, Chair of Food Biochemistry ul. Pl. Cieszyński 1, 10-726 Olsztyn, Poland
| | - J. Dziuba
- Warmia and Mazury University in Olsztyn, Chair of Food Biochemistry ul. Pl. Cieszyński 1, 10-726 Olsztyn, Poland,
| |
Collapse
|
32
|
Arakawa K, Tamaki S, Kono N, Kido N, Ikegami K, Ogawa R, Tomita M. Genome Projector: zoomable genome map with multiple views. BMC Bioinformatics 2009; 10:31. [PMID: 19166610 PMCID: PMC2636772 DOI: 10.1186/1471-2105-10-31] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2008] [Accepted: 01/23/2009] [Indexed: 01/24/2023] Open
Abstract
BACKGROUND Molecular biology data exist on diverse scales, from the level of molecules to -omics. At the same time, the data at each scale can be categorised into multiple layers, such as the genome, transcriptome, proteome, metabolome, and biochemical pathways. Due to the highly multi-layer and multi-dimensional nature of biological information, software interfaces for database browsing should provide an intuitive interface that allows for rapid migration across different views and scales. The Zoomable User Interface (ZUI) and tabbed browsing have proven successful for this purpose in other areas, especially to navigate the vast information in the World Wide Web. RESULTS This paper presents Genome Projector, a Web-based gateway for genomics information with a zoomable user interface using Google Maps API, equipped with four seamlessly accessible and searchable views: a circular genome map, a traditional genome map, a biochemical pathways map, and a DNA walk map. The Web application for 320 bacterial genomes is available at http://www.g-language.org/GenomeProjector/. All data and software including the source code, documentations, and development API are freely available under the GNU General Public License. Zoomable maps can be easily created from any image file using the development API, and an online data mapping service for Genome Projector is also available at our Web site. CONCLUSION Genome Projector is an intuitive Web application for browsing genomics information, implemented with a zoomable user interface and tabbed browsing utilising Google Maps API and Asynchronous JavaScript and XML (AJAX) technology.
Collapse
Affiliation(s)
- Kazuharu Arakawa
- Institute for Advanced Biosciences, Keio University, Fujisawa, 252-8520, Japan.
| | | | | | | | | | | | | |
Collapse
|
33
|
Thybert D, Avner S, Lucchetti-Miganeh C, Chéron A, Barloy-Hubler F. OxyGene: an innovative platform for investigating oxidative-response genes in whole prokaryotic genomes. BMC Genomics 2008; 9:637. [PMID: 19117520 PMCID: PMC2631583 DOI: 10.1186/1471-2164-9-637] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2008] [Accepted: 12/31/2008] [Indexed: 11/25/2022] Open
Abstract
Background Oxidative stress is a common stress encountered by living organisms and is due to an imbalance between intracellular reactive oxygen and nitrogen species (ROS, RNS) and cellular antioxidant defence. To defend themselves against ROS/RNS, bacteria possess a subsystem of detoxification enzymes, which are classified with regard to their substrates. To identify such enzymes in prokaryotic genomes, different approaches based on similarity, enzyme profiles or patterns exist. Unfortunately, several problems persist in the annotation, classification and naming of these enzymes due mainly to some erroneous entries in databases, mistake propagation, absence of updating and disparity in function description. Description In order to improve the current annotation of oxidative stress subsystems, an innovative platform named OxyGene has been developed. It integrates an original database called OxyDB, holding thoroughly tested anchor-based signatures associated to subfamilies of oxidative stress enzymes, and a new anchor-driven annotator, for ab initio detection of ROS/RNS response genes. All complete Bacterial and Archaeal genomes have been re-annotated, and the results stored in the OxyGene repository can be interrogated via a Graphical User Interface. Conclusion OxyGene enables the exploration and comparative analysis of enzymes belonging to 37 detoxification subclasses in 664 microbial genomes. It proposes a new classification that improves both the ontology and the annotation of the detoxification subsystems in prokaryotic whole genomes, while discovering new ORFs and attributing precise function to hypothetical annotated proteins. OxyGene is freely available at:
Collapse
Affiliation(s)
- David Thybert
- CNRS UMR 6026, Interactions Cellulaires et Moléculaires, Equipe B@SIC, Université de Rennes 1, IFR140 GFAS, Campus de Beaulieu, Av. du Général Leclerc, 35042 Rennes, France.
| | | | | | | | | |
Collapse
|
34
|
Ivanciuc O, Schein CH, Garcia T, Oezguen N, Negi SS, Braun W. Structural analysis of linear and conformational epitopes of allergens. Regul Toxicol Pharmacol 2008; 54:S11-9. [PMID: 19121639 DOI: 10.1016/j.yrtph.2008.11.007] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2008] [Revised: 11/06/2008] [Accepted: 11/06/2008] [Indexed: 11/17/2022]
Abstract
In many countries regulatory agencies have adopted safety guidelines, based on bioinformatics rules from the WHO/FAO and EFSA recommendations, to prevent potentially allergenic novel foods or agricultural products from reaching consumers. We created the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP/) to combine data that had previously been available only as flat files on Web pages or in the literature. SDAP was designed to be user friendly, to be of maximum use to regulatory agencies, clinicians, as well as to scientists interested in assessing the potential allergenic risk of a protein. We developed methods, unique to SDAP, to compare the physicochemical properties of discrete areas of allergenic proteins to known IgE epitopes. We developed a new similarity measure, the property distance (PD) value that can be used to detect related segments in allergens with clinical observed cross-reactivity. We have now expanded this work to obtain experimental validation of the PD index as a quantitative predictor of IgE cross-reactivity, by designing peptide variants with predetermined PD scores relative to known IgE epitopes. In complementary work we show how sequence motifs characteristic of allergenic proteins in protein families can be used as fingerprints for allergenicity.
Collapse
Affiliation(s)
- Ovidiu Ivanciuc
- Sealy Center for Structural Biology and Molecular Biophysics, Departments of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555-0857, USA
| | | | | | | | | | | |
Collapse
|
35
|
Schiller MR. Minimotif miner: a computational tool to investigate protein function, disease, and genetic diversity. ACTA ACUST UNITED AC 2008; Chapter 2:Unit 2.12. [PMID: 18429315 DOI: 10.1002/0471140864.ps0212s48] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The Minimotif Miner Web site contains information on several hundred short functional motifs in a single database, and allows the user to search protein queries for the presence of these motifs. Scoring based on evolutionary conservation, protein surface prediction, and motif frequency can be used in conjunction with other motif programs and the known biology of the query to reduce false-positive predictions and select short motifs for experimental pursuit.
Collapse
Affiliation(s)
- Martin R Schiller
- University of Connecticut Health Center, Farmington, Connecticut, USA
| |
Collapse
|
36
|
Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. The RAST Server: rapid annotations using subsystems technology. BMC Genomics 2008; 9:75. [PMID: 18261238 PMCID: PMC2265698 DOI: 10.1186/1471-2164-9-75] [Citation(s) in RCA: 8609] [Impact Index Per Article: 538.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2007] [Accepted: 02/08/2008] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND The number of prokaryotic genome sequences becoming available is growing steadily and is growing faster than our ability to accurately annotate them. DESCRIPTION We describe a fully automated service for annotating bacterial and archaeal genomes. The service identifies protein-encoding, rRNA and tRNA genes, assigns functions to the genes, predicts which subsystems are represented in the genome, uses this information to reconstruct the metabolic network and makes the output easily downloadable for the user. In addition, the annotated genome can be browsed in an environment that supports comparative analysis with the annotated genomes maintained in the SEED environment. The service normally makes the annotated genome available within 12-24 hours of submission, but ultimately the quality of such a service will be judged in terms of accuracy, consistency, and completeness of the produced annotations. We summarize our attempts to address these issues and discuss plans for incrementally enhancing the service. CONCLUSION By providing accurate, rapid annotation freely to the community we have created an important community resource. The service has now been utilized by over 120 external users annotating over 350 distinct genomes.
Collapse
Affiliation(s)
- Ramy K Aziz
- Fellowship for Interpretation of Genomes, Burr Ridge, IL 60527, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Meinnel T, Giglione C. Tools for analyzing and predicting N-terminal protein modifications. Proteomics 2008; 8:626-49. [DOI: 10.1002/pmic.200700592] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
|
38
|
Tocharoentanaphol C, Promso S, Zelenika D, Lowhnoo T, Tongsima S, Sura T, Chantratita W, Matsuda F, Mooney S, Sakuntabhai A. Evaluation of resequencing on number of tag SNPs of 13 atherosclerosis-related genes in Thai population. J Hum Genet 2007; 53:74-86. [PMID: 18043865 DOI: 10.1007/s10038-007-0220-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2007] [Accepted: 10/29/2007] [Indexed: 11/26/2022]
Abstract
In the candidate gene approach, information about the distribution of single nucleotide polymorphisms (SNPs) is a crucial requirement for choosing efficient markers necessary for a case-control association study. To obtain such information, we discovered SNPs in 13 genes related to atherosclerosis by resequencing exon-flanking regions of 32 healthy Thai individuals. In total, 194 polymorphisms were identified, 184 of them SNPs, four insertions, and the rest deletions. Fifty-nine of the SNPs were characterized as novel polymorphisms, and these accounted for 30% of the identified SNPs. Comparing allele frequency distributions of the Thai population with other Asian populations shows similar patterns. In contrast, a low correlation pattern (r = 0.521) was found when comparing with either Caucasian or African populations. However, some rare alleles (rs11574541 and rs10874913) are found in the Thai population but not in other Asian populations. Most of the novel SNPs found were located outside the haplotype blocks generated by known SNPs in the Thai population. Only 5.77% of the novel SNPs lies in these defined haplotype blocks. The selection of haplotype-tagging SNPs shows that 8 of 13 genes benefited from the ethnic-specific genotype information. That is, when at least one novel SNP was present, the tagging SNPs chosen were altered. Functional prediction of 16 nonsynonymous SNPs (nsSNPs) by three different algorithm tools demonstrated that five nsSNPs possibly alter their corresponding protein functions. These results provide necessary information for conducting further genetic association studies involving the Thai population and demonstrate that resequencing of candidate genes provides more complete information for full genetic studies.
Collapse
Affiliation(s)
- Chintana Tocharoentanaphol
- Centre National de Génotypage, 2 rue Gaston Crémieux, CP 5721, 91057, Evry Cedex, France.
- Chulabhorn Cancer Centre, Viphavadee-Rangsit road, Laksi, Bangkok, 10120, Thailand.
| | - Somying Promso
- Department of Pathology, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok, 10400, Thailand
- Center for Computational Biology and Bioinformatics, Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Dianna Zelenika
- Centre National de Génotypage, 2 rue Gaston Crémieux, CP 5721, 91057, Evry Cedex, France
| | - Tassanee Lowhnoo
- Centre National de Génotypage, 2 rue Gaston Crémieux, CP 5721, 91057, Evry Cedex, France
- Research center, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok, 10400, Thailand
| | - Sissades Tongsima
- National Center for Genetic Engineering and Biotechnology, Pathumthani, Thailand
| | - Thanyachai Sura
- Department of Medicine, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok, 10400, Thailand
| | - Wasun Chantratita
- Department of Pathology, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok, 10400, Thailand
| | - Fumihiko Matsuda
- Centre National de Génotypage, 2 rue Gaston Crémieux, CP 5721, 91057, Evry Cedex, France
| | - Sean Mooney
- Center for Computational Biology and Bioinformatics, Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Anavaj Sakuntabhai
- Department of Medicine, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok, 10400, Thailand
- Laboratoire de Génétique de la réponse aux infections chez l'homme, Institut Pasteur, 75724, Paris Cedex 15, France
| |
Collapse
|
39
|
Peng FY, Reid KE, Liao N, Schlosser J, Lijavetzky D, Holt R, Martínez Zapater JM, Jones S, Marra M, Bohlmann J, Lund ST. Generation of ESTs in Vitis vinifera wine grape (Cabernet Sauvignon) and table grape (Muscat Hamburg) and discovery of new candidate genes with potential roles in berry development. Gene 2007; 402:40-50. [PMID: 17761391 DOI: 10.1016/j.gene.2007.07.016] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2007] [Revised: 06/26/2007] [Accepted: 07/17/2007] [Indexed: 11/30/2022]
Abstract
We report the generation and analysis of a total of 77,583 expressed sequence tags (ESTs) from two grapevine (Vitis vinifera L.) cultivars, Cabernet Sauvignon (wine grape) and Muscat Hamburg (table grape) with a focus on EST sequence quality and assembly optimization. The majority of the ESTs were derived from normalized cDNA libraries representing berry pericarp and seed developmental series, pooled non-berry tissues including root, flower, and leaf in Cabernet Sauvignon, and pooled tissues of berry, seed, and flower in Muscat Hamburg. EST and unigene sequence quality were determined by computational filtering coupled with small-scale contig reassembly, manual review, and BLAST analyses. EST assembly was optimized to better discriminate among closely related paralogs using two independent grape sequence sets, a previously published set of Vitis spp. gene families and our EST dataset derived from pooled leaf, flower, and root tissues of Cabernet Sauvignon. Sequence assembly within individual libraries indicated that those prepared from pooled tissues contributed the most to gene discovery. Annotations based upon searches against multiple databases including tomato and strawberry sequences helped to identify putative functions of ESTs and unigenes, particularly with respect to fleshy fruit development. Sequence comparison among the three wine grape libraries identified a number of genes preferentially expressed in the pericarp tissue, including transcription factors, receptor-like protein kinases, and hexose transporters. Gene ontology (GO) classification in the biological process aspect showed that GO categories corresponding to 'transport' and 'cell organization and biogenesis', which are associated with metabolite movement and cell wall structural changes during berry ripening, were higher in pericarp than in other tissues in the wine grape studied. The sequence data were used to characterize potential roles of new genes in berry development and composition.
Collapse
Affiliation(s)
- Fred Y Peng
- Wine Research Centre, Faculty of Land and Food Systems, University of British Columbia, 2205 East Mall, Vancouver, British Columbia, Canada V6T 1Z4
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Overbeek R, Bartels D, Vonstein V, Meyer F. Annotation of bacterial and archaeal genomes: improving accuracy and consistency. Chem Rev 2007; 107:3431-47. [PMID: 17658903 DOI: 10.1021/cr068308h] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Ross Overbeek
- Fellowship for Interpretation of Genomes, Burr Ridge, Illinois 60527, USA
| | | | | | | |
Collapse
|
41
|
Wang G, Deng J, Yang J, Zheng JJ, Wang HZ, Hu Q, Zhang ZM, Chen C, Wang D, Li ZP. Analysis on the protein structure and function of retinoic acid induced 16. Shijie Huaren Xiaohua Zazhi 2007; 15:1342-1346. [DOI: 10.11569/wcjd.v15.i12.1342] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
AIM: To analyze the protein structure and function of Homo sapiens retinoic acid induced 16 (RAI16) cDNA nucleotides sequence.
METHODS: Using some related computer software and database, RAI16 cDNA sequence and the deduced amino acid (AA) sequence were analyzed and compared with those of other species, and the secondary structure and hydrophobicity of the protein were determined and predicted.
RESULTS: RAI16 cDNA sequence consisted of 2863 nucleotides encoding a 759-AA protein. RAI16 protein included a 31-AA signal peptide. The protein had no transmembrane domain, and its subcellular localization was in endoplasmic reticulum. The secondary structure contained several kinds of conformations. RAI16 protein appeared as a compact and globular domain, and was a secretory and hydrophilicity protein (grand average of hydropathicity: -0.136). The protein was also unstable (the unstable coefficient: 53.11), with many phosphorylation and proteinase sites.
CONCLUSION: As a new signal protein, RAI16 may be involved in the signal transduction and transcription of hepatoma cell differentiation by interacting with Tec protein.
Collapse
|
42
|
Abstract
The Swiss Institute of Bioinformatics (SIB), the European Bioinformatics Institute (EBI), and the Protein Information Resource (PIR) form the Universal Protein Resource (UniProt) consortium. Its main goal is to provide the scientific community with a central resource for protein sequences and functional information. The UniProt consortium maintains the UniProt KnowledgeBase (UniProtKB) and several supplementary databases including the UniProt Reference Clusters (UniRef) and the UniProt Archive (UniParc). (1) UniProtKB is a comprehensive protein sequence knowledgebase that consists of two sections: UniProtKB/Swiss-Prot, which contains manually annotated entries, and UniProtKB/TrEMBL, which contains computer-annotated entries. UniProtKB/Swiss-Prot entries contain information curated by biologists and provide users with cross-links to about 100 external databases and with access to additional information or tools. (2) The UniRef databases (UniRef100, UniRef90, and UniRef50) define clusters of protein sequences that share 100, 90, or 50% identity. (3) The UniParc database stores and maps all publicly available protein sequence data, including obsolete data excluded from UniProtKB. The UniProt databases can be accessed online (http://www.uniprot.org/) or downloaded in several formats (ftp://ftp.uniprot.org/pub). New releases are published every 2 weeks. The purpose of this chapter is to present a guided tour of a UniProtKB/Swiss-Prot entry, paying particular attention to the specificities of plant protein annotation. We will also present some of the tools and databases that are linked to each entry.
Collapse
Affiliation(s)
- Emmanuel Boutet
- Swiss Institute of Bioinformatics, Centre Medical Universitaire, Geneva, Switzerland
| | | | | | | | | |
Collapse
|
43
|
Heazlewood JL, Verboom RE, Tonti-Filippini J, Small I, Millar AH. SUBA: the Arabidopsis Subcellular Database. Nucleic Acids Res 2007; 35:D213-8. [PMID: 17071959 PMCID: PMC1635339 DOI: 10.1093/nar/gkl863] [Citation(s) in RCA: 350] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2006] [Revised: 09/14/2006] [Accepted: 10/03/2006] [Indexed: 12/02/2022] Open
Abstract
Knowledge of protein localisation contributes towards our understanding of protein function and of biological inter-relationships. A variety of experimental methods are currently being used to produce localisation data that need to be made accessible in an integrated manner. Chimeric fluorescent fusion proteins have been used to define subcellular localisations with at least 1100 related experiments completed in Arabidopsis. More recently, many studies have employed mass spectrometry to undertake proteomic surveys of subcellular components in Arabidopsis yielding localisation information for approximately 2600 proteins. Further protein localisation information may be obtained from other literature references to analysis of locations (AmiGO: approximately 900 proteins), location information from Swiss-Prot annotations (approximately 2000 proteins); and location inferred from gene descriptions (approximately 2700 proteins). Additionally, an increasing volume of available software provides location prediction information for proteins based on amino acid sequence. We have undertaken to bring these various data sources together to build SUBA, a SUBcellular location database for Arabidopsis proteins. The localisation data in SUBA encompasses 10 distinct subcellular locations, >6743 non-redundant proteins and represents the proteins encoded in the transcripts responsible for 51% of Arabidopsis expressed sequence tags. The SUBA database provides a powerful means by which to assess protein subcellular localisation in Arabidopsis (http://www.suba.bcs.uwa.edu.au).
Collapse
Affiliation(s)
- Joshua L Heazlewood
- ARC Centre of Excellence in Plant Energy Biology, School of Biomedical, Biomolecular and Chemical Sciences, The University of Western Australia 35 Stirling Highway, Crawley 6009, Western Australia, Australia.
| | | | | | | | | |
Collapse
|
44
|
Taylor PD, Attwood TK, Flower DR. Combining algorithms to predict bacterial protein sub-cellular location: Parallel versus concurrent implementations. Bioinformation 2006; 1:285-9. [PMID: 17597909 PMCID: PMC1891705 DOI: 10.6026/97320630001285] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2006] [Accepted: 12/05/2006] [Indexed: 11/23/2022] Open
Abstract
We describe a novel and potentially important tool for candidate subunit vaccine selection through in silico reverse-vaccinology. A set of Bayesian networks able to make individual predictions for specific subcellular locations is implemented in three pipelines with different architectures: a parallel implementation with a confidence level-based decision engine and two serial implementations with a hierarchical decision structure, one initially rooted by prediction between membrane types and another rooted by soluble versus membrane prediction. The parallel pipeline outperformed the serial pipeline, but took twice as long to execute. The soluble-rooted serial pipeline outperformed the membrane-rooted predictor. Assessment using genomic test sets was more equivocal, as many more predictions are made by the parallel pipeline, yet the serial pipeline identifies 22 more of the 74 proteins of known location.
Collapse
Affiliation(s)
- Paul D Taylor
- The Jenner Institute, University of Oxford, Compton, Newbury, Berkshire, RG20 7NN, UK
| | - Teresa K Attwood
- Faculty of Life Sciences & School of Computer Science, The University of Manchester, Oxford Road, Manchester M13 9PT, UK
| | - Darren R Flower
- The Jenner Institute, University of Oxford, Compton, Newbury, Berkshire, RG20 7NN, UK
- Darren R Flower
E-mail:
; Phone: +44 1635 577954; Fax: +44 1635 577908; Corresponding author
| |
Collapse
|
45
|
Taylor PD, Attwood TK, Flower DR. Toward bacterial protein sub-cellular location prediction: single-class discrimminant models for all gram- and gram+ compartments. Bioinformation 2006; 1:276-80. [PMID: 17597907 PMCID: PMC1891713 DOI: 10.6026/97320630001276] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2006] [Accepted: 12/02/2006] [Indexed: 11/23/2022] Open
Abstract
Based on Bayesian Networks, methods were created that address protein sequence-based bacterial subcellular location prediction. Distinct predictive
algorithms for the eight bacterial subcellular locations were created. Several variant methods were explored. These variations included differences in
the number of residues considered within the query sequence - which ranged from the N-terminal 10 residues to the whole sequence - and residue representation -
which took the form of amino acid composition, percentage amino acid composition, or normalised amino acid composition. The accuracies of the best performing
networks were then compared to PSORTB. All individual location methods outperform PSORTB except for the Gram+ cytoplasmic protein predictor, for which accuracies
were essentially equal, and for outer membrane protein prediction, where PSORTB outperforms the binary predictor. The method described here is an important new
approach to method development for subcellular location prediction. It is also a new, potentially valuable tool for candidate subunit vaccine selection.
Collapse
Affiliation(s)
- Paul D Taylor
- The Jenner Institute, University of Oxford, Compton,Newbury, Berkshire, RG20 7NN, UK
| | - Teresa K Attwood
- Faculty of Life Sciences & School of Computer Science, The University of Manchester, Oxford Road, Manchester M13 9PT, UK
| | - Darren R Flower
- The Jenner Institute, University of Oxford, Compton,Newbury, Berkshire, RG20 7NN, UK
- Darren R Flower
E-mail:
; Phone: +44 1635 577954; Fax: +44 1635 577908; Corresponding author
| |
Collapse
|
46
|
Taylor PD, Attwood TK, Flower DR. Multi-class subcellular location prediction for bacterial proteins. Bioinformation 2006; 1:260-4. [PMID: 17597904 PMCID: PMC1891703 DOI: 10.6026/97320630001260] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2006] [Accepted: 11/22/2006] [Indexed: 11/23/2022] Open
Abstract
Two algorithms, based onBayesian Networks (BNs), for bacterial subcellular location prediction, are explored in this paper: one predicts all locations for Gram+ bacteria and the other all locations for Gram- bacteria. Methods were evaluated using different numbers of residues (from the N-terminal 10 residues to the whole sequence) and residue representation (amino acid-composition, percentage amino acid-composition or normalised amino acid-composition). The accuracy of the best resulting BN was compared to PSORTB. The accuracy of this multi-location BN was roughly comparable to PSORTB; the difference in predictions is low, often less than 2%. The BN method thus represents both an important new avenue of methodological development for subcellular location prediction and a potentially value new tool of true utilitarian value for candidate subunit vaccine selection.
Collapse
Affiliation(s)
- Paul D Taylor
- The Jenner Institute, University of Oxford, Compton,Newbury, Berkshire, RG20 7NN, UK
- Faculty of Life Sciences & School of Computer Science, The University of Manchester, Oxford Road, Manchester M13 9PT, UK
| | - Teresa K Attwood
- Faculty of Life Sciences & School of Computer Science, The University of Manchester, Oxford Road, Manchester M13 9PT, UK
| | - Darren R Flower
- The Jenner Institute, University of Oxford, Compton,Newbury, Berkshire, RG20 7NN, UK
| |
Collapse
|
47
|
Badger JH, Hoover TR, Brun YV, Weiner RM, Laub MT, Alexandre G, Mrázek J, Ren Q, Paulsen IT, Nelson KE, Khouri HM, Radune D, Sosa J, Dodson RJ, Sullivan SA, Rosovitz MJ, Madupu R, Brinkac LM, Durkin AS, Daugherty SC, Kothari SP, Giglio MG, Zhou L, Haft DH, Selengut JD, Davidsen TM, Yang Q, Zafar N, Ward NL. Comparative genomic evidence for a close relationship between the dimorphic prosthecate bacteria Hyphomonas neptunium and Caulobacter crescentus. J Bacteriol 2006; 188:6841-50. [PMID: 16980487 PMCID: PMC1595504 DOI: 10.1128/jb.00111-06] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The dimorphic prosthecate bacteria (DPB) are alpha-proteobacteria that reproduce in an asymmetric manner rather than by binary fission and are of interest as simple models of development. Prior to this work, the only member of this group for which genome sequence was available was the model freshwater organism Caulobacter crescentus. Here we describe the genome sequence of Hyphomonas neptunium, a marine member of the DPB that differs from C. crescentus in that H. neptunium uses its stalk as a reproductive structure. Genome analysis indicates that this organism shares more genes with C. crescentus than it does with Silicibacter pomeroyi (a closer relative according to 16S rRNA phylogeny), that it relies upon a heterotrophic strategy utilizing a wide range of substrates, that its cell cycle is likely to be regulated in a similar manner to that of C. crescentus, and that the outer membrane complements of H. neptunium and C. crescentus are remarkably similar. H. neptunium swarmer cells are highly motile via a single polar flagellum. With the exception of cheY and cheR, genes required for chemotaxis were absent in the H. neptunium genome. Consistent with this observation, H. neptunium swarmer cells did not respond to any chemotactic stimuli that were tested, which suggests that H. neptunium motility is a random dispersal mechanism for swarmer cells rather than a stimulus-controlled navigation system for locating specific environments. In addition to providing insights into bacterial development, the H. neptunium genome will provide an important resource for the study of other interesting biological processes including chromosome segregation, polar growth, and cell aging.
Collapse
Affiliation(s)
- Jonathan H Badger
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Rohl C, Price Y, Fischer TB, Paczkowski M, Zettel MF, Tsai J. Cataloging the relationships between proteins: a review of interaction databases. Mol Biotechnol 2006; 34:69-93. [PMID: 16943573 DOI: 10.1385/mb:34:1:69] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/1999] [Revised: 11/30/1999] [Accepted: 11/30/1999] [Indexed: 11/11/2022]
Abstract
By organizing and making widely accessible the increasing amounts of data from high-throughput analyses, protein interaction databases have become an integral resource for the biological community in relating sequence data with higher-order function. To provide a sense of the use and applicability of these databases, we describe each of the major comprehensive interaction databases as well as some of the more specialized ones. Content description, search/browse functionalities, and data presentation are discussed. A succinct explanation of database contents helps the user quickly identify whether the database contains applicable information to their research interest. Broad levels of search/browse functions as well as descriptions/examples allow users to quickly find and access pertinent data. At this point, clear presentation of search results as well as the primary content is necessary. Many databases display information graphically or divided into smaller digestible parts over a number of tabbed/linked pages. In addition, cross-linking between the databases promotes interconnectivity of the data and is an added layer of relational data for the user. Overall, although these protein interaction databases are under continual improvement, their current state shows that much time and effort has gone into organizing and presenting these large sets of data-describing protein interactions.
Collapse
Affiliation(s)
- Carol Rohl
- Rosetta Inpharmatics LLC, Seattle, WA, USA
| | | | | | | | | | | |
Collapse
|
49
|
Heazlewood JL, Tonti-Filippini J, Verboom RE, Millar AH. Combining experimental and predicted datasets for determination of the subcellular location of proteins in Arabidopsis. PLANT PHYSIOLOGY 2005; 139:598-609. [PMID: 16219920 PMCID: PMC1255979 DOI: 10.1104/pp.105.065532] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2005] [Revised: 08/03/2005] [Accepted: 08/08/2005] [Indexed: 05/04/2023]
Abstract
Substantial experimental datasets defining the subcellular location of Arabidopsis (Arabidopsis thaliana) proteins have been reported in the literature in the form of organelle proteomes built from mass spectrometry data (approximately 2,500 proteins). Subcellular location for specific proteins has also been published based on imaging of chimeric fluorescent fusion proteins in intact cells (approximately 900 proteins). Further, the more diverse history of biochemical determination of subcellular location is stored in the entries of the Swiss-Prot database for the products of many Arabidopsis genes (approximately 1,800 proteins). Combined with the range of bioinformatic targeting prediction tools and comparative genomic analysis, these experimental datasets provide a powerful basis for defining the final location of proteins within the wide variety of subcellular structures present inside Arabidopsis cells. We have analyzed these published experimental and prediction data to answer a range of substantial questions facing researchers about the veracity of these approaches to determining protein location and their interrelatedness. We have merged these data to form the subcellular location database for Arabidopsis proteins (SUBA), providing an integrated understanding of protein location, encompassing the plastid, mitochondrion, peroxisome, nucleus, plasma membrane, endoplasmic reticulum, vacuole, Golgi, cytoskeleton structures, and cytosol (www.suba.bcs.uwa.edu.au). This includes data on more than 4,400 nonredundant Arabidopsis protein sequences. We also provide researchers with an online resource that may be used to query protein sets or protein families and determine whether predicted or experimental location data exist; to analyze the nature of contamination between published proteome sets; and/or for building theoretical subcellular proteomes in Arabidopsis using the latest experimental data.
Collapse
Affiliation(s)
- Joshua L Heazlewood
- Australian Research Council Centre of Excellence in Plant Energy Biology, University of Western Australia, Crawley
| | | | | | | |
Collapse
|
50
|
Affiliation(s)
- Paul Bertone
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA
| | | |
Collapse
|