Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Li P, Piao Y, Shon HS, Ryu KH. Comparing the normalization methods for the differential analysis of Illumina high-throughput RNA-Seq data. BMC Bioinformatics 2015;16:347. [PMID: 26511205 PMCID: PMC4625728 DOI: 10.1186/s12859-015-0778-7] [Citation(s) in RCA: 103] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2015] [Accepted: 10/14/2015] [Indexed: 01/08/2023] Open

For:	Li P, Piao Y, Shon HS, Ryu KH. Comparing the normalization methods for the differential analysis of Illumina high-throughput RNA-Seq data. BMC Bioinformatics 2015;16:347. [PMID: 26511205 PMCID: PMC4625728 DOI: 10.1186/s12859-015-0778-7] [Citation(s) in RCA: 103] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2015] [Accepted: 10/14/2015] [Indexed: 01/08/2023] Open

Number

Cited by Other Article(s)

Paton V, Ramirez Flores RO, Gabor A, Badia-I-Mompel P, Tanevski J, Garrido-Rodriguez M, Saez-Rodriguez J. Assessing the impact of transcriptomics data analysis pipelines on downstream functional enrichment results. Nucleic Acids Res 2024:gkae552. [PMID: 38943333 DOI: 10.1093/nar/gkae552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 06/03/2024] [Accepted: 06/19/2024] [Indexed: 07/01/2024] Open

Jiang G, Zheng JY, Ren SN, Yin W, Xia X, Li Y, Wang HL. A comprehensive workflow for optimizing RNA-seq data analysis. BMC Genomics 2024;25:631. [PMID: 38914930 PMCID: PMC11197194 DOI: 10.1186/s12864-024-10414-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 05/15/2024] [Indexed: 06/26/2024] Open

Abstract

BACKGROUND

Current RNA-seq analysis software for RNA-seq data tends to use similar parameters across different species without considering species-specific differences. However, the suitability and accuracy of these tools may vary when analyzing data from different species, such as humans, animals, plants, fungi, and bacteria. For most laboratory researchers lacking a background in information science, determining how to construct an analysis workflow that meets their specific needs from the array of complex analytical tools available poses a significant challenge.

RESULTS

By utilizing RNA-seq data from plants, animals, and fungi, it was observed that different analytical tools demonstrate some variations in performance when applied to different species. A comprehensive experiment was conducted specifically for analyzing plant pathogenic fungal data, focusing on differential gene analysis as the ultimate goal. In this study, 288 pipelines using different tools were applied to analyze five fungal RNA-seq datasets, and the performance of their results was evaluated based on simulation. This led to the establishment of a relatively universal and superior fungal RNA-seq analysis pipeline that can serve as a reference, and certain standards for selecting analysis tools were derived for reference. Additionally, we compared various tools for alternative splicing analysis. The results based on simulated data indicated that rMATS remained the optimal choice, although consideration could be given to supplementing with tools such as SpliceWiz.

CONCLUSION

The experimental results demonstrate that, in comparison to the default software parameter configurations, the analysis combination results after tuning can provide more accurate biological insights. It is beneficial to carefully select suitable analysis software based on the data, rather than indiscriminately choosing tools, in order to achieve high-quality analysis results more efficiently.

Collapse

Xiao J, Yao X, Guan X, Xiong J, Fang Y, Zhang J, Zhang Y, Moming A, Su Z, Jin J, Ge Y, Wang J, Fan Z, Tang S, Shen S, Deng F. Viromes of Haemaphysalis longicornis reveal different viral abundance and diversity in free and engorged ticks. Virol Sin 2024;39:194-204. [PMID: 38360150 DOI: 10.1016/j.virs.2024.02.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 02/08/2024] [Indexed: 02/17/2024] Open

Affiliation(s)

Jian Xiao Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China; University of Chinese Academy of Sciences, Beijing, 101408, China
Xuan Yao Hubei Provincial Center for Disease Control and Prevention, Wuhan, 430070, China
Xuhua Guan Hubei Provincial Center for Disease Control and Prevention, Wuhan, 430070, China
Jinfeng Xiong Hubei Provincial Center for Disease Control and Prevention, Wuhan, 430070, China
Yaohui Fang Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
Jingyuan Zhang Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
You Zhang Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China; Current address: Department of Medical Laboratory, The Second Affiliated Hospital, Hainan Medical University, Haikou, 57000, China
Abulimiti Moming Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China; Xinjiang Key Laboratory of Vector-borne Infectious Diseases, Urumqi, 830002, China
Zhengyuan Su Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
Jiayin Jin Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
Yingying Ge Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
Jun Wang Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
Zhaojun Fan Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
Shuang Tang Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
Shu Shen Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China; Hubei Jiangxia Laboratory, Wuhan, 430200, China; Xinjiang Key Laboratory of Vector-borne Infectious Diseases, Urumqi, 830002, China.
Fei Deng Key Laboratory of Virology and Biosafety and National Virus Resource Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China.

Collapse

Singh V, Kirtipal N, Song B, Lee S. Normalization of RNA-Seq data using adaptive trimmed mean with multi-reference. Brief Bioinform 2024;25:bbae241. [PMID: 38770720 PMCID: PMC11107385 DOI: 10.1093/bib/bbae241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 04/04/2024] [Accepted: 05/07/2024] [Indexed: 05/22/2024] Open

Zou C, Tan H, Huang K, Zhai R, Yang M, Huang A, Wei X, Mo R, Xiong F. Physiological Characteristic Changes and Transcriptome Analysis of Maize (Zea mays L.) Roots under Drought Stress. Int J Genomics 2024;2024:5681174. [PMID: 38269194 PMCID: PMC10807950 DOI: 10.1155/2024/5681174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 10/08/2023] [Accepted: 12/18/2023] [Indexed: 01/26/2024] Open

Wang G, Tian X, Peng R, Huang Y, Li Y, Li Z, Hu X, Luo Z, Zhang Y, Cui X, Niu L, Lu G, Yang F, Gao L, Chan JFW, Jin Q, Yin F, Tang C, Ren Y, Du J. Genomic and phylogenetic profiling of RNA of tick-borne arboviruses in Hainan Island, China. Microbes Infect 2024;26:105218. [PMID: 37714509 DOI: 10.1016/j.micinf.2023.105218] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 09/11/2023] [Accepted: 09/11/2023] [Indexed: 09/17/2023]

Affiliation(s)

Gaoyu Wang Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China
Xiuying Tian Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China
Ruoyan Peng Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China
Yi Huang Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China
Youyou Li Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China
Zihan Li NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China; Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China
Xiaoyuan Hu Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China
Zufen Luo Department of Infectious Disease, the Second Affiliated Hospital of Hainan Medical University, Haikou, 570216, China
Yun Zhang Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China
Xiuji Cui Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China
Lina Niu Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China
Gang Lu Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China
Fan Yang NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China
Lei Gao NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China
Jasper Fuk-Woo Chan State Key Laboratory of Emerging Infectious Diseases, Department of Microbiology, School of Clinical Medicine, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong Special Administrative Region, China
Qi Jin NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China
Feifei Yin Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China
Chuanning Tang Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China.
Yi Ren Haikou Maternal and Child Health Hospital, Haikou, 570102, China.
Jiang Du NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China; Hainan Medical University-The University of Hong Kong Joint Laboratory of Tropical Infectious Diseases, Key Laboratory of Tropical Translational Medicine of Ministry of Education, Hainan Medical University, Haikou, 571199, China.

Collapse

Xia Y. Statistical normalization methods in microbiome data with application to microbiome cancer research. Gut Microbes 2023;15:2244139. [PMID: 37622724 PMCID: PMC10461514 DOI: 10.1080/19490976.2023.2244139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 07/12/2023] [Accepted: 07/31/2023] [Indexed: 08/26/2023] Open

Hu Y, Wang L, Yang G, Wang S, Guo M, Lu H, Zhang T. VDR promotes testosterone synthesis in mouse Leydig cells via regulation of cholesterol side chain cleavage cytochrome P450 (Cyp11a1) expression. Genes Genomics 2023;45:1377-1387. [PMID: 37747642 DOI: 10.1007/s13258-023-01444-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Accepted: 09/30/2022] [Indexed: 09/26/2023]

Abstract

BACKGROUND

The vitamin D receptor (VDR) mediates the pleiotropic biological actions that include osteoporosis, immune responses and androgen synthesis.VDR is widely expressed in testis cells such as Leydig cells, Sertoli cells, and sperm. The levels of steroids are critical for sexual development. In the early stage of steroidogenesis, cholesterol is converted to pregnenolone (precursor of most steroid hormones) by cholesterol side-chain lyase (CYP11A1), which eventually synthesizes the male hormone testosterone.

OBJECTIVE

This study aims to reveal how VDR regulates CYP11A1 expression and affects testosterone synthesis in murine Leydig cells.

METHODS

The levels of VDR, CYP11A1 were determined by quantitative real-time polymerase chain reaction (RT-qPCR) or western blot. Targeted relationship between VDR and Cyp11a1 was evaluated by dual-luciferase reporter assay. The levels of testosterone concentrations in cell culture media serum by enzyme-linked immunosorbent assay (ELISA).

RESULTS

Phylogenetic and motif analysis showed that the Cyp11a1 family had sequence loss, which may have special biological functions during evolution. The results of promoter prediction showed that vitamin D response element (VDRE) existed in the upstream promoter region of murine Cyp11a1. Dual-luciferase assay confirmed that VDR could bind candidate VDREs in upstream region of Cyp11a1, and enhance gene expression. Tissue distribution and localizatio analysis showed that Cyp11a1 was mainly expressed in testis, and dominantly existed in murine Leydig cells. Furthermore, over-expression VDR and CYP11A1 significantly increased testosterone synthesis in mice Leydig cells.

CONCLUSIONS

Active vitamin D3 (VD3) and Vdr interference treatment showed that VD3/VDR had a positive regulatory effect on Cyp11a1 expression and testosterone secretion. VDR promotes testosterone synthesis in male mice by up-regulating Cyp11a1 expression, which played an important role for male reproduction.

Collapse

Stokes T, Cen HH, Kapranov P, Gallagher IJ, Pitsillides AA, Volmar C, Kraus WE, Johnson JD, Phillips SM, Wahlestedt C, Timmons JA. Transcriptomics for Clinical and Experimental Biology Research: Hang on a Seq. ADVANCED GENETICS (HOBOKEN, N.J.) 2023;4:2200024. [PMID: 37288167 PMCID: PMC10242409 DOI: 10.1002/ggn2.202200024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Indexed: 06/09/2023]

Altay G, Zapardiel-Gonzalo J, Peters B. RNA-seq preprocessing and sample size considerations for gene network inference. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.02.522518. [PMID: 36711979 PMCID: PMC9881880 DOI: 10.1101/2023.01.02.522518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]

Abstract

Background

Gene network inference (GNI) methods have the potential to reveal functional relationships between different genes and their products. Most GNI algorithms have been developed for microarray gene expression datasets and their application to RNA-seq data is relatively recent. As the characteristics of RNA-seq data are different from microarray data, it is an unanswered question what preprocessing methods for RNA-seq data should be applied prior to GNI to attain optimal performance, or what the required sample size for RNA-seq data is to obtain reliable GNI estimates.

Results

We ran 9144 analysis of 7 different RNA-seq datasets to evaluate 300 different preprocessing combinations that include data transformations, normalizations and association estimators. We found that there was no single best performing preprocessing combination but that there were several good ones. The performance varied widely over various datasets, which emphasized the importance of choosing an appropriate preprocessing configuration before GNI. Two preprocessing combinations appeared promising in general: First, Log-2 TPM (transcript per million) with Variance-stabilizing transformation (VST) and Pearson Correlation Coefficient (PCC) association estimator. Second, raw RNA-seq count data with PCC. Along with these two, we also identified 18 other good preprocessing combinations. Any of these algorithms might perform best in different datasets. Therefore, the GNI performances of these approaches should be measured on any new dataset to select the best performing one for it. In terms of the required biological sample size of RNA-seq data, we found that between 30 to 85 samples were required to generate reliable GNI estimates.

Conclusions

This study provides practical recommendations on default choices for data preprocessing prior to GNI analysis of RNA-seq data to obtain optimal performance results.

Collapse

Lucena-Leandro VS, Abreu EFA, Vidal LA, Torres CR, Junqueira CICVF, Dantas J, Albuquerque ÉVS. Current Scenario of Exogenously Induced RNAi for Lepidopteran Agricultural Pest Control: From dsRNA Design to Topical Application. Int J Mol Sci 2022;23:ijms232415836. [PMID: 36555476 PMCID: PMC9785151 DOI: 10.3390/ijms232415836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 11/24/2022] [Accepted: 11/25/2022] [Indexed: 12/24/2022] Open

Abstract

Invasive insects cost the global economy around USD 70 billion per year. Moreover, increasing agricultural insect pests raise concerns about global food security constraining and infestation rising after climate changes. Current agricultural pest management largely relies on plant breeding-with or without transgenes-and chemical pesticides. Both approaches face serious technological obsolescence in the field due to plant resistance breakdown or development of insecticide resistance. The need for new modes of action (MoA) for managing crop health is growing each year, driven by market demands to reduce economic losses and by consumer demand for phytosanitary measures. The disabling of pest genes through sequence-specific expression silencing is a promising tool in the development of environmentally-friendly and safe biopesticides. The specificity conferred by long dsRNA-base solutions helps minimize effects on off-target genes in the insect pest genome and the target gene in non-target organisms (NTOs). In this review, we summarize the status of gene silencing by RNA interference (RNAi) for agricultural control. More specifically, we focus on the engineering, development and application of gene silencing to control Lepidoptera through non-transforming dsRNA technologies. Despite some delivery and stability drawbacks of topical applications, we reviewed works showing convincing proof-of-concept results that point to innovative solutions. Considerations about the regulation of the ongoing research on dsRNA-based pesticides to produce commercialized products for exogenous application are discussed. Academic and industry initiatives have revealed a worthy effort to control Lepidoptera pests with this new mode of action, which provides more sustainable and reliable technologies for field management. New data on the genomics of this taxon may contribute to a future customized target gene portfolio. As a case study, we illustrate how dsRNA and associated methodologies could be applied to control an important lepidopteran coffee pest.

Collapse

Costa-Silva J, Domingues DS, Menotti D, Hungria M, Lopes FM. Temporal progress of gene expression analysis with RNA-Seq data: A review on the relationship between computational methods. Comput Struct Biotechnol J 2022;21:86-98. [PMID: 36514333 PMCID: PMC9730150 DOI: 10.1016/j.csbj.2022.11.051] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 11/25/2022] [Accepted: 11/25/2022] [Indexed: 12/03/2022] Open

Kochhar P, Vukku M, Rajashekhar R, Mukhopadhyay A. microRNA signatures associated with fetal growth restriction: a systematic review. Eur J Clin Nutr 2022;76:1088-1102. [PMID: 34741137 DOI: 10.1038/s41430-021-01041-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 10/17/2021] [Accepted: 10/19/2021] [Indexed: 12/20/2022]

Thawng CN, Smith GB. A transcriptome software comparison for the analyses of treatments expected to give subtle gene expression responses. BMC Genomics 2022;23:452. [PMID: 35725382 PMCID: PMC9208185 DOI: 10.1186/s12864-022-08673-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 05/26/2022] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

In this comparative study we evaluate the performance of four software tools: DNAstar-D (DESeq2), DNAstar-E (edgeR), CLC Genomics and Partek Flow for identification of differentially expressed genes (DEGs) using a transcriptome of E. coli. The RNA-seq data are from the effect of below-background radiation 5.5 nGy total dose (0.2nGy/hr) on E. coli grown shielded from natural radiation 655 m below ground in a pre-World War II steel vault. The gene expression response to three supplemented sources of radiation designed to mimic natural background, 1952 - 5720 nGy in total dose (71-208 nGy/hr), are compared to this "radiation-deprived" treatment. In addition, RNA-seq data of Caenorhabditis elegans nematode from similar radiation treatments was analyzed by three of the software packages.

RESULTS

In E. coli, the four software programs identified one of the supplementary sources of radiation (KCl) to evoke about 5 times more transcribed genes than the minus-radiation treatment (69-114 differentially expressed genes, DEGs), and so the rest of the analyses used this KCl vs "Minus" comparison. After imposing a 30-read minimum cutoff, one of the DNAStar options shared two of the three steps (mapping, normalization, and statistic) with Partek Flow (they both used median of ratios to normalize and the DESeq2 statistical package), and these two programs identified the highest number of DEGs in common with each other (53). In contrast, when the programs used different approaches in each of the three steps, between 31 and 40 DEGs were found in common. Regarding the extent of expression differences, three of the four programs gave high fold-change results (15-178 fold), but one (DNAstar's DESeq2) resulted in more conservative fold-changes (1.5-3.5). In a parallel study comparing three qPCR commercial validation software programs, these programs also gave variable results as to which genes were significantly regulated. Similarly, the C. elegans analysis showed exaggerated fold-changes in CLC and DNAstar's edgeR while DNAstar-D was more conservative.

CONCLUSIONS

Regarding the extent of expression (fold-change), and considering the subtlety of the very low level radiation treatments, in E. coli three of the four programs gave what we consider exaggerated fold-change results (15 - 178 fold), but one (DNAstar's DESeq2) gave more realistic fold-changes (1.5-3.5). When RT-qPCR validation comparisons to transcriptome results were carried out, they supported the more conservative DNAstar-D's expression results. When another model organism's (nematode) response to these radiation differences was similarly analyzed, DNAstar-D also resulted in the most conservative expression patterns. Therefore, we would propose DESeq2 ("DNAstar-D") as an appropriate software tool for differential gene expression studies for treatments expected to give subtle transcriptome responses.

Collapse

Analysis of Gut Microbiome Structure Based on GMPR+Spectrum. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12125895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Sun X, Xu H, Liu G, Chen J, Xu J, Li M, Liu L. A Robust Immuno-Prognostic Model of Non-Muscle-Invasive Bladder Cancer Indicates Dynamic Interaction in Tumor Immune Microenvironment Contributes to Cancer Progression. Front Genet 2022;13:833989. [PMID: 35719408 PMCID: PMC9205430 DOI: 10.3389/fgene.2022.833989] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2021] [Accepted: 04/28/2022] [Indexed: 12/24/2022] Open

Vandenbon A. Evaluation of critical data processing steps for reliable prediction of gene co-expression from large collections of RNA-seq data. PLoS One 2022;17:e0263344. [PMID: 35089979 PMCID: PMC8797241 DOI: 10.1371/journal.pone.0263344] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 01/16/2022] [Indexed: 11/19/2022] Open

Khan RIN, Sahu AR, Malla WA, Praharaj MR, Hosamani N, Kumar S, Gupta S, Sharma S, Saxena A, Varshney A, Singh P, Verma V, Kumar P, Singh G, Pandey A, Saxena S, Gandham RK, Tiwari AK. Systems biology under heat stress in Indian cattle. Gene 2021;805:145908. [PMID: 34411649 DOI: 10.1016/j.gene.2021.145908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2021] [Revised: 08/11/2021] [Accepted: 08/13/2021] [Indexed: 11/26/2022]

Affiliation(s)

Raja Ishaq Nabi Khan Division of Veterinary Biotechnology, Indian Veterinary Research Institute, Bareilly, India
Amit Ranjan Sahu Division of Veterinary Biotechnology, Indian Veterinary Research Institute, Bareilly, India
Waseem Akram Malla Division of Veterinary Biotechnology, Indian Veterinary Research Institute, Bareilly, India
Manas Ranjan Praharaj Computational Biology and Genomics, National Institute of Animal Biotechnology, Hyderabad, India
Neelima Hosamani Computational Biology and Genomics, National Institute of Animal Biotechnology, Hyderabad, India
Shakti Kumar Computational Biology and Genomics, National Institute of Animal Biotechnology, Hyderabad, India
Smita Gupta Division of Veterinary Biotechnology, Indian Veterinary Research Institute, Bareilly, India
Shweta Sharma Division of Veterinary Biotechnology, Indian Veterinary Research Institute, Bareilly, India
Archana Saxena Division of Veterinary Biotechnology, Indian Veterinary Research Institute, Bareilly, India
Anshul Varshney Division of Veterinary Biotechnology, Indian Veterinary Research Institute, Bareilly, India
Pragya Singh Division of Veterinary Biotechnology, Indian Veterinary Research Institute, Bareilly, India
Vinay Verma Division of Physiology and Climatology, Indian Veterinary Research Institute, Bareilly, India
Puneet Kumar Division of Physiology and Climatology, Indian Veterinary Research Institute, Bareilly, India
Gyanendra Singh Division of Physiology and Climatology, Indian Veterinary Research Institute, Bareilly, India
Aruna Pandey Division of Veterinary Biotechnology, Indian Veterinary Research Institute, Bareilly, India
Shikha Saxena Division of Veterinary Biotechnology, Indian Veterinary Research Institute, Bareilly, India
Ravi Kumar Gandham Computational Biology and Genomics, National Institute of Animal Biotechnology, Hyderabad, India.
Ashok Kumar Tiwari Division of Biological Standardization, Indian Veterinary Research Institute, Bareilly, India.

Collapse

Helmy M, Agrawal R, Ali J, Soudy M, Bui TT, Selvarajoo K. GeneCloudOmics: A Data Analytic Cloud Platform for High-Throughput Gene Expression Analysis. FRONTIERS IN BIOINFORMATICS 2021;1:693836. [PMID: 36303746 PMCID: PMC9581002 DOI: 10.3389/fbinf.2021.693836] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 10/14/2021] [Indexed: 11/18/2022] Open

Abstract

Gene expression profiling techniques, such as DNA microarray and RNA-Sequencing, have provided significant impact on our understanding of biological systems. They contribute to almost all aspects of biomedical research, including studying developmental biology, host-parasite relationships, disease progression and drug effects. However, the high-throughput data generations present challenges for many wet experimentalists to analyze and take full advantage of such rich and complex data. Here we present GeneCloudOmics, an easy-to-use web server for high-throughput gene expression analysis that extends the functionality of our previous ABioTrans with several new tools, including protein datasets analysis, and a web interface. GeneCloudOmics allows both microarray and RNA-Seq data analysis with a comprehensive range of data analytics tools in one package that no other current standalone software or web-based tool can do. In total, GeneCloudOmics provides the user access to 23 different data analytical and bioinformatics tasks including reads normalization, scatter plots, linear/non-linear correlations, PCA, clustering (hierarchical, k-means, t-SNE, SOM), differential expression analyses, pathway enrichments, evolutionary analyses, pathological analyses, and protein-protein interaction (PPI) identifications. Furthermore, GeneCloudOmics allows the direct import of gene expression data from the NCBI Gene Expression Omnibus database. The user can perform all tasks rapidly through an intuitive graphical user interface that overcomes the hassle of coding, installing tools/packages/libraries and dealing with operating systems compatibility and version issues, complications that make data analysis tasks challenging for biologists. Thus, GeneCloudOmics is a one-stop open-source tool for gene expression data analysis and visualization. It is freely available at http://combio-sifbi.org/GeneCloudOmics.

Collapse

Constructing a Defined Starter for Multispecies Vinegar Fermentation via Evaluating the Vitality and Dominance of Functional Microbes in Autochthonous Starter. Appl Environ Microbiol 2021;88:e0217521. [PMID: 34818103 DOI: 10.1128/aem.02175-21] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

Mature vinegar culture has usually been used as a type of autochthonous starter for rapidly initiate initiating the next batch of acetic acid fermentation (AAF) and maintaining the batch-to-batch uniformity of AAF in the production of traditional cereal vinegar. However, the vitality and dominance of functional microbes in autochthonous starters remain unclear, which hinders further improvement of fermentation yield and production. Here, based on metagenomic (MG), metatranscriptomic (MT), and 16S rRNA gene sequencings, 11 bacterial operational taxonomic units (OTUs) with significant metabolic activity (MT/MG ratio >1) and dominance (relative abundance >1%) were targeted in the autochthonous vinegar starter, all of which were assigned to 4 species (Acetobacter pasteurianus, Lactobacillus acetotolerans, L. helveticus, Acetilactobacillus jinshanensis). Then, we evaluated the successions and interactions of these 11 bacterial OTUs at different AAF stages. Last, a defined starter was constructed with 4 core species isolated from the autochthonous starter (A. pasteurianus, L. acetotolerans, L. helveticus, Ac. jinshanensis). The defined starter culture could rapidly initiate the AAF in a sterile or unsterilized environment and similar dynamics of metabolites (ethanol, titratable acidity, acetic acid, lactic acid, and volatile compounds) and environmental indexes (temperature, pH) of fermentation were observed as compared with that of autochthonous starter (P > 0.05). This work provides a method to construct a defined microbiota from a complex system while preserving its metabolic function. IMPORTANCE Complex microorganisms are beneficial to the flavor formation in natural food fermentation, but they also pose challenges to the mass production of standardized products. It is attractive to construct a defined starter to rapidly initiate fermentation process and significantly improve fermentation yield. This study provides a comprehensive understanding of vital and dominant species in the autochthonous vinegar starter via multi-omics, and designs a defined microbial community for the efficient fermentation of cereal vinegar.

Collapse

Sobreiro MB, Collevatti RG, Dos Santos YLA, Bandeira LF, Lopes FJF, Novaes E. RNA-Seq reveals different responses to drought in Neotropical trees from savannas and seasonally dry forests. BMC PLANT BIOLOGY 2021;21:463. [PMID: 34641780 PMCID: PMC8507309 DOI: 10.1186/s12870-021-03244-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 09/24/2021] [Indexed: 05/24/2023]

Abstract

BACKGROUND

Water is one of the main limiting factors for plant growth and crop productivity. Plants constantly monitor water availability and can rapidly adjust their metabolism by altering gene expression. This leads to phenotypic plasticity, which aids rapid adaptation to climate changes. Here, we address phenotypic plasticity under drought stress by analyzing differentially expressed genes (DEG) in four phylogenetically related neotropical Bignoniaceae tree species: two from savanna, Handroanthus ochraceus and Tabebuia aurea, and two from seasonally dry tropical forests (SDTF), Handroanthus impetiginosus and Handroanthus serratifolius. To the best of our knowledge, this is the first report of an RNA-Seq study comparing tree species from seasonally dry tropical forest and savanna ecosystems.

RESULTS

Using a completely randomized block design with 4 species × 2 treatments (drought and wet) × 3 blocks (24 plants) and an RNA-seq approach, we detected a higher number of DEGs between treatments for the SDTF species H. serratifolius (3153 up-regulated and 2821 down-regulated under drought) and H. impetiginosus (332 and 207), than for the savanna species. H. ochraceus showed the lowest number of DEGs, with only five up and nine down-regulated genes, while T. aurea exhibited 242 up- and 96 down-regulated genes. The number of shared DEGs among species was not related to habitat of origin or phylogenetic relationship, since both T. aurea and H impetiginosus shared a similar number of DEGs with H. serratifolius. All four species shared a low number of enriched gene ontology (GO) terms and, in general, exhibited different mechanisms of response to water deficit. We also found 175 down-regulated and 255 up-regulated transcription factors from several families, indicating the importance of these master regulators in drought response.

CONCLUSION

Our findings show that phylogenetically related species may respond differently at gene expression level to drought stress. Savanna species seem to be less responsive to drought at the transcriptional level, likely due to morphological and anatomical adaptations to seasonal drought. The species with the largest geographic range and widest edaphic-climatic niche, H. serratifolius, was the most responsive, exhibiting the highest number of DEG and up- and down-regulated transcription factors (TF).

Collapse

Zhang Y, Hu B, Agwanda B, Fang Y, Wang J, Kuria S, Yang J, Masika M, Tang S, Lichoti J, Fan Z, Shi Z, Ommeh S, Wang H, Deng F, Shen S. Viromes and surveys of RNA viruses in camel-derived ticks revealing transmission patterns of novel tick-borne viral pathogens in Kenya. Emerg Microbes Infect 2021;10:1975-1987. [PMID: 34570681 PMCID: PMC8525980 DOI: 10.1080/22221751.2021.1986428] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Affiliation(s)

You Zhang State Key Laboratory of Virology and National Virus Resource Centre, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, People's Republic of China.,University of Chinese Academy of Sciences, Beijing, People's Republic of China
Ben Hu CAS Key Laboratory of Special Pathogens and Biosafety, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, People's Republic of China
Bernard Agwanda Department of Zoology, National Museums of Kenya, Nairobi, Kenya
Yaohui Fang State Key Laboratory of Virology and National Virus Resource Centre, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, People's Republic of China.,University of Chinese Academy of Sciences, Beijing, People's Republic of China
Jun Wang State Key Laboratory of Virology and National Virus Resource Centre, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, People's Republic of China
Stephen Kuria Institute For Biotechnology Research (IBR), Jomo Kenyatta University of Agriculture and Technology (JKUAT), Nairobi, Kenya
Juan Yang State Key Laboratory of Virology and National Virus Resource Centre, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, People's Republic of China
Moses Masika Department of Medical Microbiology, University of Nairobi Nairobi, Kenya
Shuang Tang State Key Laboratory of Virology and National Virus Resource Centre, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, People's Republic of China
Jacqueline Lichoti Directorate of Veterinary Services, State Department of Livestock, Ministry of Agriculture, Livestock, Fisheries and Irrigation, Nairobi, Kenya
Zhaojun Fan State Key Laboratory of Virology and National Virus Resource Centre, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, People's Republic of China
Zhengli Shi CAS Key Laboratory of Special Pathogens and Biosafety, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, People's Republic of China
Sheila Ommeh Institute For Biotechnology Research (IBR), Jomo Kenyatta University of Agriculture and Technology (JKUAT), Nairobi, Kenya
Hualin Wang State Key Laboratory of Virology and National Virus Resource Centre, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, People's Republic of China
Fei Deng State Key Laboratory of Virology and National Virus Resource Centre, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, People's Republic of China
Shu Shen State Key Laboratory of Virology and National Virus Resource Centre, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, People's Republic of China

Collapse

Genome-Wide Identification and Transcriptional Expression Profiles of Transcription Factor WRKY in Common Walnut (Juglans regia L.). Genes (Basel) 2021;12:genes12091444. [PMID: 34573426 PMCID: PMC8466090 DOI: 10.3390/genes12091444] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 09/07/2021] [Accepted: 09/17/2021] [Indexed: 11/16/2022] Open

Abstract

The transcription factor WRKY is widely distributed in the plant kingdom, playing a significant role in plant growth, development and response to stresses. Walnut is an economically important temperate tree species valued for both its edible nuts and high-quality wood, and its response to various stresses is an important factor that determines the quality of its fruit. However, in walnut trees themselves, information about the WRKY gene family remains scarce. In this paper, we perform a comprehensive study of the WRKY gene family in walnut. In total, we identified 103 WRKY genes in the common walnut that are clustered into 4 groups and distributed on 14 chromosomes. The conserved domains all contained a WRKY domain, and motif 2 was observed in most WRKYs, suggesting a high degree of conservation and similar functions within each subfamily. However, gene structure was significantly differentiated between different subfamilies. Synteny analysis indicates that there were 56 gene pairs in J. regia and A. thaliana, 76 in J. regia and J. mandshurica, 75 in J. regia and J. microcarpa, 76 in J. regia and P. trichocarpa, and 33 in J. regia and Q. robur, indicating that the WRKY gene family may come from a common ancestor. GO and KEGG enrichment analysis showed that the WRKY gene family was involved in resistance traits and the plant-pathogen interaction pathway. In anthracnose-resistant F26 fruits (AR) and anthracnose-susceptible F423 fruits (AS), transcriptome and qPCR analysis results showed that JrWRKY83, JrWRKY73 and JrWRKY74 were expressed significantly more highly in resistant cultivars, indicating that these three genes may be important contributors to stress resistance in walnut trees. Furthermore, we investigate how these three genes potentially target miRNAs and interact with proteins. JrWRKY73 was target by the miR156 family, including 12 miRNAs; this miRNA family targets WRKY genes to enhance plant defense. JrWRKY73 also interacted with the resistance gene AtMPK6, showing that it may play a crucial role in walnut defense.

Collapse

Assessment of reference genes at six different developmental stages of Schistosoma mansoni for quantitative RT-PCR. Sci Rep 2021;11:16816. [PMID: 34413342 PMCID: PMC8376997 DOI: 10.1038/s41598-021-96055-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 07/31/2021] [Indexed: 12/13/2022] Open

Carmona-Mora P, Ander BP, Jickling GC, Dykstra-Aiello C, Zhan X, Ferino E, Hamade F, Amini H, Hull H, Sharp FR, Stamova B. Distinct peripheral blood monocyte and neutrophil transcriptional programs following intracerebral hemorrhage and different etiologies of ischemic stroke. J Cereb Blood Flow Metab 2021;41:1398-1416. [PMID: 32960689 PMCID: PMC8142129 DOI: 10.1177/0271678x20953912] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Revised: 07/07/2020] [Accepted: 07/29/2020] [Indexed: 12/25/2022]

Van Houtven J, Cuypers B, Meysman P, Hooyberghs J, Laukens K, Valkenborg D. Constrained Standardization of Count Data from Massive Parallel Sequencing. J Mol Biol 2021;433:166966. [PMID: 33794260 DOI: 10.1016/j.jmb.2021.166966] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 02/26/2021] [Accepted: 03/23/2021] [Indexed: 11/22/2022]

Stupnikov A, McInerney CE, Savage KI, McIntosh SA, Emmert-Streib F, Kennedy R, Salto-Tellez M, Prise KM, McArt DG. Robustness of differential gene expression analysis of RNA-seq. Comput Struct Biotechnol J 2021;19:3470-3481. [PMID: 34188784 PMCID: PMC8214188 DOI: 10.1016/j.csbj.2021.05.040] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 05/25/2021] [Accepted: 05/25/2021] [Indexed: 01/05/2023] Open

Yang J, Wang D, Yang Y, Yang W, Jin W, Niu X, Gong J. A systematic comparison of normalization methods for eQTL analysis. Brief Bioinform 2021;22:6278608. [PMID: 34015824 DOI: 10.1093/bib/bbab193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Revised: 04/14/2021] [Accepted: 04/28/2021] [Indexed: 11/15/2022] Open

Identification of transcriptional subtypes in lung adenocarcinoma and squamous cell carcinoma through integrative analysis of microarray and RNA sequencing data. Sci Rep 2021;11:8709. [PMID: 33888829 PMCID: PMC8062554 DOI: 10.1038/s41598-021-88209-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Accepted: 04/08/2021] [Indexed: 02/02/2023] Open

Abstract

Classification of tumors into subtypes can inform personalized approaches to treatment including the choice of targeted therapies. The two most common lung cancer histological subtypes, lung adenocarcinoma and lung squamous cell carcinoma, have been previously divided into transcriptional subtypes using microarray data, and corresponding signatures were subsequently used to classify RNA-seq data. Cross-platform unsupervised classification facilitates the identification of robust transcriptional subtypes by combining vast amounts of publicly available microarray and RNA-seq data. However, cross-platform classification is challenging because of intrinsic differences in data generated using the two gene expression profiling technologies. In this report, we show that robust gene expression subtypes can be identified in integrated data representing over 3500 normal and tumor lung samples profiled using two widely used platforms, Affymetrix HG-U133 Plus 2.0 Array and Illumina HiSeq RNA sequencing. We tested and analyzed consensus clustering for 384 combinations of data processing methods. The agreement between subtypes identified in single-platform and cross-platform normalized data was then evaluated using a variety of statistics. Results show that unsupervised learning can be achieved with combined microarray and RNA-seq data using selected preprocessing, cross-platform normalization, and unsupervised feature selection methods. Our analysis confirmed three lung adenocarcinoma transcriptional subtypes, but only two consistent subtypes in squamous cell carcinoma, as opposed to four subtypes previously identified. Further analysis showed that tumor subtypes were associated with distinct patterns of genomic alterations in genes coding for therapeutic targets. Importantly, by integrating quantitative proteomics data, we were able to identify tumor subtype biomarkers that effectively classify samples on the basis of both gene and protein expression. This study provides the basis for further integrative data analysis across gene and protein expression profiling platforms.

Collapse

de Vries JJC, Brown JR, Couto N, Beer M, Le Mercier P, Sidorov I, Papa A, Fischer N, Oude Munnink BB, Rodriquez C, Zaheri M, Sayiner A, Hönemann M, Cataluna AP, Carbo EC, Bachofen C, Kubacki J, Schmitz D, Tsioka K, Matamoros S, Höper D, Hernandez M, Puchhammer-Stöckl E, Lebrand A, Huber M, Simmonds P, Claas ECJ, López-Labrador FX. Recommendations for the introduction of metagenomic next-generation sequencing in clinical virology, part II: bioinformatic analysis and reporting. J Clin Virol 2021;138:104812. [PMID: 33819811 DOI: 10.1016/j.jcv.2021.104812] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 03/20/2021] [Indexed: 12/11/2022]

Affiliation(s)

Jutte J C de Vries Clinical Microbiological Laboratory, department of Medical Microbiology, Leiden University Medical Center, Leiden, the Netherlands.
Julianne R Brown Microbiology, Virology and Infection Prevention & Control, Great Ormond Street Hospital for Children NHS Foundation Trust, London, United Kingdom.
Natacha Couto Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom.
Martin Beer Friedrich-Loeffler-Institute, Institute of Diagnostic Virology, Greifswald, Germany.
Philippe Le Mercier Swiss Institute of Bioinformatics, Geneva, Switzerland.
Igor Sidorov Clinical Microbiological Laboratory, department of Medical Microbiology, Leiden University Medical Center, Leiden, the Netherlands.
Anna Papa Department of Microbiology, Medical School, Aristotle University of Thessaloniki, Greece.
Nicole Fischer University Medical Center Hamburg-Eppendorf, UKE Institute for Medical Microbiology, Virology and Hygiene, Germany.
Bas B Oude Munnink Viroscience, Erasmus Medical Center, Rotterdam, the Netherlands.
Christophe Rodriquez Department of Virology, University hospital Henri Mondor, Assistance Public des Hopitaux de Paris, Créteil, France.
Maryam Zaheri Institute of Medical Virology, University of Zurich, Switzerland.
Arzu Sayiner Dokuz Eylul University, Medical Faculty, Department of Medical Microbiology, Izmir, Turkey.
Mario Hönemann Institute of Virology, Leipzig University, Leipzig, Germany.
Alba Perez Cataluna Department of Preservation and Food Safety Technologies, IATA-CSIC, Paterna, Valencia, Spain.
Ellen C Carbo Clinical Microbiological Laboratory, department of Medical Microbiology, Leiden University Medical Center, Leiden, the Netherlands.
Claudia Bachofen Institute of Virology, University of Zurich, Switzerland.
Jakub Kubacki Institute of Virology, University of Zurich, Switzerland.
Dennis Schmitz RIVM National Institute for Public Health and Environment, Bilthoven, the Netherlands.
Katerina Tsioka Department of Microbiology, Medical School, Aristotle University of Thessaloniki, Greece.
Sébastien Matamoros Medical Microbiology and Infection Control, Amsterdam UMC, Amsterdam, the Netherlands.
Dirk Höper Friedrich-Loeffler-Institute, Institute of Diagnostic Virology, Greifswald, Germany.
Marta Hernandez Laboratory of Molecular Biology and Microbiology, Instituto Tecnologico Agrario de Castilla y Leon, Valladolid, Spain.
Elisabeth Puchhammer-Stöckl Center of Virology, Medical University Vienna, Vienna, Austria.
Aitana Lebrand Swiss Institute of Bioinformatics, Geneva, Switzerland.
Michael Huber Institute of Medical Virology, University of Zurich, Switzerland.
Peter Simmonds Nuffield Department of Medicine, University of Oxford, Oxford, UK.
Eric C J Claas Clinical Microbiological Laboratory, department of Medical Microbiology, Leiden University Medical Center, Leiden, the Netherlands.
F Xavier López-Labrador Virology Laboratory, Genomics and Health Area, Centre for Public Health Research (FISABIO-Public Health), Valencia, Spain; Department of Microbiology, Medical School, University of Valencia, Spain; CIBERESP, Instituto de Salud Carlos III, Madrid, Spain.

Collapse

Giraud D, Lima O, Rousseau-Gueutin M, Salmon A, Aïnouche M. Gene and Transposable Element Expression Evolution Following Recent and Past Polyploidy Events in Spartina (Poaceae). Front Genet 2021;12:589160. [PMID: 33841492 PMCID: PMC8027259 DOI: 10.3389/fgene.2021.589160] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 02/23/2021] [Indexed: 12/18/2022] Open

Abstract

Gene expression dynamics is a key component of polyploid evolution, varying in nature, intensity, and temporal scales, most particularly in allopolyploids, where two or more sub-genomes from differentiated parental species and different repeat contents are merged. Here, we investigated transcriptome evolution at different evolutionary time scales among tetraploid, hexaploid, and neododecaploid Spartina species (Poaceae, Chloridoideae) that successively diverged in the last 6-10 my, at the origin of differential phenotypic and ecological traits. Of particular interest are the recent (19th century) hybridizations between the two hexaploids Spartina alterniflora (2n = 6x = 62) and S. maritima (2n = 6x = 60) that resulted in two sterile F1 hybrids: Spartina × townsendii (2n = 6x = 62) in England and Spartina × neyrautii (2n = 6x = 62) in France. Whole genome duplication of S. × townsendii gave rise to the invasive neo-allododecaploid species Spartina anglica (2n = 12x = 124). New transcriptome assemblies and annotations for tetraploids and the enrichment of previously published reference transcriptomes for hexaploids and the allododecaploid allowed identifying 42,423 clusters of orthologs and distinguishing 21 transcribed transposable element (TE) lineages across the seven investigated Spartina species. In 4x and 6x mesopolyploids, gene and TE expression changes were consistent with phylogenetic relationships and divergence, revealing weak expression differences in the tetraploid sister species Spartina bakeri and Spartina versicolor (<2 my divergence time) compared to marked transcriptome divergence between the hexaploids S. alterniflora and S. maritima that diverged 2-4 mya. Differentially expressed genes were involved in glycolysis, post-transcriptional protein modifications, epidermis development, biosynthesis of carotenoids. Most detected TE lineages (except SINE elements) were found more expressed in hexaploids than in tetraploids, in line with their abundance in the corresponding genomes. Comparatively, an astonishing (52%) expression repatterning and deviation from parental additivity were observed following recent reticulate evolution (involving the F1 hybrids and the neo-allododecaploid S. anglica), with various patterns of biased homoeologous gene expression, including genes involved in epigenetic regulation. Downregulation of TEs was observed in both hybrids and accentuated in the neo-allopolyploid. Our results reinforce the view that allopolyploidy represents springboards to new regulatory patterns, offering to worldwide invasive species, such as S. anglica, the opportunity to colonize stressful and fluctuating environments on saltmarshes.

Collapse

A comprehensive analysis of tumor microenvironment-related genes in colon cancer. Clin Transl Oncol 2021;23:1769-1781. [PMID: 33689097 DOI: 10.1007/s12094-021-02578-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Accepted: 02/23/2021] [Indexed: 12/19/2022]

Lim DK, Rashid NU, Ibrahim JG. MODEL-BASED FEATURE SELECTION AND CLUSTERING OF RNA-SEQ DATA FOR UNSUPERVISED SUBTYPE DISCOVERY. Ann Appl Stat 2021;15:481-508. [PMID: 34457104 PMCID: PMC8386505 DOI: 10.1214/20-aoas1407] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Systematic comparison and assessment of RNA-seq procedures for gene expression quantitative analysis. Sci Rep 2020;10:19737. [PMID: 33184454 PMCID: PMC7665074 DOI: 10.1038/s41598-020-76881-x] [Citation(s) in RCA: 85] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Accepted: 11/03/2020] [Indexed: 01/16/2023] Open

Elolimy AA, Washam C, Byrum S, Chen C, Dawson H, Bowlin AK, Randolph CE, Saraf MK, Yeruva L. Formula Diet Alters the Ileal Metagenome and Transcriptome at Weaning and during the Postweaning Period in a Porcine Model. mSystems 2020;5:e00457-20. [PMID: 32753508 PMCID: PMC7406227 DOI: 10.1128/msystems.00457-20] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 07/21/2020] [Indexed: 01/05/2023] Open

Abstract

Exclusive breastfeeding impacts the intestinal microbiome and is associated with a better immune function than is seen with milk formula (MF) feeding in infants and yet with mechanisms poorly defined. The porcine model was used to evaluate the impact of MF on ileum microbial communities and gene expression relative to human milk (HM)-fed piglets. Fifty-two Dutch Landrace male piglets were fed an isocaloric diet of either HM (n = 26) or MF (n = 26) from day 2 through day 21 of age and weaned to a solid diet until day 51. Eleven piglets from each group were euthanized at day 21, while the remaining piglets (HM, n = 15; MF, n = 15) were euthanized at day 51 to collect ileal epithelium (EP) scrapings and ileal (IL) tissues. The epithelial mucosa was subjected to shotgun metagenome sequencing, and EP and IL tissues were used for transcriptome analysis. On day 21, transcriptome data revealed that the levels of pathways involved in inflammation and apoptosis were significantly higher in MF piglets than in HM piglets, whereas the levels of tight junctions and pathogen detection systems were lower in MF piglets than in HM piglets. The MF impacts on the small intestine were maintained over the postweaning period (day 51) as indicated by higher levels of Dialister invisus bacteria and higher levels of expression of genes associated with inflammation and apoptosis pathways relative to HM group. The current study demonstrated that MF might impact local intestinal inflammation, apoptosis, and tight junctions and might suppress pathogen recognition in the small intestine compared with HM.IMPORTANCE Exclusive human milk (HM) breastfeeding for the first 6 months of age in infants is recommended to improve health outcomes during early life and beyond. When women are unable to provide sufficient HM, milk formula (MF) is often recommended as a complementary or alternative source of nutrition. Previous studies in piglets demonstrated that MF alters the gut microbiome and induces inflammatory cytokine production. The links between MF feeding, gut microbiome, and inflammation status are unclear due to challenges associated with the collection of intestinal samples from human infants. The current report provides the first insight into MF-microbiome-inflammation connections in the small intestine compared with HM feeding using a porcine model. The present results showed that, compared with HM, MF might impact immune function through the induction of ileal inflammation, apoptosis, and tight junction disruptions and likely compromised immune defense against pathogen detection in the small intestine relative to piglets that were fed HM.

Collapse

Zhao S, Ye Z, Stanton R. Misuse of RPKM or TPM normalization when comparing across samples and sequencing protocols. RNA (NEW YORK, N.Y.) 2020;26:903-909. [PMID: 32284352 PMCID: PMC7373998 DOI: 10.1261/rna.074922.120] [Citation(s) in RCA: 191] [Impact Index Per Article: 47.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]

Richard M, Decamps C, Chuffart F, Brambilla E, Rousseaux S, Khochbin S, Jost D. PenDA, a rank-based method for personalized differential analysis: Application to lung cancer. PLoS Comput Biol 2020;16:e1007869. [PMID: 32392248 PMCID: PMC7274464 DOI: 10.1371/journal.pcbi.1007869] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Revised: 06/05/2020] [Accepted: 04/11/2020] [Indexed: 12/27/2022] Open

Abstract

The hopes of precision medicine rely on our capacity to measure various high-throughput genomic information of a patient and to integrate them for personalized diagnosis and adapted treatment. Reaching these ambitious objectives will require the development of efficient tools for the detection of molecular defects at the individual level. Here, we propose a novel method, PenDA, to perform Personalized Differential Analysis at the scale of a single sample. PenDA is based on the local ordering of gene expressions within individual cases and infers the deregulation status of genes in a sample of interest compared to a reference dataset. Based on realistic simulations of RNA-seq data of tumors, we showed that PenDA outcompetes existing approaches with very high specificity and sensitivity and is robust to normalization effects. Applying the method to lung cancer cohorts, we observed that deregulated genes in tumors exhibit a cancer-type-specific commitment towards up- or down-regulation. Based on the individual information of deregulation given by PenDA, we were able to define two new molecular histologies for lung adenocarcinoma cancers strongly correlated to survival. In particular, we identified 37 biomarkers whose up-regulation lead to bad prognosis and that we validated on two independent cohorts. PenDA provides a robust, generic tool to extract personalized deregulation patterns that can then be used for the discovery of therapeutic targets and for personalized diagnosis. An open-access, user-friendly R package is available at https://github.com/bcm-uga/penda.

The hopes of precision medicine rely on our capacity to measure individual molecular information for personalized diagnosis and treatment. These challenging perspectives will be only possible with the development of efficient methodological tools to identify patient-specific molecular defects from the many precise molecular information that one can access at the single-individual, single tissue or even single-cell levels. Such methods will provide a better understanding of disease-specific biological mechanisms and will promote the development of personalized therapeutic strategies. Here we describe a novel method, named PenDA, to perform differential analysis of gene expression at the individual level. Based on a realistic benchmark of simulated tumors, we demonstrated that PenDA reaches very high efficiency in detecting sample-specific deregulated genes. We then applied the method to two large cohorts associated with lung cancer. A detailed statistical analysis of the results allowed to isolate genes with specific deregulation patterns, like genes that are up-regulated in all tumors or genes that are expressed but never deregulated in any tumors. Given their specificities, these genes are likely to be of interest in therapeutic research. In particular, we were able to identified 37 new biomarkers associated to bad prognosis that we validated on two independent cohorts.

Collapse

Wang B. A Zipf-plot based normalization method for high-throughput RNA-seq data. PLoS One 2020;15:e0230594. [PMID: 32271772 PMCID: PMC7144957 DOI: 10.1371/journal.pone.0230594] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Accepted: 03/03/2020] [Indexed: 12/02/2022] Open

Hu G, Grover CE, Arick MA, Liu M, Peterson DG, Wendel JF. Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids. Brief Bioinform 2020;22:1819-1835. [PMID: 32219306 PMCID: PMC7986634 DOI: 10.1093/bib/bbaa035] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Revised: 02/06/2020] [Accepted: 02/24/2020] [Indexed: 12/29/2022] Open

Patient-Tailored Radiation Therapy for Rectal Cancer: The Devil Is in the Details. Dis Colon Rectum 2020;63:265-266. [PMID: 32032138 DOI: 10.1097/dcr.0000000000001567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

Yang S, Wachtel MS, Wu J. DFseq: Distribution-Free Method to Detect Differential Gene Expression for RNA-Sequencing Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020;17:558-565. [PMID: 30176602 DOI: 10.1109/tcbb.2018.2866994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Li X, Cooper NGF, O'Toole TE, Rouchka EC. Choice of library size normalization and statistical methods for differential gene expression analysis in balanced two-group comparisons for RNA-seq studies. BMC Genomics 2020;21:75. [PMID: 31992223 PMCID: PMC6986029 DOI: 10.1186/s12864-020-6502-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2019] [Accepted: 01/16/2020] [Indexed: 12/20/2022] Open

Abstract

Background

High-throughput RNA sequencing (RNA-seq) has evolved as an important analytical tool in molecular biology. Although the utility and importance of this technique have grown, uncertainties regarding the proper analysis of RNA-seq data remain. Of primary concern, there is no consensus regarding which normalization and statistical methods are the most appropriate for analyzing this data. The lack of standardized analytical methods leads to uncertainties in data interpretation and study reproducibility, especially with studies reporting high false discovery rates. In this study, we compared a recently developed normalization method, UQ-pgQ2, with three of the most frequently used alternatives including RLE (relative log estimate), TMM (Trimmed-mean M values) and UQ (upper quartile normalization) in the analysis of RNA-seq data. We evaluated the performance of these methods for gene-level differential expression analysis by considering the factors, including: 1) normalization combined with the choice of a Wald test from DESeq2 and an exact test/QL (Quasi-likelihood) F-Test from edgeR; 2) sample sizes in two balanced two-group comparisons; and 3) sequencing read depths.

Results

Using the MAQC RNA-seq datasets with small sample replicates, we found that UQ-pgQ2 normalization combined with an exact test can achieve better performance in term of power and specificity in differential gene expression analysis. However, using an intra-group analysis of false positives from real and simulated data, we found that a Wald test performs better than an exact test when the number of sample replicates is large and that a QL F-test performs the best given sample sizes of 5, 10 and 15 for any normalization. The RLE, TMM and UQ methods performed similarly given a desired sample size.

Conclusion

We found the UQ-pgQ2 method combined with an exact test/QL F-test is the best choice in order to control false positives when the sample size is small. When the sample size is large, UQ-pgQ2 with a QL F-test is a better choice for the type I error control in an intra-group analysis. We observed read depths have a minimal impact for differential gene expression analysis based on the simulated data.

Collapse

Genome-Wide Analysis of Cyclophilin Proteins in 21 Oomycetes. Pathogens 2019;9:pathogens9010024. [PMID: 31888032 PMCID: PMC7168621 DOI: 10.3390/pathogens9010024] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Revised: 12/12/2019] [Accepted: 12/20/2019] [Indexed: 12/20/2022] Open

Abrams ZB, Johnson TS, Huang K, Payne PRO, Coombes K. A protocol to evaluate RNA sequencing normalization methods. BMC Bioinformatics 2019;20:679. [PMID: 31861985 PMCID: PMC6923842 DOI: 10.1186/s12859-019-3247-x] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open

Jiang S, Cheng SJ, Ren LC, Wang Q, Kang YJ, Ding Y, Hou M, Yang XX, Lin Y, Liang N, Gao G. An expanded landscape of human long noncoding RNA. Nucleic Acids Res 2019;47:7842-7856. [PMID: 31350901 PMCID: PMC6735957 DOI: 10.1093/nar/gkz621] [Citation(s) in RCA: 74] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2019] [Revised: 06/18/2019] [Accepted: 07/11/2019] [Indexed: 12/21/2022] Open

Affiliation(s)

Shuai Jiang Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing 100871, China
Si-Jin Cheng Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing 100871, China
Li-Chen Ren Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing 100871, China
Qian Wang Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing 100871, China
Yu-Jian Kang Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing 100871, China
Yang Ding Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing 100871, China
Mei Hou Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing 100871, China
Xiao-Xu Yang Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing 100871, China
Yuan Lin Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing 100871, China
Nan Liang Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing 100871, China
Ge Gao Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing 100871, China

Collapse

Liu W, Jacquiod S, Brejnrod A, Russel J, Burmølle M, Sørensen SJ. Deciphering links between bacterial interactions and spatial organization in multispecies biofilms. THE ISME JOURNAL 2019;13:3054-3066. [PMID: 31455806 PMCID: PMC6864094 DOI: 10.1038/s41396-019-0494-9] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Revised: 07/26/2019] [Accepted: 07/29/2019] [Indexed: 01/23/2023]

van Rooij J, Mandaviya PR, Claringbould A, Felix JF, van Dongen J, Jansen R, Franke L, 't Hoen PAC, Heijmans B, van Meurs JBJ. Evaluation of commonly used analysis strategies for epigenome- and transcriptome-wide association studies through replication of large-scale population studies. Genome Biol 2019;20:235. [PMID: 31727104 PMCID: PMC6857161 DOI: 10.1186/s13059-019-1878-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2019] [Accepted: 11/02/2019] [Indexed: 12/15/2022] Open

Mandelboum S, Manber Z, Elroy-Stein O, Elkon R. Recurrent functional misinterpretation of RNA-seq data caused by sample-specific gene length bias. PLoS Biol 2019;17:e3000481. [PMID: 31714939 PMCID: PMC6850523 DOI: 10.1371/journal.pbio.3000481] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2019] [Accepted: 10/08/2019] [Indexed: 11/19/2022] Open

Abstract

Data normalization is a critical step in RNA sequencing (RNA-seq) analysis, aiming to remove systematic effects from the data to ensure that technical biases have minimal impact on the results. Analyzing numerous RNA-seq datasets, we detected a prevalent sample-specific length effect that leads to a strong association between gene length and fold-change estimates between samples. This stochastic sample-specific effect is not corrected by common normalization methods, including reads per kilobase of transcript length per million reads (RPKM), Trimmed Mean of M values (TMM), relative log expression (RLE), and quantile and upper-quartile normalization. Importantly, we demonstrate that this bias causes recurrent false positive calls by gene-set enrichment analysis (GSEA) methods, thereby leading to frequent functional misinterpretation of the data. Gene sets characterized by markedly short genes (e.g., ribosomal protein genes) or long genes (e.g., extracellular matrix genes) are particularly prone to such false calls. This sample-specific length bias is effectively removed by the conditional quantile normalization (cqn) and EDASeq methods, which allow the integration of gene length as a sample-specific covariate. Consequently, using these normalization methods led to substantial reduction in GSEA false results while retaining true ones. In addition, we found that application of gene-set tests that take into account gene–gene correlations attenuates false positive rates caused by the length bias, but statistical power is reduced as well. Our results advocate the inspection and correction of sample-specific length biases as default steps in RNA-seq analysis pipelines and reiterate the need to account for intergene correlations when performing gene-set enrichment tests to lessen false interpretation of transcriptomic data.

Analysis of numerous RNA-seq datasets reveals a recurrent sample-specific length bias that causes frequent false positive calls by gene-set enrichment analyses, leading to functional misinterpretation of the data. Its removal requires methods that allow the integration of gene length as sample-specific covariate.

Collapse

Reyes ALP, Silva TC, Coetzee SG, Plummer JT, Davis BD, Chen S, Hazelett DJ, Lawrenson K, Berman BP, Gayther SA, Jones MR. GENAVi: a shiny web application for gene expression normalization, analysis and visualization. BMC Genomics 2019;20:745. [PMID: 31619158 PMCID: PMC6796420 DOI: 10.1186/s12864-019-6073-7] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Accepted: 08/29/2019] [Indexed: 01/13/2023] Open

Abstract

BACKGROUND

The development of next generation sequencing (NGS) methods led to a rapid rise in the generation of large genomic datasets, but the development of user-friendly tools to analyze and visualize these datasets has not developed at the same pace. This presents a two-fold challenge to biologists; the expertise to select an appropriate data analysis pipeline, and the need for bioinformatics or programming skills to apply this pipeline. The development of graphical user interface (GUI) applications hosted on web-based servers such as Shiny can make complex workflows accessible across operating systems and internet browsers to those without programming knowledge.

RESULTS

We have developed GENAVi (Gene Expression Normalization Analysis and Visualization) to provide a user-friendly interface for normalization and differential expression analysis (DEA) of human or mouse feature count level RNA-Seq data. GENAVi is a GUI based tool that combines Bioconductor packages in a format for scientists without bioinformatics expertise. We provide a panel of 20 cell lines commonly used for the study of breast and ovarian cancer within GENAVi as a foundation for users to bring their own data to the application. Users can visualize expression across samples, cluster samples based on gene expression or correlation, calculate and plot the results of principal components analysis, perform DEA and gene set enrichment and produce plots for each of these analyses. To allow scalability for large datasets we have provided local install via three methods. We improve on available tools by offering a range of normalization methods and a simple to use interface that provides clear and complete session reporting and for reproducible analysis.

CONCLUSION

The development of tools using a GUI makes them practical and accessible to scientists without bioinformatics expertise, or access to a data analyst with relevant skills. While several GUI based tools are currently available for RNA-Seq analysis we improve on these existing tools. This user-friendly application provides a convenient platform for the normalization, analysis and visualization of gene expression data for scientists without bioinformatics expertise.

Collapse

Improved cellulase production in recombinant Saccharomyces cerevisiae by disrupting the cell wall protein-encoding gene CWP2. J Biosci Bioeng 2019;129:165-171. [PMID: 31537451 DOI: 10.1016/j.jbiosc.2019.08.012] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2019] [Revised: 08/23/2019] [Accepted: 08/23/2019] [Indexed: 12/27/2022]