1
|
Khodaee F, Zandie R, Edelman ER. Multimodal Learning for Mapping the Genotype-Phenotype Dynamics. RESEARCH SQUARE 2024:rs.3.rs-4355413. [PMID: 38798675 PMCID: PMC11118704 DOI: 10.21203/rs.3.rs-4355413/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
How complex phenotypes emerge from intricate gene expression patterns is a fundamental question in biology. Quantitative characterization of this relationship, however, is challenging due to the vast combinatorial possibilities and dynamic interplay between genotype and phenotype landscapes. Integrating high-content genotyping approaches such as single-cell RNA sequencing and advanced learning methods such as language models offers an opportunity for dissecting this complex relationship. Here, we present a computational integrated genetics framework designed to analyze and interpret the high-dimensional landscape of genotypes and their associated phenotypes simultaneously. We applied this approach to develop a multimodal foundation model to explore the genotype-phenotype relationship manifold for human transcriptomics at the cellular level. Analyzing this joint manifold showed a refined resolution of cellular heterogeneity, enhanced precision in phenotype annotating, and uncovered potential cross-tissue biomarkers that are undetectable through conventional gene expression analysis alone. Moreover, our results revealed that the gene networks are characterized by scale-free patterns and show context-dependent gene-gene interactions, both of which result in significant variations in the topology of the gene network, particularly evident during aging. Finally, utilizing contextualized embeddings, we investigated gene polyfunctionality which illustrates the multifaceted roles that genes play in different biological processes, and demonstrated that for VWF gene in endothelial cells. Overall, this study advances our understanding of the dynamic interplay between gene expression and phenotypic manifestation and demonstrates the potential of integrated genetics in uncovering new dimensions of cellular function and complexity.
Collapse
Affiliation(s)
- Farhan Khodaee
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, 02139, MA, USA
| | - Rohola Zandie
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, 02139, MA, USA
| | - Elazer R. Edelman
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, 02139, MA, USA
- Department of Medicine (Cardiovascular Medicine), Brigham and Women’s Hospital, Boston, 02115, MA, USA
| |
Collapse
|
2
|
Sun JR, Lou YN, Huang R, Li KX, Jia LQ. Predictive value of TCM tongue characteristics for chemotherapy-induced myelosuppression in patients with lung cancer. Medicine (Baltimore) 2024; 103:e37636. [PMID: 38608065 PMCID: PMC11018151 DOI: 10.1097/md.0000000000037636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 02/26/2024] [Indexed: 04/14/2024] Open
Abstract
This study aimed to investigate the clinical predictors, including traditional Chinese medicine tongue characteristics and other clinical parameters for chemotherapy-induced myelosuppression (CIM), and then to develop a clinical prediction model and construct a nomogram. A total of 103 patients with lung cancer were prospectively enrolled in this study. All of them were scheduled to receive first-line chemotherapy regimens. Participants were randomly assigned to either the training group (n = 52) or the test group (n = 51). Tongue characteristics and clinical parameters were collected before the start of chemotherapy, and then the incidence of myelosuppression was assessed after treatment. We used univariate logistic regression analysis to identify the risk predictors for assessing the incidence of CIM. Moreover, we developed a predictive model and a nomogram using multivariate logistic regression analysis. Finally, we evaluated the predictive performance of the model by examining the area under the curve value of the receiver operating characteristic, calibration curve, and decision curve analysis. As a result, a total of 3 independent predictors were found to be associated with the CIM in multivariate regression analysis: the fat tongue (OR = 3.67), Karnofsky performance status score (OR = 0.11), and the number of high-toxic drugs in chemotherapy regimens (OR = 4.78). Then a model was constructed using these 3 predictors and it exhibited a robust predictive performance with an area under the curve of 0.82 and the consistent calibration curves. Besides, the decision curve analysis results suggested that applying this predictive model can result in more net clinical benefit for patients. We established a traditional Chinese medicine prediction model based on the tongue characteristics and clinical parameters, which could serve as a useful tool for assessing the risk of CIM.
Collapse
Affiliation(s)
- Jian-Rong Sun
- Department of Clinical Medicine, Beijing University of Chinese Medicine, Beijing, PR China
- Oncology Department of Traditional Chinese Medicine, China-Japan Friendship Hospital, Beijing, PR China
| | - Yan-Ni Lou
- Oncology Department of Traditional Chinese Medicine, China-Japan Friendship Hospital, Beijing, PR China
| | - Rong Huang
- Department of Clinical Medicine, Beijing University of Chinese Medicine, Beijing, PR China
- Oncology Department of Traditional Chinese Medicine, China-Japan Friendship Hospital, Beijing, PR China
| | - Kai-Xuan Li
- Department of Clinical Medicine, Beijing University of Chinese Medicine, Beijing, PR China
- Oncology Department of Traditional Chinese Medicine, China-Japan Friendship Hospital, Beijing, PR China
| | - Li-Qun Jia
- Oncology Department of Traditional Chinese Medicine, China-Japan Friendship Hospital, Beijing, PR China
| |
Collapse
|
3
|
Wang RR, Chen JL, Duan SJ, Lu YX, Chen P, Zhou YC, Yao SK. Noninvasive Diagnostic Technique for Nonalcoholic Fatty Liver Disease Based on Features of Tongue Images. Chin J Integr Med 2024; 30:203-212. [PMID: 38051474 DOI: 10.1007/s11655-023-3616-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/07/2023] [Indexed: 12/07/2023]
Abstract
OBJECTIVE To investigate a new noninvasive diagnostic model for nonalcoholic fatty liver disease (NAFLD) based on features of tongue images. METHODS Healthy controls and volunteers confirmed to have NAFLD by liver ultrasound were recruited from China-Japan Friendship Hospital between September 2018 and May 2019, then the anthropometric indexes and sampled tongue images were measured. The tongue images were labeled by features, based on a brief protocol, without knowing any other clinical data, after a series of corrections and data cleaning. The algorithm was trained on images using labels and several anthropometric indexes for inputs, utilizing machine learning technology. Finally, a logistic regression algorithm and a decision tree model were constructed as 2 diagnostic models for NAFLD. RESULTS A total of 720 subjects were enrolled in this study, including 432 patients with NAFLD and 288 healthy volunteers. Of them, 482 were randomly allocated into the training set and 238 into the validation set. The diagnostic model based on logistic regression exhibited excellent performance: in validation set, it achieved an accuracy of 86.98%, sensitivity of 91.43%, and specificity of 80.61%; with an area under the curve (AUC) of 0.93 [95% confidence interval (CI) 0.68-0.98]. The decision tree model achieved an accuracy of 81.09%, sensitivity of 91.43%, and specificity of 66.33%; with an AUC of 0.89 (95% CI 0.66-0.92) in validation set. CONCLUSIONS The features of tongue images were associated with NAFLD. Both the 2 diagnostic models, which would be convenient, noninvasive, lightweight, rapid, and inexpensive technical references for early screening, can accurately distinguish NAFLD and are worth further study.
Collapse
Affiliation(s)
- Rong-Rui Wang
- Graduate School of Beijing University of Chinese Medicine, Beijing, 100029, China
- Department of Gastroenterology, China-Japan Friendship Hospital, Beijing, 100029, China
| | - Jia-Liang Chen
- Center of Integrative Medicine, Beijing Ditan Hospital, Capital Medical University, Beijing, 100015, China
| | - Shao-Jie Duan
- Graduate School of Beijing University of Chinese Medicine, Beijing, 100029, China
- Department of Gastroenterology, China-Japan Friendship Hospital, Beijing, 100029, China
| | - Ying-Xi Lu
- Nanjing Linkwah Micro-electronics Institute, Beijing, 100191, China
- Institute of Microelectronics, Tsinghua University, Beijing, 100084, China
| | - Ping Chen
- Institute of Microelectronics, Tsinghua University, Beijing, 100084, China
| | - Yuan-Chen Zhou
- Peking University China-Japan Friendship School of Clinical Medicine, Beijing, 100029, China
| | - Shu-Kun Yao
- Graduate School of Beijing University of Chinese Medicine, Beijing, 100029, China.
- Department of Gastroenterology, China-Japan Friendship Hospital, Beijing, 100029, China.
| |
Collapse
|
4
|
Shi Y, Wang H, Yao X, Li J, Liu J, Chen Y, Liu L, Xu J. Machine learning prediction models for different stages of non-small cell lung cancer based on tongue and tumor marker: a pilot study. BMC Med Inform Decis Mak 2023; 23:197. [PMID: 37773123 PMCID: PMC10542664 DOI: 10.1186/s12911-023-02266-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Accepted: 08/17/2023] [Indexed: 09/30/2023] Open
Abstract
OBJECTIVE To analyze the tongue feature of NSCLC at different stages, as well as the correlation between tongue feature and tumor marker, and investigate the feasibility of establishing prediction models for NSCLC at different stages based on tongue feature and tumor marker. METHODS Tongue images were collected from non-advanced NSCLC patients (n = 109) and advanced NSCLC patients (n = 110), analyzed the tongue images to obtain tongue feature, and analyzed the correlation between tongue feature and tumor marker in different stages of NSCLC. On this basis, six classifiers, decision tree, logistic regression, SVM, random forest, naive bayes, and neural network, were used to establish prediction models for different stages of NSCLC based on tongue feature and tumor marker. RESULTS There were statistically significant differences in tongue feature between the non-advanced and advanced NSCLC groups. In the advanced NSCLC group, the number of indexes with statistically significant correlations between tongue feature and tumor marker was significantly higher than in the non-advanced NSCLC group, and the correlations were stronger. Support Vector Machine (SVM), decision tree, and logistic regression among the machine learning methods performed poorly in models with different stages of NSCLC. Neural network, random forest and naive bayes had better classification efficiency for the data set of tongue feature and tumor marker and baseline. The models' classification accuracies were 0.767 ± 0.081, 0.718 ± 0.062, and 0.688 ± 0.070, respectively, and the AUCs were 0.793 ± 0.086, 0.779 ± 0.075, and 0.771 ± 0.072, respectively. CONCLUSIONS There were statistically significant differences in tongue feature between different stages of NSCLC, with advanced NSCLC tongue feature being more closely correlated with tumor marker. Due to the limited information, single data sources including baseline, tongue feature, and tumor marker cannot be used to identify the different stages of NSCLC in this pilot study. In addition to the logistic regression method, other machine learning methods, based on tumor marker and baseline data sets, can effectively improve the differential diagnosis efficiency of different stages of NSCLC by adding tongue image data, which requires further verification based on large sample studies in the future.
Collapse
Affiliation(s)
- Yulin Shi
- The Office of Academic Affairs, Shanghai, 201203, China
| | - Hao Wang
- College of Traditional Chinese Medicine, Shanghai University of Traditional Chinese Medicine, Shanghai, 201203, China
| | - Xinghua Yao
- College of Traditional Chinese Medicine, Shanghai University of Traditional Chinese Medicine, Shanghai, 201203, China
| | - Jun Li
- College of Traditional Chinese Medicine, Shanghai University of Traditional Chinese Medicine, Shanghai, 201203, China
| | - Jiayi Liu
- College of Traditional Chinese Medicine, Shanghai University of Traditional Chinese Medicine, Shanghai, 201203, China
| | - Yuan Chen
- Longhua Hospital, Shanghai University of Traditional Chinese Medicine, Shanghai, 200032, China
| | - Lingshuang Liu
- Longhua Hospital, Shanghai University of Traditional Chinese Medicine, Shanghai, 200032, China.
| | - Jiatuo Xu
- College of Traditional Chinese Medicine, Shanghai University of Traditional Chinese Medicine, Shanghai, 201203, China.
| |
Collapse
|
5
|
Zhou J, Li S, Wang X, Yang Z, Hou X, Lai W, Zhao S, Deng Q, Zhou W. Weakly Supervised Deep Learning for Tooth-Marked Tongue Recognition. Front Physiol 2022; 13:847267. [PMID: 35492602 PMCID: PMC9039050 DOI: 10.3389/fphys.2022.847267] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 03/08/2022] [Indexed: 11/29/2022] Open
Abstract
The recognition of tooth-marked tongues has important value for clinical diagnosis of traditional Chinese medicine. Tooth-marked tongue is often related to spleen deficiency, cold dampness, sputum, effusion, and blood stasis. The clinical manifestations of patients with tooth-marked tongue include loss of appetite, borborygmus, gastric distention, and loose stool. Traditional clinical tooth-marked tongue recognition is conducted subjectively based on the doctor’s visual observation, and its performance is affected by the doctor’s subjectivity, experience, and environmental lighting changes. In addition, the tooth marks typically have various shapes and colors on the tongue, which make it very challenging for doctors to identify tooth marks. The existing methods based on deep learning have made great progress for tooth-marked tongue recognition, but there are still shortcomings such as requiring a large amount of manual labeling of tooth marks, inability to detect and locate the tooth marks, and not conducive to clinical diagnosis and interpretation. In this study, we propose an end-to-end deep neural network for tooth-marked tongue recognition based on weakly supervised learning. Note that the deep neural network only requires image-level annotations of tooth-marked or non-tooth marked tongues. In this method, a deep neural network is trained to classify tooth-marked tongues with the image-level annotations. Then, a weakly supervised tooth-mark detection network (WSTDN) as an architecture variant of the pre-trained deep neural network is proposed for the tooth-marked region detection. Finally, the WSTDN is re-trained and fine-tuned using only the image-level annotations to simultaneously realize the classification of the tooth-marked tongue and the positioning of the tooth-marked region. Experimental results of clinical tongue images demonstrate the superiority of the proposed method compared with previously reported deep learning methods for tooth-marked tongue recognition. The proposed tooth-marked tongue recognition model may provide important syndrome diagnosis and efficacy evaluation methods, and contribute to the understanding of ethnopharmacological mechanisms.
Collapse
Affiliation(s)
- Jianguo Zhou
- School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Shangxuan Li
- School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Xuesong Wang
- School of Artificial Intelligence, Beijing Normal University, Beijing, China
| | - Zizhu Yang
- School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Xinyuan Hou
- School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Wei Lai
- Beijing Yikang Medical Technology Co., Ltd., Beijing, China
| | - Shifeng Zhao
- School of Artificial Intelligence, Beijing Normal University, Beijing, China
| | - Qingqiong Deng
- School of Artificial Intelligence, Beijing Normal University, Beijing, China
- *Correspondence: Qingqiong Deng, ; Wu Zhou,
| | - Wu Zhou
- School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, China
- *Correspondence: Qingqiong Deng, ; Wu Zhou,
| |
Collapse
|