1
|
Mahapatra D, Bozorgtabar B, Ge Z, Reyes M. GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification. Med Image Anal 2024; 93:103075. [PMID: 38199069 DOI: 10.1016/j.media.2023.103075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 11/26/2023] [Accepted: 12/29/2023] [Indexed: 01/12/2024]
Abstract
Informative sample selection in an active learning (AL) setting helps a machine learning system attain optimum performance with minimum labeled samples, thus reducing annotation costs and boosting performance of computer-aided diagnosis systems in the presence of limited labeled data. Another effective technique to enlarge datasets in a small labeled data regime is data augmentation. An intuitive active learning approach thus consists of combining informative sample selection and data augmentation to leverage their respective advantages and improve the performance of AL systems. In this paper, we propose a novel approach called GANDALF (Graph-based TrANsformer and Data Augmentation Active Learning Framework) to combine sample selection and data augmentation in a multi-label setting. Conventional sample selection approaches in AL have mostly focused on the single-label setting where a sample has only one disease label. These approaches do not perform optimally when a sample can have multiple disease labels (e.g., in chest X-ray images). We improve upon state-of-the-art multi-label active learning techniques by representing disease labels as graph nodes and use graph attention transformers (GAT) to learn more effective inter-label relationships. We identify the most informative samples by aggregating GAT representations. Subsequently, we generate transformations of these informative samples by sampling from a learned latent space. From these generated samples, we identify informative samples via a novel multi-label informativeness score, which beyond the state of the art, ensures that (i) generated samples are not redundant with respect to the training data and (ii) make important contributions to the training stage. We apply our method to two public chest X-ray datasets, as well as breast, dermatology, retina and kidney tissue microscopy MedMNIST datasets, and report improved results over state-of-the-art multi-label AL techniques in terms of model performance, learning rates, and robustness.
Collapse
Affiliation(s)
- Dwarikanath Mahapatra
- Inception Institute of AI, Abu Dhabi, United Arab Emirates; Faculty of IT, Monash University, Melbourne, Australia.
| | - Behzad Bozorgtabar
- École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland; Lausanne University Hospital (CHUV), Lausanne, Switzerland
| | - Zongyuan Ge
- Faculty of IT, Monash University, Melbourne, Australia
| | - Mauricio Reyes
- ARTORG Center for Biomedical Engineering Research, University of Bern, Bern, Switzerland
| |
Collapse
|
2
|
Azevedo PHRDA, Peçanha BRDB, Flores-Junior LAP, Alves TF, Dias LRS, Muri EMF, Lima CHDS. In silico drug repurposing by combining machine learning classification model and molecular dynamics to identify a potential OGT inhibitor. J Biomol Struct Dyn 2024; 42:1417-1428. [PMID: 37054524 DOI: 10.1080/07391102.2023.2199868] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Accepted: 04/01/2023] [Indexed: 04/15/2023]
Abstract
O-linked N-acetylglucosamine (O-GlcNAc) is a unique intracellular post-translational glycosylation at the hydroxyl group of serine or threonine residues in nuclear, cytoplasmic and mitochondrial proteins. The enzyme O-GlcNAc transferase (OGT) is responsible for adding GlcNAc, and anomalies in this process can lead to the development of diseases associated with metabolic imbalance, such as diabetes and cancer. Repurposing approved drugs can be an attractive tool to discover new targets reducing time and costs in the drug design. This work focuses on drug repurposing to OGT targets by virtual screening of FDA-approved drugs through consensus machine learning (ML) models from an imbalanced dataset. We developed a classification model using docking scores and ligand descriptors. The SMOTE approach to resampling the dataset showed excellent statistical values in five of the seven ML algorithms to create models from the training set, with sensitivity, specificity and accuracy over 90% and Matthew's correlation coefficient greater than 0.8. The pose analysis obtained by molecular docking showed only H-bond interaction with the OGT C-Cat domain. The molecular dynamics simulation showed the lack of H-bond interactions with the C- and N-catalytic domains allowed the drug to exit the binding site. Our results showed that the non-steroidal anti-inflammatory celecoxib could be a potentially OGT inhibitor.
Collapse
Affiliation(s)
| | | | | | - Tatiana Fialho Alves
- Laboratório de Química Medicinal, Faculdade de Farmácia, Universidade Federal Fluminense, Niterói, RJ, Brazil
| | - Luiza Rosaria Sousa Dias
- Laboratório de Química Medicinal, Faculdade de Farmácia, Universidade Federal Fluminense, Niterói, RJ, Brazil
| | - Estela Maris Freitas Muri
- Laboratório de Química Medicinal, Faculdade de Farmácia, Universidade Federal Fluminense, Niterói, RJ, Brazil
| | | |
Collapse
|
3
|
Regan T, McCredie MN, Harris B, Clark S. Using classification trees to identify psychotherapy patients at risk for poor treatment adherence. Psychother Res 2024; 34:159-170. [PMID: 36881612 PMCID: PMC10483023 DOI: 10.1080/10503307.2023.2183911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/10/2022] [Revised: 02/15/2023] [Accepted: 02/17/2023] [Indexed: 03/08/2023] Open
Abstract
To determine the relative importance of a wide variety of personality and psychopathology variables in influencing patients' adherence to psychotherapy treatment. Two classification trees were trained to predict patients' (1) treatment utilization (i.e., their likelihood of missing a given appointment) and (2) termination status (i.e., their likelihood of dropping out of therapy prematurely). Each tree was then validated in an external dataset to examine performance accuracy. Patients' social detachment was most influential in predicting their treatment utilization, followed by affective instability and activity/energy levels. Patients' interpersonal warmth was most influential in predicting their termination status, followed by levels of disordered thought and resentment. The overall accuracy rating for the tree for termination status was 71.4%, while the tree for treatment utilization had a 38.7% accuracy rating. Classification trees are a practical tool for clinicians to determine patients at risk of premature termination. More research is needed to develop trees that predict treatment utilization with high accuracy across different types of patients and settings.
Collapse
Affiliation(s)
- Timothy Regan
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health
| | | | - Bethany Harris
- Department of Psychological & Brain Sciences, Texas A&M University
| | - Shaunna Clark
- Department of Psychiatry & Behavioral Sciences, Texas A&M College of Medicine
| |
Collapse
|
4
|
Haj-Hosseini N, Lindblad J, Hasséus B, Kumar VV, Subramaniam N, Hirsch JM. Early Detection of Oral Potentially Malignant Disorders: A Review on Prospective Screening Methods with Regard to Global Challenges. J Maxillofac Oral Surg 2024; 23:23-32. [PMID: 38312957 PMCID: PMC10831018 DOI: 10.1007/s12663-022-01710-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Accepted: 03/10/2022] [Indexed: 11/28/2022] Open
Abstract
Oral cancer is a cancer type that is widely prevalent in low-and middle-income countries with a high mortality rate, and poor quality of life for patients after treatment. Early treatment of cancer increases patient survival, improves quality of life and results in less morbidity and a better prognosis. To reach this goal, early detection of malignancies using technologies that can be used in remote and low resource areas is desirable. Such technologies should be affordable, accurate, and easy to use and interpret. This review surveys different technologies that have the potentials of implementation in primary health and general dental practice, considering global perspectives and with a focus on the population in India, where oral cancer is highly prevalent. The technologies reviewed include both sample-based methods, such as saliva and blood analysis and brush biopsy, and more direct screening of the oral cavity including fluorescence, Raman techniques, and optical coherence tomography. Digitalisation, followed by automated artificial intelligence based analysis, are key elements in facilitating wide access to these technologies, to non-specialist personnel and in rural areas, increasing quality and objectivity of the analysis while simultaneously reducing the labour and need for highly trained specialists.
Collapse
Affiliation(s)
- Neda Haj-Hosseini
- Department of Biomedical Engineering, Linköping University, Linköping, Sweden
- Centre for Medical Image Science and Visualization, Linköping University, Linköping, Sweden
| | - Joakim Lindblad
- Centre for Image Analysis, Department of Information Technology, Uppsala University, Uppsala, Sweden
| | - Bengt Hasséus
- Department of Oral Medicine and Pathology, Institute of Odontology, University of Gothenburg, The Sahlgrenska Academy, Gothenburg, Sweden
- Clinic of Oral Medicine, Public Dental Service, Gothenburg, Region Västra Götaland Sweden
| | - Vinay Vijaya Kumar
- Department of Head and Neck Oncology, Sri Shankara Cancer Hospital and Research Centre, Bangalore, India
- Department of Surgical Sciences, Odontology and Maxillofacial Surgery, Medical Faculty, Uppsala University, Uppsala, Sweden
| | - Narayana Subramaniam
- Department of Head and Neck Oncology, Sri Shankara Cancer Hospital and Research Centre, Bangalore, India
| | - Jan-Michaél Hirsch
- Department of Surgical Sciences, Odontology and Maxillofacial Surgery, Medical Faculty, Uppsala University, Uppsala, Sweden
- Department of Research & Development, Public Dental Services Region Stockholm, Stockholm, Sweden
| |
Collapse
|
5
|
Hurair M, Ju J, Han J. Environmental-Driven Approach towards Level 5 Self-Driving. Sensors (Basel) 2024; 24:485. [PMID: 38257577 PMCID: PMC10820702 DOI: 10.3390/s24020485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 01/03/2024] [Accepted: 01/05/2024] [Indexed: 01/24/2024]
Abstract
As technology advances in almost all areas of life, many companies and researchers are working to develop fully autonomous vehicles. Such level 5 autonomous driving, unlike levels 0 to 4, is a driverless vehicle stage and so the leap from level 4 to level 5 autonomous driving requires much more research and experimentation. For autonomous vehicles to safely drive in complex environments, autonomous cars should ensure end-to-end delay deadlines of sensor systems and car-controlling algorithms including machine learning modules, which are known to be very computationally intensive. To address this issue, we propose a new framework, i.e., an environment-driven approach for autonomous cars. Specifically, we identify environmental factors that we cannot control at all, and controllable internal factors such as sensing frequency, image resolution, prediction rate, car speed, and so on. Then, we design an admission control module that allows us to control internal factors such as image resolution and detection period to determine whether given parameters are acceptable or not for supporting end-to-end deadlines in the current environmental scenario while maintaining the accuracy of autonomous driving. The proposed framework has been verified with an RC car and a simulator.
Collapse
Affiliation(s)
| | | | - Junghee Han
- School of Electronics and Information Engineering, Korea Aerospace University, 76 Hanggongdaehang-ro, Goyang-si 412-791, Gyeonggi-do, Republic of Korea; (M.H.); (J.J.)
| |
Collapse
|
6
|
Feng M, Fang T, He C, Li M, Liu J. Affect and stress detection based on feature fusion of LSTM and 1DCNN. Comput Methods Biomech Biomed Engin 2024; 27:512-520. [PMID: 36919485 DOI: 10.1080/10255842.2023.2188988] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 02/24/2023] [Indexed: 03/16/2023]
Abstract
The impact of emotions on health, especially stress, is receiving increasing attention. It is important to provide a non-invasive affect detection system that can be continuously monitored for a long period of time. Multi-sensor fusion strategies can better improve the performance of affect detection models, but there are also problems such as insufficient feature extraction and poor spatiotemporal feature fusion. Therefore, this study proposes a feature-level fusion method based on long short-term memory and one-dimensional convolutional neural network to extract temporal and spatial features of electrocardiogram, electromyogram, electrical activity, temperature, accelerator and response data, respectively, and then fuse them in a summation fashion for affect and stress detection. In particular, we added the tanh activation function before feature fusion, which can improve the model's performance. We used the wearable affect and stress detection dataset to train the model, which includes three different emotion states (neutral, stress, and amusement) for three-class emotion classification with accuracy and F1-scores of 87.82% and 86.68%, respectively. Due to the importance of stress, we also studied binary classification for stress detection, where neutral and amusement were combined as non-stress, with accuracy and F1-scores of 94.9% and 94.98%, respectively. The performance of the proposed model outperforms other control models and can effectively improve the performance of affect and stress detection, and promote medical care, health care and elderly care.
Collapse
Affiliation(s)
- Mingxu Feng
- Nanchang Key Laboratory of Medical and Technology Research, Nanchang University, Nanchang, Jiangxi, China
- School of Industrial Engineering, Ningxia Polytechnic, Yinchuan, Ningxia, China
| | - Tianshu Fang
- Nanchang Key Laboratory of Medical and Technology Research, Nanchang University, Nanchang, Jiangxi, China
| | - Chaozhu He
- Nanchang Key Laboratory of Medical and Technology Research, Nanchang University, Nanchang, Jiangxi, China
| | - Mengqian Li
- Department of Psychosomatic medicine, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Jizhong Liu
- Nanchang Key Laboratory of Medical and Technology Research, Nanchang University, Nanchang, Jiangxi, China
| |
Collapse
|
7
|
Martos R, Guerra R, Navarro F, Peruch M, Neuwirth K, Valsecchi A, Jankauskas R, Ibáñez O. Computer-aided craniofacial superimposition validation study: the identification of the leaders and participants of the Polish-Lithuanian January Uprising (1863-1864). Int J Legal Med 2024; 138:107-121. [PMID: 36520206 PMCID: PMC10772000 DOI: 10.1007/s00414-022-02929-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 12/04/2022] [Indexed: 12/23/2022]
Abstract
In 2017, a series of human remains corresponding to the executed leaders of the "January Uprising" of 1863-1864 were uncovered at the Upper Castle of Vilnius (Lithuania). During the archeological excavations, 14 inhumation pits with the human remains of 21 individuals were found at the site. The subsequent identification process was carried out, including the analysis and cross-comparison of post-mortem data obtained in situ and in the lab with ante-mortem data obtained from historical archives. In parallel, three anthropologists with diverse backgrounds in craniofacial identification and two students without previous experience attempted to identify 11 of these 21 individuals using the craniofacial superimposition technique. To do this, the five participants had access to 18 3D scanned skulls and 14 photographs of 11 different candidates. The participants faced a cross-comparison problem involving 252 skull-face overlay scenarios. The methodology follows the main agreements of the European project MEPROCS and uses the software Skeleton-ID™. Based on MEPROCS standard, a final decision was provided within a scale, assigning a value in terms of strong, moderate, or limited support to the claim that the skull and the facial image belonged (or not) to the same person for each case. The problem of binary classification, positive/negative, with an identification rate for each participant was revealed. The results obtained in this study make the authors think that both the quality of the materials used and the previous experience of the analyst play a fundamental role when reaching conclusions using the CFS technique.
Collapse
Affiliation(s)
- Rubén Martos
- Panacea Cooperative Research S. Coop, Ponferrada, Spain
- Physical Anthropology Lab, Department of Legal Medicine, Toxicology and Physical Anthropology, University of Granada, Granada, Spain
| | - Rosario Guerra
- Panacea Cooperative Research S. Coop, Ponferrada, Spain
- Physical Anthropology Lab, Department of Legal Medicine, Toxicology and Physical Anthropology, University of Granada, Granada, Spain
| | - Fernando Navarro
- Physical Anthropology Lab, Department of Legal Medicine, Toxicology and Physical Anthropology, University of Granada, Granada, Spain
| | - Michela Peruch
- Department of Medicine and Surgery, University of Trieste, Trieste, Italy
| | - Kevin Neuwirth
- Institute for Prehistory, Early History and Medieval Archaeology, University of Tübingen, Tübingen, Germany
| | - Andrea Valsecchi
- Panacea Cooperative Research S. Coop, Ponferrada, Spain
- Andalusian Research Institute in Data Science and Computational Intelligence, University of Granada, Granada, Spain
| | - Rimantas Jankauskas
- Department of Anatomy, Histology and Anthropology, Vilnius University, Vilnius, Lithuania
| | - Oscar Ibáñez
- Panacea Cooperative Research S. Coop, Ponferrada, Spain.
- Faculty of Computer Science, CITIC, University of A Coruna, 15071, La Coruña, Spain.
| |
Collapse
|
8
|
Moosa H, Ali M, Alaswad H, Elmedany W, Balakrishna C. A combined Blockchain and zero-knowledge model for healthcare B2B and B2C data sharing. Arab Journal of Basic and Applied Sciences 2023. [DOI: 10.1080/25765299.2023.2188701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/31/2023] Open
|
9
|
Yang Q, Wang X, Cao X, Liu S, Xie F, Li Y. Multi-classification of national fitness test grades based on statistical analysis and machine learning. PLoS One 2023; 18:e0295674. [PMID: 38134133 PMCID: PMC10745189 DOI: 10.1371/journal.pone.0295674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Accepted: 11/27/2023] [Indexed: 12/24/2023] Open
Abstract
Physical fitness is a key element of a healthy life, and being overweight or lacking physical exercise will lead to health problems. Therefore, assessing an individual's physical health status from a non-medical, cost-effective perspective is essential. This paper aimed to evaluate the national physical health status through national physical examination data, selecting 12 indicators to divide the physical health status into four levels: excellent, good, pass, and fail. The existing challenge lies in the fact that most literature on physical fitness assessment mainly focuses on the two major groups of sports athletes and school students. Unfortunately, there is no reasonable index system has been constructed. The evaluation method has limitations and cannot be applied to other groups. This paper builds a reasonable health indicator system based on national physical examination data, breaks group restrictions, studies national groups, and hopes to use machine learning models to provide helpful health suggestions for citizens to measure their physical status. We analyzed the significance of the selected indicators through nonparametric tests and exploratory statistical analysis. We used seven machine learning models to obtain the best multi-classification model for the physical fitness test level. Comprehensive research showed that MLP has the best classification effect, with macro-precision reaching 74.4% and micro-precision reaching 72.8%. Furthermore, the recall rates are also above 70%, and the Hamming loss is the smallest, i.e., 0.272. The practical implications of these findings are significant. Individuals can use the classification model to understand their physical fitness level and status, exercise appropriately according to the measurement indicators, and adjust their lifestyle, which is an important aspect of health management.
Collapse
Affiliation(s)
- Qian Yang
- School of Mathematics and Statistics, Beijing Technology and Business University, Beijing, China
| | - Xueli Wang
- School of Mathematics and Statistics, Beijing Technology and Business University, Beijing, China
| | - Xianbing Cao
- School of Mathematics and Statistics, Beijing Technology and Business University, Beijing, China
| | - Shuai Liu
- School of Mathematics and Statistics, Beijing Technology and Business University, Beijing, China
| | - Feng Xie
- School of Mathematics and Statistics, Beijing Technology and Business University, Beijing, China
| | - Yumei Li
- School of Mathematics and Statistics, Beijing Technology and Business University, Beijing, China
| |
Collapse
|
10
|
Abstract
The growth of databases in the healthcare domain opens multiple doors for machine learning and artificial intelligence technology. Many medical devices are available in the medical field; however, medical errors remain a severe challenge. Different algorithms are developed to identify and solve medical errors, such as detecting anomalous readings, anomalous health conditions of a patient, etc. However, they fail to answer why those entries are considered an anomaly. This research gap leads to an outlying aspect mining problem. The problem of outlying aspect mining aims to discover the set of features (a.k.a subspace) in which the given data point is dramatically different than others. In this paper, we present a framework that detects anomalies in healthcare data and then provides an explanation of anomalies. This paper aims to effectively and efficiently detect anomalies and explain why they are considered anomalies by detecting outlying aspects. First, we re-introduced four anomaly detection techniques and outlying aspect mining algorithms. Then, we evaluate the performance of anomaly detection techniques and choose the best anomaly detection algorithm. Later, we detect the top k anomaly as a query and detect their outlying aspect. Lastly, we evaluate their performance on 16 real-world healthcare datasets. The experimental results show that the latest isolation-based outlying aspect mining measure, SiNNE, has outstanding performance on this task and has promising results.
Collapse
Affiliation(s)
- Durgesh Samariya
- Institute of Innovation, Science and Sustainability, Federation University, Berwick, VIC Australia
| | - Jiangang Ma
- Institute of Innovation, Science and Sustainability, Federation University, Berwick, VIC Australia
| | - Sunil Aryal
- School of Information Technology, Deakin University, Geelong, VIC Australia
| | - Xiaohui Zhao
- Institute of Innovation, Science and Sustainability, Federation University, Ballarat, VIC Australia
| |
Collapse
|
11
|
Palla IA, Singson M. How do researchers perceive research misbehaviors? A case study of Indian researchers. Account Res 2023; 30:707-724. [PMID: 35584318 DOI: 10.1080/08989621.2022.2078712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Despite ample evidence of increasing research misconduct in India, little attention has been paid to understanding researchers' perception of research integrity and research misconduct among young Indian researchers. Interviews among 30 research scholars were conducted at Pondicherry University in India to understand their experience and perception of research misconduct. The top three influencing factors for scientific misconduct, according to the participants, were unavailability of adequate funds (35%), pressure from research supervisors (29%), and desperation to publish articles (25%). The participants had witnessed research misconduct in different forms i.e., data fabrication, falsification, and plagiarism. However, plagiarism was the most often cited cause of misbehavior in our interviews. Majority of participants have witnessed or personally encountered multiple instances where authorship conflicts occurred. The other questionable research practices highlighted in the study were improper citations, authorship disputes like gift and ghost authorships, misrepresentation of statistical data, failure to publish negative results. In an increasingly diverse and changing research environment, our research calls for practical research guidelines based on honesty, openness, and accountability that can help articulate and strengthen scientists' core values. More importantly, scientific misconduct can only be prevented by using a multifaceted strategy that includes identifying instances of scientific misconduct and implementing suitable deterrents and treatments that could change the behavior associated with such misconduct.
Collapse
Affiliation(s)
- Ishfaq Ahmad Palla
- Department of Library and Information Science, Pondicherry University, Puducherry, India
| | - Mangkhollen Singson
- Department of Library and Information Science, Pondicherry University, Puducherry, India
| |
Collapse
|
12
|
Lu H, Uddin S. Embedding-based link predictions to explore latent comorbidity of chronic diseases. Health Inf Sci Syst 2023; 11:2. [PMID: 36593862 PMCID: PMC9803807 DOI: 10.1007/s13755-022-00206-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 12/13/2022] [Indexed: 12/31/2022] Open
Abstract
Purpose Comorbidity is a term used to describe when a patient simultaneously has more than one chronic disease. Comorbidity is a significant health issue that affects people worldwide. This study aims to use machine learning and graph theory to predict the comorbidity of chronic diseases. Methods A patient-disease bipartite graph is constructed based on the administrative claim data. The bipartite graph projection approach was used to create the comorbidity network. For the link prediction task, three graph machine learning embedding-based models (node2vec, graph neural networks and hand-crafted approach) with different variants were used on the comorbidity network to compare their performance. This study also considered three commonly used similarity-based link prediction approaches (Jaccard coefficient, Adamic-Adar index and Resource allocation index) for performance comparison. Results The results showed that the embedding-based hand-crafted features technique achieved outstanding performance compared with the remaining similarity-based and embedding-based models. Especially, the hand-crafted technique with the extreme gradient boosting classifier achieved the highest accuracy (91.67%), followed by the same technique with the Logistic regression classifier (90.26%). For this shallow embedding method, the Jaccard coefficient and the degree centrality of the original chronic disease were the most important features for comorbidity prediction. Conclusion The proposed framework can be used to predict the comorbidity of chronic disease at an early stage of hospital admission. Thus, the prediction outcome could be valuable for medical practice, giving healthcare providers more control over their services and lowering expenses.
Collapse
Affiliation(s)
- Haohui Lu
- School of Project Management, Faculty of Engineering, The University of Sydney, Level 2, 21 Ross Street, Forest Lodge, NSW 2037 Australia
| | - Shahadat Uddin
- School of Project Management, Faculty of Engineering, The University of Sydney, Level 2, 21 Ross Street, Forest Lodge, NSW 2037 Australia
| |
Collapse
|
13
|
Li Z, An Z, Cheng W, Zhou J, Zheng F, Hu B. MHA: a multimodal hierarchical attention model for depression detection in social media. Health Inf Sci Syst 2023; 11:6. [PMID: 36660408 PMCID: PMC9846704 DOI: 10.1007/s13755-022-00197-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2022] [Accepted: 09/06/2022] [Indexed: 01/19/2023] Open
Abstract
As a serious mental disease, depression causes great harm to the physical and mental health of individuals, and becomes an important cause of suicide. Therefore, it is necessary to accurately identify and treat depressed patients. Compared with traditional clinical diagnosis methods, a large amount of real and different types of data on social media provides new ideas for depression detection research. In this paper, we construct a depression detection data set based on Weibo, and propose a Multimodal Hierarchical Attention (MHA) model for social media depression detection. Multimodal data is fed into the model and the attention mechanism is applied within and between modalities at the same time. Experimental results show that the proposed model achieves the best classification performance. In addition, we propose a distribution normalization method, which can optimize the data distribution and improve the accuracy of depression detection.
Collapse
Affiliation(s)
- Zepeng Li
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 730000 Gansu China
| | - Zhengyi An
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 730000 Gansu China
| | - Wenchuan Cheng
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 730000 Gansu China
| | - Jiawei Zhou
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 730000 Gansu China
| | - Fang Zheng
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 730000 Gansu China
| | - Bin Hu
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 730000 Gansu China
- School of Medical Technology, Beijing Institute of Technology, Beijing, 100081 Beijing China
- CAS Center for Excellence in Brain Science and Intelligence Technology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200000 Shanghai China
| |
Collapse
|
14
|
Koh AS, Bos HMW, Rothblum ED, Carone N, Gartrell NK. Donor sibling relations among adult offspring conceived via insemination by lesbian parents. Hum Reprod 2023; 38:2166-2174. [PMID: 37697711 DOI: 10.1093/humrep/dead175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Revised: 08/13/2023] [Indexed: 09/13/2023] Open
Abstract
STUDY QUESTION How do adult offspring in planned lesbian-parent families feel about and relate to their donor (half) sibling(s) (DS)? SUMMARY ANSWER A majority of offspring had found DS and maintained good ongoing relationships, and all offspring (regardless of whether a DS had been identified) were satisfied with their knowledge of and contact level with the DS. WHAT IS KNOWN ALREADY The first generation of donor insemination offspring of intended lesbian-parent families is now in their 30s. Coincident with this is an increased use of DNA testing and genetic ancestry websites, facilitating the discovery of donor siblings from a common sperm donor. Few studies of offspring and their DS include sexual minority parent (SMP) families, and only sparse data separately analyze the offspring of SMP families or extend the analyses to established adult offspring. STUDY DESIGN, SIZE, DURATION This cohort study included 75 adult offspring, longitudinally followed since conception in lesbian-parent families. Quantitative analyses were performed from online surveys of the offspring in the seventh wave of the 36-year study, with a 90% family retention rate. The data were collected from March 2021 to November 2022. PARTICIPANTS/MATERIALS, SETTING, METHODS Participants were 30- to 33-year-old donor insemination offspring whose lesbian parents enrolled in a US prospective longitudinal study when these offspring were conceived. Offspring who knew of a DS were asked about their numbers found, characteristics or motivations for meeting, DS terminology, relationship quality and maintenance, and impact of the DS contact on others. All offspring (with or without known DS) were asked about the importance of knowing if they have DS and their terminology, satisfaction with information about DS, and feelings about future contact. MAIN RESULTS AND THE ROLE OF CHANCE Of offspring, 53% (n = 40) had found DS in modest numbers, via a DS or sperm bank registry in 45% of cases, and most of these offspring had made contact. The offspring had their meeting motivations fulfilled, viewed the DS as acquaintances more often than siblings or friends, and maintained good relationships via meetings, social media, and cell phone communication. They disclosed their DS meetings to most relatives with neutral impact. The offspring, whether with known or unknown DS, felt neutral about the importance of knowing if they had DS, were satisfied with what they knew (or did not know) of the DS, and were satisfied with their current level of DS contact. This study is the largest, longest-running longitudinal study of intended lesbian-parent families and their offspring, and due to its prospective nature, is not biased by over-sampling offspring who were already satisfied with their DS. LIMITATIONS, REASONS FOR CAUTION The sample was from the USA, and mostly White, highly educated individuals, not representative of the diversity of donor insemination offspring of lesbian-parent families. WIDER IMPLICATIONS OF THE FINDINGS While about half of the offspring found out about DS, the other half did not. Regardless of knowing of a DS, these adult offspring of lesbian parents were satisfied with their level of DS contact. Early disclosure and identity formation about being donor-conceived in a lesbian-parent family may distinguish these study participants from donor insemination offspring and adoptees in the general population, who may be more compelled to seek genetic relatives. The study participants who sought DS mostly found a modest number of them, in contrast to reports in studies that have found large numbers of DS. This may be because one-third of study offspring had donors known to the families since conception, who may have been less likely to participate in commercial sperm banking or internet donation sites, where quotas are difficult to enforce or nonexistent. The study results have implications for anyone considering gamete donation, gamete donors, donor-conceived offspring, and/or gamete banks, as well as the medical and public policy professionals who advise them. STUDY FUNDING/COMPETING INTEREST(S) No funding was provided for this project. The authors have no competing interests. TRIAL REGISTRATION NUMBER N/A.
Collapse
Affiliation(s)
- Audrey S Koh
- Department of Obstetrics, Gynecology and Reproductive Sciences, School of Medicine, University of California, San Francisco, San Francisco, CA, USA
| | - Henny M W Bos
- Research Institute of Child Development and Education, University of Amsterdam, Amsterdam, The Netherlands
| | - Esther D Rothblum
- Department of Women's Studies, San Diego State University, San Diego, CA, USA
- Williams Institute, UCLA School of Law, Los Angeles, CA, USA
| | - Nicola Carone
- Department of Brain and Behavioral Sciences, University of Pavia, Pavia, Italy
| | - Nanette K Gartrell
- Research Institute of Child Development and Education, University of Amsterdam, Amsterdam, The Netherlands
- Williams Institute, UCLA School of Law, Los Angeles, CA, USA
| |
Collapse
|
15
|
Adams Z, Osman M, Bechlivanidis C, Meder B. (Why) Is Misinformation a Problem? Perspect Psychol Sci 2023; 18:1436-1463. [PMID: 36795592 PMCID: PMC10623619 DOI: 10.1177/17456916221141344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
In the last decade there has been a proliferation of research on misinformation. One important aspect of this work that receives less attention than it should is exactly why misinformation is a problem. To adequately address this question, we must first look to its speculated causes and effects. We examined different disciplines (computer science, economics, history, information science, journalism, law, media, politics, philosophy, psychology, sociology) that investigate misinformation. The consensus view points to advancements in information technology (e.g., the Internet, social media) as a main cause of the proliferation and increasing impact of misinformation, with a variety of illustrations of the effects. We critically analyzed both issues. As to the effects, misbehaviors are not yet reliably demonstrated empirically to be the outcome of misinformation; correlation as causation may have a hand in that perception. As to the cause, advancements in information technologies enable, as well as reveal, multitudes of interactions that represent significant deviations from ground truths through people's new way of knowing (intersubjectivity). This, we argue, is illusionary when understood in light of historical epistemology. Both doubts we raise are used to consider the cost to established norms of liberal democracy that come from efforts to target the problem of misinformation.
Collapse
Affiliation(s)
- Zoë Adams
- Department of Linguistics, School of Languages, Linguistics and Film, Queen Mary University London
| | - Magda Osman
- Centre for Science and Policy, University of Cambridge
- Judge Business School, University of Cambridge
- Leeds Business School, University of Leeds
| | | | - Björn Meder
- Department of Psychology, Health and Medical University, Potsdam, Germany
- Max Planck Research Group iSearch, Max Planck Institute for Human Development, Berlin, Germany
| |
Collapse
|
16
|
Qian C, Liu Y, Barnett-Neefs C, Salgia S, Serbetci O, Adalja A, Acharya J, Zhao Q, Ivanek R, Wiedmann M. A perspective on data sharing in digital food safety systems. Crit Rev Food Sci Nutr 2023; 63:12513-12529. [PMID: 35880485 DOI: 10.1080/10408398.2022.2103086] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
In this age of data, digital tools are widely promoted as having tremendous potential for enhancing food safety. However, the potential of these digital tools depends on the availability and quality of data, and a number of obstacles need to be overcome to achieve the goal of digitally enabled "smarter food safety" approaches. One key obstacle is that participants in the food system and in food safety often lack the willingness to share data, due to fears of data abuse, bad publicity, liability, and the need to keep certain data (e.g., human illness data) confidential. As these multifaceted concerns lead to tension between data utility and privacy, the solutions to these challenges need to be multifaceted. This review outlines the data needs in digital food safety systems, exemplified in different data categories and model types, and key concerns associated with sharing of food safety data, including confidentiality and privacy of shared data. To address the data privacy issue a combination of innovative strategies to protect privacy as well as legal protection against data abuse need to be pursued. Existing solutions for maximizing data utility, while not compromising data privacy, are discussed, most notably differential privacy and federated learning.
Collapse
Affiliation(s)
- Chenhao Qian
- Department of Food Science, Cornell University, Ithaca, NY, USA
| | - Yuhan Liu
- School of Electrical and Computer Engineering, Cornell University, Ithaca, NY, USA
| | - Cecil Barnett-Neefs
- Department of Population Medicine and Diagnostic Sciences, Cornell University, Ithaca, NY, USA
| | - Sudeep Salgia
- School of Electrical and Computer Engineering, Cornell University, Ithaca, NY, USA
| | - Omer Serbetci
- School of Electrical and Computer Engineering, Cornell University, Ithaca, NY, USA
| | - Aaron Adalja
- SC Johnson College of Business, Cornell University, Ithaca, NY, USA
| | - Jayadev Acharya
- School of Electrical and Computer Engineering, Cornell University, Ithaca, NY, USA
| | - Qing Zhao
- School of Electrical and Computer Engineering, Cornell University, Ithaca, NY, USA
| | - Renata Ivanek
- Department of Population Medicine and Diagnostic Sciences, Cornell University, Ithaca, NY, USA
| | - Martin Wiedmann
- Department of Food Science, Cornell University, Ithaca, NY, USA
| |
Collapse
|
17
|
Kita K, Uemura K, Takao M, Fujimori T, Tamura K, Nakamura N, Wakabayashi G, Kurakami H, Suzuki Y, Wataya T, Nishigaki D, Okada S, Tomiyama N, Kido S. Use of artificial intelligence to identify data elements for The Japanese Orthopaedic Association National Registry from operative records. J Orthop Sci 2023; 28:1392-1399. [PMID: 36163118 DOI: 10.1016/j.jos.2022.09.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/08/2022] [Revised: 08/09/2022] [Accepted: 09/06/2022] [Indexed: 11/19/2022]
Abstract
BACKGROUND The Japanese Orthopaedic Association National Registry (JOANR) was recently launched in Japan and is expected to improve the quality of medical care. However, surgeons must register ten detailed features for total hip arthroplasty, which is labor intensive. One possible solution is to use a system that automatically extracts information about the surgeries. Although it is not easy to extract features from an operative record consisting of free-text data, natural language processing has been used to extract features from operative records. This study aimed to evaluate the best natural language processing method for building a system that automatically detects some elements in the JOANR from the operative records of total hip arthroplasty. METHODS We obtained operative records of total hip arthroplasty (n = 2574) in three hospitals and targeted two items: surgical approach and fixation technique. We compared the accuracy of three natural language processing methods: rule-based algorithms, machine learning, and bidirectional encoder representations from transformers (BERT). RESULTS In the surgical approach task, the accuracy of BERT was superior to that of the rule-based algorithm (99.6% vs. 93.6%, p < 0.001), comparable to machine learning. In the fixation technique task, the accuracy of BERT was superior to the rule-based algorithm and machine learning (96% vs. 74%, p < 0.0001 and 94%, p = 0.0004). CONCLUSIONS BERT is the most appropriate method for building a system that automatically detects the surgical approach and fixation technique.
Collapse
Affiliation(s)
- Kosuke Kita
- Department of Artificial Intelligence Diagnostic Radiology, Graduate School of Medicine, Osaka University, Osaka, Japan.
| | - Keisuke Uemura
- Department of Orthopaedic Surgery, Graduate School of Medicine, Osaka University, Osaka, Japan
| | - Masaki Takao
- Department of Orthopaedic Surgery, Graduate School of Medicine, Osaka University, Osaka, Japan
| | - Takahito Fujimori
- Department of Orthopaedic Surgery, Graduate School of Medicine, Osaka University, Osaka, Japan
| | - Kazunori Tamura
- Department of Orthopaedic Surgery, Kyowakai Hospital, Osaka, Japan
| | - Nobuo Nakamura
- Department of Orthopaedic Surgery, Kyowakai Hospital, Osaka, Japan
| | - Gen Wakabayashi
- Department of Orthopaedic Surgery, Ikeda City Hospital, Osaka, Japan
| | - Hiroyuki Kurakami
- Department of Medical Innovation, Osaka University Hospital, Osaka, Japan
| | - Yuki Suzuki
- Department of Artificial Intelligence Diagnostic Radiology, Graduate School of Medicine, Osaka University, Osaka, Japan
| | - Tomohiro Wataya
- Department of Artificial Intelligence Diagnostic Radiology, Graduate School of Medicine, Osaka University, Osaka, Japan
| | - Daiki Nishigaki
- Department of Artificial Intelligence Diagnostic Radiology, Graduate School of Medicine, Osaka University, Osaka, Japan
| | - Seiji Okada
- Department of Orthopaedic Surgery, Graduate School of Medicine, Osaka University, Osaka, Japan
| | | | - Shoji Kido
- Department of Artificial Intelligence Diagnostic Radiology, Graduate School of Medicine, Osaka University, Osaka, Japan
| |
Collapse
|
18
|
Ayday E, Vaidya J, Jiang X, Telenti A. Ensuring Trust in Genomics Research. IEEE Int Conf Trust Priv Secur Intell Syst Appl 2023; 2023:1-12. [PMID: 38562180 PMCID: PMC10981793 DOI: 10.1109/tps-isa58951.2023.00011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Reproducibility, transparency, representation, and privacy underpin the trust on genomics research in general and genome-wide association studies (GWAS) in particular. Concerns about these issues can be mitigated by technologies that address privacy protection, quality control, and verifiability of GWAS. However, many of the existing technological solutions have been developed in isolation and may address one aspect of reproducibility, transparency, representation, and privacy of GWAS while unknowingly impacting other aspects. As a consequence, the current patchwork of technological tools only partially and in an overlapping manner address issues with GWAS, sometimes even creating more problems. This paper addresses the progress in a field that creates technological solutions that augment the acceptance and security of population genetic analyses. The text identifies areas that are falling behind in technical implementation or where there is insufficient research. We make the case that a full understanding of the different GWAS settings, technological tools and new research directions can holistically address the requirements for the acceptance of GWAS.
Collapse
Affiliation(s)
- Erman Ayday
- Department of Computer and Data Sciences Case Western Reserve University Cleveland, OH
| | - Jaideep Vaidya
- Management Science and Information Systems Department Rutgers University Newark, NJ
| | - Xiaoqian Jiang
- Department of Data Science and Artificial Intelligence University of Texas - Health Houston, TX
| | - Amalio Telenti
- Dept. of Integrative Structural and Computational Biology Scripps Institute La Jolla, CA
| |
Collapse
|
19
|
Xie L, Yu H, Xu K, Yang T, Wang M, Lu H, Xiong R, Wang Y. Learning a simulation-based visual policy for real-world peg in unseen holes. Rev Sci Instrum 2023; 94:105107. [PMID: 37812051 DOI: 10.1063/5.0168544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 09/15/2023] [Indexed: 10/10/2023]
Abstract
This paper proposes a learning-based visual peg-in-hole that enables training with several shapes in simulation and adapting to arbitrary unseen shapes in the real world with minimal sim-to-real cost. The core idea is to decouple the generalization of the sensory-motor policy from the design of a fast-adaptable perception module and a simulated generic policy module. The framework consists of a segmentation network (SN), a virtual sensor network (VSN), and a controller network (CN). Concretely, the VSN is trained to measure the pose of the unseen shape from a segmented image. After that, given the shape-agnostic pose measurement, the CN is trained to achieve a generic peg-in-hole. Finally, when applying to real unseen holes, we only have to fine-tune the SN required by the simulated VSN + CN. To further minimize the transfer cost, we propose to automatically collect and annotate the data for the SN after one-minute human teaching. Simulated and real-world results are presented under the configuration of eye-to/in-hand. An electric vehicle charging system with the proposed policy inside achieves a 10/10 success rate in 2-3 s, using only hundreds of auto-labeled samples for the SN transfer.
Collapse
Affiliation(s)
- Liang Xie
- College of Control Science and Engineering, Zhejiang University, Zhejiang, China
| | - Hongxiang Yu
- College of Control Science and Engineering, Zhejiang University, Zhejiang, China
| | - Kechun Xu
- College of Control Science and Engineering, Zhejiang University, Zhejiang, China
| | - Tong Yang
- College of Control Science and Engineering, Zhejiang University, Zhejiang, China
| | - Minhang Wang
- The Application Innovate Lab, Huawei Incorporated Company, Shenzhen, China
| | - Haojian Lu
- College of Control Science and Engineering, Zhejiang University, Zhejiang, China
| | - Rong Xiong
- College of Control Science and Engineering, Zhejiang University, Zhejiang, China
| | - Yue Wang
- College of Control Science and Engineering, Zhejiang University, Zhejiang, China
| |
Collapse
|
20
|
Nikam R, Yugandhar K, Gromiha MM. DeepBSRPred: deep learning-based binding site residue prediction for proteins. Amino Acids 2023; 55:1305-1316. [PMID: 36574037 DOI: 10.1007/s00726-022-03228-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 12/15/2022] [Indexed: 12/28/2022]
Abstract
MOTIVATION Proteins-protein interactions (PPIs) are important to govern several cellular activities. Amino acid residues, which are located at the interface are known as the binding sites and the information about binding sites helps to understand the binding affinities and functions of protein-protein complexes. RESULTS We have developed a deep neural network-based method, DeepBSRPred, for predicting the binding sites using protein sequence information and predicted structures from AlphaFold2. Specific sequence and structure-based features include position-specific scoring matrix (PSSM), solvent accessible surface area, conservation score and amino acid properties, and residue depth, respectively. Our method predicted the binding sites with an average F1 score of 0.73 in a dataset of 1236 proteins. Further, we compared the performance with other existing methods in the literature using four benchmark datasets and our method outperformed those methods. AVAILABILITY AND IMPLEMENTATION The DeepBSRPred web server can be found at https://web.iitm.ac.in/bioinfo2/deepbsrpred/index.html , along with all datasets used in this study. The trained models, the DeepBSRPred standalone source code, and the feature computation pipeline are freely available at https://web.iitm.ac.in/bioinfo2/deepbsrpred/download.html .
Collapse
Affiliation(s)
- Rahul Nikam
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, 600036, India
| | - Kumar Yugandhar
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, 600036, India
- Department of Computational Biology, Cornell University, New York, NY, USA
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, 600036, India.
- Department of Computer Science, Tokyo Institute of Technology, Yokohama, Japan.
| |
Collapse
|
21
|
Sahiner B, Chen W, Samala RK, Petrick N. Data drift in medical machine learning: implications and potential remedies. Br J Radiol 2023; 96:20220878. [PMID: 36971405 PMCID: PMC10546450 DOI: 10.1259/bjr.20220878] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 02/16/2023] [Accepted: 02/20/2023] [Indexed: 03/29/2023] Open
Abstract
Data drift refers to differences between the data used in training a machine learning (ML) model and that applied to the model in real-world operation. Medical ML systems can be exposed to various forms of data drift, including differences between the data sampled for training and used in clinical operation, differences between medical practices or context of use between training and clinical use, and time-related changes in patient populations, disease patterns, and data acquisition, to name a few. In this article, we first review the terminology used in ML literature related to data drift, define distinct types of drift, and discuss in detail potential causes within the context of medical applications with an emphasis on medical imaging. We then review the recent literature regarding the effects of data drift on medical ML systems, which overwhelmingly show that data drift can be a major cause for performance deterioration. We then discuss methods for monitoring data drift and mitigating its effects with an emphasis on pre- and post-deployment techniques. Some of the potential methods for drift detection and issues around model retraining when drift is detected are included. Based on our review, we find that data drift is a major concern in medical ML deployment and that more research is needed so that ML models can identify drift early, incorporate effective mitigation strategies and resist performance decay.
Collapse
Affiliation(s)
- Berkman Sahiner
- Center for Devices and Radiological Health, U.S. Food and Drug Administration 10903 New Hampshire Avenue, Silver Spring, MD 20993-0002
| | - Weijie Chen
- Center for Devices and Radiological Health, U.S. Food and Drug Administration 10903 New Hampshire Avenue, Silver Spring, MD 20993-0002
| | - Ravi K. Samala
- Center for Devices and Radiological Health, U.S. Food and Drug Administration 10903 New Hampshire Avenue, Silver Spring, MD 20993-0002
| | - Nicholas Petrick
- Center for Devices and Radiological Health, U.S. Food and Drug Administration 10903 New Hampshire Avenue, Silver Spring, MD 20993-0002
| |
Collapse
|
22
|
Knudsen NS, Andersen TB. Morphology of possible regions in elite soccer players. Sports Biomech 2023; 22:1334-1347. [PMID: 32935633 DOI: 10.1080/14763141.2020.1797862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Accepted: 06/19/2020] [Indexed: 10/23/2022]
Abstract
The popularity of spatio-temporal analyses in soccer is increasing. As many of these analyses depend on the regions a player can occupy in a certain amount of time (the possible regions), the understanding of this concepts is important for analyses to produce usable results. This study investigated how possible regions of soccer morph with varying times and running speeds. Twenty-four players from the Danish Superliga participated, and 13 players were analysed. The possible regions were analysed with times from 0.5 to 4 s (0.5 s increments) and initial velocities from 1 to 7 m/s (1 m/s increments). In this study, we showed that the possible regions can be described by ellipses (eccentricity of 0.5348 ± 0.1912). When comparing the possible region ellipses at every time and velocity pair, 1.95 % of the ellipses were not significantly different from the others. In conclusion, possible regions are unique in shape and size depending on player running speed and time available. However, as only few strikers participated, the results for this group should be interpreted with caution. Coaches can predict possible regions based on these parameters increasing precision of post-game analyses.
Collapse
|
23
|
Karnuta JM, Murphy MP, Luu BC, Ryan MJ, Haeberle HS, Brown NM, Iorio R, Chen AF, Ramkumar PN. Artificial Intelligence for Automated Implant Identification in Total Hip Arthroplasty: A Multicenter External Validation Study Exceeding Two Million Plain Radiographs. J Arthroplasty 2023; 38:1998-2003.e1. [PMID: 35271974 DOI: 10.1016/j.arth.2022.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 02/23/2022] [Accepted: 03/01/2022] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND The surgical management of complications after total hip arthroplasty (THA) necessitates accurate identification of the femoral implant manufacturer and model. Automated image processing using deep learning has been previously developed and internally validated; however, external validation is necessary prior to responsible application of artificial intelligence (AI)-based technologies. METHODS We trained, validated, and externally tested a deep learning system to classify femoral-sided THA implants as one of the 8 models from 2 manufacturers derived from 2,954 original, deidentified, retrospectively collected anteroposterior plain radiographs across 3 academic referral centers and 13 surgeons. From these radiographs, 2,117 were used for training, 249 for validation, and 588 for external testing. Augmentation was applied to the training set (n = 2,117,000) to increase model robustness. Performance was evaluated by area under the receiver operating characteristic curve, sensitivity, specificity, and accuracy. Implant identification processing speed was calculated. RESULTS The training and testing sets were drawn from statistically different populations of implants (P < .001). After 1,000 training epochs by the deep learning system, the system discriminated 8 implant models with a mean area under the receiver operating characteristic curve of 0.991, accuracy of 97.9%, sensitivity of 88.6%, and specificity of 98.9% in the external testing dataset of 588 anteroposterior radiographs. The software classified implants at a mean speed of 0.02 seconds per image. CONCLUSION An AI-based software demonstrated excellent internal and external validation. Although continued surveillance is necessary with implant library expansion, this software represents responsible and meaningful clinical application of AI with immediate potential to globally scale and assist in preoperative planning prior to revision THA.
Collapse
Affiliation(s)
- Jaret M Karnuta
- Orthopaedic Machine Learning Laboratory, Orthopaedic Intelligence LLC, Cleveland Heights, OH; Department of Orthopaedic Surgery, University of Pennsylvania, Philadelphia, PA
| | - Michael P Murphy
- Department of Orthopaedic Surgery & Rehabilitation, Loyola University Medical Center, Chicago, IL
| | - Bryan C Luu
- Orthopaedic Machine Learning Laboratory, Orthopaedic Intelligence LLC, Cleveland Heights, OH; Department of Orthopaedic Surgery, Baylor College of Medicine, Houston, TX
| | - Michael J Ryan
- Orthopaedic Machine Learning Laboratory, Orthopaedic Intelligence LLC, Cleveland Heights, OH
| | - Heather S Haeberle
- Orthopaedic Machine Learning Laboratory, Orthopaedic Intelligence LLC, Cleveland Heights, OH; Sports Medicine Institute, Hospital for Special Surgery, New York, NY
| | - Nicholas M Brown
- Department of Orthopaedic Surgery & Rehabilitation, Loyola University Medical Center, Chicago, IL
| | - Richard Iorio
- Department of Orthopaedic Surgery, Brigham & Women's Hospital, Boston, MA
| | - Antonia F Chen
- Department of Orthopaedic Surgery, Brigham & Women's Hospital, Boston, MA
| | - Prem N Ramkumar
- Orthopaedic Machine Learning Laboratory, Orthopaedic Intelligence LLC, Cleveland Heights, OH; Sports Medicine Institute, Hospital for Special Surgery, New York, NY; Department of Orthopaedic Surgery, Brigham & Women's Hospital, Boston, MA
| |
Collapse
|
24
|
Situmorang DDB. "Rapid tele-psychotherapy" with single-session music therapy in the metaverse: An alternative solution for mental health services in the future. Palliat Support Care 2023; 21:944-945. [PMID: 36218066 DOI: 10.1017/s1478951522001420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
25
|
Malik S, Khan MJ, Khan MA, El-Sayed H. Collaborative Perception-The Missing Piece in Realizing Fully Autonomous Driving. Sensors (Basel) 2023; 23:7854. [PMID: 37765911 PMCID: PMC10535382 DOI: 10.3390/s23187854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 05/23/2023] [Accepted: 05/25/2023] [Indexed: 09/29/2023]
Abstract
Environment perception plays a crucial role in enabling collaborative driving automation, which is considered to be the ground-breaking solution to tackling the safety, mobility, and sustainability challenges of contemporary transportation systems. Despite the fact that computer vision for object perception is undergoing an extraordinary evolution, single-vehicle systems' constrained receptive fields and inherent physical occlusion make it difficult for state-of-the-art perception techniques to cope with complex real-world traffic settings. Collaborative perception (CP) based on various geographically separated perception nodes was developed to break the perception bottleneck for driving automation. CP leverages vehicle-to-vehicle and vehicle-to-infrastructure communication to enable vehicles and infrastructure to combine and share information to comprehend the surrounding environment beyond the line of sight and field of view to enhance perception accuracy, lower latency, and remove perception blind spots. In this article, we highlight the need for an evolved version of the collaborative perception that should address the challenges hindering the realization of level 5 AD use cases by comprehensively studying the transition from classical perception to collaborative perception. In particular, we discuss and review perception creation at two different levels: vehicle and infrastructure. Furthermore, we also study the communication technologies and three different collaborative perception message-sharing models, their comparison analyzing the trade-off between the accuracy of the transmitted data and the communication bandwidth used for data transmission, and the challenges therein. Finally, we discuss a range of crucial challenges and future directions of collaborative perception that need to be addressed before a higher level of autonomy hits the roads.
Collapse
Affiliation(s)
- Sumbal Malik
- College of Information Technology, United Arab Emirates University, Abu Dhabi 15551, United Arab Emirates
- Emirates Center for Mobility Research (ECMR), United Arab Emirates University, Abu Dhabi 15551, United Arab Emirates
| | - Muhammad Jalal Khan
- College of Information Technology, United Arab Emirates University, Abu Dhabi 15551, United Arab Emirates
- Emirates Center for Mobility Research (ECMR), United Arab Emirates University, Abu Dhabi 15551, United Arab Emirates
| | - Manzoor Ahmed Khan
- College of Information Technology, United Arab Emirates University, Abu Dhabi 15551, United Arab Emirates
- Emirates Center for Mobility Research (ECMR), United Arab Emirates University, Abu Dhabi 15551, United Arab Emirates
| | - Hesham El-Sayed
- College of Information Technology, United Arab Emirates University, Abu Dhabi 15551, United Arab Emirates
- Emirates Center for Mobility Research (ECMR), United Arab Emirates University, Abu Dhabi 15551, United Arab Emirates
| |
Collapse
|
26
|
Mohseni N, Ghaniee Zarich M, Afshar S, Hosseini M. Identification of Novel Biomarkers for Response to Preoperative Chemoradiation in Locally Advanced Rectal Cancer with Genetic Algorithm-Based Gene Selection. J Gastrointest Cancer 2023; 54:937-950. [PMID: 36534304 DOI: 10.1007/s12029-022-00873-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/05/2022] [Indexed: 12/23/2022]
Abstract
BACKGROUND The conventional treatment for patients with locally advanced colorectal tumors is preoperative chemo-radiotherapy (PCRT) preceding surgery. This treatment strategy has some long-term side effects, and some patients do not respond to it. Therefore, an evaluation of biomarkers that may help predict patients' response to PCRT is essential. METHODS We took advantage of genetic algorithm to search the space of possible combinations of features to choose subsets of genes that would yield convenient performance in differentiating PCRT responders from non-responders using a logistic regression model as our classifier. RESULTS We developed two gene signatures; first, to achieve the maximum prediction accuracy, the algorithm yielded 39 genes, and then, aiming to reduce the feature numbers as much as possible (while maintaining acceptable performance), a 5-gene signature was chosen. The performance of the two gene signatures was (accuracy = 0.97 and 0.81, sensitivity = 0.96 and 0.83, and specificity = 86 and 0.77) using a logistic regression classifier. Through analyzing bias and variance decomposition of the model error, we further investigated the involved genes by discovering and validating another 28-gene signature which possibly points towards two different sub-systems involved in the response of the patients to treatment. CONCLUSIONS Using genetic algorithm as our gene selection method, we have identified two groups of genes that can differentiate PCRT responders from non-responders in patients of the studied dataset with considerable performance. IMPACT After passing standard requirements, our gene signatures may be applicable as a robust and effective PCRT response prediction tool for colorectal cancer patients in clinical settings and may also help future studies aiming to further investigate involved pathways gain a clearer picture for the course of their research.
Collapse
Affiliation(s)
- Nima Mohseni
- Department of Biology, Faculty of Science, Lund University, Skåne, Sweden
| | | | - Saeid Afshar
- Research Center for Molecular Medicine, Hamadan University of Medical Sciences, Hamadan, Iran.
| | | |
Collapse
|
27
|
Mitchell ARJ, Ahlert D, Brown C, Birge M, Gibbs A. Electrocardiogram-based biometrics for user identification - Using your heartbeat as a digital key. J Electrocardiol 2023; 80:1-6. [PMID: 37058746 DOI: 10.1016/j.jelectrocard.2023.04.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 02/22/2023] [Accepted: 04/04/2023] [Indexed: 04/16/2023]
Abstract
External biometrics such as thumbprint and facial recognition have become standard tools for securing our digital devices and protecting our data. These systems, however, are potentially prone to copying and cybercrime access. Researchers have therefore explored internal biometrics, such as the electrical patterns within an electrocardiogram (ECG). The heart's electrical signals carry sufficient distinctiveness to allow the ECG to be used as an internal biometric for user authentication and identification. Using the ECG in this way has many potential advantages and limitations. This article reviews the history of ECG biometrics and explores some of the technical and security considerations. It also explores current and future uses of the ECG as an internal biometric.
Collapse
Affiliation(s)
| | | | - Chris Brown
- The Allan Lab, Jersey General Hospital, Jersey
| | - Max Birge
- The Allan Lab, Jersey General Hospital, Jersey
| | | |
Collapse
|
28
|
Timmons AC, Duong JB, Fiallo NS, Lee T, Vo HPQ, Ahle MW, Comer JS, Brewer LC, Frazier SL, Chaspari T. A Call to Action on Assessing and Mitigating Bias in Artificial Intelligence Applications for Mental Health. Perspect Psychol Sci 2023; 18:1062-1096. [PMID: 36490369 PMCID: PMC10250563 DOI: 10.1177/17456916221134490] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Advances in computer science and data-analytic methods are driving a new era in mental health research and application. Artificial intelligence (AI) technologies hold the potential to enhance the assessment, diagnosis, and treatment of people experiencing mental health problems and to increase the reach and impact of mental health care. However, AI applications will not mitigate mental health disparities if they are built from historical data that reflect underlying social biases and inequities. AI models biased against sensitive classes could reinforce and even perpetuate existing inequities if these models create legacies that differentially impact who is diagnosed and treated, and how effectively. The current article reviews the health-equity implications of applying AI to mental health problems, outlines state-of-the-art methods for assessing and mitigating algorithmic bias, and presents a call to action to guide the development of fair-aware AI in psychological science.
Collapse
Affiliation(s)
- Adela C. Timmons
- University of Texas at Austin Institute for Mental Health Research
- Colliga Apps Corporation
| | | | | | | | | | | | | | - LaPrincess C. Brewer
- Department of Cardiovascular Medicine, May Clinic College of Medicine, Rochester, Minnesota, United States
- Center for Health Equity and Community Engagement Research, Mayo Clinic, Rochester, Minnesota, United States
| | | | | |
Collapse
|
29
|
Feier JS, Nguyen K, Choi JS. Twitter Perspectives on Cochlear Implantation: Sentiment and Thematic Analysis. Otolaryngol Head Neck Surg 2023; 169:642-650. [PMID: 36939425 DOI: 10.1002/ohn.292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 01/13/2023] [Accepted: 01/21/2023] [Indexed: 02/07/2023]
Abstract
OBJECTIVE To identify themes that contribute to the most positive and negative perspectives of cochlear implants (CIs) on Twitter. STUDY DESIGN A cross-sectional qualitative study. SETTING Social media (Twitter). METHODS In this study, all English-language original tweets mentioning "cochlear implant" from 2019 to 2021 were collected from Twitter's Academic Research Database using a custom Python script. Sentiment analysis was performed using VADER, a sentiment analysis tool built to analyze the inherent positivity or negativity of social media posts. Tweets were quantitatively sorted by compound sentiment score (range -1 [most negative] to 1 [most positive]). Thematic analysis based on grounded theory was performed on the most positive, negative, and "liked" tweets. RESULTS Of the 19,376 tweets included, the average (standard deviation) compound sentiment score was 0.21 (0.46). A total of 10,375 (53.5%) tweets had a positive tone, 4965 (25.6%) were neutral and 4036 (20.8%) were negative. Of the 100 most negative tweets, the most prominent themes were media representation (21.9%), the controversy of CI within the Deaf community (19.8%), and unrelated to direct patient experience (16.7%). Of the 100 most positive tweets, the most prominent themes were tweets of happiness and support (25.0%), tweets unrelated to direct patient experience (18.0%), and tweets about hearing new sounds (10.0%). CONCLUSION While the majority of tweets on CI carried a positive tone, there are ongoing challenges with the stigma surrounding CI. Public perspectives of CI on social media may help clinicians counsel CI patients and elucidate issues that lead to newfound acceptance or ongoing stigma of CI in the general population.
Collapse
Affiliation(s)
- Joel S Feier
- Department of Surgery, Division of Otolaryngology-Head & Neck Surgery, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA
| | - Kenny Nguyen
- Department of Surgery, Division of Otolaryngology-Head & Neck Surgery, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA
| | - Janet S Choi
- Department of Otolaryngology-Head and Neck Surgery, University of Minnesota, Minneapolis, Minnesota, USA
| |
Collapse
|
30
|
Dwivedi YK, Kshetri N, Hughes L, Slade EL, Jeyaraj A, Kar AK, Baabdullah AM, Koohang A, Raghavan V, Ahuja M, Albanna H, Albashrawi MA, Al-Busaidi AS, Balakrishnan J, Barlette Y, Basu S, Bose I, Brooks L, Buhalis D, Carter L, Chowdhury S, Crick T, Cunningham SW, Davies GH, Davison RM, Dé R, Dennehy D, Duan Y, Dubey R, Dwivedi R, Edwards JS, Flavián C, Gauld R, Grover V, Hu MC, Janssen M, Jones P, Junglas I, Khorana S, Kraus S, Larsen KR, Latreille P, Laumer S, Malik FT, Mardani A, Mariani M, Mithas S, Mogaji E, Nord JH, O’Connor S, Okumus F, Pagani M, Pandey N, Papagiannidis S, Pappas IO, Pathak N, Pries-Heje J, Raman R, Rana NP, Rehm SV, Ribeiro-Navarrete S, Richter A, Rowe F, Sarker S, Stahl BC, Tiwari MK, van der Aalst W, Venkatesh V, Viglia G, Wade M, Walton P, Wirtz J, Wright R. “So what if ChatGPT wrote it?” Multidisciplinary perspectives on opportunities, challenges and implications of generative conversational AI for research, practice and policy. International Journal of Information Management 2023. [DOI: 10.1016/j.ijinfomgt.2023.102642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/14/2023]
|
31
|
Kun S, Qiuying W, Xiaofei L. An interpretable measure of semantic similarity for predicting eye movements in reading. Psychon Bull Rev 2023; 30:1227-1242. [PMID: 36732445 PMCID: PMC10482772 DOI: 10.3758/s13423-022-02240-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/12/2022] [Indexed: 02/04/2023]
Abstract
Predictions about upcoming content play an important role during language comprehension and processing. Semantic similarity as a metric has been used to predict how words are processed in context in language comprehension and processing tasks. This study proposes a novel, dynamic approach for computing contextual semantic similarity, evaluates the extent to which the semantic similarity measures computed using this approach can predict fixation durations in reading tasks recorded in a corpus of eye-tracking data, and compares the performance of these measures to that of semantic similarity measures computed using the cosine and Euclidean methods. Our results reveal that the semantic similarity measures generated by our approach are significantly predictive of fixation durations on reading and outperform those generated by the two existing approaches. The findings of this study contribute to a better understanding of how humans process words in context and make predictions in language comprehension and processing. The effective and interpretable approach to computing contextual semantic similarity proposed in this study can also facilitate further explorations of other experimental data on language comprehension and processing.
Collapse
Affiliation(s)
- Sun Kun
- Department of Linguistics, University of Tübingen, Tübingen, Germany.
| | - Wang Qiuying
- School of Teaching, Learning and Educational Sciences, Oklahoma State University, Stillwater, United States
| | - Lu Xiaofei
- Department of Applied Linguistics, The Pennsylvania State University, University Park, United States
| |
Collapse
|
32
|
Wang S, Dang Y, Sun Z, Ding Y, Pathak J, Tao C, Xiao Y, Peng Y. An NLP approach to identify SDoH-related circumstance and suicide crisis from death investigation narratives. J Am Med Inform Assoc 2023; 30:1408-1417. [PMID: 37040620 PMCID: PMC10354765 DOI: 10.1093/jamia/ocad068] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 03/10/2023] [Accepted: 03/29/2023] [Indexed: 04/13/2023] Open
Abstract
OBJECTIVES Suicide presents a major public health challenge worldwide, affecting people across the lifespan. While previous studies revealed strong associations between Social Determinants of Health (SDoH) and suicide deaths, existing evidence is limited by the reliance on structured data. To resolve this, we aim to adapt a suicide-specific SDoH ontology (Suicide-SDoHO) and use natural language processing (NLP) to effectively identify individual-level SDoH-related social risks from death investigation narratives. MATERIALS AND METHODS We used the latest National Violent Death Report System (NVDRS), which contains 267 804 victim suicide data from 2003 to 2019. After adapting the Suicide-SDoHO, we developed a transformer-based model to identify SDoH-related circumstances and crises in death investigation narratives. We applied our model retrospectively to annotate narratives whose crisis variables were not coded in NVDRS. The crisis rates were calculated as the percentage of the group's total suicide population with the crisis present. RESULTS The Suicide-SDoHO contains 57 fine-grained circumstances in a hierarchical structure. Our classifier achieves AUCs of 0.966 and 0.942 for classifying circumstances and crises, respectively. Through the crisis trend analysis, we observed that not everyone is equally affected by SDoH-related social risks. For the economic stability crisis, our result showed a significant increase in crisis rate in 2007-2009, parallel with the Great Recession. CONCLUSIONS This is the first study curating a Suicide-SDoHO using death investigation narratives. We showcased that our model can effectively classify SDoH-related social risks through NLP approaches. We hope our study will facilitate the understanding of suicide crises and inform effective prevention strategies.
Collapse
Affiliation(s)
- Song Wang
- Cockrell School of Engineering, The University of Texas at Austin, Austin, Texas, USA
| | - Yifang Dang
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Zhaoyi Sun
- Population Health Sciences, Weill Cornell Medicine, New York, New York, USA
| | - Ying Ding
- School of Information, The University of Texas at Austin, Austin, Texas, USA
| | - Jyotishman Pathak
- Population Health Sciences, Weill Cornell Medicine, New York, New York, USA
| | - Cui Tao
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Yunyu Xiao
- Population Health Sciences, Weill Cornell Medicine, New York, New York, USA
| | - Yifan Peng
- Population Health Sciences, Weill Cornell Medicine, New York, New York, USA
| |
Collapse
|
33
|
Morís DI, de Moura J, Marcos PJ, Rey EM, Novo J, Ortega M. Comprehensive analysis of clinical data for COVID-19 outcome estimation with machine learning models. Biomed Signal Process Control 2023; 84:104818. [PMID: 36915863 PMCID: PMC9995330 DOI: 10.1016/j.bspc.2023.104818] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Revised: 11/22/2022] [Accepted: 03/05/2023] [Indexed: 03/11/2023]
Abstract
COVID-19 is a global threat for the healthcare systems due to the rapid spread of the pathogen that causes it. In such situation, the clinicians must take important decisions, in an environment where medical resources can be insufficient. In this task, the computer-aided diagnosis systems can be very useful not only in the task of supporting the clinical decisions but also to perform relevant analyses, allowing them to understand better the disease and the factors that can identify the high risk patients. For those purposes, in this work, we use several machine learning algorithms to estimate the outcome of COVID-19 patients given their clinical information. Particularly, we perform 2 different studies: the first one estimates whether the patient is at low or at high risk of death whereas the second estimates if the patient needs hospitalization or not. The results of the analyses of this work show the most relevant features for each studied scenario, as well as the classification performance of the considered machine learning models. In particular, the XGBoost algorithm is able to estimate the need for hospitalization of a patient with an AUC-ROC of 0 . 8415 ± 0 . 0217 while it can also estimate the risk of death with an AUC-ROC of 0 . 7992 ± 0 . 0104 . Results have demonstrated the great potential of the proposal to determine those patients that need a greater amount of medical resources for being at a higher risk. This provides the healthcare services with a tool to better manage their resources.
Collapse
Affiliation(s)
- Daniel I Morís
- Centro de Investigación CITIC, Universidade da Coruña, Campus de Elviña, s/n, 15071 A Coruña, Spain.,Grupo VARPA, Instituto de Investigación Biomédica de A Coruña (INIBIC), Universidade da Coruña, Xubias de Arriba, 84, 15006 A Coruña, Spain
| | - Joaquim de Moura
- Centro de Investigación CITIC, Universidade da Coruña, Campus de Elviña, s/n, 15071 A Coruña, Spain.,Grupo VARPA, Instituto de Investigación Biomédica de A Coruña (INIBIC), Universidade da Coruña, Xubias de Arriba, 84, 15006 A Coruña, Spain
| | - Pedro J Marcos
- Dirección Asistencial y Servicio de Neumología, Complejo Hospitalario Universitario de A Coruña (CHUAC), Instituto de Investigación Biomédica de A Coruña (INIBIC), Universidade da Coruña, Sergas, 15006 A Coruña, Spain
| | - Enrique Míguez Rey
- Grupo de Investigación en Virología Clínica, Sección de Enfermedades Infecciosas, Servicio de Medicina Interna, Instituto de Investigación Biomédica de A Coruña (INIBIC), Área Sanitaria A Coruña y CEE (ASCC), SERGAS, 15006 A Coruña, Spain
| | - Jorge Novo
- Centro de Investigación CITIC, Universidade da Coruña, Campus de Elviña, s/n, 15071 A Coruña, Spain.,Grupo VARPA, Instituto de Investigación Biomédica de A Coruña (INIBIC), Universidade da Coruña, Xubias de Arriba, 84, 15006 A Coruña, Spain
| | - Marcos Ortega
- Centro de Investigación CITIC, Universidade da Coruña, Campus de Elviña, s/n, 15071 A Coruña, Spain.,Grupo VARPA, Instituto de Investigación Biomédica de A Coruña (INIBIC), Universidade da Coruña, Xubias de Arriba, 84, 15006 A Coruña, Spain
| |
Collapse
|
34
|
Hu C, Wang H, Zhang W, Xie Y, Jiao L, Cui S. TrDosePred: A deep learning dose prediction algorithm based on transformers for head and neck cancer radiotherapy. J Appl Clin Med Phys 2023; 24:e13942. [PMID: 36867441 PMCID: PMC10338766 DOI: 10.1002/acm2.13942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 01/18/2023] [Accepted: 01/24/2023] [Indexed: 03/04/2023] Open
Abstract
BACKGROUND Intensity-Modulated Radiation Therapy (IMRT) has been the standard of care for many types of tumors. However, treatment planning for IMRT is a time-consuming and labor-intensive process. PURPOSE To alleviate this tedious planning process, a novel deep learning based dose prediction algorithm (TrDosePred) was developed for head and neck cancers. METHODS The proposed TrDosePred, which generated the dose distribution from a contoured CT image, was a U-shape network constructed with a convolutional patch embedding and several local self-attention based transformers. Data augmentation and ensemble approach were used for further improvement. It was trained based on the dataset from Open Knowledge-Based Planning Challenge (OpenKBP). The performance of TrDosePred was evaluated with two mean absolute error (MAE) based scores utilized by OpenKBP challenge (i.e., Dose score and DVH score) and compared to the top three approaches of the challenge. In addition, several state-of-the-art methods were implemented and compared to TrDosePred. RESULTS The TrDosePred ensemble achieved the dose score of 2.426 Gy and the DVH score of 1.592 Gy on the test dataset, ranking at 3rd and 9th respectively in the leaderboard on CodaLab as of writing. In terms of DVH metrics, on average, the relative MAE against the clinical plans was 2.25% for targets and 2.17% for organs at risk. CONCLUSIONS A transformer-based framework TrDosePred was developed for dose prediction. The results showed a comparable or superior performance as compared to the previous state-of-the-art approaches, demonstrating the potential of transformer to boost the treatment planning procedures.
Collapse
Affiliation(s)
- Chenchen Hu
- Institute of Radiation MedicineChinese Academy of Medical Sciences and Peking Union Medical CollegeTianjinChina
| | - Haiyun Wang
- Institute of Radiation MedicineChinese Academy of Medical Sciences and Peking Union Medical CollegeTianjinChina
| | - Wenyi Zhang
- Institute of Radiation MedicineChinese Academy of Medical Sciences and Peking Union Medical CollegeTianjinChina
| | - Yaoqin Xie
- Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenChina
| | - Ling Jiao
- Institute of Radiation MedicineChinese Academy of Medical Sciences and Peking Union Medical CollegeTianjinChina
| | - Songye Cui
- Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenChina
- Department of Medical PhysicsMemorial Sloan Kettering Cancer CenterNew YorkUSA
| |
Collapse
|
35
|
Bai H, Yu X, Yan Z, Zhang J, Yang LT. VeriORouting: Verification on intelligent routing outsourced to the cloud. Inf Sci (N Y) 2023; 633:410-430. [DOI: 10.1016/j.ins.2023.03.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/17/2023]
|
36
|
Tolosa G, Mallia A. Many are Better than One: Algorithm Selection for Faster Top-K Retrieval. Inf Process Manag 2023. [DOI: 10.1016/j.ipm.2023.103359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/03/2023]
|
37
|
Hu Z, Yang P, Li B, Sun Y, Yang B. Biomedical extractive question answering based on dynamic routing and answer voting. Inf Process Manag 2023. [DOI: 10.1016/j.ipm.2023.103367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/07/2023]
|
38
|
Dey A, Kumar BR, Das B, Ghoshal AK. Outlier detection in social networks leveraging community structure. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.03.120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/30/2023]
|
39
|
Yang Y, Zhao J. Which part of a picture is worth a thousand words: A joint framework for finding and visualizing critical linear features from images. Inf Process Manag 2023. [DOI: 10.1016/j.ipm.2023.103370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/05/2023]
|
40
|
Amshi AH, Prasad R. Time series analysis and forecasting of cholera disease using discrete wavelet transform and seasonal autoregressive integrated moving average model. Scientific African 2023. [DOI: 10.1016/j.sciaf.2023.e01652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023] Open
|
41
|
Gao J, Peng P, Lu F, Claramunt C, Xu Y. Towards travel recommendation interpretability: Disentangling tourist decision-making process via knowledge graph. Inf Process Manag 2023. [DOI: 10.1016/j.ipm.2023.103369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/08/2023]
|
42
|
Sadhuka S, Fridman D, Berger B, Cho H. Assessing transcriptomic reidentification risks using discriminative sequence models. Genome Res 2023; 33:1101-1112. [PMID: 37541758 PMCID: PMC10538488 DOI: 10.1101/gr.277699.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 04/19/2023] [Indexed: 08/06/2023]
Abstract
Gene expression data provide molecular insights into the functional impact of genetic variation, for example, through expression quantitative trait loci (eQTLs). With an improving understanding of the association between genotypes and gene expression comes a greater concern that gene expression profiles could be matched to genotype profiles of the same individuals in another data set, known as a linking attack. Prior works show such a risk could analyze only a fraction of eQTLs that is independent owing to restrictive model assumptions, leaving the full extent of this risk incompletely understood. To address this challenge, we introduce the discriminative sequence model (DSM), a novel probabilistic framework for predicting a sequence of genotypes based on gene expression data. By modeling the joint distribution over all known eQTLs in a genomic region, DSM improves the power of linking attacks with necessary calibration for linkage disequilibrium and redundant predictive signals. We show greater linking accuracy of DSM compared with existing approaches across a range of attack scenarios and data sets including up to 22,288 individuals, suggesting that DSM helps uncover a substantial additional risk overlooked by previous studies. Our work provides a unified framework for assessing the privacy risks of sharing diverse omics data sets beyond transcriptomics.
Collapse
Affiliation(s)
- Shuvom Sadhuka
- Computer Science and AI Lab, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Daniel Fridman
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Bonnie Berger
- Computer Science and AI Lab, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Hyunghoon Cho
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA;
| |
Collapse
|
43
|
Kim J, Jeong HJ, Lim S, Kim J. Effective and efficient core computation in signed networks. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.03.097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
|
44
|
Sahmoud S, Topcuoglu HR. Dynamic multi-objective evolutionary algorithms in noisy environments. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.03.094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
|
45
|
Liu W, Zhang Y, Yang H, Meng Q. A Survey on Differential Privacy for Medical Data Analysis. Ann Data Sci 2023; 11:1-15. [PMID: 38625247 PMCID: PMC10257172 DOI: 10.1007/s40745-023-00475-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/16/2023] [Accepted: 05/22/2023] [Indexed: 12/01/2023]
Abstract
Machine learning methods promote the sustainable development of wise information technology of medicine (WITMED), and a variety of medical data brings high value and convenience to medical analysis. However, the applications of medical data have also been confronted with the risk of privacy leakage that is hard to avoid, especially when conducting correlation analysis or data sharing among multiple institutions. Data security and privacy preservation have recently played an essential role in the field of secure and private medical data analysis, where many differential privacy strategies are applied to medical data publishing and mining. In this paper, we survey research work on the applications of differential privacy for medical data analysis, discussing the necessity of medical privacy-preserving, the advantages of differential privacy, and their applications to typical medical data, such as genomic data and wearable device data. Furthermore, we discuss the challenges and potential future research directions for differential privacy in medical applications.
Collapse
Affiliation(s)
- WeiKang Liu
- Cyberspace Institute of Advanced Technology, Guangzhou University, Guangzhou, China
| | - Yanchun Zhang
- Cyberspace Institute of Advanced Technology, Guangzhou University, Guangzhou, China
- Institute of Sustainable Industries and Liveable Cities, Victoria University, Melbourne, Australia
| | - Hong Yang
- Cyberspace Institute of Advanced Technology, Guangzhou University, Guangzhou, China
| | - Qinxue Meng
- College of Information Engineering, Suzhou University, Suzhou, China
| |
Collapse
|
46
|
Zhang X, Min F, Song G, Yu H. LSTC: When label-specific features meet third-order label correlations. Inf Sci (N Y) 2023; 632:617-636. [DOI: 10.1016/j.ins.2023.03.056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/17/2023]
|
47
|
Rimbeck M, Reil H, Stumpf-Wollersheim J, Leyer M. How the Internet of Things is reshaping teamwork: An experimental study. COMPUT IND 2023. [DOI: 10.1016/j.compind.2023.103902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/03/2023]
|
48
|
Yuan F, Zhou W, Dodge HH, Zhao X. Short: Causal structural learning of conversational engagement for socially isolated older adults. Smart Health (Amst) 2023; 28:100384. [PMID: 37065441 PMCID: PMC10101035 DOI: 10.1016/j.smhl.2023.100384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
Abstract
Social isolation has become a growing public health concern in older adults and older adults with mild cognitive impairment. Coping strategies must be developed to increase social contact for socially isolated older adults. In this paper, we explored the conversational strategy between trained conversation moderators and socially isolated adults during a conversational engagement clinical trial (Clinicaltrials.gov: NCT02871921). We carried out structural learning and causality analysis to investigate the conversation strategies used by the trained moderators to engage socially isolated adults in the conversation and the causal effects of the strategy on engagement. Causal relations and effects were inferred between participants' emotions, the dialogue strategies used by moderators, and participants' following emotions. The results found in this paper may be used to support the development of cost-efficient, trustworthy AI- and/or robot-based platform to promote conversational engagement for older adults to address the challenges in social interaction.
Collapse
Affiliation(s)
- Fengpei Yuan
- Department of Mechanical, Aerospace and Biomedical Engineering, The University of Tennessee Knoxville, 1512 Middle Drive, Knoxville, TN, 37996, USA
| | - Wenjun Zhou
- Department of Business Analytics and Statistics, The University of Tennessee Knoxville, 916 Volunteer Blvd., Knoxville, TN, 37996, USA
| | - Hiroko H. Dodge
- Harvard Medical School, 25 Shattuck Street, Boston, MA, 02115, USA
| | - Xiaopeng Zhao
- Department of Mechanical, Aerospace and Biomedical Engineering, The University of Tennessee Knoxville, 1512 Middle Drive, Knoxville, TN, 37996, USA
| |
Collapse
|
49
|
Du L, He J, Li T, Wang Y, Lan X, Huang Y. DBWE-Corbat: Background network traffic generation using dynamic word embedding and contrastive learning for cyber range. Comput Secur 2023. [DOI: 10.1016/j.cose.2023.103202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/30/2023]
|
50
|
Gyngell C, Lynch F, Vears D, Bowman-Smart H, Savulescu J, Christodoulou J. Storing paediatric genomic data for sequential interrogation across the lifespan. J Med Ethics 2023:jme-2022-108471. [PMID: 37263770 DOI: 10.1136/jme-2022-108471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 03/02/2023] [Indexed: 06/03/2023]
Abstract
Genomic sequencing (GS) is increasingly used in paediatric medicine to aid in screening, research and treatment. Some health systems are trialling GS as a first-line test in newborn screening programmes. Questions about what to do with genomic data after it has been generated are becoming more pertinent. While other research has outlined the ethical reasons for storing deidentified genomic data to be used in research, the ethical case for storing data for future clinical use has not been explicated. In this paper, we examine the ethical case for storing genomic data with the intention of using it as a lifetime health resource. In this model, genomic data would be stored with the intention of reanalysis at certain points through one's life. We argue this could benefit individuals and create an important public resource. However, several ethical challenges must first be met to achieve these benefits. We explore issues related to privacy, consent, justice and equality. We conclude by arguing that health systems should be moving towards futures that allow for the sequential interrogation of genomic data throughout the lifespan.
Collapse
Affiliation(s)
- Christopher Gyngell
- Biomedical Ethics Research Group, Murdoch Children's Research Institute, Parkville, Victoria, Australia
- Department of Paediatrics, The University of Melbourne, Melbourne, Victoria, Australia
| | - Fiona Lynch
- Biomedical Ethics Research Group, Murdoch Children's Research Institute, Parkville, Victoria, Australia
- Melbourne Law School, The University of Melbourne, Parkville, VIC, Australia
| | - Danya Vears
- Biomedical Ethics Research Group, Murdoch Children's Research Institute, Parkville, Victoria, Australia
- Department of Paediatrics, The University of Melbourne, Melbourne, Victoria, Australia
| | - Hilary Bowman-Smart
- Biomedical Ethics Research Group, Murdoch Children's Research Institute, Parkville, Victoria, Australia
- University of South Australia, Adeliade, South Australia, Australia
| | - Julian Savulescu
- Biomedical Ethics Research Group, Murdoch Children's Research Institute, Parkville, Victoria, Australia
- Faculty of Philosophy, University of Oxford, Oxford, UK
- Centre for Biomedical Ethics - Yong Loo Lin School of Medicine, National University of Singapore, Singapore
| | - John Christodoulou
- Department of Paediatrics, The University of Melbourne, Melbourne, Victoria, Australia
- Brain and Mitochondrial Research Group, Murdoch Children's Research Institute, Parkville, VIC, Australia
| |
Collapse
|