1
|
Deep learning-based method for sentiment analysis for patients' drug reviews. PeerJ Comput Sci 2024; 10:e1976. [PMID: 38699208 PMCID: PMC11065412 DOI: 10.7717/peerj-cs.1976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 03/11/2024] [Indexed: 05/05/2024]
Abstract
This article explores the application of deep learning techniques for sentiment analysis of patients' drug reviews. The main focus is to evaluate the effectiveness of bidirectional long-short-term memory (LSTM) and a hybrid model (bidirectional LSTM-CNN) for sentiment classification based on the entire review text, medical conditions, and rating scores. This study also investigates the impact of using GloVe word embeddings on the model's performance. Two different drug review datasets were used to train and test the models. The proposed methodology involves the implementation and evaluation of both deep learning models with the GloVe word embeddings for sentiment analysis of drug reviews. The experimental results indicate that Model A (Bi-LSTM-CNN) achieved an accuracy of 96% and Model B (Bi-LSTM-CNN) performs consistently at 87% for accuracy. Notably, the incorporation of GloVe word representations improves the overall performance of the models, as supported by Cohen's Kappa coefficient, indicating a high level of agreement. These findings showed the efficacy of deep learning-based approaches, particularly bidirectional LSTM and bidirectional LSTM-CNN, for sentiment analysis of patients' drug reviews.
Collapse
|
2
|
Temporal meta-optimiser based sensitivity analysis (TMSA) for agent-based models and applications in children's services. Sci Rep 2024; 14:9105. [PMID: 38643325 PMCID: PMC11032329 DOI: 10.1038/s41598-024-59743-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Accepted: 04/15/2024] [Indexed: 04/22/2024] Open
Abstract
With current and predicted economic pressures within English Children's Services in the UK, there is a growing discourse around the development of methods of analysis using existing data to make more effective interventions and policy decisions. Agent-Based modelling shows promise in aiding in this, with limitations that require novel methods to overcome. This can include challenges in managing model complexity, transparency, and validation; which may deter analysts from implementing such Agent-Based simulations. Children's Services specifically can gain from the expansion of modelling techniques available to them. Sensitivity analysis is a common step when analysing models that currently has methods with limitations regarding Agent-Based Models. This paper outlines an improved method of conducting Sensitivity Analysis to enable better utilisation of Agent-Based models (ABMs) within Children's Services. By using machine learning based regression in conjunction with the Nomadic Peoples Optimiser (NPO) a method of conducting sensitivity analysis tailored for ABMs is achieved. This paper demonstrates the effectiveness of the approach by drawing comparisons with common existing methods of sensitivity analysis, followed by a demonstration of an improved ABM design in the target use case.
Collapse
|
3
|
Laser communications system with drones as relay medium for healthcare applications. PeerJ Comput Sci 2024; 10:e1759. [PMID: 38435606 PMCID: PMC10909153 DOI: 10.7717/peerj-cs.1759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 11/26/2023] [Indexed: 03/05/2024]
Abstract
This article introduces a prototype laser communication system integrated with uncrewed aerial vehicles (UAVs), aimed at enhancing data connectivity in remote healthcare applications. Traditional radio frequency systems are limited by their range and reliability, particularly in challenging environments. By leveraging UAVs as relay points, the proposed system seeks to address these limitations, offering a novel solution for real-time, high-speed data transmission. The system has been empirically tested, showcasing its ability to maintain data transmission integrity under various conditions. Results indicate a substantial improvement in connectivity, with high data transmission success rate (DTSR) scores, even amidst environmental disturbances. This study underscores the system's potential for critical applications such as emergency response, public health monitoring, and extending services to remote or underserved areas.
Collapse
|
4
|
Machine Learning-Based Predictive Models for Detection of Cardiovascular Diseases. Diagnostics (Basel) 2024; 14:144. [PMID: 38248021 PMCID: PMC10813849 DOI: 10.3390/diagnostics14020144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 12/21/2023] [Accepted: 12/25/2023] [Indexed: 01/23/2024] Open
Abstract
Cardiovascular diseases present a significant global health challenge that emphasizes the critical need for developing accurate and more effective detection methods. Several studies have contributed valuable insights in this field, but it is still necessary to advance the predictive models and address the gaps in the existing detection approaches. For instance, some of the previous studies have not considered the challenge of imbalanced datasets, which can lead to biased predictions, especially when the datasets include minority classes. This study's primary focus is the early detection of heart diseases, particularly myocardial infarction, using machine learning techniques. It tackles the challenge of imbalanced datasets by conducting a comprehensive literature review to identify effective strategies. Seven machine learning and deep learning classifiers, including K-Nearest Neighbors, Support Vector Machine, Logistic Regression, Convolutional Neural Network, Gradient Boost, XGBoost, and Random Forest, were deployed to enhance the accuracy of heart disease predictions. The research explores different classifiers and their performance, providing valuable insights for developing robust prediction models for myocardial infarction. The study's outcomes emphasize the effectiveness of meticulously fine-tuning an XGBoost model for cardiovascular diseases. This optimization yields remarkable results: 98.50% accuracy, 99.14% precision, 98.29% recall, and a 98.71% F1 score. Such optimization significantly enhances the model's diagnostic accuracy for heart disease.
Collapse
|
5
|
Application of Machine Learning to Predict COVID-19 Spread via an Optimized BPSO Model. Biomimetics (Basel) 2023; 8:457. [PMID: 37887588 PMCID: PMC10604133 DOI: 10.3390/biomimetics8060457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Revised: 09/21/2023] [Accepted: 09/21/2023] [Indexed: 10/28/2023] Open
Abstract
During the pandemic of the coronavirus disease (COVID-19), statistics showed that the number of affected cases differed from one country to another and also from one city to another. Therefore, in this paper, we provide an enhanced model for predicting COVID-19 samples in different regions of Saudi Arabia (high-altitude and sea-level areas). The model is developed using several stages and was successfully trained and tested using two datasets that were collected from Taif city (high-altitude area) and Jeddah city (sea-level area) in Saudi Arabia. Binary particle swarm optimization (BPSO) is used in this study for making feature selections using three different machine learning models, i.e., the random forest model, gradient boosting model, and naive Bayes model. A number of predicting evaluation metrics including accuracy, training score, testing score, F-measure, recall, precision, and receiver operating characteristic (ROC) curve were calculated to verify the performance of the three machine learning models on these datasets. The experimental results demonstrated that the gradient boosting model gives better results than the random forest and naive Bayes models with an accuracy of 94.6% using the Taif city dataset. For the dataset of Jeddah city, the results demonstrated that the random forest model outperforms the gradient boosting and naive Bayes models with an accuracy of 95.5%. The dataset of Jeddah city achieved better results than the dataset of Taif city in Saudi Arabia using the enhanced model for the term of accuracy.
Collapse
|
6
|
A Kullback-Liebler divergence-based representation algorithm for malware detection. PeerJ Comput Sci 2023; 9:e1492. [PMID: 37810364 PMCID: PMC10557483 DOI: 10.7717/peerj-cs.1492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 06/27/2023] [Indexed: 10/10/2023]
Abstract
Background Malware, malicious software, is the major security concern of the digital realm. Conventional cyber-security solutions are challenged by sophisticated malicious behaviors. Currently, an overlap between malicious and legitimate behaviors causes more difficulties in characterizing those behaviors as malicious or legitimate activities. For instance, evasive malware often mimics legitimate behaviors, and evasion techniques are utilized by legitimate and malicious software. Problem Most of the existing solutions use the traditional term of frequency-inverse document frequency (TF-IDF) technique or its concept to represent malware behaviors. However, the traditional TF-IDF and the developed techniques represent the features, especially the shared ones, inaccurately because those techniques calculate a weight for each feature without considering its distribution in each class; instead, the generated weight is generated based on the distribution of the feature among all the documents. Such presumption can reduce the meaning of those features, and when those features are used to classify malware, they lead to a high false alarms. Method This study proposes a Kullback-Liebler Divergence-based Term Frequency-Probability Class Distribution (KLD-based TF-PCD) algorithm to represent the extracted features based on the differences between the probability distributions of the terms in malware and benign classes. Unlike the existing solution, the proposed algorithm increases the weights of the important features by using the Kullback-Liebler Divergence tool to measure the differences between their probability distributions in malware and benign classes. Results The experimental results show that the proposed KLD-based TF-PCD algorithm achieved an accuracy of 0.972, the false positive rate of 0.037, and the F-measure of 0.978. Such results were significant compared to the related work studies. Thus, the proposed KLD-based TF-PCD algorithm contributes to improving the security of cyberspace. Conclusion New meaningful characteristics have been added by the proposed algorithm to promote the learned knowledge of the classifiers, and thus increase their ability to classify malicious behaviors accurately.
Collapse
|
7
|
Parkinson's Disease Detection Using Filter Feature Selection and a Genetic Algorithm with Ensemble Learning. Diagnostics (Basel) 2023; 13:2816. [PMID: 37685354 PMCID: PMC10486479 DOI: 10.3390/diagnostics13172816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 08/17/2023] [Accepted: 08/25/2023] [Indexed: 09/10/2023] Open
Abstract
Parkinson's disease (PD) is a neurodegenerative disorder marked by motor and non-motor symptoms that have a severe impact on the quality of life of the affected individuals. This study explores the effect of filter feature selection, followed by ensemble learning methods and genetic selection, on the detection of PD patients from attributes extracted from voice clips from both PD patients and healthy patients. Two distinct datasets were employed in this study. Filter feature selection was carried out by eliminating quasi-constant features. Several classification models were then tested on the filtered data. Decision tree, random forest, and XGBoost classifiers produced remarkable results, especially on Dataset 1, where 100% accuracy was achieved by decision tree and random forest. Ensemble learning methods (voting, stacking, and bagging) were then applied to the best-performing models to see whether the results could be enhanced further. Additionally, genetic selection was applied to the filtered data and evaluated using several classification models for their accuracy and precision. It was found that in most cases, the predictions for PD patients showed more precision than those for healthy individuals. The overall performance was also better on Dataset 1 than on Dataset 2, which had a greater number of features.
Collapse
|
8
|
A convolutional neural network-based decision support system for neonatal quiet sleep detection. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:17018-17036. [PMID: 37920045 DOI: 10.3934/mbe.2023759] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/04/2023]
Abstract
Sleep plays an important role in neonatal brain and physical development, making its detection and characterization important for assessing early-stage development. In this study, we propose an automatic and computationally efficient algorithm to detect neonatal quiet sleep (QS) using a convolutional neural network (CNN). Our study used 38-hours of electroencephalography (EEG) recordings, collected from 19 neonates at Fudan Children's Hospital in Shanghai, China (Approval No. (2020) 22). To train and test the CNN, we extracted 12 prominent time and frequency domain features from 9 bipolar EEG channels. The CNN architecture comprised two convolutional layers with pooling and rectified linear unit (ReLU) activation. Additionally, a smoothing filter was applied to hold the sleep stage for 3 minutes. Through performance testing, our proposed method achieved impressive results, with 94.07% accuracy, 89.70% sensitivity, 94.40% specificity, 79.82% F1-score and a 0.74 kappa coefficient when compared to human expert annotations. A notable advantage of our approach is its computational efficiency, with the entire training and testing process requiring only 7.97 seconds. The proposed algorithm has been validated using leave one subject out (LOSO) validation, which demonstrates its consistent performance across a diverse range of neonates. Our findings highlight the potential of our algorithm for real-time neonatal sleep stage classification, offering a fast and cost-effective solution. This research opens avenues for further investigations in early-stage development monitoring and the assessment of neonatal health.
Collapse
|
9
|
Empowering Precision Medicine: Unlocking Revolutionary Insights through Blockchain-Enabled Federated Learning and Electronic Medical Records. SENSORS (BASEL, SWITZERLAND) 2023; 23:7476. [PMID: 37687931 PMCID: PMC10490801 DOI: 10.3390/s23177476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 08/10/2023] [Accepted: 08/10/2023] [Indexed: 09/10/2023]
Abstract
Precision medicine has emerged as a transformative approach to healthcare, aiming to deliver personalized treatments and therapies tailored to individual patients. However, the realization of precision medicine relies heavily on the availability of comprehensive and diverse medical data. In this context, blockchain-enabled federated learning, coupled with electronic medical records (EMRs), presents a groundbreaking solution to unlock revolutionary insights in precision medicine. This abstract explores the potential of blockchain technology to empower precision medicine by enabling secure and decentralized data sharing and analysis. By leveraging blockchain's immutability, transparency, and cryptographic protocols, federated learning can be conducted on distributed EMR datasets without compromising patient privacy. The integration of blockchain technology ensures data integrity, traceability, and consent management, thereby addressing critical concerns associated with data privacy and security. Through the federated learning paradigm, healthcare institutions and research organizations can collaboratively train machine learning models on locally stored EMR data, without the need for data centralization. The blockchain acts as a decentralized ledger, securely recording the training process and aggregating model updates while preserving data privacy at its source. This approach allows the discovery of patterns, correlations, and novel insights across a wide range of medical conditions and patient populations. By unlocking revolutionary insights through blockchain-enabled federated learning and EMRs, precision medicine can revolutionize healthcare delivery. This paradigm shift has the potential to improve diagnosis accuracy, optimize treatment plans, identify subpopulations for clinical trials, and expedite the development of novel therapies. Furthermore, the transparent and auditable nature of blockchain technology enhances trust among stakeholders, enabling greater collaboration, data sharing, and collective intelligence in the pursuit of advancing precision medicine. In conclusion, this abstract highlights the transformative potential of blockchain-enabled federated learning in empowering precision medicine. By unlocking revolutionary insights from diverse and distributed EMR datasets, this approach paves the way for a future where healthcare is personalized, efficient, and tailored to the unique needs of each patient.
Collapse
|
10
|
Securing Secrets in Cyber-Physical Systems: A Cutting-Edge Privacy Approach with Consortium Blockchain. SENSORS (BASEL, SWITZERLAND) 2023; 23:7162. [PMID: 37631699 PMCID: PMC10458996 DOI: 10.3390/s23167162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2023] [Revised: 07/28/2023] [Accepted: 07/29/2023] [Indexed: 08/27/2023]
Abstract
In the era of interconnected and intelligent cyber-physical systems, preserving privacy has become a paramount concern. This paper aims a groundbreaking proof-of-concept (PoC) design that leverages consortium blockchain technology to address privacy challenges in cyber-physical systems (CPSs). The proposed design introduces a novel approach to safeguarding sensitive information and ensuring data integrity while maintaining a high level of trust among stakeholders. By harnessing the power of consortium blockchain, the design establishes a decentralized and tamper-resistant framework for privacy preservation. However, ensuring the security and privacy of sensitive information within CPSs poses significant challenges. This paper proposes a cutting-edge privacy approach that leverages consortium blockchain technology to secure secrets in CPSs. Consortium blockchain, with its permissioned nature, provides a trusted framework for governing the network and validating transactions. By employing consortium blockchain, secrets in CPSs can be securely stored, shared, and accessed by authorized entities only, mitigating the risks of unauthorized access and data breaches. The proposed approach offers enhanced security, privacy preservation, increased trust and accountability, as well as interoperability and scalability. This paper aims to address the limitations of traditional security mechanisms in CPSs and harness the potential of consortium blockchain to revolutionize the management of secrets, contributing to the advancement of CPS security and privacy. The effectiveness of the design is demonstrated through extensive simulations and performance evaluations. The results indicate that the proposed approach offers significant advancements in privacy protection, paving the way for secure and trustworthy cyber-physical systems in various domains.
Collapse
|
11
|
Motion Capture Technologies for Ergonomics: A Systematic Literature Review. Diagnostics (Basel) 2023; 13:2593. [PMID: 37568956 PMCID: PMC10416907 DOI: 10.3390/diagnostics13152593] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 07/25/2023] [Accepted: 08/02/2023] [Indexed: 08/13/2023] Open
Abstract
Muscular skeletal disorder is a difficult challenge faced by the working population. Motion capture (MoCap) is used for recording the movement of people for clinical, ergonomic and rehabilitation solutions. However, knowledge barriers about these MoCap systems have made them difficult to use for many people. Despite this, no state-of-the-art literature review on MoCap systems for human clinical, rehabilitation and ergonomic analysis has been conducted. A medical diagnosis using AI applies machine learning algorithms and motion capture technologies to analyze patient data, enhancing diagnostic accuracy, enabling early disease detection and facilitating personalized treatment plans. It revolutionizes healthcare by harnessing the power of data-driven insights for improved patient outcomes and efficient clinical decision-making. The current review aimed to investigate: (i) the most used MoCap systems for clinical use, ergonomics and rehabilitation, (ii) their application and (iii) the target population. We used preferred reporting items for systematic reviews and meta-analysis guidelines for the review. Google Scholar, PubMed, Scopus and Web of Science were used to search for relevant published articles. The articles obtained were scrutinized by reading the abstracts and titles to determine their inclusion eligibility. Accordingly, articles with insufficient or irrelevant information were excluded from the screening. The search included studies published between 2013 and 2023 (including additional criteria). A total of 40 articles were eligible for review. The selected articles were further categorized in terms of the types of MoCap used, their application and the domain of the experiments. This review will serve as a guide for researchers and organizational management.
Collapse
|
12
|
Identification and characterization of RNA polymerase II (RNAP) C-Terminal domain phosphatase-like 3 (SlCPL3) in tomato under biotic stress. Mol Biol Rep 2023; 50:6783-6793. [PMID: 37392286 DOI: 10.1007/s11033-023-08564-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 05/31/2023] [Indexed: 07/03/2023]
Abstract
BACKGROUND Bacterial diseases are a huge threat to the production of tomatoes. During infection intervals, pathogens affect biochemical, oxidant and molecular properties of tomato. Therefore, it is necessary to study the antioxidant enzymes, oxidation state and genes involved during bacterial infection in tomato. METHODS AND RESULTS Different bioinformatic analyses were performed to conduct homology, gene promoter analysis and determined protein structure. Antioxidant, MDA and H2O2 response was measured in Falcon, Rio grande and Sazlica tomato cultivars. In this study, RNA Polymerase II (RNAP) C-Terminal Domain Phosphatase-like 3 (SlCPL-3) gene was identified and characterized. It contained 11 exons, and encoded for two protein domains i.e., CPDCs and BRCT. SOPMA and Phyre2, online bioinformatic tools were used to predict secondary structure. For the identification of protein pockets CASTp web-based tool was used. Netphos and Pondr was used for prediction of phosphorylation sites and protein disordered regions. Promoter analysis revealed that the SlCPL-3 is involved in defense-related mechanisms. We further amplified two different regions of SlCPL-3 and sequenced them. It showed homology respective to the reference tomato genome. Our results showed that SlCPL-3 gene was triggered during bacterial stress. SlCPL-3 expression was upregulated in response to bacterial stress during different time intervals. Rio grande showed a high level of SICPL-3 gene expression after 72 hpi. Biochemical and gene expression analysis showed that under biotic stress Rio grande cultivar is more sensitive to Pst DC 3000 bacteria. CONCLUSION This study lays a solid foundation for the functional characterization of SlCPL-3 gene in tomato cultivars. All these findings would be beneficial for further analysis of SlCPL-3 gene and may be helpful for the development of resilient tomato cultivars.
Collapse
|
13
|
An Adaptive Early Stopping Technique for DenseNet169-Based Knee Osteoarthritis Detection Model. Diagnostics (Basel) 2023; 13:diagnostics13111903. [PMID: 37296755 DOI: 10.3390/diagnostics13111903] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Revised: 05/23/2023] [Accepted: 05/25/2023] [Indexed: 06/12/2023] Open
Abstract
Knee osteoarthritis (OA) detection is an important area of research in health informatics that aims to improve the accuracy of diagnosing this debilitating condition. In this paper, we investigate the ability of DenseNet169, a deep convolutional neural network architecture, for knee osteoarthritis detection using X-ray images. We focus on the use of the DenseNet169 architecture and propose an adaptive early stopping technique that utilizes gradual cross-entropy loss estimation. The proposed approach allows for the efficient selection of the optimal number of training epochs, thus preventing overfitting. To achieve the goal of this study, the adaptive early stopping mechanism that observes the validation accuracy as a threshold was designed. Then, the gradual cross-entropy (GCE) loss estimation technique was developed and integrated to the epoch training mechanism. Both adaptive early stopping and GCE were incorporated into the DenseNet169 for the OA detection model. The performance of the model was measured using several metrics including accuracy, precision, and recall. The obtained results were compared with those obtained from the existing works. The comparison shows that the proposed model outperformed the existing solutions in terms of accuracy, precision, recall, and loss performance, which indicates that the adaptive early stopping coupled with GCE improved the ability of DenseNet169 to accurately detect knee OA.
Collapse
|
14
|
Assessment of proline function in higher plants under extreme temperatures. PLANT BIOLOGY (STUTTGART, GERMANY) 2023; 25:379-395. [PMID: 36748909 DOI: 10.1111/plb.13510] [Citation(s) in RCA: 25] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Accepted: 01/30/2023] [Indexed: 06/18/2023]
Abstract
Climate change and abiotic stress factors are key players in crop losses worldwide. Among which, extreme temperatures (heat and cold) disturb plant growth and development, reduce productivity and, in severe cases, lead to plant death. Plants have developed numerous strategies to mitigate the detrimental impact of temperature stress. Exposure to stress leads to the accumulation of various metabolites, e.g. sugars, sugar alcohols, organic acids and amino acids. Plants accumulate the amino acid 'proline' in response to several abiotic stresses, including temperature stress. Proline abundance may result from de novo synthesis, hydrolysis of proteins, reduced utilization or degradation. Proline also leads to stress tolerance by maintaining the osmotic balance (still controversial), cell turgidity and indirectly modulating metabolism of reactive oxygen species. Furthermore, the crosstalk of proline with other osmoprotectants and signalling molecules, e.g. glycine betaine, abscisic acid, nitric oxide, hydrogen sulfide, soluble sugars, helps to strengthen protective mechanisms in stressful environments. Development of less temperature-responsive cultivars can be achieved by manipulating the biosynthesis of proline through genetic engineering. This review presents an overview of plant responses to extreme temperatures and an outline of proline metabolism under such temperatures. The exogenous application of proline as a protective molecule under extreme temperatures is also presented. Proline crosstalk and interaction with other molecules is also discussed. Finally, the potential of genetic engineering of proline-related genes is explained to develop 'temperature-smart' plants. In short, exogenous application of proline and genetic engineering of proline genes promise ways forward for developing 'temperature-smart' future crop plants.
Collapse
|
15
|
Machine Learning in Metastatic Cancer Research: Potentials, Possibilities, and Prospects. Comput Struct Biotechnol J 2023; 21:2454-2470. [PMID: 37077177 PMCID: PMC10106342 DOI: 10.1016/j.csbj.2023.03.046] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 03/26/2023] [Accepted: 03/27/2023] [Indexed: 03/31/2023] Open
Abstract
Cancer has received extensive recognition for its high mortality rate, with metastatic cancer being the top cause of cancer-related deaths. Metastatic cancer involves the spread of the primary tumor to other body organs. As much as the early detection of cancer is essential, the timely detection of metastasis, the identification of biomarkers, and treatment choice are valuable for improving the quality of life for metastatic cancer patients. This study reviews the existing studies on classical machine learning (ML) and deep learning (DL) in metastatic cancer research. Since the majority of metastatic cancer research data are collected in the formats of PET/CT and MRI image data, deep learning techniques are heavily involved. However, its black-box nature and expensive computational cost are notable concerns. Furthermore, existing models could be overestimated for their generality due to the non-diverse population in clinical trial datasets. Therefore, research gaps are itemized; follow-up studies should be carried out on metastatic cancer using machine learning and deep learning tools with data in a symmetric manner.
Collapse
|
16
|
A possible link between air pollution and suicide? L'ENCEPHALE 2023; 49:94-95. [PMID: 34916076 DOI: 10.1016/j.encep.2021.08.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 07/06/2021] [Accepted: 08/20/2021] [Indexed: 01/21/2023]
|
17
|
Developing future heat-resilient vegetable crops. Funct Integr Genomics 2023; 23:47. [PMID: 36692535 PMCID: PMC9873721 DOI: 10.1007/s10142-023-00967-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 01/06/2023] [Accepted: 01/06/2023] [Indexed: 01/25/2023]
Abstract
Climate change seriously impacts global agriculture, with rising temperatures directly affecting the yield. Vegetables are an essential part of daily human consumption and thus have importance among all agricultural crops. The human population is increasing daily, so there is a need for alternative ways which can be helpful in maximizing the harvestable yield of vegetables. The increase in temperature directly affects the plants' biochemical and molecular processes; having a significant impact on quality and yield. Breeding for climate-resilient crops with good yields takes a long time and lots of breeding efforts. However, with the advent of new omics technologies, such as genomics, transcriptomics, proteomics, and metabolomics, the efficiency and efficacy of unearthing information on pathways associated with high-temperature stress resilience has improved in many of the vegetable crops. Besides omics, the use of genomics-assisted breeding and new breeding approaches such as gene editing and speed breeding allow creation of modern vegetable cultivars that are more resilient to high temperatures. Collectively, these approaches will shorten the time to create and release novel vegetable varieties to meet growing demands for productivity and quality. This review discusses the effects of heat stress on vegetables and highlights recent research with a focus on how omics and genome editing can produce temperature-resilient vegetables more efficiently and faster.
Collapse
|
18
|
A Novel Explainable CNN Model for Screening COVID-19 on X-ray Images. COMPUTER SYSTEMS SCIENCE AND ENGINEERING 2023; 46:1789-1809. [DOI: 10.32604/csse.2023.034022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 11/13/2022] [Indexed: 06/15/2023]
|
19
|
Abstract
Global climate changes cause extreme temperatures and a significant reduction in crop production, leading to food insecurity worldwide. Temperature extremes (including both heat and cold stresses) is one of the most limiting factors in plant growth and development and severely affect plant physiology, biochemical, and molecular processes. Biostimulants like melatonin (MET) have a multifunctional role that acts as a "defense molecule" to safeguard plants against the noxious effects of temperature stress. MET treatment improves plant growth and temperature tolerance by improving several defense mechanisms. Current research also suggests that MET interacts with other molecules, like phytohormones and gaseous molecules, which greatly supports plant adaptation to temperature stress. Genetic engineering via overexpression or CRISPR/Cas system of MET biosynthetic genes uplifts the MET levels in transgenic plants and enhances temperature stress tolerance. This review highlights the critical role of MET in plant production and tolerance against temperature stress. We have documented how MET interacts with other molecules to alleviate temperature stress. MET-mediated molecular breeding would be great potential in helping the adverse effects of temperature stress by creating transgenic plants.
Collapse
|
20
|
Enhanced Tooth Region Detection Using Pretrained Deep Learning Models. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:15414. [PMID: 36430133 PMCID: PMC9692549 DOI: 10.3390/ijerph192215414] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 11/16/2022] [Accepted: 11/17/2022] [Indexed: 06/15/2023]
Abstract
The rapid development of artificial intelligence (AI) has led to the emergence of many new technologies in the healthcare industry. In dentistry, the patient's panoramic radiographic or cone beam computed tomography (CBCT) images are used for implant placement planning to find the correct implant position and eliminate surgical risks. This study aims to develop a deep learning-based model that detects missing teeth's position on a dataset segmented from CBCT images. Five hundred CBCT images were included in this study. After preprocessing, the datasets were randomized and divided into 70% training, 20% validation, and 10% test data. A total of six pretrained convolutional neural network (CNN) models were used in this study, which includes AlexNet, VGG16, VGG19, ResNet50, DenseNet169, and MobileNetV3. In addition, the proposed models were tested with/without applying the segmentation technique. Regarding the normal teeth class, the performance of the proposed pretrained DL models in terms of precision was above 0.90. Moreover, the experimental results showed the superiority of DenseNet169 with a precision of 0.98. In addition, other models such as MobileNetV3, VGG19, ResNet50, VGG16, and AlexNet obtained a precision of 0.95, 0.94, 0.94, 0.93, and 0.92, respectively. The DenseNet169 model performed well at the different stages of CBCT-based detection and classification with a segmentation accuracy of 93.3% and classification of missing tooth regions with an accuracy of 89%. As a result, the use of this model may represent a promising time-saving tool serving dental implantologists with a significant step toward automated dental implant planning.
Collapse
|
21
|
Numerical Investigation of Photo-Generated Carrier Recombination Dynamics on the Device Characteristics for the Perovskite/Carbon Nitride Absorber-Layer Solar Cell. NANOMATERIALS (BASEL, SWITZERLAND) 2022; 12:4012. [PMID: 36432297 PMCID: PMC9699136 DOI: 10.3390/nano12224012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 11/02/2022] [Accepted: 11/09/2022] [Indexed: 06/16/2023]
Abstract
The nitrogenated holey two-dimensional carbon nitride (C2N) has been efficaciously utilized in the fabrication of transistors, sensors, and batteries in recent years, but lacks application in the photovoltaic industry. The C2N possesses favorable optoelectronic properties. To investigate its potential feasibility for solar cells (as either an absorber layer/interface layer), we foremost detailed the numerical modeling of the double-absorber-layer−methyl ammonium lead iodide (CH3NH3PbI3) −carbon nitride (C2N) layer solar cell and subsequently provided in-depth insight into the active-layer-associated recombination losses limiting the efficiency (η) of the solar cell. Under the recombination kinetics phenomena, we explored the influence of radiative recombination, Auger recombination, Shockley Read Hall recombination, the energy distribution of defects, Band Tail recombination (Hoping Model), Gaussian distribution, and metastable defect states, including single-donor (0/+), single-acceptor (−/0), double-donor (0/+/2+), double-acceptor (2/−/0−), and the interface-layer defects on the output characteristics of the solar cell. Setting the defect (or trap) density to 1015cm−3 with a uniform energy distribution of defects for all layers, we achieved an η of 24.16%. A considerable enhancement in power-conversion efficiency ( η~27%) was perceived as we reduced the trap density to 1014cm−3 for the absorber layers. Furthermore, it was observed that, for the absorber layer with double-donor defect states, the active layer should be carefully synthesized to reduce crystal-order defects to keep the total defect density as low as 1017cm−3 to achieve efficient device characteristics.
Collapse
|
22
|
Cecal duplication cyst mimicking intussusception in a female: A case report. Int J Surg Case Rep 2022; 100:107767. [DOI: 10.1016/j.ijscr.2022.107767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 10/20/2022] [Accepted: 10/21/2022] [Indexed: 11/06/2022] Open
|
23
|
Convolutional Neural Network Model Based on 2D Fingerprint for Bioactivity Prediction. Int J Mol Sci 2022; 23:13230. [PMID: 36362018 PMCID: PMC9657591 DOI: 10.3390/ijms232113230] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 10/22/2022] [Accepted: 10/27/2022] [Indexed: 10/15/2023] Open
Abstract
Determining and modeling the possible behaviour and actions of molecules requires investigating the basic structural features and physicochemical properties that determine their behaviour during chemical, physical, biological, and environmental processes. Computational approaches such as machine learning methods are alternatives to predicting the physiochemical properties of molecules based on their structures. However, the limited accuracy and high error rates of such predictions restrict their use. In this paper, a novel technique based on a deep learning convolutional neural network (CNN) for the prediction of chemical compounds' bioactivity is proposed and developed. The molecules are represented in the new matrix format Mol2mat, a molecular matrix representation adapted from the well-known 2D-fingerprint descriptors. To evaluate the performance of the proposed methods, a series of experiments were conducted using two standard datasets, namely the MDL Drug Data Report (MDDR) and Sutherland, datasets comprising 10 homogeneous and 14 heterogeneous activity classes. After analysing the eight fingerprints, all the probable combinations were investigated using the five best descriptors. The results showed that a combination of three fingerprints, ECFP4, EPFP4, and ECFC4, along with a CNN activity prediction process, achieved the highest performance of 98% AUC when compared to the state-of-the-art ML algorithms NaiveB, LSVM, and RBFN.
Collapse
|
24
|
Secondary Tumors of the Kidney: A Comprehensive Clinicopathologic Analysis. Adv Anat Pathol 2022; 29:241-251. [PMID: 35249993 DOI: 10.1097/pap.0000000000000338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Metastases to the kidney are rare and were historically described in autopsy series, and the incidence ranged between 2.36% and 12.6%. However, in the contemporary literature with the improvements in imaging modalities (computed tomography scan and magnetic resonance imaging) and other health care screening services, metastatic tumors to the kidney are being diagnosed more frequently in surgical specimens. The utility of needle core biopsies in the primary evaluation of renal masses has also increased the number of sampled metastases, and as a result, only limited histologic material is available for evaluation in some cases and may potentially lead to diagnostic pitfalls. In the last decade, a few large clinical series have been published. In these series, the majority of metastatic tumors to the kidney are carcinomas, with the lung being the most common primary site. A significant number of the various tumor types with metastasis to the kidney are also associated with widespread metastases to other organs, and the renal metastasis may present several years after diagnosis of the primary tumor. The majority of secondary tumors of the kidney are asymptomatic, incidentally discovered, and solitary. There should be a high index of suspicion of metastasis to the kidney in patients with an associated enlarging renal lesion with minimal to no enhancement on imaging and tumor progression of a known high-grade nonrenal malignancy. Secondary tumors of the kidney can be accurately diagnosed by correlating histopathologic features with clinical and radiographic findings and the judicious use of ancillary studies.
Collapse
|
25
|
Stock Analysis of Shrimp Scad (Alepes djedaba) Fishery from Northern Arabian Sea, Balochistan Coast, Pakistan. PAK J ZOOL 2022. [DOI: 10.17582/journal.pjz/20210329070315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
26
|
Analysis of the vulnerability estimation and neighbor value prediction in autonomous systems. Sci Rep 2022; 12:9457. [PMID: 35676409 PMCID: PMC9177568 DOI: 10.1038/s41598-022-13613-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Accepted: 05/04/2022] [Indexed: 11/18/2022] Open
Abstract
The security within autonomous systems (AS)s is one of the important measures to keep network users safe and stable from the various type of Distributed Denial of Service (DDoS) attacks. Similar to the other existing attack types Internet control message protocol (ICMP) based attacks are remained open challenge on the Internet environment. In this study, we have proposed a method to estimate the vulnerability of 600 AS provider edge (PE) routers by sending ICMP packets and predicted AS neighbor values using least square regression (LSR) approach. The results of our study show that 265 AS PE routers are vulnerable due to ICMP flood attack from the 600 ASs which were analyzed. Additionally, we have predicted that about 60% of total AS neighbors will be reduced in the next 3 years. Our results indicate that some ASs still did not deploy the firewall system in the boundary of their networks. Similarly, we also observed that the majority of ASs which expected to have less neighbor values in the next 3 years is due to change their routing paths to find adjacent paths.
Collapse
|
27
|
Multi-constraints based deep learning model for automated segmentation and diagnosis of coronary artery disease in X-ray angiographic images. PeerJ Comput Sci 2022; 8:e993. [PMID: 35721418 PMCID: PMC9202622 DOI: 10.7717/peerj-cs.993] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Accepted: 05/05/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND The detection of coronary artery disease (CAD) from the X-ray coronary angiography is a crucial process which is hindered by various issues such as presence of noise, insufficient contrast of the input images along with the uncertainties caused by the motion due to respiration and variation of angles of vessels. METHODS In this article, an Automated Segmentation and Diagnosis of Coronary Artery Disease (ASCARIS) model is proposed in order to overcome the prevailing challenges in detection of CAD from the X-ray images. Initially, the preprocessing of the input images was carried out by using the modified wiener filter for the removal of both internal and external noise pixels from the images. Then, the enhancement of contrast was carried out by utilizing the optimized maximum principal curvature to preserve the edge information thereby contributing to increasing the segmentation accuracy. Further, the binarization of enhanced images was executed by the means of OTSU thresholding. The segmentation of coronary arteries was performed by implementing the Attention-based Nested U-Net, in which the attention estimator was incorporated to overcome the difficulties caused by intersections and overlapped arteries. The increased segmentation accuracy was achieved by performing angle estimation. Finally, the VGG-16 based architecture was implemented to extract threefold features from the segmented image to perform classification of X-ray images into normal and abnormal classes. RESULTS The experimentation of the proposed ASCARIS model was carried out in the MATLAB R2020a simulation tool and the evaluation of the proposed model was compared with several existing approaches in terms of accuracy, sensitivity, specificity, revised contrast to noise ratio, mean square error, dice coefficient, Jaccard similarity, Hausdorff distance, Peak signal-to-noise ratio (PSNR), segmentation accuracy and ROC curve. DISCUSSION The results obtained conclude that the proposed model outperforms the existing approaches in all the evaluation metrics thereby achieving optimized classification of CAD. The proposed method removes the large number of background artifacts and obtains a better vascular structure.
Collapse
|
28
|
Moving Beyond DNA Sequence to Improve Plant Stress Responses. Front Genet 2022; 13:874648. [PMID: 35518351 PMCID: PMC9061961 DOI: 10.3389/fgene.2022.874648] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2022] [Accepted: 03/31/2022] [Indexed: 01/25/2023] Open
Abstract
Plants offer a habitat for a range of interactions to occur among different stress factors. Epigenetics has become the most promising functional genomics tool, with huge potential for improving plant adaptation to biotic and abiotic stresses. Advances in plant molecular biology have dramatically changed our understanding of the molecular mechanisms that control these interactions, and plant epigenetics has attracted great interest in this context. Accumulating literature substantiates the crucial role of epigenetics in the diversity of plant responses that can be harnessed to accelerate the progress of crop improvement. However, harnessing epigenetics to its full potential will require a thorough understanding of the epigenetic modifications and assessing the functional relevance of these variants. The modern technologies of profiling and engineering plants at genome-wide scale provide new horizons to elucidate how epigenetic modifications occur in plants in response to stress conditions. This review summarizes recent progress on understanding the epigenetic regulation of plant stress responses, methods to detect genome-wide epigenetic modifications, and disentangling their contributions to plant phenotypes from other sources of variations. Key epigenetic mechanisms underlying stress memory are highlighted. Linking plant response with the patterns of epigenetic variations would help devise breeding strategies for improving crop performance under stressed scenarios.
Collapse
|
29
|
Hypersensitivity Reactions in Men with Prostate Cancer Undergoing Docetaxel Chemotherapy. Clin Oncol (R Coll Radiol) 2022. [DOI: 10.1016/j.clon.2021.11.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
30
|
Machine learning classifiers with pre-processing techniques for rumour detection on social media: an empirical study. INTERNATIONAL JOURNAL OF CLOUD COMPUTING 2022; 11:330. [DOI: 10.1504/ijcc.2022.10049539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
|
31
|
Ensembling of Efficient Deep Convolutional Networks and Machine Learning Algorithms for Resource Effective Detection of Tuberculosis Using Thoracic (Chest) Radiography. IEEE ACCESS 2022; 10:85442-85458. [DOI: 10.1109/access.2022.3194152] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
|
32
|
An Ensemble Learning Based Approach for Detecting and Tracking COVID19 Rumors. COMPUTERS, MATERIALS & CONTINUA 2022; 70:1721-1747. [DOI: 10.32604/cmc.2022.018972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Accepted: 05/07/2021] [Indexed: 06/15/2023]
|
33
|
Designing an Arabic Google Play Store User Review Dataset for Detecting App Requirement Issues. ADVANCES ON SMART AND SOFT COMPUTING 2022:133-143. [DOI: 10.1007/978-981-16-5559-3_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
|
34
|
Combining CNN and Grad-Cam for COVID-19 Disease Prediction and Visual Explanation. INTELLIGENT AUTOMATION & SOFT COMPUTING 2022; 32:723-745. [DOI: 10.32604/iasc.2022.022179] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 09/02/2021] [Indexed: 06/15/2023]
|
35
|
Enhancing Parkinson's Disease Prediction Using Machine Learning and Feature Selection Methods. COMPUTERS, MATERIALS & CONTINUA 2022; 71:5639-5658. [DOI: 10.32604/cmc.2022.023124] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Accepted: 11/19/2021] [Indexed: 06/15/2023]
|
36
|
Arabic Sentiment Analysis of Users’ Opinions of Governmental Mobile Applications. COMPUTERS, MATERIALS & CONTINUA 2022; 72:4675-4689. [DOI: 10.32604/cmc.2022.027311] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 03/08/2022] [Indexed: 06/15/2023]
|
37
|
Machine learning classifiers with pre-processing techniques for rumour detection on social media: an empirical study. INTERNATIONAL JOURNAL OF CLOUD COMPUTING 2022; 11:330. [DOI: 10.1504/ijcc.2022.124797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
|
38
|
Hybrid Filter and Wrapper Feature Selection Method for Enhancing Detection Process of Phishing Websites. ADVANCES ON SMART AND SOFT COMPUTING 2022:419-426. [DOI: 10.1007/978-981-16-5559-3_34] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
|
39
|
COCP: Coupling Parameters Content Placement Strategy for In-Network Caching-Based Content-Centric Networking. COMPUTERS, MATERIALS & CONTINUA 2022; 71:5523-5543. [DOI: 10.32604/cmc.2022.020587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 07/30/2021] [Indexed: 09/02/2023]
|
40
|
Phishing Websites Detection by Using Optimized Stacking Ensemble Model. COMPUTER SYSTEMS SCIENCE AND ENGINEERING 2022; 41:109-125. [DOI: 10.32604/csse.2022.020414] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2021] [Accepted: 06/23/2021] [Indexed: 06/15/2023]
|
41
|
An Aggregated Mutual Information Based Feature Selection with Machine Learning Methods for Enhancing IoT Botnet Attack Detection. SENSORS (BASEL, SWITZERLAND) 2021; 22:185. [PMID: 35009725 PMCID: PMC8749651 DOI: 10.3390/s22010185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2021] [Revised: 12/23/2021] [Accepted: 12/24/2021] [Indexed: 06/14/2023]
Abstract
Due to the wide availability and usage of connected devices in Internet of Things (IoT) networks, the number of attacks on these networks is continually increasing. A particularly serious and dangerous type of attack in the IoT environment is the botnet attack, where the attackers can control the IoT systems to generate enormous networks of "bot" devices for generating malicious activities. To detect this type of attack, several Intrusion Detection Systems (IDSs) have been proposed for IoT networks based on machine learning and deep learning methods. As the main characteristics of IoT systems include their limited battery power and processor capacity, maximizing the efficiency of intrusion detection systems for IoT networks is still a research challenge. It is important to provide efficient and effective methods that use lower computational time and have high detection rates. This paper proposes an aggregated mutual information-based feature selection approach with machine learning methods to enhance detection of IoT botnet attacks. In this study, the N-BaIoT benchmark dataset was used to detect botnet attack types using real traffic data gathered from nine commercial IoT devices. The dataset includes binary and multi-class classifications. The feature selection method incorporates Mutual Information (MI) technique, Principal Component Analysis (PCA) and ANOVA f-test at finely-granulated detection level to select the relevant features for improving the performance of IoT Botnet classifiers. In the classification step, several ensemble and individual classifiers were used, including Random Forest (RF), XGBoost (XGB), Gaussian Naïve Bayes (GNB), k-Nearest Neighbor (k-NN), Logistic Regression (LR) and Support Vector Machine (SVM). The experimental results showed the efficiency and effectiveness of the proposed approach, which outperformed other techniques using various evaluation metrics.
Collapse
|
42
|
A Novel Feature-Engineered-NGBoost Machine-Learning Framework for Fraud Detection in Electric Power Consumption Data. SENSORS 2021; 21:s21248423. [PMID: 34960516 PMCID: PMC8704372 DOI: 10.3390/s21248423] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 12/10/2021] [Accepted: 12/13/2021] [Indexed: 11/19/2022]
Abstract
This study presents a novel feature-engineered–natural gradient descent ensemble-boosting (NGBoost) machine-learning framework for detecting fraud in power consumption data. The proposed framework was sequentially executed in three stages: data pre-processing, feature engineering, and model evaluation. It utilized the random forest algorithm-based imputation technique initially to impute the missing data entries in the acquired smart meter dataset. In the second phase, the majority weighted minority oversampling technique (MWMOTE) algorithm was used to avoid an unequal distribution of data samples among different classes. The time-series feature-extraction library and whale optimization algorithm were utilized to extract and select the most relevant features from the kWh reading of consumers. Once the most relevant features were acquired, the model training and testing process was initiated by using the NGBoost algorithm to classify the consumers into two distinct categories (“Healthy” and “Theft”). Finally, each input feature’s impact (positive or negative) in predicting the target variable was recognized with the tree SHAP additive-explanations algorithm. The proposed framework achieved an accuracy of 93%, recall of 91%, and precision of 95%, which was greater than all the competing models, and thus validated its efficacy and significance in the studied field of research.
Collapse
|
43
|
A robust approach for industrial small-object detection using an improved faster regional convolutional neural network. Sci Rep 2021; 11:23390. [PMID: 34862417 PMCID: PMC8642523 DOI: 10.1038/s41598-021-02805-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Accepted: 11/09/2021] [Indexed: 11/09/2022] Open
Abstract
With the increasing pace in the industrial sector, the need for a smart environment is also increasing and the production of industrial products in terms of quality always matters. There is a strong burden on the industrial environment to continue to reduce impulsive downtime, concert deprivation, and safety risks, which needs an efficient solution to detect and improve potential obligations as soon as possible. The systems working in industrial environments for generating industrial products are very fast and generate products rapidly, sometimes leading to faulty products. Therefore, this problem needs to be solved efficiently. Considering this problem in terms of faulty small-object detection, this study proposed an improved faster regional convolutional neural network-based model to detect the faults in the product images. We introduced a novel data-augmentation method along with a bi-cubic interpolation-based feature amplification method. A center loss is also introduced in the loss function to decrease the inter-class similarity issue. The experimental results show that the proposed improved model achieved better classification accuracy for detecting our small faulty objects. The proposed model performs better than the state-of-the-art methods.
Collapse
|
44
|
Noninvasive Detection of Respiratory Disorder Due to COVID-19 at the Early Stages in Saudi Arabia. ELECTRONICS 2021; 10:2701. [DOI: 10.3390/electronics10212701] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The Kingdom of Saudi Arabia has suffered from COVID-19 disease as part of the global pandemic due to severe acute respiratory syndrome coronavirus 2. The economy of Saudi Arabia also suffered a heavy impact. Several measures were taken to help mitigate its impact and stimulate the economy. In this context, we present a safe and secure WiFi-sensing-based COVID-19 monitoring system exploiting commercially available low-cost wireless devices that can be deployed in different indoor settings within Saudi Arabia. We extracted different activities of daily living and respiratory rates from ubiquitous WiFi signals in terms of channel state information (CSI) and secured them from unauthorized access through permutation and diffusion with multiple substitution boxes using chaos theory. The experiments were performed on healthy participants. We used the variances of the amplitude information of the CSI data and evaluated their security using several security parameters such as the correlation coefficient, mean-squared error (MSE), peak-signal-to-noise ratio (PSNR), entropy, number of pixel change rate (NPCR), and unified average change intensity (UACI). These security metrics, for example, lower correlation and higher entropy, indicate stronger security of the proposed encryption method. Moreover, the NPCR and UACI values were higher than 99% and 30, respectively, which also confirmed the security strength of the encrypted information.
Collapse
|
45
|
A new affinity matrix weighted k-nearest neighbors graph to improve spectral clustering accuracy. PeerJ Comput Sci 2021; 7:e692. [PMID: 34604521 PMCID: PMC8444071 DOI: 10.7717/peerj-cs.692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Accepted: 08/04/2021] [Indexed: 06/13/2023]
Abstract
Researchers have thought about clustering approaches that incorporate traditional clustering methods and deep learning techniques. These approaches normally boost the performance of clustering. Getting knowledge from large data-sets is quite an interesting task. In this case, we use some dimensionality reduction and clustering techniques. Spectral clustering is gaining popularity recently because of its performance. Lately, numerous techniques have been introduced to boost spectral clustering performance. One of the most significant part of these techniques is to construct a similarity graph. We introduced weighted k-nearest neighbors technique for the construction of similarity graph. Using this new metric for the construction of affinity matrix, we achieved good results as we tested it both on real and artificial data-sets.
Collapse
|
46
|
Skene gland adenocarcinoma: Clinicopathologic features, comprehensive biomarker analysis, and review of the literature. Pathol Int 2021; 71:712-714. [PMID: 34215022 DOI: 10.1111/pin.13145] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2021] [Accepted: 06/15/2021] [Indexed: 11/29/2022]
|
47
|
A Deep Learning Artificial Neural Network Algorithm for Instance-based Arabic Language Authorship Attribution. ADVANCES IN DATA SCIENCE AND ADAPTIVE ANALYSIS 2021; 13. [DOI: 10.1142/s2424922x21430026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
One of the common examples of cybercrime are identity theft and violating of intellectual property that commonly occur in social media. Authorship attribution (AA) techniques are used to extract and use several features of the text in order to identify the original author. These features are used to differentiate the writing style of one author from others. Several machine learning methods have been used to identify the AA using different languages. Few studies were conducted for Arabic AA. This paper aims to investigate the performance of deep learning-based artificial neural network (ANN) for identifying the attribution of authors using Arabic text. The applied model helps protect users in social media from identity theft and violating of their intellectual property. The experiments of this study used a dataset that includes 4,686 Arabic texts for 15 different authors. The performance of the deep learning method was compared with several machine learning methods. The experimental results showed the superior performance of deep learning for AA in Arabic language using different evaluation criteria such as F-score, accuracy, precision, and recall measures.
Collapse
|
48
|
Association of Body Mass Index and Waist Circumference With Dental Caries and Consequences of Untreated Dental Caries Among 12- to 14-Year-old Boys: A Cross-Sectional Study. Int Dent J 2021; 71:522-529. [PMID: 33622545 PMCID: PMC9275107 DOI: 10.1016/j.identj.2021.01.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
OBJECTIVE To evaluate the association between body mass index (BMI) and waist circumference (WC) and dental caries (DC), and the clinical consequences of untreated dental caries (U-DC) among 12- to 14-year-old male public-school adolescents in the Northern Province, Saudi Arabia. METHODS The demographic and anthropometric measurements of 302 boys 12 to 14 years of age (mean: 12.5 years) were recorded. BMI and central obesity (based on WC) were measured. The decayed-missing-filled teeth (DMFT) index was used to record DC. The pulp involvement, ulceration, fistula, abscess (PUFA) index was used to quantify the clinical consequences of U-DC. Multiple logistic regression analysis was performed to evaluate the risk factors related to DC and clinical consequences of U-DC. RESULTS A high prevalence of DC was found in adolescents who were underweight according to BMI and nonobese based on WC (46.7% vs 34.5%). The association between underweight (BMI) and obese (WC) with DC (odds ratio [95% CI]) was 1.91 (0.87, 4.18) and 0.34 (0.18, 0.63), respectively, while with PUFA (adjusted odds ratio [AOR]; 95%CI), it was 1.76 (0.76, 4.09) and 0.19 (0.06, 0.63) respectively. The logistic regression model showed that consuming sugar more than once a day led to a 2.87-fold greater likelihood of DC (AOR [95% CI] = 2.87 [1.68, 4.88]) and a 3.91-fold greater likelihood of mean PUFA score (AOR [95% CI] = 3.91 [2.05, 7.44]. CONCLUSION High risks for DC and clinical consequences of U-DC were observed among underweight and nonobese adolescent males. The frequency of sugar consumption was significantly associated with both conditions.
Collapse
|
49
|
Tuning Hyper-Parameters of Machine Learning Methods for Improving the Detection of Hate Speech. ADVANCES ON SMART AND SOFT COMPUTING 2021:71-78. [DOI: 10.1007/978-981-15-6048-4_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
|
50
|
Improved Deep Learning Based Method for Molecular Similarity Searching Using Stack of Deep Belief Networks. Molecules 2020; 26:E128. [PMID: 33383976 PMCID: PMC7795308 DOI: 10.3390/molecules26010128] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 12/24/2020] [Accepted: 12/25/2020] [Indexed: 11/24/2022] Open
Abstract
Virtual screening (VS) is a computational practice applied in drug discovery research. VS is popularly applied in a computer-based search for new lead molecules based on molecular similarity searching. In chemical databases similarity searching is used to identify molecules that have similarities to a user-defined reference structure and is evaluated by quantitative measures of intermolecular structural similarity. Among existing approaches, 2D fingerprints are widely used. The similarity of a reference structure and a database structure is measured by the computation of association coefficients. In most classical similarity approaches, it is assumed that the molecular features in both biological and non-biologically-related activity carry the same weight. However, based on the chemical structure, it has been found that some distinguishable features are more important than others. Hence, this difference should be taken consideration by placing more weight on each important fragment. The main aim of this research is to enhance the performance of similarity searching by using multiple descriptors. In this paper, a deep learning method known as deep belief networks (DBN) has been used to reweight the molecule features. Several descriptors have been used for the MDL Drug Data Report (MDDR) dataset each of which represents different important features. The proposed method has been implemented with each descriptor individually to select the important features based on a new weight, with a lower error rate, and merging together all new features from all descriptors to produce a new descriptor for similarity searching. Based on the extensive experiments conducted, the results show that the proposed method outperformed several existing benchmark similarity methods, including Bayesian inference networks (BIN), the Tanimoto similarity method (TAN), adapted similarity measure of text processing (ASMTP) and the quantum-based similarity method (SQB). The results of this proposed multi-descriptor-based on Stack of deep belief networks method (SDBN) demonstrated a higher accuracy compared to existing methods on structurally heterogeneous datasets.
Collapse
|