Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Dias Canedo E, Cordeiro Mendes B. Software Requirements Classification Using Machine Learning Algorithms. Entropy (Basel) 2020;22:E1057. [PMID: 33286826 PMCID: PMC7597130 DOI: 10.3390/e22091057] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 09/02/2020] [Accepted: 09/03/2020] [Indexed: 11/16/2022]

For:	Dias Canedo E, Cordeiro Mendes B. Software Requirements Classification Using Machine Learning Algorithms. Entropy (Basel) 2020;22:E1057. [PMID: 33286826 PMCID: PMC7597130 DOI: 10.3390/e22091057] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 09/02/2020] [Accepted: 09/03/2020] [Indexed: 11/16/2022]

Number

Cited by Other Article(s)

Saleem S, Asim MN, Van Elst L, Junker M, Dengel A. MLR-predictor: a versatile and efficient computational framework for multi-label requirements classification. Front Artif Intell 2024;7:1481581. [PMID: 39664103 PMCID: PMC11632133 DOI: 10.3389/frai.2024.1481581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2024] [Accepted: 11/05/2024] [Indexed: 12/13/2024] Open

Abstract

Introduction

Requirements classification is an essential task for development of a successful software by incorporating all relevant aspects of users' needs. Additionally, it aids in the identification of project failure risks and facilitates to achieve project milestones in more comprehensive way. Several machine learning predictors are developed for binary or multi-class requirements classification. However, a few predictors are designed for multi-label classification and they are not practically useful due to less predictive performance.

Method

MLR-Predictor makes use of innovative OkapiBM25 model to transforms requirements text into statistical vectors by computing words informative patterns. Moreover, predictor transforms multi-label requirements classification data into multi-class classification problem and utilize logistic regression classifier for categorization of requirements. The performance of the proposed predictor is evaluated and compared with 123 machine learning and 9 deep learning-based predictive pipelines across three public benchmark requirements classification datasets using eight different evaluation measures.

Results

The large-scale experimental results demonstrate that proposed MLR-Predictor outperforms 123 adopted machine learning and 9 deep learning predictive pipelines, as well as the state-of-the-art requirements classification predictor. Specifically, in comparison to state-of-the-art predictor, it achieves a 13% improvement in macro F1-measure on the PROMISE dataset, a 1% improvement on the EHR-binary dataset, and a 2.5% improvement on the EHR-multiclass dataset.

Discussion

As a case study, the generalizability of proposed predictor is evaluated on softwares customer reviews classification data. In this context, the proposed predictor outperformed the state-of-the-art BERT language model by F-1 score of 1.4%. These findings underscore the robustness and effectiveness of the proposed MLR-Predictor in various contexts, establishing its utility as a promising solution for requirements classification task.

Collapse

Bagies T. Classifying software security requirements into confidentiality, integrity, and availability using machine learning approaches. PeerJ Comput Sci 2024;10:e2554. [PMID: 39650452 PMCID: PMC11623117 DOI: 10.7717/peerj-cs.2554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2024] [Accepted: 11/05/2024] [Indexed: 12/11/2024]

Al-Fraihat D, Sharrab Y, Al-Ghuwairi AR, Sbaih N, Qahmash A. Detecting refactoring type of software commit messages based on ensemble machine learning algorithms. Sci Rep 2024;14:21367. [PMID: 39266651 PMCID: PMC11392950 DOI: 10.1038/s41598-024-72307-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Accepted: 09/05/2024] [Indexed: 09/14/2024] Open

Laison EKE, Hamza Ibrahim M, Boligarla S, Li J, Mahadevan R, Ng A, Muthuramalingam V, Lee WY, Yin Y, Nasri BR. Identifying Potential Lyme Disease Cases Using Self-Reported Worldwide Tweets: Deep Learning Modeling Approach Enhanced With Sentimental Words Through Emojis. J Med Internet Res 2023;25:e47014. [PMID: 37843893 PMCID: PMC10616745 DOI: 10.2196/47014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 07/26/2023] [Accepted: 08/31/2023] [Indexed: 10/17/2023] Open

Abstract

BACKGROUND

Lyme disease is among the most reported tick-borne diseases worldwide, making it a major ongoing public health concern. An effective Lyme disease case reporting system depends on timely diagnosis and reporting by health care professionals, and accurate laboratory testing and interpretation for clinical diagnosis validation. A lack of these can lead to delayed diagnosis and treatment, which can exacerbate the severity of Lyme disease symptoms. Therefore, there is a need to improve the monitoring of Lyme disease by using other data sources, such as web-based data.

OBJECTIVE

We analyzed global Twitter data to understand its potential and limitations as a tool for Lyme disease surveillance. We propose a transformer-based classification system to identify potential Lyme disease cases using self-reported tweets.

METHODS

Our initial sample included 20,000 tweets collected worldwide from a database of over 1.3 million Lyme disease tweets. After preprocessing and geolocating tweets, tweets in a subset of the initial sample were manually labeled as potential Lyme disease cases or non-Lyme disease cases using carefully selected keywords. Emojis were converted to sentiment words, which were then replaced in the tweets. This labeled tweet set was used for the training, validation, and performance testing of DistilBERT (distilled version of BERT [Bidirectional Encoder Representations from Transformers]), ALBERT (A Lite BERT), and BERTweet (BERT for English Tweets) classifiers.

RESULTS

The empirical results showed that BERTweet was the best classifier among all evaluated models (average F1-score of 89.3%, classification accuracy of 90.0%, and precision of 97.1%). However, for recall, term frequency-inverse document frequency and k-nearest neighbors performed better (93.2% and 82.6%, respectively). On using emojis to enrich the tweet embeddings, BERTweet had an increased recall (8% increase), DistilBERT had an increased F1-score of 93.8% (4% increase) and classification accuracy of 94.1% (4% increase), and ALBERT had an increased F1-score of 93.1% (5% increase) and classification accuracy of 93.9% (5% increase). The general awareness of Lyme disease was high in the United States, the United Kingdom, Australia, and Canada, with self-reported potential cases of Lyme disease from these countries accounting for around 50% (9939/20,000) of the collected English-language tweets, whereas Lyme disease-related tweets were rare in countries from Africa and Asia. The most reported Lyme disease-related symptoms in the data were rash, fatigue, fever, and arthritis, while symptoms, such as lymphadenopathy, palpitations, swollen lymph nodes, neck stiffness, and arrythmia, were uncommon, in accordance with Lyme disease symptom frequency.

CONCLUSIONS

The study highlights the robustness of BERTweet and DistilBERT as classifiers for potential cases of Lyme disease from self-reported data. The results demonstrated that emojis are effective for enrichment, thereby improving the accuracy of tweet embeddings and the performance of classifiers. Specifically, emojis reflecting sadness, empathy, and encouragement can reduce false negatives.

Collapse

A machine learning approach for hierarchical classification of software requirements. MACHINE LEARNING WITH APPLICATIONS 2023. [DOI: 10.1016/j.mlwa.2023.100457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/03/2023] Open

Huang X, Hu Y. Recognition of Continuous Music Segments Based on the Phase Space Reconstruction Method. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:4099505. [PMID: 36238675 PMCID: PMC9553418 DOI: 10.1155/2022/4099505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 12/15/2021] [Indexed: 11/24/2022]

Abstract

Piano score recognition is one of the important research contents in the field of music information retrieval, and it plays an important role in information processing. In order to reduce the influence of vocals on the progress of piano notes and restore the harmonic information corresponding to piano notes, the article models the harmonic information and vocal information corresponding to piano notes in the frequency spectrum. We use the phase space reconstruction method to extract the nonlinear feature parameters in the note audio and use some of the parameters as the training set to construct the support vector machine (SVM) classifier and the other part as the test set to test the recognition effect. Therefore, the method of adaptive signal decomposition and SVM is introduced into the signal preprocessing link, and the corresponding recognition process is established. In order to improve the performance of the support vector machine, the article uses measurement learning method to obtain the measurement learning and uses the measurement learning to replace the Euclidean distance of the Gaussian kernel function of the support vector machine. The SVM method of adaptive signal decomposition and the SVM method of principal component analysis are introduced into the preprocessing process of the note signal, and then the preprocessed signal is reconstructed in phase space, and the corresponding recognition process is established. The method of directly reconstructing the phase space of the original signal has higher accuracy and can be applied to the note recognition of continuous music segments. The final experimental results show that, compared with the current popular piano score recognition algorithm, the recognition accuracy of the proposed piano score recognition algorithm is improved by 3.5% to 12.2%.

Collapse

Research on Product Core Component Acquisition Based on Patent Semantic Network. ENTROPY 2022;24:e24040549. [PMID: 35455212 PMCID: PMC9026476 DOI: 10.3390/e24040549] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Revised: 04/06/2022] [Accepted: 04/07/2022] [Indexed: 02/01/2023]

Khurshid I, Imtiaz S, Boulila W, Khan Z, Abbasi A, Javed AR, Jalil Z. Classification of Non-Functional Requirements From IoT Oriented Healthcare Requirement Document. Front Public Health 2022;10:860536. [PMID: 35372217 PMCID: PMC8974737 DOI: 10.3389/fpubh.2022.860536] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2022] [Accepted: 02/07/2022] [Indexed: 01/03/2023] Open

Abstract

Internet of Things (IoT) involves a set of devices that aids in achieving a smart environment. Healthcare systems, which are IoT-oriented, provide monitoring services of patients' data and help take immediate steps in an emergency. Currently, machine learning-based techniques are adopted to ensure security and other non-functional requirements in smart health care systems. However, no attention is given to classifying the non-functional requirements from requirement documents. The manual process of classifying the non-functional requirements from documents is erroneous and laborious. Missing non-functional requirements in the Requirement Engineering (RE) phase results in IoT oriented healthcare system with compromised security and performance. In this research, an experiment is performed where non-functional requirements are classified from the IoT-oriented healthcare system's requirement document. The machine learning algorithms considered for classification are Logistic Regression (LR), Support Vector Machine (SVM), Multinomial Naive Bayes (MNB), K-Nearest Neighbors (KNN), ensemble, Random Forest (RF), and hybrid KNN rule-based machine learning (ML) algorithms. The results show that our novel hybrid KNN rule-based machine learning algorithm outperforms others by showing an average classification accuracy of 75.9% in classifying non-functional requirements from IoT-oriented healthcare requirement documents. This research is not only novel in its concept of using a machine learning approach for classification of non-functional requirements from IoT-oriented healthcare system requirement documents, but it also proposes a novel hybrid KNN-rule based machine learning algorithm for classification with better accuracy. A new dataset is also created for classification purposes, comprising requirements related to IoT-oriented healthcare systems. However, since this dataset is small and consists of only 104 requirements, this might affect the generalizability of the results of this research.

Collapse

Peketi V, Satti S. ARCORE: A Requirements Dataset for Service Identification. BIG DATA ANALYTICS 2022. [DOI: 10.1007/978-3-031-24094-2_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

One- and Two-Phase Software Requirement Classification Using Ensemble Deep Learning. ENTROPY 2021;23:e23101264. [PMID: 34681988 PMCID: PMC8535052 DOI: 10.3390/e23101264] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Revised: 09/27/2021] [Accepted: 09/27/2021] [Indexed: 12/11/2022]

Enhancing Software Feature Extraction Results Using Sentiment Analysis to Aid Requirements Reuse. COMPUTERS 2021. [DOI: 10.3390/computers10030036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Dhindsa A, Bhatia S, Agrawal S, Sohi BS. An Improvised Machine Learning Model Based on Mutual Information Feature Selection Approach for Microbes Classification. ENTROPY (BASEL, SWITZERLAND) 2021;23:257. [PMID: 33672252 PMCID: PMC7927045 DOI: 10.3390/e23020257] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 02/10/2021] [Accepted: 02/20/2021] [Indexed: 12/11/2022]

Assi K. Traffic Crash Severity Prediction-A Synergy by Hybrid Principal Component Analysis and Machine Learning Models. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:E7598. [PMID: 33086567 PMCID: PMC7589286 DOI: 10.3390/ijerph17207598] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Revised: 10/14/2020] [Accepted: 10/17/2020] [Indexed: 12/24/2022]