1
|
Wang Y, Hou Z, Shen L, Wu T, Wang J, Huang H, Zhang H, Zhang D. Towards Natural Language-Based Visualization Authoring. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2023; 29:1222-1232. [PMID: 36197854 DOI: 10.1109/tvcg.2022.3209357] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
A key challenge to visualization authoring is the process of getting familiar with the complex user interfaces of authoring tools. Natural Language Interface (NLI) presents promising benefits due to its learnability and usability. However, supporting NLIs for authoring tools requires expertise in natural language processing, while existing NLIs are mostly designed for visual analytic workflow. In this paper, we propose an authoring-oriented NLI pipeline by introducing a structured representation of users' visualization editing intents, called editing actions, based on a formative study and an extensive survey on visualization construction tools. The editing actions are executable, and thus decouple natural language interpretation and visualization applications as an intermediate layer. We implement a deep learning-based NL interpreter to translate NL utterances into editing actions. The interpreter is reusable and extensible across authoring tools. The authoring tools only need to map the editing actions into tool-specific operations. To illustrate the usages of the NL interpreter, we implement an Excel chart editor and a proof-of-concept authoring tool, VisTalk. We conduct a user study with VisTalk to understand the usage patterns of NL-based authoring systems. Finally, we discuss observations on how users author charts with natural language, as well as implications for future research.
Collapse
|
2
|
Towards lifelong human assisted speaker diarization. COMPUT SPEECH LANG 2022. [DOI: 10.1016/j.csl.2022.101437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
3
|
Active Correction for Incremental Speaker Diarization of a Collection with Human in the Loop. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12041782] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
State of the art diarization systems now achieve decent performance but those performances are often not good enough to deploy them without any human supervision. Additionally, most approaches focus on single audio files while many use cases involving multiple recordings with recurrent speakers require the incremental processing of a collection. In this paper, we propose a framework that solicits a human in the loop to correct the clustering by answering simple questions. After defining the nature of the questions for both single file and collection of files, we propose two algorithms to list those questions and associated stopping criteria that are necessary to limit the work load on the human in the loop. Experiments performed on the ALLIES dataset show that a limited interaction with a human expert can lead to considerable improvement of up to 36.5% relative diarization error rate (DER) for single files and 33.29% for a collection.
Collapse
|
4
|
Visual Active Learning for Labeling: A Case for Soundscape Ecology Data. INFORMATION 2021. [DOI: 10.3390/info12070265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Labeling of samples is a recurrent and time-consuming task in data analysis and machine learning and yet generally overlooked in terms of visual analytics approaches to improve the process. As the number of tailored applications of learning models increases, it is crucial that more effective approaches to labeling are developed. In this paper, we report the development of a methodology and a framework to support labeling, with an application case as background. The methodology performs visual active learning and label propagation with 2D embeddings as layouts to achieve faster and interactive labeling of samples. The framework is realized through SoundscapeX, a tool to support labeling in soundscape ecology data. We have applied the framework to a set of audio recordings collected for a Long Term Ecological Research Project in the Cantareira-Mantiqueira Corridor (LTER CCM), localized in the transition between northeastern São Paulo state and southern Minas Gerais state in Brazil. We employed a pre-label data set of groups of animals to test the efficacy of the approach. The results showed the best accuracy at 94.58% in the prediction of labeling for birds and insects; and 91.09% for the prediction of the sound event as frogs and insects.
Collapse
|
5
|
Choi Y, Park S, Lee S. Identifying emerging technologies to envision a future innovation ecosystem: A machine learning approach to patent data. Scientometrics 2021. [DOI: 10.1007/s11192-021-04001-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
6
|
De Angeli K, Gao S, Alawad M, Yoon HJ, Schaefferkoetter N, Wu XC, Durbin EB, Doherty J, Stroup A, Coyle L, Penberthy L, Tourassi G. Deep active learning for classifying cancer pathology reports. BMC Bioinformatics 2021; 22:113. [PMID: 33750288 PMCID: PMC7941989 DOI: 10.1186/s12859-021-04047-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Accepted: 02/23/2021] [Indexed: 11/18/2022] Open
Abstract
Background Automated text classification has many important applications in the clinical setting; however, obtaining labelled data for training machine learning and deep learning models is often difficult and expensive. Active learning techniques may mitigate this challenge by reducing the amount of labelled data required to effectively train a model. In this study, we analyze the effectiveness of 11 active learning algorithms on classifying subsite and histology from cancer pathology reports using a Convolutional Neural Network as the text classification model. Results We compare the performance of each active learning strategy using two differently sized datasets and two different classification tasks. Our results show that on all tasks and dataset sizes, all active learning strategies except diversity-sampling strategies outperformed random sampling, i.e., no active learning. On our large dataset (15K initial labelled samples, adding 15K additional labelled samples each iteration of active learning), there was no clear winner between the different active learning strategies. On our small dataset (1K initial labelled samples, adding 1K additional labelled samples each iteration of active learning), marginal and ratio uncertainty sampling performed better than all other active learning techniques. We found that compared to random sampling, active learning strongly helps performance on rare classes by focusing on underrepresented classes. Conclusions Active learning can save annotation cost by helping human annotators efficiently and intelligently select which samples to label. Our results show that a dataset constructed using effective active learning techniques requires less than half the amount of labelled data to achieve the same performance as a dataset constructed using random sampling. Supplementary Information The online version supplementary material available at 10.1186/s12859-021-04047-1.
Collapse
Affiliation(s)
- Kevin De Angeli
- Oak Ridge National Lab, Oak Ridge, TN, USA.,The Bredesen Center, The University of Tennessee, Knoxville, TN, US
| | - Shang Gao
- Oak Ridge National Lab, Oak Ridge, TN, USA.
| | | | | | | | - Xiao-Cheng Wu
- Louisiana Tumor Registry, Louisiana State University Health Sciences Center, School of Public Health, New Orleans, LA, USA
| | - Eric B Durbin
- College of Medicine, University of Kentucky, Lexington, KY, USA
| | - Jennifer Doherty
- Utah Cancer Registry, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Antoinette Stroup
- New Jersey State Cancer Registry, New Jersey Department of Health, Trenton, NJ, USA
| | - Linda Coyle
- Information Management Services Inc., Calverton, MD, USA
| | - Lynne Penberthy
- Surveillance Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, MD, USA
| | | |
Collapse
|
7
|
Abstract
One of the major aspects affecting the performance of the classification algorithms is the amount of labeled data which is available during the training phase. It is widely accepted that the labeling procedure of vast amounts of data is both expensive and time-consuming since it requires the employment of human expertise. For a wide variety of scientific fields, unlabeled examples are easy to collect but hard to handle in a useful manner, thus improving the contained information for a subject dataset. In this context, a variety of learning methods have been studied in the literature aiming to efficiently utilize the vast amounts of unlabeled data during the learning process. The most common approaches tackle problems of this kind by individually applying active learning or semi-supervised learning methods. In this work, a combination of active learning and semi-supervised learning methods is proposed, under a common self-training scheme, in order to efficiently utilize the available unlabeled data. The effective and robust metrics of the entropy and the distribution of probabilities of the unlabeled set, to select the most sufficient unlabeled examples for the augmentation of the initial labeled set, are used. The superiority of the proposed scheme is validated by comparing it against the base approaches of supervised, semi-supervised, and active learning in the wide range of fifty-five benchmark datasets.
Collapse
|
8
|
A Method of Speech Coding for Speech Recognition Using a Convolutional Neural Network. Symmetry (Basel) 2019. [DOI: 10.3390/sym11091185] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
This work presents a new approach to speech recognition, based on the specific coding of time and frequency characteristics of speech. The research proposed the use of convolutional neural networks because, as we know, they show high resistance to cross-spectral distortions and differences in the length of the vocal tract. Until now, two layers of time convolution and frequency convolution were used. A novel idea is to weave three separate convolution layers: traditional time convolution and the introduction of two different frequency convolutions (mel-frequency cepstral coefficients (MFCC) convolution and spectrum convolution). This application takes into account more details contained in the tested signal. Our idea assumes creating patterns for sounds in the form of RGB (Red, Green, Blue) images. The work carried out research for isolated words and continuous speech, for neural network structure. A method for dividing continuous speech into syllables has been proposed. This method can be used for symmetrical stereo sound.
Collapse
|
9
|
Huang H, Huang J, Feng Y, Zhang J, Liu Z, Wang Q, Chen L. On the improvement of reinforcement active learning with the involvement of cross entropy to address one-shot learning problem. PLoS One 2019; 14:e0217408. [PMID: 31216289 PMCID: PMC6583946 DOI: 10.1371/journal.pone.0217408] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Accepted: 05/10/2019] [Indexed: 12/03/2022] Open
Abstract
As a promising research direction in recent decades, active learning allows an oracle to assign labels to typical examples for performance improvement in learning systems. Existing works mainly focus on designing criteria for screening examples of high value to be labeled in a handcrafted manner. Instead of manually developing strategies of querying the user to access labels for the desired examples, we utilized the reinforcement learning algorithm parameterized with the neural network to automatically explore query strategies in active learning when addressing stream-based one-shot classification problems. With the involvement of cross-entropy in the loss function of Q-learning, an efficient policy to decide when and where to predict or query an instance is learned through the developed framework. Compared with a former influential work, the advantages of our method are demonstrated experimentally with two image classification tasks, and it exhibited better performance, quick convergence, relatively good stability and fewer requests for labels.
Collapse
Affiliation(s)
- Honglan Huang
- College of Systems Engineering, National University of Defense Technology, Changsha, Hunan, China
| | - Jincai Huang
- College of Systems Engineering, National University of Defense Technology, Changsha, Hunan, China
| | - Yanghe Feng
- College of Systems Engineering, National University of Defense Technology, Changsha, Hunan, China
- * E-mail:
| | - Jiarui Zhang
- College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, Hunan, China
| | - Zhong Liu
- College of Systems Engineering, National University of Defense Technology, Changsha, Hunan, China
| | - Qi Wang
- College of Systems Engineering, National University of Defense Technology, Changsha, Hunan, China
| | - Li Chen
- College of Systems Engineering, National University of Defense Technology, Changsha, Hunan, China
| |
Collapse
|
10
|
Yu H, Yang X, Zheng S, Sun C. Active Learning From Imbalanced Data: A Solution of Online Weighted Extreme Learning Machine. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019; 30:1088-1103. [PMID: 30137013 DOI: 10.1109/tnnls.2018.2855446] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
It is well known that active learning can simultaneously improve the quality of the classification model and decrease the complexity of training instances. However, several previous studies have indicated that the performance of active learning is easily disrupted by an imbalanced data distribution. Some existing imbalanced active learning approaches also suffer from either low performance or high time consumption. To address these problems, this paper describes an efficient solution based on the extreme learning machine (ELM) classification model, called active online-weighted ELM (AOW-ELM). The main contributions of this paper include: 1) the reasons why active learning can be disrupted by an imbalanced instance distribution and its influencing factors are discussed in detail; 2) the hierarchical clustering technique is adopted to select initially labeled instances in order to avoid the missed cluster effect and cold start phenomenon as much as possible; 3) the weighted ELM (WELM) is selected as the base classifier to guarantee the impartiality of instance selection in the procedure of active learning, and an efficient online updated mode of WELM is deduced in theory; and 4) an early stopping criterion that is similar to but more flexible than the margin exhaustion criterion is presented. The experimental results on 32 binary-class data sets with different imbalance ratios demonstrate that the proposed AOW-ELM algorithm is more effective and efficient than several state-of-the-art active learning algorithms that are specifically designed for the class imbalance scenario.
Collapse
|
11
|
Uteva E, Graham RS, Wilkinson RD, Wheatley RJ. Active learning in Gaussian process interpolation of potential energy surfaces. J Chem Phys 2018; 149:174114. [DOI: 10.1063/1.5051772] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Affiliation(s)
- Elena Uteva
- School of Chemistry, University of Nottingham, Nottingham NG7 2RD, United Kingdom
| | - Richard S. Graham
- School of Mathematical Sciences, University of Nottingham, Nottingham NG7 2RD, United Kingdom
| | - Richard D. Wilkinson
- School of Mathematics and Statistics, University of Sheffield, Sheffield S10 2TN, United Kingdom
| | - Richard J. Wheatley
- School of Chemistry, University of Nottingham, Nottingham NG7 2RD, United Kingdom
| |
Collapse
|
12
|
Kholghi M, Phillips Y, Towsey M, Sitbon L, Roe P. Active learning for classifying long‐duration audio recordings of the environment. Methods Ecol Evol 2018. [DOI: 10.1111/2041-210x.13042] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
| | | | - Michael Towsey
- Queensland University of Technology Brisbane Qld Australia
| | | | - Paul Roe
- Queensland University of Technology Brisbane Qld Australia
| |
Collapse
|
13
|
|
14
|
Integration of Text Classification Model with Speech to Text System. BIG DATA ANALYTICS 2017. [DOI: 10.1007/978-3-319-72413-3_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
|
15
|
Han W, Coutinho E, Ruan H, Li H, Schuller B, Yu X, Zhu X. Semi-Supervised Active Learning for Sound Classification in Hybrid Learning Environments. PLoS One 2016; 11:e0162075. [PMID: 27627768 PMCID: PMC5023122 DOI: 10.1371/journal.pone.0162075] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2015] [Accepted: 08/17/2016] [Indexed: 11/19/2022] Open
Abstract
Coping with scarcity of labeled data is a common problem in sound classification tasks. Approaches for classifying sounds are commonly based on supervised learning algorithms, which require labeled data which is often scarce and leads to models that do not generalize well. In this paper, we make an efficient combination of confidence-based Active Learning and Self-Training with the aim of minimizing the need for human annotation for sound classification model training. The proposed method pre-processes the instances that are ready for labeling by calculating their classifier confidence scores, and then delivers the candidates with lower scores to human annotators, and those with high scores are automatically labeled by the machine. We demonstrate the feasibility and efficacy of this method in two practical scenarios: pool-based and stream-based processing. Extensive experimental results indicate that our approach requires significantly less labeled instances to reach the same performance in both scenarios compared to Passive Learning, Active Learning and Self-Training. A reduction of 52.2% in human labeled instances is achieved in both of the pool-based and stream-based scenarios on a sound classification task considering 16,930 sound instances.
Collapse
Affiliation(s)
- Wenjing Han
- Language Computing Lab, Samsung R&D Institute of China - Beijing (SRC-B), Beijing, China
| | - Eduardo Coutinho
- Department of Music, University of Liverpool, Liverpool, United Kingdom
- Department of Computing, Imperial College London, London, United Kingdom
| | - Huabin Ruan
- Department of Computer Science and Technology, Tsinghua University, Beijing, China
- * E-mail:
| | - Haifeng Li
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Björn Schuller
- Department of Computing, Imperial College London, London, United Kingdom
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
- Complex Systems Engineering, University of Passau, Passau, Germany
| | - Xiaojie Yu
- Language Computing Lab, Samsung R&D Institute of China - Beijing (SRC-B), Beijing, China
| | - Xuan Zhu
- Language Computing Lab, Samsung R&D Institute of China - Beijing (SRC-B), Beijing, China
| |
Collapse
|
16
|
Sousa AF, Prudêncio RB, Ludermir TB, Soares C. Active learning and data manipulation techniques for generating training examples in meta-learning. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.02.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
17
|
Le TB, Kim SW. Modified criterion to select useful unlabeled data for improving semi-supervised support vector machines. Pattern Recognit Lett 2015. [DOI: 10.1016/j.patrec.2015.04.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
18
|
|
19
|
Ma L, Reisert M, Burkhardt H. RENNSH: a novel α-helix identification approach for intermediate resolution electron density maps. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:228-239. [PMID: 21383418 DOI: 10.1109/tcbb.2011.52] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Accurate identification of protein secondary structures is beneficial to understand three-dimensional structures of biological macromolecules. In this paper, a novel refined classification framework is proposed, which treats alpha-helix identification as a machine learning problem by representing each voxel in the density map with its Spherical Harmonic Descriptors (SHD). An energy function is defined to provide statistical analysis of its identification performance, which can be applied to all the α-helix identification approaches. Comparing with other existing α-helix identification methods for intermediate resolution electron density maps, the experimental results demonstrate that our approach gives the best identification accuracy and is more robust to the noise.
Collapse
|
20
|
Yu D, Varadarajan B, Deng L, Acero A. Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion. COMPUT SPEECH LANG 2010. [DOI: 10.1016/j.csl.2009.03.004] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
21
|
Shinozaki T, Ostendorf M, Atlas L. Characteristics of speaking style and implications for speech recognition. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2009; 126:1500-1510. [PMID: 19739763 DOI: 10.1121/1.3183593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Differences in speaking style are associated with more or less spectral variability, as well as different modulation characteristics. The greater variation in some styles (e.g., spontaneous speech and infant-directed speech) poses challenges for recognition but possibly also opportunities for learning more robust models, as evidenced by prior work and motivated by child language acquisition studies. In order to investigate this possibility, this work proposes a new method for characterizing speaking style (the modulation spectrum), examines spontaneous, read, adult-directed, and infant-directed styles in this space, and conducts pilot experiments in style detection and sampling for improved speech recognizer training. Speaking style classification is improved by using the modulation spectrum in combination with standard pitch and energy variation. Speech recognition experiments on a small vocabulary conversational speech recognition task show that sampling methods for training with a small amount of data benefit from the new features.
Collapse
Affiliation(s)
- Takahiro Shinozaki
- Department of Electrical Engineering, University of Washington, Seattle, WA 98195-2500, USA
| | | | | |
Collapse
|
22
|
Kuo JS, Li H, Yang YK. Active learning for constructing transliteration lexicons from the Web. ACTA ACUST UNITED AC 2007. [DOI: 10.1002/asi.20737] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
23
|
Abstract
State of the art data-driven speech and language processing systems require a large amount of human intervention ranging from data annotation to system prototyping. In the traditional
supervised passive approach
, the system is trained on a given number of annotated data samples and evaluated using a separate test set. Then more data is collected arbitrarily, annotated, and the whole cycle is repeated. In this article, we propose the
active approach
where the system itself selects its own training data, evaluates itself and re-trains when necessary. We first employ
active learning
which aims to automatically select the examples that are likely to be the most informative for a given task. We use active learning for both selecting the examples to label and the examples to re-label in order to correct labeling errors. Furthermore, the system automatically evaluates itself using
active evaluation
to keep track of the unexpected events and decides on-demand to label more examples. The active approach enables dynamic adaptation of spoken language processing systems to unseen or unexpected events for nonstationary input while reducing the manual annotation effort significantly. We have evaluated the active approach with the AT&T spoken dialog system used for customer care applications. In this article, we present our results for both automatic speech recognition and spoken language understanding.
Collapse
|
24
|
Zhao Y, Xu C, Cao Y. Research on Query-by-Committee Method of Active Learning and Application. ADVANCED DATA MINING AND APPLICATIONS 2006. [DOI: 10.1007/11811305_107] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|