1
|
Zhang Y, Shirakawa M, Hara T. Generalized durative event detection on social media. J Intell Inf Syst 2023; 60:73-95. [PMID: 36818487 PMCID: PMC9927034 DOI: 10.1007/s10844-022-00730-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 07/15/2022] [Accepted: 07/18/2022] [Indexed: 11/26/2022]
Abstract
Given the recent availability of large volumes of social media discussions, finding temporal unusual phenomena, which can be called events, from such data is of great interest. Previous works on social media event detection either assume a specific type of event, or assume certain behavior of observed variables. In this paper, we propose a general method for event detection on social media that makes few assumptions. The main assumption we make is that when an event occurs, affected semantic aspects will behave differently from their usual behavior, for a sustained period. We generalize the representation of time units based on word embeddings of social media text, and propose an algorithm to detect durative events in time series in a general sense. In addition, we also provide an incremental version of the algorithm for the purpose of real-time detection. We test our approaches on synthetic data and two real-world tasks. With the synthetic dataset, we compare the performance of retrospective and incremental versions of the algorithm. In the first real-world task, we use a novel setting to test if our method and baseline methods can exhaustively catch all real-world news in the test period. The evaluation results show that when the event is quite unusual with regard to the base social media discussion, it can be captured more effectively with our method. In the second real-world task, we use the event captured to help improve the accuracy of stock market movement prediction. We show that our event-based approach has a clear advantage compared to other ways of adding social media information.
Collapse
Affiliation(s)
- Yihong Zhang
- Graduate School of Information Science and Technology, Multimedia Data Engineering Lab, Osaka University, Osaka, Japan
| | - Masumi Shirakawa
- Graduate School of Information Science and Technology, Multimedia Data Engineering Lab, Osaka University, Osaka, Japan
| | - Takahiro Hara
- Graduate School of Information Science and Technology, Multimedia Data Engineering Lab, Osaka University, Osaka, Japan
| |
Collapse
|
2
|
Mredula MS, Dey N, Rahman MS, Mahmud I, Cho YZ. A Review on the Trends in Event Detection by Analyzing Social Media Platforms' Data. SENSORS (BASEL, SWITZERLAND) 2022; 22:4531. [PMID: 35746313 PMCID: PMC9231398 DOI: 10.3390/s22124531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 06/08/2022] [Accepted: 06/10/2022] [Indexed: 06/15/2023]
Abstract
Social media platforms have many users who share their thoughts and use these platforms to organize various events collectively. However, different upsetting incidents have occurred in recent years by taking advantage of social media, raising significant concerns. Therefore, considerable research has been carried out to detect any disturbing event and take appropriate measures. This review paper presents a thorough survey to acquire in-depth knowledge about the current research in this field and provide a guideline for future research. We systematically review 67 articles on event detection by sensing social media data from the last decade. We summarize their event detection techniques, tools, technologies, datasets, performance metrics, etc. The reviewed papers mainly address the detection of events, such as natural disasters, traffic, sports, real-time events, and some others. As these detected events can quickly provide an overview of the overall condition of the society, they can significantly help in scrutinizing events disrupting social security. We found that compatibility with different languages, spelling, and dialects is one of the vital challenges the event detection algorithms face. On the other hand, the event detection algorithms need to be robust to process different media, such as texts, images, videos, and locations. We outline that the event detection techniques compatible with heterogeneous data, language, and the platform are still missing. Moreover, the event and its location with a 24 × 7 real-time detection system will bolster the overall event detection performance.
Collapse
Affiliation(s)
- Motahara Sabah Mredula
- Institute of Information Technology, Jahangirnagar University, Savar 1342, Bangladesh; (M.S.M.); (N.D.)
| | - Noyon Dey
- Institute of Information Technology, Jahangirnagar University, Savar 1342, Bangladesh; (M.S.M.); (N.D.)
| | - Md. Sazzadur Rahman
- Institute of Information Technology, Jahangirnagar University, Savar 1342, Bangladesh; (M.S.M.); (N.D.)
| | - Imtiaz Mahmud
- School of Electronic and Electrical Engineering, Kyungpook National University, Daegu 41566, Korea;
| | - You-Ze Cho
- School of Electronic and Electrical Engineering, Kyungpook National University, Daegu 41566, Korea;
| |
Collapse
|
3
|
Chowdhury SR, Basu S, Maulik U. A survey on event and subevent detection from microblog data towards crisis management. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS 2022. [DOI: 10.1007/s41060-022-00335-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
4
|
Abdella JA, Zaki NM, Shuaib K, Khan F. Airline ticket price and demand prediction: A survey. JOURNAL OF KING SAUD UNIVERSITY - COMPUTER AND INFORMATION SCIENCES 2021. [DOI: 10.1016/j.jksuci.2019.02.001] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
5
|
Singh T, Kumari M. Burst: real-time events burst detection in social text stream. THE JOURNAL OF SUPERCOMPUTING 2021; 77:11228-11256. [PMID: 33776205 PMCID: PMC7982883 DOI: 10.1007/s11227-021-03717-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 02/25/2021] [Indexed: 06/12/2023]
Abstract
Gigantic growth of social media and unbeatable trend of progress in the direction of the web seeking user's interests have generated a storm of social text streams. Seeking information to know the phenomenon of various events in the early stages is quite interesting. Various kinds of social media live streams attract users to participate in real-time events to become a part of an immense crowd. However, the vast amount of text is present on social media, the unnecessary information bogs a social text stream filtering to extract the appropriate topics and events effectively. Therefore, detecting, classifying, and identifying burst events is quite challenging due to the sparse and noisy text of Twitter. The researchers' significant open challenges are the effective cleaning and profound representation of the text stream data. This research article's main contribution is to provide a detailed study and explore bursty event detection in the social text stream. Thus, this work's main motive is to present a concise approach that classifies and detects the event keywords and maintains the record of the event based on related features. These features permit the approach to successfully determine the booming pattern of events scrupulously at different time span. Experiments are conducted and compared with the state-of-the-art methods, which reveals that the proposed approach is proficient to detect valuable patterns of interest and also achieve better scoresto extract burst events on social media posted by various users.
Collapse
Affiliation(s)
- Tajinder Singh
- Sant Longowal Institute of Engineering & Technology, Sangrur, Punjab India
| | | |
Collapse
|
6
|
Tree decomposition based anomalous connected subgraph scanning for detecting and forecasting events in attributed social media networks. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.04.064] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
7
|
Rosa RL, De Silva MJ, Silva DH, Ayub MS, Carrillo D, Nardelli PHJ, Rodriguez DZ. Event Detection System Based on User Behavior Changes in Online Social Networks: Case of the COVID-19 Pandemic. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2020; 8:158806-158825. [PMID: 34812354 PMCID: PMC8545310 DOI: 10.1109/access.2020.3020391] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2020] [Accepted: 08/25/2020] [Indexed: 05/13/2023]
Abstract
People use Online Social Networks (OSNs) to express their opinions and feelings about many topics. Depending on the nature of an event and its dissemination rate in OSNs, and considering specific regions, the users' behavior can drastically change over a specific period of time. In this context, this work aims to propose an event detection system at the early stages of an event based on changes in the users' behavior in an OSN. This system can detect an event of any subject, and thus, it can be used for different purposes. The proposed event detection system is composed of the following main modules: (1) determination of the user's location, (2) message extraction from an OSN, (3) topic identification using natural language processing (NLP) based on the Deep Belief Network (DBN), (4) the user behavior change analyzer in the OSN, and (5) affective analysis for emotion identification based on a tree-convolutional neural network (tree-CNN). In the case of public health, the early event detection is very relevant for the population and the authorities in order to be able take corrective actions. Hence, the new coronavirus disease (COVID-19) is used as a case study in this work. For performance validation, the modules related to the topic identification and affective analysis were compared with other similar solutions or implemented with other machine learning algorithms. In the performance assessment, the proposed event detection system achieved an accuracy higher than 0.90, while other similar methods reached accuracy values less than 0.74. Additionally, our proposed system was able to detect an event almost three days earlier than the other methods. Furthermore, the information provided by the system permits to understand the predominant characteristics of an event, such as keywords and emotion type of messages.
Collapse
Affiliation(s)
- Renata Lopes Rosa
- Department of Computer ScienceUniversidade Federal de Lavras (UFLA) Lavras 37200 Brazil
| | | | | | - Muhammad Shoaib Ayub
- Department of Electrical EngineeringChulalongkorn University Bangkok 10330 Thailand
| | - Dick Carrillo
- School of Energy SystemsLappeenranta-Lahti University University of Technology 53850 Lappeenranta Finland
| | - Pedro H J Nardelli
- School of Energy SystemsLappeenranta-Lahti University University of Technology 53850 Lappeenranta Finland
| | | |
Collapse
|
8
|
Aslan S, Kaya B. Time-aware link prediction based on strengthened projection in bipartite networks. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2019.08.025] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
9
|
Ghani NA, Hamid S, Targio Hashem IA, Ahmed E. Social media big data analytics: A survey. COMPUTERS IN HUMAN BEHAVIOR 2019. [DOI: 10.1016/j.chb.2018.08.039] [Citation(s) in RCA: 155] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
10
|
Wang H, Li L, Pan P, Wang Y, Jin Y. Online detection of abnormal passenger out-flow in urban metro system. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2019.04.075] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
11
|
Șerban O, Thapen N, Maginnis B, Hankin C, Foot V. Real-time processing of social media with SENTINEL: A syndromic surveillance system incorporating deep learning for health classification. Inf Process Manag 2019. [DOI: 10.1016/j.ipm.2018.04.011] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
12
|
Zhang Y, Eick CF. Tracking Events in Twitter by Combining an LDA-Based Approach and a Density–Contour Clustering Approach. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING 2019. [DOI: 10.1142/s1793351x19400051] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Nowadays, Twitter has become one of the fastest-growing microblogging services; consequently, analyzing this rich and continuously user-generated content can reveal unprecedentedly valuable knowledge. In this paper, we propose a novel two-stage system to detect and track events from tweets by integrating a Latent Dirichlet Allocation (LDA)-based approach and an efficient density–contour-based spatio-temporal clustering approach. In the proposed system, we first divide the geotagged tweet stream into temporal time windows; next, events are identified as topics in tweets using an LDA-based topic discovery step; then, each tweet is assigned an event label; next, a density–contour-based spatio-temporal clustering approach is employed to identify spatio-temporal event clusters. In our approach, topic continuity is established by calculating KL-divergences between topics and spatio-temporal continuity is established by a family of newly formulated spatial cluster distance functions. Moreover, the proposed density–contour clustering approach considers two types of densities: “absolute” density and “relative” density to identify event clusters where either there is a high density of event tweets or there is a high percentage of event tweets. We evaluate our approach using real-world data collected from Twitter, and the experimental results show that the proposed system can not only detect and track events effectively but also discover interesting patterns from geotagged tweets.
Collapse
Affiliation(s)
- Yongli Zhang
- Department of Computer Science, University of Houston, Houston, TX 77204-3010, USA
| | - Christoph F. Eick
- Department of Computer Science, University of Houston, Houston, TX 77204-3010, USA
| |
Collapse
|
13
|
Chen X, Wang S, Tang Y, Hao T. A bibliometric analysis of event detection in social media. ONLINE INFORMATION REVIEW 2019. [DOI: 10.1108/oir-03-2018-0068] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
The purpose of this paper is to explore the research status and development trend of the field of event detection in social media (ED in SM) through a bibliometric analysis of academic publications.
Design/methodology/approach
First, publication distributions are analyzed including the trends of publications and citations, subject distribution, predominant journals, affiliations, authors, etc. Second, an indicator of collaboration degree is used to measure scientific connective relations from different perspectives. A network analysis method is then applied to reveal scientific collaboration relations. Furthermore, based on keyword co-occurrence analysis, major research themes and their evolutions throughout time span are discovered. Finally, a network analysis method is applied to visualize the analysis results.
Findings
The area of ED in SM has received increasing attention and interest in academia with Computer Science and Engineering as two major research subjects. The USA and China contribute the most to the area development. Affiliations and authors tend to collaborate more with those within the same country. Among the 14 identified research themes, newly emerged themes such as Pharmacovigilance event detection are discovered.
Originality/value
This study is the first to comprehensively illustrate the research status of ED in SM by conducting a bibliometric analysis. Up-to-date findings are reported, which can help relevant researchers understand the research trend, seek scientific collaborators and optimize research topic choices.
Collapse
|
14
|
dos Santos ED, Quiles MG, Faria FA. A correlation-based approach for event detection in Instagram. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2018. [DOI: 10.3233/jifs-169482] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Elder Donizetti dos Santos
- GIBIS Lab., Instituto de Ciência e Tecnologia, Universidade Federal de São Paulo, Campus de São José dos Campos, CEP 12231-280, São José dos Campos, SP, Brazil
| | - Marcos Gonçalves Quiles
- GIBIS Lab., Instituto de Ciência e Tecnologia, Universidade Federal de São Paulo, Campus de São José dos Campos, CEP 12231-280, São José dos Campos, SP, Brazil
| | - Fabio Augusto Faria
- GIBIS Lab., Instituto de Ciência e Tecnologia, Universidade Federal de São Paulo, Campus de São José dos Campos, CEP 12231-280, São José dos Campos, SP, Brazil
| |
Collapse
|
15
|
Zubiaga A, Kochkina E, Liakata M, Procter R, Lukasik M, Bontcheva K, Cohn T, Augenstein I. Discourse-aware rumour stance classification in social media using sequential classifiers. Inf Process Manag 2018. [DOI: 10.1016/j.ipm.2017.11.009] [Citation(s) in RCA: 55] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
16
|
Hu W, Wang H, Peng C, Liang H, Du B. RETRACTED: An event detection method for social networks based on link prediction. INFORM SYST 2017. [DOI: 10.1016/j.is.2017.06.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
17
|
Srijith P, Hepple M, Bontcheva K, Preotiuc-Pietro D. Sub-story detection in Twitter with hierarchical Dirichlet processes. Inf Process Manag 2017. [DOI: 10.1016/j.ipm.2016.10.004] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|