1
|
Num-Symbolic Homophonic Social Net-Words. INFORMATION 2022. [DOI: 10.3390/info13040174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Many excellent studies about social networks and text analyses can be found in the literature, facilitating the rapid development of automated text analysis technology. Due to the lack of natural separators in Chinese, the text numbers and symbols also have their original literal meaning. Thus, combining Chinese characters with numbers and symbols in user-generated content is a challenge for the current analytic approaches and procedures. Therefore, we propose a new hybrid method for detecting blended numeric and symbolic homophony Chinese neologisms (BNShCNs). Interpretation of the words’ actual semantics was performed according to their independence and relative position in context. This study obtained a shortlist using a probability approach from internet-collected user-generated content; subsequently, we evaluated the shortlist by contextualizing word-embedded vectors for BNShCN detection. The experiments show that the proposed method efficiently extracted BNShCNs from user-generated content.
Collapse
|
2
|
GE C, SHI H, JIANG J, XU X. Investigating the Demand for Blockchain Talents in the Recruitment Market: Evidence from Topic Modeling Analysis on Job Postings. INFORMATION & MANAGEMENT 2021. [DOI: 10.1016/j.im.2021.103513] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
3
|
Deng W, Yang Y. Cross-Platform Comparative Study of Public Concern on Social Media during the COVID-19 Pandemic: An Empirical Study Based on Twitter and Weibo. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:6487. [PMID: 34208483 PMCID: PMC8296381 DOI: 10.3390/ijerph18126487] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 05/31/2021] [Accepted: 06/12/2021] [Indexed: 11/24/2022]
Abstract
The COVID-19 pandemic has created a global health crisis that has affected economies and societies worldwide. During these times of uncertainty and crisis, people have turned to social media platforms as communication tools and primary information sources. Online discourse is conducted under the influence of many different factors, such as background, culture, politics, etc. However, parallel comparative research studies conducted in different countries to identify similarities and differences in online discourse are still scarce. In this study, we combine the crisis lifecycle and opinion leader concepts and use data mining and a set of predefined search terms (coronavirus and COVID-19) to investigate discourse on Twitter (101,271 tweets) and Sina Weibo (92,037 posts). Then, we use a topic modeling technique, Latent Dirichlet Allocation (LDA), to identify the most common issues posted by users and temporal analysis to research the issue's trend. Social Network Analysis (SNA) allows us to discover the opinion leader on the two different platforms. Finally, we find that online discourse reflects the crisis lifecycle according to the stage of COVID-19 in China and the US. Regarding the status of the COVID-19 pandemic, users of Twitter tend to pay more attention to the economic situation while users of Weibo pay more attention to public health. The issues focused on in online discourse have a strong relationship with the development of the crisis in different countries. Additionally, on the Twitter platform many political actors act as opinion leaders, while on the Weibo platform official media and government accounts control the release of information.
Collapse
Affiliation(s)
| | - Yi Yang
- College of Public Administration, Huazhong University of Science and Technology, Wuhan 430074, China;
| |
Collapse
|
4
|
Xie J, Zhang Z, Liu X, Zeng D. Unveiling the Hidden Truth of Drug Addiction: A Social Media Approach Using Similarity Network-Based Deep Learning. J MANAGE INFORM SYST 2021. [DOI: 10.1080/07421222.2021.1870388] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Jiaheng Xie
- Department of Accounting and MIS, Lerner College of Business & Economics, University of Delaware, Newark, DE, USA
| | - Zhu Zhang
- State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences
- Shenzhen Artificial Intelligence and Data Science Research Institute (Longhua)
| | - Xiao Liu
- Arizona State University, Tempe, AZ, USA
| | - Daniel Zeng
- State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences
- University of Chinese Academy of Sciences
- Shenzhen Artificial Intelligence and Data Science Research Institute (Longhua)
| |
Collapse
|
5
|
Guangce R, Lei X. Knowledge Discovery of News Text Based on Artificial Intelligence. ACM T ASIAN LOW-RESO 2021. [DOI: 10.1145/3418062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
The explosion of news text and the development of artificial intelligence provide a new opportunity and challenge to provide high-quality media monitoring service. In this article, we propose a semantic analysis approach based on the Latent Dirichlet Allocation (LDA) and Apriori algorithm, and we realize application to improve media monitoring reports by mining large-scale news text. First, we propose to use LDA model to mine news text topic words and reducing news dimensionality. Then, we propose to use Apriori algorithm to discovering the relationship of topic words. Finally, we discovery the relevance of news text topic words and show the intensity and dependency among topic words through drawing. This application can realize to extract the news topics and discover the correlation and dependency among news topics in mass news text. The results show that the method based on LDA and Apriori can help the media monitoring staff to better understand the hidden knowledge in the news text and improve the media analysis report.
Collapse
Affiliation(s)
- Ruan Guangce
- Information Management Department, East China Normal University, Minhang, Shanghai, China
| | - Xia Lei
- Lecture & Exhibition Center, Shanghai Library, Huai Hai Zhong Lu, Shanghai, China
| |
Collapse
|
6
|
Sharma A, Kumar S. Bayesian rough set based information retrieval. JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS 2020. [DOI: 10.1080/09720510.2020.1799575] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Anil Sharma
- University School of Information Communication & Technology, Guru Gobind Singh Indraprastha University, Dwarka, New Delhi 110078, India
| | - Suresh Kumar
- Department of Computer Science & Engineering, Ambedkar Institute of Advanced, Communication Technologies & Research, Geeta Colony, New Delhi 110031, India
| |
Collapse
|
7
|
Exploiting Open Data to analyze discussion and controversy in online citizen participation. Inf Process Manag 2020. [DOI: 10.1016/j.ipm.2020.102301] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
8
|
Wu C, Kanoulas E, Rijke MD. Learning entity-centric document representations using an entity facet topic model. Inf Process Manag 2020. [DOI: 10.1016/j.ipm.2020.102216] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
9
|
Unwanted advances in higher education:Uncovering sexual harassment experiences in academia with text mining. Inf Process Manag 2020. [DOI: 10.1016/j.ipm.2019.102167] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
10
|
A content-location-aware public welfare activity information push system based on microblog. Inf Process Manag 2020. [DOI: 10.1016/j.ipm.2019.102137] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
11
|
Belkahla Driss O, Mellouli S, Trabelsi Z. From citizens to government policy-makers: Social media data analysis. GOVERNMENT INFORMATION QUARTERLY 2019. [DOI: 10.1016/j.giq.2019.05.002] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
12
|
Lashkari F, Bagheri E, Ghorbani AA. Neural embedding-based indices for semantic search. Inf Process Manag 2019. [DOI: 10.1016/j.ipm.2018.10.015] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
13
|
Elia G, Solazzo G, Lorenzo G, Passiante G. Assessing learners’ satisfaction in collaborative online courses through a big data approach. COMPUTERS IN HUMAN BEHAVIOR 2019. [DOI: 10.1016/j.chb.2018.04.033] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
14
|
De Mauro A, Greco M, Grimaldi M, Ritala P. Human resources for Big Data professions: A systematic classification of job roles and required skill sets. Inf Process Manag 2018. [DOI: 10.1016/j.ipm.2017.05.004] [Citation(s) in RCA: 70] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
15
|
Qian Y, Du Y, Deng X, Ma B, Ye Q, Yuan H. Detecting new Chinese words from massive domain texts with word embedding. J Inf Sci 2018. [DOI: 10.1177/0165551518786676] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Textual information retrieval (TIR) is based on the relationship between word units. Traditional word segmentation techniques attempt to discern the word units accurately from texts; however, they are unable to appropriately and efficiently identify all new words. Identification of new words, especially in languages such as Chinese, remains a challenge. In recent years, word embedding methods have used numerical word vectors to retain the semantic and correlated information between words in a corpus. In this article, we propose the word-embedding-based method (WEBM), a novel method that combines word embedding and frequent n-gram string mining for discovering new words from domain corpora. First, we mapped all word units in a domain corpus to a high-dimension word vector space. Second, we used a frequent n-gram word string mining method to identify a set of candidates for new words. We designed a pruning strategy based on the word vectors to quantify the possibility of a word string being a new word, thereby allowing the evaluation of candidates based on the similarity of word units in the same string. In a comparative study, our experimental results revealed that WEBM had a great advantage in detecting new words from massive Chinese corpora.
Collapse
Affiliation(s)
- Yu Qian
- School of Management and Economics, University of Electronic Science and Technology of China, P.R. China
| | - Yang Du
- School of Management and Economics, University of Electronic Science and Technology of China, P.R. China
| | - Xiongwen Deng
- School of Management and Economics, University of Electronic Science and Technology of China, P.R. China
| | - Baojun Ma
- Research Center for Big Data Management & Intelligent Decision, School of Economics and Management, Beijing University of Posts and Telecommunications, P.R. China
| | - Qiongwei Ye
- School of Business, Yunnan University of Finance and Economics, P.R. China; School of Economics and Management, Tsinghua University, P.R. China
| | - Hua Yuan
- School of Management and Economics, University of Electronic Science and Technology of China, P.R. China
| |
Collapse
|
16
|
Ahmadian S, Meghdadi M, Afsharchi M. A social recommendation method based on an adaptive neighbor selection mechanism. Inf Process Manag 2018. [DOI: 10.1016/j.ipm.2017.03.002] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
17
|
Zhang W, Du Y, Yoshida T, Wang Q. DRI-RCNN: An approach to deceptive review identification using recurrent convolutional neural network. Inf Process Manag 2018. [DOI: 10.1016/j.ipm.2018.03.007] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
18
|
|
19
|
Luo J, Pan X, Zhu X. Discovery of repost patterns by topic analysis in enterprise social networking. ASLIB J INFORM MANAG 2017. [DOI: 10.1108/ajim-08-2016-0128] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
An increasing number of users are inspired by enterprises to repost social media messages, which greatly contributes to the dissemination of such messages in an online social network. The purpose of this paper is to discover the repost patterns of users regarding enterprise social media messages to help enterprises improve information management abilities for social media.
Design/methodology/approach
This paper proposes a novel method to discover the repost patterns of users in enterprise social networking (ESN) at the macro-level through topic analysis. Specifically, it proposes the message-diversity metric to measure the latent topic diversity degree of the social media messages. Through this technique, the paper analyzes the message-diversity characteristics of the enterprise social media messages and then explores the repost patterns of users.
Findings
The experimental results show that a high repost rate is more prominent for the messages with diverse latent topics, where message-diversity is as high as 0.5.
Practical implications
The findings have great potential in several management areas, such as employing social media marketing, predicting popular messages, helping enterprises strengthen their online presence, and gathering more potential customers.
Originality/value
This study explores how the repost patterns of users in ESN can be determined through general macro-level behavior of users instead of their micro-level processes. The patterns can also lead to a deeper understanding of which contents can drive people to diffuse information. This study gives an important insight into the information behavior of social media users for enterprise management researchers.
Collapse
|