1
|
Hussain K, Saeed Z, Abbasi R, Sindhu M, Khattak A, Arafat S, Daud A, Mushtaq M. Towards understanding the role of content-based and contextualized features in detecting abuse on Twitter. Heliyon 2024; 10:e29593. [PMID: 38665572 PMCID: PMC11043940 DOI: 10.1016/j.heliyon.2024.e29593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 04/03/2024] [Accepted: 04/10/2024] [Indexed: 04/28/2024] Open
Abstract
This paper presents a novel approach for detecting abuse on Twitter. Abusive posts have become a major problem for social media platforms like Twitter. It is important to identify abuse to mitigate its potential harm. Many researchers have proposed methods to detect abuse on Twitter. However, most of the existing approaches for detecting abuse look only at the content of the abusive tweet in isolation and do not consider its contextual information, particularly the tweets posted before the abusive tweet. In this paper, we propose a new method for detecting abuse that uses contextual information from the tweets that precede and follow the abusive tweet. We hypothesize that this contextual information can be used to better understand the intent of the abusive tweet and to identify abuse that content-based methods would otherwise miss. We performed extensive experiments to identify the best combination of features and machine learning algorithms to detect abuse on Twitter. We test eight different machine learning classifiers on content- and context-based features for the experiments. The proposed method is compared with existing abuse detection methods and achieves an absolute improvement of around 7%. The best results are obtained by combining the content and context-based features. The highest accuracy of the proposed method is 86%, whereas the existing methods used for comparison have highest accuracy of 79.2%.
Collapse
Affiliation(s)
- Kamal Hussain
- Instituto Superior Técnico, Universidade de Lisboa, Portugal
| | - Zafar Saeed
- Dipartimento di Informatica, Università degli Studi di Bari, Bari, Italy
| | - Rabeeh Abbasi
- Department of Computer Science, Quaid-i-Azam University, Islamabad, Pakistan
| | - Muddassar Sindhu
- Department of Computer Science, Quaid-i-Azam University, Islamabad, Pakistan
| | - Akmal Khattak
- Department of Computer Science, Quaid-i-Azam University, Islamabad, Pakistan
| | - Sachi Arafat
- Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Ali Daud
- Faculty of Resilience, Rabdan Academy, Abu Dhabi, United Arab Emirates
| | - Mubashar Mushtaq
- Department of Computer Science, Forman Christian College, Lahore, Pakistan
| |
Collapse
|
2
|
Said A, Shabbir M, Hassan SU, Hassan ZR, Ahmed A, Koutsoukos X. On augmenting topological graph representations for attributed graphs. Appl Soft Comput 2023. [DOI: 10.1016/j.asoc.2023.110104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
|
3
|
Cao R, Liu XF, Fang Z, Xu XK, Wang X. How do scientific papers from different journal tiers gain attention on social media? Inf Process Manag 2023. [DOI: 10.1016/j.ipm.2022.103152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
4
|
User engagement with scholarly tweets of scientific papers: a large-scale and cross-disciplinary analysis. Scientometrics 2022. [DOI: 10.1007/s11192-022-04468-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
AbstractThis study investigates the extent to which scholarly tweets of scientific papers are engaged with by Twitter users through four types of user engagement behaviors, i.e., liking, retweeting, quoting, and replying. Based on a sample consisting of 7 million scholarly tweets of Web of Science papers, our results show that likes is the most prevalent engagement metric, covering 44% of scholarly tweets, followed by retweets (36%), whereas quotes and replies are only present for 9% and 7% of all scholarly tweets, respectively. From a disciplinary point of view, scholarly tweets in the field of Social Sciences and Humanities are more likely to trigger user engagement over other subject fields. The presence of user engagement is more associated with other Twitter-based factors (e.g., number of mentioned users in tweets and number of followers of users) than with science-based factors (e.g., citations and Mendeley readers of tweeted papers). Building on these findings, this study sheds light on the possibility to apply user engagement metrics in measuring deeper levels of Twitter reception of scholarly information.
Collapse
|
5
|
Liu J, Liu Y. Exploring the User Interaction Network in an Anxiety Disorder Online Community: An Exponential Random Graph Model with Topical and Emotional Effects. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph19116354. [PMID: 35681939 PMCID: PMC9180229 DOI: 10.3390/ijerph19116354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 05/16/2022] [Accepted: 05/18/2022] [Indexed: 11/16/2022]
Abstract
The increasing number of people with anxiety disorders presents challenges when gathering health information. Users in anxiety disorder online communities (ADOCs) share and obtain a variety of health information, such as treatment experience, drug efficacy, and emotional support. This interaction alleviates the difficulties involved in obtaining health information. Users engage in community interaction via posts, comments, and replies, which promotes the development of an online community as well as the wellbeing of community users, and research concerning the formation mechanism of the user interaction network in ADOCs could be beneficial to users. Taking the Anxiety Disorder Post Bar as the research object, this study constructed an ADOC user interaction network based on users' posts, comments, and personal information data. With the help of exponential random graph models (ERGMs), we studied the effects of the network structure, user attributes, topics, and emotional intensity on user interaction networks. We found that there was significant reciprocity in the user interaction network in ADOCs. In terms of user attributes, gender homogeneity had no impact on the formation of the user interaction network. Experienced users in the community had obvious advantages, and experienced users could obtain replies more easily from other members. In terms of topics, pathology popularization showed obvious homogeneity, and symptoms of generalized anxiety disorder showed obvious heterogeneity. In terms of emotional intensity, users with polarized emotions were more likely to receive replies from users with positive emotions. The probability of interaction between two users with negative emotions was small, and users with opposite emotional polarity tended to interact, especially when the interaction was initiated by users with positive emotions.
Collapse
|
6
|
Pal A, Rees TJ. Introducing the EMPIRE Index: A novel, value-based metric framework to measure the impact of medical publications. PLoS One 2022; 17:e0265381. [PMID: 35377894 PMCID: PMC8979442 DOI: 10.1371/journal.pone.0265381] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 03/01/2022] [Indexed: 12/13/2022] Open
Abstract
Article-level measures of publication impact (alternative metrics or altmetrics) can help authors and other stakeholders assess engagement with their research and the success of their communication efforts. The wide variety of altmetrics can make interpretation and comparative assessment difficult; available summary tools are either narrowly focused or do not reflect the differing values of metrics from a stakeholder perspective. We created the EMPIRE (EMpirical Publication Impact and Reach Evaluation) Index, a value-based, multi-component metric framework for medical publications. Metric weighting and grouping were informed by a statistical analysis of 2891 Phase III clinical trial publications and by a panel of stakeholders who provided value assessments. The EMPIRE Index comprises three component scores (social, scholarly, and societal impact), each incorporating related altmetrics indicating a different aspect of engagement with the publication. These are averaged to provide a total impact score and benchmarked so that a score of 100 equals the mean scores of Phase III clinical trial publications in the New England Journal of Medicine (NEJM) in 2016. Predictor metrics are defined to estimate likely long-term impact. The social impact component correlated strongly with the Altmetric Attention Score and the scholarly impact component correlated modestly with CiteScore, with the societal impact component providing unique insights. Analysis of fresh metrics collected 1 year after the initial dataset, including an independent sample, showed that scholarly and societal impact scores continued to increase, whereas social impact scores did not. Analysis of NEJM ‘notable articles’ showed that observational studies had the highest total impact and component scores, except for societal impact, for which surgical studies had the highest score. The EMPIRE Index provides a richer assessment of publication value than standalone traditional and alternative metrics and may enable medical researchers to assess the impact of publications easily and to understand what characterizes impactful research.
Collapse
|
7
|
Ye YE, Na JC, Oh P. Are automated accounts driving scholarly communication on Twitter? a case study of dissemination of COVID-19 publications. Scientometrics 2022; 127:2151-2172. [PMID: 35370323 PMCID: PMC8956136 DOI: 10.1007/s11192-022-04343-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2020] [Accepted: 03/08/2022] [Indexed: 11/08/2022]
Abstract
From a network perspective, this study analyzes 659 users mentioning sampled COVID-19 articles 10 or more times on Twitter with a focus on their roles in facilitating the process of scholarly communication. Different from existing studies, we consider both the user types and the automation of accounts to profile influential users in the network of research dissemination. Our study found that similar to academic users, non-academic users can also be active players in communicating scientific publications. The results highlight the intensive interactions between human users and automated accounts, including bots and cyborgs, which accounted for 45% of connections among the top users. This study also demonstrates the important role of automated accounts in initiating and facilitating research dissemination. Specifically, (1) bot-assisted academic publishers showed the highest amplifier scores, which measures a user’s tendency of being the first to share information and reach out to others within their trusted networks, (2) 5.28% of the selected articles was first tweeted by automated research feeds, ranking the fourth among the 22 classified user groups, and (3) bot-assisted publishers and automated feeds of generic topics and news alerts were highly ranked in authority, a network measure to quantify the degree to which a user consumes important resources of relevant topics. In the conclusion section, we discuss future directions to improve the validity of Twitter metrics in assessing research impacts.
Collapse
|
8
|
Shams F, Abbas A, Khan W, Khan US, Nawaz R. A death, infection, and recovery (DIR) model to forecast the COVID-19 spread. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE UPDATE 2021; 2:100047. [PMID: 34977844 PMCID: PMC8713423 DOI: 10.1016/j.cmpbup.2021.100047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 11/14/2021] [Accepted: 12/27/2021] [Indexed: 06/14/2023]
Abstract
BACKGROUND The SARS-Cov-2 virus (commonly known as COVID-19) has resulted in substantial casualties in many countries. The first case of COVID-19 was reported in China towards the end of 2019. Cases started to appear in several other countries (including Pakistan) by February 2020. To analyze the spreading pattern of the disease, several researchers used the Susceptible-Infectious-Recovered (SIR) model. However, the classical SIR model cannot predict the death rate. OBJECTIVE In this article, we present a Death-Infection-Recovery (DIR) model to forecast the virus spread over a window of one (minimum) to fourteen (maximum) days. Our model captures the dynamic behavior of the virus and can assist authorities in making decisions on non-pharmaceutical interventions (NPI), like travel restrictions, lockdowns, etc. METHOD The size of training dataset used was 134 days. The Auto Regressive Integrated Moving Average (ARIMA) model was implemented using XLSTAT (add-in for Microsoft Excel), whereas the SIR and the proposed DIR model was implemented using python programming language. We compared the performance of DIR model with the SIR model and the ARIMA model by computing the Percentage Error and Mean Absolute Percentage Error (MAPE). RESULTS Experimental results demonstrate that the maximum% error in predicting the number of deaths, infections, and recoveries for a period of fourteen days using the DIR model is only 2.33%, using ARIMA model is 10.03% and using SIR model is 53.07%. CONCLUSION This percentage of error obtained in forecasting using DIR model is significantly less than the% error of the compared models. Moreover, the MAPE of the DIR model is sufficiently below the two compared models that indicates its effectiveness.
Collapse
Affiliation(s)
- Fazila Shams
- Department of Computer Science, COMSATS University Islamabad, Islamabad, Pakistan
| | - Assad Abbas
- Department of Computer Science, COMSATS University Islamabad, Islamabad, Pakistan
| | - Wasiq Khan
- Department of Computing and Mathematics, Liverpool John Moores University, Liverpool L3 3AF, United Kingdom
| | - Umar Shahbaz Khan
- National Centre of Robotics and Automation, H-12, Islamabad, Pakistan
| | - Raheel Nawaz
- Department of Operations, Technology, Events and Hospitality Management, Manchester Metropolitan University (MMU), Manchester M156BH, United Kingdom
| |
Collapse
|
9
|
A Novel Deep Neural Network-Based Approach to Measure Scholarly Research Dissemination Using Citations Network. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app112210970] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
We investigated the scientific research dissemination by analyzing the publications and citation data, implying that not all citations are significantly important. Therefore, as alluded to existing state-of-the-art models that employ feature-based techniques to measure the scholarly research dissemination between multiple entities, our model implements the convolutional neural network (CNN) with fastText-based pre-trained embedding vectors, utilizes only the citation context as its input to distinguish between important and non-important citations. Moreover, we speculate using focal-loss and class weight methods to address the inherited class imbalance problems in citation classification datasets. Using a dataset of 10 K annotated citation contexts, we achieved an accuracy of 90.7% along with a 90.6% f1-score, in the case of binary classification. Finally, we present a case study to measure the comprehensiveness of our deployed model on a dataset of 3100 K citations taken from the ACL Anthology Reference Corpus. We employed state-of-the-art graph visualization open-source tool Gephi to analyze the various aspects of citation network graphs, for each respective citation behavior.
Collapse
|
10
|
Understanding and predicting the dissemination of scientific papers on social media: a two-step simultaneous equation modeling–artificial neural network approach. Scientometrics 2021. [DOI: 10.1007/s11192-021-04051-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
11
|
Yeung AWK, Kletecka-Pulker M, Eibensteiner F, Plunger P, Völkl-Kernstock S, Willschke H, Atanasov AG. Implications of Twitter in Health-Related Research: A Landscape Analysis of the Scientific Literature. Front Public Health 2021; 9:654481. [PMID: 34307273 PMCID: PMC8299201 DOI: 10.3389/fpubh.2021.654481] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Accepted: 06/09/2021] [Indexed: 01/07/2023] Open
Abstract
Background: Twitter, representing a big social media network, is broadly used for the communication of health-related information. In this work, we aimed to identify and analyze the scientific literature on Twitter use in context of health by utilizing a bibliometric approach, in order to obtain quantitative information on dominant research topics, trending themes, key publications, scientific institutions, and prolific researchers who contributed to this scientific area. Methods: Web of Science electronic database was searched to identify relevant papers on Twitter and health. Basic bibliographic data was obtained utilizing the "Analyze" function of the database. Full records and cited references were exported to VOSviewer, a dedicated bibliometric software, for further analysis. A term map and a keyword map were synthesized to visualize recurring words within titles, abstracts and keywords. Results: The analysis was based on the data from 2,582 papers. The first papers were published in 2009, and the publication count increased rapidly since 2015. Original articles and reviews were published in a ratio of 10.6:1. The Journal of Medical Internet Research was the top journal, and the United States had contributions to over half (52%) of these publications, being the home-country of eight of the top ten most productive institutions. Keyword analysis identified six topically defined clusters, with professional education in healthcare being the top theme cluster (consisting of 66 keywords). The identified papers often investigated Twitter together with other social media, such as YouTube and Facebook. Conclusions: A great diversity of themes was found in the identified papers, including: professional education in healthcare, big data and sentiment analysis, social marketing and substance use, physical and emotional well-being of young adults, and public health and health communication. Our quantitative analysis outlines Twitter as both, an increasingly popular data source, and a highly versatile tool for health-related research.
Collapse
Affiliation(s)
- Andy Wai Kan Yeung
- Oral and Maxillofacial Radiology, Applied Oral Sciences and Community Dental Care, Faculty of Dentistry, The University of Hong Kong, Hong Kong, China
| | - Maria Kletecka-Pulker
- Ludwig Boltzmann Institute for Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria.,Institute for Ethics and Law in Medicine, University of Vienna, Vienna, Austria
| | - Fabian Eibensteiner
- Ludwig Boltzmann Institute for Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria.,Division of Pediatric Nephrology and Gastroenterology, Department of Pediatrics and Adolescent Medicine, Comprehensive Center for Pediatrics, Medical University of Vienna, Vienna, Austria
| | - Petra Plunger
- Ludwig Boltzmann Institute for Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria
| | - Sabine Völkl-Kernstock
- Ludwig Boltzmann Institute for Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria
| | - Harald Willschke
- Ludwig Boltzmann Institute for Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria.,Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Medical University Vienna, Vienna, Austria
| | - Atanas G Atanasov
- Ludwig Boltzmann Institute for Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria.,Institute of Genetics and Animal Biotechnology of the Polish Academy of Sciences, Magdalenka, Poland.,Institute of Neurobiology, Bulgarian Academy of Sciences, Sofia, Bulgaria.,Department of Pharmaceutical Sciences, University of Vienna, Vienna, Austria
| |
Collapse
|
12
|
Pamplona da Costa J, Sica de Campos AL, Cintra PR, Greco LF, Poker JH. The nature of rapid response to COVID-19 in Latin America: an examination of Argentina, Brazil, Chile, Colombia and Mexico. ONLINE INFORMATION REVIEW 2021. [DOI: 10.1108/oir-09-2020-0391] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
PurposeThe coronavirus-19 (COVID-19) pandemic mobilized the international scientific community in the search for its cure and containment. The purpose of this paper is to examine the nature of the rapid response to the COVID-19 of the scientific community in selected Latin American countries (Argentina, Brazil, Chile, Colombia and Mexico) in the period running from January to August 2020. Rapid response is reconceptualized from its original meaning in health policy, as the swift mobilization of existing scientific resources to address an emergency (DeVita et al., 2017).Design/methodology/approachThe paper explores the rapid response of the Argentinian, Brazilian, Chilean, Colombian and Mexican scientific communities from the perspective of bibliometric and altmetric data. The authors will examine scientific publications indexed to the Web of Science (WoS) dealing with COVID-19. Besides patterns of scientific output and impact as measured by citations, the authors complement the analysis with altmetric analysis. The aim is to verify whether or not factors that explain the extent of scientific impact can also be identified with respect to the wider impact made evident by altmetric indicators (Haustein, 2016).FindingsThe authors identified a somewhat limited response of the Argentinian, Brazilian, Chilean, Colombian and Mexican scientific communities to COVID-19 in terms of quantity of publications. The authorship of publications in the topic of COVID-19 was associated with authorship of publications dealing with locally relevant diseases. Some factors appear to contribute to visibility of scientific outputs. Papers that involved wider international collaborations and authors with previous publications in arboviruses were associated with higher levels of citations. Previous work on arbovirus was also associated with higher altmetric attention. The country of origin of authors exerted a positive effect on altmetric indicators.Research limitations/implicationsA limitation in the analysis is that, due to the nature of the data source (WoS), the authors were unable to verify the career status and the productivity of the authors in the sample. Nonetheless, the results appear to suggest that there is some overlapping in authors conducting research in Arboviruses and COVID-19. Career status and productivity should be the focus of future research.Practical implicationsIn the context of countries with limited scientific resources, like the ones investigated in our Latin American sample, previous efforts in the study of locally relevant diseases may contribute to the creation of an expertise that can be applied when a health emergency brings about a novel disease.Originality/valueThe originality of the paper rests on the fact that the authors identified that previous work on arbovirus contributed to the scientific visibility of publications on COVID-19.
Collapse
|
13
|
Hassan SU, Shabbir M, Iqbal S, Said A, Kamiran F, Nawaz R, Saif U. Leveraging Deep Learning and SNA approaches for Smart City Policing in the Developing World. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT 2021. [DOI: 10.1016/j.ijinfomgt.2019.102045] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
14
|
Fang Z, Costas R, Tian W, Wang X, Wouters P. How is science clicked on Twitter? Click metrics for Bitly short links to scientific publications. J Assoc Inf Sci Technol 2021. [DOI: 10.1002/asi.24458] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Affiliation(s)
- Zhichao Fang
- Centre for Science and Technology Studies (CWTS) Leiden University Leiden The Netherlands
| | - Rodrigo Costas
- Centre for Science and Technology Studies (CWTS) Leiden University Leiden The Netherlands
- DST‐NRF Centre of Excellence in Scientometrics and Science Technology and Innovation Policy, Stellenbosch University Stellenbosch South Africa
| | - Wencan Tian
- WISE Lab, Institute of Science of Science and S&T Management Dalian University of Technology Dalian China
| | - Xianwen Wang
- WISE Lab, Institute of Science of Science and S&T Management Dalian University of Technology Dalian China
| | - Paul Wouters
- Centre for Science and Technology Studies (CWTS) Leiden University Leiden The Netherlands
| |
Collapse
|
15
|
Said A, Janjua MU, Hassan SU, Muzammal Z, Saleem T, Thaipisutikul T, Tuarob S, Nawaz R. Detailed analysis of Ethereum network on transaction behavior, community structure and link prediction. PeerJ Comput Sci 2021; 7:e815. [PMID: 34977356 PMCID: PMC8670368 DOI: 10.7717/peerj-cs.815] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 11/23/2021] [Indexed: 05/04/2023]
Abstract
Ethereum, the second-largest cryptocurrency after Bitcoin, has attracted wide attention in the last few years and accumulated significant transaction records. However, the underlying Ethereum network structure is still relatively unexplored. Also, very few attempts have been made to perform link predictability on the Ethereum transactions network. This paper presents a Detailed Analysis of the Ethereum Network on Transaction Behavior, Community Structure, and Link Prediction (DANET) framework to investigate various valuable aspects of the Ethereum network. Specifically, we explore the change in wealth distribution and accumulation on Ethereum Featured Transactional Network (EFTN) and further study its community structure. We further hunt for a suitable link predictability model on EFTN by employing state-of-the-art Variational Graph Auto-Encoders. The link prediction experimental results demonstrate the superiority of outstanding prediction accuracy on Ethereum networks. Moreover, the statistic usages of the Ethereum network are visualized and summarized through the experiments allowing us to formulate conjectures on the current use of this technology and future development.
Collapse
Affiliation(s)
- Anwar Said
- Department of Computer Science, Information Technology University, Lahore, Pakistan
| | - Muhammad Umar Janjua
- Department of Computer Science, Information Technology University, Lahore, Pakistan
| | - Saeed-Ul Hassan
- Department of Computing and Mathematics, The Manchester Metropolitan University, Manchester, United Kingdom
| | - Zeeshan Muzammal
- Department of Computer Science, Information Technology University, Lahore, Pakistan
| | - Tania Saleem
- Department of Computer Science, Information Technology University, Lahore, Pakistan
| | - Tipajin Thaipisutikul
- Faculty of Information and Communication Technology, Mahidol University, Salaya, Nakhon Pathom, Thailand
| | - Suppawong Tuarob
- Faculty of Information and Communication Technology, Mahidol University, Salaya, Nakhon Pathom, Thailand
| | - Raheel Nawaz
- Department of Operations, Technology, Events and Hospitality Management, Manchester Metropolitan University, Manchester, United Kingdom
| |
Collapse
|
16
|
Copiello S. Other than detecting impact in advance, alternative metrics could act as early warning signs of retractions: tentative findings of a study into the papers retracted by PLoS ONE. Scientometrics 2020. [DOI: 10.1007/s11192-020-03698-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
|
17
|
Costas R, Rijcke S, Marres N. “Heterogeneous couplings”: Operationalizing network perspectives to study science‐society interactions through social media metrics. J Assoc Inf Sci Technol 2020. [DOI: 10.1002/asi.24427] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Rodrigo Costas
- Centre for Science and Technology Studies (CWTS) Leiden University Leiden The Netherlands
- DST‐NRF Centre of Excellence in Scientometrics and Science, Technology and Innovation Policy Stellenbosch University Stellenbosch South Africa
| | - Sarah Rijcke
- Centre for Science and Technology Studies (CWTS) Leiden University Leiden The Netherlands
| | - Noortje Marres
- Centre for Science and Technology Studies (CWTS) Leiden University Leiden The Netherlands
- Centre for Interdisciplinary Methodologies University of Warwick Coventry UK
| |
Collapse
|
18
|
Drongstrup D, Malik S, Aljohani NR, Alelyani S, Safder I, Hassan SU. Can social media usage of scientific literature predict journal indices of AJG, SNIP and JCR? An altmetric study of economics. Scientometrics 2020. [DOI: 10.1007/s11192-020-03613-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
19
|
Yan W, Liu Q, Chen R, Yi S. Social networks formed by follower–followee relationships on academic social networking sites: an examination of corporation users. Scientometrics 2020. [DOI: 10.1007/s11192-020-03553-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
20
|
|
21
|
Hassan SU, Saleem A, Soroya SH, Safder I, Iqbal S, Jamil S, Bukhari F, Aljohani NR, Nawaz R. Sentiment analysis of tweets through Altmetrics: A machine learning approach. J Inf Sci 2020. [DOI: 10.1177/0165551520930917] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The purpose of the study is to (a) contribute to annotating an Altmetrics dataset across five disciplines, (b) undertake sentiment analysis using various machine learning and natural language processing–based algorithms, (c) identify the best-performing model and (d) provide a Python library for sentiment analysis of an Altmetrics dataset. First, the researchers gave a set of guidelines to two human annotators familiar with the task of related tweet annotation of scientific literature. They duly labelled the sentiments, achieving an inter-annotator agreement (IAA) of 0.80 (Cohen’s Kappa). Then, the same experiments were run on two versions of the dataset: one with tweets in English and the other with tweets in 23 languages, including English. Using 6388 tweets about 300 papers indexed in Web of Science, the effectiveness of employed machine learning and natural language processing models was measured by comparing with well-known sentiment analysis models, that is, SentiStrength and Sentiment140, as the baseline. It was proved that Support Vector Machine with uni-gram outperformed all the other classifiers and baseline methods employed, with an accuracy of over 85%, followed by Logistic Regression at 83% accuracy and Naïve Bayes at 80%. The precision, recall and F1 scores for Support Vector Machine, Logistic Regression and Naïve Bayes were (0.89, 0.86, 0.86), (0.86, 0.83, 0.80) and (0.85, 0.81, 0.76), respectively.
Collapse
Affiliation(s)
| | | | - Saira Hanif Soroya
- Department of Information Management, University of the Punjab, Pakistan
| | | | | | - Saqib Jamil
- Department of Management Sciences, University of Okara, Pakistan
| | - Faisal Bukhari
- Punjab University College for Information Technology (PUCIT), University of the Punjab, Pakistan
| | - Naif Radi Aljohani
- Faculty of Computing and Information Technology, King Abdulaziz University, Kingdom of Saudi Arabia
| | - Raheel Nawaz
- School of Computer Science, Manchester Metropolitan University, UK
| |
Collapse
|
22
|
Hassan SU, Iqbal S, Aljohani NR, Alelyani S, Zuccala A. Introducing the ‘alt-index’ for measuring the social visibility of scientific research. Scientometrics 2020. [DOI: 10.1007/s11192-020-03447-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
23
|
Haneef F, Abbasi RA, Sindhu MA, Khattak AS, Noor MN, Aljohani NR, Daud A, Arafat S. Using network science to understand the link between subjects and professions. COMPUTERS IN HUMAN BEHAVIOR 2020. [DOI: 10.1016/j.chb.2019.106228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
24
|
Aljohani NR, Fayoumi A, Hassan SU. Bot prediction on social networks of Twitter in altmetrics using deep graph convolutional networks. Soft comput 2020. [DOI: 10.1007/s00500-020-04689-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
25
|
Wang X, Ran Y, Jia T. Measuring similarity in co-occurrence data using ego-networks. CHAOS (WOODBURY, N.Y.) 2020; 30:013101. [PMID: 32013468 DOI: 10.1063/1.5129036] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 12/14/2019] [Indexed: 06/10/2023]
Abstract
The co-occurrence association is widely observed in many empirical data. Mining the information in co-occurrence data is essential for advancing our understanding of systems such as social networks, ecosystems, and brain networks. Measuring similarity of entities is one of the important tasks, which can usually be achieved using a network-based approach. Here, we show that traditional methods based on the aggregated network can bring unwanted indirect relationships. To cope with this issue, we propose a similarity measure based on the ego network of each entity, which effectively considers the change of an entity's centrality from one ego network to another. The index proposed is easy to calculate and has a clear physical meaning. Using two different data sets, we compare the new index with other existing ones. We find that the new index outperforms the traditional network-based similarity measures, and it can sometimes surpass the embedding method. In the meanwhile, the measure by the new index is weakly correlated with those by other methods, hence providing a different dimension to quantify similarities in co-occurrence data. Altogether, our work makes an extension in the network-based similarity measure and can be potentially applied in several related tasks.
Collapse
Affiliation(s)
- Xiaomeng Wang
- College of Computer and Information Science, Southwest University, Beibei, Chongqing 400715, People's Republic of China
| | - Yijun Ran
- College of Computer and Information Science, Southwest University, Beibei, Chongqing 400715, People's Republic of China
| | - Tao Jia
- College of Computer and Information Science, Southwest University, Beibei, Chongqing 400715, People's Republic of China
| |
Collapse
|