1
|
Yan HY, Yang KC, Shanahan J, Menczer F. Exposure to social bots amplifies perceptual biases and regulation propensity. Sci Rep 2023; 13:20707. [PMID: 38001150 PMCID: PMC10673860 DOI: 10.1038/s41598-023-46630-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 11/03/2023] [Indexed: 11/26/2023] Open
Abstract
Automated accounts on social media that impersonate real users, often called "social bots," have received a great deal of attention from academia and the public. Here we present experiments designed to investigate public perceptions and policy preferences about social bots, in particular how they are affected by exposure to bots. We find that before exposure, participants have some biases: they tend to overestimate the prevalence of bots and see others as more vulnerable to bot influence than themselves. These biases are amplified after bot exposure. Furthermore, exposure tends to impair judgment of bot-recognition self-efficacy and increase propensity toward stricter bot-regulation policies among participants. Decreased self-efficacy and increased perceptions of bot influence on others are significantly associated with these policy preference changes. We discuss the relationship between perceptions about social bots and growing dissatisfaction with the polluted social media environment.
Collapse
Affiliation(s)
- Harry Yaojun Yan
- The Media School, Indiana University, Bloomington, IN, 47405, USA.
- Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN, 47408, USA.
- Observatory on Social Media, Indiana University, Bloomington, IN, 47408, USA.
| | - Kai-Cheng Yang
- Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN, 47408, USA
- Observatory on Social Media, Indiana University, Bloomington, IN, 47408, USA
| | - James Shanahan
- The Media School, Indiana University, Bloomington, IN, 47405, USA
- Observatory on Social Media, Indiana University, Bloomington, IN, 47408, USA
| | - Filippo Menczer
- Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN, 47408, USA
- Observatory on Social Media, Indiana University, Bloomington, IN, 47408, USA
| |
Collapse
|
2
|
Pierri F, DeVerna MR, Yang KC, Axelrod D, Bryden J, Menczer F. One Year of COVID-19 Vaccine Misinformation on Twitter: Longitudinal Study. J Med Internet Res 2023; 25:e42227. [PMID: 36735835 PMCID: PMC9970010 DOI: 10.2196/42227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2022] [Revised: 01/18/2023] [Accepted: 01/30/2023] [Indexed: 02/05/2023] Open
Abstract
BACKGROUND Vaccinations play a critical role in mitigating the impact of COVID-19 and other diseases. Past research has linked misinformation to increased hesitancy and lower vaccination rates. Gaps remain in our knowledge about the main drivers of vaccine misinformation on social media and effective ways to intervene. OBJECTIVE Our longitudinal study had two primary objectives: (1) to investigate the patterns of prevalence and contagion of COVID-19 vaccine misinformation on Twitter in 2021, and (2) to identify the main spreaders of vaccine misinformation. Given our initial results, we further considered the likely drivers of misinformation and its spread, providing insights for potential interventions. METHODS We collected almost 300 million English-language tweets related to COVID-19 vaccines using a list of over 80 relevant keywords over a period of 12 months. We then extracted and labeled news articles at the source level based on third-party lists of low-credibility and mainstream news sources, and measured the prevalence of different kinds of information. We also considered suspicious YouTube videos shared on Twitter. We focused our analysis of vaccine misinformation spreaders on verified and automated Twitter accounts. RESULTS Our findings showed a relatively low prevalence of low-credibility information compared to the entirety of mainstream news. However, the most popular low-credibility sources had reshare volumes comparable to those of many mainstream sources, and had larger volumes than those of authoritative sources such as the US Centers for Disease Control and Prevention and the World Health Organization. Throughout the year, we observed an increasing trend in the prevalence of low-credibility news about vaccines. We also observed a considerable amount of suspicious YouTube videos shared on Twitter. Tweets by a small group of approximately 800 "superspreaders" verified by Twitter accounted for approximately 35% of all reshares of misinformation on an average day, with the top superspreader (@RobertKennedyJr) responsible for over 13% of retweets. Finally, low-credibility news and suspicious YouTube videos were more likely to be shared by automated accounts. CONCLUSIONS The wide spread of misinformation around COVID-19 vaccines on Twitter during 2021 shows that there was an audience for this type of content. Our findings are also consistent with the hypothesis that superspreaders are driven by financial incentives that allow them to profit from health misinformation. Despite high-profile cases of deplatformed misinformation superspreaders, our results show that in 2021, a few individuals still played an outsized role in the spread of low-credibility vaccine content. As a result, social media moderation efforts would be better served by focusing on reducing the online visibility of repeat spreaders of harmful content, especially during public health crises.
Collapse
Affiliation(s)
- Francesco Pierri
- Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, Milano, Italy
- Observatory on Social Media, Indiana University, Bloomington, IN, United States
| | - Matthew R DeVerna
- Observatory on Social Media, Indiana University, Bloomington, IN, United States
| | - Kai-Cheng Yang
- Observatory on Social Media, Indiana University, Bloomington, IN, United States
| | - David Axelrod
- Observatory on Social Media, Indiana University, Bloomington, IN, United States
| | - John Bryden
- Observatory on Social Media, Indiana University, Bloomington, IN, United States
| | - Filippo Menczer
- Observatory on Social Media, Indiana University, Bloomington, IN, United States
| |
Collapse
|
3
|
Wilhelm E, Ballalai I, Belanger ME, Benjamin P, Bertrand-Ferrandis C, Bezbaruah S, Briand S, Brooks I, Bruns R, Bucci LM, Calleja N, Chiou H, Devaria A, Dini L, D'Souza H, Dunn AG, Eichstaedt JC, Evers SMAA, Gobat N, Gissler M, Gonzales IC, Gruzd A, Hess S, Ishizumi A, John O, Joshi A, Kaluza B, Khamis N, Kosinska M, Kulkarni S, Lingri D, Ludolph R, Mackey T, Mandić-Rajčević S, Menczer F, Mudaliar V, Murthy S, Nazakat S, Nguyen T, Nilsen J, Pallari E, Pasternak Taschner N, Petelos E, Prinstein MJ, Roozenbeek J, Schneider A, Srinivasan V, Stevanović A, Strahwald B, Syed Abdul S, Varaidzo Machiri S, van der Linden S, Voegeli C, Wardle C, Wegwarth O, White BK, Willie E, Yau B, Purnat TD. Measuring the Burden of Infodemics: Summary of the Methods and Results of the Fifth WHO Infodemic Management Conference. JMIR Infodemiology 2023; 3:e44207. [PMID: 37012998 PMCID: PMC9989916 DOI: 10.2196/44207] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 01/09/2023] [Accepted: 01/26/2023] [Indexed: 01/27/2023]
Abstract
Background An infodemic is excess information, including false or misleading information, that spreads in digital and physical environments during a public health emergency. The COVID-19 pandemic has been accompanied by an unprecedented global infodemic that has led to confusion about the benefits of medical and public health interventions, with substantial impact on risk-taking and health-seeking behaviors, eroding trust in health authorities and compromising the effectiveness of public health responses and policies. Standardized measures are needed to quantify the harmful impacts of the infodemic in a systematic and methodologically robust manner, as well as harmonizing highly divergent approaches currently explored for this purpose. This can serve as a foundation for a systematic, evidence-based approach to monitoring, identifying, and mitigating future infodemic harms in emergency preparedness and prevention. Objective In this paper, we summarize the Fifth World Health Organization (WHO) Infodemic Management Conference structure, proceedings, outcomes, and proposed actions seeking to identify the interdisciplinary approaches and frameworks needed to enable the measurement of the burden of infodemics. Methods An iterative human-centered design (HCD) approach and concept mapping were used to facilitate focused discussions and allow for the generation of actionable outcomes and recommendations. The discussions included 86 participants representing diverse scientific disciplines and health authorities from 28 countries across all WHO regions, along with observers from civil society and global public health-implementing partners. A thematic map capturing the concepts matching the key contributing factors to the public health burden of infodemics was used throughout the conference to frame and contextualize discussions. Five key areas for immediate action were identified. Results The 5 key areas for the development of metrics to assess the burden of infodemics and associated interventions included (1) developing standardized definitions and ensuring the adoption thereof; (2) improving the map of concepts influencing the burden of infodemics; (3) conducting a review of evidence, tools, and data sources; (4) setting up a technical working group; and (5) addressing immediate priorities for postpandemic recovery and resilience building. The summary report consolidated group input toward a common vocabulary with standardized terms, concepts, study designs, measures, and tools to estimate the burden of infodemics and the effectiveness of infodemic management interventions. Conclusions Standardizing measurement is the basis for documenting the burden of infodemics on health systems and population health during emergencies. Investment is needed into the development of practical, affordable, evidence-based, and systematic methods that are legally and ethically balanced for monitoring infodemics; generating diagnostics, infodemic insights, and recommendations; and developing interventions, action-oriented guidance, policies, support options, mechanisms, and tools for infodemic managers and emergency program managers.
Collapse
Affiliation(s)
- Elisabeth Wilhelm
- US Centers for Disease Control and Prevention Atlanta, GA United States
| | | | - Marie-Eve Belanger
- Department of Political Science and International Relations Université de Genève Geneva Switzerland
| | | | | | - Supriya Bezbaruah
- Department of Epidemic and Pandemic Preparedness and Prevention World Health Organization Geneva Switzerland
| | - Sylvie Briand
- Department of Epidemic and Pandemic Preparedness and Prevention World Health Organization Geneva Switzerland
| | - Ian Brooks
- Center for Health Informatics School of Information Sciences University of Illinois Champaign, IL United States
| | - Richard Bruns
- Johns Hopkins Center for Health Security Baltimore, MD United States
| | - Lucie M Bucci
- Immunize Canada Canadian Public Health Association Ottawa, ON Canada
| | - Neville Calleja
- Directorate for Health Information and Research Ministry for Health Valletta Malta
| | - Howard Chiou
- US Centers for Disease Control and Prevention Atlanta, GA United States
- US Public Health Service Commissioned Corps Rockville, MD United States
| | | | - Lorena Dini
- Working Group Health Policy and Systems Research and Innovation Institute for General Practice Charité Universitätsmedizin Berlin Berlin Germany
| | - Hyjel D'Souza
- The George Institute for Global Health New Delhi India
| | - Adam G Dunn
- Biomedical Informatics and Digital Health Faculty of Medicine and Health University of Sydney Sydney Australia
| | - Johannes C Eichstaedt
- Department of Psychology Stanford University Stanford, CA United States
- Institute for Human-Centered AI Stanford University Stanford, CA United States
| | - Silvia M A A Evers
- Department of Health Services Research Maastricht University Maastricht Netherlands
| | - Nina Gobat
- Department of Country Readiness Strengthening World Health Organization Geneva Switzerland
| | - Mika Gissler
- Department of Knowledge Brokers THL Finnish Institute for Health and Welfare Helsinki Finland
| | - Ian Christian Gonzales
- Field Epidemiology Training Program Epidemiology Bureau Department of Health Manila Philippines
| | - Anatoliy Gruzd
- Ted Rogers School of Management Toronto Metropolitan University Toronto, ON Canada
| | - Sarah Hess
- Department of Epidemic and Pandemic Preparedness and Prevention World Health Organization Geneva Switzerland
| | - Atsuyoshi Ishizumi
- Department of Epidemic and Pandemic Preparedness and Prevention World Health Organization Geneva Switzerland
| | - Oommen John
- The George Institute for Global Health New Delhi India
| | - Ashish Joshi
- Department of Epidemiology and Biostatistics Graduate School of Public Health and Health Policy City University of New York New York, NY United States
| | - Benjamin Kaluza
- Department Technological Analysis and Strategic Planning Fraunhofer Institute for Technological Trend Analysis INT Euskirchen Germany
| | - Nagwa Khamis
- Infection Prevention and Control Department Children's Cancer Hospital Egypt-57357 Ain Shams University Specialized Hospital Cairo Egypt
| | - Monika Kosinska
- Department of Social Determinants World Health Organization Geneva Switzerland
| | - Shibani Kulkarni
- US Centers for Disease Control and Prevention Atlanta, GA United States
| | - Dimitra Lingri
- European Healthcare Fraud and Corruption Network Aristotle Universtity of Thessaloniki Brussels Belgium
| | - Ramona Ludolph
- Department of Epidemic and Pandemic Preparedness and Prevention World Health Organization Geneva Switzerland
| | - Tim Mackey
- Global Health Program Department of Anthropology University of California San Diego, CA United States
| | | | - Filippo Menczer
- Observatory on Social Media Luddy School of Informatics, Computing, and Engineering Indiana University Bloomington, IN United States
| | | | - Shruti Murthy
- The George Institute for Global Health New Delhi India
| | - Syed Nazakat
- DataLEADS (Health Analytics Asia) New Delhi India
| | - Tim Nguyen
- Department of Epidemic and Pandemic Preparedness and Prevention World Health Organization Geneva Switzerland
| | - Jennifer Nilsen
- Technology and Social Change Project Harvard University Cambridge, MA United States
| | - Elena Pallari
- Health Innovation Network Guy's and St Thomas' Hospital London United Kingdom
| | - Natalia Pasternak Taschner
- Center of Science and Society Columbia University New York, NY United States
- Instituto Questão de Ciência São Paulo Brazil
| | - Elena Petelos
- Department of Health Services Research Care and Public Health Research Institute Maastricht University Maastricht Netherlands
- Clinic of Social and Family Medicine Faculty of Medicine University of Crete Heraklion Greece
| | - Mitchell J Prinstein
- American Psychological Association Washington DC, DC United States
- Department of Psychology and Neuroscience University of North Carolina at Chapel Hill Chapel Hill, NC United States
| | - Jon Roozenbeek
- Department of Psychology University of Cambridge Cambridge United Kingdom
| | - Anton Schneider
- Bureau for Global Health Office of Infectious Disease United States Agency for International Development Washington DC, DC United States
| | | | - Aleksandar Stevanović
- Institute of Social Medicine Faculty of Medicine University of Belgrade Belgrade Serbia
| | - Brigitte Strahwald
- Pettenkofer School of Public Health Ludwig-Maximilians-Universität München Munich Germany
| | - Shabbir Syed Abdul
- The George Institute for Global Health New Delhi India
- Graduate Institute of Biomedical Informatics Taipei Medical University Taipei Taiwan
| | | | | | - Christopher Voegeli
- Office of the Director National Center for Immunization and Respiratory Diseases US Centers for Disease Control and Prevention Atlanta, GA United States
| | - Claire Wardle
- Information Futures Lab School of Public Health Brown University Providence, RI United States
| | - Odette Wegwarth
- Heisenberg Chair for Medical Risk Literacy & Evidence-Based Decisions Charite - Universitätsmedizin Berlin Berlin Germany
| | - Becky K White
- Department of Epidemic and Pandemic Preparedness and Prevention World Health Organization Geneva Switzerland
| | - Estelle Willie
- Communications, Policy, Advocacy The Rockefeller Foundation New York, NY United States
| | - Brian Yau
- Department of Epidemic and Pandemic Preparedness and Prevention World Health Organization Geneva Switzerland
| | - Tina D Purnat
- Department of Epidemic and Pandemic Preparedness and Prevention World Health Organization Geneva Switzerland
| |
Collapse
|
4
|
Yang KC, Ferrara E, Menczer F. Botometer 101: social bot practicum for computational social scientists. J Comput Soc Sci 2022; 5:1511-1528. [PMID: 36035522 PMCID: PMC9391657 DOI: 10.1007/s42001-022-00177-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 08/01/2022] [Indexed: 05/16/2023]
Abstract
Social bots have become an important component of online social media. Deceptive bots, in particular, can manipulate online discussions of important issues ranging from elections to public health, threatening the constructive exchange of information. Their ubiquity makes them an interesting research subject and requires researchers to properly handle them when conducting studies using social media data. Therefore, it is important for researchers to gain access to bot detection tools that are reliable and easy to use. This paper aims to provide an introductory tutorial of Botometer, a public tool for bot detection on Twitter, for readers who are new to this topic and may not be familiar with programming and machine learning. We introduce how Botometer works, the different ways users can access it, and present a case study as a demonstration. Readers can use the case study code as a template for their own research. We also discuss recommended practice for using Botometer.
Collapse
Affiliation(s)
- Kai-Cheng Yang
- Observatory on Social Media, Indiana University Bloomington, Bloomington, IN 47408 USA
| | - Emilio Ferrara
- Information Sciences Institute, University of Southern California, Marina del Rey, CA 90292 USA
| | - Filippo Menczer
- Observatory on Social Media, Indiana University Bloomington, Bloomington, IN 47408 USA
| |
Collapse
|
5
|
Yang KC, Hui PM, Menczer F. How Twitter data sampling biases U.S. voter behavior characterizations. PeerJ Comput Sci 2022; 8:e1025. [PMID: 35875635 PMCID: PMC9299280 DOI: 10.7717/peerj-cs.1025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Accepted: 06/10/2022] [Indexed: 06/15/2023]
Abstract
Online social media are key platforms for the public to discuss political issues. As a result, researchers have used data from these platforms to analyze public opinions and forecast election results. The literature has shown that due to inauthentic actors such as malicious social bots and trolls, not every message is a genuine expression from a legitimate user. However, the prevalence of inauthentic activities in social data streams is still unclear, making it difficult to gauge biases of analyses based on such data. In this article, we aim to close this gap using Twitter data from the 2018 U.S. midterm elections. We propose an efficient and low-cost method to identify voters on Twitter and systematically compare their behaviors with different random samples of accounts. We find that some accounts flood the public data stream with political content, drowning the voice of the majority of voters. As a result, these hyperactive accounts are over-represented in volume samples. Hyperactive accounts are more likely to exhibit various suspicious behaviors and to share low-credibility information compared to likely voters. Our work provides insights into biased voter characterizations when using social media data to analyze political issues.
Collapse
|
6
|
Bhadani S, Yamaya S, Flammini A, Menczer F, Ciampaglia GL, Nyhan B. Political audience diversity and news reliability in algorithmic ranking. Nat Hum Behav 2022; 6:495-505. [PMID: 35115677 DOI: 10.1038/s41562-021-01276-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 12/09/2021] [Indexed: 11/09/2022]
Abstract
Newsfeed algorithms frequently amplify misinformation and other low-quality content. How can social media platforms more effectively promote reliable information? Existing approaches are difficult to scale and vulnerable to manipulation. In this paper, we propose using the political diversity of a website's audience as a quality signal. Using news source reliability ratings from domain experts and web browsing data from a diverse sample of 6,890 US residents, we first show that websites with more extreme and less politically diverse audiences have lower journalistic standards. We then incorporate audience diversity into a standard collaborative filtering framework and show that our improved algorithm increases the trustworthiness of websites suggested to users-especially those who most frequently consume misinformation-while keeping recommendations relevant. These findings suggest that partisan audience diversity is a valuable signal of higher journalistic standards that should be incorporated into algorithmic ranking decisions.
Collapse
Affiliation(s)
- Saumya Bhadani
- Department of Computer Science and Engineering, University of South Florida, Tampa, FL, USA
| | - Shun Yamaya
- Department of Political Science, Stanford University, Stanford, CA, USA
| | | | - Filippo Menczer
- Observatory on Social Media, Indiana University, Bloomington, IN, USA
| | | | - Brendan Nyhan
- Department of Government, Dartmouth College, Hanover, NH, USA
| |
Collapse
|
7
|
Chen W, Pacheco D, Yang KC, Menczer F. Publisher Correction: Neutral bots probe political bias on social media. Nat Commun 2022; 13:264. [PMID: 34987155 PMCID: PMC8733012 DOI: 10.1038/s41467-021-27855-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Affiliation(s)
- Wen Chen
- Observatory on Social Media, Indiana University, Bloomington, IN, USA
| | - Diogo Pacheco
- Observatory on Social Media, Indiana University, Bloomington, IN, USA.,Department of Computer Science, University of Exeter, Exeter, UK
| | - Kai-Cheng Yang
- Observatory on Social Media, Indiana University, Bloomington, IN, USA
| | - Filippo Menczer
- Observatory on Social Media, Indiana University, Bloomington, IN, USA.
| |
Collapse
|
8
|
Silva FN, Tandon A, Amancio DR, Flammini A, Menczer F, Milojević S, Fortunato S. Recency predicts bursts in the evolution of author citations. Quantitative Science Studies 2020. [DOI: 10.1162/qss_a_00070] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
The citations process for scientific papers has been studied extensively. But while the citations accrued by authors are the sum of the citations of their papers, translating the dynamics of citation accumulation from the paper to the author level is not trivial. Here we conduct a systematic study of the evolution of author citations, and in particular their bursty dynamics. We find empirical evidence of a correlation between the number of citations most recently accrued by an author and the number of citations they receive in the future. Using a simple model where the probability for an author to receive new citations depends only on the number of citations collected in the previous 12–24 months, we are able to reproduce both the citation and burst size distributions of authors across multiple decades.
Collapse
Affiliation(s)
| | - Aditya Tandon
- Center for Complex Networks and Systems Research, Luddy School of Informatics, Computing and Engineering, Indiana University, Bloomington, USA
| | - Diego Raphael Amancio
- Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, Brazil
| | - Alessandro Flammini
- Indiana University Network Science Institute, Indiana University, Bloomington, USA
- Center for Complex Networks and Systems Research, Luddy School of Informatics, Computing and Engineering, Indiana University, Bloomington, USA
| | - Filippo Menczer
- Indiana University Network Science Institute, Indiana University, Bloomington, USA
- Center for Complex Networks and Systems Research, Luddy School of Informatics, Computing and Engineering, Indiana University, Bloomington, USA
| | - Staša Milojević
- Center for Complex Networks and Systems Research, Luddy School of Informatics, Computing and Engineering, Indiana University, Bloomington, USA
| | - Santo Fortunato
- Indiana University Network Science Institute, Indiana University, Bloomington, USA
- Center for Complex Networks and Systems Research, Luddy School of Informatics, Computing and Engineering, Indiana University, Bloomington, USA
| |
Collapse
|
9
|
Yang K, Varol O, Davis CA, Ferrara E, Flammini A, Menczer F. Arming the public with artificial intelligence to counter social bots. Human Behav and Emerg Tech 2019. [DOI: 10.1002/hbe2.115] [Citation(s) in RCA: 167] [Impact Index Per Article: 33.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Kai‐Cheng Yang
- Center for Complex Networks and Systems Research Indiana University Bloomington Indiana
| | - Onur Varol
- Center for Complex Networks Research Northeastern University Boston Massachusetts
| | - Clayton A. Davis
- Center for Complex Networks and Systems Research Indiana University Bloomington Indiana
| | - Emilio Ferrara
- Information Sciences Institute University of Southern California Los Angeles California
| | - Alessandro Flammini
- Center for Complex Networks and Systems Research Indiana University Bloomington Indiana
- Indiana University Network Science Institute Bloomington Indiana
| | - Filippo Menczer
- Center for Complex Networks and Systems Research Indiana University Bloomington Indiana
- Indiana University Network Science Institute Bloomington Indiana
| |
Collapse
|
10
|
Qiu X, Oliveira DFM, Shirazi AS, Flammini A, Menczer F. Retraction Note: Limited individual attention and online virality of low-quality information. Nat Hum Behav 2019. [DOI: 10.1038/s41562-018-0507-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
11
|
Affiliation(s)
- Dimitar Nikolov
- Center for Complex Networks and Systems ResearchIndiana University 919 E 10th Street Bloomington IN 47408
| | | | - Alessandro Flammini
- Center for Complex Networks and Systems ResearchIndiana University 919 E 10th Street Bloomington IN 47408
| | - Filippo Menczer
- Center for Complex Networks and Systems ResearchIndiana University 919 E 10th Street Bloomington IN 47408
| |
Collapse
|
12
|
Ciampaglia GL, Nematzadeh A, Menczer F, Flammini A. How algorithmic popularity bias hinders or promotes quality. Sci Rep 2018; 8:15951. [PMID: 30374134 PMCID: PMC6206065 DOI: 10.1038/s41598-018-34203-2] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2017] [Accepted: 10/01/2018] [Indexed: 11/19/2022] Open
Abstract
Algorithms that favor popular items are used to help us select among many choices, from top-ranked search engine results to highly-cited scientific papers. The goal of these algorithms is to identify high-quality items such as reliable news, credible information sources, and important discoveries–in short, high-quality content should rank at the top. Prior work has shown that choosing what is popular may amplify random fluctuations and lead to sub-optimal rankings. Nonetheless, it is often assumed that recommending what is popular will help high-quality content “bubble up” in practice. Here we identify the conditions in which popularity may be a viable proxy for quality content by studying a simple model of a cultural market endowed with an intrinsic notion of quality. A parameter representing the cognitive cost of exploration controls the trade-off between quality and popularity. Below and above a critical exploration cost, popularity bias is more likely to hinder quality. But we find a narrow intermediate regime of user attention where an optimal balance exists: choosing what is popular can help promote high-quality items to the top. These findings clarify the effects of algorithmic popularity bias on quality outcomes, and may inform the design of more principled mechanisms for techno-social cultural markets.
Collapse
Affiliation(s)
| | - Azadeh Nematzadeh
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, Indiana, USA
| | - Filippo Menczer
- Indiana University Network Science Institute, Bloomington, Indiana, USA.,School of Informatics, Computing, and Engineering, Indiana University, Bloomington, Indiana, USA
| | - Alessandro Flammini
- Indiana University Network Science Institute, Bloomington, Indiana, USA.,School of Informatics, Computing, and Engineering, Indiana University, Bloomington, Indiana, USA
| |
Collapse
|
13
|
Abstract
Massive amounts of fake news and conspiratorial content have spread over social media before and after the 2016 US Presidential Elections despite intense fact-checking efforts. How do the spread of misinformation and fact-checking compete? What are the structural and dynamic characteristics of the core of the misinformation diffusion network, and who are its main purveyors? How to reduce the overall amount of misinformation? To explore these questions we built Hoaxy, an open platform that enables large-scale, systematic studies of how misinformation and fact-checking spread and compete on Twitter. Hoaxy captures public tweets that include links to articles from low-credibility and fact-checking sources. We perform k-core decomposition on a diffusion network obtained from two million retweets produced by several hundred thousand accounts over the six months before the election. As we move from the periphery to the core of the network, fact-checking nearly disappears, while social bots proliferate. The number of users in the main core reaches equilibrium around the time of the election, with limited churn and increasingly dense connections. We conclude by quantifying how effectively the network can be disrupted by penalizing the most central nodes. These findings provide a first look at the anatomy of a massive online misinformation diffusion network.
Collapse
Affiliation(s)
- Chengcheng Shao
- College of Computer, National University of Defense Technology, Changsha, Hunan, China
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, United States of America
- * E-mail:
| | - Pik-Mai Hui
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, United States of America
| | - Lei Wang
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, United States of America
| | - Xinwen Jiang
- The MOE Key Laboratory of Intelligent Computing and Information Processing, Xiangtan University, Xiangtan, Hunan, China
| | - Alessandro Flammini
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, United States of America
| | - Filippo Menczer
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, United States of America
| | | |
Collapse
|
14
|
Abstract
The deluge of online and offline misinformation is overloading the exchange of ideas upon which democracies depend. Fake news, conspiracy theories, and deceptive social bots proliferate, facilitating the manipulation of public opinion. Countering misinformation while protecting freedom of speech will require collaboration across industry, journalism, and academia. The Workshop on Digital Misinformation — held in May 2017 in conjunction with the International Conference on Web and Social Media in Montréal, Québec, Canada — was intended to foster these efforts. The meeting brought together more than 100 stakeholders from academia, media, and tech companies to discuss the research challenges implicit in building a trustworthy Web. Below we outline the main findings from the discussion.
Collapse
|
15
|
Lazer DMJ, Baum MA, Benkler Y, Berinsky AJ, Greenhill KM, Menczer F, Metzger MJ, Nyhan B, Pennycook G, Rothschild D, Schudson M, Sloman SA, Sunstein CR, Thorson EA, Watts DJ, Zittrain JL. The science of fake news. Science 2018; 359:1094-1096. [DOI: 10.1126/science.aao2998] [Citation(s) in RCA: 1359] [Impact Index Per Article: 226.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
|
16
|
An J, Ciampaglia GL, Grinberg N, Joseph K, Mantzarlis A, Maus G, Menczer F, Proferes N, Welles BF. Reports of the Workshops Held at the 2017 International AAAI Conference on Web and Social Media. AI MAG 2017. [DOI: 10.1609/aimag.v38i4.2772] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Abstract
The Workshop Program of the Association for the Advancement of Artificial Intelligence’s International Conference on Web and Social Media (AAAI-17) was held in Montréal, Québec, Canada on Tuesday, May 15, 2017. There were eight workshops in the program: Digital Misinformation, Events Analytics Using Social Media Data, News and Public Opinion, Observational Studies through Social Media, Perceptual Biases and Social Media, Social Media and Demographic Research, Studying User Perceptions and Experiences with Algorithms, The ICWSM Science Slam. Workshops were held on the first day of the conference. Workshop participants met and discussed issues with a selected focus — providing an informal setting for active exchange among researchers, developers, and users on topics of current interest. Organizers from two of the workshop chose to include papers in the AAAI Technical Reports series (Observational Studies through Social Media and News and Public Opinion). Their papers were included as a nonarchival part of the ICWSM proceedings. Organizers from four of the workshops (Digital Misinformation, News and Public Opinion, Perceptual Biases and Social Media, and Studying User Perceptions and Experiences with Algorithms) submitted reports, which are reproduced in this report. Brief summaries of the other four workshops have been reproduced from their website descriptions.
Collapse
|
17
|
Qiu X, F. M. Oliveira D, Sahami Shirazi A, Flammini A, Menczer F. Limited individual attention and online virality of low-quality information. Nat Hum Behav 2017. [DOI: 10.1038/s41562-017-0132] [Citation(s) in RCA: 72] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
18
|
Suárez-Serrato P, Roberts ME, Davis C, Menczer F. On the Influence of Social Bots in Online Protests. Lecture Notes in Computer Science 2016. [DOI: 10.1007/978-3-319-47874-6_19] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
19
|
|
20
|
|
21
|
Ciampaglia GL, Shiralkar P, Rocha LM, Bollen J, Menczer F, Flammini A. Computational Fact Checking from Knowledge Networks. PLoS One 2015; 10:e0128193. [PMID: 26083336 PMCID: PMC4471100 DOI: 10.1371/journal.pone.0128193] [Citation(s) in RCA: 148] [Impact Index Per Article: 16.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2015] [Accepted: 04/24/2015] [Indexed: 11/19/2022] Open
Abstract
Traditional fact checking by expert journalists cannot keep up with the enormous volume of information that is now generated online. Computational fact checking may significantly enhance our ability to evaluate the veracity of dubious information. Here we show that the complexities of human fact checking can be approximated quite well by finding the shortest path between concept nodes under properly defined semantic proximity metrics on knowledge graphs. Framed as a network problem this approach is feasible with efficient computational techniques. We evaluate this approach by examining tens of thousands of claims related to history, entertainment, geography, and biographical information using a public knowledge graph extracted from Wikipedia. Statements independently known to be true consistently receive higher support via our method than do false ones. These findings represent a significant step toward scalable computational fact-checking methods that may one day mitigate the spread of harmful misinformation.
Collapse
Affiliation(s)
- Giovanni Luca Ciampaglia
- Center for Complex Networks and Systems Research, Indiana University, Bloomington, Indiana, United States of America
- * E-mail:
| | - Prashant Shiralkar
- Center for Complex Networks and Systems Research, Indiana University, Bloomington, Indiana, United States of America
| | - Luis M. Rocha
- Center for Complex Networks and Systems Research, Indiana University, Bloomington, Indiana, United States of America
- Instituto Gulbenkian de Ciencia, Oeiras, Portugal
| | - Johan Bollen
- Center for Complex Networks and Systems Research, Indiana University, Bloomington, Indiana, United States of America
| | - Filippo Menczer
- Center for Complex Networks and Systems Research, Indiana University, Bloomington, Indiana, United States of America
| | - Alessandro Flammini
- Center for Complex Networks and Systems Research, Indiana University, Bloomington, Indiana, United States of America
| |
Collapse
|
22
|
Abstract
Online traces of human activity offer novel opportunities to study the dynamics of complex knowledge exchange networks, in particular how emergent patterns of collective attention determine what new information is generated and consumed. Can we measure the relationship between demand and supply for new information about a topic? We propose a normalization method to compare attention bursts statistics across topics with heterogeneous distribution of attention. Through analysis of a massive dataset on traffic to Wikipedia, we find that the production of new knowledge is associated to significant shifts of collective attention, which we take as proxy for its demand. This is consistent with a scenario in which allocation of attention toward a topic stimulates the demand for information about it, and in turn the supply of further novel information. However, attention spikes only for a limited time span, during which new content has higher chances of receiving traffic, compared to content created later or earlier on. Our attempt to quantify demand and supply of information, and our finding about their temporal ordering, may lead to the development of the fundamental laws of the attention economy, and to a better understanding of social exchange of knowledge information networks.
Collapse
Affiliation(s)
- Giovanni Luca Ciampaglia
- Center for Complex Networks and Systems Research, School of Informatics and Computing, Indiana University, Bloomington, IN 47408, USA
| | - Alessandro Flammini
- Center for Complex Networks and Systems Research, School of Informatics and Computing, Indiana University, Bloomington, IN 47408, USA
| | - Filippo Menczer
- Center for Complex Networks and Systems Research, School of Informatics and Computing, Indiana University, Bloomington, IN 47408, USA
| |
Collapse
|
23
|
Abstract
We have a limited understanding of the factors that make people influential and topics popular in social media. Are users who comment on a variety of matters more likely to achieve high influence than those who stay focused? Do general subjects tend to be more popular than specific ones? Questions like these demand a way to detect the topics hidden behind messages associated with an individual or a keyword, and a gauge of similarity among these topics. Here we develop such an approach to identify clusters of similar hashtags in Twitter by detecting communities in the hashtag co-occurrence network. Then the topical diversity of a user’s interests is quantified by the entropy of her hashtags across different topic clusters. A similar measure is applied to hashtags, based on co-occurring tags. We find that high topical diversity of early adopters or co-occurring tags implies high future popularity of hashtags. In contrast, low diversity helps an individual accumulate social influence. In short, diverse messages and focused messengers are more likely to gain impact.
Collapse
Affiliation(s)
- Lilian Weng
- Center for Complex Networks and Systems Research School of Informatics and Computing, Indiana University Bloomington, Bloomington, USA
- * E-mail:
| | - Filippo Menczer
- Center for Complex Networks and Systems Research School of Informatics and Computing, Indiana University Bloomington, Bloomington, USA
| |
Collapse
|
24
|
|
25
|
Abstract
How does network structure affect diffusion? Recent studies suggest that the answer depends on the type of contagion. Complex contagions, unlike infectious diseases (simple contagions), are affected by social reinforcement and homophily. Hence, the spread within highly clustered communities is enhanced, while diffusion across communities is hampered. A common hypothesis is that memes and behaviors are complex contagions. We show that, while most memes indeed spread like complex contagions, a few viral memes spread across many communities, like diseases. We demonstrate that the future popularity of a meme can be predicted by quantifying its early spreading pattern in terms of community concentration. The more communities a meme permeates, the more viral it is. We present a practical method to translate data about community structure into predictive knowledge about what information will spread widely. This connection contributes to our understanding in computational social science, social media analytics, and marketing applications.
Collapse
Affiliation(s)
- Lilian Weng
- Center for Complex Networks and Systems Research, School of Informatics and Computing, Indiana University, Bloomington, IN 47408, USA
| | | | | |
Collapse
|
26
|
|
27
|
|
28
|
Abstract
We examine the temporal evolution of digital communication activity relating to the American anti-capitalist movement Occupy Wall Street. Using a high-volume sample from the microblogging site Twitter, we investigate changes in Occupy participant engagement, interests, and social connectivity over a fifteen month period starting three months prior to the movement's first protest action. The results of this analysis indicate that, on Twitter, the Occupy movement tended to elicit participation from a set of highly interconnected users with pre-existing interests in domestic politics and foreign social movements. These users, while highly vocal in the months immediately following the birth of the movement, appear to have lost interest in Occupy related communication over the remainder of the study period.
Collapse
Affiliation(s)
- Michael D Conover
- Center for Complex Networks and Systems Research, School of Informatics and Computing, Indiana University, Bloomington, Indiana, USA.
| | | | | | | |
Collapse
|
29
|
Conover MD, Davis C, Ferrara E, McKelvey K, Menczer F, Flammini A. The geospatial characteristics of a social movement communication network. PLoS One 2013; 8:e55957. [PMID: 23483885 PMCID: PMC3590214 DOI: 10.1371/journal.pone.0055957] [Citation(s) in RCA: 87] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2012] [Accepted: 01/04/2013] [Indexed: 12/02/2022] Open
Abstract
Social movements rely in large measure on networked communication technologies to organize and disseminate information relating to the movements' objectives. In this work we seek to understand how the goals and needs of a protest movement are reflected in the geographic patterns of its communication network, and how these patterns differ from those of stable political communication. To this end, we examine an online communication network reconstructed from over 600,000 tweets from a thirty-six week period covering the birth and maturation of the American anticapitalist movement, Occupy Wall Street. We find that, compared to a network of stable domestic political communication, the Occupy Wall Street network exhibits higher levels of locality and a hub and spoke structure, in which the majority of non-local attention is allocated to high-profile locations such as New York, California, and Washington D.C. Moreover, we observe that information flows across state boundaries are more likely to contain framing language and references to the media, while communication among individuals in the same state is more likely to reference protest action and specific places and times. Tying these results to social movement theory, we propose that these features reflect the movement's efforts to mobilize resources at the local level and to develop narrative frames that reinforce collective purpose at the national level.
Collapse
Affiliation(s)
- Michael D Conover
- Center for Complex Networks and Systems Research, School of Informatics and Computing, Indiana University, Bloomington, IN, USA.
| | | | | | | | | | | |
Collapse
|
30
|
Abstract
The birth and decline of disciplines are critical to science and society. How do scientific disciplines emerge? No quantitative model to date allows us to validate competing theories on the different roles of endogenous processes, such as social collaborations, and exogenous events, such as scientific discoveries. Here we propose an agent-based model in which the evolution of disciplines is guided mainly by social interactions among agents representing scientists. Disciplines emerge from splitting and merging of social communities in a collaboration network. We find that this social model can account for a number of stylized facts about the relationships between disciplines, scholars, and publications. These results provide strong quantitative support for the key role of social interactions in shaping the dynamics of science. While several "science of science" theories exist, this is the first account for the emergence of disciplines that is validated on the basis of empirical data.
Collapse
Affiliation(s)
- Xiaoling Sun
- Department of Computer Science and Technology, Dalian University of Technology, China
| | | | | | | | | |
Collapse
|
31
|
Kaur J, Hoang DT, Sun X, Possamai L, Jafariasbagh M, Patil S, Menczer F. Scholarometer: a social framework for analyzing impact across disciplines. PLoS One 2012; 7:e43235. [PMID: 22984414 PMCID: PMC3440403 DOI: 10.1371/journal.pone.0043235] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2012] [Accepted: 07/18/2012] [Indexed: 11/25/2022] Open
Abstract
The use of quantitative metrics to gauge the impact of scholarly publications, authors, and disciplines is predicated on the availability of reliable usage and annotation data. Citation and download counts are widely available from digital libraries. However, current annotation systems rely on proprietary labels, refer to journals but not articles or authors, and are manually curated. To address these limitations, we propose a social framework based on crowdsourced annotations of scholars, designed to keep up with the rapidly evolving disciplinary and interdisciplinary landscape. We describe a system called Scholarometer, which provides a service to scholars by computing citation-based impact measures. This creates an incentive for users to provide disciplinary annotations of authors, which in turn can be used to compute disciplinary metrics. We first present the system architecture and several heuristics to deal with noisy bibliographic and annotation data. We report on data sharing and interactive visualization services enabled by Scholarometer. Usage statistics, illustrating the data collected and shared through the framework, suggest that the proposed crowdsourcing approach can be successful. Secondly, we illustrate how the disciplinary bibliometric indicators elicited by Scholarometer allow us to implement for the first time a universal impact measure proposed in the literature. Our evaluation suggests that this metric provides an effective means for comparing scholarly impact across disciplinary boundaries.
Collapse
Affiliation(s)
- Jasleen Kaur
- Center for Complex Networks and Systems Research, School of Informatics & Computing, Indiana University, Bloomington, United States of America.
| | | | | | | | | | | | | |
Collapse
|
32
|
Abstract
The wide adoption of social media has increased the competition among ideas for our finite attention. We employ a parsimonious agent-based model to study whether such a competition may affect the popularity of different memes, the diversity of information we are exposed to, and the fading of our collective interests for specific topics. Agents share messages on a social network but can only pay attention to a portion of the information they receive. In the emerging dynamics of information diffusion, a few memes go viral while most do not. The predictions of our model are consistent with empirical data from Twitter, a popular microblogging platform. Surprisingly, we can explain the massive heterogeneity in the popularity and persistence of memes as deriving from a combination of the competition for our limited attention and the structure of the social network, without the need to assume different intrinsic values among ideas.
Collapse
Affiliation(s)
- L. Weng
- Center for Complex Networks and Systems Research School of Informatics and Computing Indiana University, Bloomington, USA
| | - A. Flammini
- Center for Complex Networks and Systems Research School of Informatics and Computing Indiana University, Bloomington, USA
| | - A. Vespignani
- Department of Health Sciences, Department of Physics and College of Computer and Information Sciences, Northeastern University, USA
- Institute for Quantitative Social Sciences, Harvard University, USA
- Institute for Scientific Interchange (ISI), Torino, Italy
| | - F. Menczer
- Center for Complex Networks and Systems Research School of Informatics and Computing Indiana University, Bloomington, USA
| |
Collapse
|
33
|
Ratkiewicz J, Fortunato S, Flammini A, Menczer F, Vespignani A. Characterizing and modeling the dynamics of online popularity. Phys Rev Lett 2010; 105:158701. [PMID: 21230945 DOI: 10.1103/physrevlett.105.158701] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2010] [Revised: 07/18/2010] [Indexed: 05/30/2023]
Abstract
Online popularity has an enormous impact on opinions, culture, policy, and profits. We provide a quantitative, large scale, temporal analysis of the dynamics of online content popularity in two massive model systems: the Wikipedia and an entire country's Web space. We find that the dynamics of popularity are characterized by bursts, displaying characteristic features of critical systems such as fat-tailed distributions of magnitude and interevent time. We propose a minimal model combining the classic preferential popularity increase mechanism with the occurrence of random popularity shifts due to exogenous factors. The model recovers the critical features observed in the empirical analysis of the systems analyzed here, highlighting the key factors needed in the description of popularity dynamics.
Collapse
Affiliation(s)
- Jacob Ratkiewicz
- School of Informatics and Computing, Indiana University, Bloomington, Indiana 47406, USA
| | | | | | | | | |
Collapse
|
34
|
Abstract
Collaborative query routing is a new paradigm for Web search that treats both established search engines and other publicly available indices as intelligent peer agents in a search network. The approach makes it transparent for anyone to build their own (micro) search engine, by integrating established Web search services, desktop search, and topical crawling techniques. The challenge in this model is that each of these agents must learn about its environment— the existence, knowledge, diversity, reliability, and trustworthiness of other agents — by analyzing the queries received from and results exchanged with these other agents. We present the 6S peer network, which uses machine learning techniques to learn about the changing query environment. We show that simple reinforcement learning algorithms are sufficient to detect and exploit semantic locality in the network, resulting in efficient routing and high-quality search results. A prototype of 6S is available for public use and is intended to assist in the evaluation of different AI techniques employed by the networked agents.
Collapse
|
35
|
|
36
|
|
37
|
Abstract
Search engines have become key media for our scientific, economic, and social activities by enabling people to access information on the web despite its size and complexity. On the down side, search engines bias the traffic of users according to their page ranking strategies, and it has been argued that they create a vicious cycle that amplifies the dominance of established and already popular sites. This bias could lead to a dangerous monopoly of information. We show that, contrary to intuition, empirical data do not support this conclusion; popular sites receive far less traffic than predicted. We discuss a model that accurately predicts traffic data patterns by taking into consideration the topical interests of users and their searching behavior in addition to the way search engines rank pages. The heterogeneity of user interests explains the observed mitigation of search engines' popularity bias.
Collapse
Affiliation(s)
- S. Fortunato
- *School of Informatics, Indiana University, Bloomington, IN 47406
- Fakultät für Physik, Universität Bielefeld, D-33501 Bielefeld, Germany
- Complex Networks Lagrange Laboratory, Institute for Scientific Interchange, 10133 Torino, Italy
| | - A. Flammini
- *School of Informatics, Indiana University, Bloomington, IN 47406
| | - F. Menczer
- *School of Informatics, Indiana University, Bloomington, IN 47406
- Department of Computer Science, Indiana University, Bloomington, IN 47405; and
- To whom correspondence should be addressed. E-mail:
| | - A. Vespignani
- *School of Informatics, Indiana University, Bloomington, IN 47406
- Complex Networks Lagrange Laboratory, Institute for Scientific Interchange, 10133 Torino, Italy
| |
Collapse
|
38
|
Abstract
Network growth is currently explained through mechanisms that rely on node prestige measures, such as degree or fitness. In many real networks, those who create and connect nodes do not know the prestige values of existing nodes but only their ranking by prestige. We propose a criterion of network growth that explicitly relies on the ranking of the nodes according to any prestige measure, be it topological or not. The resulting network has a scale-free degree distribution when the probability to link a target node is any power-law function of its rank, even when one has only partial information of node ranks. Our criterion may explain the frequency and robustness of scale-free degree distributions in real networks, as illustrated by the special case of the Web graph.
Collapse
Affiliation(s)
- Santo Fortunato
- School of Informatics, Indiana University, Bloomington, Indiana 47406, USA
| | | | | |
Collapse
|
39
|
Abstract
How does a network of documents grow without centralized control? This question is becoming crucial as we try to explain the emergent scale-free topology of the World Wide Web and use link analysis to identify important information resources. Existing models of growing information networks have focused on the structure of links but neglected the content of nodes. Here I show that the current models fail to reproduce a critical characteristic of information networks, namely the distribution of textual similarity among linked documents. I propose a more realistic model that generates links by using both popularity and content. This model yields remarkably accurate predictions of both degree and similarity distributions in networks of web pages and scientific literature.
Collapse
Affiliation(s)
- Filippo Menczer
- School of Informatics, Indiana University, Bloomington, IN 47408, USA.
| |
Collapse
|
40
|
|
41
|
Affiliation(s)
- YongSeog Kim
- Department of Business Information Systems, Utah State University, Logan, UT 84322-3515, USA
| | - W. Nick Street
- Department of Business Information Systems, Utah State University, Logan, UT 84322-3515, USA
| | - Filippo Menczer
- Department of Business Information Systems, Utah State University, Logan, UT 84322-3515, USA
| |
Collapse
|
42
|
Abstract
Can we model the scale-free distribution of Web hypertext degree under realistic assumptions about the behavior of page authors? Can a Web crawler efficiently locate an unknown relevant page? These questions are receiving much attention due to their potential impact for understanding the structure of the Web and for building better search engines. Here I investigate the connection between the linkage and content topology of Web pages. The relationship between a text-induced distance metric and a link-based neighborhood probability distribution displays a phase transition between a region where linkage is not determined by content and one where linkage decays according to a power law. This relationship is used to propose a Web growth model that is shown to accurately predict the distribution of Web page degree, based on textual content and assuming only local knowledge of degree for existing pages. A qualitatively similar phase transition is found between linkage and semantic distance, with an exponential decay tail. Both relationships suggest that efficient paths can be discovered by decentralized Web navigation algorithms based on textual and/or categorical cues.
Collapse
Affiliation(s)
- Filippo Menczer
- Department of Management Sciences, University of Iowa, Iowa City, IA 52242, USA.
| |
Collapse
|
43
|
Dagorn L, Menczer F, Bach P, Olson RJ. Co-evolution of movement behaviours by tropical pelagic predatory fishes in response to prey environment: a simulation model. Ecol Modell 2000. [DOI: 10.1016/s0304-3800(00)00374-4] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
44
|
|
45
|
Abstract
Local selection is a simple selection scheme in evolutionary computation. Individual fitnesses are accumulated over time and compared to a fixed threshold, rather than to each other, to decide who gets to reproduce. Local selection, coupled with fitness functions stemming from the consumption of finite shared environmental resources, maintains diversity in a way similar to fitness sharing. However, it is more efficient than fitness sharing and lends itself to parallel implementations for distributed tasks. While local selection is not prone to premature convergence, it applies minimal selection pressure to the population. Local selection is, therefore, particularly suited to Pareto optimization or problem classes where diverse solutions must be covered. This paper introduces ELSA, an evolutionary algorithm employing local selection and outlines three experiments in which ELSA is applied to multiobjective problems: a multimodal graph search problem, and two Pareto optimization problems. In all these experiments, ELSA significantly outperforms other well-known evolutionary algorithms. The paper also discusses scalability, parameter dependence, and the potential distributed applications of the algorithm.
Collapse
Affiliation(s)
- F Menczer
- Management Sciences Department, University of Iowa, Iowa City 52242, USA.
| | | | | |
Collapse
|