301
|
Abstract
Collaboration among researchers is an essential component of the modern scientific enterprise, playing a particularly important role in multidisciplinary research. However, we continue to wrestle with allocating credit to the coauthors of publications with multiple authors, because the relative contribution of each author is difficult to determine. At the same time, the scientific community runs an informal field-dependent credit allocation process that assigns credit in a collective fashion to each work. Here we develop a credit allocation algorithm that captures the coauthors' contribution to a publication as perceived by the scientific community, reproducing the informal collective credit allocation of science. We validate the method by identifying the authors of Nobel-winning papers that are credited for the discovery, independent of their positions in the author list. The method can also compare the relative impact of researchers working in the same field, even if they did not publish together. The ability to accurately measure the relative credit of researchers could affect many aspects of credit allocation in science, potentially impacting hiring, funding, and promotion decisions.
Collapse
|
302
|
|
303
|
Wang D, Song C, Shen HW, Barabási AL. Science communication. Response to Comment on "Quantifying long-term scientific impact". Science 2014; 345:149. [PMID: 25013056 DOI: 10.1126/science.1248961] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Wang, Mei, and Hicks claim that they observed large mean prediction errors when using our model. We find that their claims are a simple consequence of overfitting, which can be avoided by standard regularization methods. Here, we show that our model provides an effective means to identify papers that may be subject to overfitting, and the model, with or without prior treatment, outperforms the proposed naïve approach.
Collapse
Affiliation(s)
- Dashun Wang
- IBM Thomas J. Watson Research Center, Yorktown Heights, NY 10598, USA
| | - Chaoming Song
- Department of Physics, University of Miami, Coral Gables, FL 33146, USA
| | - Hua-Wei Shen
- Center for Complex Network Research and Departments of Physics, Computer Science and Biology, Northeastern University, Boston, MA 02115, USA. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
| | - Albert-László Barabási
- Center for Complex Network Research and Departments of Physics, Computer Science and Biology, Northeastern University, Boston, MA 02115, USA.
| |
Collapse
|
304
|
Wang J, Mei Y, Hicks D. Science communication. Comment on "Quantifying long-term scientific impact". Science 2014; 345:149. [PMID: 25013055 DOI: 10.1126/science.1248770] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Wang et al. (Reports, 4 October 2013, p. 127) claimed high prediction power for their model of citation dynamics. We replicate their analysis but find discouraging results: 14.75% papers are estimated with unreasonably large μ (>5) and λ (>10) and correspondingly enormous prediction errors. The prediction power is even worse than simply using short-term citations to approximate long-term citations.
Collapse
Affiliation(s)
- Jian Wang
- Center for R&D Monitoring (ECOOM) and Department of Managerial Economics, Strategy and Innovation, University of Leuven, Waaistraat 6, Bus 3536, 3000 Leuven, Belgium. Institute for Research Information and Quality Assurance (iFQ), Schuetzenstrasse 6a, 10117 Berlin, Germany.
| | - Yajun Mei
- H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, 755 Ferst Drive Northwest, Atlanta, GA 30332-0205, USA
| | - Diana Hicks
- School of Public Policy, Georgia Institute of Technology, 685 Cherry Street, Atlanta, GA 30332-0345, USA
| |
Collapse
|
305
|
Abstract
Human activities increasingly take place in online environments, providing novel opportunities for relating individual behaviors to population-level outcomes. In this paper, we introduce a simple generative model for the collective behavior of millions of social networking site users who are deciding between different software applications. Our model incorporates two distinct mechanisms: one is associated with recent decisions of users, and the other reflects the cumulative popularity of each application. Importantly, although various combinations of the two mechanisms yield long-time behavior that is consistent with data, the only models that reproduce the observed temporal dynamics are those that strongly emphasize the recent popularity of applications over their cumulative popularity. This demonstrates--even when using purely observational data without experimental design--that temporal data-driven modeling can effectively distinguish between competing microscopic mechanisms, allowing us to uncover previously unidentified aspects of collective online behavior.
Collapse
|
306
|
Abstract
For the study of information propagation, one fundamental problem is uncovering universal laws governing the dynamics of information propagation. This problem, from the microscopic perspective, is formulated as estimating the propagation probability that a piece of information propagates from one individual to another. Such a propagation probability generally depends on two major classes of factors: the intrinsic attractiveness of information and the interactions between individuals. Despite the fact that the temporal effect of attractiveness is widely studied, temporal laws underlying individual interactions remain unclear, causing inaccurate prediction of information propagation on evolving social networks. In this report, we empirically study the dynamics of information propagation, using the dataset from a population-scale social media website. We discover a temporal scaling in information propagation: the probability a message propagates between two individuals decays with the length of time latency since their latest interaction, obeying a power-law rule. Leveraging the scaling law, we further propose a temporal model to estimate future propagation probabilities between individuals, reducing the error rate of information propagation prediction from 6.7% to 2.6% and improving viral marketing with 9.7% incremental customers.
Collapse
|
307
|
Winnink JJ, Tijssen RJW. R&D dynamics and scientific breakthroughs in HIV/AIDS drugs development: the case of Integrase Inhibitors. Scientometrics 2014. [DOI: 10.1007/s11192-014-1330-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
308
|
Author Impact Factor: tracking the dynamics of individual scientific impact. Sci Rep 2014; 4:4880. [PMID: 24814674 PMCID: PMC4017244 DOI: 10.1038/srep04880] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2013] [Accepted: 03/18/2014] [Indexed: 11/08/2022] Open
Abstract
The impact factor (IF) of scientific journals has acquired a major role in the evaluations of the output of scholars, departments and whole institutions. Typically papers appearing in journals with large values of the IF receive a high weight in such evaluations. However, at the end of the day one is interested in assessing the impact of individuals, rather than papers. Here we introduce Author Impact Factor (AIF), which is the extension of the IF to authors. The AIF of an author A in year t is the average number of citations given by papers published in year t to papers published by A in a period of Δt years before year t. Due to its intrinsic dynamic character, AIF is capable to capture trends and variations of the impact of the scientific output of scholars in time, unlike the h-index, which is a growing measure taking into account the whole career path.
Collapse
|
309
|
Abstract
Starting from the dataset of the publication corpus of the APS during the period 1955–2009, we reconstruct the individual researchers trajectories, namely the list of the consecutive affiliations for each scholar. Crossing this information with different geographic datasets we embed these trajectories in a spatial framework. Using methods from network theory and complex systems analysis we characterise these patterns in terms of topological network properties and we analyse the dependence of an academic path across different dimensions: the distance between two subsequent positions, the relative importance of the institutions (in terms of number of publications) and some socio–cultural traits. We show that distance is not always a good predictor for the next affiliation while other factors like “the previous steps” of the career of the researchers (in particular the first position) or the linguistic and historical similarity between two countries can have an important impact. Finally we show that the dataset exhibit a memory effect, hence the fate of a career strongly depends from the first two affiliations.
Collapse
|
310
|
Deville P, Wang D, Sinatra R, Song C, Blondel VD, Barabási AL. Career on the move: geography, stratification, and scientific impact. Sci Rep 2014; 4:4770. [PMID: 24759743 PMCID: PMC3998072 DOI: 10.1038/srep04770] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2014] [Accepted: 04/07/2014] [Indexed: 11/22/2022] Open
Abstract
Changing institutions is an integral part of an academic life. Yet little is known about the mobility patterns of scientists at an institutional level and how these career choices affect scientific outcomes. Here, we examine over 420,000 papers, to track the affiliation information of individual scientists, allowing us to reconstruct their career trajectories over decades. We find that career movements are not only temporally and spatially localized, but also characterized by a high degree of stratification in institutional ranking. When cross-group movement occurs, we find that while going from elite to lower-rank institutions on average associates with modest decrease in scientific performance, transitioning into elite institutions does not result in subsequent performance gain. These results offer empirical evidence on institutional level career choices and movements and have potential implications for science policy.
Collapse
Affiliation(s)
- Pierre Deville
- 1] Department of Applied Mathematics, Université catholique de Louvain, Belgium [2] CCNR and Physics Department, Northeastern University, Boston, MA 02115, USA
| | - Dashun Wang
- 1] CCNR and Physics Department, Northeastern University, Boston, MA 02115, USA [2] IBM Thomas J. Watson Research Center, Yorktown Heights, NY, 10598, USA
| | - Roberta Sinatra
- 1] CCNR and Physics Department, Northeastern University, Boston, MA 02115, USA [2] Center for Cancer Systems Biology, Dana-Farber Cancer Institute, Boston, MA 02115, USA
| | - Chaoming Song
- 1] CCNR and Physics Department, Northeastern University, Boston, MA 02115, USA [2] Department of Physics, University of Miami, Coral Gables, FL 33124, USA
| | - Vincent D Blondel
- Department of Applied Mathematics, Université catholique de Louvain, Belgium
| | - Albert-László Barabási
- 1] CCNR and Physics Department, Northeastern University, Boston, MA 02115, USA [2] Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115 [3] Center for Network Science, Central European University, Budapest, Hungary
| |
Collapse
|
311
|
Medo M. Statistical validation of high-dimensional models of growing networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 89:032801. [PMID: 24730893 DOI: 10.1103/physreve.89.032801] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2013] [Indexed: 06/03/2023]
Abstract
The abundance of models of complex networks and the current insufficient validation standards make it difficult to judge which models are strongly supported by data and which are not. We focus here on likelihood maximization methods for models of growing networks with many parameters and compare their performance on artificial and real datasets. While high dimensionality of the parameter space harms the performance of direct likelihood maximization on artificial data, this can be improved by introducing a suitable penalization term. Likelihood maximization on real data shows that the presented approach is able to discriminate among available network models. To make large-scale datasets accessible to this kind of analysis, we propose a subset sampling technique and show that it yields substantial model evidence in a fraction of time necessary for the analysis of the complete data.
Collapse
Affiliation(s)
- Matúš Medo
- Physics Department, Chemin du Musée 3, University of Fribourg, 1700 Fribourg, Switzerland
| |
Collapse
|
312
|
|
313
|
|
314
|
|