1
|
Yang KC, Aronson B, Ahn YY. BiRank: Fast and Flexible Ranking on Bipartite Networks with R and Python. JOURNAL OF OPEN SOURCE SOFTWARE 2020; 5:2315. [PMID: 34729449 PMCID: PMC8559594 DOI: 10.21105/joss.02315] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Affiliation(s)
- Kai-Cheng Yang
- Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN
| | - Brian Aronson
- Department of Sociology, Indiana University, Bloomington, IN
| | - Yong-Yeol Ahn
- Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN
| |
Collapse
|
2
|
Hoppe TA, Litovitz A, Willis KA, Meseroll RA, Perkins MJ, Hutchins BI, Davis AF, Lauer MS, Valantine HA, Anderson JM, Santangelo GM. Topic choice contributes to the lower rate of NIH awards to African-American/black scientists. SCIENCE ADVANCES 2019; 5:eaaw7238. [PMID: 31633016 PMCID: PMC6785250 DOI: 10.1126/sciadv.aaw7238] [Citation(s) in RCA: 344] [Impact Index Per Article: 68.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Accepted: 09/14/2019] [Indexed: 05/18/2023]
Abstract
Despite efforts to promote diversity in the biomedical workforce, there remains a lower rate of funding of National Institutes of Health R01 applications submitted by African-American/black (AA/B) scientists relative to white scientists. To identify underlying causes of this funding gap, we analyzed six stages of the application process from 2011 to 2015 and found that disparate outcomes arise at three of the six: decision to discuss, impact score assignment, and a previously unstudied stage, topic choice. Notably, AA/B applicants tend to propose research on topics with lower award rates. These topics include research at the community and population level, as opposed to more fundamental and mechanistic investigations; the latter tend to have higher award rates. Topic choice alone accounts for over 20% of the funding gap after controlling for multiple variables, including the applicant's prior achievements. Our findings can be used to inform interventions designed to close the funding gap.
Collapse
Affiliation(s)
- Travis A. Hoppe
- Office of Portfolio Analysis, National Institutes of Health, Bethesda, MD, USA
- Division of Program Coordination, Planning, and Strategic Initiatives, National Institutes of Health, Bethesda, MD, USA
| | - Aviva Litovitz
- Office of Portfolio Analysis, National Institutes of Health, Bethesda, MD, USA
- Division of Program Coordination, Planning, and Strategic Initiatives, National Institutes of Health, Bethesda, MD, USA
| | - Kristine A. Willis
- National Institute of General Medical Sciences, National Institutes of Health, Bethesda, MD, USA
| | - Rebecca A. Meseroll
- Office of Portfolio Analysis, National Institutes of Health, Bethesda, MD, USA
- Division of Program Coordination, Planning, and Strategic Initiatives, National Institutes of Health, Bethesda, MD, USA
| | - Matthew J. Perkins
- Office of Portfolio Analysis, National Institutes of Health, Bethesda, MD, USA
- Division of Program Coordination, Planning, and Strategic Initiatives, National Institutes of Health, Bethesda, MD, USA
| | - B. Ian Hutchins
- Office of Portfolio Analysis, National Institutes of Health, Bethesda, MD, USA
- Division of Program Coordination, Planning, and Strategic Initiatives, National Institutes of Health, Bethesda, MD, USA
| | - Alison F. Davis
- Scientific Workforce Diversity, National Institutes of Health, Bethesda, MD, USA
| | - Michael S. Lauer
- Office of Extramural Research, National Institutes of Health, Bethesda, MD, USA
| | - Hannah A. Valantine
- Scientific Workforce Diversity, National Institutes of Health, Bethesda, MD, USA
| | - James M. Anderson
- Division of Program Coordination, Planning, and Strategic Initiatives, National Institutes of Health, Bethesda, MD, USA
| | - George M. Santangelo
- Office of Portfolio Analysis, National Institutes of Health, Bethesda, MD, USA
- Division of Program Coordination, Planning, and Strategic Initiatives, National Institutes of Health, Bethesda, MD, USA
- Corresponding author.
| |
Collapse
|
3
|
A Novel Entropy-Based Centrality Approach for Identifying Vital Nodes in Weighted Networks. ENTROPY 2018; 20:e20040261. [PMID: 33265352 PMCID: PMC7512776 DOI: 10.3390/e20040261] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Revised: 03/30/2018] [Accepted: 04/07/2018] [Indexed: 12/25/2022]
Abstract
Measuring centrality has recently attracted increasing attention, with algorithms ranging from those that simply calculate the number of immediate neighbors and the shortest paths to those that are complicated iterative refinement processes and objective dynamical approaches. Indeed, vital nodes identification allows us to understand the roles that different nodes play in the structure of a network. However, quantifying centrality in complex networks with various topological structures is not an easy task. In this paper, we introduce a novel definition of entropy-based centrality, which can be applicable to weighted directed networks. By design, the total power of a node is divided into two parts, including its local power and its indirect power. The local power can be obtained by integrating the structural entropy, which reveals the communication activity and popularity of each node, and the interaction frequency entropy, which indicates its accessibility. In addition, the process of influence propagation can be captured by the two-hop subnetworks, resulting in the indirect power. In order to evaluate the performance of the entropy-based centrality, we use four weighted real-world networks with various instance sizes, degree distributions, and densities. Correspondingly, these networks are adolescent health, Bible, United States (US) airports, and Hep-th, respectively. Extensive analytical results demonstrate that the entropy-based centrality outperforms degree centrality, betweenness centrality, closeness centrality, and the Eigenvector centrality.
Collapse
|
4
|
Adeyemi IR, Razak SA, Salleh M, Venter HS. Observing Consistency in Online Communication Patterns for User Re-Identification. PLoS One 2016; 11:e0166930. [PMID: 27918593 PMCID: PMC5137900 DOI: 10.1371/journal.pone.0166930] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2015] [Accepted: 11/07/2016] [Indexed: 11/19/2022] Open
Abstract
Comprehension of the statistical and structural mechanisms governing human dynamics in online interaction plays a pivotal role in online user identification, online profile development, and recommender systems. However, building a characteristic model of human dynamics on the Internet involves a complete analysis of the variations in human activity patterns, which is a complex process. This complexity is inherent in human dynamics and has not been extensively studied to reveal the structural composition of human behavior. A typical method of anatomizing such a complex system is viewing all independent interconnectivity that constitutes the complexity. An examination of the various dimensions of human communication pattern in online interactions is presented in this paper. The study employed reliable server-side web data from 31 known users to explore characteristics of human-driven communications. Various machine-learning techniques were explored. The results revealed that each individual exhibited a relatively consistent, unique behavioral signature and that the logistic regression model and model tree can be used to accurately distinguish online users. These results are applicable to one-to-one online user identification processes, insider misuse investigation processes, and online profiling in various areas.
Collapse
Affiliation(s)
- Ikuesan Richard Adeyemi
- Information Assurance and Security Research Group, Faculty of Computing, Universiti Teknologi Malaysia, Skudai, Johor Bahru, Malaysia
- Information and Computer Security Architecture Research Group, Department of Computer Science, University of Pretoria, Lynnwood, South Africa
| | - Shukor Abd Razak
- Information Assurance and Security Research Group, Faculty of Computing, Universiti Teknologi Malaysia, Skudai, Johor Bahru, Malaysia
| | - Mazleena Salleh
- Information Assurance and Security Research Group, Faculty of Computing, Universiti Teknologi Malaysia, Skudai, Johor Bahru, Malaysia
| | - Hein S. Venter
- Information and Computer Security Architecture Research Group, Department of Computer Science, University of Pretoria, Lynnwood, South Africa
| |
Collapse
|
5
|
Relative Citation Ratio (RCR): A New Metric That Uses Citation Rates to Measure Influence at the Article Level. PLoS Biol 2016; 14:e1002541. [PMID: 27599104 PMCID: PMC5012559 DOI: 10.1371/journal.pbio.1002541] [Citation(s) in RCA: 266] [Impact Index Per Article: 33.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Accepted: 08/01/2016] [Indexed: 11/19/2022] Open
Abstract
Despite their recognized limitations, bibliometric assessments of scientific productivity have been widely adopted. We describe here an improved method to quantify the influence of a research article by making novel use of its co-citation network to field-normalize the number of citations it has received. Article citation rates are divided by an expected citation rate that is derived from performance of articles in the same field and benchmarked to a peer comparison group. The resulting Relative Citation Ratio is article level and field independent and provides an alternative to the invalid practice of using journal impact factors to identify influential papers. To illustrate one application of our method, we analyzed 88,835 articles published between 2003 and 2010 and found that the National Institutes of Health awardees who authored those papers occupy relatively stable positions of influence across all disciplines. We demonstrate that the values generated by this method strongly correlate with the opinions of subject matter experts in biomedical research and suggest that the same approach should be generally applicable to articles published in all areas of science. A beta version of iCite, our web tool for calculating Relative Citation Ratios of articles listed in PubMed, is available at https://icite.od.nih.gov.
Collapse
|
6
|
Zhang FG, Zeng A. Information Filtering via Heterogeneous Diffusion in Online Bipartite Networks. PLoS One 2015; 10:e0129459. [PMID: 26125631 PMCID: PMC4488376 DOI: 10.1371/journal.pone.0129459] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2014] [Accepted: 05/10/2015] [Indexed: 11/18/2022] Open
Abstract
The rapid expansion of Internet brings us overwhelming online information, which is impossible for an individual to go through all of it. Therefore, recommender systems were created to help people dig through this abundance of information. In networks composed by users and objects, recommender algorithms based on diffusion have been proven to be one of the best performing methods. Previous works considered the diffusion process from user to object, and from object to user to be equivalent. We show in this work that it is not the case and we improve the quality of the recommendation by taking into account the asymmetrical nature of this process. We apply this idea to modify the state-of-the-art recommendation methods. The simulation results show that the new methods can outperform these existing methods in both recommendation accuracy and diversity. Finally, this modification is checked to be able to improve the recommendation in a realistic case.
Collapse
Affiliation(s)
- Fu-Guo Zhang
- School of Information Technology, Jiangxi University of Finance and Economics, Nanchang 330013, P.R. China
- Jiangxi Key Laboratory of Data and Knowledge Engineering, Jiangxi University of Finance and Economics, Nanchang, 330013, P. R. China
| | - An Zeng
- School of Systems Science, Beijing Normal University, Beijing, 100875, P. R. China
- * E-mail:
| |
Collapse
|
7
|
Predicting Positive and Negative Relationships in Large Social Networks. PLoS One 2015; 10:e0129530. [PMID: 26075404 PMCID: PMC4468140 DOI: 10.1371/journal.pone.0129530] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2014] [Accepted: 05/11/2015] [Indexed: 11/27/2022] Open
Abstract
In a social network, users hold and express positive and negative attitudes (e.g. support/opposition) towards other users. Those attitudes exhibit some kind of binary relationships among the users, which play an important role in social network analysis. However, some of those binary relationships are likely to be latent as the scale of social network increases. The essence of predicting latent binary relationships have recently began to draw researchers' attention. In this paper, we propose a machine learning algorithm for predicting positive and negative relationships in social networks inspired by structural balance theory and social status theory. More specifically, we show that when two users in the network have fewer common neighbors, the prediction accuracy of the relationship between them deteriorates. Accordingly, in the training phase, we propose a segment-based training framework to divide the training data into two subsets according to the number of common neighbors between users, and build a prediction model for each subset based on support vector machine (SVM). Moreover, to deal with large-scale social network data, we employ a sampling strategy that selects small amount of training data while maintaining high accuracy of prediction. We compare our algorithm with traditional algorithms and adaptive boosting of them. Experimental results of typical data sets show that our algorithm can deal with large social networks and consistently outperforms other methods.
Collapse
|
8
|
Energy Spectral Behaviors of Communication Networks of Open-Source Communities. PLoS One 2015; 10:e0128251. [PMID: 26047331 PMCID: PMC4457875 DOI: 10.1371/journal.pone.0128251] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2014] [Accepted: 04/23/2015] [Indexed: 11/19/2022] Open
Abstract
Large-scale online collaborative production activities in open-source communities must be accompanied by large-scale communication activities. Nowadays, the production activities of open-source communities, especially their communication activities, have been more and more concerned. Take CodePlex C # community for example, this paper constructs the complex network models of 12 periods of communication structures of the community based on real data; then discusses the basic concepts of quantum mapping of complex networks, and points out that the purpose of the mapping is to study the structures of complex networks according to the idea of quantum mechanism in studying the structures of large molecules; finally, according to this idea, analyzes and compares the fractal features of the spectra in different quantum mappings of the networks, and concludes that there are multiple self-similarity and criticality in the communication structures of the community. In addition, this paper discusses the insights and application conditions of different quantum mappings in revealing the characteristics of the structures. The proposed quantum mapping method can also be applied to the structural studies of other large-scale organizations.
Collapse
|