1
|
He C, Liu F, Dong K, Wu J, Zhang Q. Research on the formation mechanism of research leadership relations: An exponential random graph model analysis approach. J Informetr 2023. [DOI: 10.1016/j.joi.2023.101401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/03/2023]
|
2
|
Deep representation learning of scientific paper reveals its potential scholarly impact. J Informetr 2023. [DOI: 10.1016/j.joi.2023.101376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
|
3
|
A multi-view method of scientific paper classification via heterogeneous graph embeddings. Scientometrics 2022. [DOI: 10.1007/s11192-022-04419-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
4
|
Wang Z, Liu Y. SEA-PS: Semantic embedding with attention to measuring patent similarity by leveraging various text fields. J Inf Sci 2022. [DOI: 10.1177/01655515221106651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Similarity metrics are critical to identifying the relationships between patents. While many bibliometric methods such as co-citation and co-classification fail to use the vast majority of technical information existing in the text, most text mining methods focus on keywords in only one text field of the patent document. This article aims to leverage various text fields to measure pairwise patent similarity according to their technological bases. A novel approach called semantic embedding with attention for patent similarity (SEA-PS) is proposed. First, the method identifies technological bases and models the semantic relatedness. To achieve this, we put forward an additional patent stop-word list to help extract technical terms with an n-gram-based statistical method. The technical terms are then mapped into a vector space using word embedding. Second, we propose a graph-based method to allocate weights to distinguish the technical focus, considering the linkages between technologies. Finally, we assess the feasibility of the text fields, and integrate their semantics at the patent-level with an attention layer to conduct similarity metrics. The validations are from two perspectives: content validity (coverage of technical information, the validity of semantic representations and effectiveness of text field combinations), and external validity against existing methods via an expert panel. The results demonstrate the superiority of SEA-PS to existing methods, and suggest that ‘abstracts’, ‘claims’ and ‘technical descriptions’ are more effective than ‘titles’. SEA-PS is a fundamental tool for patent retrieval and classification. It also has a broad range of practical applications in innovation and strategy studies, including identifying technological frontiers and studying knowledge spillovers.
Collapse
Affiliation(s)
- Zihong Wang
- School of Economics, Huazhong University of Science and Technology, China
| | - Yufei Liu
- Center for Strategic Studies, Chinese Academy of Engineering, China
| |
Collapse
|
5
|
He C, Wu J, Zhang Q. Proximity‐aware research leadership recommendation in research collaboration via deep neural networks. J Assoc Inf Sci Technol 2021. [DOI: 10.1002/asi.24546] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Chaocheng He
- School of Information Management Wuhan University Wuhan Hubei China
- School of Data Science City University of Hong Kong Kowloon, Hong Kong China
| | - Jiang Wu
- School of Information Management Wuhan University Wuhan Hubei China
| | - Qingpeng Zhang
- School of Data Science City University of Hong Kong Kowloon, Hong Kong China
| |
Collapse
|
6
|
Semantic and relational spaces in science of science: deep learning models for article vectorisation. Scientometrics 2021. [DOI: 10.1007/s11192-021-03984-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
AbstractOver the last century, we observe a steady and exponential growth of scientific publications globally. The overwhelming amount of available literature makes a holistic analysis of the research within a field and between fields based on manual inspection impossible. Automatic techniques to support the process of literature review are required to find the epistemic and social patterns that are embedded in scientific publications. In computer sciences, new tools have been developed to deal with large volumes of data. In particular, deep learning techniques open the possibility of automated end-to-end models to project observations to a new, low-dimensional space where the most relevant information of each observation is highlighted. Using deep learning to build new representations of scientific publications is a growing but still emerging field of research. The aim of this paper is to discuss the potential and limits of deep learning for gathering insights about scientific research articles. We focus on document-level embeddings based on the semantic and relational aspects of articles, using Natural Language Processing (NLP) and Graph Neural Networks (GNNs). We explore the different outcomes generated by those techniques. Our results show that using NLP we can encode a semantic space of articles, while GNN we enable us to build a relational space where the social practices of a research community are also encoded.
Collapse
|
7
|
|