Unlock instant, AI-driven research and patent intelligence for your innovation.

Scholar clustering-oriented research interest mining method and device and storage medium

A technology for scholars and interests, applied in the field of big data, it can solve problems such as the inability to track the research interests of scholars, the semantic representation of scholars’ research interests that cannot be directly used to obtain dynamics, the discontinuity of paper publication time and project application time, etc., to achieve accurate scholars Clustering, realizing research interest mining, and improving the effect of clustering accuracy

Active Publication Date: 2021-10-19
BEIJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, none of these dynamic representation models can track scholars' research interests from academic data
On the one hand, the publication time of papers and project application time are discontinuous, which is not suitable for online processing of academic data
On the other hand, existing batch methods are only suitable for single-source or monolingual data, and thus cannot be directly used to obtain dynamic semantic representations of scholars' research interests.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scholar clustering-oriented research interest mining method and device and storage medium
  • Scholar clustering-oriented research interest mining method and device and storage medium
  • Scholar clustering-oriented research interest mining method and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with the embodiments and accompanying drawings. Here, the exemplary embodiments and descriptions of the present invention are used to explain the present invention, but not to limit the present invention.

[0045] Here, it should also be noted that, in order to avoid obscuring the present invention due to unnecessary details, only the structures and / or processing steps closely related to the solution according to the present invention are shown in the drawings, and the related Other details are not relevant to the invention.

[0046] It should be emphasized that the term "comprising / comprising" when used herein refers to the presence of a feature, element, step or component, but does not exclude the presence or addition of one or more other features, elements, steps or components.

[0047] Aiming...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a scholar clustering-oriented research interest mining method and device and a storage medium. The method comprises the following steps of constructing an academic metadata set based on multi-source scholar related academic data; serving the academic metadata as input data to be input into a pre-established research interest mining model, sampling the topic model to obtain scholar interest semantic representation, wherein the scholar interest semantic representation comprises professional field-topic distribution, topic-English word distribution, topic-Chinese word distribution and topic-scholar distribution; performing scholar clustering based on the obtained scholar interest semantic representation to obtain a scholar clustering result; enabling the study interest mining model to share the same subject distribution for data of scholars from the same data source and belonging to the same professional field, and in the study interest mining model, the professional field-subject distribution is modeled as Dirichlet distribution; and modeling the subject-English word distribution, the subject-Chinese word distribution, and the subject-scholar distribution as polynomial distributions.

Description

technical field [0001] The invention relates to the field of big data technology, in particular to a research interest mining method, device and storage medium for scholars clustering. Background technique [0002] Academic data such as scholars, research projects, and papers all have their own areas of expertise. For example, some data belongs to software engineering and some data belongs to artificial intelligence. The research content of different professional fields is different, and the data of the same professional field often have a common theme distribution. For academic data of scholars, discovering research interests of scholars from academic data and clustering scholars according to their research interests are important for many tasks, such as selecting collaborators for scholars, reviewers for journals, and selection for governments expert. Different from general data, scholarly interest-related academic data has its unique properties. On the one hand, it is...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F16/33
CPCG06F16/353G06F16/3331G06F2216/03
Inventor 寇菲菲王文东杜军平李昂薛哲梁美玉
Owner BEIJING UNIV OF POSTS & TELECOMM