Supercharge Your Innovation With Domain-Expert AI Agents!

Research interest mining method, device and storage medium for scholar clustering

A technology for scholars and interests, applied in the field of big data, it can solve the problems such as the inability to track the research interests of scholars, the discontinuity between the time of publication of papers and the application of projects, and the inability to directly obtain dynamic semantic representations of research interests of scholars, so as to improve the aggregation rate. Class accuracy, precise scholar clustering, and the effect of realizing research interest mining

Active Publication Date: 2021-12-07
BEIJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, none of these dynamic representation models can track scholars' research interests from academic data
On the one hand, the publication time of papers and project application time are discontinuous, which is not suitable for online processing of academic data
On the other hand, existing batch methods are only suitable for single-source or monolingual data, and thus cannot be directly used to obtain dynamic semantic representations of scholars' research interests.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Research interest mining method, device and storage medium for scholar clustering
  • Research interest mining method, device and storage medium for scholar clustering
  • Research interest mining method, device and storage medium for scholar clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with the embodiments and accompanying drawings. Here, the exemplary embodiments and descriptions of the present invention are used to explain the present invention, but not to limit the present invention.

[0045] Here, it should also be noted that, in order to avoid obscuring the present invention due to unnecessary details, only the structures and / or processing steps closely related to the solution according to the present invention are shown in the drawings, and the related Other details are not relevant to the invention.

[0046] It should be emphasized that the term "comprising / comprising" when used herein refers to the presence of a feature, element, step or component, but does not exclude the presence or addition of one or more other features, elements, steps or components.

[0047] Aiming...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a research interest mining method, device and storage medium for scholars clustering. The method includes the following steps: constructing an academic metadata set based on academic data related to scholars from multiple sources; inputting academic metadata as input data into a pre-established In the research interest mining model of , the topic model is sampled to obtain the semantic representation of scholar interest, which includes professional field-topic distribution, topic-English word distribution, topic-Chinese word distribution and topic-scholar distribution; Semantic interest representation performs scholar clustering and obtains scholar clustering results; the research interest mining model shares the same topic distribution for the data of scholars from the same data source and belongs to the same professional field. In the research interest mining model, the professional field-subject distribution is modeled as a Dirichlet distribution, and the subject-English word distribution, subject-Chinese word distribution, and subject-scholar distribution are modeled as multinomial distributions.

Description

technical field [0001] The invention relates to the field of big data technology, in particular to a research interest mining method, device and storage medium for scholars clustering. Background technique [0002] Academic data such as scholars, research projects, and papers all have their own areas of expertise. For example, some data belongs to software engineering and some data belongs to artificial intelligence. The research content of different professional fields is different, and the data of the same professional field often have a common theme distribution. For academic data of scholars, discovering research interests of scholars from academic data and clustering scholars according to their research interests are important for many tasks, such as selecting collaborators for scholars, reviewers for journals, and selection for governments expert. Different from general data, scholarly interest-related academic data has its unique properties. On the one hand, it is...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F16/33
CPCG06F16/353G06F16/3331G06F2216/03
Inventor 寇菲菲王文东杜军平李昂薛哲梁美玉
Owner BEIJING UNIV OF POSTS & TELECOMM
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More