Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Disambiguation method and device for thesis author and computer equipment

A paper author, paper technology, applied in the disambiguation method, device and computer equipment field of paper author, can solve the problem that the correspondence between paper and author name is not available, and achieve the effect of eliminating clustering errors and improving accuracy

Active Publication Date: 2020-11-03
PING AN TECH (SHENZHEN) CO LTD
View PDF9 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003]The main purpose of this application is to provide a disambiguation method for the author of the paper, aiming to solve the technical problem that the corresponding relationship between the paper and the name of the author in the database cannot reach the usable level

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Disambiguation method and device for thesis author and computer equipment
  • Disambiguation method and device for thesis author and computer equipment
  • Disambiguation method and device for thesis author and computer equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0055] refer to figure 1 , a disambiguation method for the author of a paper in this embodiment, comprising:

[0056]S1: Form the names of authors involved in all papers in the database according to the preset rules to form a name tree;

[0057] S2: Obtain a heterogeneous network of association relationships corresponding to all papers in the database, wherein the heterogeneous network of association relationships includes the association relationship between authors and collaborators, and the association relationship between authors and institutions;

[0058] S3: Obtain...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the artificial intelligence technology, and discloses a paper author disambiguation method comprising the following steps: respectively forming author names involved in all papers in a database into name trees according to preset rules; obtaining association relationship heterogeneous networks corresponding to all papers in a database; obtaining paper semantic representations respectively corresponding to all papers in the database; constructing a similar matrix based on the name tree, the association relationship heterogeneous network and the paper semantic representation; clustering the similar matrixes to obtain paper clustering groups corresponding to all papers in a database; judging whether the paper clustering group corresponding to the author to be disambiguated belongs to a paper clustering group corresponding to a specified author or or not; and if not, judging that the author to be disambiguated is different from the specified author. According to the method and device, the author names are preprocessed to construct the name tree, then clustering errors caused by different expression modes of name writing are eliminated according to the name tree, it is guaranteed that the names of the same author are divided into the same group as much as possible, and the name disambiguation accuracy is improved.

Description

technical field [0001] This application relates to the technical field of artificial intelligence, in particular to the author's disambiguation method, device and computer equipment. Background technique [0002] There are a huge number of papers in the paper database, and each paper often involves more than one author. It is difficult to form a unique academic ID for each author based on the database, to achieve a unique correspondence between the papers in the database and the natural person of the author, and to realize the identification of authors with the same name. Distinguish papers and improve the accuracy of database retrieval. However, the existing implementation methods require a high degree of participation by the author. For example, the author uploads the paper and maintains personal information, which makes the author's enthusiasm for using it low, making it difficult to implement. Therefore, the database information is difficult to complete, and the correspo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/30G06F16/35G06K9/62
CPCG06F40/30G06F16/35G06F18/23G06F18/24
Inventor 马文佳林桂倪渊
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products