Unlock instant, AI-driven research and patent intelligence for your innovation.

A Disambiguation Archiving and Storage Method for Data of Scientific Research Achievements

A technology for archiving and storing scientific research results, applied in the field of data processing, it can solve the problems of high cost, misclassification, and no consideration of the data characteristics of other scientific research results, and achieve the effect of improving the accuracy.

Active Publication Date: 2022-06-07
四川优科服科技服务有限公司
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method has an obvious shortcoming. The binary classification method does not consider the global distribution characteristics of the author's literature. If a researcher's scientific research data research direction is completely different from other literature, it will lead to misclassification.
[0004] Usually, the disambiguation method based on supervised learning is better than other methods, but in practical applications, it is impractical and expensive to manually label the data of large-scale scientific research results database
At present, most scientific researchers disambiguate with the same name, and disambiguation is done based on the paper data set, without considering the data characteristics of other scientific research results, it will appear that a researcher's paper is completely different from other documents, but may be related to a certain patent, or Other types of scientific research data belong to the scientific research achievements of the same period

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Disambiguation Archiving and Storage Method for Data of Scientific Research Achievements
  • A Disambiguation Archiving and Storage Method for Data of Scientific Research Achievements
  • A Disambiguation Archiving and Storage Method for Data of Scientific Research Achievements

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] The present invention will be further described below with reference to the accompanying drawings.

[0057] like figure 1 The present invention provides a method for disambiguating, archiving and storing scientific research achievement data. First, disambiguation is carried out by using the strong characteristics of various scientific research achievement data, and then based on the attribute characteristics of various scientific research achievement data, combined with clustering and disambiguation based on feature relation graph. method to accurately archive the incremental data of scientific research results, the specific steps are as follows:

[0058] S1. Structural processing and data completion of the scientific research achievement data of the archived scientific research personnel and the scientific research achievement data to be archived, and storing them in the database;

[0059] S2. Obtain the collaborator field of the scientific research result data, calcu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a scientific research achievement data disambiguation archiving storage method, including: S1, processing and completing the scientific research achievement data that has been archived and to be archived, and saving it to the database; S2, calculating the similarity of collaborators, and filing if they match , if not, enter S3; S3, cluster the data that has been archived and the data to be archived; S4, calculate the distance from the center point of each cluster of the data to be archived to the center point of each cluster of each scientific researcher with the same name, and obtain the closest distance Scientists who belong to the cluster; S5. Establish a feature relationship graph; S6. Calculate the similarity probability between the data node to be archived and each data node that has been archived, and calculate the average and variance, and compare it with the threshold to complete the archive. The solution proposed by the present invention does not require data labeling training, and is more practical in most scientific research personnel systems, and can quickly realize data disambiguation and simultaneously improve the accuracy of disambiguation.

Description

technical field [0001] The invention relates to the field of data processing, in particular to a method for archiving and storing scientific research achievement data disambiguation. Background technique [0002] With the popularization of the Internet, different institutions or departments currently have their own online scientific research personnel systems or scientific research personnel information databases, and a large number of scientific research personnel achievement information will be added from time to time. These data must be accurately archived in the system. Authors already exist. Archives, the problem of researchers with the same name is an urgent problem to be solved by this type of system. [0003] Existing disambiguation methods of the same name almost all transform the problem into a related clustering or classification problem for machine learning. In the process of data incremental disambiguation, most scholars currently use the supervised disambiguat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/31G06F16/33G06F16/332G06F16/35
CPCG06F16/316G06F16/332G06F16/3346G06F16/35
Inventor 杨春明郭鑫张晖李波赵旭剑
Owner 四川优科服科技服务有限公司