Method and system for enhancing file entity association degree based on knowledge graph

A technology of knowledge graph and correlation degree, which is applied in the field of enhancing the correlation degree of archive entities based on knowledge graph, can solve the problems of low correlation degree and utilization rate of archive data, and achieve the effect of improving correlation degree and utilization rate

Pending Publication Date: 2020-10-09
AGRI INFORMATION INST OF CAS
View PDF3 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Therefore, the present invention provides a method and system for enhancing the correlation degree of archive entities base

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for enhancing file entity association degree based on knowledge graph
  • Method and system for enhancing file entity association degree based on knowledge graph
  • Method and system for enhancing file entity association degree based on knowledge graph

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0052] An embodiment of the present invention provides a method for enhancing the association degree of archive entities based on knowledge graphs, such as figure 1 shown, including the following steps:

[0053] Step S1: Acquiring archive text data.

[0054] The collection of archives resources is the basis of archives management. The basic function of archives management is intelligent collection and archiving. It is the centralized processing center for submitted electronic files to be archived. It mainly realizes the submission of archives data, receipt of archives, and communication between the archives system and other application systems. data conversion. Use intelligent technology to collect information and data generated by different data sources, and extract potentially available information. After completing intelligent archiving, it is necessary to carry out data processing and analysis according to the characteristics of the archive data. As the existing paper ar...

Embodiment 2

[0071] An embodiment of the present invention provides a system for enhancing the association degree of archive entities based on knowledge graphs, such as Figure 7 shown, including:

[0072] The data acquisition module 1 is used to acquire text data; this module executes the method described in step S1 in Embodiment 1, which will not be repeated here.

[0073] The entity recognition module 2 is used to identify the file text data by using the entity recognition model, and generate the instance data of the defined entity; this module executes the method described in step S2 in Embodiment 1, which will not be repeated here.

[0074] The relationship extraction module 3 is used to identify the instance data of the defined entity by using the relationship extraction model, and generate the smallest unit in the knowledge graph; this module executes the method described in step S3 in Embodiment 1, which will not be repeated here.

[0075] The knowledge fusion module 4 is used to ...

Embodiment 3

[0078] An embodiment of the present invention provides a terminal, such as Figure 8 As shown, it includes: at least one processor 401 , such as a CPU (Central Processing Unit, central processing unit), at least one communication interface 403 , memory 404 , and at least one communication bus 402 . Wherein, the communication bus 402 is used to realize connection and communication between these components. Wherein, the communication interface 403 may include a display screen (Display) and a keyboard (Keyboard), and the optional communication interface 403 may also include a standard wired interface and a wireless interface. The memory 404 may be a high-speed RAM memory (Random Access Memory, volatile random access memory), or a non-volatile memory (non-volatile memory), such as at least one disk memory. Optionally, the memory 404 may also be at least one storage device located away from the aforementioned processor 401 . The processor 401 may execute the method for enhancing ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a system for enhancing file entity association degree based on a knowledge graph. The method comprises the following steps: obtaining archive text data; identifying the archive text data by utilizing an entity identification model, and generating instance data of a defined entity; identifying the instance data of the defined entity by using a relationship extraction model, and generating a minimum unit in a knowledge graph; using a knowledge fusion model to carry out deduplication preprocessing on the minimum unit in a knowledge graph, establishing partition index sub-documents, searching for matched entities according to text similarity or structural similarity, and performing knowledge fusion through a preset entity alignment algorithm to enhance theassociation degree of archive entities. The main functions of intelligent file collection and filing, data processing and analysis and file resource semantic enhancement are achieved through entity recognition, relation extraction and fusion technologies, powerful support is provided for semantic association and intelligent development of file management, and the file data association degree and the file data utilization rate are increased.

Description

technical field [0001] The invention relates to the technical field of information resource management, in particular to a method and system for enhancing the correlation degree of archive entities based on knowledge graphs. Background technique [0002] Traditional archives management is mainly "manual" management, collection is mainly through "request", archives resources are in the stage of "preservation-based", lack of in-depth resource development and sharing in the functions of resource collection, description, retrieval and query, resulting in The value of archival data has not been activated, which cannot meet the urgent needs of archival researchers for information sharing. Target design and task description are carried out around the digitization of archives, archives database system and construction of digital archives. Archives management is also gradually tilting towards the construction of archives informatization, promoting the further opening and sharing of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/36G06F16/31G06F16/335G06F16/35G06F16/28G06F40/289G06F40/30
CPCG06F16/367G06F16/313G06F16/335G06F16/355G06F16/288G06F40/289G06F40/30
Inventor 雷洁赵瑞雪鲜国建寇远涛侯希闻仲晓春刘杉许怡然程思梦
Owner AGRI INFORMATION INST OF CAS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products