Method and system for automatic construction of knowledge graph for massive unstructured text

An unstructured, knowledge graph technology, applied in the field of computer software, can solve problems such as low degree of automation, difficult maintenance, and large workload, and achieve the effect of reducing human resource costs, improving time efficiency, and improving construction speed

Active Publication Date: 2020-04-28
GLOBAL TONE COMM TECH
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] To sum up, the problems existing in the existing technology are: At present, there are few automatic construction methods of knowledge graphs for massive unstructured texts, and the technical difficulty is relatively high.
The existing methods are mainly manual, with a low degree of automation, requiring heavy manual labor to construct, trim, deduplicate, process, and align the atlas. The entire process is highly professional, heavy workload, and difficult to maintain.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for automatic construction of knowledge graph for massive unstructured text
  • Method and system for automatic construction of knowledge graph for massive unstructured text
  • Method and system for automatic construction of knowledge graph for massive unstructured text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0046] The method for automatically constructing knowledge graphs oriented to unstructured Internet texts provided by the present invention is more universal, and can rapidly build large-scale knowledge graphs.

[0047] The application principle of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0048] like figure 1 As shown, the method for automatically constructing a knowledge map for massive unstructured text provided by the embodiment of the present invention includes the following steps:

[0049] S101: abstract the problem of named entity recognition into a seq...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention relates to the technical field of computer software. Disclosed are a knowledge graph automatic construction method and system for massive unstructured text. A named entity recognition problem is abstracted into one sequence labeling problem: one sentence is given, each word in a sentence sequence is labeled; an effective feature is designed on the basis of training data, various classification models are learned, a trained classifier is used to predict relations; multiple pieces of existing knowledge are linked, one large-scale unified knowledge network is created from the top level; and entity information are grabbed from three major online encyclopedias, open websites, relevant knowledge bases or search engine logs and integrated. The present invention significantly increases the speed of constructing a knowledge graph, increases time efficiency, reduces costs for labor resource by 30% or more. At the same time, the present invention provides improved domain portability, when constructing the knowledge graph, optimization is only required for entities and a relational extraction algorithm in the present invention for rapid implementation.

Description

technical field [0001] The invention belongs to the technical field of computer software, and in particular relates to a method and system for automatically constructing a knowledge graph facing massive unstructured texts. Background technique [0002] At present, the existing technologies commonly used in the industry are as follows: Knowledge Graph aims to describe the entities of the objective world and the relationship between them. attribute composition. In 2012, Google first launched the knowledge graph and used it to enhance search results in search engines, which also marked the successful application of large-scale knowledge graphs in Internet semantic search. In other words, the knowledge graph is composed of a large amount of knowledge, and each piece of knowledge is represented by a triplet, for example: (China, capital, Beijing). At present, knowledge graphs are mostly extracted and constructed from encyclopedic structured data; knowledge graphs can serve cust...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/36G06F16/338
CPCG06F16/00
Inventor 李世奇程国艮
Owner GLOBAL TONE COMM TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products