Knowledge graph construction system

A knowledge graph and knowledge technology, applied in the field of knowledge graph construction system, can solve problems such as low system efficiency

Active Publication Date: 2016-11-30
EMOTIBOT TECH LTD
View PDF3 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the problem of low efficiency caused by poor system design in existing large-scale knowledge mining and knowledge discovery applications, the present invention proposes a knowledge map construction system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Knowledge graph construction system
  • Knowledge graph construction system
  • Knowledge graph construction system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0024] Such as figure 1 As shown, the knowledge map construction system of this embodiment includes a crawler cluster 10, a Hadoop distributed storage cluster 20, a natural language processing cluster 50, a Mahout knowledge mining module 30, and a knowledge database 40; the crawler cluster 10 is used to capture webpage data, and store the webpage data in the webpage HBase table, and the webpage HBase table is stored in the Hadoop distributed storage cluster; the natural language processing cluster 50 is used to obtain the webpage HBase table from the Hadoop distributed storage cluster to generate original knowledge information, And the original knowledge information is stored in the original knowledge HBase table, and the original knowledge HBase table is stored in the Hadoop distributed storage cluster; the Mahout knowledge mining module 30 is used to carry out knowledge mining to the original knowledge information, generate unstructured data, and unstructured data The struct...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention belongs to the technical field of large-scale data mining, and specifically relates to a knowledge graph construction system. The knowledge graph construction system comprises a crawler cluster, a Hadoop distributed storage cluster, a natural language processing cluster, a Mahout knowledge mining module and a knowledge database. The crawler cluster is used for crawling webpage data according to seed addresses and storing the webpage data into a webpage HBase table. The natural language processing cluster is used for obtaining the webpage HBase table from the Hadoop distributed storage cluster, generating original knowledge information, and storing the original knowledge information in an original knowledge HBase table. The Mahout knowledge mining module is used for performing knowledge mining on the original knowledge information, generating unstructured data, and storing the unstructured data in an unstructured data HBase table. The knowledge database is used for constructing a knowledge graph according to the manually reviewed unstructured data.

Description

technical field [0001] The invention belongs to the technical field of large-scale data mining, and in particular relates to a knowledge map construction system. Background technique [0002] The construction of knowledge graph plays a great role in the understanding and accurate answering of intelligent dialogue knowledge questions; therefore, for the background of the dialogue system, how to quickly and effectively mine valuable knowledge information from a large number of regular and irregular data has become The key to building a knowledge graph. Among them, crawlers need to be used to capture and store a large amount of relevant data; data processing is performed on the data captured in the background to extract relevant information; for the extracted information, structured data can be processed and stored in a relatively simple manner. For unstructured information, further data processing should be done through algorithms such as word segmentation, named entity recog...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/36
Inventor 刘涛祖佺
Owner EMOTIBOT TECH LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products