A Deepdive-based field text knowledge extracting method

A technology of knowledge extraction and text, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of lack and difficulty in data utilization, and achieve the effects of cost reduction, strong practicability and flexibility
CN107169079AActive Publication Date: 2017-09-15ZHEJIANG UNIV

Patent Information

Authority / Receiving Office
CN ยท China
Current Assignee / Owner
ZHEJIANG UNIV
Publication Date
2017-09-15

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention provides a Deepdive-based field text knowledge extracting method comprising the steps of: (1) acquiring original texts required by a knowledge base construction system and performing pretreatment on the texts; (2) performing entity connection on the pre-treated texts, finding out target entities corresponding to a preset specific relation, generating entity-relation-entity triads and forming a candidate relation-entity pair set; (3) learning and labeling a plurality of candidate relation-entity pairs by using a weak supervising method and generating training samples of a Deepdive tool; (4) inputting the training samples into the Deepdive tool to train Deepdive, and outputting candidate relation-entity pairs with probability values greater than a threshold value to form an extracted knowledge base. The method can complete the work of construction of a field knowledge base, has great expandability and is of high practical value for utilization and extraction of unstructured data.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to computer natural language processing technology, and specifically designs a method for extracting domain text knowledge based on Deepdive. Background technique

[0002] The construction of knowledge base has practical significance and application prospect in reality. The daily operation of Apple's Siri and Microsoft's Cortana is based on a large knowledge base, and quickly returns correct answers to users' questions. However, in some vertical fields, such as customer service, finance, chat robots, etc., there is a lack of knowledge bases for specific relationships, or lack of knowledge bases with complete information and timely content updates. If the knowledge base can be automatically constructed for a specific field and some specific relationships, and achieve high accuracy, it can effectively reduce the manpower and time costs in knowledge base construction, and provide more downstream applications. good service. [0003...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More