Method and device for checking knowledge base triad

A technology of triples and knowledge base, applied in the field of knowledge base, can solve the problem that the efficiency is difficult to meet the needs of building a large-scale knowledge base

Active Publication Date: 2018-05-11
NEW FOUNDER HLDG DEV LLC +2
View PDF4 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For such errors, it is difficult to meet the needs of building a large-

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for checking knowledge base triad
  • Method and device for checking knowledge base triad
  • Method and device for checking knowledge base triad

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0110] figure 1 It is a flowchart of a method for checking triplets in a knowledge base provided by Embodiment 1 of the present invention. Such as figure 1 As shown, the present embodiment provides a method for checking knowledge base triples, and the specific steps of the method are as follows:

[0111] S101. Obtain M words used to characterize the first relationship in the corpus as target feature words, and acquire the first weight value of the target feature words. The corpus includes multiple sentences, and each sentence includes at least one word, wherein M is a positive integer .

[0112] In this embodiment, the corpus refers to a large-scale electronic text library through scientific sampling and processing, which stores language materials that have actually appeared in the actual use of the language, such as literary works, sentence paragraphs of newspapers and magazines, etc. The corpus includes a plurality of sentences, and each sentence includes at least one wor...

Embodiment 2

[0120] figure 2 It is a flowchart of a method for checking triplets in a knowledge base provided by Embodiment 2 of the present invention. Such as figure 2 As shown, the method for checking knowledge base triples provided in this embodiment specifically includes the following steps:

[0121] S201. Obtain N target triples whose relationship is the first relationship in the knowledge base, where N is a positive integer.

[0122] For the convenience of description, in this embodiment, the first relationship is set as "teacher-student". First, obtain N target triplets whose relationship is "teacher-student" from the knowledge base, such as , , Etc., the acquisition process can be random acquisition, or acquisition by predetermined rules. The number N of target triples can be selected according to actual needs. The larger the number, the more M target feature words obtained can represent the "teacher-student" relationship, and the corresponding first weight value is more acc...

Embodiment 3

[0191] In this embodiment, a specific example is given to illustrate the method for checking triplets in the knowledge base.

[0192] Through the above-mentioned embodiment, the triplets whose first relationship is "teacher-student" in the knowledge base are tested, among which N=100 triplets whose first relationship is "teacher-student" are randomly selected, and after S201-S205 After (wherein M=200, Q=10), obtain the target feature word (only showing part target feature word) as shown in table 1:

[0193] Table 1

[0194]

[0195] On the basis of the 200 target feature words and the corresponding first weight values ​​obtained above, according to S206, the confidence of the triples to be tested whose first relationship is "teacher-student" in the knowledge base is obtained, thereby obtaining Table 2.

[0196] Table 2

[0197]

[0198] Further, by selecting the first L (L=100) triples with higher confidence as positive triples, and obtaining S triples whose relationsh...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a device for checking a knowledge base triad. The method comprises the steps that M terms used for representing a first relation in a corpus are used as target feature terms, and first weight values of the target feature terms are acquired; according to the first weight values, the confidence of a to-be-checked triad in the first relation in a knowledge base isacquired; and whether the to-be-checked triad is credible is determined according to the confidence. According to the method, whether the to-be-checked triad is credible is determined by acquiring theconfidence of the to-be-checked triad, separate or batch checking can be realized, checking efficiency is improved, manual checking cost in practical application can be saved, and the efficiency of constructing a high-quality knowledge base is substantially improved; and moreover, it is accurate to check the credible degree of the triad through the confidence, universality is high when information checking is performed on different types of knowledge base triads, and the method can be applied to triad checking of any knowledge base.

Description

technical field [0001] Embodiments of the present invention relate to the field of knowledge bases, and in particular, to a method and device for checking triples of knowledge bases. Background technique [0002] Knowledge service refers to the high-level information service process that extracts knowledge from various explicit and tacit knowledge resources according to people's needs, and uses it to solve user problems. Knowledge base is an important form of data organization in knowledge service. The accuracy of its content directly determines the effectiveness of knowledge service, and it usually consists of several triples. [0003] However, the data sources for constructing the knowledge base are complex and diverse. There are structured data, semi-structured data, and unstructured data only in form, and errors may occur in the extraction process. wrong information. A typical type of error is the relationship error expressed by triples, for example: the triple <Li ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/35G06F16/36
Inventor 谢海华黄肖俊吕肖庆汤帜
Owner NEW FOUNDER HLDG DEV LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products