Method and device for mining hypernym-hyponym relation between domain-specific terms

A technology of terms and domains, which is applied in the field of mining the hyponym relationship between domain terms, can solve problems such as difficulty in constructing pattern sets, difficulty in finding vocabulary, and reducing the accuracy of extraction results.

Inactive Publication Date: 2017-04-19
CHINA MOBILE COMM GRP CO LTD
View PDF5 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] First, the template-based method: due to the complexity of the Chinese language and the lack of semantic analysis in the pattern matching process, a large number of useless concept pairs will also be matched, which greatly reduces the accuracy of the extraction results; and due to the Chinese syntax Various forms, it is difficult to construct a relatively complete set of patterns
[0008] Secondly, the dictionary-based method: the cost of making, maintaining and updating the dictionary is too high, and it is difficult to find many profes...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for mining hypernym-hyponym relation between domain-specific terms
  • Method and device for mining hypernym-hyponym relation between domain-specific terms
  • Method and device for mining hypernym-hyponym relation between domain-specific terms

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0084] According to an aspect of an embodiment of the present invention, a method for mining the hyponymy relationship between field terms is provided. First, according to a plurality of first predetermined field terms, the method collects the entry explanation of the first field term on the thesaurus page. sentence, the first domain term is a word related to the semantics of the first predetermined domain term; then, use the upper and lower relationship model file generated by the CRF tool in advance to obtain the first domain term and the entry explanation The hypernymy relationship between the words included in the sentence.

[0085] Therefore, the method for excavating the hyponym relationship between domain terms in the embodiment of the present invention uses the entry explanation sentence of the domain term in the thesaurus page, uses CRF machine learning technology to train and learn, finally establishes a model file, and uses the model The hypernymy relationship betwe...

Embodiment 2

[0134] According to another aspect of the embodiments of the present invention, there is also provided a device for mining the hypernymy relationship between domain terms, such as figure 2 As shown, the device 200 includes:

[0135] The collection module 201 is configured to collect, based on a plurality of first predetermined field terms, an explanation sentence for the entry of the first field term on the thesaurus page, where the first field term is a word semantically related to the first predetermined field term ;

[0136] The relationship acquiring module 205 is configured to acquire the hyponym relationship between the term in the first field and the words included in the entry explanation sentence by using the hypernym relationship model file generated in advance by using the CRF tool.

[0137] Optionally, as in figure 2 As shown, the device also includes:

[0138] The model building module 203 is configured to use the CRF tool to generate the upper and lower rela...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and device for mining a hypernym-hyponym relation between domain-specific terms. The method includes: acquiring an entry explanation statement where first domain-specific terms are located, on a word bank page, according to a plurality of first preset domain-specific terms, wherein the first preset domain-specific terms are associates with first preset domain-specific term meaning; and acquiring a hypernym-hyponym relation between the first domain-specific terms and words included in the entry explanation statement by using a hypernym-hyponym relation model file which is generated in advance by a condition random field CRF tool. According to the scheme of the invention, the entry explanation statement where first domain-specific terms are located, on a word bank page is used, training and learning is performed by using a CRF robot learning technology, the model file is established finally, the hypernym-hyponym relation between the first domain-specific terms and the words included in the entry explanation statement is acquired by using the model file, and the accuracy of acquisition of the hypernym-hyponym relation can be improved.

Description

technical field [0001] The present invention relates to the technical field of data services, in particular to a method and device for mining the upper and lower relations between terms in the field. Background technique [0002] The hyponymy relationship is a kind of semantic relationship, which is often used in the construction and improvement of dictionaries, ontologies and knowledge bases. The hyponymy relationship in ontology learning refers to: Given two words D and U, for the meanings expressed by these two words, if U contains D, U and D are considered to have a hyponymy relationship, and U is the hypernymy of D concept, D is a sub-concept of U, denoted as ISA(D, U). For example, ISA (carbon dioxide, greenhouse gas), ISA (4G business travel package, tariff package). Apply the hypernymy relationship to the query expansion of search engines or automatic responses. For example, when a user searches for "4G business travel package", the superordinate concept of the sea...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
Inventor 黄毅邓路夏爽
Owner CHINA MOBILE COMM GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products