Dictionary-based sememe knowledge base construction method and device

A construction method and knowledge base technology, applied in the field of natural language processing, can solve the time-consuming and labor-intensive problems of sememe knowledge base, and achieve the effect of time-consuming, labor-intensive and good practicability

Pending Publication Date: 2021-10-15
TSINGHUA UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, at present, most languages ​​do not have a sememe knowledge base similar to HowNet, and building a sememe knowledge base b...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dictionary-based sememe knowledge base construction method and device
  • Dictionary-based sememe knowledge base construction method and device
  • Dictionary-based sememe knowledge base construction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the present invention. Obviously, the described embodiments are part of the embodiments of the present invention , but not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0039] At present, most languages ​​do not have a sememe knowledge base similar to HowNet, and constructing a sememe knowledge base by manual annotation is time-consuming and laborious. In order to solve this problem, it is very meaningful to use a computer to automatically build a sememe knowledge base. Different languages ​​often have a class of learning dictionaries. These dictionaries gen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a dictionary-based sememe knowledge base construction method and device, and the method comprises the steps: constructing a sememe set according to a controlled word list of a target language dictionary; obtaining a paraphrase word set corresponding to the paraphrase of each semantic item according to the semantic item of each word in the target language dictionary; and according to the sememe set, performing sememe extraction on the paraphrase word set, and according to a sememe extraction result, constructing a sememe knowledge base corresponding to the target language dictionary. Through the dictionary of the target language and the controlled word list corresponding to the dictionary, the primitive knowledge base can be efficiently, economically and automatically constructed for the target language, the problem that time and labor are wasted when the primitive knowledge base is manually constructed is solved, and good practicability is achieved.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a method and device for constructing a dictionary-based sememe knowledge base. Background technique [0002] In linguistics, a word is defined as the smallest meaningful unit that can be used independently, but not the smallest indivisible semantic unit. Thus, words can be further subdivided into smaller semantic elements. For example, "woman" is further split into "human", "female", and "adult". The smallest indivisible semantic unit in human language is called a sememe. Some linguists believe that the semantics of all words, as well as other concepts, can be represented by a finite set of sememes. Through sememe, words can be analyzed in a more fine-grained manner, which can further help people understand the nature of language. [0003] However, for most human languages, sememes are often very obscure. At present, the sememe knowledge base is mainly con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/36G06F16/21
CPCG06F16/374G06F16/211
Inventor 孙茂松岂凡超刘知远王凤玉
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products