Unlock instant, AI-driven research and patent intelligence for your innovation.

A method, device and computer-readable storage medium for constructing a dictionary

A dictionary and real word technology, applied in computer parts, computing, instruments, etc., can solve problems such as not being able to represent semantic information well, candidate word interference, and reducing dictionary accuracy.

Active Publication Date: 2021-02-09
HUAWEI TECH CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, this method of constructing a dictionary based on BoW features only considers the frequency of occurrence of each feature word in the feature words corresponding to the word, and treats each feature word as an independent entity, which cannot well represent the Semantic information, for example, the type tendency of some words is related to its idiomatic usage, common collocation and other information, which cannot be reflected by the BoW features extracted in the interpretation. come more interference, reduce the accuracy of the dictionary

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method, device and computer-readable storage medium for constructing a dictionary
  • A method, device and computer-readable storage medium for constructing a dictionary
  • A method, device and computer-readable storage medium for constructing a dictionary

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] The technical solutions in the embodiments of the present invention will be described below with reference to the accompanying drawings.

[0051] The embodiments of the present invention can be applied to the construction and expansion of dictionaries in natural language processing. The dictionaries can be some special-purpose dictionaries, such as sentiment dictionaries and swear words dictionaries, or dictionaries constructed based on actual usage.

[0052] Among them, the general steps of the process of natural language processing are: input language text --> extract features from language text --> build a model based on features --> predict and classify language text, and extract features from language text It is often necessary to use external resources for assistance, and dictionaries are one of the main resources in this regard. That is to say, it is necessary to extract features from language texts based on dictionaries to complete the processing of natural langu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the present invention provide a method and device for constructing a dictionary. The method includes: acquiring a candidate word and the definition of the candidate word; selecting a feature word of the candidate word from the definition of the candidate word; The feature words of the word, through the preset classifier, get the initial judgment result of the candidate word; according to the feature words selected from the interpretation of each middle word in at least one middle word, through the classifier, get each middle word The judgment result of word, wherein, this at least one intermediate word comprises the N-level characteristic words of this candidate word; According to the initial judgment result of this candidate word and the judgment result of this at least one intermediate word, determine the final judgment result of this candidate word, the The final judgment result of the candidate word is used to indicate whether the candidate word can be added to the dictionary. Therefore, the accuracy of the dictionary can be improved.

Description

technical field [0001] Embodiments of the present invention relate to the field of natural language processing, and more specifically, relate to a method and device for constructing a dictionary. Background technique [0002] Dictionaries are key resources in the process of natural language processing. At present, most of the dictionaries are based on artificially constructed dictionaries, that is, dictionaries that are manually recognized in the corpus. However, the disadvantage of artificially constructed dictionaries is that the words in the dictionary are not perfect. , especially for the existing emerging new words on the Internet, the shortcomings of artificially constructed dictionaries are more obvious, and cannot well meet the practical application. [0003] In order to make the construction of the dictionary more perfect, the way of automatically constructing the dictionary is introduced. At present, a known method of constructing a dictionary is to search for the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/242G06F40/30G06F40/211G06K9/62
CPCG06F40/242G06F40/211G06F40/30G06F18/24
Inventor 张旸王雅圣毕舒展颜友亮
Owner HUAWEI TECH CO LTD