A method of word sense disambiguation in Chinese sentences based on convolution neural network

A convolutional neural network and word sense disambiguation technology, applied in biological neural network models, semantic analysis, neural architecture, etc., can solve problems such as poor classifier training effect, and achieve multi-category data processing and data reduction The amount and parameter amount, the effect of preventing overfitting

Pending Publication Date: 2019-01-15
HARBIN UNIV OF SCI & TECH
View PDF10 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are some shortcomings and deficiencies in the traditional algorithm
The extracted disambiguation features are only limited to local areas, and the training effect of the classifier is not very good

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method of word sense disambiguation in Chinese sentences based on convolution neural network
  • A method of word sense disambiguation in Chinese sentences based on convolution neural network
  • A method of word sense disambiguation in Chinese sentences based on convolution neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] In order to clearly and completely describe the technical solutions in the embodiments of the present invention, the present invention will be further described in detail below in conjunction with the drawings in the embodiments.

[0058] Take the disambiguation of the ambiguous word "son and daughter" in the Chinese sentence "The excellent traditional culture jointly created by the sons and daughters of all ethnic groups in China has always been an important basis for maintaining the spiritual bond of all Chinese people and achieving peaceful reunification" as an example.

[0059] The embodiment of the present invention is based on the flow chart of the Chinese sentence meaning disambiguation method of convolutional neural network, as figure 1 shown, including the following steps.

[0060] Step 1 The extraction process of disambiguation features is as follows:

[0061] Chinese sentence: The excellent traditional culture jointly created by the sons and daughters of al...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a word sense disambiguation method based on a convolution neural network (CNN). The invention firstly processes the Chinese corpus, and processes the Chinese sentence containing ambiguous words by word segmentation, part-of-speech tagging and semantic tagging to obtain the processed training corpus and test corpus. Then the optimized CNN model is obtained by training the model with the training corpus. On the optimized CNN model, the test corpus is disambiguated, and the probability distribution of ambiguous words in each semantic category is obtained. The semantic category with the highest probability is the semantic category of ambiguous words. The invention realizes good disambiguation for ambiguous words and more accurately judges the true meaning of the ambiguous words.

Description

technical field [0001] The invention relates to a Chinese sentence meaning disambiguation method based on a convolutional neural network, and the method has a good application in the field of natural language processing. Background technique [0002] In the field of natural language processing, vocabulary generally has polysemy. The purpose of word sense disambiguation is to determine the semantics of ambiguous words in a specific context. Word sense disambiguation has important applications in machine translation, automatic summarization, information retrieval and text classification, and its performance is closely related to word sense disambiguation. [0003] Some common algorithms are often used to disambiguate and classify words, such as: k-means, naive Bayesian, classification methods based on association rules, and artificial neural networks. However, there are some shortcomings and deficiencies in the traditional algorithm. The extracted disambiguation features ar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06N3/04
CPCG06F40/211G06F40/30G06N3/045
Inventor 张春祥赵凌云周雪松
Owner HARBIN UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products