Chinese word sense disambiguation method based on graph convolutional neural network

A convolutional neural network and word sense disambiguation technology, which is applied in the field of natural language processing, can solve the problems of insufficient extraction of disambiguation features and poor classification effect of classifiers, so as to achieve improved disambiguation accuracy and good classification effect Effect

Active Publication Date: 2021-07-09
HARBIN UNIV OF SCI & TECH
View PDF14 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these traditional algorithms have some shortcomings. They cannot fully extract disambiguation features or are limited to local disambiguation feature extraction, and the classification effect of the classifier is not very good.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese word sense disambiguation method based on graph convolutional neural network
  • Chinese word sense disambiguation method based on graph convolutional neural network
  • Chinese word sense disambiguation method based on graph convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] In order to clearly and completely describe the technical solutions in the embodiments of the present invention, the present invention will be further described in detail below in conjunction with the drawings in the embodiments.

[0060] Take the disambiguation of the ambiguous word "本" in the Chinese sentence "Rural work insists on helping farmers get rich" as an example.

[0061] The flow chart of the Chinese word sense disambiguation method based on the graph convolutional neural network in the embodiment of the present invention, such as figure 1 shown, including the following steps.

[0062] The extraction process of step 1 disambiguation feature is as follows:

[0063] Chinese sentence "Rural work insists on helping farmers get rich."

[0064] Step 1-1 Use the Chinese word segmentation tool to segment Chinese sentences into words. The result of word segmentation is: Rural work insists on helping farmers get rich.

[0065] Step 1-2 Use the Chinese part-of-speec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a Chinese word sense disambiguation method based on a graph convolutional neural network (GCN). According to the invention, firstly, Chinese corpora are preprocessed; word segmentation, part-of-speech tagging and semantic tagging processing are performed on statements, containing ambiguous words, of the training and testing corpora; a word sense disambiguation feature graph is constructed by taking sentences where ambiguous words are located and word forms, part-of-speech and semantics contained in the sentences as disambiguation features and nodes, and weights are embedded into the nodes and edges by using Word2Vec and Doc2Vec tools and point mutual information (PMI) and TF-IDF methods; and the GCN model is trained by using the training corpus, and thus optimizing the model; word sense disambiguation is performed on the test corpus by using the optimized GCN model, so that probability distribution of ambiguous vocabularies under each semantic category can be obtained; and the semantic class corresponding to the maximum probability value is judged as the semantic class of the ambiguous vocabulary. The invention has a good word sense disambiguation effect, and the real meaning of the ambiguous vocabulary is more accurately judged.

Description

Technical field: [0001] The invention relates to a Chinese word sense disambiguation method based on a graph convolutional neural network, which can be well applied in the field of natural language processing. Background technique: [0002] Word sense disambiguation is a fundamental research topic in the field of natural language processing. In natural language, polysemy often exists, which often brings some troubles to applications in text classification, machine translation, and information retrieval. According to the specific language environment of the context, finding out the true semantics of ambiguous words and improving the accuracy of word representation will bring better results to the above application fields. [0003] At present, some common algorithms are often used to disambiguate and classify ambiguous words, such as Naive Bayesian, K-means, classification methods based on association rules and artificial neural networks. However, these traditional algorithm...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/30G06F40/289G06F40/268G06F16/35G06N3/04G06N3/08
CPCG06F40/30G06F40/289G06F40/268G06F16/353G06N3/08G06N3/047G06N3/045
Inventor 刘睿仇化平黄长帅
Owner HARBIN UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products