Chinese word sense disambiguation method based on fusion of graph convolutional neural network and support vector machine

A technology of convolutional neural network and support vector machine, applied in biological neural network model, neural architecture, semantic analysis, etc., can solve polysemy and other problems of a word, and achieve good disambiguation features, good classification effect, and accurate disambiguation rate-enhancing effect

Pending Publication Date: 2022-03-15
HARBIN UNIV OF SCI & TECH
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of this, in order to solve the phenomenon of polysemy in Chinese in the field of natural language processing, the present invention discloses a Chinese word sense disambiguation method based on graph convolutional neural network fusion support vector machine

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese word sense disambiguation method based on fusion of graph convolutional neural network and support vector machine
  • Chinese word sense disambiguation method based on fusion of graph convolutional neural network and support vector machine
  • Chinese word sense disambiguation method based on fusion of graph convolutional neural network and support vector machine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] In order to clearly and completely describe the technical solutions in the embodiments of the present invention, the present invention will be further described in detail below in conjunction with the drawings in the embodiments.

[0062] Take the disambiguation process of the ambiguous word "surface" in the Chinese sentence "this can remove the pesticide residue on the surface of vegetables" as an example.

[0063] The embodiment of the present invention is based on the framework of the Chinese word sense disambiguation method based on the graph convolutional neural network fusion support vector machine, such as figure 1 shown, including the following steps.

[0064] Step 1 The extraction process of disambiguation features is as follows:

[0065] Chinese sentence "This can remove the residual pesticides on the surface of vegetables."

[0066] Step 1-1 uses the Chinese word segmentation tool to perform word segmentation on the Chinese sentence, and the word segmentati...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a Chinese word sense disambiguation method based on a graph convolutional neural network (GCN) fused with a support vector machine (SVM), in particular to a Chinese word sense disambiguation method based on the graph convolutional neural network (GCN) fused with the support vector machine (SVM) and a Chinese word sense disambiguation method based on the graph convolutional neural network (GCN) fused with the Chinese word sense disambiguation method based on the graph convolutional neural network (GCN) fused with the support vector machine (SVM). The method comprises the steps of firstly preprocessing corpora; and performing word segmentation, part-of-speech tagging and semantic tagging processing on statements of the training and testing corpora. A word sense disambiguation graph is constructed by taking sentences where ambiguous words are located and word forms, part-of-speech and semantic classes of vocabulary units on two sides of the ambiguous words as disambiguation features and taking the disambiguation features as nodes. Weights of nodes and edges in the graph are calculated by using Word2Vec, a Doc2Vec tool, point-by-point mutual information (PMI) and a TF-IDF algorithm. And training the GCN model by the training corpus, and optimizing the model. And calculating disambiguation features of training and testing corpora by using the optimized GCN model, inputting the calculated disambiguation features of the training corpora into an SVM classifier, optimizing the SVM classifier, and classifying the testing corpora to obtain classification conditions of ambiguous vocabularies under semantic categories. The method has a good word sense disambiguation effect, and the real meaning of the ambiguous vocabulary is accurately judged.

Description

Technical field: [0001] The invention relates to a Chinese word sense disambiguation method based on a graph convolutional neural network fusion support vector machine, and the method can be well applied in the field of natural language processing. Background technique: [0002] When it comes to natural language processing, it often involves the phenomenon of polysemy in language, which affects natural language processing such as machine translation with text understanding capabilities, automatic summarization, question answering systems, public opinion analysis, machine writing, information retrieval, and text classification. field application. In order to make the above application fields have better accuracy or obtain results more in line with people's expectations. It is necessary to disambiguate words with multiple semantics, that is, word meaning disambiguation, and find out the true semantics of ambiguous words according to the specific language environment of the co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F40/30G06K9/62G06N3/04G06F16/35G06F16/33
CPCG06F40/289G06F40/30G06F16/353G06F16/3347G06N3/047G06N3/045G06F18/214G06F18/2411G06F18/253G06F18/29
Inventor 刘睿仇化平赫斌
Owner HARBIN UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products