Multitask learning Chinese language disease diagnosis method based on bidirectional long-short-term memory network

A technology of multi-task learning and long-short-term memory, applied in the field of diagnosis of Chinese language disorders in multi-task learning, can solve problems such as the difficulty of grammatical analysis, and achieve the effect of solving Chinese grammatical errors and solving assistance

Active Publication Date: 2020-03-17
成都中科云集信息技术有限公司
View PDF6 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The difficulty of grammatical error checking lies in the fact that there are no morphological changes in Chinese parts of speech, there is no simple correspondence between parts of speech and syntactic components, and the flexibility of Chinese word order makes Chinese grammatical analysis very difficult.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multitask learning Chinese language disease diagnosis method based on bidirectional long-short-term memory network
  • Multitask learning Chinese language disease diagnosis method based on bidirectional long-short-term memory network
  • Multitask learning Chinese language disease diagnosis method based on bidirectional long-short-term memory network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] The implementation of the present invention is divided into two parts: the training of the model and the use of the model. The specific implementation manners of the present invention will be described in further detail below according to the drawings and examples. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0015] figure 2 It is a schematic diagram of the model training framework of an embodiment of the present invention.

[0016] The sentence sentence disease detection model of the multi-task learning of the bidirectional long short-term memory network is as follows: figure 1 shown. Among them, task1 is the task of sentence language disorder classification, and task2 is the task of sentence language disorder type and location detection. The model uses Bi-LSTM, and the hidden layer is a network shared by task1 and task2, which is used to extract common features of the task...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a multitask learning Chinese language disease diagnosis method based on a bidirectional long-short-term memory network. The method comprises the following steps of: a Chinese level examination composition correction data set provided by a Chinese grammar error diagnosis task (CGED) in recent years is obtained; a model input sequence is obtained through fusion word embedding,text features are fully utilized, and the diagnosis effect is improved; the obtained input sequence is input into Bi-LSTM, and a Chinese grammar error diagnosis model is obtained through learning ofa neural network; a multi-task learning method is adopted, whether sentences have morbidity or not is detected as a main task, the morbidity types and error positions of the sentences are detected asauxiliary tasks, hidden layer parameters are shared among multiple tasks, and output layers related to the tasks are reserved; according to the method, the relevance between the task of detecting whether the Chinese statement has the morbidity and the task of detecting the morbidity type and the error position is fully utilized; the problem that the detection effect is poor due to the fact that nogood feature exists in the process of detecting whether the Chinese statement has the morbidity task or not is solved, and meanwhile the generalization ability of the model is improved to a certain degree due to joint optimization of multiple tasks.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method for diagnosing Chinese language disorders in multi-task learning based on a bidirectional long-short-term memory network. Background technique [0002] Chinese is considered to be one of the most difficult languages ​​in the world. Unlike English, Chinese does not have verb tenses and plurals, and there are usually multiple ways of expressing the same meaning in Chinese. Therefore, non-native speakers of Chinese often make various grammatical errors in their writing, and the research on the diagnosis of Chinese grammatical errors has become an urgent topic to be solved. [0003] In terms of grammar, due to the characteristics of Chinese itself and the limitations of Chinese theoretical research, the difficulty of Chinese grammar proofreading is higher than that of English. The difficulty of grammatical error checking lies in the fact that there is no morphological ch...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/284G06N3/04
CPCG06N3/049
Inventor 田文洪黎在万高印权
Owner 成都中科云集信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products