Multi-round dialogue rewriting method and system based on text editing and grammar error correction

A text editing and grammar technology, which is applied in natural language data processing, digital data information retrieval, special data processing applications, etc. It can solve the problems of incomplete information in multiple rounds of dialogue, and does not take into account the high degree of overlap between the original text and the target text. To achieve the effect of improving accuracy

Pending Publication Date: 2021-11-05
SHANGHAI JIAO TONG UNIV
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention aims at solving the problem of incomplete multi-round dialogue information in the prior art without considering the high overlap between the original text and the target text, and proposes a multi-round dialogue rewriting method and system based on text editing and grammati

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-round dialogue rewriting method and system based on text editing and grammar error correction
  • Multi-round dialogue rewriting method and system based on text editing and grammar error correction
  • Multi-round dialogue rewriting method and system based on text editing and grammar error correction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] Such as figure 2 As shown, it is a multi-round dialogue rewriting system based on text editing and grammatical error correction involved in this embodiment, including: a text editing module and a grammatical error correction module, wherein: the text editing module builds a sequence labeling model, and the sequence labeling model uses BERT acts as an encoder, Transformer acts as a decoder, and edits sentences according to the labels generated by the sequence labeling model; the grammatical error correction module performs grammatical error correction on the edited sentences with complete semantic information to generate the final rewritten sentence.

[0024] The text editing module includes: an original text labeling algorithm, a sequence labeling model and a text editing algorithm, wherein: the original text labeling algorithm generates text labeling data according to the dialogue data text, the sequence prediction model generates a prediction label for the dialogue te...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-round dialogue rewriting method and system based on text editing and grammar error correction, and the method comprises the steps: carrying out word-level labeling of a collected dialogue data text through a text labeling algorithm, generating text labeling data, and carrying out fine adjustment of a deformation-based bidirectional coding representation model, so as to obtain a sequence labeling model; editing dialogue history and incomplete statements according to the classification label of each word in the prediction sequence of the model, and finally carrying out grammar error correction modeling on the rewritten text to improve the fluency of the statements. According to the method, the accuracy of multi-round dialogue rewriting can be improved, the problems of anaphora and omission in a dialogue system are effectively solved by utilizing a text editing and grammar error correction method in dialogues, and the integrity of dialogue statements is improved.

Description

technical field [0001] The present invention relates to a technology for multi-round dialogue rewriting in an intelligent dialogue system, in particular to a multi-round dialogue rewriting (Dialogue Utterance Rewritten via SequenceEditing, DuReSE) method and system based on text editing and grammatical error correction. Background technique [0002] In recent years, with the development of natural language processing (nlp) technology, more and more enterprises use intelligent chatbots to replace human responses. Taking the user's inquiry and the corresponding reply of the robot as a round of dialogue, the reply generation method of a single round of dialogue is very mature, but users usually have multiple rounds of dialogue to achieve their goals. During multiple rounds of dialogue, the user often enters dialogue sentences with incomplete information (reference or omission), that is, there are pronouns pointing to part of the information in the dialogue history or omissions ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/166G06F40/253G06F40/35G06F16/332
CPCG06F40/166G06F40/253G06F40/35G06F16/3329
Inventor 蒋文荟沈备军顾小东
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products