Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for training text classification model for spoken language interaction

A text classification and spoken language technology, applied in the field of training methods and systems for text classification models, can solve the problems of time-consuming and labor-intensive, manual design of text features, and lack of dialogue information, and achieve the effect of improving accuracy.

Active Publication Date: 2022-07-01
AISPEECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In order to at least solve the time-consuming and labor-intensive task of manually designing text features in the prior art; after the classification model obtains the results, it is necessary to manually design rules to judge the final domain, which is not only time-consuming and labor-intensive, but also not flexible enough; dialog information can help the model judge the domain, Improve the accuracy of domain classification, but there is no problem of adding dialogue information in the existing method model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for training text classification model for spoken language interaction
  • Method and system for training text classification model for spoken language interaction
  • Method and system for training text classification model for spoken language interaction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0031] As an embodiment, the obtaining of the spoken text corpus training set and the dialogue history context information includes:

[0032] Extracting the domain-intent associated in the domain set and the intent set based on the domain set of spoken interaction, the intent set and the reply template set for feedback intent;

[0033] Extracting a reply template matching the domain-intent from the reply template set, and determining a domain-intent-dialogue template;

[0034] Obtaining the domain-intent-dialogue template is determined as dialogue history context information.

[0035] In this embodiment, there are many contents in the historical contextual dialogue information, but the domain of the previous round of dialogue (referred to as pre_domain), the intention of the user in the previous round of dialogue (referred to as pre_intent), and the reply of the previous round of dialogue system (referred to as pre_intent) pre_systemreply) These three are key factors that inf...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the present invention provides a training method for a text classification model for oral interaction. The method includes: acquiring a training set of spoken text corpus and dialogue history context information; expanding the spoken text corpus training set through the dialogue history context information to enrich the training set of spoken text corpus; establishing a text classification model based on a bidirectional long-term and short-term memory network , through the dialogue history context information and the expanded oral text corpus training set, the text classification model is trained, so that the text classification model can learn the field classification of the spoken text through the dialogue history context information. The embodiment of the present invention also provides a training system for a text classification model for spoken language interaction. The embodiment of the present invention determines the dialogue history context information, constructs a large number of virtual dialogue texts, and makes up for the lack of corpus; the dialogue history context information is used as a part of the input of the training model, and the dialogue history context information helps the model to improve the accuracy of domain classification.

Description

technical field [0001] The invention relates to the field of intelligent speech dialogue, in particular to a training method and system for a text classification model for spoken language interaction. Background technique [0002] In the text classification of spoken language interaction, a large number of artificially annotated corpora are usually used to train a deep learning model, and the model can automatically obtain text features. After the model outputs the results, it is also necessary to combine the last round of dialogue state design rules to select the final domain output. [0003] In the process of realizing the present invention, the inventor found that there are at least the following problems in the related art: [0004] The text classification method based on feature engineering requires labor-intensive design of text features. The quality of feature design restricts the final performance of the model, and the features used in this method often have problems...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06N3/04
CPCG06F16/355G06N3/044G06N3/045
Inventor 方艳徐华初敏
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products