Unlock instant, AI-driven research and patent intelligence for your innovation.

Data augmentation method and system for spoken language semantic understanding

A semantic understanding and data technology, applied in natural language data processing, semantic analysis, neural learning methods, etc., can solve problems such as inability to generate tags, high space-time complexity, and poor overall performance of sentences

Active Publication Date: 2021-11-12
AISPEECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In order to at least solve the problems in the prior art that label collection is required, new labels cannot be generated, the calculation process has high space-time complexity, and the overall performance of data-augmented sentences is poor.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data augmentation method and system for spoken language semantic understanding
  • Data augmentation method and system for spoken language semantic understanding
  • Data augmentation method and system for spoken language semantic understanding

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0045] As an implementation manner, the atom template matching table includes: a built-in atom template matching table or a user-defined atom template matching table.

[0046] In this embodiment, the atomic template matching table will affect the effect of data enhancement to a certain extent. In order to satisfy developers at various levels, some atomic template matching tables will be built in to help developers deal with more daily dialogue actions. If developers have some unique requirements in a certain field or in a certain scenario, developers are also allowed to customize the atom template matching table to allow developers to adjust themselves.

[0047] It can be seen from this embodiment that a built-in atomic template matching table is provided to help developers handle daily dialogue actions and improve the efficiency of data enhancement for developers. For developers with certain special needs, a custom atomic template matching table is provided at the same time,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An embodiment of the present invention provides a data enhancement method for semantic understanding of spoken language. The method includes: defining an atomic template matching table, in which each atomic template is a natural language description corresponding to a structured basic granularity label; determining a semantic representation according to a given dialog action, and decomposing the semantic representation of the dialog action According to the atomic template matching table, find the structured basic granularity label corresponding to the unit semantic label, and then determine the atomic template matching the basic granularity label, and use the atomic template to convert the given dialogue action into an atomic sample ; using a neural network-based sentence generation model to combine atomic sample sets into natural sentences. The embodiment of the present invention also provides a data enhancement system for semantic understanding of spoken language. The embodiments of the present invention require data-enhanced dialogue action input to generate brand new tags, and the generated new sentences are more in line with natural sentences.

Description

technical field [0001] The invention relates to the field of speech recognition, in particular to a data enhancement method and system for spoken language semantic understanding. Background technique [0002] SLU (Spoken Language Understanding, Spoken Language Understanding) is a key component of a spoken dialogue system, which parses user utterances into corresponding semantic representations in a narrow domain. Typical semantic representations of SLU can be semantic frames or dialogue behaviors. [0003] Deep learning has achieved great success in the field of SLU, but it requires a large amount of labeled data, which limits the scalability of SLU models. Despite the progress and huge research activities in semi-supervised learning and domain adaptation, deep SLU models A large amount of labeled data is still required for training. In order to obtain a large amount of data, the following techniques are usually used, including: zero-shot learning (zero-shot learning), inpu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33G06F16/332G06F40/30G06F40/205G06N3/08
CPCG06N3/084G06F16/3329G06F16/3344
Inventor 俞凯朱苏赵子健
Owner AISPEECH CO LTD