Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for speech recognition punctuation recovery and storage medium

A technology of speech recognition and punctuation, applied in speech recognition, speech analysis, instruments, etc., can solve problems affecting downstream task performance, downstream task performance degradation, ambiguity, etc.

Pending Publication Date: 2022-03-01
BEIJING YOUZHUJU NETWORK TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Existing commercial speech recognition systems often output text without punctuation, which may lead to misunderstandings and affect the performance of downstream tasks such as machine translation and information extraction
Specifically, on the one hand, text without punctuation is difficult to read, has poor readability, unclear sentences, and ambiguity. On the other hand, downstream tasks such as machine translation and information extraction assume that the input is punctuated. Yes, text without punctuation can cause performance degradation in downstream tasks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for speech recognition punctuation recovery and storage medium
  • Method and device for speech recognition punctuation recovery and storage medium
  • Method and device for speech recognition punctuation recovery and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The following will clearly and completely describe the technical solutions in the embodiments of the present disclosure with reference to the drawings in the embodiments of the present disclosure, but obviously, the described embodiments are only some of the embodiments of the present disclosure, not all of them. The following descriptions of the embodiments are only illustrative in fact, and by no means limit the present disclosure and its application or use. It should be understood that the present disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein.

[0019] It should be understood that the various steps described in the method implementations of the present disclosure may be executed in different orders, and / or executed in parallel. Additionally, method embodiments may include additional steps and / or omit performing illustrated steps. The scope of the present disclosure is not limited in this respec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method and device for speech recognition punctuation recovery and a storage medium. The invention provides a training method of a model for speech recognition punctuation recovery, comprising the following steps: obtaining text samples and corresponding audio samples for model training, the audio samples corresponding to the text samples obtained from non-audio texts being virtual samples; and training a model for speech recognition punctuation recovery based on the acquired text samples for model training and the corresponding audio samples.

Description

technical field [0001] The present disclosure relates to speech recognition, including punctuation recovery in speech recognition. Background technique [0002] Automatic Speech Recognition (ASR), a technology that converts human speech into text, has a wide range of applications and can serve as an upstream component for multiple tasks, such as voice assistants and speech translation, among others. Existing commercial speech recognition systems often output text without punctuation, which may lead to misunderstandings and affect the performance of downstream tasks such as machine translation and information extraction. Specifically, on the one hand, text without punctuation is difficult to read, has poor readability, unclear sentences, and ambiguity. On the other hand, downstream tasks such as machine translation and information extraction assume that the input is punctuated. Yes, text without punctuation can lead to performance degradation in downstream tasks. Contents ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/06G10L15/26
CPCG10L15/063G10L15/26
Inventor 吴礼蔚朱耀明程善伯王明轩
Owner BEIJING YOUZHUJU NETWORK TECH CO LTD