Voice processing method and device, and electronic equipment

A voice processing and voice block technology, applied in the computer field, can solve problems such as increased recognition delay, and achieve the effect of reducing output delay

Active Publication Date: 2022-02-11
BEIJING YOUZHUJU NETWORK TECH CO LTD
View PDF8 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the modeling process of SAN, as the future frame information used increases, the recognition accuracy after modeling will increase accordingly, but the recognition delay will also increase accordingly.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice processing method and device, and electronic equipment
  • Voice processing method and device, and electronic equipment
  • Voice processing method and device, and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the present disclosure are shown in the drawings, it should be understood that the disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein; A more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are for exemplary purposes only, and are not intended to limit the protection scope of the present disclosure.

[0019] It should be understood that the various steps described in the method implementations of the present disclosure may be executed in different orders, and / or executed in parallel. Additionally, method embodiments may include additional steps and / or omit performing illustrated steps. The scope of the present disclosure is not limited in this regard. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a voice processing method and device, and electronic equipment. According to a specific embodiment, the method comprises the following steps: receiving a to-be-recognized voice block as a current voice block, wherein the voice block comprises a past frame, a current frame and a future frame; and based on the current voice block, executing the following voice recognition steps: performing voice recognition based on the current voice block to obtain a voice recognition result of the current frame and a voice recognition result of the future frame; determining whether a previous voice block of the current voice block exists; if so, updating a target recognition result by using the voice recognition result of the current frame of the current voice block; and outputting the voice recognition result of the future frame of the current voice block. According to the embodiment, the output delay of the voiced recognition results can be reduced.

Description

technical field [0001] The embodiments of the present disclosure relate to the field of computer technology, and in particular, to a voice processing method, device and electronic equipment. Background technique [0002] Streaming speech recognition, as one of the important application scenarios of speech products, has strong requirements for high accuracy and low latency. In order to improve the recognition accuracy of streaming speech, bidirectional neural networks are often used for acoustic modeling. Self-Attention Networks (SAN), as one of them, is increasingly used in speech products due to its high computing parallelism and strong modeling effects. However, in the modeling process of SAN, as the future frame information used increases, the recognition accuracy after modeling will increase accordingly, but the recognition delay will also increase accordingly. How to generate recognition results with low delay while keeping the recognition accuracy unchanged is a tech...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/26
CPCG10L15/26
Inventor 董林昊蔡猛马泽君
Owner BEIJING YOUZHUJU NETWORK TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products