Speech data processing method and device, computer device and storage medium

A technology of voice data and processing methods, which is applied in voice analysis, voice recognition, instruments, etc., and can solve the problems of low accuracy of voice recognition models

Active Publication Date: 2018-11-23
PING AN TECH (SHENZHEN) CO LTD
View PDF10 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Based on this, it is necessary to address the above-mentioned technical problems and provide a voice data processing method, device, computer equipment and storage medium for solving the technical problem of low accuracy of voice recognition models in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech data processing method and device, computer device and storage medium
  • Speech data processing method and device, computer device and storage medium
  • Speech data processing method and device, computer device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0030] The speech data processing method provided by this application can be applied in such as figure 1 An application environment in which a computer device communicates with a server over a network. Computer equipment can be, but is not limited to, various personal computers, laptops, smartphones, tablets, and portable wearable devices. The server can be implemented as an independent server.

[0031] Specifically, the voice data processing method ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a speech data processing method and device, a computer device and a storage medium. The method comprises the steps of: obtaining original speech data; employing a VAD (Voice Activity Detection) algorithm to perform framing and segmentation processing for the original speech data to obtain at least two frames of speech data to be tested; employing an ASR (Automatic Speech Recognition) feature extraction algorithm to perform feature extraction of each frame of speech data to be tested to obtain speech features of a filter to be tested; employing a trained ASR-LSTM(Long-Short Term Memory) speech recognition model to perform recognition for the speech features of the filter to be tested to obtain a recognition probability value; and if the recognition probability value is larger than a preset probability value, taking the speech data to be tested as target speech data. The method can effectively remove the interference of the noise and the mute so as to improve the accuracy of the model recognition.

Description

technical field [0001] The invention relates to the technical field of voice recognition, in particular to a voice data processing method, device, computer equipment and storage medium. Background technique [0002] Voice Activity Detection (VAD), also known as voice endpoint detection or voice boundary detection, is to identify and eliminate long periods of silence from the sound signal stream, so as to save voice channels without reducing service quality. The role of resources. [0003] At present, when training or recognizing speech recognition models, it is necessary to obtain relatively pure speech data for model training, but the current speech data is often mixed with noise or silence, resulting in the use of speech data mixed with noise for training. The accuracy rate of the speech recognition model is low, which is not conducive to the popularization and application of the speech recognition model. Contents of the invention [0004] Based on this, it is necessar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/04G10L15/05G10L15/06
CPCG10L15/04G10L15/05G10L15/06G10L15/063
Inventor 涂宏
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products