Speech data processing method and device, computer device and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of voice data and processing methods, which is applied in voice analysis, voice recognition, instruments, etc., and can solve the problems of low accuracy of voice recognition models

Active Publication Date: 2018-11-23

PING AN TECH (SHENZHEN) CO LTD

View PDF10 Cites 17 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Based on this, it is necessary to address the above-mentioned technical problems and provide a voice data processing method, device, computer equipment and storage medium for solving the technical problem of low accuracy of voice recognition models in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0029] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0030] The speech data processing method provided by this application can be applied in such as figure 1 An application environment in which a computer device communicates with a server over a network. Computer equipment can be, but is not limited to, various personal computers, laptops, smartphones, tablets, and portable wearable devices. The server can be implemented as an independent server.

[0031] Specifically, the voice data processing method ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention discloses a speech data processing method and device, a computer device and a storage medium. The method comprises the steps of: obtaining original speech data; employing a VAD (Voice Activity Detection) algorithm to perform framing and segmentation processing for the original speech data to obtain at least two frames of speech data to be tested; employing an ASR (Automatic Speech Recognition) feature extraction algorithm to perform feature extraction of each frame of speech data to be tested to obtain speech features of a filter to be tested; employing a trained ASR-LSTM(Long-Short Term Memory) speech recognition model to perform recognition for the speech features of the filter to be tested to obtain a recognition probability value; and if the recognition probability value is larger than a preset probability value, taking the speech data to be tested as target speech data. The method can effectively remove the interference of the noise and the mute so as to improve the accuracy of the model recognition.

Description

technical field [0001] The invention relates to the technical field of voice recognition, in particular to a voice data processing method, device, computer equipment and storage medium. Background technique [0002] Voice Activity Detection (VAD), also known as voice endpoint detection or voice boundary detection, is to identify and eliminate long periods of silence from the sound signal stream, so as to save voice channels without reducing service quality. The role of resources. [0003] At present, when training or recognizing speech recognition models, it is necessary to obtain relatively pure speech data for model training, but the current speech data is often mixed with noise or silence, resulting in the use of speech data mixed with noise for training. The accuracy rate of the speech recognition model is low, which is not conducive to the popularization and application of the speech recognition model. Contents of the invention [0004] Based on this, it is necessar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/04G10L15/05G10L15/06

CPCG10L15/04G10L15/05G10L15/06G10L15/063

Inventor 涂宏

Owner PING AN TECH (SHENZHEN) CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Speech data processing method and device, computer device and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology