Voice recognition method and device and computer readable storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and speech frame technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of poor performance of direct speaker distinction

Active Publication Date: 2019-03-29

PING AN TECH (SHENZHEN) CO LTD

View PDF5 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the performance of using DNN to extract features for direct speaker discrimination is poor. Therefore, how to improve the performance and effectiveness of the speaker recognition system has become the focus of research.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0044] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0045] The speech recognition method provided by the embodiments of the present invention can be executed by a speech recognition device, wherein, in some embodiments, the speech recognition device can be set on smart terminals such as mobile phones, computers, tablets, and smart watches. In some embodiments, the voice recognition device may be installed on an intelligent terminal. In some embodiments, the voice recognition device may be spatially ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention discloses a voice recognition method and device and a computer readable storage medium. The method comprises the steps of obtaining a to-be-detected first digital voicesignal, wherein the first digital voice signal is composed of digital codes, and the digital codes are composed of multiple digits; conducting preset segmentation processing on the first digital voice signal to obtain multiple second digital voice signals; processing each second digital voice signal according to a preset signal processing method, determining a logarithm Mel power spectrum corresponding to each second digital voice signal, and extracting target feature information of each second digital voice signal from the corresponding logarithm Mel power spectrum; recognizing the target feature information of each second digital voice signal to obtain a target digit corresponding to the second digital voice signal; determining a target digital password corresponding to the first digital voice signal according to the target digits to improve the performance and validity of voice recognition.

Description

technical field [0001] The present invention relates to the technical field of speech recognition, in particular to a speech recognition method, device and computer-readable storage medium. Background technique [0002] The speaker recognition system based on vector (Identity-Vector, I-vector) is a classic method to solve the problem of text-independent speaker recognition. However, in recent years, this field has attracted more and more attention from deep learning. The deep learning methods and techniques for solving acoustic problems can be divided into two categories: (1) using a deep neural network (Deep Neural Network, DNN) connected behind a Hidden Markov Model (HMM) to train Baum-Welch's statistical (2) A training method combining bottleneck features and Mel Frequency Cepstral Coefficient (MFCC) features. Since text-dependent problems are mainly built on the basis of text-independent problems, DNN can also be used to solve text-dependent speaker recognition problems...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/26G10L15/16G10L15/14G10L15/10G10L15/06G10L25/24

CPCG10L15/063G10L15/10G10L15/144G10L15/16G10L25/24G10L15/26

Inventor 贾雪丽程宁王健宗

Owner PING AN TECH (SHENZHEN) CO LTD

Voice recognition method and device and computer readable storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology