Unlock instant, AI-driven research and patent intelligence for your innovation.

Voice recognition method and device and computer readable storage medium

A speech recognition and speech frame technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of poor performance of direct speaker distinction

Active Publication Date: 2019-03-29
PING AN TECH (SHENZHEN) CO LTD
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the performance of using DNN to extract features for direct speaker discrimination is poor. Therefore, how to improve the performance and effectiveness of the speaker recognition system has become the focus of research.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice recognition method and device and computer readable storage medium
  • Voice recognition method and device and computer readable storage medium
  • Voice recognition method and device and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0045] The speech recognition method provided by the embodiments of the present invention can be executed by a speech recognition device, wherein, in some embodiments, the speech recognition device can be set on smart terminals such as mobile phones, computers, tablets, and smart watches. In some embodiments, the voice recognition device may be installed on an intelligent terminal. In some embodiments, the voice recognition device may be spatially ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a voice recognition method and device and a computer readable storage medium. The method comprises the steps of obtaining a to-be-detected first digital voicesignal, wherein the first digital voice signal is composed of digital codes, and the digital codes are composed of multiple digits; conducting preset segmentation processing on the first digital voice signal to obtain multiple second digital voice signals; processing each second digital voice signal according to a preset signal processing method, determining a logarithm Mel power spectrum corresponding to each second digital voice signal, and extracting target feature information of each second digital voice signal from the corresponding logarithm Mel power spectrum; recognizing the target feature information of each second digital voice signal to obtain a target digit corresponding to the second digital voice signal; determining a target digital password corresponding to the first digital voice signal according to the target digits to improve the performance and validity of voice recognition.

Description

technical field [0001] The present invention relates to the technical field of speech recognition, in particular to a speech recognition method, device and computer-readable storage medium. Background technique [0002] The speaker recognition system based on vector (Identity-Vector, I-vector) is a classic method to solve the problem of text-independent speaker recognition. However, in recent years, this field has attracted more and more attention from deep learning. The deep learning methods and techniques for solving acoustic problems can be divided into two categories: (1) using a deep neural network (Deep Neural Network, DNN) connected behind a Hidden Markov Model (HMM) to train Baum-Welch's statistical (2) A training method combining bottleneck features and Mel Frequency Cepstral Coefficient (MFCC) features. Since text-dependent problems are mainly built on the basis of text-independent problems, DNN can also be used to solve text-dependent speaker recognition problems...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/26G10L15/16G10L15/14G10L15/10G10L15/06G10L25/24
CPCG10L15/063G10L15/10G10L15/144G10L15/16G10L25/24G10L15/26
Inventor 贾雪丽程宁王健宗
Owner PING AN TECH (SHENZHEN) CO LTD