Human voice recognition algorithm

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A recognition algorithm and human voice technology, which is applied in the field of human voice recognition algorithm, can solve the problems of unable to identify the characteristics of the speaker's voice, the acoustic model cannot be accurately matched and calculated, and can only be recognized, so as to achieve perfect input features, low noise, and more noise Effect

Inactive Publication Date: 2019-07-19

李东明

View PDF14 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

This recognition method is mainly aimed at non-specific people, and it can recognize the speech of most people. However, because it is a general-purpose acoustic model, when the user's pronunciation is not standard enough or has a local accent, this The general-purpose acoustic model cannot accurately perform matching calculations. Its substantial defect is that it can only recognize the speaker's speech content, and cannot directly identify the speaker's voice characteristics.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0029] Such as figure 1 As shown, a human voice recognition algorithm includes the following steps:

[0030] S1: Adaptive processing of the volume of the speaker's voice. After the volume of the speaker's voice is trained by the recognition model, the overall normalization process is performed to the same maximum value;

[0031] S2: Adaptive processing of the mute area of the speaker's voice, judge the volume value of the current speaker through the mean value filter, and then filter out the mute area through the threshold;

[0032] S3: filter the background sound to reduce noise, and perform consistency processing on the speaker's voice data;

[0033] S4: Extract the speaker's voice features, and extract the high-dimensional feature vector of the speaker's voice through the trained neural network algorithm model;

[0034] S5: Compare and identify the speaker's voice features with the voiceprint library, use the cosine distance to compare the high-dimensional features extr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a human voice recognition algorithm. The human voice recognition algorithm includes the steps of S1, subjecting the volume of a speaker's voice to self-adaptive processing, carrying out recognition model training on the obtained volume of the speaker's voice and performing overall normalization processing to a same maximum value; S2, subjecting a mute area of the speaker's voice to self-adaptive processing, judging a volume value of a current speaker through mean filtering, and filtering out the mute area through a threshold value; S3, filtering and denoising backgroundsound and performing consistency processing on speaker voice data; S4, extracting voice characteristics of the speaker, and extracting a high-dimensional characteristic vector of the speaker's voice through a trained neural network algorithm model; S5, comparing and identifying the voice characteristics of the speaker with a voiceprint library, and comparing the high-dimensional characteristics extracted by the neural network algorithm model through a cosine distance so as to obtain the similarity of the speaker's characteristics. The human voice recognition algorithm directly identifies the voice characteristics of the speaker, and is low in noise and high in precision.

Description

technical field [0001] The invention belongs to the technical field of voice recognition, and in particular relates to a human voice recognition algorithm. Background technique [0002] Speech recognition technology is an information technology that converts human voices, bytes or phrases into corresponding words or symbols, or gives responses through the process of machine recognition and understanding. With the rapid development of information technology, speech recognition technology has been widely used in people's daily life. For example, when using a terminal device, voice recognition technology can be used to conveniently input information in the terminal device by inputting voice. [0003] Speech recognition technology is essentially a process of pattern recognition. The patterns of unknown speech are compared with reference patterns of known speech one by one, and the best matching reference pattern is output as the recognition result. There are many recognition m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/02G10L15/16G10L21/0208

CPCG10L15/02G10L15/16G10L21/0208

Inventor 史程彭加木

Owner 李东明

Who we serve

R&D Engineer
R&D Manager
IP Professional

Why Patsnap Eureka

Industry Leading Data Capabilities
Powerful AI technology
Patent DNA Extraction

Social media

Patsnap Eureka Blog

Learn More

PatSnap group products

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Human voice recognition algorithm

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology