Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Human voice recognition algorithm

A recognition algorithm and human voice technology, which is applied in the field of human voice recognition algorithm, can solve the problems of unable to identify the characteristics of the speaker's voice, the acoustic model cannot be accurately matched and calculated, and can only be recognized, so as to achieve perfect input features, low noise, and more noise Effect

Inactive Publication Date: 2019-07-19
李东明
View PDF14 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This recognition method is mainly aimed at non-specific people, and it can recognize the speech of most people. However, because it is a general-purpose acoustic model, when the user's pronunciation is not standard enough or has a local accent, this The general-purpose acoustic model cannot accurately perform matching calculations. Its substantial defect is that it can only recognize the speaker's speech content, and cannot directly identify the speaker's voice characteristics.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Human voice recognition algorithm
  • Human voice recognition algorithm
  • Human voice recognition algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] Such as figure 1 As shown, a human voice recognition algorithm includes the following steps:

[0030] S1: Adaptive processing of the volume of the speaker's voice. After the volume of the speaker's voice is trained by the recognition model, the overall normalization process is performed to the same maximum value;

[0031] S2: Adaptive processing of the mute area of ​​the speaker's voice, judge the volume value of the current speaker through the mean value filter, and then filter out the mute area through the threshold;

[0032] S3: filter the background sound to reduce noise, and perform consistency processing on the speaker's voice data;

[0033] S4: Extract the speaker's voice features, and extract the high-dimensional feature vector of the speaker's voice through the trained neural network algorithm model;

[0034] S5: Compare and identify the speaker's voice features with the voiceprint library, use the cosine distance to compare the high-dimensional features extr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a human voice recognition algorithm. The human voice recognition algorithm includes the steps of S1, subjecting the volume of a speaker's voice to self-adaptive processing, carrying out recognition model training on the obtained volume of the speaker's voice and performing overall normalization processing to a same maximum value; S2, subjecting a mute area of the speaker's voice to self-adaptive processing, judging a volume value of a current speaker through mean filtering, and filtering out the mute area through a threshold value; S3, filtering and denoising backgroundsound and performing consistency processing on speaker voice data; S4, extracting voice characteristics of the speaker, and extracting a high-dimensional characteristic vector of the speaker's voice through a trained neural network algorithm model; S5, comparing and identifying the voice characteristics of the speaker with a voiceprint library, and comparing the high-dimensional characteristics extracted by the neural network algorithm model through a cosine distance so as to obtain the similarity of the speaker's characteristics. The human voice recognition algorithm directly identifies the voice characteristics of the speaker, and is low in noise and high in precision.

Description

technical field [0001] The invention belongs to the technical field of voice recognition, and in particular relates to a human voice recognition algorithm. Background technique [0002] Speech recognition technology is an information technology that converts human voices, bytes or phrases into corresponding words or symbols, or gives responses through the process of machine recognition and understanding. With the rapid development of information technology, speech recognition technology has been widely used in people's daily life. For example, when using a terminal device, voice recognition technology can be used to conveniently input information in the terminal device by inputting voice. [0003] Speech recognition technology is essentially a process of pattern recognition. The patterns of unknown speech are compared with reference patterns of known speech one by one, and the best matching reference pattern is output as the recognition result. There are many recognition m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/16G10L21/0208
CPCG10L15/02G10L15/16G10L21/0208
Inventor 史程彭加木
Owner 李东明
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products