Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Deep Learning Method for Speech Gender Recognition

A gender recognition and deep learning technology, applied in the field of gender recognition, can solve the problems of being easily affected by the environment and low system robustness, and achieve the effect of high recognition rate and rich dimensions.

Active Publication Date: 2021-09-07
杭州杰峰科技有限公司
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The current speech gender recognition system usually consists of three parts, which are speech signal preprocessing, feature extraction and classification. Feature extraction is the most important part, and its quality directly affects the recognition results. The speech gender proposed by previous researchers Most of the features are based on the prosody features and sound quality features of speech, which are all artificially designed features. The robustness of the system is not high, and it is easily affected by the environment.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Deep Learning Method for Speech Gender Recognition
  • A Deep Learning Method for Speech Gender Recognition
  • A Deep Learning Method for Speech Gender Recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The present invention will be further described below in conjunction with the accompanying drawings, but the present invention is not limited to the following examples.

[0048] Such as Figure 1-3 Shown is a specific embodiment of a voice gender recognition deep learning method, this embodiment is a voice gender recognition deep learning method, the method comprises the following steps:

[0049] a) voice signal collection;

[0050] b) performing speech endpoint detection on the collected speech signal and segmenting the speech signal segment of human voice;

[0051] c) Obtaining N frames after subdividing the speech signal segment into frames, performing multi-resolution cochlear speech feature extraction on each frame, and finally obtaining the speech features of N frames;

[0052] D) Carry out speech gender recognition to output speech gender recognition result, described speech gender recognition comprises the steps:

[0053] d1) Establish a deep learning classif...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice gender recognition deep learning method, comprising the following steps: a. collecting voice signals; b. detecting the endpoints of the collected voice signals and segmenting the human voice voice signal segments; c. The voice signal segment is divided into frames and multi-resolution cochlear voice features are extracted for each frame; d. Input the voice features of each frame into the pre-trained deep learning model of voice gender recognition for classification; e. Determine the output voice gender Voting statistics, according to the final output of the speaker's gender of the voice signal segment, the present invention uses multi-resolution cochlear voice features, which are more in line with the voice feature parameters of human auditory perception analysis, and uses capsule networks as the voice gender recognition acoustic reasoning model , can adapt to the speech environment with low signal-to-noise ratio, and has a higher recognition rate than traditional methods.

Description

technical field [0001] The invention relates to a deep learning method, and more specifically, relates to a deep learning method for speech gender recognition, which belongs to the technical field of gender recognition. Background technique [0002] Speaker recognition is a hotspot in the field of identity authentication and artificial intelligence. Solving the problem of speaker recognition has important theoretical value and far-reaching practical significance. Gender recognition can be applied to occasions where the gender of boys and girls needs to be verified, such as knowing the gender of the person being verified from the first digit of the ID card number, and verifying whether the speaker’s gender is consistent with the gender contained in the ID card number. It can be applied to the front-end of speech recognition. When the gender of the speaker is recognized, the gender-dependent speech model (Gender-Dependent Model) is used for recognition to improve the speech re...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L17/18G10L17/04G10L17/02G06K9/62
CPCG10L17/18G10L17/04G10L17/02G06F18/24
Inventor 陈晋生罗世操
Owner 杭州杰峰科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products