Speaker recognition method and system based on gender and language

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speaker recognition and speaker technology, which is applied in the field of speaker recognition, can solve the problems such as the decline of the accuracy of the single factor recognition method, and achieve the effect of improving the robustness and high recognition accuracy.

Pending Publication Date: 2022-04-15

ZHEJIANG UNIV

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The technical problem to be solved by the present invention is to provide a speech recognition method and system based on gender and language, which combines the gender information and language information contained in the voice content to identify the speaker, and solves the problem of a single factor when the tone of the language changes. The technical problem that the accuracy rate of the recognition method drops

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0025] The technical framework of the invention will be described below in conjunction with the accompanying drawings.

[0026] In the prior art, most of the speaker recognition methods focus on a single factor, that is, the speaker’s own recognition. This method requires the speaker to keep the same way of speaking in the two stages of voiceprint registration and voiceprint recognition. When the speaker When different tones are used, the recognition accuracy will decrease.

[0027] In order to solve most of the technical problems in the prior art that are based on a single factor, that is, the identification of the speaker itself, resulting in low robustness of speaker identification, an embodiment of the present invention provides a speaker identification method and system based on gender and language .

[0028] The technical solutions provided by various embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speaker recognition method and system based on gender and language, and belongs to the field of speaker recognition. Comprising the following steps: acquiring to-be-recognized voice data, specifically an audio file containing an audio of an effective speaker; carrying out noise reduction processing on the audio file to obtain a low-noise voice audio; carrying out SMAC feature extraction on the denoised voice audio to obtain a voice spectrum feature map; inputting the speech spectrum feature map into a ResNet model to obtain a speech feature vector; inputting the voice feature vector into a multi-target learning model, and identifying to obtain the identity of the speaker, the gender of the speaker and the language information used by the speaker; and performing weighted fusion on the three recognition task results to obtain a speaker recognition result corresponding to the to-be-recognized voice data. According to the method, the gender information and the language information in the voice are comprehensively utilized, the robustness of speaking recognition is effectively improved, and particularly, the recognition precision is high under the condition that the voice of the speaker changes.

Description

technical field [0001] The invention relates to the field of speaker recognition, in particular to a gender and language-based speaker recognition method and system. Background technique [0002] With the continuous development of artificial intelligence, more and more intelligent identification technologies have been applied in life, including face recognition, fingerprint recognition and voiceprint recognition that have emerged in recent years. Voiceprint recognition, also known as speaker recognition, analyzes a piece of audio content to identify which speaker the audio belongs to. Speakers can be used for identity authentication and have attracted widespread attention because of their convenience. [0003] In the prior art, most of the speaker recognition methods focus on a single factor, that is, the speaker’s own recognition. This method requires the speaker to keep the same way of speaking in the two stages of voiceprint registration and voiceprint recognition. When ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L17/02G10L17/04G10L17/18

Inventor 徐文渊冀晓宇程雨诗高逸卓

Owner ZHEJIANG UNIV

Speaker recognition method and system based on gender and language

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology