A Large-Scale Speaker Identification Method

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A speaker recognition and speaker technology, applied in speech analysis, instruments, etc., can solve problems such as difficulty in meeting the needs of practical applications and declining accuracy

Inactive Publication Date: 2015-10-21

BEIJING INSTITUTE OF TECHNOLOGYGY

View PDF3 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, when the number of people to be identified continues to increase, the accuracy of the above method will drop significantly. When the number of people increases to a certain scale, it will be difficult to meet the needs of practical applications. This is an important problem that text-independent speaker recognition technology needs to solve

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0080] In order to better illustrate the purpose and advantages of the present invention, the implementation of the method of the present invention will be further described in detail below in conjunction with the accompanying drawings and examples.

[0081] All the following tests are completed on the same computer, the specific configuration is: Intel dual-core CPU (main frequency 1.8G), 1G memory, WindowsXP SP3 operating system.

[0082] first link

[0083] This section will use the voice files of the TIMIT audio library to describe in detail the specific process of speaker registration / training and speaker identification in the present invention when the target speaker size is 600.

[0084] The TIMIT speech library is a standard library jointly produced by MIT, Stanford Research Institute, and Texas Instruments. It contains the corpus of 630 speakers (438 males and 192 females), each with 10 voices.

[0085] Randomly select all voice data of 600 people from all speakers, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a text-independent speaker identification method, wherein the text-independent speaker identification method is based on 2D-Haar voice frequency characteristics and suitable for large-scaled speakers. The invention provides conception and a calculation method of the 2D-Haar voice frequency characteristics, and foundational voice frequency characteristics are used to form a voice frequency characteristic graph at first; then the voice frequency characteristic graph is used to extract 2D-Haar voice frequency characteristics; then an AdaBoost.MH algorithm is used to accomplish screening of the 2D-Haar voice frequency characteristics and training of a speaker classifier; finally the trained speaker classifier is used to achieve the identification of speakers. Compared with the prior art, the large-scaled speaker identification method can effectively restrain decay of identification accuracy rate in a large-scale speaker identification situation, and has high identification accuracy rate and identification speed. The text-independent speaker identification method is not only applied to a desktop computer, but also applied to mobile calculation platforms like a cell phone, a tablet and the like.

Description

technical field [0001] The invention relates to a text-independent speaker identification method suitable for large-scale speakers, which belongs to the technical field of biological identification; from the perspective of technical realization, it also belongs to the technical field of computer science and voice processing. Background technique [0002] Speaker Identification (Speaker Identification) technology is an important branch of Speaker Recognition (SR) technology. It uses the characteristics of each speaker's voice signal to extract speaker information from a piece of voice, and then judges whether the voice is Which one of several people said is a "many choice one" pattern recognition problem. With the rapid development of modern electronic technology in recent years, the application requirements of speaker recognition technology have become stronger and stronger (such as court identification, criminal suspect voice tracking and positioning, voice retrieval, etc.)...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L17/02

Inventor罗森林谢尔曼潘丽敏

OwnerBEIJING INSTITUTE OF TECHNOLOGYGY

A Large-Scale Speaker Identification Method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology