Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Three-particle cooperative optimization method applied to vector quantization-based speaker recognition

A technology of speaker recognition and vector quantization, which is applied in the field of three-particle collaborative optimization, can solve problems that affect the performance of speaker recognition, cannot be guaranteed, and find the global optimal codebook, etc.

Inactive Publication Date: 2010-06-16
SHENZHEN UNIV
View PDF0 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If the initial codebook is not well selected, it will affect the codebook performance and program operation time, thus affecting the speaker recognition performance
[0049] Although the codebook design quality of the particle pair collaborative optimization algorithm has been greatly improved compared with the traditional LBG algorithm, there is still a certain sensitivity to the selection of the initial codebook, which may fall into the local optimum, and cannot guarantee that the global optimum can be found. Ucodebook

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Three-particle cooperative optimization method applied to vector quantization-based speaker recognition
  • Three-particle cooperative optimization method applied to vector quantization-based speaker recognition
  • Three-particle cooperative optimization method applied to vector quantization-based speaker recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0085] In order to have a further understanding and understanding of the structural features of the present invention and the achieved effects, the preferred embodiments and accompanying drawings are used for a detailed description, as follows:

[0086] The technical problem to be solved by the present invention is to propose an improved vector quantization speaker codebook model optimization design method - triple-particle cooperative optimization method (Triple-Particle Cooperative Optimizer, TPCO). The present invention integrates the advantages of the particle pair collaborative optimization algorithm and the hybrid leapfrog algorithm, starts from the information dissemination in the control group, coordinates global exploration and local deep search capabilities, improves the diversity of the group, overcomes prematurity, and improves the convergence speed and solution accuracy. It can more effectively prevent particles from falling into the local optimal codebook, and mak...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a three-particle cooperative optimization method applied to speaker recognition, which is an optimization design method for a vector quantization speaker codebook model and comprises the following steps: dividing an initial population of six particles into two sub-populations, wherein each particle represents a codebook and each sub-population consists of three particles named three-particle; adopting different particle update parameters by two three-particles so as to realize global exploration and local exploration; executing updating of speed and position of PSO and operation of an LBG algorithm with an iteration number of three by the particles in each iteration; when the iterations reach mixing update times, mixing and dividing the particles into new three-particles to realize global information exchange and cooperative evolution; when a condition of the maximum iteration number of the initial population is met, selecting two particles from the two three-particles to continue searching until the maximum iteration number of the elite particles is achieved; and using the optimized one as the speaker codebook model. The three-particle cooperative optimization method applied to the speaker recognition better solves the problem that an initial codebook influences the optimization result, and obviously improves the performance of short-voice speaker recognition.

Description

technical field [0001] The invention relates to the technical field of speech recognition, and more specifically, relates to a three-particle cooperative optimization method applied to speaker recognition based on vector quantization. Background technique [0002] Speaker recognition is a biometric technology that is urgently needed for the application of information technology in existing communication networks. Judicial field (such as monitoring and identification of criminals), security field (such as airport access control system control), information service field (such as automatic information retrieval or e-commerce), etc. [0003] The speaker model is the core of high-performance speaker recognition. Since the late 1980s, speaker recognition has entered a new and thriving development due to the establishment of speaker models based on vector quantization, probability statistics models, and artificial neural networks. period. As a typical optimization problem, model...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/14
Inventor 纪震薛丽萍周家锐储颖
Owner SHENZHEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products