Speaker recognition method for deliberately pretended voices

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology for speaker recognition and recognition method, which is applied in the field of speech signal processing and speaker recognition, and can solve the problems of making speech identification more difficult.

Inactive Publication Date: 2015-03-25

NANJING UNIV OF POSTS & TELECOMM

View PDF4 Cites 21 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The appearance of disguised voice will make the work of voice identification more difficult

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0010] The technical solution of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0011] like figure 1 , figure 2 and image 3 As shown, the research on the characteristics of disguised speech and its speaker recognition is of great significance, which provides a reference for the improvement of the actual speaker recognition technology. Speech camouflage significantly reduces the recognition rate of the speaker system, and different camouflage types have different effects on automatic speaker recognition. There are large differences in the recognition results of different speakers' disguised speech, and some speakers are easier to identify than others. Speakers have orientation when implementing camouflage strategies, and different speakers are good at or prefer different camouflage methods. Various camouflage methods change the characteristics of the speaker in the time domain, frequency domain, and cepstrum domain...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a speaker recognition method for deliberately pretended voices. Firstly, a reasonable recording scheme is set up in an anechoic room without noise and reflection for eight deliberately pretended voices, namely tone raising, tone lowering, quick speaking, slow speaking, nose nipping, mouth covering, object biting (holding a pencil in the mouth) and chewing (chewing gum), then based on pitch period presorting, the Mel frequency cepstrum coefficient and a Gauss hybrid model are used for carrying out recognition under pretending of a speaker, and finally self-adaptive group adjustment is adopted to achieve high-quality speaker recognition of pretended voices. The method can be applied to voice cases that criminals cover up identities through pretended voices.

Description

technical field [0001] The invention relates to a speaker recognition method aimed at deliberately disguising speech, and belongs to the fields of speech signal processing and speaker recognition. Background technique [0002] With the development of the times, the speaker recognition technology has made great progress, and the analysis and research of the speaker's personality characteristics of speech have been paid attention to. However, the emergence of fake speech has brought unprecedented challenges to the research work of speaker recognition. Fake speech belongs to severely distorted speech, which is relative to normal speech. In a broad sense, fake speech refers to any change, distortion or deviation from normal speech, regardless of the reason, can be called fake speech. In a narrow sense, camouflage refers to deliberate camouflage, that is, the deliberate distortion of normal speech for the purpose of concealing one's identity. [0003] In voice crime cases, in ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/06G10L15/02

Inventor孙林慧杨震

OwnerNANJING UNIV OF POSTS & TELECOMM

Speaker recognition method for deliberately pretended voices

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology