Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speaker recognition method for deliberately pretended voices

A technology for speaker recognition and recognition method, which is applied in the field of speech signal processing and speaker recognition, and can solve the problems of making speech identification more difficult.

Inactive Publication Date: 2015-03-25
NANJING UNIV OF POSTS & TELECOMM
View PDF4 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The appearance of disguised voice will make the work of voice identification more difficult

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaker recognition method for deliberately pretended voices
  • Speaker recognition method for deliberately pretended voices
  • Speaker recognition method for deliberately pretended voices

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0010] The technical solution of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0011] like figure 1 , figure 2 and image 3 As shown, the research on the characteristics of disguised speech and its speaker recognition is of great significance, which provides a reference for the improvement of the actual speaker recognition technology. Speech camouflage significantly reduces the recognition rate of the speaker system, and different camouflage types have different effects on automatic speaker recognition. There are large differences in the recognition results of different speakers' disguised speech, and some speakers are easier to identify than others. Speakers have orientation when implementing camouflage strategies, and different speakers are good at or prefer different camouflage methods. Various camouflage methods change the characteristics of the speaker in the time domain, frequency domain, and cepstrum domain...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a speaker recognition method for deliberately pretended voices. Firstly, a reasonable recording scheme is set up in an anechoic room without noise and reflection for eight deliberately pretended voices, namely tone raising, tone lowering, quick speaking, slow speaking, nose nipping, mouth covering, object biting (holding a pencil in the mouth) and chewing (chewing gum), then based on pitch period presorting, the Mel frequency cepstrum coefficient and a Gauss hybrid model are used for carrying out recognition under pretending of a speaker, and finally self-adaptive group adjustment is adopted to achieve high-quality speaker recognition of pretended voices. The method can be applied to voice cases that criminals cover up identities through pretended voices.

Description

technical field [0001] The invention relates to a speaker recognition method aimed at deliberately disguising speech, and belongs to the fields of speech signal processing and speaker recognition. Background technique [0002] With the development of the times, the speaker recognition technology has made great progress, and the analysis and research of the speaker's personality characteristics of speech have been paid attention to. However, the emergence of fake speech has brought unprecedented challenges to the research work of speaker recognition. Fake speech belongs to severely distorted speech, which is relative to normal speech. In a broad sense, fake speech refers to any change, distortion or deviation from normal speech, regardless of the reason, can be called fake speech. In a narrow sense, camouflage refers to deliberate camouflage, that is, the deliberate distortion of normal speech for the purpose of concealing one's identity. [0003] In voice crime cases, in ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/06G10L15/02
Inventor 孙林慧杨震
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products