Method for acquiring vocal print picture with refined time-frequency structure

A voiceprint, time-frequency technology, applied in speech analysis, instruments, etc., can solve the problems of unable to provide time-frequency structure, unable to provide voiceprint, etc.

Inactive Publication Date: 2012-06-20
JIANGNAN UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Speech is a non-stationary, multi-component, real-valued signal, and its WVD cannot provide a time-frequency structure that reflects the non-stationary characteristics of speech. Therefore, based on nonlinear time-frequency analysis methods, it cannot provide effective voiceprints.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for acquiring vocal print picture with refined time-frequency structure
  • Method for acquiring vocal print picture with refined time-frequency structure
  • Method for acquiring vocal print picture with refined time-frequency structure

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0072] A specific embodiment of the present invention is as follows:

[0073] Step 1: Record the voice signal, filter it if necessary, and store it in WAV format.

[0074] Step 2: Perform inherent modal decomposition on the recorded voice signal, and express the voice signal as the sum of a series of inherent modal functions.

[0075] Step 3: Find the instantaneous amplitude and instantaneous frequency according to the intrinsic mode function.

[0076] Step 4: Calculate the Hilbert spectrum of the signal.

[0077] Step 5: Draw the Hilbert spectrum as a three-dimensional graphic on the time-frequency plane to obtain a voiceprint image with a fine time-frequency structure.

[0078] In order to compare the actual effects of the present invention, the "five" of an adult male voice in English is used as an experimental signal (the speech signal comes from a publicly used speech database). The voiceprint diagram with fine time-frequency structure obtained by the method of the present inventio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a time-frequency analysis method for a voice signal, which displays the analysis result in a mode of a vocal print picture, and provides a method for acquiring the vocal print picture with a refined time-frequency structure. Due to the adoption of an adaptive signal decomposition method, the vocal print picture provides the refined time-frequency structure which reflects the non-stable characteristic of the voice signal, but because of the window effect of the conventional spectrogram, the conventional spectrogram have to use a long-window spectrogram and a short-window spectrogram to describe the non-stable characteristic of the voice signal. The vocal print picture with the refined time-frequency structure can be further applied to speaker recognition, vocal print identification and the like.

Description

Technical field [0001] The invention relates to a time-frequency analysis method of a speech signal, in particular to a method for obtaining a voiceprint image with a fine time-frequency structure. Background technique [0002] Voiceprint diagram is also called spectrogram, or spectrogram for short. It was invented by Bell Labs researchers in 1941. It uses three-dimensional or two-dimensional graphics to display the spectral characteristics of speech. The vertical axis represents the frequency, the horizontal axis represents the time, and the color depth. Indicates the amount of energy in a specific frequency band. The earliest analog spectrogram instrument converts sound waves into the intensity, wavelength, frequency and rhythm of electrical signals, and then draws these electrical signals into spectral graphics. [0003] The currently used spectrograms are divided into broadband (short window) spectrograms and narrowband (long window) spectrograms. Both of these spectrograms a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L19/00G10L19/02
Inventor 于凤芹
Owner JIANGNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products