Method for acquiring vocal print picture with refined time-frequency structure

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A voiceprint, time-frequency technology, applied in speech analysis, instruments, etc., can solve the problems of unable to provide time-frequency structure, unable to provide voiceprint, etc.

Inactive Publication Date: 2012-06-20

JIANGNAN UNIV

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Speech is a non-stationary, multi-component, real-valued signal, and its WVD cannot provide a time-frequency structure that reflects the non-stationary characteristics of speech. Therefore, based on nonlinear time-frequency analysis methods, it cannot provide effective voiceprints.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

specific Embodiment

[0072] A specific embodiment of the present invention is as follows:

[0073] Step 1: Record the voice signal, filter it if necessary, and store it in WAV format.

[0074] Step 2: Perform inherent modal decomposition on the recorded voice signal, and express the voice signal as the sum of a series of inherent modal functions.

[0075] Step 3: Find the instantaneous amplitude and instantaneous frequency according to the intrinsic mode function.

[0076] Step 4: Calculate the Hilbert spectrum of the signal.

[0077] Step 5: Draw the Hilbert spectrum as a three-dimensional graphic on the time-frequency plane to obtain a voiceprint image with a fine time-frequency structure.

[0078] In order to compare the actual effects of the present invention, the "five" of an adult male voice in English is used as an experimental signal (the speech signal comes from a publicly used speech database). The voiceprint diagram with fine time-frequency structure obtained by the method of the present inventio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a time-frequency analysis method for a voice signal, which displays the analysis result in a mode of a vocal print picture, and provides a method for acquiring the vocal print picture with a refined time-frequency structure. Due to the adoption of an adaptive signal decomposition method, the vocal print picture provides the refined time-frequency structure which reflects the non-stable characteristic of the voice signal, but because of the window effect of the conventional spectrogram, the conventional spectrogram have to use a long-window spectrogram and a short-window spectrogram to describe the non-stable characteristic of the voice signal. The vocal print picture with the refined time-frequency structure can be further applied to speaker recognition, vocal print identification and the like.

Description

Technical field [0001] The invention relates to a time-frequency analysis method of a speech signal, in particular to a method for obtaining a voiceprint image with a fine time-frequency structure. Background technique [0002] Voiceprint diagram is also called spectrogram, or spectrogram for short. It was invented by Bell Labs researchers in 1941. It uses three-dimensional or two-dimensional graphics to display the spectral characteristics of speech. The vertical axis represents the frequency, the horizontal axis represents the time, and the color depth. Indicates the amount of energy in a specific frequency band. The earliest analog spectrogram instrument converts sound waves into the intensity, wavelength, frequency and rhythm of electrical signals, and then draws these electrical signals into spectral graphics. [0003] The currently used spectrograms are divided into broadband (short window) spectrograms and narrowband (long window) spectrograms. Both of these spectrograms a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L19/00G10L19/02

Inventor于凤芹

OwnerJIANGNAN UNIV

Method for acquiring vocal print picture with refined time-frequency structure

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

specific Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology