Unlock instant, AI-driven research and patent intelligence for your innovation.

Long time structure vocal print-based multi-layer filtering audio frequency search method and device

A voiceprint and long-term technology, applied in voice analysis, voice recognition, special data processing applications, etc., can solve the conflict between voiceprint stability and collision rate, achieve low index collision rate, improve speed and accuracy, The effect of strong stability

Inactive Publication Date: 2011-02-23
BEIJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In view of this, the object of the present invention is to provide an audio retrieval method based on long-term structural voiceprint and multi-layer filtering, which can effectively solve the problem of conflict between voiceprint stability and collision rate. For massive audio databases, the present invention It can effectively improve the retrieval accuracy, retrieval efficiency and anti-noise performance of audio retrieval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Long time structure vocal print-based multi-layer filtering audio frequency search method and device
  • Long time structure vocal print-based multi-layer filtering audio frequency search method and device
  • Long time structure vocal print-based multi-layer filtering audio frequency search method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0035] Such as figure 1 Shown is the device block diagram of the embodiment of the present invention, including:

[0036] For the audio data in the database (unit 101), extract features, use multiple feature points with long-term structural information to construct a voiceprint (unit 102), and then use the voiceprint to construct a database index (unit 103).

[0037] In the retrieval stage, for the input query segment (unit 104), extract feature...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a sample-based audio frequency search method, namely, a long time structure vocal print-based multi-layer filtering audio frequency search method, which can search for the complete information of the entire audio frequency through a recorded audio frequency clip. The invention discloses a novel method for generating a vocal print having long time structure information and search effect is enhanced by a two-layer filtering method. The method comprises the following steps of: extracting the vocal print characteristic of an input clip; processing by using a first layer filter; calculating result reliability; determining whether second filtering is performed or not; and realizing secondary filtering by inquiring vocal print expansion. The invention also discloses a long time structure vocal print-based multi-layer filtering audio frequency search device. Experiments indicate that the accuracy of up to 99.7 percent can be reached for an audio frequency library containing 10,000 songs when an inquired clip lasts for 5 seconds and the signal-to-noise ratio is 0 db by the embodiment of the invention.

Description

technical field [0001] The invention belongs to the field of computer technology applications, and in particular relates to a method and device for querying an audio database, in particular to a content-based sample audio retrieval method, that is, to search for the complete information of the entire audio through recorded original audio clips. Background technique [0002] With the rapid development of modern information technology, especially multimedia technology and network technology, a large amount of multimedia information can be obtained from the Internet. And various audio files have become the most frequently searched objects by users in various search engines (such as Baidu, Google, etc.). The traditional audio information retrieval technology is mainly based on text, but the traditional text-based audio information retrieval cannot meet people's needs for audio retrieval. That is to say, if the user hears a piece of very familiar audio and wants to query the inf...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/08G06F17/30
Inventor 刘刚王镪郭军
Owner BEIJING UNIV OF POSTS & TELECOMM