Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice retrieval method and system based on audio fingerprints

An audio fingerprint and retrieval system technology, applied in speech analysis, speech recognition, digital data information retrieval, etc., can solve the problems of low retrieval efficiency and poor retrieval robustness of long speech segments, and achieve the effect of improving retrieval efficiency

Pending Publication Date: 2020-12-04
LANZHOU UNIVERSITY OF TECHNOLOGY
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to provide a voice retrieval method and system based on audio fingerprints, to solve the problems of low retrieval efficiency and poor retrieval robustness of existing audio fingerprint methods for long speech segments

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice retrieval method and system based on audio fingerprints
  • Voice retrieval method and system based on audio fingerprints
  • Voice retrieval method and system based on audio fingerprints

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0065] The purpose of the present invention is to provide a voice retrieval method and system based on audio fingerprints, which can improve the retrieval efficiency of long speech segments and the robustness of audio fingerprint retrieval.

[0066] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a voice retrieval method and system based on audio fingerprints. The method comprises the following steps: extracting Mel frequency cepstrum coefficient (MFCC) features and linear prediction cepstrum coefficient (LPCC) features of original voice with the duration of 20s; performing feature combination processing on the MFCC features and the LPCC features, and determining acombined feature matrix; performing column dimension reduction on the combined feature matrix based on an information entropy feature dimension reduction method, and determining a feature matrix after column dimension reduction; based on an energy-based feature dimension reduction method, performing row dimension reduction on the feature matrix after column dimension reduction, and determining afeature matrix after row dimension reduction; constructing an audio fingerprint database according to the feature matrix subjected to row dimension reduction; and performing matching retrieval on theto-be-queried voice segment and the audio fingerprint in the audio fingerprint library by using a normalized Hamming distance algorithm. According to the invention, the retrieval efficiency and retrieval precision of the long voice segment and the retrieval robustness of the audio fingerprint can be improved.

Description

technical field [0001] The invention relates to the field of audio retrieval, in particular to a voice retrieval method and system based on audio fingerprints. Background technique [0002] With the explosive growth of the number of digital audio on the Internet, high-speed retrieval in audio big data has become an urgent problem to be solved. Audio fingerprint retrieval technology uses short audio fingerprint data instead of audio itself for retrieval, which can effectively improve the efficiency of audio retrieval, but the amount of fingerprint data corresponding to audio big data is also quite large, and traditional audio fingerprint retrieval methods have been difficult to meet the requirements of audio big data. Fast and accurate retrieval requirements in the environment. Therefore, audio retrieval technology has been widely concerned by many researchers. [0003] At present, scholars have proposed many methods in audio fingerprinting, feature extraction, dimensionali...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/635G06F16/683G10L15/02G10L15/08G10L25/24
CPCG06F16/635G06F16/683G10L15/02G10L15/08G10L25/24Y02D10/00
Inventor 张秋余许福久张其文段宏湘白建赵雪娇
Owner LANZHOU UNIVERSITY OF TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products