Massive digital audio fingerprint storage and retrieval method

A digital audio and audio fingerprint technology, which is applied in audio data retrieval, digital data information retrieval, file system, etc., can solve the problems of high complexity of music fingerprints and too large database, and achieve the effect of high-efficiency query

Inactive Publication Date: 2019-06-07
成都嗨翻屋科技有限公司
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But music fingerprints are more complex than search engines
The main reason is that the fingerprint library has the same order of magnitude as the search word library, and thousands of fingerprints need to be retrieved for

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Massive digital audio fingerprint storage and retrieval method
  • Massive digital audio fingerprint storage and retrieval method
  • Massive digital audio fingerprint storage and retrieval method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The present invention will be further described below in conjunction with accompanying drawing:

[0039] Both solr and Elasticsearch are implemented based on Lucene. Elasticsearch, or ES, is a distributed, scalable, real-time search and data analysis engine. At the same time, it is not just full-text search, it can also handle structured search, data analysis, complex language processing, geographic location and relationship between objects, etc.

[0040] The basic concept of ES:

[0041] Document: A document is an atomic unit for indexing and searching, and it is a container that contains one or more fields.

[0042] Term: A unit in search, representing a certain word in the text.

[0043] Shard: The data in an index is stored in multiple shards, which is equivalent to a horizontal table. A shard is an instance of Lucene, which is a complete search engine in itself. ES actually uses sharding to achieve distribution. Shards are containers for data, documents are st...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a massive digital audio fingerprint storage and retrieval method, which comprises two steps of digital audio fingerprint storage and digital audio fingerprint retrieval, and the digital audio fingerprint storage comprises the following steps of: performing Hash processing on an audio file to be stored to obtain a unique identifier of each audio file; extracting audio fingerprints of the audio files to be stored, wherein the audio fingerprint of each audio file forms a character string; taking an identifier obtained after Hash of each to-be-processed audio file as a Rowkey of the Hbase, taking the fingerprint character string as a value of one column, taking meta information as a value of the other column, writing the values into a table of the Hbase, and writing thevalues into the Hbase; and taking identifier and fingerprint character string obtained after Hash of each audio file to be processed as two fields of document corresponding to audio file to be processed, and writing two fields into ES. Near-real-time full-text search and distributed characteristics of ES are utilized, and concurrent and real-time query is supported while high-efficiency query isguaranteed.

Description

technical field [0001] The invention belongs to the technical field of music identification query and retrieval, and in particular relates to a method and system for storing and retrieving mass digital audio fingerprints. Background technique [0002] At present, the applications of music recognition include listening to songs, humming, broadcast stream copyright monitoring, car music recognition, video BGM copyright identification, etc. The core of these applications is to extract unique audio features to form special fingerprints, Then compare the extracted fingerprint with the fingerprint of the music in the music library. In the above scenarios that require high accuracy, it is necessary to retain as many fingerprints as possible for comparison. The fingerprints extracted from such a piece of music with an ordinary duration may have nearly 10,000 or tens of thousands of fingerprints. When the music library reaches a large order of magnitude At that time, the number of f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/63G06F16/683G06F16/13
Inventor 尹学渊王东明
Owner 成都嗨翻屋科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products