Audio frequency copy detection method based on similarity

A detection method and similarity technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of harming the interests of content providers, reducing the accuracy and stability of searches, and impractical searches

Inactive Publication Date: 2012-05-02
FUDAN UNIV
View PDF1 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, a series of problems followed: because the Internet audio is very flexible and open in the production, distribution, playback, transmission and other links, the number of illegal audio and pirated audio content on the Internet is increasing, which seriously damages content providers and related parties. party interests, hindering the healthy and orderly development of the network audio industry
However, using this technique, the search becomes impractical due to the computational complexity of long-term (say, up to several days) suspect audio signals or many reference audio signals
Of course, one can improve the speed by rough matching, but this will inevitably reduce the correctness and stability of the search

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio frequency copy detection method based on similarity
  • Audio frequency copy detection method based on similarity
  • Audio frequency copy detection method based on similarity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0115] The present invention will be further described below by taking an application as an example. There are 875 audio files in the database, and they are all in PCM format, mono, and the sampling rate is 44100. The duration of each audio file is longer than one minute but not longer than six minutes. The 875 audio files are divided into 3185 reference audio signals with a length of one minute and stored in the database. In the experiment, the audio file used for testing is also in PCM format, mono, and the sampling rate is 44100Hz. The selection of experimental parameters is as follows: the LBG algorithm generates 128 class center points through a series of training sequences, and the threshold for judging similarity is selected as 0.95, because it is found in the experiment that when the threshold is selected large enough, the reference audio signal judged is similar to the audio to be tested The position is relatively accurate, and the error is only one or two seconds. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of audio frequency information processing and in particular relates to an audio frequency copy detection method based on similarity. The method comprises the following steps: firstly establishing a reference audio frequency signal database, segmenting the signals before entering the database so that the signals have equal length, wherein the reference audio frequency signals in the database are illegal or bad information; copying and detecting the audio frequency signal according to the established reference audio frequency database, namely, orderly extracting features, generating a histogram and computing the similarity; then judging whether the input audio signal has a copy containing the reference audio frequency in the database by a parallel algorithm so as to obtain the output result, namely the result of whether the audio frequency has illegal or bad information. The method provided by the invention can be used for detecting and filtering unhealthy, violent and retroactive audio on the internet and various audio frequency copy detection application systems to prevent various bad contents from spreading.

Description

technical field [0001] The invention belongs to the technical field of audio information processing, and in particular relates to an audio copy detection method. Background technique [0002] The advancement of audio compression technology and the emergence of large-capacity storage have led to the emergence of massive audio information on the Internet. These audio information are widely used in education, entertainment, news, advertising and other fields, and become an important part of people's daily life. However, a series of problems followed: because the Internet audio is very flexible and open in the production, distribution, playback, transmission and other links, the number of illegal audio and pirated audio content on the Internet is increasing, which seriously damages content providers and related parties. The interests of other parties have hindered the healthy and orderly development of the network audio industry. At the same time, Internet audio has also become...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L11/00G10L15/00G10L19/02G10L25/48
Inventor 肖星星卜素亮
Owner FUDAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products