Audio similarity determination method and apparatus, and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology for determining methods and similarities, applied in the field of communication, can solve problems such as inapplicability, inability to extract MIDI feature files, and narrow applicability of existing solutions, so as to achieve the effect of improving applicability

Pending Publication Date: 2018-05-11

TENCENT TECH (SHENZHEN) CO LTD

View PDF4 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] During the research and practice of the prior art, the inventors of the present invention found that since the MIDI feature file mainly shows the pitch and frequency of the audio at each sampling point, therefore, for the song, the MIDI feature will be more obvious, For shorter recordings, such as a line, effective MIDI feature files cannot be extracted. Therefore, in some specific scenarios, such as dubbing, the existing scheme is not applicable, that is, the applicability of the existing scheme is relatively low. narrow

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0042] In this embodiment, description will be made from the perspective of an apparatus for determining audio similarity, and the apparatus for determining audio similarity may specifically be integrated in a server or other device.

[0043] A method for determining audio similarity, comprising: acquiring first audio data and second audio data; respectively performing normalization processing and high-pass filtering on the first audio data and the second audio data to obtain the first audio data corresponding to the first audio data. One filtered data, second filtered data corresponding to the second audio data, and determining short-term energy distributions of the first filtered data and the second filtered data respectively, to obtain first distribution information corresponding to the first filtered data , second distribution information corresponding to the second filtered data; calculating the similarity between the first audio data and the second audio data based on the...

Embodiment 2

[0131] According to the methods implemented in the foregoing embodiments, examples will be given below for further detailed description.

[0132] In this embodiment, the device for determining the audio similarity is integrated in the server, the first audio file is an original audio file (ie original dubbing file), and the second audio file is a user audio file as an example for illustration.

[0133] Such as Figure 2a As shown, a method for determining audio similarity, the specific process can be as follows:

[0134] 201. The server acquires an original audio file, extracts first audio data from the original audio file, acquires a user audio file, and extracts second audio data from the user audio file.

[0135]For example, after the server obtains the original audio file, it can transcode the original audio file according to the preset transcoding strategy, for example, convert the original audio file into wav uncompressed format, and transcode the original audio file ac...

Embodiment 3

[0208] In order to better implement the above method, an embodiment of the present invention further provides an audio similarity determination apparatus, and the audio similarity determination apparatus may specifically be integrated in a server or other equipment.

[0209] For example, if Figure 3a As shown, the device for determining the audio similarity may include an acquisition unit 301, a first processing unit 302, a second processing unit 303, and a calculation unit 304, as follows:

[0210] (1) acquisition unit 301;

[0211] An acquiring unit 301, configured to acquire first audio data and second audio data.

[0212] For example, the obtaining unit 301 may be specifically configured to obtain a first audio file, extract the first audio data from the first audio file, and obtain a second audio file, and extract the second audio data from the second audio file, wait.

[0213] Optionally, in order to reduce interference, reduce the difference between audio files caus...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Embodiments of the present invention disclose an audio similarity determination method and apparatus, and a storage medium. The method may comprises: performing normalization processing and high-passfiltering on first audio data and second audio data respectively, determining short-time energy distribution of the first audio data and the second audio data respectively, and calculating the similarity between the first audio data and the second audio data based on the obtained short-time energy distribution. The scheme of the embodiments of the present invention not only can effectively and accurately calculate the similarity but also can be applied to most application scenarios, and the applicability of the scheme is improved.

Description

technical field [0001] The present invention relates to the field of communication technology, in particular to a method, device and storage medium for determining audio similarity. Background technique [0002] Audio data refers to digitized sound data, and audio similarity here refers to the similarity in intonation and tone of two pieces of audio data. Based on the audio similarity, people can perform some preset processing on the audio data, such as judging whether the dubbing is appropriate, whether the imitation is in place, whether the song is out of tune, and so on. [0003] In the prior art, the preset model is generally used to extract the musical instrument digital interface (MIDI, Musical Instrument Digital Interface) feature file of the audio data from the two audio files to be compared. For example, a certain algorithm can be used first Extract the MIDI feature file of the original audio file. After the user uploads a recording, extract the MIDI feature file o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F17/30

CPCG06F16/683

Inventor 徐勇

Owner TENCENT TECH (SHENZHEN) CO LTD

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Audio similarity determination method and apparatus, and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology