Audio similarity determination method and apparatus, and storage medium

A technology for determining methods and similarities, applied in the field of communication, can solve problems such as inapplicability, inability to extract MIDI feature files, and narrow applicability of existing solutions, so as to achieve the effect of improving applicability

Pending Publication Date: 2018-05-11
TENCENT TECH (SHENZHEN) CO LTD
View PDF4 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] During the research and practice of the prior art, the inventors of the present invention found that since the MIDI feature file mainly shows the pitch and frequency of the audio at each sampling point, therefore, for the song, the MIDI feature will be more obv

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio similarity determination method and apparatus, and storage medium
  • Audio similarity determination method and apparatus, and storage medium
  • Audio similarity determination method and apparatus, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0041] Embodiment 1.

[0042] In this embodiment, description will be made from the perspective of an apparatus for determining audio similarity, which may specifically be integrated in a server or other equipment.

[0043] A method for determining audio similarity, comprising: acquiring first audio data and second audio data; respectively performing normalization processing and high-pass filtering on the first audio data and the second audio data to obtain the first audio data corresponding to the first audio data. First filtered data, second filtered data corresponding to the second audio data, and determining the short-term energy distribution of the first filtered data and the second filtered data respectively, to obtain first distribution information corresponding to the first filtered data , and second distribution information corresponding to the second filtered data; calculating the similarity between the first audio data and the second audio data based on the first di...

Example Embodiment

[0130] Embodiment two,

[0131] According to the methods implemented in the previous embodiments, the following examples will be used for further detailed description.

[0132] In this embodiment, the device for determining the audio similarity is specifically integrated in the server, the first audio file is an original audio file (ie, an original dubbing file), and the second audio file is a user audio file as an example for description.

[0133] like Figure 2a As shown, a method for determining audio similarity, the specific process can be as follows:

[0134] 201. The server acquires an original audio file, extracts first audio data from the original audio file, and acquires a user audio file, and extracts second audio data from the user audio file.

[0135]For example, after obtaining the original audio file, the server can transcode the original audio file according to a preset transcoding strategy, such as converting the original audio file into an uncompressed wav f...

Example Embodiment

[0207] Embodiment three,

[0208] In order to better implement the above method, an embodiment of the present invention further provides an apparatus for determining audio similarity, and the apparatus for determining audio similarity may specifically be integrated in a server or other equipment.

[0209] For example, as Figure 3a As shown, the apparatus for determining audio similarity may include an obtaining unit 301, a first processing unit 302, a second processing unit 303, and a calculating unit 304, as follows:

[0210] (1) Acquisition unit 301;

[0211] The acquiring unit 301 is configured to acquire first audio data and second audio data.

[0212] For example, the obtaining unit 301 can be specifically configured to obtain a first audio file, extract the first audio data from the first audio file, and obtain a second audio file, and extract the second audio data from the second audio file, and many more.

[0213] Optionally, in order to reduce interference, reduc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the present invention disclose an audio similarity determination method and apparatus, and a storage medium. The method may comprises: performing normalization processing and high-passfiltering on first audio data and second audio data respectively, determining short-time energy distribution of the first audio data and the second audio data respectively, and calculating the similarity between the first audio data and the second audio data based on the obtained short-time energy distribution. The scheme of the embodiments of the present invention not only can effectively and accurately calculate the similarity but also can be applied to most application scenarios, and the applicability of the scheme is improved.

Description

technical field [0001] The present invention relates to the field of communication technology, in particular to a method, device and storage medium for determining audio similarity. Background technique [0002] Audio data refers to digitized sound data, and audio similarity here refers to the similarity in intonation and tone of two pieces of audio data. Based on the audio similarity, people can perform some preset processing on the audio data, such as judging whether the dubbing is appropriate, whether the imitation is in place, whether the song is out of tune, and so on. [0003] In the prior art, the preset model is generally used to extract the musical instrument digital interface (MIDI, Musical Instrument Digital Interface) feature file of the audio data from the two audio files to be compared. For example, a certain algorithm can be used first Extract the MIDI feature file of the original audio file. After the user uploads a recording, extract the MIDI feature file o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/683
Inventor 徐勇
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products