Audio key point determination method, apparatus and device, and storage medium

A technology for determining methods and key points, applied in the field of audio data processing, can solve the problems of inaccurate audio key points, missing key information, inaccurate key points, etc., and achieve the effect of accurately determining audio key points

Pending Publication Date: 2020-07-24
BEIJING BYTEDANCE NETWORK TECH CO LTD
View PDF5 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Existing audio key point extraction methods usually include an extraction method based on audio drum points and an extraction method based on audio rhythm points. Among them, the extraction method based on audio drum points will miss a lot of key...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio key point determination method, apparatus and device, and storage medium
  • Audio key point determination method, apparatus and device, and storage medium
  • Audio key point determination method, apparatus and device, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] figure 1 It is a flow chart of a method for determining audio key points provided by Embodiment 1 of the present disclosure. This embodiment is applicable to situations where it is necessary to determine the position of a key point in audio. The method can be executed by an audio key point determining device, which can be implemented in software and / or hardware, and which can be configured in a computer device. Such as figure 1 As shown, the method may include the following steps:

[0032] S110. Determine the position of a feature point of the target audio, where the feature point includes a drum point and a rhythm point.

[0033] In this embodiment, the audio may be a file storing sound content, wherein the sound content may be a sound wave with a frequency between 20 Hz and 20 kHz that can be heard by the human ear, and its essential content may include sound intensity and time information. In this embodiment, the target audio may be an audio signal generated after...

Embodiment 2

[0053] figure 2 It is a flow chart of an audio key point determination method provided in Embodiment 2 of the present disclosure. This embodiment can be combined with various optional solutions in one or more of the above embodiments. In this embodiment, determining the drum position of the target audio includes:

[0054] Starting from the starting point of the target audio, sequentially calculate the audio difference of each preset audio unit in the complex frequency domain, wherein the preset audio unit is a first preset number of audio in the target audio the audio fragment of the frame;

[0055] Determine the sound intensity of each preset audio unit according to the audio difference;

[0056] The position of the drum is determined based on the sound intensity of each preset audio unit.

[0057] And, determine the rhythm point position of the target audio, including:

[0058] The target audio is input into the target convolutional neural network, and the position of t...

Embodiment 3

[0085] image 3 It is a schematic structural diagram of an audio key point determination device provided in Embodiment 3 of the present disclosure. This embodiment is applicable to situations where it is necessary to determine the positions of key points in audio. The device can be implemented in the form of software and / or hardware, and the device can be configured in computer equipment. Such as image 3 As shown, the device may include:

[0086] Feature point position determining module 310, for determining the feature point position of target audio, feature point comprises drum point and rhythm point;

[0087] A sound intensity determination module 320, configured to determine the sound intensity corresponding to the feature point;

[0088] The key point determination module 330 is configured to determine key points of the target audio based on preset key point determination rules and combined with sound intensity.

[0089] An audio key point determination device provi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses an audio key point determination method and device, equipment and a storage medium. The audio key point determination method comprises the steps that the feature point position of a target audio is determined, and feature points comprise a drumhead and a rhythm point; determining sound intensity corresponding to the feature points; and based on a preset key point determination rule, determining a key point of the target audio in combination with the sound intensity. According to the technical scheme of the embodiment of the invention, the feature points of the target audio, which comprises the drumbeat and the rhythm point, are utilized; the key points of the target audio are determined by combining the sound intensity of each feature point, so that the defect of inaccurate determination of the key points due to the fact that only a single feature point is used for determining the key points is overcome, and the effect of more accurately determining the key points of the audio is achieved.

Description

technical field [0001] The embodiments of the present disclosure relate to the technical field of audio data processing, and in particular, to a method, device, device, and storage medium for determining audio key points. Background technique [0002] Audio is a common multimedia form on the Internet. Extracting feature points from audio as the key points of audio is a common application scenario for short video applications. [0003] Existing audio key point extraction methods usually include an extraction method based on audio drum points and an extraction method based on audio rhythm points. Among them, the extraction method based on audio drum points will miss a lot of key information in a scene where the audio is very soothing. lead to inaccurate key points. The extraction method based on audio rhythm points will lead to very uniform key points, and it will also make people feel that audio key points are inaccurate and lost. Contents of the invention [0004] Embodi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/683G06F16/432G06N3/04
CPCG06F16/683G06F16/433G06N3/045
Inventor 杨旭静靳潇杰
Owner BEIJING BYTEDANCE NETWORK TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products