Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Semi-supervised music main melody extraction method

An extraction method and main melody technology, which is applied in the field of semi-supervised music main melody extraction, can solve problems such as performance degradation, poor generalization, and limited performance, and achieve high generalization effects

Active Publication Date: 2020-06-23
DALIAN MARITIME UNIVERSITY
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002]The features used in the existing main melody extraction methods are divided into two categories: the first category is low-level artificial features, which are set by researchers according to the characteristics of the signal. The performance of complex music with multiple sound sources drops significantly, and its generalization is poor; the second type of features is advanced self-learning features, which are obtained by the algorithm on the basis of data sets, and its performance depends heavily on the capacity of the training set and diversity
Recently, deep learning has provided a new solution for the extraction of the main theme of music, but it takes a long time to train to obtain appropriate network parameters, and the capacity and diversity of the training set are still important factors restricting its performance.
However, existing annotated datasets for theme extraction are still lacking, which limits the performance of such methods.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semi-supervised music main melody extraction method
  • Semi-supervised music main melody extraction method
  • Semi-supervised music main melody extraction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0035] Collect a certain period of music audio with a duration of 8.2 seconds, and its time domain waveform is as follows image 3 shown. This segment of audio is analyzed by CQT spectrum, and its CQT amplitude spectrum is as follows Figure 4 shown. Based on the network parameters of the trained semi-supervised extreme learning machine, the melody pitch of the audio signal is predicted, and the obtained prediction matrix is ​​as follows: Figure 5 shown. Take the maximum value of each column of the prediction matrix to obtain the rough estimation result of the melody pitch, and then perform spectral peak search in the roughly estimated 2 / 3 semitone range of each frame to obtain the final melody pitch estimation, the estimated value and True value as Figure 6 shown. It can be seen from this embodiment that the proposed method can output more accurate melody pitch estimation results.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a semi-supervised music main melody extraction method, which comprises the following steps of: performing normalization, resampling and filtering preprocessing on an input audio signal to obtain an audio signal for simulating auditory characteristics of a human ear; carrying out constant Q spectrum transformation on the audio signal to obtain a variable resolution frequencyspectrum signal with logarithmically distributed frequency, aggregating adjacent multiple frames of amplitude spectrums to obtain a feature vector, constructing an input vector set of an extreme learning machine according to the feature vector, and obtaining an output vector set according to the training set; carrying out parameter training on the extreme learning machine, and carrying out melodypitch coarse estimation by utilizing an extreme learning machine network; searching for a spectrum peak within a rough estimated 2 / 3 half-tone range of the melody pitch of each frame, taking the frequency corresponding to the spectrum peak as the melody pitch of the frame to be output, and finely adjusting the melody pitch.

Description

technical field [0001] The invention belongs to the field of audio signal processing, in particular to a semi-supervised music main melody extraction method. Background technique [0002] The features used in the existing main melody extraction methods are divided into two categories: the first category is low-level artificial features, which are set by researchers based on signal characteristics, and their performance is significantly reduced when dealing with complex music with multiple sound sources. The second type of features is advanced self-learning features, which are obtained by the algorithm on the basis of the data set, and its performance depends heavily on the capacity and diversity of the training set. Recently, deep learning has provided a new solution for music main melody extraction, but it takes a long time to train to obtain appropriate network parameters, and the capacity and diversity of the training set are still important factors restricting its perfor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L19/02G10L19/032G10L19/26G10L25/30G10L25/90
CPCG10L19/265G10L19/0212G10L19/032G10L25/30G10L25/90
Inventor 张维维毕胜房少军
Owner DALIAN MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products