Human voice melody extraction method and system based on numbered musical notation recognition and fundamental frequency extraction

An extraction method and a technique of notation, applied in the field of vocal melody extraction, can solve the problems of inability to obtain lyrics and pitch, difficult to obtain lyric information, and inability to extract melody, etc.

Active Publication Date: 2020-06-23
成都潜在人工智能科技有限公司
View PDF10 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The goal of this method is to extract the main melody track from multiple audio tracks, but it cannot extract the melody from the main melody track. At the same time, it is difficult for this method to obtain lyric information containing sub-track information.
Unable to get matching libretto and pitch

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Human voice melody extraction method and system based on numbered musical notation recognition and fundamental frequency extraction
  • Human voice melody extraction method and system based on numbered musical notation recognition and fundamental frequency extraction
  • Human voice melody extraction method and system based on numbered musical notation recognition and fundamental frequency extraction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention will be further described below in conjunction with accompanying drawing:

[0042] Such as figure 1 As shown, a human voice melody extraction method based on numbered musical notation recognition and fundamental frequency extraction includes the following steps:

[0043] S1: Data preprocessing, binarize the numbered notation file corresponding to the song to be processed, process the original audio file of the song into down-sampled mono audio, and separate the human voice from the down-sampled mono audio Waveform; specifically includes:

[0044] S101: Decode the original audio file of the song into wave format, and normalize it to -1~1;

[0045] S102: averaging the audio in wave format to obtain mono audio;

[0046] S103: down-sampling the monophonic audio to between 8000 and 44100;

[0047] S104: Binarize the numbered musical notation file corresponding to the song;

[0048] S105: separate the vocal waveform from the down-sampled monophonic ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a human voice melody extraction method and system based on numbered musical notation recognition and fundamental frequency extraction, and the system applies the method, and the method comprises the steps: carrying out the binarization of a numbered musical notation file corresponding to a to-be-processed song, processing an original audio file of the song into downsampledsingle-track audio, and separating a human voice waveform from the single-track audio; identifying notes and lyric pairs in the numbered musical notation to obtain a list of lyrics and notes; retrieving a list of lyrics and notes according to the libretto file to obtain a matching result sequence of libretto and notes; selecting a note, calculating the fundamental frequency of the note according to the separated human voice waveform, calculating the frequency of each note according to the calculated fundamental frequency and the relative relation of the notes, and converting the frequency of each note into midi pitch; and translating the matching result sequence of the row lyrics and the notes to obtain a matching result sequence of the row lyrics and the notes of which the pitches are matched with the midi pitches of the notes. The human voice melody with the pitch matched with the melody can be extracted.

Description

technical field [0001] The invention belongs to the technical field of audio processing, and in particular relates to a human voice melody extraction method and system based on numbered spectrum recognition and fundamental frequency extraction. Background technique [0002] With the development of computer technology, the main way of dissemination of music has changed from the original carrier based on tapes and CDs to the network download and click based on digital music. In order to adapt to the change of the way of transmission, music identification and retrieval technology is also applied more and more widely. In music information retrieval, the main theme of music is mainly used, and the main theme of music can be used for music analysis, music retrieval, music identification, similar music recommendation, etc. [0003] The invention patent with application number 201810537265.3 discloses a method, device, terminal and storage medium for extracting the main melody trac...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/48G10L25/51
CPCG10L25/48G10L25/51G10H2210/056G10H2210/061Y02D30/70
Inventor 尹学渊刘鑫忠江天宇
Owner 成都潜在人工智能科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products