End-to-end music score note identification method based on deep learning

A deep learning and recognition method technology, applied in the field of optical score recognition, can solve the problems of unrecognizable note pitch and time value, low recognition accuracy, etc., and achieve the effect of improving generalization ability

Inactive Publication Date: 2020-02-28
BEIJING UNIV OF TECH
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current target detection method based on deep learning cannot recognize the pitch and duration of notes, and the sequence recognition method has problems such as low recognition accuracy when processing multi-voice scores.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • End-to-end music score note identification method based on deep learning
  • End-to-end music score note identification method based on deep learning
  • End-to-end music score note identification method based on deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The corpus of the present invention consists of 10,000 MusicXML files, which are downloaded from the MuseScore dataset, after which a dataset containing musical notation images and corresponding labels is created from the corpus. The whole process is divided into two stages: download MusicXML from MuseScore and convert it into a vector image (svg) file; parse the svg to obtain the bounding box, duration and pitch of the symbol. These data are divided into three distinct subsets. 60% is used for training, 15% for validation and 25% for evaluating the model.

[0029] Data enhancement: For each selected score image, the entire score image is cropped into 4 images a, b, c, and d to amplify the data set, so that the total amount of data is expanded by 4 times. After that, four data enhancement methods including fuzzy, elastic transformation, color transformation and affine transformation are used to process the cropped score image and input to the neural network model.

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an end-to-end music score note identification method based on deep learning. The method is divided into three steps in total: (1) data preprocessing: downloading a corresponding data set from MuseScore, and recoding pitch and value labels; and (2) data enhancement: performing data enhancement on the recoded music score data; the invention providing four different enhancement methods; and (3) an end-to-end model: using a deep convolutional neural network model for end-to-end music score note identification, inputting the enhanced data into the model, and the output of the model being the note value and pitch. The music score note recognition model based on deep learning is provided for the printed music score, namely, the whole music score image is input into the model, the value and pitch of notes on the music score are directly output, the model is completely end-to-end, and the multi-voicemusic score image can be accurately identified.

Description

technical field [0001] The invention belongs to the field of optical music score recognition, and is an end-to-end neural network recognition method based on deep learning, which can be applied to musical note recognition. Background technique [0002] Optical Score Recognition is the musical application of Optical Character Recognition to recognize musical scores into editable or playable forms such as MIDI (for playback) and MusicXML (for page layout). Compared with other symbols in the music score, the musical note occupies a very high proportion. It is used to record the pitch and time value, and has important semantic information. Therefore, note recognition is the core and key of score recognition. The shape of musical notes is ever-changing, and its diversity and polymorphism characteristics determine that musical notes are difficult to identify. The traditional note recognition method needs to delete the staff in advance, then extract the primitive symbols, and com...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V2201/07G06N3/048G06N3/045G06F18/214
Inventor 黄志清贾翔王师凯张煜森
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products