The invention discloses a vocal music learning electronic auxiliary pronunciation system. The system comprises a mouth shape and tongue position image acquisition module, an audio acquisition module, a data processing module, a pronunciation standard assessment module, a mouth shape standard assessment module, a pitch extraction module, a beat extraction module, a note pitch and note duration model construction module, a singing ability primary assessment module, a comprehensive assessment module, a pronunciation auxiliary guidance module, a training scheme generation module, and a central processor. According to the system, data acquisition and assessment in a whole vocal music ability detection process are accomplished based on a computer system, the automation degree is high, the coverage is wide, the detection and analysis of the position and stability condition of throat, the timbre condition of sound, the audio penetrating force condition, the overtone application condition, and the maintenance condition of pronunciation in singing are accomplished at one time, and a targeted training scheme can be obtained.