An electronic musical instrument stores plural pieces of voice data (i.e., voice waveform data) indicating plural syllables (a, i, u, etc., or do, re, mi, etc.) and automatic performance data indicating a performed music piece. The automatic performance data is composed of a series of note data and information indicating voice data corresponding to the note data. The pitch indicated by the performance information from a keyboard and the pitch indicated by the performance data are compared, and in case where both pitches correspond with each other, the voice data is reproduced with a frequency corresponding to both pitches (steps S21 and S22). In case where both pitches do not correspond with each other, the voice data is reproduced with a frequency having the pitch indicated by the inputted performance information (steps S21, S23 and S24). Therefore, a user can practice playing a musical instrument with fun by getting the user to listen to voices such as lyrics, syllable names (do, re, mi, etc.) or the like.