A pronunciation training
system extracts pronunciation features from various pronunciation samples, links pronunciation features with corresponding
muscle movements and diagram representations, displays related waveforms and pronunciation processes, and mark the differences between different waveforms and different pronunciation processes for helping a user to distinguish different sounds. First, the
system collects pronunciation samples from people, categorizes these samples, analyzes them in
time domain and in
frequency domain, identifies the positions and movements of pronunciation organs, provides interfaces for experts to define pronunciation features, extracts and compares pronunciation features, and build links between pronunciation features and pronunciation processes. Then, the
system collects pronunciation samples from a user, analyzes the pronunciation samples, extracts pronunciation features from the pronunciation samples, regenerates the pronunciation process, and displays related waveforms for helping a user to enhance the user's awareness on different sounds. The system can further increase the user's awareness on how a sound relates to a pronunciation feature and the
muscle movements of a pronunciation organ by providing interfaces for a user to create different sounds by modifying the existing sounds on its
loudness, tone, duration, and pace, by modifying the features in
time domain or
frequency domain, and by modifying the
muscle movements of related pronunciation organs.