The invention belongs to the technical field of audio
processing, and relates to an extensible
audio recognition system based on man-
machine interaction and a method thereof. The extensible
audio recognition system comprises an audio acquisition device, a voice recognition module, a loading sample unit, a finite-state
machine, a classification storage characteristic sample
database and an instruction execution module. The
audio recognition method is based on high recognition rate of isolate word speed recognition to a speaker dependent, and enables the
system to store voice segments which can not be recognized into the sample
database in an
online learning mode after a process of man-
machine interaction through the assistance of a user on the premise of fully training the user, and in addition, the cost to recognition is reduced through divided module storage and loading. The core
algorithm of the invention is based on voice signals, is not limited to languages of speakers, and can support the recognition of mixed languages (for example, Chinese and English and the like). The method has lower
false recognition rate and no recognition rate, and improves the reliability and adaptability of the system through dialogue interaction and online increment training.