Extensible audio recognition method based on man-machine interaction

A technology of speech recognition and human-computer interaction, applied in the field of scalable speech recognition of human-computer interaction, can solve the problems of unreliable recognition rate and unreliable system, and achieve the effect of low unrecognized rate, reduced overhead, and real-time matching speed.

Inactive Publication Date: 2010-12-22
FUDAN UNIV
View PDF0 Cites 34 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] Since the unreliability of the recognition rate will directly lead to the unreliability of the system, it i

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Extensible audio recognition method based on man-machine interaction
  • Extensible audio recognition method based on man-machine interaction
  • Extensible audio recognition method based on man-machine interaction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] figure 1 It is a structural diagram of the human-computer interaction expandable speech recognition system of the present invention, including an audio collection device, a speech recognition module, a loading sample unit, a finite state machine, a classification and storage feature sample library, and an instruction execution module.

[0057] Classification storage feature sample library is manually classified and stored in the hard disk, while the loaded sample unit is stored in the memory. During the identification process, the feature sample library only loads the samples corresponding to some instructions to the loading sample unit according to the state of the finite state machine. The speech signal collected by the audio collection device is sent to the speech recognition module, which matches the input speech segment signal with the loaded sample speech signal, outputs the recognition result—command number and similarity, and changes the finite state machine of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of audio processing, and relates to an extensible audio recognition system based on man-machine interaction and a method thereof. The extensible audio recognition system comprises an audio acquisition device, a voice recognition module, a loading sample unit, a finite-state machine, a classification storage characteristic sample database and an instruction execution module. The audio recognition method is based on high recognition rate of isolate word speed recognition to a speaker dependent, and enables the system to store voice segments which can not be recognized into the sample database in an online learning mode after a process of man-machine interaction through the assistance of a user on the premise of fully training the user, and in addition, the cost to recognition is reduced through divided module storage and loading. The core algorithm of the invention is based on voice signals, is not limited to languages of speakers, and can support the recognition of mixed languages (for example, Chinese and English and the like). The method has lower false recognition rate and no recognition rate, and improves the reliability and adaptability of the system through dialogue interaction and online increment training.

Description

technical field [0001] The invention relates to an expandable voice recognition method of human-computer interaction, which can be used in various intelligent electronic products where voice recognition is required, and the system has higher reliability through human-computer interaction, belonging to isolated word voice recognition, intelligent Robotic human-computer interaction and other technical categories. Background technique [0002] A large number of electronic devices need to interact with operators to complete specific functions. The most common is to operate the machine by buttons or remote control. After the computer was born, the software interface was operated through the mouse and keyboard. The voice interaction method is gradually adopted by various systems. It is very convenient and does not require users to use any additional equipment, and the dialogue method is more easily accepted by the majority of users. At the same time, the shortcomings of voice i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/22G10L15/06G10L15/08G10L15/07G10L17/22
Inventor 王视鎏冯瑞金城薛向阳
Owner FUDAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products