Combined lip reading and voice recognition multimodal interface system

a multimodal interface and lip reading technology, applied in surveying and navigation, navigation instruments, instruments, etc., can solve problems such as user's attention, increased accident risk, and difficulty in operating the navigation system

Active Publication Date: 2011-03-24
HYUNDAI MOTOR CO LTD +1
View PDF24 Cites 139 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0009]The present provides, in preferred aspects, a combined lip reading and voice recognition multimodal interface system, which implements a lip reading system that effectively detects lips from a face image through a camera, suitably tracks lip movements, and suitably recognizes a voic...

Problems solved by technology

Although the use of the touch screen can minimize input errors, a user has to use his or her hands and eyes at the same time, which makes it difficult to operate the navigation system during driving, and also distracts the user's attention, thus increasing the risk of an accident.
However, this method is susceptible to audio noise, and therefore a malfunction in recognition may occur in a noisy environment.
However, at present, there has not been any consistent research on all the processes.
Its performance is susceptible to an initial position, and quick movements of lips in speech cannot be robustly tracked, thereby making it difficult to obtain stable feature values when tracking on a video.
Further, while research has been conducted on recognizer algori...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Combined lip reading and voice recognition multimodal interface system
  • Combined lip reading and voice recognition multimodal interface system
  • Combined lip reading and voice recognition multimodal interface system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027]In a first aspect, the present invention features a combined lip reading and voice recognition multimodal interface system, comprising an audio voice input unit, a voice recognition unit, a voice recognition instruction and estimated probability output unit, a lip video image input unit, a lip reading unit, a lip reading recognition instruction output unit, a voice recognition and lip reading recognition result combining unit that outputs the voice recognition instruction.

[0028]In one embodiment, the audio voice input unit obtains a sound signal input by an audio input sensor or an input audio signal transmitted from the outside by wired or wireless connection.

[0029]In another embodiment, the voice recognition unit recognizes voice from the input audio signal and calculates an estimated recognition accuracy.

[0030]In a further embodiment, the voice recognition instruction and estimated probability output unit outputs an instruction corresponding to the voice recognized by the v...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a combined lip reading and voice recognition multimodal interface system, which can issue a navigation operation instruction only by voice and lip movements, thus allowing a driver to look ahead during a navigation operation and reducing vehicle accidents related to navigation operations during driving. The combined lip reading and voice recognition multimodal interface system in accordance with the present invention includes: an audio voice input unit; a voice recognition unit; a voice recognition instruction and estimated probability output unit; a lip video image input unit; a lip reading unit; a lip reading recognition instruction output unit; and a voice recognition and lip reading recognition result combining unit that outputs the voice recognition instruction

Description

CROSS-REFERENCE TO RELATED APPLICATION[0001]This application claims under 35 U.S.C. §119(a) the benefit of Korean Patent Application No. 10-2009-0089637 filed on Sep. 22, 2009, the entire contents of which are incorporated herein by reference.BACKGROUND OF THE INVENTION[0002]The present invention relates, in general, to a combined lip reading and voice recognition multimodal interface system. More particularly, in preferred embodiments, the present invention relates to a combined lip reading and voice recognition multimodal interface system, which can suitably issue a navigation operation instruction primarily, preferably only, by voice and lip movements, thus preferably allowing a driver to look ahead during a navigation operation and suitably reducing vehicle accidents related to navigation operations during driving.[0003]Presently, with the development of automobile technology and the increasing use of vehicles in daily life, there has been increasing interest and demand for safe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/00G10L15/04G10L17/00G10L21/00G06K9/00
CPCG01C21/3602G01C21/3608G10L15/32G06K9/00335G10L15/25G06F3/011G06V40/20G06F3/017G06F3/0481
Inventor KIM, DAE HEEKIM, DAI-JINLEE, JINSHIN, JONG-JULEE, JIN-SEOK
Owner HYUNDAI MOTOR CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products