Unlock instant, AI-driven research and patent intelligence for your innovation.

Multi-modal information processing and interaction system

An interactive system, multi-modal technology, applied in electrical digital data processing, special data processing applications, digital data information retrieval and other directions, can solve the problems of rigid dialogue mechanism, simple mode fusion method, etc. The effect of natural and flexible human-computer interaction and natural multi-modal interaction

Active Publication Date: 2021-04-06
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to effectively solve the problem of simple modal fusion and rigid dialogue mechanism in the multimodal interactive system, the present invention first establishes a multimodal information fusion model, based on the D-S evidence theory, making full use of multimodal information for intent Fusion, and combine the interaction information of each mode under the intent based on the slot filling method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-modal information processing and interaction system
  • Multi-modal information processing and interaction system
  • Multi-modal information processing and interaction system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The framework of the multi-modal human-computer interaction system proposed by the present invention is as follows: figure 1 As shown, the multimodal human-computer interaction system is mainly divided into four functional modules:

[0022] (1) Multi-modal information cognition module: recognize each modal interactive information, including multiple interactive information identification modules. The present invention mainly includes a voice command recognition module and a gesture recognition module. The invention has strong expansibility, and traditional modules such as touch control and joystick can be added later.

[0023] (2) Multi-modal information fusion module: first, the D-S evidence-based method is used to fully utilize multi-modal information for intent fusion, and then the information is integrated to combine the multi-modal information into formalized instructions;

[0024] (3) Multi-modal dialog management module: a dialog management model that combines ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a multi-modal information processing and interaction system which is used for solving the problems that in a multi-modal interaction system, a modal fusion mode is simple, and a dialogue mechanism is stiff. The system comprises a multi-modal information cognition module, a multi-modal information fusion module and a multi-modal dialogue management module, the multi-modal information cognition module is used for identifying all modal interaction information of a user, and the multi-modal information fusion module performs intention fusion on all modal interaction information of the user by using a DS evidence theory to obtain a multi-modal dialogue result; a final interaction intention of the user is determined and a formalized instruction which corresponds to the final interaction intention of the user and can be identified by a machine is acquired. the multi-modal dialogue management module adopts a dialogue management model integrating a finite-state machine and an information slot filling method for a multi-modal human-computer interaction scene, and is used for controlling a dialogue process and generating a response; according to the invention, the accuracy of user interaction intention recognition is effectively improved, and natural and flexible man-machine interaction is realized.

Description

technical field [0001] The invention relates to multi-modal information fusion technology, in particular to a multi-modal interactive system that effectively utilizes multi-modal information and can realize human-computer friendly interaction. Background technique [0002] Since the late 20th century, more and more scholars have paid more and more attention to the research on multimodal human-computer interaction. Many university laboratories and scientific research institutions at home and abroad have established relevant scientific research teams, such as the School of Human-Computer Interaction at Carnegie-Mellon University, the Artificial Intelligence Research Center at Stanford University, and the Media Lab at the Massachusetts Institute of Technology. Large companies such as Google and Microsoft have also injected a lot of manpower and material resources into the research of multimodal human-computer interaction. Since multi-modal human-computer interaction has receiv...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F16/332
CPCG06F16/3329G06F18/257G06F18/25
Inventor 甘明刚徐磊田宗凯陈杰陈文颉陈晨窦丽华
Owner BEIJING INSTITUTE OF TECHNOLOGYGY