Unlock instant, AI-driven research and patent intelligence for your innovation.

Information processing device, information processing method and program

A technology for information processing equipment and processing units, which is applied in character and pattern recognition, speech analysis, instruments, etc., and can solve problems such as low accuracy, insufficient robustness, and speaker recognition.

Inactive Publication Date: 2011-09-21
SONY CORP
View PDF7 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] However, in such a deterministic integration processing method using uncertain and asynchronous data input from a camera and a microphone in the system of the related art, there is a problem in that only data with insufficient robustness and low accuracy can be obtained
In this process, however, since only mouth movements are the subject to be evaluated, there is the problem that, for example, a user chewing gum will also be recognized as a speaker

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information processing device, information processing method and program
  • Information processing device, information processing method and program
  • Information processing device, information processing method and program

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Hereinafter, an information processing device, an information processing method, and a program according to embodiments of the present invention will be described in detail with reference to the accompanying drawings. Descriptions are provided in the following topics.

[0042] 1. Overview on user localization and user identification processing by particle filtering based on audio and image event detection information

[0043] 2. Regarding speaker specification processing associated with score (AVSR score) calculation processing by voice and image-based voice recognition

[0044] Furthermore, the present invention is based on the technology of Japanese Patent Application No. 2007-317711 (Japanese Unexamined Patent Application Publication No. 2009-140366 ), which is the applicant's previous application, and the outline and composition of the invention disclosed therein will be described in Subject 1 above. Hereinafter, speaker designation processing associated with score...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an information processing device, an information processing method and a program. The information processing device includes an audio-based speech recognition processing unit which is input with audio information as observation information of a real space, executes an audio-based speech recognition process, thereby generating word information that is determined to have a high probability of being spoken, an image-based speech recognition processing unit which is input with image information as observation information of the real space, analyzes mouth movements of each user included in the input image, thereby generating mouth movement information, an audio-image-combined speech recognition score calculating unit which is input with the word information and the mouth movement information, executes a score setting process in which a mouth movement close to the word information is set with a high score, thereby executing a score setting process, and an information integration processing unit which is input with the score and executes a speaker specification process.

Description

technical field [0001] The present invention relates to an information processing device, an information processing method and a program. More specifically, the present invention relates to an information processing device, an information processing method, and a program that enable information such as images and sounds to be input from an external environment and analyze the external environment based on the input information, specifically, specify a position of an object and recognize objects such as speakers. Background technique [0002] A system that performs communication or interactive processing between a human and an information processing device such as a PC or a robot is called a human-computer interaction system. In such a human-computer interaction system, an information processing device such as a PC or a robot receives image information or audio information, analyzes the received information, and recognizes a human voice or motion. [0003] When a person tra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/24G06V10/80G10L15/00G10L17/00G10L17/10G10L17/14
CPCG10L15/25G10L15/32G10L15/142G06K9/00221G06K9/6288G10L2015/025G06V40/16G06V10/80G06F18/25
Inventor 泽田务
Owner SONY CORP