Speech processing device

A voice processing and voice technology, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as unsmooth speech, unavoidable misrecognition, and mispronunciation

Inactive Publication Date: 2006-08-16
TOSHIBA TEC KK
View PDF2 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, as mentioned above, the utterance of the target language after the front trigger such as the operation of the voice operation button and the utterance of the keyword requires accurate utterance as described above. Therefore, the speaker is aware of this and becomes nervous, causing the speech to be unsmooth. or high probability of being wrong
Therefore, it is difficult to avoid misrecognition due to the user's utterance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech processing device
  • Speech processing device
  • Speech processing device

Examples

Experimental program
Comparison scheme
Effect test

no. 1 approach

[0021] FIG. 1 is a block diagram showing the overall configuration of a speech processing device 1 . The block diagram shown in FIG. 1 is a functional block diagram, and various functions shown in the functional block diagram are executed by a computer (not shown). In other words, the functions shown in FIG. 1 are realized by arithmetic processing in the processor according to the program code for causing the computer to execute the functions. In this case, the processor, the storage medium storing the program code, and the like may be a firmware structure configured as an integrated circuit, or may be configured by a general-purpose computer, for example. When the processor and the storage medium storing the program code are constituted by a general-purpose computer or the like, as an example, the program code is installed in advance on an HDD or the like of the general-purpose computer. The installed program code is copied to RAM, for example, and the processor built in the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

To ensure that speech recognition is performed without being accompanied by operation of a forward trigger. The system includes a speech / non-speech discrimination section 5 which discriminates whether the sound inputted from a speech input section 3 is speech or non-speech, a keyword dictionary 10, a dictionary 13 for speech recognition, a speech recognition section 8 which performs speech recognition based on the dictionary 13 for speech recognition, a speech keyword detection section 11 which detects whether the sound judged to be the speech in the speech / non-speech discrimination section 5 is a word previously registered in the keyword dictionary 10 or not, and a recognition instruction section 9 which emits the instruction to perform the speech recognition of the sound inputted at the time the sound inputted from the speech input section 3 is detected to be the sound including a word registered in the keyword dictionary 10 to the speech recognition section 8. The speech recognition is performed using the specific utterance after the user utters a desired word as a trigger.

Description

technical field [0001] The invention relates to a voice processing device, which can perform voice recognition and speaker recognition, and is used to control various devices through voice. Background technique [0002] In general, in voice processing for voice recognition and speaker recognition, there is a problem of misrecognition caused by picking up surrounding environmental voices in addition to target voices. In order to eliminate such a disadvantage, Patent Document 1 below discloses a technique of operating a button by voice before the user utters a target language. This technology is generally referred to as push-to-talk. In addition, Patent Document 2 below discloses a technique for solving the problem by uttering a specific keyword instead of the voice operation button disclosed in Patent Document 1. This technique is to wait for a word to become a keyword and obtain information after the word is recognized, which is called a voice command (magic word) method. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/00G10L15/08G10L17/00G10L15/28G10L15/04G10L25/78
Inventor 关根直树柿野友成
Owner TOSHIBA TEC KK
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products