Speech recognition apparatus, method and electronic equipment

A speech recognition and speech technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as misrecognition, and achieve the effect of improving accuracy and reducing the probability of misrecognition.

Inactive Publication Date: 2015-10-14
FUJITSU LTD
View PDF13 Cites 53 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Embodiments of the present invention provide a speech recognition device, method, and electronic equipment, which can perform keywor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition apparatus, method and electronic equipment
  • Speech recognition apparatus, method and electronic equipment
  • Speech recognition apparatus, method and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0030] figure 2 is a schematic diagram of the composition of the speech recognition device in Embodiment 1 of the present invention, such as figure 2 As shown, the speech recognition device 100 includes a recognition unit 101 , a decoding unit 102 , a calculation unit 103 and a judgment unit 104 .

[0031] Wherein, the recognition unit 101 is used to recognize the speech to obtain candidate keywords; the decoding unit 102 is used to combine semantic information to decode the speech containing the speech that recognizes the candidate keywords in the speech, so as to generate The word grid corresponding to the voice that contains the voice that recognizes the candidate keyword; the computing unit 103 calculates the confidence of the candidate keyword according to the word grid; the judging unit 104 judges whether to The candidate keyword is determined as a keyword.

[0032] It can be known from the above embodiments that by combining semantic information and further identify...

Embodiment 2

[0057] Embodiment 2 provides a voice recognition device, which has the same structure as the voice recognition device in Embodiment 1. In Embodiment 2, the working principle of the speech recognition device is described by taking decoding a speech segment as an example. In this embodiment, only the speech segment is decoded, so that the complexity of the generated word grid can be controlled and the amount of calculation can be saved. In the case of decoding all speech, a more complex word lattice will be generated, but the working principle of the speech recognition device is the same as in this embodiment.

[0058] In the embodiment of the present invention, it is assumed that the voice input into the voice recognition device 100 is "zun jing shi zhang shi chuan tong mei de, xu yao cong wo zuo qi".

[0059] The recognition unit 101 recognizes the speech, and obtains the candidate keyword "teacher", wherein, the recognized speech of "teacher" is "shi zhang";

[0060] The de...

Embodiment 3

[0073] Embodiment 3 provides an electronic device, which includes the speech recognition device described in Embodiment 1 and 2. The electronic device may have functions such as voice control, through which the voice recognition device recognizes keywords, and generates corresponding control signals according to the keywords.

[0074] Figure 8 It is a schematic block diagram of the system configuration of the electronic device 800 in the embodiment of the present invention. Such as Figure 8 As shown, the electronic device 800 may include a central processing unit 801 and a memory 802; the memory 802 is coupled to the central processing unit 801. It is worth noting that this figure is exemplary; other types of structures may also be used in addition to or instead of this structure to implement telecommunications functions or other functions.

[0075] In one embodiment, the function of the voice recognition device can be integrated into the central processing unit 801 . Wh...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a speech recognition apparatus, a method and a piece of electronic equipment. The apparatus comprises an identifying unit used for identifying a speech to acquire a candidate keyword; a decoding unit decoding speeches including the speech of the identified candidate keyword on the basis of meaning information to generate a word grid corresponding to the speeches including the speech of the identified candidate keyword; a calculating unit calculating the confidence of the candidate keyword according to the word grid; and a judging unit judging whether to determine the candidate keyword as a keyword according to the confidence. By performing keyword identification in a way to refer to the meaning information, the apparatus solves the mistaken identification problem caused by similar pronunciations.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a speech recognition device, method and electronic equipment. Background technique [0002] Keyword Recognition (KWR) is a branch of speech recognition, also known as Keyword Spotting (KWS). Words other than words and various non-speech sounds. The main difference between keyword recognition and continuous speech recognition is that continuous speech recognition requires recognition of all the content of speech, while keyword recognition only requires recognition of keywords from speech. [0003] In the prior art, keywords in speech are usually identified based on acoustic models: for example, keywords can be identified directly based on the acoustic model of speech, but this method is prone to false rejection (False Rejection, FR) and false acceptance ( False Alarm, FA); in some improved schemes, a Filler model can be built to improve the accuracy of keyword recogni...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/183
Inventor 石自强刘汝杰
Owner FUJITSU LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products