Speech recognition system, speech recognition request device, speech recognition method, speech recognition program, and recording medium

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
a speech recognition and request device technology, applied in the field of speech recognition system, speech recognition method, speech recognition program, can solve the problems of reducing recognition speed and recognition precision, unable to collect all an enormous amount of vocabulary in the first place, and unable to meet the needs of speech recognition. risk, to achieve the effect of suppressing risk

Inactive Publication Date: 2012-08-23

NEC CORP

View PDF4 Cites 36 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Benefits of technology

[0025]According to this invention, it is possible to provide a speech recognition system capable of secret speech recognition which suppresses a risk that a content of a user's utterance may be leaked to the third party to a minimum level in a case where a speech recognition function is realized as a service provided via a network.

[0026]Further, according to this invention, it is possible to provide a speech recognition system capable of secret speech recognition which suppresses a risk that a content expected to be uttered by the user or special information related to a task or domain to be used for a speech recognition technology by the user may be leaked to the third party to a minimum level in a case where the speech recognition function is realized as the service provided via the network.

Problems solved by technology

Note that, there are limitations on the vocabulary and phrases that can be modeled by a single language model.

If a larger volume of vocabulary and diverse phrases are to be modeled beyond the limitations, ambiguity in hypothesis search increases, which results in a decrease in recognition speed and a deterioration in recognition precision.

Further, it is impossible to collect all an enormous amount of vocabulary in the first place.

The speech recognition is widely applicable to various purposes, but poses a problem of requiring corresponding calculation amount particularly in the above-mentioned hypothesis search processing.

The speech recognition technology has been developed by solving mutually contradictory objects to increase the recognition precision and to reduce the calculation amount, but even today, there still remains a problem, for example, that there are limitations on a vocabulary number that can be handled by a cellular telephone terminal and the like.

As described above, an original object thereof is to overcome the problem that the speech recognition is difficult to perform on the mobile terminals having poor processing performance because the calculation amount involved in the speech recognition processing is severe.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

first embodiment

[0050]Next, a mode for embodying the invention is described in detail by referring to the accompanying drawings. Note that, to clarify the description, simplification or omission would be made about descriptions which are related to inputs, control processing, display, communications, and the like all of which have less to do with this invention. Here, to facilitate an understanding of the invention, premised issues would be summed up.[0051]A content (information) to be made secret includes an uttered content (information converted into data) itself and a content that can be uttered (information related to utterance: information to be used for speech recognition).[0052]The former is caused to leak by restoring speech, and the latter is caused to leak by decrypting vocabulary information included in a language model or other such operation.[0053]The speech can be restored from an acoustic feature although incompletely.[0054]Even if the speech itself cannot be restored, one that know...

fourth embodiment

[0175]Next, a fourth embodiment is described by referring to FIG. 5. Note that, to clarify the description, descriptions of the same parts as those of other embodiments are simplified or omitted.

[0176]FIG. 5 is a block diagram illustrating a configuration of the fourth embodiment. A plurality of speech recognition servers of a speech recognition system according to the fourth embodiment each provide a speech recognition service.

[0177]The information processing device that requests for the speech recognition includes an utterance dividing unit for extracting the feature vector by performing time division on the sound (speech) input thereto. Note that, instead of the time division for the feature vector, division may be performed in units of clauses or words of the speech.

[0178]The information processing device that requests for the speech recognition (requesting server) performs the shuffling or the like on a sequence relationship between the divided items of speech data, then subjec...

fifth embodiment

[0182]Next, a fifth embodiment is described by referring to FIG. 6. Note that, to clarify the description, descriptions of the same parts as those of other embodiments are simplified or omitted.

[0183]FIG. 6 is a block diagram illustrating a configuration of a fifth embodiment. A speech recognition system according to the fifth embodiment has a mode in which the speech recognition server including the acoustic likelihood detection unit is used to generate result data on the acoustic likelihood and transfer the result data to another speech recognition server including the hypothesis search unit. Further, the speech recognition system may be configured such that a secret speech identification device instructs the speech recognition server including the acoustic likelihood detection unit to perform the transfer itself. Further, the speech recognition system may be configured such that the result data on the acoustic likelihood to be transferred is divided and transferred to the plurali...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Provided is a speech recognition system, including: a first information processing device including a speech recognition processing unit for receiving data to be used for speech recognition transmitted via a network, carrying out speech recognition processing, and returning resultant data; and a second information processing device connected to the first information processing device via the network. The second information processing device performs conversion of the data into data having a format that disables a content thereof from being perceived and also enables the speech recognition processing unit to perform the speech recognition processing. Thereafter, the second information processing device transmits the data to be used for the speech recognition by the speech recognition processing unit and constructs resultant data returned from the first information processing device into a content of a valid and perceivable recognition result.

Description

TECHNICAL FIELD[0001]This invention relates to a speech recognition system, a speech recognition method, and a speech recognition program. Specifically, this invention relates to a speech recognition system, a speech recognition method, and a speech recognition program, which disable the third party from restoring details of a recognition result regarding a content of speech to be subjected to speech recognition, details of a speech recognition dictionary, or the like.BACKGROUND ART[0002]A speech recognition technology using an information processing system is a technology for taking out language information included in input speech data. A system using the speech recognition technology can be used as a speech word processor if all the speech data are converted into text, and can be used as a speech command input device if a keyword included in the speech data is extracted.[0003]FIG. 7 illustrates an example of a related speech recognition system. The speech recognition system illus...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(United States)

IPC IPC(8): G10L19/00G10L15/28G10L15/30

CPCG10L15/22G10L15/30G06F21/32G10L15/26G10L17/00G10L15/02G10L15/187G10L2015/025

InventorNAGATOMO, KENTARO

OwnerNEC CORP

Speech recognition system, speech recognition request device, speech recognition method, speech recognition program, and recording medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Benefits of technology

Problems solved by technology

Method used

Image

Examples

first embodiment

fourth embodiment

fifth embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology