Unlock instant, AI-driven research and patent intelligence for your innovation.

Speech translation method and device

A technology of speech translation and target speech, applied in the field of speech translation, can solve problems such as wrong translation results and inaccurate translation results

Active Publication Date: 2022-02-25
IFLYTEK CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, combining speech recognition technology and text translation technology for speech translation has the disadvantage of error accumulation. Wrong word gets wrong translation
It can be seen that errors in the speech recognition stage will accumulate in the text translation stage, resulting in inaccurate translation results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech translation method and device
  • Speech translation method and device
  • Speech translation method and device

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0079] see figure 1 , which is a schematic flowchart of a voice translation method provided in this embodiment, the method includes the following steps:

[0080] S101: Obtain a target speech to be translated.

[0081] In this embodiment, any speech that is used for speech translation in this embodiment is defined as the target speech. And, the present embodiment does not limit the language type of the target voice, for example, the target voice can be Chinese voice or English voice, etc.; meanwhile, the present embodiment does not limit the length of the target voice, for example, the target voice can be a sentence, or More words and so on.

[0082] It can be understood that the target voice can be obtained by recording according to actual needs. For example, the voice of telephone conversations in people's daily life, or conference recordings, etc. can be used as the target voice. After the target voice is obtained, this implementation can be used to Example realizes the t...

no. 2 example

[0147] In this embodiment, by translating the first translation object (i.e., the recognized text of the target speech) through step S102 in the above-mentioned first embodiment, the first probability corresponding to the kth word in the final translated text of the target speech can be generated distribution, and this first probability distribution can be defined as P text (y k ), where y k refers to the kth word in the final translated text of the target speech.

[0148] Among them, the first probability distribution P text (y k ) can include the kth word y obtained after decoding the recognized text of the target speech k is the first decoding probability of each candidate word in the vocabulary. The larger the value of the first decoding probability, it indicates that the kth word y obtained after decoding the recognized text of the target speech k The probability of corresponding to the word to be selected is greater.

[0149] combine Figure 4 The network structu...

no. 3 example

[0173] In this embodiment, the first translated text can be obtained by translating the first translation object (ie, the recognized text of the target speech) through step S102 in the above-mentioned first embodiment.

[0174] Among them, the first translation text is the text of the target translation language, and the first translation text can be defined as Among them, K 1 Indicates the number of single characters (or words) contained in the first translated text. For example, assuming that the target speech is Chinese and the target translation language is English, that is, the target speech needs to be translated into English text, then the first translated text is the English text Among them, K 1 Indicates the number of words contained in the English text.

[0175] An optional implementation is to combine Figure 4 The network structure shown can use the decoding vector obtained by the text decoder to generate the first translated text of the target speech Among...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application discloses a speech translation method and device. The method includes: after acquiring the target speech to be translated, translating both the recognized text of the target speech and the target speech itself as translation objects to obtain the target speech Compared with the method in the prior art that first recognizes the target speech, obtains the recognized text, and then translates the recognized text as the translation object, the translation objects in this application are more abundant, including the recognized text There are two translation objects of the target speech and the target speech. Therefore, by translating the two translation objects, a more accurate translation text of the target speech can be determined.

Description

technical field [0001] The present application relates to the technical field of speech translation, in particular to a speech translation method and device. Background technique [0002] Existing speech translation methods generally include two steps, namely, speech recognition and text translation. Specifically, firstly, a piece of speech is recognized into a text in the same language through speech recognition technology, and then the recognized text is translated into a text in another language by using text translation technology, thereby realizing the speech translation process. [0003] However, combining speech recognition technology and text translation technology for speech translation has the disadvantage of error accumulation. Wrong words get wrong translations. It can be seen that errors in the speech recognition stage will accumulate in the text translation stage, resulting in inaccurate translation results. Contents of the invention [0004] The main purp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/26G06F40/58
CPCG10L15/26G06F40/58
Inventor 马志强刘俊华魏思胡国平
Owner IFLYTEK CO LTD