Direct speech-to-speech translation via machine learning

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A machine learning and speech technology, applied in the field of machine learning, can solve problems such as the inability of cascading systems to learn

Pending Publication Date: 2021-01-08

GOOGLE LLC

View PDF1 Cites 8 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Finally, cascaded systems cannot learn to produce fluent pronunciations of words that do not require translation, such as names or other proper nouns

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0025] overview

[0026] In general, the present disclosure is directed to systems and methods for training and using machine learning models (such as, for example, sequence-to-sequence models) to perform direct and text-free speech-to-speech translation. In particular, aspects of the present disclosure provide an attention-based sequence-to-sequence neural network that can directly translate speech from one language to speech in another without relying on intermediate textual representations. According to one aspect of the present disclosure, the machine learning models described herein can be trained end-to-end to learn to map acoustic feature representations (e.g., spectrograms) of speech in a first language (e.g., Spanish) directly to a second language (e.g., Spanish). Acoustic feature representation (eg, spectrogram) of speech in a language (eg, English). For example, speech in the second language may correspond to a translation of speech in the first language (eg, also ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present disclosure provides systems and methods that train and use machine-learned models such as, for example, sequence-to-sequence models, to perform direct and text-free speech-to-speech translation. In particular, aspects of the present disclosure provide an attention-based sequence-to-sequence neural network which can directly translate speech from one language into speech in another language, without relying on an intermediate text representation.

Description

[0001] related application [0002] This application claims priority and benefit to US Provisional Patent Application No. 62 / 826,258, which is hereby incorporated by reference in its entirety. technical field [0003] This disclosure relates generally to machine learning. More specifically, the present disclosure relates to direct and text-free speech-to-speech translation by machine learning models, such as sequence-to-sequence models. Background technique [0004] Speech-to-speech translation (S2ST) refers to the process of translating speech in one language (eg, represented by a first speech waveform) into speech in a different language (eg, represented by a different second speech waveform). Conventional S2ST systems rely on a cascaded approach that combines multiple disparate systems to perform translation. In particular, conventional S2ST systems are often divided into three components that operate separately and sequentially: automatic speech recognition (ASR), text...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L13/033G10L13/04G10L21/003

CPCG10L13/033G10L21/003G10L13/00G06F40/47G06F40/58

Inventor贾晔Z.陈Y.吴M.约翰逊F.比亚德西R.韦斯W.马彻雷

OwnerGOOGLE LLC

Direct speech-to-speech translation via machine learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology