Direct speech-to-speech translation via machine learning

A machine learning and speech technology, applied in the field of machine learning, can solve problems such as the inability of cascading systems to learn

Pending Publication Date: 2021-01-08
GOOGLE LLC
View PDF1 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Finally, cascaded systems cannot learn to produce fluent pronunciations...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Direct speech-to-speech translation via machine learning
  • Direct speech-to-speech translation via machine learning
  • Direct speech-to-speech translation via machine learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] overview

[0026] In general, the present disclosure is directed to systems and methods for training and using machine learning models (such as, for example, sequence-to-sequence models) to perform direct and text-free speech-to-speech translation. In particular, aspects of the present disclosure provide an attention-based sequence-to-sequence neural network that can directly translate speech from one language to speech in another without relying on intermediate textual representations. According to one aspect of the present disclosure, the machine learning models described herein can be trained end-to-end to learn to map acoustic feature representations (e.g., spectrograms) of speech in a first language (e.g., Spanish) directly to a second language (e.g., Spanish). Acoustic feature representation (eg, spectrogram) of speech in a language (eg, English). For example, speech in the second language may correspond to a translation of speech in the first language (eg, also ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present disclosure provides systems and methods that train and use machine-learned models such as, for example, sequence-to-sequence models, to perform direct and text-free speech-to-speech translation. In particular, aspects of the present disclosure provide an attention-based sequence-to-sequence neural network which can directly translate speech from one language into speech in another language, without relying on an intermediate text representation.

Description

[0001] related application [0002] This application claims priority and benefit to US Provisional Patent Application No. 62 / 826,258, which is hereby incorporated by reference in its entirety. technical field [0003] This disclosure relates generally to machine learning. More specifically, the present disclosure relates to direct and text-free speech-to-speech translation by machine learning models, such as sequence-to-sequence models. Background technique [0004] Speech-to-speech translation (S2ST) refers to the process of translating speech in one language (eg, represented by a first speech waveform) into speech in a different language (eg, represented by a different second speech waveform). Conventional S2ST systems rely on a cascaded approach that combines multiple disparate systems to perform translation. In particular, conventional S2ST systems are often divided into three components that operate separately and sequentially: automatic speech recognition (ASR), text...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L13/033G10L13/04G10L21/003
CPCG10L13/00G10L13/033G10L21/003G06F40/47G06F40/58
Inventor 贾晔Z.陈Y.吴M.约翰逊F.比亚德西R.韦斯W.马彻雷
Owner GOOGLE LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products