Direct speech-to-speech translation via machine learning
A machine learning and speech technology, applied in the field of machine learning, can solve problems such as the inability of cascading systems to learn
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0025] overview
[0026] In general, the present disclosure is directed to systems and methods for training and using machine learning models (such as, for example, sequence-to-sequence models) to perform direct and text-free speech-to-speech translation. In particular, aspects of the present disclosure provide an attention-based sequence-to-sequence neural network that can directly translate speech from one language to speech in another without relying on intermediate textual representations. According to one aspect of the present disclosure, the machine learning models described herein can be trained end-to-end to learn to map acoustic feature representations (e.g., spectrograms) of speech in a first language (e.g., Spanish) directly to a second language (e.g., Spanish). Acoustic feature representation (eg, spectrogram) of speech in a language (eg, English). For example, speech in the second language may correspond to a translation of speech in the first language (eg, also ...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com