Speech translation method and system
A speech translation and speech technology, applied in the field of speech translation methods and systems, can solve the problems of inability to obtain enough multi-source languages, inability to meet real-time requirements, limited practicability, etc., to increase readability, reduce training difficulty, The effect of reducing the scale
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0055] This embodiment provides a speech translation method for transcribing Hokkien speech into target language text.
[0056] The speech translation model of the present invention adopts a sequence-to-sequence architecture based on Transformer, as attached figure 1 shown.
[0057] In the encoder part on the left, the input speech features are converted into another dimensional vector (InputEmbedding) by transformation, and summed with the positional encoding (Positional Encoding) vector, and then through a series of matrix operations in the dotted line box, more Abstract features, as the output of the encoding part, the operation in the dotted box can be repeated multiple times, and then the output of the encoding part is used as part of the input of the multi-head attention layer (Multi-Head Attention) of the decoder part on the right.
[0058] In the decoder part on the right, the recognized text of the previous time step is used as input, first transformed into ...
Embodiment 2
[0099] This embodiment provides a speech translation system, including:
[0100] Speech input terminal, the user inputs speech to be recognized in a small language through components such as a microphone, the terminal extracts the speech features of the speech to be recognized, and transmits the speech features to the recognition module.
[0101] The recognition module stores a list of replaceable words and a dictionary of subwords, and is loaded with a speech translation model model, calculates the position encoding vectors for the above speech features and sums them up, transcribes the speech of the minority language into the target language text, and outputs the recognized text.
[0102] Applying this system to APPs or other smart devices on mobile phones can meet the needs of users for voice translation in small languages.
[0103] Those skilled in the art can understand that all or part of the steps in the embodiment of the above voice data detection method can ...
PUM
![No PUM](https://static-eureka.patsnap.com/ssr/23.2.0/_nuxt/noPUMSmall.5c5f49c7.png)
Abstract
Description
Claims
Application Information
![application no application](https://static-eureka.patsnap.com/ssr/23.2.0/_nuxt/application.06fe782c.png)
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com