Speech translation method and device
A speech translation and speech technology, applied in the computer field, can solve the problems of inappropriate English text, unable to express the style and characteristics of the source speaker, unable to express the style and characteristics of the source speaker, etc., so as to facilitate the understanding of semantics and context. , expressive
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0076] see figure 1 , a schematic flowchart of a speech translation method provided in an embodiment of the present application, the method includes the following steps:
[0077] S101: Acquire first voice data.
[0078] In this embodiment, the voice data that needs to be translated into text is defined as the first voice data.
[0079] This embodiment does not limit the source of the first voice data. For example, the first voice data may be the real voice or the recorded voice of the source speaker, or may be the real voice or the recorded voice. The processed special effect voice.
[0080] This embodiment also does not limit the length of the first voice data. For example, the first voice data may be a word, a sentence, or a paragraph.
[0081] S102: Generate speech recognition text by performing speech recognition on the first speech data.
[0082] After the first voice data is acquired, the first voice data is converted into voice recognition text by using a voice reco...
no. 2 example
[0089] This embodiment will focus on the specific implementation of S103 in the above-mentioned first embodiment. For other relevant parts, please refer to the introduction of the first embodiment.
[0090] see figure 2 , a schematic flowchart of a speech translation method provided in an embodiment of the present application, the method includes the following steps:
[0091] S201: Acquire first voice data.
[0092] S202: Generate speech recognition text by performing speech recognition on the first speech data.
[0093] It should be noted that S201 and S202 in this embodiment are the same as S101 and S102 in the first embodiment. For related descriptions, please refer to the first embodiment, which will not be repeated here.
[0094] S203: Use the speech recognition text as a unit recognition text, or use each text segment forming the speech recognition text as a unit recognition text respectively.
[0095] In this embodiment, the speech recognition text as a whole may be...
no. 3 example
[0131] In the current speech translation technology, the audio synthesized by the machine after translation is completely the speaking style of the speaker trained in the synthesis model. The effect of the synthesized audio has a very low correlation with the speaking style of the source speaker before translation. Sometimes, simply translating It is difficult for the incoming audio to express the style and characteristics of the source speaker.
[0132] In order to solve this defect, this embodiment provides a voice translation method, which can translate the voice data of the source speaker (that is, the first voice data in the first embodiment and the second embodiment), obtain the translated text, and Audio synthesis is performed by combining the acoustic features in the speech data, so that the synthesized audio is adapted to the speech style of the source speaker, so as to achieve a more natural and expressive speech translation. This method is suitable for real-time spo...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com