Speech synthesis method and speech synthesis device
A sound synthesis and sound technology, which is applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of users' incoordination and unpleasantness, and achieve the effect of reducing sound quality and suppressing the sense of noise
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 Embodiment approach
[0045]
[0046] FIG. 1 shows the configuration of a voice interactive interface according to the first embodiment. The interface is between digital information equipment (such as digital TV and car navigation system) and the user, and supports the operation of the user's equipment by exchanging information (dialogue) with the user through voice. This interface includes a voice recognition unit 10 , a dialog processing unit 20 and a voice synthesis unit 30 .
[0047] The voice recognition unit 10 recognizes a user's voice.
[0048] The dialogue processing part 20 sends the control signal corresponding to the recognition result by the voice recognition part 10 to the digital information device, or sends the recognition result by the voice recognition part 10 and / or the response message (text) according to the control signal from the digital information device A signal for controlling and giving emotion to the response text is sent to the voice synthesis unit 30 .
[0049] Th...
no. 2 Embodiment approach
[0100] In the first embodiment, phase shaping and high-domain phase diffusion are performed in separate steps. If these are applied, it is possible to impose some other operation on the pitch waveform temporarily shaped by phase shaping. The second embodiment is characterized in that the data storage capacity is reduced by grouping the temporarily shaped pitch waveforms into clusters.
[0101] The interface according to the second embodiment includes a speech synthesis unit 40 shown in FIG. 16 instead of the speech synthesis unit 30 shown in FIG. 1 . Other constituent elements are the same as those shown in FIG. 1 . The speech synthesis unit 40 shown in FIG.
[0102] The representative pitch waveform obtained by the device shown in FIG. 17( a ) (a device separate from the voice interactive interface) is stored in advance in the representative pitch waveform DB 42 . In the apparatus shown in FIG. 17( a ), a waveform DB 34 is provided, the output of which is connected to the ...
no. 3 Embodiment approach
[0107] The storage capacity reduction effect brought about by clustering, that is, the improvement of clustering efficiency is not only effective in shaping the pitch waveform by removing phase fluctuations, but also in normalizing the amplitude and time length. In the third embodiment, when the pitch waveform is stored, a step of normalizing the amplitude and the time length is designed. In addition, when reading the pitch waveform, the amplitude and duration are appropriately converted according to the synthesized voice.
[0108] The interface according to the third embodiment includes a speech synthesis unit 50 shown in FIG. 18( a ) instead of the speech synthesis unit 30 shown in FIG. 1 . Other constituent elements are the same as those shown in Fig. 1 . The speech synthesis unit 50 shown in FIG. 18( a ) further adds a deformation unit 51 to the constituent elements of the speech synthesis unit 40 shown in FIG. 16 . The deformation unit 51 is provided between the pitch w...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com