Apparatus and method for creating dictionary for speech synthesis
A technology for making devices and dictionaries, which is applied in speech synthesis, speech analysis, instruments, etc. It can solve problems such as continuous recording, inability to confirm the sound quality of synthesized waveforms, and reduced production efficiency of voice synthesis dictionaries, so as to achieve the effect of improving production efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 Embodiment approach
[0016] The synthesized dictionary creating device of the first embodiment is a device that records the voice of a user who reads a sentence, and creates a user-customized voice synthesized dictionary using the recorded waveform. By voice synthesis using the voice synthesis dictionary created by this device, the user can read arbitrary sentences with their own voice quality.
[0017] figure 1 It is a block diagram of the synthesized dictionary creation device 100 of the first embodiment. The synthesized dictionary making device of the present embodiment is provided with: the sentence storage unit 109 that stores predetermined N (N is a natural number, N≥2) sentences; presents to the user the first sentence sequentially selected from the N sentences stored in the sentence storage unit 109 The prompting part 110 of the first sentence; the voice recording of the user who reads the first sentence aloud, and the recording part 101 that stores the recorded waveform in association wi...
no. 2 Embodiment approach
[0083] Figure 5It is a block diagram of the synthesized dictionary creation device 500 of the second embodiment. The difference from the voice synthesis creation device 100 of the first embodiment is that the voice quality evaluation unit 501 evaluates the voice quality of the synthesized waveform based on the similarity between the recorded waveform stored in the recording unit 101 and the synthesized waveform generated by the voice synthesizer 107 .
[0084] Here, the first sentence corresponding to the recorded waveform stored in the storage unit 101 is used as the second sentence in the speech synthesis unit 107 . Then, the degree of similarity between the recorded waveform of the first sentence and the synthesized waveform generated from the second sentence is calculated. In this way, by matching the utterance contents between the recorded waveform and the synthesized waveform, it is possible to evaluate the similarity excluding the difference in utterance contents. Th...
Deformed example 1
[0088] In the speech synthesis dictionary creation device of this embodiment, the user is presented with first sentences sequentially selected from predetermined N4 sentences, but the first sentences presented to the user may be a plurality of sentences. That is, a segment including a plurality of first sentences may be presented to the user. In addition, N sentences may be stored in the sentence storage unit 109 as a segment including a plurality of sentences.
[0089] In addition, in the speech synthesis dictionary creation device of the present embodiment, it is judged whether or not to create a speech synthesis dictionary based on the variable M and the data volume of all recorded waveforms. , The amount of data of all recorded waveforms, and judge whether to make a voice synthesis dictionary. That is, the necessity determination unit 104 determines whether or not to create a speech synthesis dictionary based on the number of first sentences whose recording has been prope...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 