Method and system for simulating reading and pronunciation of real person
A real person, voice technology, applied in the direction of speech synthesis, speech analysis, instruments, etc., can solve the problems such as the tone and emotion are not rich enough, the pronunciation cannot be read by a real person, the pronunciation is rigid and not vivid, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0054] Such as figure 1 as shown, figure 1 It is a flow chart of a method for simulating human reading pronunciation provided in Embodiment 1 of the present application. The method includes:
[0055] S101: Input the text to be pronounced in the pre-built classification model.
[0056] Among them, the classification model is pre-built through a large number of audios of webcasters collected on the Internet and corresponding texts. In this way, after inputting the text to be pronounced, a suitable speech can be found out according to the classification model, and further pronounced.
[0057] S102: Split the text to be pronounced into multiple words, and acquire the text vector of each word in sequence.
[0058] In practical applications, the text to be pronounced may be a sentence or a paragraph. When the text to be pronounced is a sentence, you can directly split the sentence into multiple words; when the text to be pronounced is a paragraph, you first need to split the te...
Embodiment 2
[0064] On the basis of Embodiment 1, Embodiment 2 of the present application provides a method of simulating the pronunciation of a real person reading aloud, the method comprising:
[0065] Input the text to be pronounced in the pre-built classification model.
[0066] Split the text to be pronounced into multiple words, and obtain the text vector of each word in turn.
[0067] Obtain the speech vector corresponding to the word according to the text vector of the word.
[0068] Convert the speech vector to audio and play it through the player.
[0069] Such as figure 2 as shown, figure 2 It is a flow chart of constructing a classification model provided in Embodiment 2 of the present application. Specifically, the construction method of the classification model includes:
[0070] S201: Collect a training sample set.
[0071] Specifically, a large number of samples need to be taken in advance, and a sentence is generally taken as a sample. It should be noted that when...
Embodiment 3
[0095] In order to realize the method for simulating the pronunciation of human reading aloud described in the first embodiment, the third embodiment of the present application provides a system for simulating the pronunciation of reading aloud for a real person. Such as Figure 5 as shown, Figure 5 It is a schematic structural diagram of a system for simulating human reading and pronunciation provided in Embodiment 3 of the present application. The system includes: a construction unit 401, an input unit 402, a split unit 403, an acquisition unit 404 and a conversion unit 405, wherein,
[0096] The construction unit 401 is configured to pre-construct the classification model.
[0097] Wherein, the classification model is pre-built by the construction unit 401 through a large amount of audios of webcasters collected on the Internet and corresponding texts. In this way, after inputting the text to be pronounced, a suitable speech can be found out according to the classificat...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com