Text-to-voice processing method and computer readable storage medium
A processing method and text technology, applied in speech analysis, speech recognition, speech synthesis, etc., can solve the problems of far away from the effect of manual reading of text, difficulty in simulating emotional color, etc., and achieve a more entertaining and more anthropomorphic reading experience. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0044] Embodiment 1: This embodiment provides a text-to-speech processing method, refer to figure 1 As shown, the method includes:
[0045] S110. Acquire the text to be converted and a conversion control instruction matched with the text to be converted.
[0046] Specifically, the conversion control instruction is pre-edited and configured, and the conversion control instruction is associated with the text to be converted. Exemplarily, both the text to be converted and the conversion control instruction have an identification number, and the text to be converted and the text matching the text to be converted are obtained according to the identification number. Convert control instructions.
[0047] In a preferred embodiment, the conversion control instruction includes at least one of a pause instruction, an accent instruction, a speech rate adjustment instruction, a sentence tone adjustment instruction, and an instruction for adding mouth habit, that is, the conversion contro...
Embodiment 2
[0099] Embodiment 2: This embodiment provides a text-to-speech conversion system, refer to figure 2 As shown, the system includes:
[0100] An acquisition module 210, configured to acquire the text to be converted and the conversion control instruction matched with the text to be converted;
[0101] The processing module 220 is configured to process the text to be converted to obtain a target voice based on the conversion control instruction and preset processing rules.
[0102] In a preferred embodiment, the processing module 220 includes:
[0103] A splitting unit 221, configured to split the text to be converted by morpheme units to obtain a morpheme set;
[0104] The conversion unit 222 is configured to convert each morpheme in the morpheme set to obtain a corresponding audio frame set and generate a corresponding index, the audio frame set includes a first audio frame and a second audio frame, and the first audio frame is the audio frame corresponding to the conversio...
Embodiment 3
[0130] Embodiment 3: This embodiment provides a computer-readable storage medium on which a computer program is stored. When the computer program is executed by a processor, the following steps are implemented:
[0131] Obtaining the text to be converted and a conversion control instruction coordinated with the text to be converted;
[0132] Processing the text to be converted based on the conversion control instruction and preset processing rules to obtain a target voice.
[0133] As a preferred implementation mode, in the embodiment of the present invention, when the processor executes the computer program, the following steps are also implemented:
[0134] Splitting the text to be converted by morpheme units to obtain a morpheme set;
[0135] Convert each morpheme in the morpheme set to obtain a corresponding audio frame set and generate a corresponding index, the audio frame set includes a first audio frame and a second audio frame, and the first audio frame is the conver...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 

