Speech conversion method and device, electronic equipment and readable storage medium
A speech conversion and speech technology, applied in speech analysis, speech synthesis, instruments, etc., can solve problems such as slow computing speed, affecting speech sound quality, high system performance requirements, etc., to improve discontinuity, increase computing speed, and save computing resources Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0057] Please refer to figure 2 , figure 2 It is a flow chart of the steps of the speech conversion method provided by the preferred embodiment of the present invention. The method is applied to the electronic device 100 described above, and the steps of the voice conversion method will be described in detail below.
[0058] Step S110 : Segment the voice to be converted of the speaker to be converted into a plurality of frame units to be converted based on a preset segmentation rule.
[0059] In this embodiment, the voice range to be converted can be selected by marking. Optionally, the voice to be converted can be selected from the voices of the speakers to be converted by calling an automatic voice marking tool for marking.
[0060] After the marked speech to be converted is obtained, the speech to be converted is segmented using a preset segmentation rule, so that each segmented frame unit includes a plurality of continuous speech frames.
[0061] Step S120, extracting...
no. 2 example
[0119] Please refer to Figure 9 , Figure 9 A structural block diagram of the speech conversion device 300 provided by the preferred embodiment of the present invention. The speech conversion device 300 includes: a segmentation module 310 , an extraction module 320 , a calculation module 330 , a matching module 340 and a processing module 350 .
[0120] The segmentation module 310 is configured to segment the voice to be converted of the speaker to be converted into multiple frame units to be converted based on preset segmentation rules, wherein each frame unit to be converted includes multiple continuous voice frames.
[0121] The extraction module 320 is used to extract the Mel cepstrum feature of each frame unit to be converted.
[0122] In this embodiment, the extraction module 320 extracts the Mel cepstrum feature of the frame unit to be converted including:
[0123] Perform time-frequency domain change on the frame unit to be converted to obtain spectrum information ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


