Voice style conversion method and device, equipment and storage medium
A technology of style conversion and conversion method, which is applied in speech analysis, instruments, etc., and can solve the problems of affecting the effect of voice change and low style similarity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0028] figure 1 It is a flowchart of a speech style conversion method provided by Embodiment 1 of the present invention. This embodiment is applicable to the case of performing audio voice change with a specific style and unchanged speech content on any speech. A voice style conversion method provided in this embodiment can be performed by the voice style conversion device provided in the embodiment of the present invention, which can be implemented by software and / or hardware, and integrated into the device that executes the method , the device may be a user terminal configured with any voice-changing application.
[0029] Specifically, refer to figure 1 , the method may include the following steps:
[0030] S110. Acquire a source-style speech, a target-style speech, and an initial conversion speech.
[0031] Specifically, in order to show users various voices in various voice styles, the audio voice changing technology set in the voice changing application is usually used...
Embodiment 2
[0045] Figure 2A It is a flowchart of a speech style conversion method provided by Embodiment 2 of the present invention, Figure 2B It is a schematic diagram of the principles of various speech losses during the calculation loss optimization process in the method provided by Embodiment 2 of the present invention. This embodiment is optimized on the basis of the foregoing embodiments. Specifically, such as Figure 2A As shown, this embodiment explains in detail the specific calculation process of speech content loss and speech style loss.
[0046] optional, such as Figure 2A As shown, the following steps may be included in this embodiment:
[0047] S210. Acquire source-style speech, target-style speech and initial conversion speech.
[0048] S220. Determine the speech content features of the source-style speech, the speech style features of the target-style speech, and the speech content and speech style features of the initial converted speech.
[0049] Specifically, ...
Embodiment 3
[0068] image 3 It is a flowchart of a speech style conversion method provided by Embodiment 3 of the present invention. This embodiment is optimized on the basis of the foregoing embodiments. Specifically, this embodiment explains in detail the specific process of initially converting speech for loss optimization.
[0069] optional, such as image 3 As shown, the following steps may be included in this embodiment:
[0070] S301. Acquire a source-style speech, a target-style speech, and an initial conversion speech.
[0071] S302, according to the speech content loss between the initial conversion speech and the source style speech and the speech style loss between the initial conversion speech and the target style speech, use the gradient descent algorithm to perform corresponding gradient sub-optimization on the gradient loss of the initial conversion speech, and obtain The new gradient loss.
[0072] Optionally, when performing loss optimization on the initial converte...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com