Unlock instant, AI-driven research and patent intelligence for your innovation.

Speech generation method, device, device and computer readable medium

A technology of speech and speech fragments, applied in the computer field, can solve the problems such as the inability to guarantee the correspondence of time nodes and the inability to guarantee the similarity of voiceprint features.

Active Publication Date: 2022-05-20
BEIJING BYTEDANCE NETWORK TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] It takes a lot of manpower to convert the audio and video of the first language into the audio and video of the second language, and there is no guarantee that the audio and video of the second language correspond to the time nodes of the speech segment in the audio and video of the first language, and there is no guarantee that the two Therefore, an automatic translation dubbing technology is needed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech generation method, device, device and computer readable medium
  • Speech generation method, device, device and computer readable medium
  • Speech generation method, device, device and computer readable medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the disclosure are shown in the drawings, it should be understood that the disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein. Rather, these examples are provided so that the understanding of this disclosure will be thorough and complete. It should be understood that the drawings and embodiments of the present disclosure are for exemplary purposes only, and are not intended to limit the protection scope of the present disclosure.

[0017] It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings. In the case of no conflict, the embodiments in the present disclosure and the features in the embodiments can be combined with each other.

[0018] It should be noted that conc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the present disclosure discloses a voice generation method, device, electronic equipment and computer-readable medium. A specific implementation of the method includes: by performing speaker segmentation on the original voice, determining the start time and end time of each speech segment in the original voice to obtain the segmented voice; determining each speech segment in the original voice The voiceprint feature vector corresponding to the speech segment; the text corresponding to each speaking speech segment in the above-mentioned original voice is converted into a target language text, and the target language text corresponding to each speaking voice segment in the above-mentioned original voice is obtained; based on each of the above-mentioned original voice The start time and end time of each speech segment, the voiceprint feature vector corresponding to the speech segment, and the target language text corresponding to the speech segment to generate the target voice. This embodiment implements automatic conversion of audio and video in the first language into audio and video in the second language.

Description

technical field [0001] The embodiments of the present disclosure relate to the field of computer technology, and in particular to a voice generation method, device, device and computer-readable medium. Background technique [0002] It takes a lot of manpower to convert the audio and video of the first language into the audio and video of the second language, and there is no guarantee that the audio and video of the second language correspond to the time nodes of the speech segment in the audio and video of the first language, and there is no guarantee that the two Therefore, an automatic translation and dubbing technology is needed. Contents of the invention [0003] The Summary of the Disclosure is provided to introduce concepts in a simplified form that are described in detail in the Detailed Description that follows. The content of this disclosure is not intended to identify the key features or essential features of the claimed technical solution, nor is it intended to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/04G10L13/08G10L15/00G10L15/02G10L15/26G10L25/30
CPCG10L13/086G10L15/005G10L15/02G10L25/30G10L15/26G06F40/47G06F40/279G10L13/033G10L17/02G10L17/04
Inventor 蔡猛孔亚鲁
Owner BEIJING BYTEDANCE NETWORK TECH CO LTD