Unlock instant, AI-driven research and patent intelligence for your innovation.

Audio generation method and device based on artificial intelligence, equipment and storage medium

An artificial intelligence and audio technology, applied in the field of artificial intelligence, can solve the problems of inability to achieve accurate audio synthesis, rough audio synthesis, affecting audio synthesis, etc., and achieve the effect of improving stability, improving robustness, and accurate audio generation.

Pending Publication Date: 2021-12-21
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the related art, the audio synthesis method is relatively rough. Usually, the audio data of the target object is directly extracted, and then synthesized based on the extracted embedding vector of the target object to obtain the synthesized audio data. This synthesis method cannot be realized. Accurate synthesis of audio, which affects the normal audio synthesis of the user experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio generation method and device based on artificial intelligence, equipment and storage medium
  • Audio generation method and device based on artificial intelligence, equipment and storage medium
  • Audio generation method and device based on artificial intelligence, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0195] Example 1. The audio generating device is a mobile terminal application program and module

[0196] The audio generating device 555 in the embodiment of the present application can be provided as a software module designed using programming languages ​​such as software C / C++, Java, etc., embedded in various mobile terminal applications based on systems such as Android or iOS (stored in executable instructions) In the storage medium of the mobile terminal, it is executed by the processor of the mobile terminal), so as to directly use the computing resources of the mobile terminal itself to complete the relevant audio synthesis tasks, and regularly or irregularly transmit the processing results to the remote server through various network communication methods , or save it locally on the mobile terminal.

example 2

[0197] Example 2. The audio generating device is a server application program and a platform

[0198] The audio generating device 555 in the embodiment of the present application can be provided as application software or a dedicated software module in a large-scale software system designed using programming languages ​​such as C / C++ and Java, and runs on the server side (in the form of executable instructions on the server side) stored in a storage medium and run by a server-side processor), and the server uses its own computing resources to complete related audio synthesis tasks.

[0199] The embodiment of the present application can also be provided as a distributed and parallel computing platform composed of multiple servers, equipped with a customized and easy-to-interact network (Web) interface or other user interfaces (UI, User Interface), forming a user interface for individuals, The audio synthesis platform used by groups or units (for audio synthesis), etc.

example 3

[0200] Example 3. The audio generating device is a server-side application program interface (API, Application Program Interface) and a plug-in

[0201] The audio generation device 555 in the embodiment of the present application can be provided as a server-side API or plug-in for users to call to execute the artificial intelligence-based audio generation method in the embodiment of the present application, and embedded in various applications.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an audio generation method and device based on artificial intelligence, electronic equipment and a computer readable storage medium, and relates to the artificial intelligence technology. The method comprises the following steps: sampling multiple pieces of audio data of a target object to obtain reference audio data of the target object; performing audio coding processing on the reference audio data of the target object to obtain a reference embedded vector of the reference audio data; performing attention processing based on timbre on the reference embedding vector of the reference audio data to obtain a timbre embedding vector of the target object; performing text coding processing on the target text to obtain a content embedding vector of the target text; and performing synthesis processing based on the tone embedding vector of the target object and the content embedding vector of the target text to obtain audio data which conforms to the tone of the target object and corresponds to the target text. According to the invention, the stability of audio synthesis can be improved.

Description

technical field [0001] The present application relates to artificial intelligence technology, and in particular to an artificial intelligence-based audio generation method, device, electronic equipment, and computer-readable storage medium. Background technique [0002] Artificial Intelligence (AI) is a comprehensive technology of computer science. By studying the design principles and implementation methods of various intelligent machines, the machines have the functions of perception, reasoning and decision-making. Artificial intelligence technology is a comprehensive subject that involves a wide range of fields, such as natural language processing technology and machine learning / deep learning. With the development of technology, artificial intelligence technology will be applied in more fields and play an increasingly important role. increasingly important value. [0003] In the related art, the audio synthesis method is relatively rough. Usually, the feature extraction ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/126G06F16/33G06F16/683G06N3/04G06N3/08
CPCG06F40/126G06F16/3343G06F16/683G06N3/084G06N3/045G10L13/033G10L13/02G10L13/047G06N3/0464G06N3/0495G06N3/0442G06N3/0455G06N3/094
Inventor 郑艺斌李新辉苏文超卢鲤
Owner TENCENT TECH (SHENZHEN) CO LTD