Multimedia data processing method and device and readable storage medium

A technology of multimedia data and processing methods, applied in the computer field, can solve problems such as misidentification of audio classification models, reduce the accuracy of audio data recognition, and achieve the effect of improving accuracy

Pending Publication Date: 2022-04-12
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when the audio data A is recognized as a whole, there will often be some noise (for example, the harmony in the audio data A) interference, which wil

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multimedia data processing method and device and readable storage medium
  • Multimedia data processing method and device and readable storage medium
  • Multimedia data processing method and device and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0084] The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of this application.

[0085] Artificial Intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. In other words, artificial intelligence is a comprehensive technique of computer science that attempts to understand the nature of intelligence and produce a new kind of intelligent machine th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multimedia data processing method and device and a readable storage medium, and the method comprises the steps: obtaining a target audio recognition model associated with original audio data when the original audio data in a multimedia file is obtained; inputting the original audio data into a target vocal music separation model, and performing vocal music separation on the original audio data by the target vocal music separation model to obtain a first type of audio track associated with the first object and a second type of audio track associated with the second object; acquiring voice data of the first object from the first type of audio track, inputting the voice data of the first object into a target voice recognition model, and performing text recognition on the voice data of the first object by the target voice recognition model to obtain a text recognition result of the first object; and determining the audio type of the original audio data based on the text recognition result, and storing the audio data associated with the second object in the second type of audio track. According to the invention, the accuracy of audio data identification can be improved.

Description

technical field [0001] The present application relates to the field of computer technology, and in particular to a multimedia data processing method, device and readable storage medium. Background technique [0002] At present, in some business scenarios (for example, audio data identification scenarios), if certain audio data (for example, audio data A) needs to be classified, the audio type of the audio data can be identified first, so as to classify and store Or provide the audio type of the audio data A when using the audio data A. [0003] For example, in the existing audio classification scheme, the complete audio data A can be directly input into an audio classification model, and then the audio data A can be recognized as a whole through the audio classification model, so as to obtain the audio type of the audio data A . However, when the audio data A is recognized as a whole, there will often be some noise (for example, the harmony in the audio data A) interferenc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/65G06F16/68G06F16/683G06F16/75G06F16/78G06F16/783G06K9/62G06N3/04
Inventor 冯鑫
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products