Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data processing method and device

A data processing and audio data technology, applied in the Internet field, can solve the problems of poor phoneme time information accuracy and low phoneme alignment accuracy.

Pending Publication Date: 2021-03-23
TENCENT MUSIC ENTERTAINMENT TECH SHENZHEN CO LTD
View PDF16 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, using the existing technology, only rough phoneme alignment results in different time intervals can be obtained, the accuracy of phoneme alignment in time is low, and the accuracy of phoneme time information is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and device
  • Data processing method and device
  • Data processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0076] The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0077] See figure 1 , is a system architecture diagram of data processing provided by an embodiment of the present invention. The server 10b establishes a connection with the user terminal 10a through a switch and a communication bus. The base frequency extraction algorithm model and the automatic speech recognition model are stored in the database 10c. The server 10b obtains the target audio data, and extracts the fundamental frequency curve of the target audio data according to ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a data processing method and device. The method comprises the following steps that: a fundamental frequency curve of target audio data is acquired, wherein the fundamental frequency curve comprises at least one fundamental frequency effective interval, and the fundamental frequency effective interval is an interval of a fundamental frequency value of the fundamental frequency curve in a target range; voice recognition is performed on the target audio data to determine phonemes corresponding to the fundamental frequency effective intervals and time information of each phoneme; and if the phonemes corresponding to a target fundamental frequency effective interval include mute phonemes, the time information of adjacent phonemes of the mute phonemes isadjusted according to the positions of the mute phonemes in a phoneme sequence corresponding to the target fundamental frequency effective interval, so that the adjacent phonemes after time information adjustment cover the mute phonemes, wherein the target fundamental frequency effective interval is any fundamental frequency effective interval in the at least one fundamental frequency effective interval. By adopting the method and the device, the time alignment accuracy of the phonemes in the audio can be improved.

Description

technical field [0001] The present application relates to the technical field of the Internet, and in particular to a data processing method and device. Background technique [0002] In the field of music applications, the realization of business application functions such as song content analysis, song detail teaching, and singing voice synthesis requires the use of time information of phonemes in audio (songs). At present, the main way to determine phonemes and time information is to use existing technologies to obtain phonemes at different times, generally through automatic speech recognition technology (Automatic Speech Recognition, ASR), to perform phoneme recognition and alignment on input audio. However, using the existing technology, only rough phoneme alignment results in different time intervals can be obtained, the accuracy of phoneme alignment in time is low, and the accuracy of phoneme time information is poor. Contents of the invention [0003] Embodiments o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/22G10L15/26
CPCG10L15/02G10L15/22G10L15/26G10L2015/025
Inventor 徐东
Owner TENCENT MUSIC ENTERTAINMENT TECH SHENZHEN CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products