Audio detection method, device, electronic device and readable storage medium

An audio detection and audio technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of not considering the interaction between stress and pause, and the accuracy of prosody detection results is not high enough, and achieve the effect of improving accuracy

Active Publication Date: 2020-09-04
TENCENT TECH (SHENZHEN) CO LTD
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The current audio detection method usually detects the accent or pause in the audio separately, without considering the interaction between the accent and the pause, and the accuracy of the prosody detection result is not high enough.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio detection method, device, electronic device and readable storage medium
  • Audio detection method, device, electronic device and readable storage medium
  • Audio detection method, device, electronic device and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0082] Embodiments of the present application are described in detail below, and examples of the embodiments are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present application, and are not construed as limiting the present application.

[0083] Those skilled in the art will understand that unless otherwise stated, the singular forms "a", "an", "said" and "the" used herein may also include plural forms. It should be further understood that the word "comprising" used in the description of the present application refers to the presence of features, integers, steps, operations, elements and / or components, but does not exclude the presence or addition of one or more other features, integers, Steps, operations, elements, components and / or groups thereof. It will b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present application relates to the technical field of information processing, and discloses an audio detection method, device, electronic equipment, and a readable storage medium. The audio detection method includes: receiving the audio to be detected and the text corresponding to the audio sent by the terminal; combining the audio with the text Perform alignment processing to obtain the start and end time of each phoneme of multiple phonemes corresponding to the text in the audio; extract the phoneme feature vector of each phoneme in the audio, and obtain the audio sequence feature of the audio based on the start and end time of each phoneme; The phoneme feature vector and the audio sequence feature are used to obtain the prosody detection result of the audio; the prosody detection result includes the accent feature and the pause feature of the audio; the prosody detection result is returned to the terminal, so that the terminal displays the text corresponding to the accent feature and the pause feature. The audio detection method provided by the present application can improve the accuracy of prosody detection results.

Description

technical field [0001] The present application relates to the field of speech technology, and in particular, the present application relates to an audio detection method, device, electronic equipment and readable storage medium. Background technique [0002] Artificial Intelligence (AI) is a comprehensive technology of computer science. By studying the design principles and implementation methods of various intelligent machines, the machines have the functions of perception, reasoning and decision-making. Speech rhythm detection is an important application field of artificial intelligence technology. It is mainly used to detect the rhythm of the user's voice data. By detecting the wrong rhythm in the voice data, it can provide users with real-time feedback and correction to help users. Improve your language skills. [0003] The current audio detection method usually detects the stress or pause in the audio separately, without considering the interaction between the stress a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/18G10L15/02G10L25/51
CPCG10L15/02G10L15/1807G10L25/51G10L2015/025
Inventor 林炳怀王丽园
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products