Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Audio recognition method and device

An audio recognition and audio technology, applied in the field of data processing, can solve the problems of high computational complexity of the speech recognition model, dependence on classification quality, unsuitability for high concurrency, etc., and achieve the goals of increasing generalization, saving statistical time, and precise opening time Effect

Pending Publication Date: 2021-01-26
BEIJING YUANLI WEILAI SCI & TECH CO LTD
View PDF16 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Based on the statistics of the user's speaking time, ideally this solution should have the best performance results, but this solution often has the following disadvantages: 1) The speech recognition model generally has high computational complexity and is not suitable for high-concurrency online environments use
Under this scheme, the classification quality depends on manual review, otherwise there will be more errors in the category standard; The model performance is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio recognition method and device
  • Audio recognition method and device
  • Audio recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Preferred embodiments of the present application will be described in more detail below with reference to the accompanying drawings. Although preferred embodiments of the present application are shown in the drawings, it should be understood that the present application may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this application will be thorough and complete, and will fully convey the scope of this application to those skilled in the art.

[0036] The terminology used in this application is for the purpose of describing particular embodiments only, and is not intended to limit the application. As used in this application and the appended claims, the singular forms "a", "the", and "the" are intended to include the plural forms as well, unless the context clearly dictates otherwise. It should also be understood that the term "and / or" as used herein refers to and includes a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an audio recognition method and device. The method comprises the following steps: obtaining an original audio, adding null data of a first duration before the head of the original audio, and adding null data of a second duration after the tail of the original audio to obtain an expanded audio; taking a third time length of the sum of the first time length and the second time length as a segmentation window, and sequentially performing windowing from the head part of the expanded audio by taking a first step length to obtain a plurality of sub-audios; conducting calculation to obtain time-frequency characteristic sequences of the sub-audios; a neural network obtaining the probability that the sub-audios belong to specific classifications according to the time-frequency characteristic sequence; and comparing the probabilities with a judgment threshold to judge whether the sub-audios are of specific classifications or not. According to the scheme provided by the invention, on the premise of ensuring the interactivity and interaction efficiency of a user, the statistical time is saved, the feedback efficiency is improved, the opening time node statistical function is added, and the precision is improved.

Description

technical field [0001] The present application relates to the technical field of data processing, in particular to an audio recognition method and device. Background technique [0002] With the development of Internet technology, online education and other similar industries are booming, and the number of online learners has increased dramatically. Teachers evaluate students' classroom participation and give feedback to improve teaching effectiveness through subjective feelings and statistics of students' speaking time. [0003] At present, there are several schemes for counting the duration of a user's opening in the prior art. [0004] Based on switch-type opening time statistics, that is, a recording button that can be turned on and off is set on the user client, and the user needs to press the button before speaking. The starting point of this program is to achieve simple and direct time statistics. However, when the audience is a group of young children, the characteri...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L25/24G10L25/30G10L25/51
CPCG10L25/24G10L25/30G10L25/51
Inventor 贾杨夏龙吴凡郭常圳
Owner BEIJING YUANLI WEILAI SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products