Student behavior log generation method and system based on audio scene recognition
A scene recognition and audio technology, which is applied in speech analysis, biological neural network models, data processing applications, etc., to achieve strong category discrimination capabilities, accurate recognition and classification, and enhanced semantic expression
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0038] This embodiment provides a method for generating a student behavior log based on audio scene recognition. In this embodiment, the method is applied to a server for illustration. It can be understood that the method can also be applied to a terminal, and can also be applied to a terminal including a terminal. And servers and systems, and through the interaction of terminals and servers. The server can be an independent physical server, a server cluster or a distributed system composed of multiple physical servers, or a cloud service, cloud database, cloud computing, cloud function, cloud storage, network server, cloud communication, intermediate Cloud servers for basic cloud computing services such as software services, domain name services, security services CDN, and big data and artificial intelligence platforms. The terminal may be a smart phone, a tablet computer, a notebook computer, a desktop computer, a smart speaker, a smart watch, etc., but is not limited theret...
Embodiment approach
[0043] As one or more embodiments, the training process of the educational scene recognition model includes:
[0044] Extract the spectral features of audio documents and the probability distribution of topics on audio words;
[0045] Generate a topic feature matrix based on the probability distribution of topics on audio words and the spectral features of audio documents;
[0046] Update topic feature matrix based on graph neural network;
[0047] Classify the updated topic feature matrix;
[0048] Output audio scene recognition results.
[0049] As one or more embodiments, the process of extracting the spectral features of the audio document and the probability distribution of the subject on the audio words includes:
[0050] Label each audio document of each type of educational scene with an educational scene label, and build a training set based on the audio document feature matrix and educational labels of all types of educational scenes;
[0051] Perform K-means clus...
Embodiment 2
[0067] This embodiment provides a system for generating student behavior logs based on audio scene recognition.
[0068] A student behavior log generation system based on audio scene recognition, comprising:
[0069] an acquisition module, which is configured to: acquire the recorded audio document, and divide the audio document into several audio segments;
[0070] a prediction module, which is configured to: adopt a trained educational scene recognition model based on the audio clips to obtain educational scene prediction labels of each audio clip;
[0071] The behavior log generation module is configured to: mark a start time point and an end time point for each audio clip, mark the start time point and the end time point according to each audio clip, and combine the educational scene prediction labels of each audio clip to generate a time distribution behavior log;
[0072] The process of predicting the educational scene recognition model includes: a feature generation m...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


