Deep learning sample labeling method based on online education big data

A deep learning and sample labeling technology, applied in multimedia data retrieval, metadata multimedia retrieval, special data processing applications, etc., can solve the problems of inaccurate deep learning output, inaccurate samples, and unusable pictures, etc., to improve user satisfaction speed, release boring, and improve user experience

Active Publication Date: 2022-07-15
BEIJING DASHENG ON LINE TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] The marking methods for online education files (video, audio, pictures, etc.) mainly include: first, it is a pure manual labor to realize the marking by screening the files by a single person, and the subjectivity of a single person is too strong, resulting in the marking Inaccurate; the second type, based on automated annotation, is still in the research stage, and cannot be applied to image annotation, let alone video and audio
[0003] The disadvantage of the existing technology is that if thousands of files are marked, it is necessary to repeatedly watch audio, video or pictures for screening, which consumes a lot of manpower and material resources, but is very subjective
As a result, the labeled samples are not accurate enough, and finally the output of deep learning is inaccurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep learning sample labeling method based on online education big data
  • Deep learning sample labeling method based on online education big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The present invention will be described in detail below with reference to the accompanying drawings and embodiments:

[0052] attached Figure 1-2 It can be seen that a deep learning sample labeling method based on online education big data,

[0053] Input M pieces of online education data to be labeled, and ensure that the input data is of the same category (such as all audio, or all video), M is a large amount of data, generally more than 100,000, so as to ensure deep learning training The best model comes out.

[0054] Store the online education data to be labeled in the database;

[0055] Display the N data that need to be labeled through the online education data labeling system; the N data come from unlabeled random data in the database;

[0056] Label the displayed N data;

[0057] When M is greater than N*impressions, randomly select N data from the remaining M-N*impressions, and when it is less than N, select all the remaining data;

[0058] When M is less...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a deep learning sample labeling method based on online education big data. M pieces of online education data to be labelled are inputted, N pieces of data are displayed, and N pieces of displayed data are labelled; when M is greater than N* number of impressions, Randomly select N data from the remaining M‑N* number of impressions, when it is less than N, select all the remaining data; when M is less than N* number of impressions, randomly select N data from M again, and set the number of impressions to 1; After the sum of all data being labeled is greater than the J multiple of M, the number of times the data is labeled is greater than K as the valid data conforming to the classification; the classification library of different types of data is obtained. The invention is based on the deep learning sample labeling method based on online education big data, which improves the identification of online education data, improves user satisfaction, improves user experience, avoids personal subjectivity, releases the dullness of single repetitive work, and also greatly improves user satisfaction. The big one is convenient for all users who mark it.

Description

technical field [0001] The invention relates to an Internet online education system, in particular to a deep learning sample labeling method based on online education big data. Background technique [0002] The main methods for labeling online education documents (videos, audios, pictures, etc.) are as follows: The first is to screen the documents by a single person to achieve labeling, which is a purely manual labor, and the subjectivity of a single person is too strong, resulting in labeling Inaccurate; the second, based on automated annotation, is still in the research stage, and it cannot be applied to image annotation, not to mention video and audio. [0003] The disadvantage of the prior art is that if thousands of files are marked, it is necessary to repeatedly watch audio, video or pictures for screening, which consumes a lot of manpower and material resources, but is very subjective. As a result, the labeled samples are not accurate enough, and finally the output o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/48G06F16/45G06K9/62
CPCG06F16/48G06F18/24
Inventor 熊利陈靖李晓清
Owner BEIJING DASHENG ON LINE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products