Data classification method, device and apparatus and storage medium
A data classification and data technology, applied in the field of data processing, can solve the problem of consuming large computing resources and time resources.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0087] see figure 1 , which shows a schematic flowchart of the data classification method provided by the embodiment of the present application, the method may include:
[0088] Step S101: Obtain data to be classified.
[0089] Wherein, the data to be classified may be, but not limited to, text, image, audio, video and other data.
[0090] Step S102: Input the data to be classified into the pre-established student classification model, and obtain the classification result output by the student classification model.
[0091] Wherein, the classification result output by the student classification model includes numerical values that can represent the probability that the data to be classified belongs to each set category.
[0092] Exemplarily, if the categories include category y1, category y2, and category y3, then the classification results output by the student classification model include l1, l2, and l3, wherein l1 can represent the possibility that the data to be classi...
no. 2 example
[0099] It can be seen from the above embodiments that the category of the data to be classified is determined based on the student classification model, and the student classification model is trained using the training data in the training set. This embodiment introduces the training process of the student classification model.
[0100] see figure 2 , showing a schematic flow chart of the training process of the student classification model, which may include:
[0101] Step S201: Obtain multiple pieces of training data from the constructed training set to form a training subset.
[0102] Wherein, the amount of training data in the training subset can be set according to actual conditions.
[0103] Step S202: Input each piece of training data in the training subset into multiple teacher classification models, and obtain classification results predicted by the multiple teacher classification models for each piece of training data in the training subset.
[0104] Assuming tha...
no. 3 example
[0122] It can be known from the above embodiments that the student classification model is trained using the training data in the constructed training set. This embodiment introduces the process of constructing the training set.
[0123] There are many ways to implement training set construction, and in a possible implementation way, the process of building training set may include:
[0124] Obtain the first data set and the second data set, wherein, each piece of data in the first data set is data marked with a category, and each piece of data in the second data set is unlabeled data; the data in the first data set and the second data set The data in the two data sets are mixed, and the training set is composed of the mixed data.
[0125] Considering that there may be some unlabeled data of poor quality in the second data set, in order to prevent these unlabeled data of poor quality from affecting the training of the student classification model, this embodiment provides anot...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com