Supercharge Your Innovation With Domain-Expert AI Agents!

Out-of-distribution data detection method and device, server and storage medium

A data detection and data technology, applied in the field of machine learning, can solve problems such as inability to detect OOD data well, and achieve the effect of improving the difference in reconstruction errors and improving accuracy

Pending Publication Date: 2021-06-15
SOUTH UNIVERSITY OF SCIENCE AND TECHNOLOGY OF CHINA +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the strong generalization ability of the autoencoder, even the OOD data outside the distribution of the training data may be well reconstructed, which makes the autoencoder unable to detect the OOD data well.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Out-of-distribution data detection method and device, server and storage medium
  • Out-of-distribution data detection method and device, server and storage medium
  • Out-of-distribution data detection method and device, server and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0052] figure 1 It is a schematic flowchart of a method for detecting out-of-distribution data provided by Embodiment 1 of the present invention. The out-of-distribution data detection method provided by the embodiment of the present invention can be realized by an autoencoder based on memory selection. like figure 1 As shown, the out-of-distribution data detection method provided in Embodiment 1 of the present invention includes:

[0053] S110. Obtain data to be detected.

[0054] Specifically, the data to be detected is high-dimensional data, which is an efficient representation, and includes various data features.

[0055] S120. Perform encoding processing on the data to be detected to obtain low-dimensional feature data.

[0056] Specifically, the encoding process of the data to be detected is to find the low-dimensional embedded representation of the high-dimensional data, and it is expected that the low-dimensional embedded representation contains the essential chara...

Embodiment 2

[0073] figure 2 It is a schematic flowchart of a method for detecting out-of-distribution data provided by Embodiment 2 of the present invention, and this embodiment is a further refinement of the foregoing embodiments. like figure 2 As shown, the out-of-distribution data detection method provided by Embodiment 2 of the present invention includes:

[0074] S210. Obtain the data to be detected.

[0075] S220. Perform encoding processing on the data to be detected to obtain low-dimensional feature data.

[0076] S230. Calculate a first cosine similarity between the low-dimensional feature data and the preset first memory data.

[0077] Specifically, the preset first memory data includes a plurality of first characteristic data. The first cosine similarity between the low-dimensional feature data and the preset first memory data refers to the first cosine similarity between the low-dimensional feature data and each first feature data of the preset first memory data, so it i...

Embodiment 3

[0092] image 3 It is a schematic structural diagram of an out-of-distribution data detection device provided in Embodiment 3 of the present invention. The device for detecting out-of-distribution data provided by the embodiments of the present invention can be realized by an autoencoder based on memory selection. The out-of-distribution data detection device provided in this embodiment can implement the out-of-distribution data detection method provided in any embodiment of the present invention, and has the corresponding functional structure and beneficial effects of the method. For the content that is not described in detail in this embodiment, please refer to any of the present invention. Description of method embodiments.

[0093] like image 3 As shown, the out-of-distribution data detection device provided by Embodiment 3 of the present invention includes: a data-to-be-detected acquisition module 310, an encoding module 320, a first data processing module 330, a secon...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses an out-of-distribution data detection method and device, a server and a storage medium. The method comprises the steps of obtaining to-be-detected data; encoding the to-be-detected data to obtain low-dimensional feature data; processing the low-dimensional feature data based on preset first memory data to obtain first low-dimensional data; processing the low-dimensional feature data based on preset second memory data to obtain second low-dimensional data; generating data to be decoded based on the first low-dimensional data and the second low-dimensional data; decoding the to-be-decoded data to obtain reconstructed data; determining a reconstruction error of the to-be-detected data and the reconstruction data, and if the reconstruction error is greater than a preset threshold, determining that the to-be-detected data is out-of-distribution data. According to the embodiment of the invention, the reconstruction capability of the self-encoder on the OOD data is limited, and the difference between the reconstruction errors of the ID data and the OOD data is improved, so the accuracy of OOD data identification is improved.

Description

technical field [0001] The embodiments of the present invention relate to the technical field of machine learning, and in particular to a method, device, server and storage medium for detecting out-of-distribution data. Background technique [0002] In the field of machine learning, the data used to train the model is usually referred to as In-distribution (ID) data, while Out-of-distribution (OOD) data refers to data that is inconsistent with the training ID data distribution. . In the practical application of neural network models, there are sometimes OOD data in the input data, which will lead to inaccurate model predictions and limit the application of neural network modules. Therefore, OOD data detection for the input data of the machine learning model is an important means to improve the prediction accuracy of the model. [0003] At present, the detection of OOD data mainly adopts the OOD detection algorithm based on autoencoder. The autoencoder uses ID data for tra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/08
CPCG06N3/084
Inventor 潘超宋丽妍姚新武晓宇胡崝
Owner SOUTH UNIVERSITY OF SCIENCE AND TECHNOLOGY OF CHINA
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More