Method and device for data processing
A data processing and data sample technology, applied in the field of information processing, can solve the problem that data processing methods cannot meet the high efficiency and high precision of missing value processing at the same time, so as to reduce the time required for processing missing values, improve correctness, and improve The effect of processing speed
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0024] Embodiment 1 of the present invention provides a data processing method. The method can be executed by a data processing device, wherein the device can be implemented by hardware and / or software, and can generally be integrated into a data processing platform. figure 1 is a schematic flow chart of the data processing method provided in Embodiment 1 of the present invention, such as figure 1 As shown, the method includes:
[0025] S101. Obtain a data sample.
[0026] In this embodiment, the data sample may be an entity class data sample, and the data sample includes a first data sample and a second data sample, wherein the first data sample is a data sample including missing values, and the second data sample is not including missing values data sample.
[0027] In a specific application, the data sample can be pre-stored in the database corresponding to the data processing platform. When obtaining the data sample, the data sample can be directly called from the stora...
Embodiment 2
[0040] figure 2 It is a schematic flowchart of a data processing method provided in Embodiment 2 of the present invention. This embodiment is optimized on the basis of the above embodiments, and further, before the calculation of the similarity between the attribute values of the data samples that include missing values and the attribute values of data samples that do not include missing values, further includes: The initial contribution of each attribute of the data sample is obtained according to the attribute corresponding to the missing value, and each attribute is a related attribute of the attribute corresponding to the missing value.
[0041] Further, the attribute values of the related attribute and the attribute corresponding to the missing value are all continuous values; The similarity between them, specifically: calculate the similarity between the related attribute values of the data samples that include missing values and the related attribute value...
Embodiment 3
[0072] image 3 It is a schematic flowchart of a data processing method provided by Embodiment 3 of the present invention. This embodiment is optimized on the basis of the above embodiments. Further, the determination of the number of filling samples required to fill the missing value according to the sample number determination rule includes: according to the non-missing rate of the corresponding attribute of the missing value and the not included The number of data samples with missing values determines the first number of samples required to fill the missing values; the missing value is filled according to the contribution rate of the relevant attribute of the attribute corresponding to the missing value and the number of data samples that do not include the missing value The required second number of samples; determining the number of filling samples required to fill the missing value according to the first number of samples and the second number of samples.
[0073] Co...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com