Method for preprocessing abnormal values of e-business sales amounts based on statistical discrimination process
An outlier and preprocessing technology, which is applied in the direction of electrical digital data processing, data processing applications, special data processing applications, etc., can solve problems such as missing data values, inconsistent data values, large sales volume, etc., and save time for consulting data , The effect of improving data accuracy and shortening the collection cycle
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0022] The steps of this preprocessing method are as follows:
[0023] Step 1: Improve data mining techniques and tools;
[0024] Step 2: Preliminarily verify the basic data, find out outliers, include non-outliers in the original e-commerce database, and verify outliers again;
[0025] Step 3: classify outliers; outliers are classified as: 1) missing and noisy data; 2) false data; 3) brushing data;
[0026] Step 4: Strengthen the comparison and elimination with the false information database, reduce the missing and noisy data, and fill in zeros for the missing data;
[0027] Step 5: For the false data, use DDFAI to discriminate and verify it. If it is judged to be false information, it will be included in the false information database, and it will be deleted, and the non-false information will be included in the original e-commerce database;
[0028] Step 6: Perform verification processing on the swiping data; the verification processing method is: 1) The swiping website i...
Embodiment 2
[0032] For the abnormal value of e-commerce sales, first improve the abnormal database:
[0033] 1) Carry out outlier test on the data, if it is indeed an outlier, delete the data, and record the data information in the outlier database;
[0034] 2) When collecting data again, first compare the data to be collected with the outlier database. If the information is consistent, this piece of data will not be collected and stored;
[0035] 3) Carry out outlier test on the newly collected data. If it is detected as an outlier, delete the data, and record the data information into the outlier database to improve the outlier database; repeat the cycle to continuously improve the outlier database .
[0036] Secondly, on the basis of the complete abnormal database, the classification judgment is made:
[0037] 1) When there is data noise, that is, when there is a null value, zero-fill the data. In the later stage, developers need to further improve data mining technology and improve...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com