Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Big data prediction analysis method, system and device and storage medium

A predictive analysis, big data technology, applied in the direction of instruments, character and pattern recognition, computing models, etc., can solve the problem of not being able to fully mine the correlation and interaction of data attributes, the linear regression model does not consider the interaction of features, and the statistical algorithm is not intelligent enough and other issues to achieve the effects of easy understanding, risk avoidance, and fast learning and classification

Pending Publication Date: 2020-07-31
GUANGDONG EASTONE CENTURY TECHNOLOGY CO LTD
View PDF7 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Classical methods have some inherent defects that need to be resolved: (1) the research of classical methods is time-consuming and requires the efforts of many experts; (2) because data sets generally contain a large number of interrelated and interacting data attributes, traditional statistical algorithms Not smart enough to fully mine the correlation and interaction between data attributes, so the predicted results may be simple and limited; (3) If the data comes from a questionnaire survey, it means that the survey form is designed by the surveyor himself , so the attributes of the data in the dataset may not be sufficient for researchers to create models based on their knowledge and inferences
[0005] For machine learning methods, most models (such as neural networks) cannot be explained. For interpretable models (such as decision trees), the analysis accuracy is generally not guaranteed. In addition, extracting patterns (knowledge we need) from the model needs to be done manually. , which requires the efforts of many professionals
For example, the linear regression algorithm assumes that the predicted value y and the attribute x=(x 1 ,x 2 …x n ) is linear, the predicted value y can be calculated as y=a 0 +a 1 x 1 +a 2 x 2 +…+a n x n , where a 0 is the error term for all other factors affecting the variable y except attribute x, however the linear regression model does not consider the interaction between features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Big data prediction analysis method, system and device and storage medium
  • Big data prediction analysis method, system and device and storage medium
  • Big data prediction analysis method, system and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment 1

[0078] Specifically, this embodiment provides a data prediction analysis method for the China General Social Survey (CGSS) data set. CGSS aims to systematically monitor the relationship between social structure and quality of life in China, and this example evaluates the effectiveness of predictive analysis by exploring the relationship between personal income and other factors.

[0079] This example chooses to analyze the 2015 CGSS dataset, which contains 10968 data samples collected from 10968 individuals. In this embodiment, 45 attributes that may be related to personal income are selected from the CGSS data set.

[0080] After preprocessing the collected data, use the rule fitting algorithm to generate the corresponding rules, assuming that the following four rules are generated:

[0081] Rule 1, the correlation between personal education level and annual income is the most obvious;

[0082] Rule 2, populations with at least secondary education and living in urban areas ...

specific Embodiment 2

[0097] In the expert reasoning system for engineering equipment development, the predictive analysis method of big data is used to collect and analyze data of 8 different types of combat engineering vehicles. as followed:

[0098] A1 represents fuel consumption {more, less};

[0099] A2 represents workload {large, medium, small};

[0100] A3 represents the protective ability {strong, weak};

[0101] A4 represents the combat comprehensive performance evaluation index, with 0 and 1 representing low and high decision-making attributes, respectively.

[0102] The information table is shown in Table 1.

[0103] Form l Information Form

[0104] model A1 A2 A3 A4 1 many middle weak 0 2 many Big powerful 1 3 many Small weak 0 4 many middle powerful 1 5 many Small powerful 0 6 few middle powerful 1 7 few Small powerful 0 8 few middle weak 0

[0105] To analyze it briefly, the ...

specific Embodiment 3

[0126] We can also apply the decision tree model to the traffic field, by analyzing data about road and intersection conditions, traffic conditions, traffic load, traffic control and management, etc., to predict traffic delays and services at urban intersections Level, use the observed data on green signal ratio, saturation, traffic capacity and service level as training samples to train the decision tree model, and use the trained model to predict and analyze the service level of road intersections.

[0127] Specifically, its implementation is as follows:

[0128] P1. Collect data and get data sets; we can collect data about road and intersection conditions, traffic conditions, traffic load, traffic control and management, etc., and integrate them into a data set.

[0129] P2. Preprocess the data set to obtain the interaction between the original attributes of the data in the data set; for example, extract numerical values ​​including green signal ratio, saturation, traffic c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a big data prediction analysis method, system and device and a storage medium. The method comprises the steps of collecting data to obtain a data set; generating a corresponding rule by using a rule fitting algorithm; performing regularization processing on the original attributes of the data in the data set; generating a prediction model according to the corresponding rules and the original attributes; calculating parameters of the prediction model; obtaining a weight value set of the prediction model according to the parameters; and calculating to obtain a corresponding prediction analysis result according to the weight value set. According to the invention, a unified and effective prediction analysis method is provided for each industry analysis data set according to the characteristics of the industry data set; relationships and rules existing in data are discovered through a big data analysis method, and the future development trend of things is predicted,so that the scientificity of decision making can be improved; meanwhile, enterprises can be helped to analyze future data information, and risks are effectively avoided. The method is widely applied to the technical field of data mining.

Description

technical field [0001] The present invention relates to the technical field of data mining, in particular to a method, system, device and storage medium for predictive analysis of big data. Background technique [0002] Predictive analytics is a common method in data mining where the goal is to predict unknown values ​​using known attributes. Predictive analysis can also evaluate the influence of certain attributes in the data set on the changes of certain numerical attributes. Generally, the influence is evaluated by predicting the change trend of the target numerical attribute and determining the weighting factors of other attributes on the change trend. [0003] At present, there are mainly two types of methods for predictive analysis of industry big data, one is a classic research method that follows the combination of expert knowledge and statistics, and the other is a research method based on machine learning. [0004] For the classical approach, the variables for eac...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N20/00
CPCG06N20/00G06F18/217G06F18/24323
Inventor 王永斌张忠平刘廉如傅宇曾汉毛志慧
Owner GUANGDONG EASTONE CENTURY TECHNOLOGY CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products