Unlock instant, AI-driven research and patent intelligence for your innovation.

Case risk identification method based on streaming and batch big data fusion calculation

A risk identification and big data technology, applied in the field of data processing, can solve the problems of high consumption of computing resources, obvious shortcomings in relational database performance, and long computing cycles, and achieve the effect of accurate risk identification

Pending Publication Date: 2020-07-24
ZHEJIANG BANGSUN TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1) Pure batch computing has high hardware requirements, and relational databases have obvious shortcomings in terms of reading and writing, querying, and high concurrency for massive data;
[0005] 2) When the pure batch big data calculation is performed on the characteristics of massive time-series data, the calculation of the full amount of data leads to a large consumption of computing resources, a long calculation cycle, and the results of time-series calculations cannot be reused

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Case risk identification method based on streaming and batch big data fusion calculation
  • Case risk identification method based on streaming and batch big data fusion calculation

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0038] The present invention proposes a case risk identification method based on stream and batch big data fusion calculation, the method includes the following steps:

[0039] Step 1. Define the risk scenario, and extract the data within 3 years under the scenario, and deduplicate the transaction flow data, operation flow data, and static information data rolling increments and the full amount of data, and then import them into HDFS Data platform; the risk scenario mentioned is the risk of bank operations: internal and external collusion to steal funds from customers' corporate accounts, etc.;

[0040] Step 2. Based on the historical risk behavior of the risk scenario in step 1, extract the risk features and clarify the feature processing logic; the extraction of risk features is to analyze the risk behavior characteristics of the case subject in the historical risk cases, and judge whether the operation mode and The operation process, whether there are loopholes or defects, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a case risk identification method based on streaming and batch big data fusion calculation. Firstly, based on a bank operation risk scene, corresponding data (transaction flow,operation flow and static information) is extracted and imported into a big data platform; feature extraction is carried out based on historical risk behaviors in a risk scene, and feature logic is defined; and finally, feature engineering is performed through a streaming big data calculation method and a batch big data fusion calculation method, feature combination is performed to form a rule, and finally, quick decision is performed through a Spark+Rete technology to identify suspicious risk cases. According to the method, case risk analysis is carried out on the basis of a streaming big data calculation and batch big data fusion calculation method in a massive data scene for the first time, and risk identification can be carried out quickly, flexibly and accurately.

Description

technical field [0001] The invention belongs to the technical field of data processing, and in particular relates to a case risk identification method based on stream and batch big data fusion calculation. Background technique [0002] In recent years, my country's financial operational risk cases have been frequently and frequently occurred. Many banks have successively exposed operational risk cases involving huge amounts of money. Cases related to banks' business operations have occurred frequently, and the cases in terms of operational risk have become increasingly complex. secret. Operational risk cases occur frequently, causing huge economic losses and social impact, and bringing new challenges to the stable operation of the banking industry. [0003] With decades of business derivation in banks, tens of billions of data have accumulated, how to screen suspicious risk cases in massive data and carry out effective case prevention has become an urgent problem to be solve...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q40/02G06Q40/04
CPCG06Q40/04G06Q40/03
Inventor 孙斌杰王新根鲁萍黄滔陈浩席龙吴晶晶
Owner ZHEJIANG BANGSUN TECH CO LTD