Data clustering identification method and device, computer system and readable storage medium

A data clustering and identification method technology, applied in the field of communication, can solve problems such as difficulty in obtaining labeled data, misjudgment in case screening, difficulty in making an effective and reliable mature neural network, etc.

Active Publication Date: 2020-01-07
CHINA PING AN PROPERTY INSURANCE CO LTD
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] The current insurance anti-fraud scheme mainly uses black and white list rules to screen cases. However, the existing black and white list rule engines can only rely on human experience. Increased fatigue strength, resulting in increased personnel costs;
[0003] If the method of supervised learning modeling is adopted according to the rules of the black and wh...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data clustering identification method and device, computer system and readable storage medium
  • Data clustering identification method and device, computer system and readable storage medium
  • Data clustering identification method and device, computer system and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0072] see figure 1 and figure 2 , the data clustering identification method of the present embodiment, using the data clustering identification device 1, comprises the following steps:

[0073] S1: Set a case in the case database 2 as a benchmark case, and set other cases in the case database 2 except the benchmark case as a comparison case; extract the benchmark case and the comparison case from the case database 2, and judge in turn Whether there is a relationship between the benchmark case and each control case; if yes, then assign a value of 1 to the relationship between the benchmark case and the control case; The relationship value assignment of 0; According to the relationship value between the benchmark case and each control case, the one-dimensional vector of the benchmark case is made; wherein, the data information of the case includes scene pictures, report text information, scene structure information , policy information, vehicle characteristics and reporter c...

Embodiment 2

[0192] see image 3 , the data clustering identification device 1 of this embodiment includes:

[0193] A one-dimensional vector formulating module 11, used to set a case in the case database 2 as a reference case, and set other cases in the case database 2 except the reference case as comparison cases; extract the reference case from the case database 2 Cases and comparison cases, determine whether there is a relationship between the benchmark case and each comparison case in turn; if yes, then assign a value of 1 to the relationship between the benchmark case and the comparison case; The relationship value assignment between the benchmark case and the contrast case is 0; the one-dimensional vector of the benchmark case is made according to the relation value between the benchmark case and each contrast case; wherein, the data information of the case includes on-site pictures, Report text information, scene structure information, insurance policy information, vehicle charact...

Embodiment 3

[0200] In order to achieve the above object, the present invention also provides a computer system, the computer system includes a plurality of computer equipment 3, the components of the data clustering identification device 1 of the second embodiment can be dispersed in different computer equipment, the computer equipment can be Smartphones, tablet computers, laptops, desktop computers, rack servers, blade servers, tower servers, or rack servers (including independent servers, or server clusters composed of multiple servers) that execute programs, etc. The computer equipment in this embodiment at least includes but is not limited to: a memory 31 and a processor 32 that can communicate with each other through a system bus, such as Figure 4 shown. It should be pointed out that, Figure 4 Only a computer device is shown with the components - but it should be understood that implementing all of the illustrated components is not a requirement and that more or fewer components m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data clustering identification method and device, a computer system and a readable storage medium, and the method is based on artificial intelligence, and comprises the following steps: setting one case in a case library as a reference case, and setting other cases, except the reference case, in the case library as contrast cases; sequentially judging whether an association relationship exists between the reference case and each contrast case or not, and making a one-dimensional vector; sequentially obtaining one-dimensional vectors of all cases in the case library, and combining the one-dimensional vectors of all cases to obtain an adjacent matrix; calculating an adjacency matrix to obtain a dense vector; calculating a dense vector so as to cluster all cases in the case library and output a clustering result; and calculating a risk value of the case according to the insurance policy information, the automobile features and the case reporter features of the case in the clustering result. According to the method, the high-risk cases are obtained by analyzing the cases in the cluster below the clustering threshold, so that practitioners can perform key analysis on the high-risk cases to identify suspected fraudulent cases which are not compensated or reported in the high-risk cases.

Description

technical field [0001] The invention relates to the field of communication technology, in particular to a data clustering identification method, device, computer system and readable storage medium. Background technique [0002] The current insurance anti-fraud scheme mainly uses black and white list rules to screen cases. However, the existing black and white list rule engines can only rely on human experience. Increased fatigue strength, resulting in increased personnel costs; [0003] If the method of supervised learning modeling is adopted according to the rules of the black and white list, and a mature neural network is made to screen cases, a large amount of labeled data, that is, cases suspected of fraud, are required for the neural network to learn; In , it is difficult to obtain a large amount of labeled data, so this method is often difficult to implement in practice, and it is difficult to make an effective and reliable mature neural network. Contents of the inv...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06Q40/08G06K9/62
CPCG06Q40/08G06F18/23
Inventor 张密唐文
Owner CHINA PING AN PROPERTY INSURANCE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products