Re-crime risk early warning mixed attribute data processing method, medium and equipment

A technology of attribute data and processing methods, applied in data processing applications, character and pattern recognition, instruments, etc., can solve the problems of low analysis ability, sensitive selection of initial cluster centers, and high sample dimensions, so as to reduce dimensions and stabilize calculation results. , Effectively handle the effect of analysis

Pending Publication Date: 2021-07-23
SOUTH CHINA AGRI UNIV
View PDF2 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, due to the large scale of recriminal risk early warning data and high sample dimensions, the existing models have high difficulty and low analysis ability in processing data with higher latitudes.
Moreover, the data contains both continuous attributes and categorical attributes. Currently, the cluster analysis of data in the existing technology is mainly for continuous attributes, but there are few techniques for cluster analysis of categorical attributes, and few clusters Analysis techniques can deal with continuous attributes and categorical attributes at the same time, and they are sensitive to the selection of initial cluster centers, which can easily cause the clustering results to fall into local optimum and cause the clustering effect to fluctuate greatly, and it is difficult to describe the samples and The dissimilarity between cluster centers and the differences between samples, without considering the different influences of the importance of different conditional attributes on the results, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Re-crime risk early warning mixed attribute data processing method, medium and equipment
  • Re-crime risk early warning mixed attribute data processing method, medium and equipment
  • Re-crime risk early warning mixed attribute data processing method, medium and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0076] This embodiment discloses a method for processing re-offending risk early warning mixed attribute data, which can effectively reduce the dimensions of the data, eliminate redundant attributes, and effectively carry out all special groups who have undergone prison reform Classification can effectively process this type of data, such as figure 1 shown, including steps:

[0077] Step S1, obtaining data samples to form a data set; wherein, the samples include persons who have criminal records and re-offenders and persons who have criminal records but no longer offend.

[0078] As shown in Table 1, it is assumed that the sample set includes the following samples, and the data of each sample are as follows:

[0079] Table 1

[0080] Numbering gender education level type of crime whether to reoffend no 1

male high school theft Yes no 2

Female junior high school robbery Yes no 3

male high school theft Yes ...

Embodiment 2

[0217] This embodiment discloses a recriminal risk early warning mixed attribute data processing device, including:

[0218] The obtaining module is used to obtain data samples to form a data set; wherein, the samples include persons who have criminal records and re-offenders and persons who have criminal records but no longer offend;

[0219] The preprocessing module is used to perform preliminary data preprocessing on each sample in the data set, remove redundant items and missing items in the data set, and then convert the data set into a coordinated data set;

[0220] The reduction processing module is used to perform reduction processing on each conditional attribute in the coordinated data set, delete redundant attributes in the coordinated data set, and obtain a data set after attribute reduction;

[0221] The clustering module is used for clustering the samples according to the continuous attributes and classification attributes of the samples in the data set after the...

Embodiment 3

[0224] This embodiment discloses a storage medium, including a processor and a memory for storing a program executable by the processor. When the processor executes the program stored in the memory, it realizes the re-criminal risk early warning mixed attribute data described in Embodiment 1 The processing method is as follows:

[0225] Obtain data samples to form a data set; wherein, the samples include persons who have a criminal record and re-offend and persons who have a criminal record but no longer offend;

[0226] Carry out preliminary data preprocessing on each sample in the data set, remove redundant items and missing items in the data set, and then convert the data set into a coordinated data set;

[0227] For each conditional attribute in the coordinated data set, reduction processing is performed, redundant attributes in the coordinated data set are deleted, and a data set after attribute reduction is obtained;

[0228] For the data set after attribute reduction, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a re-crime risk early warning mixed attribute data processing method, medium and equipment, and the method comprises the steps: firstly obtaining data samples to form a data set, carrying out the preliminary data preprocessing of each sample in the data set, removing redundant items and missing items in the data set, and converting the data set into a coordinated data set; performing reduction processing on each condition attribute in the coordinated dataset, deleting redundant attributes in the coordinated dataset to obtain a dataset after attribute reduction, and finally clustering the dataset after attribute reduction. According to the method, missing attributes and redundant attributes existing in the data set can be effectively removed through data preprocessing and reduction processing, so that the dimension of the data can be effectively reduced, effective processing and analysis of the early warning data are realized, and through the data obtained based on the method, the classification accuracy of re-crime risk early warning can be higher, and the classification speed is higher.

Description

technical field [0001] The invention relates to the field of data preprocessing, in particular to a method, medium and equipment for processing mixed attribute data of re-crime risk early warning. Background technique [0002] The factors that lead to recidivism among special groups who have undergone prison reform are different, such as poor growth environment, deformed outlook on life, difficulty adapting to today's society after being released from prison, etc. And different recidivism factors have different motives and the degree of harm they bring to society. Therefore, according to the characteristics of different prisoners, using unsupervised clustering to divide the target population, and then discussing the factors that lead to recidivism and the degree of harm of each type of population, can make the prediction results more accurate and reduce the prediction algorithm. Prejudice against a particular group of people. [0003] However, due to the large scale of rec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06Q50/26
CPCG06Q50/26G06F18/23G06F18/24137G06F18/214
Inventor 李康顺王梓铭陈伟林王健聪周威池
Owner SOUTH CHINA AGRI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products