Serial/incorporated case identification method

An identification method and a technology of serial merger cases, applied in the field of serial merger case identification, can solve the problems of no fixed format, short text of criminal cases, and failure to find case correlation, etc., to achieve accurate description, improve efficiency, and improve limitations.

Inactive Publication Date: 2017-01-04
WUHAN SHUWEI TECH
View PDF4 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Although traditional clustering algorithms can find clusters of arbitrary shapes and densities, their application targets are mainly for points in numerical multidimensional vector spaces; criminal case texts are short and contain a lot of important information, but they are free texts without fixed formats. It is impossible to directly discover the association between cases through traditional clustering methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Serial/incorporated case identification method
  • Serial/incorporated case identification method
  • Serial/incorporated case identification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0044] The serial and parallel case recognition method provided by the embodiment includes a case preprocessing step, a case feature extraction step, and a clustering step based on feature density; first, the case description text is obtained according to the case corpus, and the case description text is segmented and part-of-speech tagged, and the stop is removed. Use preprocessing operations s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a serial / incorporated case identification method and system. The method comprises the following steps of carrying out preprocessing: carrying out word segmentation and part-of-speech tagging on the case detail description of a case, and removing stop words; extracting the important characteristics of the case from the preprocessed case detail description through a method which combines a rule with a dictionary, and converting a case text into a characteristic vector; and according to the characteristic vector, adopting a case characteristic similarity calculation method to obtain the comprehensive similarity of the cases, clustering the cases through the comprehensive similarity, and finding the dense cluster of related cases to identify the serial / incorporated cases. Through the method and the system provided by the invention, clustering is carried out on the basis of characteristic density, and limitation that a traditional clustering algorithm is applied to a numeric type vector is eliminated. A difficult point that possible serial / incorporated case clusters cannot be obtained from a case detail text library is overcome. The method and the system can be applied to case investigation to improve investigation efficiency.

Description

technical field [0001] The invention belongs to the technical field of computer natural language processing and data mining, and more specifically relates to a method for identifying serial and parallel cases. Background technique [0002] As an important method of combating serial criminal cases, serial and parallel case analysis can tap the internal connection between cases, reduce the workload of analysts, and improve the efficiency of solving cases. The criminal case text contains information such as the time of the case, the location of the case, the modus operandi and tools, etc. Using this information combined with data mining methods to mine the internal links between cases and discover the clusters of cases can reduce the workload of analysts and improve the efficiency of solving cases. efficiency. [0003] The traditional techniques for discovering dense clusters mainly use clustering methods, among which, density-based clustering methods can discover clusters of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06K9/62
CPCG06F40/289G06F40/284G06F18/23
Inventor 郑胜夏明徐涛张胜周可蒋丹
Owner WUHAN SHUWEI TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products