Similar case matching method based on semantic similarity

A technology of semantic similarity and matching method, applied in computer parts, instruments, electronic digital data processing and other directions, can solve the problems of inability to effectively extract keywords and manual definitions, save the process of manual definition and feature extraction, and can The effect of enhanced usability and excellent generalization performance

Inactive Publication Date: 2019-12-17
JIANGSU HONGXIN SYST INTEGRATION
View PDF2 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Aiming at the problem that ordinary users cannot effectively extract keywords in legal documents and traditional semantic similarity algorithms require manual definition and feature extraction, the present invention proposes a case word vector model based on Word2Vec, and automatically extracts and describes cases on this basis keywords, convert multiple examples into single examples, and convert multiple keyword vectors into fisher vectors describing the case for semantic similarity calculation. The main process includes the case word vector generation process and the semantic similarity calculation method based on the case word vector

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Similar case matching method based on semantic similarity
  • Similar case matching method based on semantic similarity
  • Similar case matching method based on semantic similarity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] The following is attached with the manual figure 1 The present invention is described in further detail.

[0018] Step 1: Establish a case database: obtain judgment documents, preprocess and store them, and form a case database;

[0019] Step 2: Generate case word vectors; collate case corpus and remove content with low information content; use Word2Vec to train the case word vector model after word segmentation, remove stop words, and low-frequency words;

[0020] Step 3: Based on the semantic similarity calculation method of the case word vector, on the basis of the case word vector model, after extracting the keywords in the case, convert multiple examples into a single example, obtain the fisher vector describing the case, and calculate the cosine distance , to judge whether it is a similar case, and obtain the similarity judgment threshold;

[0021] Step 4: Similar case matching: Preprocess and store the case description input by the user or the imported trial do...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a case matching method based on semantic similarity, and the method comprises the following steps: obtaining a case word vector through the training of a Word2Vec model, and automatically extracting features from training data, i.e., extracting the features as one part of the model, thereby neglecting the particularity of a case text. According to the method, the keywords of the case are automatically extracted on the basis of the case word vector, and through converting multiple examples into a single example, a plurality of keyword vectors are converted into fisher vectors of the cases to perform semantic similarity calculation, so that a common user can obtain similar cases only by inputting case description or judgment documents, and the usability of the methodis greatly enhanced.

Description

technical field [0001] The invention is a case matching method based on semantic similarity, which belongs to the field of legal artificial intelligence. Background technique [0002] With the advent of the information age, the amount of information that human beings come into contact with every day is increasing, gradually moving from the age of information scarcity to the age of information overload, how to obtain effective information from it is particularly important. At present, various legal databases have stored a large amount of electronic data. Since the database can only do simple case classification, it is time-consuming and laborious to query similar cases through the database. How to quickly and efficiently search for similar cases from a large number of legal cases is a work worth exploring. With the development of Internet technology, there are already some legal case retrieval technologies related to machine learning. These technologies will search similar c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F17/22G06F17/27G06K9/62G06Q50/18
CPCG06F16/3335G06Q50/18G06F18/22
Inventor 张邱鸣糜俊于志文邵一婷丁家轩胡笳
Owner JIANGSU HONGXIN SYST INTEGRATION
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products