Court similar case recommendation model based on word vectors and word frequencies

A word vector and word frequency technology, applied in text database query, unstructured text data retrieval, special data processing applications, etc., can solve the problems of high difficulty in the application of deep learning models, difficulty in promoting different fields, time-consuming and laborious training sets, etc. , to achieve the effects of avoiding a huge amount of calculation, improving accuracy, and reducing the amount of calculation

Pending Publication Date: 2019-12-20
HUBEI UNIV OF TECH
View PDF3 Cites 37 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Applying the model to the text similarity calculation in a certain field requires a large amount of labeled text in this field as a training set, and manually labeling the training set is time-consuming and laborious
Therefore, the application of deep learning models is difficult and difficult to promote in different fields

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Court similar case recommendation model based on word vectors and word frequencies
  • Court similar case recommendation model based on word vectors and word frequencies
  • Court similar case recommendation model based on word vectors and word frequencies

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] The present invention will be further described below in conjunction with accompanying drawing:

[0061] like figure 1 As shown in the case diagram of the original case judgment, the model uses the court’s case judgment as the corpus, and the screenshot of the original document of the case judgment is shown in the figure.

[0062] like figure 2 Shown is the case diagram of the preprocessing results of the original case judgment. figure 1 The original case judgment shown is the training set text obtained after preprocessing operations such as encoding method conversion, symbol removal, stop word removal, and word segmentation.

[0063] like image 3 Shown is the algorithm flow chart of the court similar case recommendation model, starting from the input of the original court case, showing all the main functional modules in the model.

[0064] like Figure 4 Shown is the TF-W2V similarity calculation model proposed by the present invention. Among the existing simil...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a court similar case recommendation model based on word vectors and word frequencies, namely a TF-W2V similarity calculation model. The judgment documents are divided into fivecase types of criminal affairs, civil affairs, execution, compensation and administrative affairs, and in order to process, store and query the judgment documents, the model extracts the key information from the submitted judgment, and finds out the judgment with the highest similarity in the same type of judgment in the document data by adopting a Word2Vc + TF-IDF text similarity algorithm to give out the similarity and recommend the judgment. According to the method, based on a word frequency and word vector method, the keywords and the word meaning information of the texts are integrated,and the similarity of the two texts is accurately calculated. The method is applied to the court judgment for similarity calculation, and the experimental results prove that the method is simple to apply, has no requirement for a labeling training set, can be applied to the texts in different fields, consumes the moderate time in calculation, is more accurate in obtained result compared with a traditional method, is closer to the expert evaluation results, and can calculate the similarity of the court texts accurately and effectively.

Description

technical field [0001] The invention belongs to the technical field of natural language processing in the field of artificial intelligence, mainly studies the text similarity calculation technology of natural language processing, relates to a court similar case recommendation model based on word vector and word frequency, and especially relates to the text similarity calculation method of court judgments . Background technique [0002] With the steady advancement of the country's legalization construction, a system of judicial openness has been gradually formed. Judgment documents, as the record carrier of judicial litigation results, have been fully disclosed on the Internet according to law. At the same time, with the development of natural language processing technology, its integration with big data has become more and more popular. Using natural language processing technology to process the big data of judgment documents and mining its potential value is the current re...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33
CPCG06F16/3347G06F16/3334
Inventor 陈建峡张伟杨帆张杰程玉
Owner HUBEI UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products