Adjudication document annotation method and device based on machine learning algorithm

A machine learning and document technology, applied in the field of text processing, can solve problems such as low accuracy of case information extraction, incomplete extraction of legal elements, and reduced accuracy of case element extraction

Active Publication Date: 2018-07-27
江西思贤数据科技有限公司
View PDF9 Cites 41 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method only considers the method of using keywords to extract legal elements, but its shortcomings are very prominent: the judgment document is not structurally divided, and the repetition of keywords leads to a decrease in the accuracy of case element extraction; the general use of case keywords to replace the concept of legal elements, Lack of professionalism in extracting legal elements of judgment documents, which does not conform to realistic judicial logic
[0008] Therefore, those skilled in the art are committed to developing a method and device for labeling referee documents based on machine learning algorithms. Through the type labeling method based on natural language understanding and semantic analysis, the referee documents are marked and expressed in an intuitive way. The complete legal elements and internal logical relationship of the case are solved, thus solving the problems of incomplete extraction of legal elements of judgment documents and low accuracy of case information extraction caused by ignoring legal expertise and judicial logic in existing methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Adjudication document annotation method and device based on machine learning algorithm
  • Adjudication document annotation method and device based on machine learning algorithm
  • Adjudication document annotation method and device based on machine learning algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The following describes a preferred embodiment of the present invention with reference to the accompanying drawings to make its technical content clearer and easier to understand. The present invention can be embodied in many different forms of embodiments, and the protection scope of the present invention is not limited to the embodiments mentioned herein.

[0048] Such as figure 1 As shown, a method for labeling referee documents based on machine learning algorithms in a preferred embodiment of the present invention includes the following steps:

[0049] Step S101: Collect the text set of the judgment documents to be marked.

[0050] The text collection collected in step S101 is used for training and optimization of a labeling model based on a machine learning algorithm. Wherein, the texts in the text collection include multiple natural paragraphs.

[0051] Step S102: Structurally segment each text in the text set based on a preset canonical set.

[0052] In a leg...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an adjudication document annotation method and device based on a machine learning algorithm. The method comprises the following steps that: collecting the text set of adjudication documents to be annotated; carrying out structure segmentation on texts in the text set; establishing a semantic tag library; on the basis of the semantic tag library, carrying out manual annotation on the adjudication documents to be annotated; selecting parts of adjudication documents subjected to the manual annotation as a standard dataset to be handed for machine learning, and training andoptimizing a preliminary annotation model; selecting residual parts of adjudication document models subjected to the manual annotation to obtain a mature adjudication document annotation model; and after a target adjudication document to be annotated is segmented, inputting into the mature adjudication document annotation model to obtain an annotation result. Through the method, the problem in the relevant art that adjudication document legal elements are not completely extracted and case information extraction accuracy is low is solved.

Description

technical field [0001] The present invention relates to the technical field of text processing, in particular to a method and device for labeling referee documents based on machine learning algorithms. Background technique [0002] The content of judgment documents is usually relatively long, and the facts of some cases are more complicated. For judges, it is a great challenge to quickly capture the key information points of the case and clarify the judicial logic of the case from similar cases that are pushed, which is time-consuming and expensive. It is laborious and puts pressure on the judge's trial work. For adjudication documents, it is often necessary to analyze the type of adjudication documents, each paragraph, and extract the dimension information of each paragraph and other parameters, so as to summarize historical cases in a timely manner and discover laws to improve trial efficiency and achieve judicial justice. At the same time, analyzing the effective judgmen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/35G06F40/247G06F40/30
Inventor 金耀辉姜华李慧王永坤
Owner 江西思贤数据科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products