Judicial document paragraph classification method and device, computer equipment and storage medium

A classification method and document technology, applied in computer parts, computing, neural learning methods, etc., can solve problems such as lack of generalization ability, and achieve the effect of generalization ability, high accuracy and recall rate

Pending Publication Date: 2020-07-17
深圳市华云中盛科技股份有限公司
View PDF2 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, rule extraction does not have generalization ability, and long-term manual inter

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Judicial document paragraph classification method and device, computer equipment and storage medium
  • Judicial document paragraph classification method and device, computer equipment and storage medium
  • Judicial document paragraph classification method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0045] It should be understood that when used in this specification and the appended claims, the terms "comprising" and "comprises" indicate the presence of described features, integers, steps, operations, elements and / or components, but do not exclude one or Presence or addition of multiple other features, integers, steps, operations, elements, components and / or collections thereof.

[0046] It should also be understood that the terminology used ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a judicial document paragraph classification method and device, computer equipment and a storage medium. The method comprises the steps of obtaining judicial documents; performing character segmentation on the judicial document to obtain a character matrix; carrying out vector extraction according to the character matrix to obtain sentence representation vectors; splicingthe sentence representation vectors to obtain a document representation vector; inputting the document representation vectors into a classification model for classification to obtain paragraph categories; feeding back the paragraph category to the terminal for the terminal to perform information extraction, wherein the classification model is obtained by training a model composed of a bidirectional recurrent neural network and a conditional random field by taking a document representation vector with a category label as sample data. According to the method, the sentence representation vectorsare classified through the classification model composed of the trained bidirectional recurrent neural network and the conditional random field to obtain the paragraph categories, judicial document paragraphs are automatically classified, the generalization ability is achieved, and the extraction accuracy and recall rate are high.

Description

technical field [0001] The present invention relates to a text information processing method, and more specifically refers to a judicial document paragraph classification method, device, computer equipment and storage medium. Background technique [0002] For the judicial field, obtaining more information from massive judicial cases has become one of the urgent needs in the era of big data. However, the structuring of judicial documents in the form of text is the prerequisite for subsequent efficient processing and in-depth analysis. Due to the rigor and standardization of judicial documents, the paragraph composition and writing method of judicial documents are usually relatively fixed. By dividing the documents into paragraphs, the document structure can be reduced, that is, the data complexity and difficulty of subsequent document information extraction, and the accuracy can be improved. sex. [0003] At present, the common way of classifying judicial document paragraphs...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/279G06F40/205G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/045G06F18/214G06F18/24
Inventor 温凯雯吕仲琪顾正
Owner 深圳市华云中盛科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products