Information extraction method for administrative penalty decision

A technology of information extraction and text information, applied in the fields of natural language processing and legal artificial intelligence, can solve the problems of low efficiency and low accuracy of information extraction, solve the dependency of similar texts, improve accuracy and efficiency, and prevent information loss Effect

Pending Publication Date: 2022-01-11
SHANDONG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to solve the problems of low efficiency and low accuracy in the information extraction of administrative penalty decision documents in the existing judicial field, and to provide an information extraction method for administrative penalty decision documents. Module implements text feature extraction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information extraction method for administrative penalty decision
  • Information extraction method for administrative penalty decision
  • Information extraction method for administrative penalty decision

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0046] An information extraction method for an administrative penalty decision, such as figure 1 shown, including:

[0047] Step 1: Crawl from the administrative penalty document website to obtain the administrative penalty decision letters of each province; it will be used to build a data set later.

[0048] Step 2: Extract the text content of the administrative penalty decision obtained in step 1 in the html tag, construct the original data set, and obtain the .csv file.

[0049] Step 3: According to the normative rules for writing administrative punishment decisions, use regular expressions to perform data preprocessing on administrative punishment decisions to be processed, construct data sets, and obtain .csv files.

[0050] Step 4: Input the data set constructed in step 3 into the information extraction module trained with the original data set constructed in step 2, and output the information extraction results of administrative punishment documents. Information extra...

Embodiment 2

[0052] According to the information extraction method of an administrative penalty decision document described in Embodiment 1, the difference is that:

[0053] In step 2, use the strip() function in python to remove the html label and label, obtain the text content of the administrative punishment decision letter, the text content of the administrative penalty decision letter includes the decision letter number, parties, subject qualification certificate name, unified social credit code, domicile (address), legally responsible person (responsible person, operator) ), identity card number, source of the case and investigation process, case facts, proof of evidence (notification of administrative punishment, statements, defenses, hearing opinions of the parties, review and acceptance and reasons), qualitative nature of illegal acts, basis for punishment, discretionary Facts and reasons, implementation methods and deadlines of administrative penalties, remedies and deadlines. ...

Embodiment 3

[0062] According to the information extraction method of an administrative penalty decision document described in Embodiment 2, the difference is that:

[0063] In step four, the steps are as follows:

[0064] Input the data set constructed in step 3 into the pre-training language module, and according to the text characteristics of the administrative penalty decision, obtain the short text information sequence through the sliding window self-attention mechanism, including the decision document number, party, subject qualification certificate name, unified society Credit code, domicile (address), legally responsible person (responsible person, operator), ID number; through the combination of sliding window self-attention mechanism and global attention mechanism to obtain the source of the case, investigation process, case facts, evidence (administrative Notification of punishment, parties’ statement, defense, hearing opinions, review and adoption and reasons), qualitativ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an information extraction method for an administrative penalty decision. The method comprises the following steps: 1, crawling and obtaining the administrative penalty decisions of each province from an administrative penalty document network; 2, extracting the text content of the administrative penalty decision acquired in the step 1 in the html label, and constructing an original data set; 3, performing data preprocessing on the to-be-processed administrative penalty decision by utilizing a regular expression according to a normative rule written by the administrative penalty decision, and constructing a data set; and 4, inputting the data set constructed in the step 3 into an information extraction module trained by using the original data set constructed in the step 2, and outputting an administrative penalty document information extraction result. According to the method for extracting the information of the administrative penalty decision, the structured information of the decision can be accurately obtained, and understanding of the administrative penalty decision and implementation of downstream tasks such as class case retrieval, class case recommendation and judgment prediction are facilitated.

Description

technical field [0001] The invention relates to the fields of natural language processing and legal artificial intelligence, in particular to an information extraction method of an administrative penalty decision. Background technique [0002] As an important carrier of administrative penalty legal practice, the administrative penalty decision letter increases the workload and difficulty for practitioners due to its huge quantity and complicated text content. Information extraction of administrative punishment decision letters can help practitioners quickly obtain the required text information, provide a basis for downstream tasks such as similar case retrieval, similar case recommendation, judgment prediction, etc., and improve the quality and efficiency of administrative punishment judgments. [0003] The traditional information extraction work is manually entered or extracted according to the manual summary of the extraction rules. The rules formulated cannot be transplan...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/335G06F16/35G06F16/951G06F40/205G06N3/04G06N3/08G06N5/04
CPCG06F16/335G06F16/355G06F16/951G06F40/205G06N5/04G06N3/04G06N3/08G06N3/084Y02A90/10
Inventor 李玉军赵思文贲晛烨胡伟凤
Owner SHANDONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products