Judgment document dispute focus extraction method based on multi-task small sample learning

An extraction method and small-sample technology, applied in neural learning methods, text database query, unstructured text data retrieval, etc., can solve problems such as quantity imbalance, imbalance, and high data requirements, and achieve good universality , solve the small sample problem and reduce the workload

Active Publication Date: 2020-09-11
SICHUAN UNIV +1
View PDF10 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical problem to be solved by the present invention is that controversial issues are assigned to many classes, and the huge difference in the number of controversial focus issues contained in each class leads to an imbalance of classes, which leads to a decrease in the performance of text clustering ; Most of the current algorithm models have high requirements for data, and a large amount of data labeling work is required
[0008] The present invention provides a multi-task-based small-sample learning method for extracting the focus of disputes in referee documents, which solves the imbalance problem caused by the difference in the number of the above-mentioned classes and the problem that the algorithm model requires a large number of labels for the data.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Judgment document dispute focus extraction method based on multi-task small sample learning
  • Judgment document dispute focus extraction method based on multi-task small sample learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0040] Embodiment 1: Analysis of technical problems and solutions:

[0041] In order to solve the problems of low labeling efficiency and high quality labeling data, a clustering algorithm and a topic model are used. The clustering method used is GMM.

[0042] Text clustering applies cluster analysis to text, which uses machine learning and NLP to understand and classify unstructured text data; clustering algorithms are defined as an unsupervised technique that discovers populations by quantitatively comparing multiple features Whether the individuals in belong to different groups.

[0043]After the cluster phase, the vast majority of isomorphic contentious problems are properly merged into the same cluster. But due to the semantic complexity and unformatted nature of legal texts, many controversial issues remain. In this case, the controversial issues and their clusters need to be manually deleted or merged. For each cluster, rather than manually identifying the main mess...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a judgment document dispute focus extraction method based on multi-task small sample learning. According to the method, the problems that disputed problems are distributed to multiple classes, the classes are unbalanced due to the huge difference of the number of disputed focus problems contained in each class, consequently, the performance of text clustering is reduced, most of current algorithm models have high requirements for data, and a large amount of data annotation work needs to be achieved are solved. A certain amount of labeled data is obtained after clustering, cluster labels are automatically obtained through LDA, model training, model clipping and dispute focus classification matching are carried out after data enhancement processing to extract judicialdispute focuses, and help can be better provided for lawyer judge to retrieve class cases.

Description

technical field [0001] The invention relates to text classification and matching in the field of focus of disputes in the judicial field, in particular to a method for extracting focus of disputes in adjudication documents based on multi-task small-sample learning. Background technique [0002] With the continuous progress of our country's social development, the judicial reform will be further promoted. The reform starts from the demands of the people for justice, and focuses on strengthening the supervision and restriction of power. With the rapid development of information technology, the release of online judgments has played a vital role in promoting judicial transparency. As the new litigation system gradually takes shape, Chinese courts organize debates around controversial issues. [0003] Factual questioning helps to focus fact-finding in court trials, while legal questioning helps courtroom debate organization and law application. The judgment embodies the proce...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F40/211G06N3/08G06N3/04
CPCG06F16/3344G06F40/211G06N3/084G06N3/045
Inventor 不公告发明人
Owner SICHUAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products