Category prediction method and device based on theme information

A technology of subject information and prediction methods, applied in text database clustering/classification, special data processing applications, instruments, etc., can solve the problems of unpredictable low-frequency crimes and lack of semantics, so as to solve the problems of difficult prediction of crimes and avoid semantic loss Effect

Inactive Publication Date: 2019-08-23
XI AN JIAOTONG UNIV
View PDF5 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to overcome the shortcomings of the above-mentioned prior art, the purpose of the present invention is to provide a category prediction method and device based on topic information, which is used to describe the semantic relevance between each semantic unit in the description of the case and the whole case, and to identify important words and sentences. Solve the problems of lack of semantics and the inability to predict low-frequency crimes in the current text classification method, and improve the degree of judicial intelligence

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Category prediction method and device based on theme information
  • Category prediction method and device based on theme information
  • Category prediction method and device based on theme information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The implementation of the present invention will be described in detail below in conjunction with the drawings and examples.

[0041] Topic information is the representation of the implicit topic semantics of the text. A topic is a summary of the semantics of a text. One or more concept words are used to summarize the content expressed in the text. The topic information is the probability distribution of the text semantics on a given set of concept words, reflecting the overall semantics of the text. Topic information can effectively alleviate the semantic loss problem in the process of text semantic feature extraction, and is suitable for the acquisition of key semantics in the crime prediction process, improving the prediction accuracy of low-frequency crimes.

[0042] Therefore, the present invention proposes a crime prediction and classification method and device based on subject information. On the basis of hierarchical neural networks, the relationship between eac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a category prediction method and device based on theme information. The method is applicable to criminal name prediction of legal documents and includes, on the basis of the hierarchical neural network, describing the overall association between each vocabulary in each sentence and the case text according to the theme information of the case description text, determining theimportance of each sentence through the theme information, performing weighted summation to obtain semantic vector representation of the case description text, and inputting the semantic vector representation into a classifier to predict a criminal name corresponding to the case. According to the method, key words and sentences in the case description text are mined by using theme information, and more effective semantic vector representation of the case description text can be obtained, so that a better effect is achieved on low-frequency criminal name prediction.

Description

technical field [0001] The invention belongs to the technical field of text classification, in particular to a category prediction method and device based on subject information. Background technique [0002] In recent years, the informatization construction of courts has been continuously improved. As of December 2018, China Judgment Documents Network has collected and published more than 59 million judgment documents. The use of artificial intelligence technology to analyze, summarize and utilize these judicial big data has become a hot research topic topic. [0003] Charge prediction is to predict reasonable charges based on the description of the case in the judgment document, which can be used for sentencing and conviction before trial, and can also be used for evaluation of trial fairness after trial, which is conducive to realizing "same case with same sentence" and ensuring fairness and justice. [0004] Crime prediction is essentially a text category prediction pro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F16/35G06F16/33G06N3/04
CPCG06F16/355G06F16/3344G06F40/289G06F40/30G06N3/045
Inventor 王平辉韩婷胡小雨陶敬许诺张珊
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products