Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

News and case correlation analysis method based on case element guidance and deep clustering

A correlation analysis and case technology, applied in text database clustering/classification, unstructured text data retrieval, instruments, etc., can solve problems such as lack of guidance information, reduced result accuracy, clustering divergence, etc.

Active Publication Date: 2020-10-27
KUNMING UNIV OF SCI & TECH
View PDF7 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a news and case correlation analysis method based on case element guidance and deep clustering, which is used to solve the existing clustering method for news and case correlation analysis tasks, which lacks effective guidance information and easily leads to clustering divergence , reducing the accuracy of the results and other issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • News and case correlation analysis method based on case element guidance and deep clustering
  • News and case correlation analysis method based on case element guidance and deep clustering
  • News and case correlation analysis method based on case element guidance and deep clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0068] Embodiment 1: as figure 1 As shown, the correlation analysis method between news and cases based on the guidance of case elements and deep clustering includes:

[0069] Step1. Collect relevant case news documents and define relevant case elements.

[0070] The relevant case news documents collected and sorted out in Step1 are obtained by writing web crawlers to crawl relevant news texts.

[0071] The case elements defined in Step 1 are defined through the analysis of the composition of the case elements in the Chinese documents of China Judgment Documents Network, and at the same time considering the characteristics of the case-related news texts.

[0072] Specifically, a total of 5970 news texts related to 6 popular cases were crawled, as shown in Table 1. Define the three elements of "the place where the case occurred, the persons involved, and the description of the case" as the case elements, as shown in Table 2.

[0073] Table 1 Case-related news text dataset

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a news and case correlation analysis method based on case element guidance and deep clustering. The news and case correlation analysis method comprises the following steps: firstly, extracting important sentence representation texts; secondly, representing a case by using case elements to initialize a clustering center and guide a clustering search process; finally, selecting a convolutional auto-encoder to obtain text representation, using a reconstruction loss and clustering loss combined training network to enable the representation of the text to be closer to a case, unifying the text representation and clustering processes into the same frame, updating auto-encoder parameters and clustering model parameters alternately, and realizing text clustering. Aiming atthe problems that a current clustering algorithm lacks effective guidance information for news and case correlation analysis tasks, thus clustering divergence is caused, and the accuracy of a resultis reduced, the guidance effect of case elements in the clustering process and text vectorization representation is brought into full play, so the accuracy of a clustering result is effectively improved.

Description

technical field [0001] The invention relates to a news and case correlation analysis method based on case element guidance and deep clustering, and belongs to the technical field of natural language processing. Background technique [0002] The analysis of public opinion in the case field is carried out based on news texts related to a certain case. The purpose of the correlation analysis between news and cases is to judge whether the news text is related to the case, which is an important link in the analysis of news public opinion in the case field. is of great significance. The correlation analysis between news and cases can be regarded as a text clustering process, that is, news texts describing the same case are clustered under the same case cluster. [0003] At present, the relevant research on text clustering can be divided into two types based on statistics and based on deep learning. However, for the news-case correlation analysis task, due to the lack of effectiv...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F16/34G06F40/126G06F40/216
CPCG06F16/35G06F16/345
Inventor 余正涛李云龙高盛祥郭军军相艳线岩团
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products