Dispute focus discovery method and device based on dispute focus entity, and terminal

A technology for discovering methods and entities, applied in the field of natural language technology processing

Active Publication Date: 2020-10-23
CHONGQING UNIV OF POSTS & TELECOMM
View PDF31 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] However, simply dividing the identification of the focus of disputes into two tasks of judicial i

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dispute focus discovery method and device based on dispute focus entity, and terminal
  • Dispute focus discovery method and device based on dispute focus entity, and terminal
  • Dispute focus discovery method and device based on dispute focus entity, and terminal

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0045] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0046] Such as figure 1 As shown, a dispute focus discovery method based on a dispute focus entity includes but not limited to the following steps:

[0047] Obtain document data, preprocess the document data, and obtain the entity set. The document data includes title and document content.

[0048] First log on to the judicial website (such as the Judgment Document Network), use crawler technology to crawl the web page data, extract the ti...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of natural language technology processing, in particular to a dispute focus discovery method and device based on dispute focus entities and a terminal, and the method comprises the steps of obtaining document data, and carrying out the preprocessing of the document data to obtain an entity set; deleting redundant entities from the entity set to obtain a candidateentity set; splicing each candidate entity in the candidate entity set with the title and the document content to serve as an input feature; inputting the input features into a BERT model for training, and outputting a dispute focus entity after the training is completed; and performing dispute focus judgment according to the dispute focus entity. According to the invention, the candidate entity+ '-' + title + '-' + document content is used as an input feature, so that the attention of the candidate entity is higher; the binary classification task simplifies task steps of conventional entityidentification, each entity and a document can form a sample, the number of training samples is increased, meanwhile, the precision of dispute focus entities is greatly improved, and the effect is better.

Description

technical field [0001] The present invention relates to the field of natural language technology processing, in particular to a dispute focus discovery method, device and terminal based on dispute focus entities. Background technique [0002] With the advancement of the Internet and the development of judicial procedures, judicial information has exploded. How to quickly and accurately mine key information from massive judicial texts has become one of the key issues in the judicial field. In the judicial document data, the dispute focus entities specific to the judicial field are different from those in the general field, and the extraction effect of general entity recognition technology is not ideal. [0003] Currently widely used in Internet products is the Chinese named entity recognition technology (Named Entity Recognition, referred to as NER), which is mainly to identify entities with specific meanings in documents, such as names of people, places, institutions, and p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/295G06F40/30G06F16/33G06F16/35G06K9/62G06N3/04G06N3/08
CPCG06F40/295G06F40/30G06F16/3344G06F16/35G06N3/08G06N3/047G06N3/045G06F18/2415G06F18/241
Inventor 王国胤王晓浪林智敏胡峰邓蔚李子扬黄媛黄子恒
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products