Legal field event extraction method based on pre-training model and convolutional neural network algorithm
A convolutional neural network and event extraction technology, applied in the field of legal intelligence, can solve the problems of relying on training data and high cost of data labeling, so as to improve the effect and reduce the time cost and labor cost
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0070] In this example, see figure 1 , a method for extracting events in the legal field based on a pre-training model and a convolutional neural network algorithm, the method comprising the following steps:
[0071] A. Data acquisition and preprocessing:
[0072] Use web crawlers to crawl public legal text corpus, perform text preprocessing on the original legal text corpus, and sequentially perform sentence segmentation, word segmentation, and denoising to obtain usable legal text corpus data;
[0073] B. Legal event template definition:
[0074] Obtain high-frequency verbs and key nouns in the legal field, perform distance-based clustering of similar words on these words, and manually define legal event types and templates with reference to relevant legal clauses based on the clustering results;
[0075] C. Large-scale legal event data annotation based on distance supervision learning:
[0076] Use the method of rules or patterns to obtain seed legal events from semi-str...
Embodiment 2
[0080] This embodiment is basically the same as Embodiment 1, especially in that:
[0081] In this embodiment, in the step A, the specific steps for obtaining available legal text corpus data are:
[0082] A1. Use crawlers to crawl public legal document data from legal document websites;
[0083] A2. Manually classify part of the obtained legal document data according to the crimes sentenced, use the neural network model RCNN to train the crime classification model of the legal document data, classify the remaining data, and obtain the legal document data classified according to the crime;
[0084] A3. Unify the punctuation marks of legal document data into Chinese format, according to include? ! The Chinese punctuation and sentence break symbols divide the document data into sentence forms to form a sentence set;
[0085] A4. Use an open source word segmentation tool to segment each sentence in the sentence set to obtain the word segmentation result;
[0086] A5. Build a ...
Embodiment 3
[0114] This embodiment is basically the same as the above-mentioned embodiment, and the special features are:
[0115] In this embodiment, a method for extracting events in the legal field based on a pre-trained model and a convolutional neural network algorithm, the steps
[0116] A. Data acquisition and preprocessing: use web crawlers to crawl public legal text corpus, and use public information from the legal document website; perform text preprocessing on the original legal text corpus, and sequentially perform sentence segmentation, word segmentation, and denoising to obtain usable legal texts text corpus data;
[0117] A1. Use crawlers to crawl public legal document data from legal document websites;
[0118] A2. Manually classify part of the obtained legal document data according to the crimes punished. On this basis, use the neural network model RCNN to train the crime classification model of the legal document data, classify the remaining data, and obtain the legal d...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



