Judicial text-oriented search sorting method and system
A sorting method and text technology, applied in the direction of digital data information retrieval, special data processing applications, instruments, etc., can solve the problems of Query and Doc length mismatch, the result is not very good, etc., to speed up the algorithm running speed, and the matching results are reliable , the effect of accurate sorting results
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0055] refer to figure 1 , the present embodiment provides a judicial text-oriented search and sorting method, the steps are as follows:
[0056] Step 1: Data Preprocessing
[0057] (1) Data acquisition
[0058] Collect judicial text data such as judgment document data, mediation case data, and legal text data, and perform preprocessing such as deduplication.
[0059] (2) word segmentation processing
[0060] According to the collected judicial text data, construct a word segmentation dictionary in the judicial field, and use jieba word segmentation to process the word segmentation of judicial text data.
[0061] (3) Training word vectors with judicial text data
[0062] Most of the existing word vectors are trained with data such as encyclopedias and news, but the context in judicial texts is quite different from that of news encyclopedias, and it is easier to obtain a large number of unsupervised training samples in the judicial field. Therefore, using judicial text data...
Embodiment 2
[0114] refer to Figure 6 , in order to realize a judicial text-oriented search and sort method described in Embodiment 1, an embodiment of the present invention also provides a search and sort system for implementing the above-mentioned judicial text-oriented search and sort method, including:
[0115] The first obtaining module is used to obtain the judicial text data Doc, and carry out word segmentation processing to the judicial text data, and pre-train word vectors;
[0116] The second obtaining module is used to obtain the legal consultation question Query input by the user;
[0117] The correlation calculation module is used to calculate the matching score of the judicial text data Doc and the legal consulting question Query, construct the matching matrix of the judicial text data Doc and the legal consulting question Query, and intercept the relevant text according to the matching matrix , calculating the statistical information of word and word co-occurrence in the r...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com