High-precision semantic search system oriented to judicial field

A semantic search, high-precision technology, applied in semantic analysis, semantic tool creation, natural language data processing, etc., can solve the problems of large matching calculation, poor user experience, long retrieval time, etc., achieve high precision, accurate search, The effect of improved search accuracy

Inactive Publication Date: 2020-01-10
ENJOYOR COMPANY LIMITED
View PDF10 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The disadvantage is that model learning requires a lot of pre-data preparation work, and due to the high feature dimension and dense data, it causes a large amount of matching calculations, and the query data is usually limited to millions.
Once this order of magnitude is exceeded, the retrieval time will be longer and the user experience will be poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-precision semantic search system oriented to judicial field
  • High-precision semantic search system oriented to judicial field
  • High-precision semantic search system oriented to judicial field

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The present invention will be further described below in conjunction with specific examples, but the present invention is not limited to these specific implementations. Those skilled in the art will realize that the present invention covers all alternatives, modifications and equivalents as may be included within the scope of the claims.

[0044] Explanation of technical terms

[0045] Bert algorithm: Bert algorithm is a method of pre-training language representation. It trains a general "language understanding" model on a large amount of text corpus, and then uses this model to perform various downstream sub-tasks.

[0046] jieba word segmentation: jieba is a Python-based Chinese word segmentation tool that can be used for Chinese sentence / part-of-speech segmentation, part-of-speech tagging, unregistered word recognition, and supports user dictionaries and other functions.

[0047] word2vec: The word embedding model proposed by Google in 2013 is one of the most common...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A high-precision semantic search system oriented to the judicial field comprises a data layer, a word analysis layer, a sentence analysis layer, a sentence feature layer and an application layer, andthe data layer comprises data acquisition and structuralization and is used for collecting judicial data and structuralized data; the word analysis layer comprises fine-grained word segmentation and new word discovery, and is used for realizing correct segmentation of the text into words; the sentence analysis layer is used for performing part-of-speech analysis based on the segmented words, removing interference words according to a judicial scene, further extracting keywords of sentences and establishing a key vocabulary; the sentence feature layer is used for extracting sentence features; and the application layer is used for defining correlation based on the sentence features to realize text search.

Description

technical field [0001] The invention belongs to the field of natural language processing and relates to a high-precision semantic search system oriented to the judicial field. Background technique [0002] As of February 2019, China Judgment Documents Network has published more than 56 million judgment documents. These judgment documents provide important reference materials for many legal practitioners and the general public. At the same time, massive amounts of information provide an important source of data for the research and development of artificial intelligence serving the field of smart justice and the construction of service agency databases. In the past few years, search, Management software, case handling system, auxiliary tools, legal consultation, intelligent analysis reports and other products have been launched one after another. Among them, the search engine, as an important means of managing and retrieving data, is a key technology in the field of smart j...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/332G06F16/36G06F40/211G06F40/216G06F40/289G06F40/30G06Q50/18
CPCG06Q50/18G06F16/3329G06F16/3344G06F16/36
Inventor 丁锴王开红张云云
Owner ENJOYOR COMPANY LIMITED
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products