Unlock instant, AI-driven research and patent intelligence for your innovation.

A custom relevance ranking algorithm based on lucene that supports expressions

A sorting algorithm and expression technology, applied in the computer field, can solve the problems of lack of diversity in function types, lack of flexibility in inter-field operations and custom sorting, etc., and achieve the effect of flexible custom correlation sorting algorithm

Active Publication Date: 2020-07-03
NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT +1
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the problems of the lack of diversity in the types of functions supported by inter-field operations in the existing traditional big data system, and the lack of flexibility in inter-field operations and custom sorting, the present invention provides a self-defined correlation sorting based on Lucene-supported expressions algorithm

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A custom relevance ranking algorithm based on lucene that supports expressions
  • A custom relevance ranking algorithm based on lucene that supports expressions
  • A custom relevance ranking algorithm based on lucene that supports expressions

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The specific implementation method of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0033] A self-defined correlation ranking algorithm based on Lucene supporting expressions in the present invention includes four parts: analyzing expressions, calculating expressions, sorting correlations and integrating results; correspondingly adopting four modules: expression parsing module, expression Calculation module, correlation ranking module and result integration module;

[0034] like figure 1 As shown, first, after the user enters a custom expression, the expression parsing module checks the validity of the expression entered by the user and converts it into a form that the system can calculate;

[0035] Specifically: the expression entered by the user is parsed by the management node, and after the parsing is completed, the management node sends the expression and the parameters (field names) in the expression to e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a user-defined relevance sorting algorithm based on Lucene and supporting an expression and belongs to the technical field of computers. According to the algorithm, an expression analysis module is used for carrying out validity check on the expression input by a user, and the expression is converted into the form capable of being calculated by a system; an expression calculation module extracts corresponding fields from a Lucene index for calculation according to parameters in the expression; a relevance sorting module sorts calculation results of the expression; finally a result integration module is used for integrating the calculation results returned by data nodes, and the sorting result of the final user-defined expression is returned to the user. Expression calculation between multiple fields is supported, sorting is carried out, the algorithm is superior to a pure document marking and sorting mechanism, the algorithm supports more function calculation, and the algorithm is suitable for a distributed big data platform.

Description

technical field [0001] The invention belongs to the field of computer technology, in particular to a self-defined correlation ranking algorithm based on Lucene-supported expressions. Background technique [0002] At present, the key technology to obtain useful information from massive information is information retrieval. The core problem of information retrieval is to predict the relevance of documents and sort documents according to the relevance. Generally speaking, the documents at the top are considered to be the most relevant; therefore, the calculation of relevance and ranking algorithms become the core of information retrieval. [0003] The sorting techniques of typical retrieval systems mainly include word frequency statistics and word position weighted sorting algorithms, Direct Hit algorithm based on user feedback, PageRank hyperlink analysis sorting algorithm and Hits sorting algorithm. These typical relevance ranking algorithms are mainly based on full-text wor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/338G06F16/33
CPCG06F16/3344G06F16/338
Inventor 苏沐冉吴震毛洪亮唐积强王秀文马秀娟徐小磊张露晨李焱余李传海李斌斌孟宪文谢铭
Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT