Sorting algorithm for standard literature retrieval

A sorting algorithm and standard document technology, applied in the field of keyword retrieval, can solve the problem of low accuracy

Active Publication Date: 2019-07-16
江苏省质量和标准化研究院
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a sorting algorithm for standard document retrieval, which

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sorting algorithm for standard literature retrieval
  • Sorting algorithm for standard literature retrieval
  • Sorting algorithm for standard literature retrieval

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] Such as figure 1 and figure 2 A sorting algorithm for standard document retrieval is shown, including the following steps:

[0027] Step 1: establish an index system, the index system includes a database server, a retrieval server and a client server, and the database server and the client server are connected to the retrieval server through the Internet;

[0028] Step 2: Establish a standard full-text keyword library for standard documents in the database server, search the server to scan each word in the standard document, record the frequency and corresponding position of each word in the standard document, and select the one with the highest frequency of occurrence 50 words of the standard are used as the full-text keywords of the standard, and these 50 words are set as Tokens, and the search server builds an index for these 50 Tokens; in this process, the present invention uses reverse sorting technology to split the standard files, Reduce the space occupancy of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sorting algorithm for standard literature retrieval, relates to the technical field of keyword retrieval. The sorting algorithm includes: influencing the boost scoring by setting the query configuration; packaging the method for realizing the edismax and the map function; carrying out quantitative regularization processing on a plurality of fields such as query title names, question records and text; setting the scoring weight for the text relevancy of each field, endowing the fields with different weight levels according to accurate and fuzzy matching; after two times of data regularization processing, carrying out reverse order sorting through the packaged map function, and the result is fed back, so that the accuracy in the standard literature retrieval processis improved.

Description

technical field [0001] The invention relates to the technical field of keyword retrieval, in particular to a sorting algorithm for standard document retrieval. Background technique [0002] Standard electronic literature retrieval has both similarities and individuality with existing electronic literature retrieval. Most of the existing electronic literature retrieval and ranking methods are based on statistical word frequency, semantics, word grouping and other methods to score the matching degree between the search terms and the keywords of the target document, so as to realize the ranking of the retrieval results. [0003] The method described in the patent 201010182289.5 "Retrieval System Oriented to Source Document Meta-Keywords" has certain generality, but directly applying the above method to standard document retrieval does not perform well in standard recall and precision. too good. Since standard electronic literature retrieval has precise and fuzzy query require...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/31G06F16/33
CPCG06F16/319G06F16/3344Y02D10/00
Inventor 金志刚章学周陈银龙严菁伍薇王玮健赵华李天侠谢莉
Owner 江苏省质量和标准化研究院
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products