Grading treatment method and system based on Lucene fragmentation structure

A processing method and processor technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as inconsistencies in scores, and achieve the effects of improved performance, high availability, and reasonable sorting effects

Inactive Publication Date: 2013-12-18
FOCUS TECH
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0019] Aiming at how to coordinate the scoring among the various fragments in the distributed search, this patent proposes a scoring processing method and system based on the Lucene fragmentation structure, which processes the global information through multiple requests so that each fragment can share the global information and solve the problem of Solved the problem of inconsistent scores calculated by the same term in different shards, making the ranking of search results more reasonable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Grading treatment method and system based on Lucene fragmentation structure
  • Grading treatment method and system based on Lucene fragmentation structure
  • Grading treatment method and system based on Lucene fragmentation structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0055] like figure 1 As shown, it is the system structure of the embodiment of the present invention. For the convenience of description, it is assumed that in this embodiment, the index file data is divided into two fragment processors, including:

[0056] Search processor 101, global information buffer 102, fragmentation processor 103, fragmentation processor 104, wherein fragmentation processor 103 is made up of fragmentation 1 search module 1031 and index file fragmentation 1 data storage module 1032, fragmentation The processor 104 is composed of a segment 2 search module 1041 and an index file segment 2 data storage module 1042 .

[0057] The search processor is connected to the global information buffer 102, the slice processor 103, and t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a grading treatment method and system based on a Lucene fragmentation structure. The method comprises the following steps: performing data splitting on an index file to form index file fragmentation data, and distributing the index file fragmentation data on various fragmentation processors, so as to complete initialization operation; receiving query information input by users through a searching processor, and performing word segmentation treatment on the query information to form searching lexical items; looking up in a global information buffer according to the searching lexical items in sequence to judge whether the information related to a current searching lexical item exists or not; if the information related to a current searching lexical item does not exist, directly sending the searching item to the fragmentation processors to be treated; if the information related to a current searching lexical item exists, acquiring the global information of the searching lexical item from the global information buffer by the searching processor, and then sending the searching lexical item to the fragmentation processors to be treated. According to the invention, the global information is treated through multiple requests to enable various fragmentation to share the global information, so that the problem that the computed scores of one lexical item in different fragmentation are different is solved, and the sequence of searching results is more reasonable.

Description

technical field [0001] The invention belongs to the technical field of mass data processing, and in particular relates to a scoring processing method and system based on a Lucene fragmentation structure. Background technique [0002] With the rapid development of the Internet and the rapid growth of Internet information, people have increasingly relied on the Internet to obtain information in their daily work and life, so how to quickly find the information they need is self-evident for people. The traditional relational database retrieval method can no longer support the retrieval of such a large amount of data on the Internet, so full-text search has emerged as a query method with a large amount of data, and the full-text search tool represented by Lucene is based on its The advantages of high efficiency, high accuracy, and high expansion are increasingly used by Internet companies. [0003] However, due to the ease of use of Lucene, it has certain disadvantages when proc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 陈建国梁峰姜平
Owner FOCUS TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products