Index sharding method based on email characteristics

A technology of index fragmentation and mail, applied in the direction of database index, other database index, other database retrieval, etc., can solve problems such as single rules, and achieve the effect of speeding up response, transparent and convenient management

Active Publication Date: 2019-05-21
彩讯科技股份有限公司
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But this will also require a secondary calculation (hash), and the rules are relatively simple. When encountering an email with a user ID of 1, a group number of 1, and a creation time of January 2014, it is divided into storage A. configuration requirements, a simple configuration file cannot handle the requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Index sharding method based on email characteristics
  • Index sharding method based on email characteristics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments.

[0024] The mailbox full-text retrieval system adopts the industry-wide inverted index technology to realize rapid keyword location search and specific function search of emails. The system uses modular design to realize content analysis, Chinese word segmentation, index storage optimization, data distributed storage, Data backup and other functions. Bring users a fast and efficient search experience.

[0025] The system will generate a searchable index file from the user's email content text and attachment content, and the user can use the interface provided by the 139 mailbox to search for arbitrary keywords including the sender and sender, subject, body text, attachment name, attachment content, etc. The ability to retrieve and send emails to find the email content you care about. Moreover, the full-text retrieval system also prov...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an index fragmentation method based on mail characteristics. The method includes A, performing fragmentation by the method on the basis of strategy rules and strategy groups, and generating three data table structures of strategies, the strategy groups and machine fragmentation information; B, implementing different configurations as required after the data table structures of strategies, the strategy groups and fragmentation information are established; C, starting the system service; D, allowing a fragmentation service program to provide a socket interface to the exterior; E, allowing a search engine background to write the indexes into the memory of the fragments according to the fragmentation information; F, allowing the fragmentation service program to supply a fragmentation request of the indexes to the exterior. The method has the advantages that expansion and complex rule combination can be performed automatically, the response speed of the full-text retrieval can be increased, and the management of the index documents can be more transparent and convenient.

Description

technical field [0001] The invention relates to a fragmentation method of index files in full-text retrieval, in particular to an index fragmentation method based on mail characteristics. Background technique [0002] In today's era of information explosion, everyone wants to obtain the information they need more conveniently, faster and faster. There are many types of information. In addition to structured data such as ordinary web pages, more and more unstructured information is also emerging, including various reports, bills, electronic documents, various elements of websites, pictures, faxes, etc. , scanned images, and a large amount of multimedia audio and video information and so on. The large and complex variety of data brings great inconvenience to users. As a result, the full-text retrieval system was produced and widely used. [0003] In the field of mailboxes, when the mailbox capacity was relatively small in the past, the number of mails was limited, and users...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/22
CPCG06F16/902
Inventor 杨良志汪志新丁德平周广平
Owner 彩讯科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products