Cryptogram-based safe full-text indexing and retrieval system

A full-text indexing and retrieval system technology, applied in computer security devices, instruments, calculations, etc., can solve the problem that the plaintext full-text retrieval system does not implement access control strategies, achieve safe and efficient indexing process, enhance security, and improve retrieval efficiency Effect

Inactive Publication Date: 2009-09-02
HUAZHONG UNIV OF SCI & TECH
View PDF0 Cites 36 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In addition, plaintext full-text retrieval systems generally do not imple

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cryptogram-based safe full-text indexing and retrieval system
  • Cryptogram-based safe full-text indexing and retrieval system
  • Cryptogram-based safe full-text indexing and retrieval system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Such as figure 1 As shown, the present invention includes a word segmentation encryption server 100 , a ciphertext full-text index server 200 , a ciphertext full-text search server 300 , a ciphertext index library 400 and a ciphertext document library 500 .

[0026] The ciphertext index library 400 includes an inverted index of ciphertext entries and a collection of internal document objects. Wherein, the ciphertext entry inverted index is composed of a two-level index file and an inverted address file, and the address pointer in the inverted address file points to an internal document object. The internal document object is composed of document secret state permission information and secret state path pointer. The inner document object has a one-to-one correspondence with the original plaintext document.

[0027] The encrypted document repository 500 is responsible for storing and managing encrypted XML documents. Encrypted XML documents are encrypted as a whole aft...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a cryptogram-based safe full-text indexing and retrieval system. In the system, a cryptogram index library comprises a cryptogram entry reverse index and an internal document object set; a cryptogram document library is responsible for storing and managing an encrypted XML document; a word segmentation encryption server carries out Chinese word segmentation on a plaintext document and encrypts the plaintext document item by item; a cryptogram full-text indexing server standardizes an original plaintext document into an XML document, encrypts and stores the XML document in the cryptogram document library, creates a corresponding internal document object in the cryptogram index library by combining document metamessage, and creates a cryptogram reverse index for the XML document through the cryptogram entry; and a cryptogram full-text retrieval server retrieves the cryptogram index library to obtain the internal document object set through user authority information and the cryptogram entry, obtains a corresponding encrypted XML document result set from the cryptogram document library according to a pointer, decrypts the corresponding encrypted XML document result set, and returns the decrypted corresponding encrypted XML document result set to a user. The Chinese word segmentation method, the safe and high-efficiency indexing structure and the retrieval mechanism of the invention based on the special requirements of cryptogram full-text indexing can realize the cryptogram full-text indexing integrated with an access control strategy. The cryptogram-based safe full-text indexing and retrieval system has the advantages of a safe and high-efficiency indexing process, no decrypted docuterms in the indexing process, a high recall ratio and a high precision ratio in a cryptogram environment, and the like.

Description

technical field [0001] The invention relates to the technical fields of information retrieval and information security, in particular to a secure full-text index and retrieval system based on ciphertext. Background technique [0002] The information retrieval technology of full-text retrieval first appeared in the 1950s. In 1959, the Legal Information Retrieval System established by the Health Law Center of the University of Pittsburgh in the United States was the first full-text retrieval system in the world. In 1973, Lexis, a large-scale full-text database mainly including laws, news, business economics, and government publications, was put into use by Meade Corporation of the United States for public inquiries, marking the birth of the field of full-text retrieval. Since the 1980s, English full-text retrieval has developed rapidly and perfected, and now it has become the mainstream of foreign text-based information retrieval. [0003] In China, the full-text retrieval t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F21/24G06F21/60
Inventor 李瑞轩宋赛辜希武文坤梅卢正鼎左翠华吴炜雷小强燕昆李雨前
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products