Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for full-text retrieval document database

A database and full-text index technology, applied in the field of retrieval, can solve the problems of inaccurate query results, increased time required for query and retrieval, and inability to meet user query requirements, and achieve the effect of improving query efficiency.

Inactive Publication Date: 2011-11-30
ZUNYI BRANCH OF CHINA MOBILE GRP GUIZHOU COMPANY
View PDF4 Cites 47 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the prior art, the documents in the document database are usually queried through the full-text search provided by the document database itself. In this way, when the number of documents is large, the full-text search will often obtain a large result set, and the user is still faced with a massive The data results cannot meet the user's query requirements
Specifically, first of all, the efficiency of full-text search is low. For example, in the application of workflow automation, as the number of official documents increases, the capacity of the document database increases day by day. Due to the increase in database capacity, the ability of the document database to process data is greatly reduced. Especially in terms of data query and retrieval, the time required for query retrieval is greatly increased, and the query efficiency is significantly reduced
Secondly, the query results are inaccurate. Because the search engine embedded in the document database has poor support for full-text retrieval, for example, documents may contain attachments in different formats. For example, a document may contain WORD attachments, PDF attachments, etc., so , when performing full-text retrieval, the document database is required to provide parsers for attachments in different formats in order to retrieve and read the content in the attachments, but the search engine embedded in the document database does not have a parser, so there are differences in documents format, it may cause users to be unable to find the desired document or return a completely irrelevant collection of documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for full-text retrieval document database
  • Method and device for full-text retrieval document database
  • Method and device for full-text retrieval document database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] In order to make the purpose, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0036] In the prior art, the document database stores the content of specific business data in the form of documents, and it matches the content of the business data with the search keywords input by the user, which makes the query efficiency low. Therefore, in the embodiment of the present invention, consider Introduce a highly structured relational database, and implement full-text indexing of different types of document attachments through document conversion technology to provide support for parameter integrity and distributed transactions. Documents in the document database (including document attachments after document conversion) A distributed index directory is established, so that the relational database matches the distributed in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and device for full-text retrieval of a document database, comprising: querying attachments in documents and performing document conversion on the attachments obtained from the query according to a preset policy; library; establish a full-text index catalog for the generated document data sub-library, and store it in a preset relational database; receive the search keywords input by the user, match the full-text index catalog in the relational database, and obtain the matching full-text index catalog corresponding A document in a document database. By applying the invention, the query efficiency of the full-text search can be improved.

Description

technical field [0001] The invention relates to retrieval technology, in particular to a method and device for full-text retrieval of document databases. Background technique [0002] The current development direction of operators in the field of informatization support is to focus on enterprise users and strengthen the collaboration of organizations, processes, and personnel. Among them, the office automation system (OA, Office Automation) and the knowledge management system are indispensable core systems for enterprise informatization. The knowledge documents and official documents involved are generally managed by a document database and a search engine is provided. Users can input key Words are searched to obtain the required document information. [0003] Document databases belong to the category of databases, can share the same data, have physical independence and logical independence of data, separate data and programs, allow the creation of many different types of u...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 徐锐陈旭毅吴青发
Owner ZUNYI BRANCH OF CHINA MOBILE GRP GUIZHOU COMPANY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products