Supercharge Your Innovation With Domain-Expert AI Agents!

Index generating program and search program

A technology for generating programs and indexing information, which is applied in text database indexing, digital data information retrieval, unstructured text data retrieval, etc. It can solve the problem that it is difficult to suppress locking noise and achieve the effect of suppressing locking noise

Inactive Publication Date: 2015-02-25
FUJITSU LTD
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On the other hand, even if sub-elements of the same higher-level element (for example, chapter, etc.) are included in the same block as in a dictionary, it may be difficult to suppress locking noise.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Index generating program and search program
  • Index generating program and search program
  • Index generating program and search program

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] Before explaining the details, the locking of the target file for the character string search using the index information will be described.

[0041] Figure 1A It shows the index information I1 based on the file groups F1 to Fn to be searched. The file numbers shown in the uppermost row of the index information I1 are numbers corresponding to the search target file groups F1 to Fn, respectively. In the index information, character information groups C1 to Cm are respectively associated with bit sequences related to the existence or non-existence of file groups F1 to Fn.

[0042] The character information Cj included in the character information groups C1 to Cm is, for example, a character string of one character or a combination of a plurality of characters. Or the character information Cj may be part of a binary code corresponding to the character information. The character information groups C1 to Cm may be all combinations of characters assumed to be used (for exa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

[Problem] To minimize filter noise for a target filter for a string search with respect to document data in one aspect. [Solution] According to one embodiment, depending on whether or not a document element having at least a predetermined number of subelements exists in a document file, a computer: switches between a control process involving determining whether to include data in the document file in any of a plurality of blocks for each document element in a hierarchy of the subelements, or for each document element or each document element in a hierarchy of elements that are higher than the document elements; and, according to said control process, which corresponds to the switching, divides the document file into the plurality of blocks, and for each data item thereby obtained, generates index information indicating whether or not each data item includes prescribed character information.

Description

technical field [0001] The invention relates to retrieval technology of document data. Background technique [0002] Various types of books such as novels, academic books, and dictionaries are sold in the form of electronic books in which information is stored electronically. When performing a search for a plurality of document data, there is a technique of using index information indicating, for each type of character information, which correspondence relationship character information is included in a plurality of document data. For example, on the one hand, using pre-generated index information, the document data that includes a certain character information C in the search character string is used as the search object of the character string search based on the search character string, and on the other hand, the search object from the character string Controls for deleting other document data. This is because the index information shows that the above-mentioned charact...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F17/30613G06F16/31G06F16/185
Inventor 片冈正弘村田孝宏大田贵文
Owner FUJITSU LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More