Unlock instant, AI-driven research and patent intelligence for your innovation.

Information generation program, device, method, and information retrieval program, device and method

An information generation and information retrieval technology, applied in the field of information retrieval, can solve the problems of high frequency of pseudonym, katakana and English characters, low efficiency of locking object items, etc., achieve the optimization of index information size, realize retrieval noise, The effect of high-speed optimization

Active Publication Date: 2013-02-06
FUJITSU LTD
View PDF6 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] However, in the above-mentioned conventional technology, there is a problem that kana, katakana, and English characters contained in each item (record) appear frequently, and the efficiency of locking the target item is low in a bitmap of a single character.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information generation program, device, method, and information retrieval program, device and method
  • Information generation program, device, method, and information retrieval program, device and method
  • Information generation program, device, method, and information retrieval program, device and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] Embodiments of the information generation program, information retrieval program, information generation device, information retrieval device, information generation method, and information retrieval method of the present invention will be described in detail below with reference to the drawings.

[0056] [Information generation program / device / method]

[0057] First, an information generating program, an information generating device, and an information generating method will be described.

[0058]

[0059] figure 1 It is an explanatory diagram showing an example (Part 1) of information generation by the information generation device. figure 1 In, the object file group F is a collection of object files. Each object file is electronic data in which character strings are described. The object file is, for example, electronic data such as a dictionary or thesaurus, an electronic book, or a web page, and is described in a text, HTML (HyperText Markup Language), or XML ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Four phases are executed, which are (A) summing from a target file group (F), (B) sorting in the descending order of appearance frequency, (C) extracting until the rank having an intended appearance ratio, and (D) creating a map. (A1) First, an information generation device reads the target file group (F) and counts appearance frequencies of basic words. (B1) When the counting of basic words is complete in the target file group (F), the information generation device sorts the basic word appearance frequency table (101) in the descending order of appearance frequency. Namely, the basic words are sorted in the order from highest appearance frequency, and ranked from the basic word having the highest appearance frequency. (C1) Next, the information generation device refers to the sorted (B1) basic word appearance frequency table (101), and extracts basic words until the rank having the intended appearance ratio (Pw). (D1) Lastly, the information generation device generates a specific basic word appearance map (M1) regarding a specific basic word group.

Description

technical field [0001] The present invention relates to an information generation program, an information retrieval program, an information generation device, an information retrieval device, an information generation method, and an information retrieval method for generating index information indicating the presence or absence of characters or basic words, and performing retrieval using the index information. Background technique [0002] There is known a bitmap-type full-text search technique for generating a full-text search index called a character component table at high speed (for example, see Patent Documents 1 to 3 below). In the conventional bitmap-type full-text search technology, since no morphological analysis is performed, it can be generated at high speed, and bitmaps can be compressed. [0003] In a general Japanese dictionary, there are records of about 240,000 items, described in about 6,000 to 8,000 characters, and about 6,000 to 8,000 bitmaps for a single ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F17/30616G06F16/319G06F16/22G06F16/2228G06F16/2237G06F16/313
Inventor 片冈正弘
Owner FUJITSU LTD