Method and apparatus for document filtering capable of efficiently extracting document matching to searcher's intention using learning data

A technology of filtering equipment and filtering methods, which is applied in the direction of electrical digital data processing, special data processing applications, biological models, etc., and can solve problems such as inability to distinguish data from unnecessary data

Inactive Publication Date: 2005-06-15
RICOH KK
View PDF2 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] However, the retrieval results obtained by the above-mentioned conventional techniques may contain document data unnecessary for the searcher, and there is a disadvantage that it cannot clearly distinguish the data necessary for the searcher from the unnecessary data from unknown documents. necessary data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for document filtering capable of efficiently extracting document matching to searcher's intention using learning data
  • Method and apparatus for document filtering capable of efficiently extracting document matching to searcher's intention using learning data
  • Method and apparatus for document filtering capable of efficiently extracting document matching to searcher's intention using learning data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In describing the exemplary embodiments illustrated in the drawings, specific terminology will be used for the sake of clarity. However, the disclosure of this patent specification is not meant to be limited to the specific terms chosen, and it should be understood that each specific component includes all technical equivalents that work in a similar manner.

[0034] In the drawings, like reference numerals will designate like or corresponding parts throughout the several views.

[0035] figure 1 is an exemplary block diagram of a document filtering device according to an exemplary embodiment of the present invention.

[0036]The document filtering device 100 includes an information input / output unit 101 , a search term extraction unit 102 , a document sequence retrieval unit 103 , a learning data generation unit 104 , a classification parameter generation unit 105 , and a classification unit 106 . In addition, the document filtering device 100 is connected to a datab...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A document filtering apparatus includes an information input / output unit, a search word extraction unit, a first ranking search unit, a learning data unit, a classifying parameter generation unit, a second ranking search unit, and a classifying unit. The information input / output unit inputs phrasal information, and outputs search result information. The search word extraction unit extracts a search word from the phrasal information. The first ranking search unit searches a document having the search word from a database, and outputs a first ranking search result. The learning data unit prepares learning data from the first ranking search result. The classifying parameter generation unit generates a classifying parameter from the learning data. The second ranking search unit searches a document having a word corresponding to the classifying parameter from the database. The classifying unit extracts a document matching to a searcher's intention, and outputs the document as a second ranking search result.

Description

[0001] This application claims priority from Japanese Patent Application Serial No. 2003-329206 filed in the Japan Patent Office on September 19, 2003, the entire contents of which are hereby incorporated by reference. technical field [0002] The present invention relates to a method and device for document filtering, in particular to a document filtering method and device capable of effectively extracting documents matching a searcher's intention from a document database by using learning data. Background technique [0003] How to effectively retrieve documents matching the searcher's intention from the database has become a problem. In order to solve the above-mentioned problems, traditional document retrieval technology uses a combination of keywords and logical operators to perform retrieval to obtain retrieval results, and subsequent retrieval uses a new combination of keywords and logical operators to refine the retrieval results. [0004] However, a searcher needs sp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06N3/00
CPCG06F16/337
Inventor 后藤淳之伊东秀夫
Owner RICOH KK
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products