Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and apparatus for document filtering capable of efficiently extracting document matching to searcher's intention using learning data

A technology of filtering equipment and filtering methods, which is applied in the direction of electrical digital data processing, special data processing applications, biological models, etc., and can solve problems such as inability to distinguish data from unnecessary data

Inactive Publication Date: 2005-06-15
RICOH KK
View PDF2 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] However, the retrieval results obtained by the above-mentioned conventional techniques may contain document data unnecessary for the searcher, and there is a disadvantage that it cannot clearly distinguish the data necessary for the searcher from the unnecessary data from unknown documents. necessary data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for document filtering capable of efficiently extracting document matching to searcher's intention using learning data
  • Method and apparatus for document filtering capable of efficiently extracting document matching to searcher's intention using learning data
  • Method and apparatus for document filtering capable of efficiently extracting document matching to searcher's intention using learning data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In describing the exemplary embodiments illustrated in the drawings, specific terminology will be used for the sake of clarity. However, the disclosure of this patent specification is not meant to be limited to the specific terms chosen, and it should be understood that each specific component includes all technical equivalents that work in a similar manner.

[0034] In the drawings, like reference numerals will designate like or corresponding parts throughout the several views.

[0035] figure 1 is an exemplary block diagram of a document filtering device according to an exemplary embodiment of the present invention.

[0036]The document filtering device 100 includes an information input / output unit 101 , a search term extraction unit 102 , a document sequence retrieval unit 103 , a learning data generation unit 104 , a classification parameter generation unit 105 , and a classification unit 106 . In addition, the document filtering device 100 is connected to a datab...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A document filtering device includes an information input / output unit, a search word extraction unit, a first sequence retrieval unit, a learning data unit, a classification parameter generation unit, a second sequence retrieval unit, and a classification unit. The information input / output unit inputs phrase information and outputs retrieval result information. The search word extraction unit extracts the search word from the phrase information. The first sequence retrieval unit retrieves documents having the search terms from the database, and outputs the first sequence retrieval results. The learning data unit prepares learning data from the first sequence retrieval result. A classification parameter generation unit generates classification parameters from the learning data. The second sequential retrieval unit retrieves documents having words corresponding to the classification parameters from the database. The classification unit extracts documents that match the searcher's intention and outputs the documents as second-order search results.

Description

[0001] This application claims priority from Japanese Patent Application Serial No. 2003-329206 filed in the Japan Patent Office on September 19, 2003, the entire contents of which are hereby incorporated by reference. technical field [0002] The present invention relates to a method and device for document filtering, in particular to a document filtering method and device capable of effectively extracting documents matching a searcher's intention from a document database by using learning data. Background technique [0003] How to effectively retrieve documents matching the searcher's intention from the database has become a problem. In order to solve the above-mentioned problems, traditional document retrieval technology uses a combination of keywords and logical operators to perform retrieval to obtain retrieval results, and subsequent retrieval uses a new combination of keywords and logical operators to refine the retrieval results. [0004] However, a searcher needs sp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06N3/00
CPCG06F16/337
Inventor 后藤淳之伊东秀夫
Owner RICOH KK
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products