Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Grading method for information retrieval document based on viewpoint searching

An information retrieval and document technology, applied in the field of information processing, can solve problems such as poor performance and inability to well meet the needs of users' viewpoint retrieval, and achieve the effect of improving performance, good application prospects, and improving viewpoint retrieval results

Active Publication Date: 2009-01-14
TSINGHUA UNIV +1
View PDF0 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Many experimental results show that this method cannot well meet the user's opinion retrieval needs.
Even in many cases, the performance of the results after this combination of correlation and subjective and objective scores is not as good as that of the results provided to users after sorting only using correlation scores

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Grading method for information retrieval document based on viewpoint searching
  • Grading method for information retrieval document based on viewpoint searching
  • Grading method for information retrieval document based on viewpoint searching

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] It is done automatically on the computer and consists of the following steps in sequence:

[0041] Step 1 Generate sentiment word list and candidate document collection

[0042] The emotional word list includes all emotional words to be processed by the system, such as "good", "bad" and "disappointing" in Chinese, and "good" and "bad" in English. Here, the words in HowNet are automatically screened according to their attributes. If the attribute definitions of a word in HowNet include at least "good|好", "desire|良", "beautiful|美", "great|伟 ", "bad|bad", "undesired|草", "fake|假", select the word and the English descriptor corresponding to the word, and add them to the list of Chinese and English emotional words respectively.

[0043] For a query (which may contain multiple query words) input by the user, the retrieval system automatically selects all documents with any query word in the user query as a set of candidate documents. Subsequent operations will be performed wi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a scoring method of information retrieval documents based on viewpoint retrieval, which belongs to the field of information processing. The method is characterized in that: an emotion word list is firstly established; all emotion words used in a retrieval system are specified in the list; a candidate result set is generated according to an inquiry input by a user; secondly, the relativity between the documents and the inquiry of the user is calculated in the system for obtaining the relativity score of each document; according to the times of the emotion words appearing in the documents together with an inquiry word within a certain distance, the subjective and objective scores of each document in the system are calculated; the relativity score and the subjective and objective scores of one document are merged on the basis of a quadratic function (namely, multiplying), therefore, a merged final score of the document is obtained; finally, the retrieval system ranks all candidate documents in the system according to the final scores of the documents and displays the documents to the user according to scores in a sequence from the largest to the smallest. The technology has the advantages that a computer can finish the technology automatically and the retrieval results with high relativity and strong subjective opinions can be returned.

Description

technical field [0001] The invention belongs to the field of information processing, and in particular relates to an information retrieval system, specifically a method for scoring documents in the information retrieval system, and finally obtains retrieval results related to user queries and with subjective opinions. Background technique [0002] An information retrieval system is a computer system that collects information (such as webpage documents on the Internet, or digital documents in a digital library, etc.) with a certain strategy, organizes and processes the information, and provides users with retrieval services. It includes computer hardware systems And two parts of the software program running on the hardware system. Its main function is to help users quickly and efficiently obtain useful information that can meet user needs. [0003] Information retrieval systems interact with users by querying servers. On the one hand, the query server provides a page for us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 张敏马少平
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products