Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

An Information Retrieval Method Based on Readability Index

An information retrieval and readability technology, applied in the field of information retrieval, can solve problems such as difficult support, unsupervised online prediction of readability, and user readability without displaying retrieval results.

Inactive Publication Date: 2019-03-22
TIANJIN UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Recently, in order to improve the accuracy of readability calculation methods, some readability methods try to use the idea of ​​machine learning to transform the readability calculation problem into classification and prediction problems, such as using Support Vector Machine (SVM) [5] , Regression [6] , Interpolation Prediction [7] etc. However, these methods are difficult to support unsupervised online prediction of readability
[0006] As of now, search engines do not display the ability to indicate the readability of search results relative to the user

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Information Retrieval Method Based on Readability Index
  • An Information Retrieval Method Based on Readability Index
  • An Information Retrieval Method Based on Readability Index

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The technical solution of the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments, and the described specific embodiments are only for explaining the present invention, and are not intended to limit the present invention.

[0036] A kind of information retrieval method based on readability index that the present invention proposes, comprises the following steps:

[0037] Step 1. When a user uses a search engine to search for a desired keyword, the search engine retrieves documents that meet the search criteria from the index;

[0038] Step 2. During the search process, the search engine sorts the documents that meet the search conditions according to their relevance to the query keyword, and at the same time calculates the text readability score, sorts the documents that meet the search conditions, the relevance and The readability score is organized into pages and returned to the user; curre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention discloses a readability indicator based information retrieval method. The method comprises: in a searching process by using a search engine, sorting documents, which meet a search condition, according to a relevance between the documents and a query keyword; and organizing the documents that meet the search condition, a relevance sorting and a readability score into a page and returning the page to a user, wherein a text readability score equals to M*(N*average Chinese stroke number+(1-N)*difficult Chinese word frequency)+(1-M)*(P*average English character number+(1-P)*difficult English word frequency), M adjusts a weight proportion of Chinese and English readability, N adjusts weight proportions between an average Chinese stroke number indicator and a difficult Chinese word frequency indicator, and P adjusts a weight proportion between an average English character number indicator and a difficult English word frequency indicator. According to the method disclosed by the present invention, the readability score of the document is returned after retrieval, so that the user can conveniently and rapidly extract a relatively readable portion from the documents with a relatively high relevance, so that retrieval efficiency is improved.

Description

technical field [0001] The invention relates to an information retrieval method, in particular to an information retrieval method based on a readability index. Background technique [0002] Information retrieval refers to the activity of obtaining information resources related to information needs from a collection of information resources. In modern society, information retrieval has become an important way for people to discover and acquire knowledge and information. For traditional information retrieval, after the user submits a series of queries to the retrieval system, the retrieval system returns a list of results for the user to select and read according to the "correlation" between the document and the query and the "importance" of the hyperlink structure. The specific process Such as figure 1 As shown in the figure, the interactive process of traditional information retrieval is shown in the figure. When a user submits a query to a search engine, the search engine...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33G06F16/34
CPCG06F16/3334G06F16/345
Inventor 张程宋大为张鹏王博张文雅
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products