Inquiry-related multi-ranking-model integration algorithm

A sorting model and algorithm technology, applied in computing, special data processing applications, instruments, etc., can solve the problems of not considering the difference between queries, not reflecting the loss of the sorting model, and inconvenient sorting result processing, etc., to achieve good performance, The effect of broad application prospects

Inactive Publication Date: 2011-05-04
NANKAI UNIV
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] However, neither the list-based learning-to-rank method nor the learning-to-rank method that directly optimizes the evaluation metric takes the inter-query differences into account during the modeling process.
Although the query-related ranking learning algorithm considers the query characteristics more than the tr

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Inquiry-related multi-ranking-model integration algorithm
  • Inquiry-related multi-ranking-model integration algorithm
  • Inquiry-related multi-ranking-model integration algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] 1. Data collection and preprocessing

[0016] Match text documents or web pages on the Internet to user queries to build query-document pairs. All query-document pairs are represented in the form of feature vectors. The present invention mainly adopts the following features: the first category is the basic feature, which mainly reflects the matching situation between the query and each domain of the document, such as co-occurrence word frequency and its variants, document flipping frequency and its variants, the product of co-occurrence term frequency and document flip frequency and its various variants. The second category is advanced features, which mainly include the scoring of the query-document pair by some classic retrieval models such as the probability model BM25 and the statistical language model LMIR. For web documents, it also includes the third type of hyperlink features, mainly including the scoring of the web page by various link analysis algorithms...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a brand new inquiry-related multi-ranking-model integration method. The brand new inquiry-related multi-ranking-model integration method comprises the following steps of: establishing sub-ranking models for each inquiry and inquiry-related document; performing vectoring representation on the sub-ranking models so as to convert a plurality of inquiry-related ranking models into characteristic data and integrate the plurality of ranking models; establishing a new loss function as an optimization target at an inquiry level and a sample level by using a ranking support vector machine as the sub-ranking model; adjusting weight among losses generated by different inquiries by using the loss function; and providing a multiple inquiry-related ranking support vector machine fusion algorithm. Compared with the traditional model, the inquiry-related ranking models can achieve better properties when the inquiry-related multi-ranking-model integration algorithm provided by the invention is applied to actual tasks. The multi-model fusion algorithm provided by the invention can be applied to ranking learning, can also be applied to multi-element classification, sequence labeling and the like, and has a wide application prospect in the fields of information retrieval, network search and the like.

Description

technical field [0001] The invention relates to the fields of information retrieval and machine learning, in particular to a plurality of sorting support vector machine model fusion algorithms related to queries. Background technique [0002] Learning to rank is a hot topic in the field of information retrieval and machine learning. Information retrieval refers to finding a subset of information related to a given query from a large number of document collections, and is an important means of processing massive text information. In the vast majority of current information retrieval systems, the retrieved information (such as documents, etc.) is returned to the user in a sorted manner. Therefore, the core problem of information retrieval model research is how to efficiently sort information . The purpose of ranking learning is to find a ranking that can accurately predict unknown data The decision function for label y [0003] Traditional learning-to-rank methods can b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06N1/00
Inventor 王扬黄亚楼谢茂强卢敏廖振
Owner NANKAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products