Ordering strategy-based information filtering system

A technology of information filtering and sorting strategy, applied in the field of information filtering, it can solve the problems of deviation of model optimization results, inconsistent evaluation indicators, and performance constraints.

Inactive Publication Date: 2010-04-28
HEILONGJIANG INST OF TECH +1
View PDF2 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0025] In order to solve the problems existing in the existing information filtering models, such as the inconsistency between the optimization target and the evaluation index of the fil

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Ordering strategy-based information filtering system
  • Ordering strategy-based information filtering system
  • Ordering strategy-based information filtering system

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach 1

[0072] Specific embodiment one: the information filtering system based on the sorting strategy described in this embodiment includes a feature weight library, a trainer, and a filter, wherein:

[0073] Feature weight library, used to store features of information units and their weight information;

[0074] The trainer is used to adjust / update the features and their weights in the feature weight library according to the user's feedback;

[0075] The filter is used to extract the features of the received information unit and obtain the feature information; it is also used to identify the received information unit based on the features in the feature weight library, and divide the information unit into normal information and abnormal information ;

[0076] In the filter, the method for identifying the new information unit is:

[0077] Establish an information filtering model framework based on ranking strategies,

[0078] make x i Indicates a positive example, x j represent...

specific Embodiment approach 2

[0093] Embodiment 2: This embodiment is to solve the problems existing in the information filtering system based on the sorting strategy described in Embodiment 1, and provides another sorting strategy-based information filtering system for further improvement, specifically:

[0094] Put Ψ(w,x i , x j ) is defined as Ψ′(w, x i )-Ψ′(w,x j ), which is the difference between the scores of two category information units, let Ψ(w, x i , x j )=sgn[Ψ′(w,x i )-Ψ′(w,x j )], where sgn(x) is a sign function, when x>=0, sgn(x)=1; otherwise, sgn(x)=-1,

[0095] Then Equation 2 can be rewritten as:

[0096] Formula five: h w ′ ( x ‾ ) = arg max { Σ i Σ j sgn { y ij ′ · [ Ψ ′ ...

specific Embodiment approach 3

[0107] Specific embodiment three: this embodiment is a further description of the process of updating the parameter vector weight w according to formula 7 and the gradient descent method in the information filtering system based on the sorting strategy described in specific embodiment two, and its specific process is:

[0108] Step Q1, initialize the weight w to be updated to 0;

[0109] Step Q2. For each training sample, that is, the abnormal information-normal information sequence pair, perform the operations from step Q3 to step Q5:

[0110] Step Q3, calculate gap=Ψ(w, x i ,y j );

[0111] Step Q4, judge whether the gap is smaller than the set threshold TONE, if the judgment result is yes, execute step Q5, otherwise return to execute step Q3, and obtain the gap of the next abnormal information-normal information sequence pair;

[0112] Step Q5, calculation Where TRAIN_RATE represents the algorithm learning rate;

[0113] Step Q6: Accumulate the weight Δw obtained in s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an ordering strategy-based information filtering system, which relates to the technical field of information filtering, and solves the problems of inconsistency between an optimization objective and a filtering problem evaluating indicator, the deviation of a model optimization result and restricted performance in the conventional information filtering model. The information filtering system of the invention consists of a training model, a filter and a feature weight library, wherein a method for identifying a new information unit by the filter comprises the following steps: converting an information filtering problem into an ordering problem; performing optimization aiming at a core evaluating indicator 1-ROCA; establishing an ordering strategy-based information filtering model, wherein the ordering strategy-based information filtering model adopts an ordering logistic regression learning algorithm and comprehensively uses a TONE strategy-based parameter weight updating algorithm and resampling technology to obtain a weight parameter and obtain a prediction score value of the new information unit; and judging the attribute of a new mail according to the result of comparison of the prediction score with a predetermined threshold. The method of the invention can be applied to various information filtering and information push systems.

Description

technical field [0001] The invention relates to the technical field of information filtering. Background technique [0002] The rapid expansion of Internet information makes it difficult for users to obtain the information they are interested in in a timely and continuous manner. At the same time, a large amount of junk information also brings a lot of trouble to the use and management. As an effective solution to the above problems, information filtering technology can (1) actively provide users with information related to personal interests; (2) filter junk information (such as: national security, violence, pornographic and reactionary information, etc.). [0003] The well-known international text information retrieval evaluation TREC (Text Retrieval Conference) conference jointly sponsored by the Advanced Research Institute of the US Department of Defense and the US National Bureau of Standards has done a lot of work on information filtering. The TREC evaluation task sh...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 齐浩亮杨沐昀韩咏李生运海红张艳艳黄成哲雷国华
Owner HEILONGJIANG INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products