Intelligent web page classifier based on user behaviors
A webpage classification and classifier technology, applied in the field of intelligent learning, can solve problems that are not suitable for scientific literature and do not consider user search behavior interaction
Image
Examples
Embodiment Construction
[0009] In the vector space model, text generally refers to various machine-readable records, represented by D (Document). The feature item T (Term) refers to the basic language unit that appears in the document D and can represent the content of the document. It is mainly composed of words or phrases. The text can be expressed as D(T 1 ,T 2 ,...,Tn), where Tk is a feature item, k∈1,2,...,N. For a text containing n feature items, a certain weight is usually given to each feature item to indicate its importance. That is, D=D(T1, W1; T2, W2;..., Tn, Wn), abbreviated as D=D(W1, W2,..., Wn), which is a vector representation of the text, where Wk is the weight of Tk , k ∈ 1, 2, ..., N. In the vector space model, two arbitrary texts D i and D j The content correlation Sim(D i ,D j ) is represented by the cosine value of the angle between commonly used vectors, and the formula is:
[0010] Sim ( Di , Dj ...
PUM
Login to View More Abstract
Description
Claims
Application Information
- IPC
- G06F17/30
- Inventors
- 蔡阳波; 陈勇
