Unlock instant, AI-driven research and patent intelligence for your innovation.

Method for sorting a set of electronic documents

A technology for electronic documents and documents, applied in the fields of telecommunications and search engines, can solve the problem of not being able to guide users, and achieve the effect of simplifying implementation

Active Publication Date: 2008-09-17
FRANCE TELECOM SA
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] In addition, it is impossible for this process to identify a common theme (community) or a common interest in a set of documents, and it cannot guide users to pages of interest more quickly

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for sorting a set of electronic documents
  • Method for sorting a set of electronic documents
  • Method for sorting a set of electronic documents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The method according to the invention is applied to a set of electronic documents, in particular a set of WEB pages, for some of which comprise one or more hypertext links to one or more other pages.

[0049] In the illustrated selected embodiment, a degree of relatedness between two documents u and v of a set of documents V is determined based on the number of hypertext links and joint reference links that exist between documents u and v.

[0050] For the determination of the number of hypertext links between two documents, "symmetric" hypertext links are considered regardless of the meaning of the hypertext links, that is, the same treatment is applied to the case where document u includes a link to document v and The case where document v includes a link to document u.

[0051] If there exists at least one other document w such that:

[0052] - there exists at least one hypertext link pointing from w to u, and

[0053] - there exists at least one hypertext link poi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention concerns a method for sorting a set of electronic documents, including the following steps: determining (S110) for each pair of documents {u, v} of the set the degree of correlation omega (u,v) between the documents u and v; determining a function X of projection between the set of documents and a sphere of the set R where d is a positive integer, the function X being such that, for at least one document u, the distance in R between two points X (u) and X (v) where v is a document for which there is a correlation between the documents u and v, is as smaller as the degree of correlation is high; performing a sorting operation (S140) on at least one part of the set of documents based on the values taken by the function X.

Description

technical field [0001] The present invention is in the field of telecommunications and in particular in the field of search engines for searching electronic documents. [0002] More precisely, the invention relates to a method of classifying a set of electronic documents. Such a set is generated, for example, by a user performing a search through a search engine on an Internet-type network, the electronic documents in this case being Web pages (short for "World Wide Web"), which are accessed locally via local storage media, or be accessed remotely via a network. Background technique [0003] Search engines utilize several techniques for rating or categorizing pages that appear from searches. Among the known techniques for exploring a set of web pages, some rely on semantics, with a page being rated as more relevant if it contains a high occurrence of the word being searched for. These techniques are sensitive to a practice known by the name "spamming," which aims to get w...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F17/30873G06F16/954
Inventor 杰罗姆·高尔蒂尔
Owner FRANCE TELECOM SA