Method and device for recommending series documents

A document and series of technologies, applied in the field of network communication, can solve problems such as user inconvenience and reduce reading experience, so as to meet the reading needs and improve the reading experience

Inactive Publication Date: 2011-02-16
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF4 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Users need to spend time searching through search engines or classified lists, which obviously brings inconvenience to users and reduces the reading experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for recommending series documents
  • Method and device for recommending series documents
  • Method and device for recommending series documents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0064] In the above step 101, obtaining the document title of the uploaded document may grab more than one document title from the document (Meta) metadata database storing the uploaded document.

[0065] When crawling document titles from the document metadata database, in order to increase the probability of a series of documents, the following crawling strategies can be adopted but not limited to:

[0066] 1) Grab the document title of the document uploaded by the same user.

[0067] It may further specifically include: capturing document titles of documents uploaded by the same user within a time interval; or capturing documents uploaded by the same user within two or more time intervals with regular intervals.

[0068] For the same series of documents, users usually upload them within a time interval. Therefore, capturing documents uploaded by the same user within a time interval has a high probability of integrating document series. In addition, for serialized documents...

Embodiment 2

[0076] The process of character normalizing the document title can be as follows figure 2 As shown, it specifically includes the following steps:

[0077] Step 201: Remove characters irrelevant to pattern matching processing in the document title.

[0078] Characters irrelevant to pattern matching processing can be set in advance, for example, other symbols except text symbols such as Chinese, English and numbers, and regional identification symbols such as book title numbers and brackets can be set as symbols irrelevant to pattern matching processing.

[0079] In this way, symbols that may interfere with pattern matching, such as redundant space symbols, dots, meaningless symbols, etc., in the document title can be removed. Among them, symbols that are meaningful to the content of the document title can be reserved, for example, the "3-4" method may be used to represent the serial number, where the existence of dashes is meaningful to the serial number, here you can be res...

Embodiment 3

[0088] image 3 The process flowchart of the pattern matching process that the present invention provides, in the present invention can adopt the mode of regular expression (regular expression) matching to carry out pattern matching, as image 3 As shown, it mainly includes the following steps:

[0089] Step 301: Determine the pattern identified by the serial number of each document title after character normalization processing.

[0090] Various patterns of document titles may be set in advance, and then the document titles after character normalization processing are matched with the preset patterns of document titles to determine the matched patterns and record the determined pattern IDs.

[0091] For example, various modes of document titles may be pre-configured, and these modes are set according to the serial number identification after normalization processing, as shown in Table 1. It should be noted that Table 1 is only an example, and the present invention does not ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a device for recommending series documents, which are applied to a document sharing platform. The method comprises the following steps of: acquiring document titles of uploaded documents, and performing character normalization on the acquired document titles; performing mode matching on the document titles after the character normalization, and classifying the documents corresponding to the document titles with the same public character string and the same mode serial number identification to the same document series; and recommending the documents belonging to the same document series as the current read document of a user to the user. The method and the device meet the requirement of the user for reading the same series of other documents, do not need the user to spend time on searching the documents through a search engine or a classification list, improve the reading experience of the user, and also meet the potential reading requirements of the user.

Description

【Technical field】 [0001] The invention relates to the technical field of network communication, in particular to a method and device for recommending a series of documents. 【Background technique】 [0002] With the increasing promotion and popularization of network technology, network information is increasing rapidly. The document sharing platform provides convenience for users to upload and read shared documents. It provides search engines and classification indexes in massive shared documents to facilitate users to find what they need. documentation. [0003] When a user reads a document, the document sharing platform can recommend documents related to the currently read document through the established document classification. In the prior art, when recommending related documents, the first few documents with the highest correlation with the content of the currently read document are usually recommended, or the documents belonging to the same uploading user as the curren...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 杨帆高超
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products