Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for confirming site information template corresponding to target object

A technology of target objects and site information, applied in the Internet field, can solve problems such as too much information, no analysis, and inability to better satisfy users, and achieve the effects of improving acquisition efficiency, expanding scope, improving effectiveness and flexibility

Active Publication Date: 2013-09-04
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF6 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The webpage information extracted by this method does not really analyze the results of the webpage, and the extracted information is too much and complicated to better meet the needs of users; at the same time, due to the rich and diverse information in the current webpage, webmaster The page templates used are also diverse, so that it is impossible to use the same web page information extraction template to extract information on different websites

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for confirming site information template corresponding to target object
  • Method and device for confirming site information template corresponding to target object
  • Method and device for confirming site information template corresponding to target object

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0030] figure 1 It shows a schematic diagram of a processing device for determining a site information template corresponding to a target object according to one aspect of the present invention; wherein, the processing device includes reference text obtaining means 11 , training text determining means 12 , and template determining means 13 . Specifically, the reference text acquisition means 11 obtains the corresponding reference text in the reference site according to the reference site template corresponding to the target object at the reference site; the training text determination means 12 performs a matching query according to the reference text to determine the One or more site training texts that match the reference text; the template determination device 13 is based on at least one site training text in the one or more site training texts, and the tar...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and device for confirming the site information template corresponding to a target object. The method comprises the following steps: a. a processing device acquires a corresponding reference text from a reference site according to the reference site template corresponding to the target object; b. matching inquire is conducted according to the reference text to confirm one or more site training texts matched with the reference text; c. the site information template corresponding to the target object in the object sites is confirmed according to at least one site training text among the one or more site training texts and the site relevant information of the target site corresponding to the confirmed site training text. Compared with the prior art, the site information template is confirmed according to the target object, the accuracy of information acquirement is improved, the range of information acquirement is widened, and the efficiency of information acquirement is improved.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a technology for determining a site information template corresponding to a target object. Background technique [0002] The existing method for extracting web page information mainly traverses the DOM (Document Object Model) tree of the web page, and extracts all text node information therein to form text information corresponding to the web page. [0003] The webpage information extracted by this method does not really analyze the results of the webpage, and the extracted information is too much and complicated to better meet the needs of users; at the same time, due to the rich and diverse information in the current webpage, webmaster The page templates used are also diverse, so that information on different websites cannot be extracted by using the same webpage information extraction template. Contents of the invention [0004] The purpose of the present invention is ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 陈洪亮呼大为
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products