Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for determining webpage theme

A theme and webpage technology, applied in the field of webpage theme determination, can solve problems such as inconvenience, lost target, and loss of confidence in search engines, and achieve the effect of solving information extraction and expanding usability

Inactive Publication Date: 2014-06-04
DALIAN LINGDONG TECH DEV
View PDF2 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the popularization of the Internet and the development of information technology, a large number of information resources have been formed; with the continuous enrichment of network resources and the continuous expansion of network information, people's dependence on the network is becoming stronger and stronger, but it also gives service objects a vast number of resources. It is inconvenient to quickly find the specific resources you need in Internet resources; due to the increasingly diversified and personalized information needs of service objects, there are obvious personality differences among different service objects, although various search engines play an extremely important role. It plays an important role, but with the continuous growth and change of data in search engines, it is becoming more and more difficult to quickly find and obtain the information and services you need from a large amount of data. They will lose their goals or get some biased results. Therefore, when browsing the search results, many service objects often spend a lot of time and energy looking at some pages that have nothing to do with the services they are searching for. Loss of confidence, which leads to loss of clients;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for determining webpage theme

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The present invention provides an in-depth description of the realization of webpage topic information extraction; in this part, the webpage is divided according to the block label, and the feature weight vector is extracted from the divided semantic block and the entire webpage content, and the coarse-grained division of the webpage is constructed. DOM tree, and then extract the theme information of the web page based on the block type;

[0026] Such as figure 1 A method for determining a theme of a web page includes the following steps:

[0027] A. Determination of custom themes; all the themes in the webpage, that is, all directory areas, are listed for users to customize the theme, so as to realize the automatic information extraction after customization, and become the determination of the custom theme to be completed task; the process is a two-step process:

[0028] A1. To customize the topics you need to extract, you need to see all the topics in the webpage an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for determining a webpage theme. The method comprises the following steps that a custom theme and links for acquiring the webpage theme are determined; the links for acquiring the webpage theme comprises link context extraction and the local spread extraction link; the local spread extraction link comprises link context extraction in a theme type semantic block, and link context extraction and webpage link extraction in a catalog type and image type semantic block. According to the method for determining the webpage theme, a classification method of webpage HTML label document based on layout labels are given first, then the idea of blocks is disclosed, position numbering is conducted on classified semantic blocks, a layout label DOM tree with numbers is constructed, and at last theme determination and theme link extraction are conducted based on webpage blocks; a user can acquire theme information quickly, the availability of a Web is expanded, and the series problems of information extraction and the like can be better solved.

Description

technical field [0001] The invention relates to a network technology, in particular to a method for determining a webpage theme; Background technique [0002] With the popularization of the Internet and the development of information technology, a large number of information resources have been formed; with the continuous enrichment of network resources and the continuous expansion of network information, people's dependence on the network is becoming stronger and stronger, but it also gives service objects a vast number of resources. It is inconvenient to quickly find the specific resources you need in Internet resources; due to the increasingly diversified and personalized information needs of service objects, there are obvious personality differences among different service objects, although various search engines play an extremely important role. It plays an important role, but with the continuous growth and change of data in search engines, it is becoming more and more ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/955G06F40/14G06F40/30
Inventor 刘立堂李勇
Owner DALIAN LINGDONG TECH DEV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products