Method and device for extracting website names

A website domain name and website technology, applied in the Internet field, can solve the problems of high manual maintenance cost, difficulty in manually sorting out website names, limited coverage of manual collection websites, etc., and achieve the effect of full coverage and simple implementation

Inactive Publication Date: 2014-05-07
TENCENT TECH (SHENZHEN) CO LTD
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The existing technology usually adopts the method of manual collection to configure the website name to form a configuration table of , such as , etc.; and more and more websites Continuous establishment makes it more and more difficult to manually organize website names; the method of manual collection of website names in the prior art has the defect of very high manual maintenance costs, and the coverage of manual collection websites is also very limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for extracting website names
  • Method and device for extracting website names

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] The technical solutions of the present invention will be further described below in conjunction with the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0018] The present invention is to analyze the browsing record or browsing log generated by the user's online browsing of web pages when the user is offline, and automatically extract the website name of the user's online browsing website; provide an important basis for subsequent related data processing, such as the user from Sohu When the website reposts the news to Sina Weibo, it indicates that the news comes from Sohu.com, etc. The user’s online browsing of webpages includes: users can browse webpages through any browser, using mobile phones, computers and other terminals; for example, through mobile phone UC browser (a mobile browser developed by Youshi Technolog...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for extracting website names. The method comprises the steps of obtaining page headlines of websites out of browsing history, and extracting and grouping website domain names; extracting public subsegments at the heads and the tails of all the page headlines under the same website domain name; tidying all the page headlines after the public subsegment extraction, and obtaining the website names. The invention further discloses a device for extracting the website names. The method which comprises the steps of obtaining the page headlines of the websites out of the browsing history, extracting and grouping the website domain names, extracting the public subsegments at the heads and the tails of all the page headlines under the same website domain name, tidying all the page headlines after the public subsegment extraction and obtaining the website names has the advantage that the website names are automatically extracted under an off-line state, an achieving mode is simple, and the website coverage is complete.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a method and device for extracting a website name. Background technique [0002] With the rapid development of Internet technology and the gradual reduction of the threshold for individuals to establish websites, website domain names have shown explosive growth. The website name plays an indispensable and important role in displaying the source and source of the webpage, website filing and website management; at the same time, in the offline state, the website name that the user browses online is obtained by analyzing the user's browsing record, which is useful for subsequent analysis of user-related data. is of great significance. [0003] The existing technology usually adopts the method of manual collection to configure the website name to form a configuration table of <website domain name, domain name name>, such as <news.sina.com.cn, Sina News>, etc.; and m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/958
Inventor 蔡兵
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products