A Comprehensive Classification Method for Internet Websites Based on Multidimensional Features

A classification method and Internet technology, applied in the field of comprehensive classification of Internet sites based on multi-dimensional features, can solve the problems of unseen documents, technologies and products

Active Publication Date: 2020-05-26
EVERSEC BEIJING TECH
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, there are no documents, technologies and products that quantitatively measure the classification of "Internet +" industry websites in the whole country and provinces

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Comprehensive Classification Method for Internet Websites Based on Multidimensional Features
  • A Comprehensive Classification Method for Internet Websites Based on Multidimensional Features
  • A Comprehensive Classification Method for Internet Websites Based on Multidimensional Features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be described in further detail below in conjunction with the accompanying drawings, but it is not intended to limit the present invention.

[0024] In order to enable those skilled in the art to better understand the technical solutions of the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0025] Before introducing the scheme of the embodiment of the present invention, first enter the following explanation to the nouns indicated in the specific embodiment of the present invention:

[0026] 1. Internet website: The Internet website referred to in this article refers to the website accessed in the form of a domain name through the HTTP protocol in the IDC computer room. Such as Baidu, Sina and so on.

[0027] 2. "Internet +" industry: The "Internet +" industry referred to in this article refers to various industries that provide Interne...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for comprehensively classifying Internet websites based on multi-dimensional features. The method includes: collecting domain name information of Internet websites to obtain the domain name characteristics of Internet websites; based on each domain name information, using crawlers to obtain Internet website title information corresponding to the domain name, Obtain the title information list of the Internet website; Based on each domain name information, adopt the crawler to obtain the Internet website homepage information corresponding to the domain name, and obtain the characteristics of the homepage of the Internet website; Based on each domain name information, Use the crawler to obtain the Internet website page link information corresponding to the domain name, Obtain the external link characteristics of the Internet website; comprehensively obtain the above-mentioned characteristics, through information association and machine learning, determine the industry attributes of the website and perform corresponding classification. The invention solves the problem that websites cannot be accurately classified in the prior art.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a method for comprehensively classifying Internet websites based on multi-dimensional features. Background technique [0002] With the rapid development of the Internet, "Internet +" has gradually become a new format in the new era. Designing a set of website classification methods that can truly reflect the "Internet +" of various industries has become an effective way to quantitatively measure the development of "Internet +" in various industries. Way. [0003] At present, there are no documents, technologies and products that quantitatively measure the classification of "Internet +" industry websites in the whole country and provinces. Contents of the invention [0004] The object of the present invention is to provide a comprehensive classification method of Internet websites based on multi-dimensional features, so as to realize the accurate classification of the type...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/958G06F16/951G06F16/953G06F16/9535G06K9/62
CPCG06F16/951G06F16/958G06F18/241
Inventor 张振涛崔渊博李金宇李湃蔡琳杨满智刘长永金红
Owner EVERSEC BEIJING TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products