Website industry classification method and server

A technology belonging to the industry and classification method, applied in the information field, can solve problems such as low execution efficiency and large manpower consumption, and achieve the effect of improving execution efficiency

Inactive Publication Date: 2015-07-01
北龙中网(北京)科技有限责任公司
View PDF5 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a classification method and server for the industry to which the website belongs, which is used to solve the technical problem in the prior art that manually judging the industry type of each website requires a lot of manpower and low execution efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Website industry classification method and server
  • Website industry classification method and server
  • Website industry classification method and server

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] figure 1 It is a flow chart of an embodiment of the classification method for the industry to which the website belongs provided by the present invention. The execution subject of the following steps of the method may be a server capable of obtaining relevant information of the website. Such as figure 1 As shown, the classification method of the industry to which the website belongs includes:

[0023] S101, the server obtains the web page content information of the website to be classified;

[0024] The server uses existing network information grabbing tools, such as "web crawlers", to grab the program or script of the website information to be classified, so as to obtain the webpage content information of the website to be classified; the webpage content information includes all webpages contained in the website The content information involved in , including text, pictures, etc.

[0025] S102, the server performs word segmentation processing on all the words conta...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a website industry classification method and a server. The method includes that the server acquires webpage content information of a to-be-classified website; the server performs word segmentation on all characters included in the webpage content information to generate a notional word set corresponding to the webpage content information; the server matches all notional words included in the notional word set corresponding to the webpage content information with preset keywords corresponding to each industry category, and determines appearance frequency of the keywords, corresponding to each industry category, in the notional word set corresponding to the webpage content information; the server determines the industry category of the to-be-classified website according to the proportion of the appearance frequency of the keywords, corresponding to each industry category, in the notional word set corresponding to the webpage content information. By the website industry classification method and the server, the technical problems of high labor consumption and low execution efficiency during manual judgment for industry categories of websites in the prior art are solved effectively.

Description

technical field [0001] The invention relates to information technology, in particular to a classification method and server for the industry to which a website belongs. Background technique [0002] With the development of Internet technology, the number of domestic websites has increased rapidly. These websites provide various services for netizens, and involve various industries, such as: various corporate websites used to expand business for enterprises, and government websites that provide online government affairs or information inquiries for netizens. If you can distinguish the specific industry of the above-mentioned domestic websites, you can find similar websites under the industry category according to the specific industry information, which will have a great effect on the classification of website information and the improvement of search engine search results. [0003] In the prior art, the industry type of each website is judged manually, which not only consum...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 高宁杨莹
Owner 北龙中网(北京)科技有限责任公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products