Website classification catalogue optimization analysis method based on log mining

A technology of classification and analysis methods, which is applied in the field of log mining and website classification optimization analysis, and can solve the problem of difficulty in comprehensively collecting user cognition.

Active Publication Date: 2015-11-25
NANJING UNIV OF SCI & TECH
View PDF3 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In general, the website classification system optimization method has the following problems: (1) It is difficult to comprehensively collect users' cognition about the website classification directory

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Website classification catalogue optimization analysis method based on log mining
  • Website classification catalogue optimization analysis method based on log mining
  • Website classification catalogue optimization analysis method based on log mining

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] The invention applies log mining to the field of optimizing website classification catalogs, and conducts research in combination with three steps of network log mining: data preprocessing, pattern discovery and pattern analysis.

[0057] Data preprocessing: Before data mining, preprocessing the data according to the mining purpose can improve the efficiency of later data mining. In order to facilitate the optimization of the classified directory of the website, the data is preprocessed into the form of a directory path.

[0058] Data pattern discovery: pattern discovery refers to the use of various data mining techniques to mine preprocessed data to find out the hidden laws or patterns. Different users have different expectations about website categories. A good website category can provide different categories of users in a personalized way. Therefore, the premise of optimization is to divide users into different categories according to their inner expectations. The ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a website classification catalogue optimization analysis method based on log mining. According to the method, website log data is firstly preprocessed, wherein the log data refers to a series of webpage access data sets recorded on a server; through preprocessing, a catalogue path through which a user obtains information via a specific website is extracted from the log data; then, a method (VOB) based on the browsing path sequence is used for calculating the similarity between any two catalogue paths until a catalogue path similarity matrix is constructed; then, a divisive hierachical clustering (NHC) algorithm based on matrix transformation is used for performing clustering on the catalogue path similarity matrix, so that users corresponding to the catalogue paths are clustered into different categories; and finally, expected website classification catalogue systems of each category of users are mined out, and are subjected to comparison analysis on the original classification catalogue system. Through the steps, the website classification catalogue systems conforming to the expectation of the users can be mined out, and the quantitative decision support is provided for the website optimization.

Description

technical field [0001] The invention relates to a method for optimizing and analyzing a classified directory of a website, in particular to a method for optimizing and analyzing a classified directory based on log mining from the user's point of view. Background technique [0002] Whether the design of the website classification meets the user's expectations directly affects the user's satisfaction with the website, and then affects the user's willingness to use the website. Website category optimization is to decide whether to adjust the existing information category system of the website on the basis of evaluating the existing category categories of the website, and determine how to adjust if necessary. [0003] At present, most of the research on the optimization and analysis methods of the website classification system is based on traditional research methods such as questionnaires and telephone interviews. The insufficiency of the research methods themselves and the lim...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 吴鹏张丽军李小军夏子然丁慧君高庆宁
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products