Blog hierarchy classification tree construction method based on label clustering

A technology of hierarchical classification and construction method, which is applied in the directions of instruments, computing, and electrical digital data processing, etc., can solve the problems that tag clustering cannot establish hierarchical relationships, and cannot specifically determine the specific topic of the log, achieving high efficiency and accuracy , strong practical effect

Inactive Publication Date: 2009-05-13
HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
View PDF0 Cites 47 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] In order to solve the technical problems existing in the prior art that simple tag clustering cannot establish hierarchical relationships and tags cannot

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Blog hierarchy classification tree construction method based on label clustering
  • Blog hierarchy classification tree construction method based on label clustering
  • Blog hierarchy classification tree construction method based on label clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The present invention is described below in conjunction with accompanying drawing:

[0046] see figure 1 A flow chart of a blog hierarchical classification tree construction method based on tag clustering of the present invention, combined with figure 2 A schematic diagram of the calling relationship between the algorithms of each part in a method for constructing a blog hierarchical classification tree based on tag clustering in the present invention. As shown in the figure, the blog hierarchy construction algorithm is a recursive call process. The first step of this algorithm is: initialize and input the pre-defined blog hierarchy classification tree and the adjacency matrix constructed from all tag relationship data; the second step: The tag clustering algorithm will be called to cluster the tag relationship data, thereby generating several tag clusters; the third step: using the topic generalization algorithm to extract one or more key tag words as the theme of ea...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for constructing a blog hierarchical classification tree based on tag clustering. The method comprises the following steps: firstly, initializing and inputting a predefined blog hierarchical classification tree and an adjacency matrix which is constructed by tag relational data; secondly, calling a tag clustering algorithm to cluster the tag relational data so as to generate a plurality of tag clusters; thirdly, applying a theme generalization algorithm to extract one or a plurality of key tag words from each tag cluster as a theme of the cluster; fourthly, recursively calling the second step and the third step when the cluster can be further clustered; fifthly, constructing a new hierarchy and increasing a new theme node in the blog hierarchical classification tree after each recursive call is completed; sixthly, outputting a constructed blog hierarchical classification tree after the recursive terminal condition is completely fulfilled. The method is provided by aiming at searching, exploiting, browsing and other problems of blog data, can quickly organize the theme hierarchical relation of mass blog data, and has higher efficiency and accuracy.

Description

technical field [0001] The invention relates to a technology for constructing a blog topic hierarchy, in particular to a method for constructing a blog hierarchical classification tree based on tag clustering. Background technique [0002] Blog is a blog, which is a popular personal media. It carries a large amount of valuable information, and its status in the Internet is becoming more and more important, and it has become an indispensable part of people's daily life and work. However, because the information characteristics of blogs and traditional webpages are very different, how to carry out targeted retrieval and deeper mining and utilization of information in blogs has become a hot spot in the current Internet application research. [0003] In solving problems such as information retrieval, mining and browsing of blogs, it is an important link to extract topics from log content. Blog logs contain a variety of topics, and it is necessary to distinguish different topic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 叶允明王冰伟何金艳
Owner HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products