Tag-co-occurred tag clustering method

A clustering method and labeling technology, applied in the field of tag clustering where tags co-occur, can solve problems such as low accuracy of label description resources, ambiguous semantics, and chaotic organization, and achieve effective and fast clustering methods, reliable clustering results, The effect of low system resource requirements

Inactive Publication Date: 2014-12-17
WUHAN UNIV OF SCI & TECH
View PDF2 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] On the basis of the above research, the present invention is based on improving the previous tag clustering method, and solves the problems of low accuracy of tag description resources, chaotic organization, and ambiguous semantics.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tag-co-occurred tag clustering method
  • Tag-co-occurred tag clustering method
  • Tag-co-occurred tag clustering method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The technical scheme of the present invention can adopt software technology to realize automatic flow operation. The technical solution of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.

[0049] The present invention studies the label clustering method and proposes a label clustering method of label co-occurrence. The realization of the method mainly includes two innovations: feature vector extraction and clustering with improved K-means.

[0050] The extraction of feature vectors is based on the following definitions:

[0051] 1. Define a label matrix, the matrix U nxm It is an n×m matrix, n is the number of tags, m is the number of resources, and the element u in the matrix iq Indicates the label t i Annotate resource r q The frequency of , where i takes values ​​1, 2,...,n, and q takes values ​​1, 2,...,m.

[0052] 2. Define a common annotation matrix, the matrix C n×n It is an n×n ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a tag-co-occurred tag clustering method. A tag matrix, a common tag matrix, a tag importance degree matrix and a similarity matrix are defined in order to improve clustering effectiveness; feature vectors of tags are determined by extracting tag co-occurrence information; the method that the distance between one object and another object is calculated by using geometrical distance in the traditional clustering algorithm is changed into the method by using Pearson correlation coefficient in the way that the similarity is calculated by extracting the feature vectors; the tag-co-occurred tag clustering method which is used for clustering the tags by being combined with the K-means clustering algorithm is provided. The clustering method provided by the invention is better than the other clustering methods in effect and has good effectiveness and feasibility.

Description

technical field [0001] The invention relates to the technical field of network label clustering, in particular to a label clustering method for label co-occurrence. Background technique [0002] Tags are the user's subjective understanding of information and an intermediary between objective information and subjective understanding. In social networks, information is linked together through the same label, and users are also linked with other resources and users by using labels, so that people can use labels to communicate, make friends, and other operations. As a part of the online social network, tags have been extensively studied. Websites such as Flickr, del.icio.us, Douban.com, and Youtobe have all adopted collaborative labeling and clustering of tags. There are relatively few studies. At this stage, the research on optimizing the label system mainly focuses on the label cloud and the orderly organization of labels. The research on the correlation degree between tags...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/9535G06F16/35
Inventor 李鹏王娅丹金瑜刘宇何亨
Owner WUHAN UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products