Picture retrieval clustering method facing to Web2.0 label picture shared space

A web2.0, clustering method technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of labeling inconsistency, not considering labeling inconsistency, and not being able to adapt well. Achieve the effect of solving the problem of inconsistent expression, satisfying the image clustering results, and solving the query semantic ambiguity

Inactive Publication Date: 2011-11-09
ZHEJIANG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] In recent years, more and more tag application systems for text and multimedia content have appeared on the Internet. For example, Del.icio.us provides joint tags for web page bookmarks. Similarly, there is CiteUlike for the field of academic paper publishing, which targets Flickr for image label sharing, and Youtube for video label sharing, etc., the success of these commercial applications also confirms that labeling is a good joint sharing approach, and label retrieval has become a common and popular method in the field of information retrieval. However, there are semantic deviations and certain limitations in tag-based retrieval. For example, different users use different tags (such as synonyms, singular and plural, etc.) The ambiguity of the query also leads to ambiguous query semantics. How to capture this inconsistency during retrieval and quickly cluster the retrieval results to effectively distinguish ambiguity has become one of the concerns of the academic and industrial circles;
[0003] At present, there are many researches on image retrieval systems using image clustering image retrieval algorithms. Clustering algorithms are basically based on a set distance measure. The choice of distance measure determines what kind of pictures are clustered together. Pull distance, Manhattan distance, etc., but these methods are not well adapted to the highly dynamic and massive Web2.0 labeling system. Flickr mines label information to cluster image retrieval results. However, this kind of Flickr The clustering application not only does not consider the inconsistency of the label itself, but is only suitable for clustering the retrieval results of a single label;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Picture retrieval clustering method facing to Web2.0 label picture shared space
  • Picture retrieval clustering method facing to Web2.0 label picture shared space
  • Picture retrieval clustering method facing to Web2.0 label picture shared space

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] In the retrieval system facing the shared space of Web2.0 tagged images, the retrieval clustering method provided by the present invention can realize fast and effective retrieval of highly dynamic and massive tagged images, and solve the semantics of the tags themselves in the tag space to a certain extent Consistency issues and query semantic ambiguity issues, taking the image data set of the Flickr image sharing website as an example, the specific implementation steps are as follows:

[0036] 1) The system first establishes an inverted index of tag keywords for the obtained Flickr image data set, and performs preprocessing analysis on the tag set in the image database:

[0037] The first step is to build a label inverted index table TAIL for query expansion based on the existing vocabulary association knowledge and morphological transformation knowledge. Flickr pictures are basically English labels, and labels can be obtained through the vocabulary relationship struct...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a retrieval result clustering method facing to a Web2.0 label picture shared space, which comprises the following steps: excavating a vocabulary relationship and an associated relationship between labels; obtaining an expanded querying label set by a query label according to the vocabulary relationship between the labels; obtaining a candidate image set relevant to query by the expanded query label set; selecting front K most relevant labels according to the relevance measurement of the labels in the query label set and the candidate image set; automatically dividing the K labels into an optimal clustering result according to the association between the K labels by a clustering algorithm based on a picture division from top to bottom; and correspondingly clusteringthe candidate image set according to clustering labels. Aiming at the problem of inconformity of label expression, the effective query expansion is realized, and the image clustering method based on most relevant label set clustering solves the problem of diversity of label semanteme. Compared with a traditional method, the invention leads a user to rapidly and effectively retrieve and browse a picture in the Web2.0 label picture shared space.

Description

technical field [0001] The invention relates to technologies related to retrieval and clustering processing of massive pictures, in particular to a picture result clustering method oriented to the shared space of Web2.0 tagged pictures. Background technique [0002] In recent years, more and more tag application systems for text and multimedia content have appeared on the Internet. For example, Del.icio.us provides joint tags for web page bookmarks. Similarly, there is CiteUlike for the field of academic paper publishing, which targets Flickr for image label sharing, and Youtube for video label sharing, etc., the success of these commercial applications also confirms that labeling is a good joint sharing approach, and label retrieval has become a common and popular method in the field of information retrieval. However, semantic deviation and certain limitations generally exist in tag-based retrieval. For example, different users use different tags (such as synonyms, singular...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 李晓燕陈刚寿黎但胡天磊陈珂
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products