A distributed large-scale face clustering method and device

A clustering method and large-scale technology, applied in the field of data clustering, can solve the problems of increasing the computational complexity of data clustering, affecting the efficiency of face data clustering, and taking a long time for data clustering, so as to improve the face clustering. Efficiency, ensuring the stability of clustering results, and reducing the amount of data calculation

Active Publication Date: 2022-04-26
GUANGZHOU PCI TECH SOFTWARE DEV CO LTD +1
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When the magnitude of face data is large, the computational complexity of data clustering increases accordingly, and a large amount of storage space is required to store the similarity value of any two face data
Its data clustering takes a relatively long time, and the increase in memory consumption will lead to insufficient system memory, further affecting the efficiency of face data clustering

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A distributed large-scale face clustering method and device
  • A distributed large-scale face clustering method and device
  • A distributed large-scale face clustering method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0050] figure 1 A flow chart of a distributed large-scale face clustering method provided in Embodiment 1 of the present application is given. The distributed large-scale face clustering method provided in this embodiment can be composed of distributed large-scale face clustering The distributed large-scale face clustering device can be implemented by means of software and / or hardware. Generally speaking, the distributed large-scale face clustering device can be computing devices such as computers and server hosts.

[0051] In the following, a distributed large-scale face clustering device is used as an example to perform a distributed large-scale face clustering method for description. refer to figure 1 , the distributed large-scale face clustering method specifically includes:

[0052] S110. Perform batch clustering on the face pictures to be clustered and summarize the clustering results, and obtain a corresponding clustering set and an unclustering set based on the clust...

Embodiment 2

[0096] On the basis of the above examples, Figure 7 It is a schematic structural diagram of a distributed large-scale face clustering device provided in Embodiment 2 of the present application. refer to Figure 7 , the distributed large-scale face clustering device provided in this embodiment specifically includes: a first clustering module 21 , a computing module 22 , a second clustering module 23 and a merging module 24 .

[0097] Wherein, the first clustering module 21 is used to perform batch clustering on the face pictures to be clustered and summarize the clustering results, and obtain corresponding clustering sets and unclustering sets based on the clustering results, the unclustering The class collection includes multiple unclustered face pictures;

[0098] Calculation module 22 is used for extracting the human face picture of setting quantity from each class of described clustering set to constitute corresponding representative class, and calculates each described ...

Embodiment 3

[0104] Embodiment 3 of the present application provides an electronic device, referring to Figure 8 , the electronic device includes: a processor 31 , a memory 32 , a communication module 33 , an input device 34 and an output device 35 . The number of processors in the electronic device may be one or more, and the number of memories in the electronic device may be one or more. The processor, memory, communication module, input device and output device of the electronic device can be connected through a bus or in other ways.

[0105] Memory 32, as a computer-readable storage medium, can be used to store software programs, computer-executable programs and modules, such as program instructions / modules corresponding to the distributed large-scale face clustering method described in any embodiment of the present application (eg , the first clustering module, the calculation module, the second clustering module and the merging module in the distributed large-scale face clustering ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the present application discloses a distributed large-scale face clustering method and device. In the technical solution provided by the embodiment of the present application, the face images to be clustered are clustered in batches and the clustering results are summarized to obtain a clustered set and an unclustered set, and settings are extracted from each class of the clustered set A number of face pictures form a representative class, and calculate the similarity distance between each unclustered face picture and each representative class to obtain a similarity distance set, and cluster the unclustered face pictures into In the most similar representative class, and determine the link relationship between each unclustered face picture and the corresponding representative class according to the set link threshold, merge each representative class based on the link relationship, and output the merged result. Using the above technical means can reduce the amount of data calculation for large-scale face data clustering, reduce memory consumption, and improve the efficiency of face clustering on the premise of ensuring the stability of clustering results.

Description

technical field [0001] The embodiments of the present application relate to the technical field of data clustering, and in particular to a distributed large-scale face clustering method and device. Background technique [0002] At present, in many business scenarios of intelligent security, it is necessary to cluster a large amount of unlabeled face data to simplify the face data labeling process or achieve other data processing purposes. The core problem of face clustering is to divide the collected N unlabeled photos into M individuals, and "M" is unknown in advance. Most of the face clustering algorithms obtain face pictures, use the existing feature extraction technology to extract the feature vector of the face picture, and cluster all the face pictures to be clustered according to the extracted feature vector and a certain clustering algorithm . When the clustering algorithm clusters the clustered face pictures, it often needs to compare the face features to obtain t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62
CPCG06F18/23G06F18/22
Inventor 李逸帆丁保剑秦伟郑丁科曾明杨东泉
Owner GUANGZHOU PCI TECH SOFTWARE DEV CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products