One-shot Face Recognition Method Based on Sparse Representation of Hybrid Extended Block Dictionary

A hybrid extension and sparse representation technology, which is applied in character and pattern recognition, unstructured text data retrieval, image enhancement, etc., can solve the problems of lack of discrimination, lack of consistency, poor face recognition effect, etc., to eliminate Redundant information between pixels, reducing the atomic dimension of the dictionary, and facilitating accurate recognition

Active Publication Date: 2022-02-08
NANJING INST OF TECH
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although this type of method plays a certain role in single-sample face recognition, there are still two fatal flaws: (1) The virtual sample generated by synthesis is highly correlated with the standard sample, so it cannot be used as an independent sample. Intra-class change information is not representative
(2) When the face image is divided into blocks, it is usually assumed that the block image of the test sample and the block image of the training sample at the same position have similar face structural features. However, in practical applications, due to changes in face posture or accessories Due to the influence of occlusion, there may be a big difference between the block image of the test sample and the training sample at the same position, which leads to the above assumptions not being true, and the single-sample face recognition effect is not good
However, in practical applications, it is a daunting task to collect enough generic samples that satisfy various variations
[0007] (2) The atoms in the above dictionary are all represented by the original image, which leads to a large amount of pixel redundant information in the image-based dictionary, lack of consistency between atoms of the same type, and lack of discrimination between atoms of different types; at the same time, the dictionary Atoms are represented by converting two-dimensional images into one-dimensional column vectors, which also causes the dimension of dictionary atoms to be much larger than the number of atoms, which is prone to the "small sample" problem and cannot guarantee the optimal sparse solution in the solution space
Although methods such as SVDL and RADL treat occlusion information as sparse reconstruction errors and can overcome the above problems, such algorithms have high computational complexity and lack operability in practical applications.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • One-shot Face Recognition Method Based on Sparse Representation of Hybrid Extended Block Dictionary
  • One-shot Face Recognition Method Based on Sparse Representation of Hybrid Extended Block Dictionary
  • One-shot Face Recognition Method Based on Sparse Representation of Hybrid Extended Block Dictionary

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0089] Embodiment 1: Experiment in CAS-PEAL database:

[0090] first as figure 1 The building blocks shown, figure 1 Take B=4:

[0091] The CAS-PEAL face database contains 1040 types of people, a total of 99594 face images (including 595 males and 445 females). All images are collected in a special collection environment, covering four main changing conditions of posture, expression, decoration and illumination. Some face images have changes in background, distance and time span. The present invention selects 9031 images among them for experiments, some sample images such as image 3 As shown, the image size is 120×100 pixels.

[0092] On the CAS-PEAL database, the standard dataset of target objects, the general dataset of non-target objects, the occlusion dataset and the intra-class change dataset are designed as follows:

[0093] (1) The general dataset of non-target objects includes 180 types of people with changing lighting and 80 types of people with changing express...

Embodiment 2

[0107] Example 2: Experiment on the AR database:

[0108] Same as embodiment 1 first as figure 1 The building blocks shown, figure 1 Take B=4:

[0109] The AR face database contains 126 types of people (56 females, 70 males), with a total of more than 4,000 frontally aligned faces. Each type of person was shot in two stages, with 13 images in each stage, including 4 images of illumination changes, 3 images of expression changes, 3 images of glasses occlusion, and 3 images of scarf occlusion. The present invention selects 100 types of people for experiments, and performs cropping and normalization processing on the images, and the size after cropping is 120×100 pixels.

[0110] On the AR database, the standard data set of target objects, the general data set of non-target objects, the occlusion data set and the intra-class change data set are designed as follows:

[0111] (1) The standard sample set of the target object consists of any 30 types of people in the AR database,...

Embodiment 3

[0121] Embodiment 3: experiment in LFW database:

[0122] Same as embodiment 1 first as figure 1 The building blocks shown, figure 1 Take B=4; LFW (Labeled Faces in the Wild database) database is a real face database collected from the Internet, with a total of 13,233 face images of 5,749 categories, including illumination, expression, posture, occlusion, age, race, etc. This kind of mixed interference is more challenging for accurate face recognition. The present invention selects people who contain more than 10 images for identification, and obtains 158 types of people. In order to facilitate the experiment, 10 images of each type of person are selected here, and 1580 samples are used for the experiment. Some samples such as Figure 6 As shown, the image size is 120×100 pixels.

[0123] Since the samples in the LFW database are collected from real environments with various mixed interferences, it is impossible to construct typical occlusion datasets and intra-class vari...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a single-sample face recognition method based on the sparse representation of a hybrid extended block dictionary, which includes step (S1): constructing a general data set X of non-target objects; step (S2): constructing a standard sample set N of target objects; Step (S3): Construct the test sample set Y; Step (S4): Construct the occlusion block dictionary and intra-class difference block dictionary of non-target objects Step (S5): The mixed and complete extended block dictionary obtained according to the above steps adopts the The weighted block sparse representation classifier performs linear sparse representation on the B block images of the target object sample y to be tested, so as to perform occluded face recognition of the sample to be tested. In the present invention, the face image is divided into blocks first, and then the basic block dictionary of the target object, the occlusion block dictionary of the non-target object and the intra-class difference block dictionary are respectively constructed by using the KDA algorithm, and finally the weighted block sparse representation classifier is used to accurately classify the samples to be tested. prediction, which effectively improves the accuracy of single-sample face recognition.

Description

technical field [0001] The invention relates to the technical field of single-sample face recognition in which only one or a small number of standard samples are identified in human-computer interaction, and in particular to a single-sample face recognition method based on the sparse representation of a hybrid extended block dictionary. Background technique [0002] In recent years, due to the rapid development of technologies such as artificial intelligence, computer vision, and Internet of Things communications, face recognition technology has been widely used in real life, such as smart home appliances, smart retail, and smart monitoring. However, in some application scenarios, due to the limitation of storage space and personal privacy protection, some face recognition systems only include one or a small number of frontal standard images of each person (that is, those that are not disturbed by external factors such as illumination, expression, and occlusion). Frontal ima...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06V40/16G06V10/26G06K9/62G06T7/11G06F16/36
CPCG06T7/11G06F16/374G06T2207/20021G06T2207/30201G06V40/171G06V10/267G06F18/2132
Inventor 童莹马杲东曹雪虹陈瑞赵小燕
Owner NANJING INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products