One-shot Face Recognition Method Based on Sparse Representation of Hybrid Extended Block Dictionary

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A hybrid extension and sparse representation technology, which is applied in character and pattern recognition, unstructured text data retrieval, image enhancement, etc., can solve the problems of lack of discrimination, lack of consistency, poor face recognition effect, etc., to eliminate Redundant information between pixels, reducing the atomic dimension of the dictionary, and facilitating accurate recognition

Active Publication Date: 2022-02-08

NANJING INST OF TECH

View PDF10 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Although this type of method plays a certain role in single-sample face recognition, there are still two fatal flaws: (1) The virtual sample generated by synthesis is highly correlated with the standard sample, so it cannot be used as an independent sample. Intra-class change information is not representative

(2) When the face image is divided into blocks, it is usually assumed that the block image of the test sample and the block image of the training sample at the same position have similar face structural features. However, in practical applications, due to changes in face posture or accessories Due to the influence of occlusion, there may be a big difference between the block image of the test sample and the training sample at the same position, which leads to the above assumptions not being true, and the single-sample face recognition effect is not good

However, in practical applications, it is a daunting task to collect enough generic samples that satisfy various variations

[0007] (2) The atoms in the above dictionary are all represented by the original image, which leads to a large amount of pixel redundant information in the image-based dictionary, lack of consistency between atoms of the same type, and lack of discrimination between atoms of different types; at the same time, the dictionary Atoms are represented by converting two-dimensional images into one-dimensional column vectors, which also causes the dimension of dictionary atoms to be much larger than the number of atoms, which is prone to the "small sample" problem and cannot guarantee the optimal sparse solution in the solution space

Although methods such as SVDL and RADL treat occlusion information as sparse reconstruction errors and can overcome the above problems, such algorithms have high computational complexity and lack operability in practical applications.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0089] Embodiment 1: Experiment in CAS-PEAL database:

[0090] first as figure 1 The building blocks shown, figure 1 Take B=4:

[0091] The CAS-PEAL face database contains 1040 types of people, a total of 99594 face images (including 595 males and 445 females). All images are collected in a special collection environment, covering four main changing conditions of posture, expression, decoration and illumination. Some face images have changes in background, distance and time span. The present invention selects 9031 images among them for experiments, some sample images such as image 3 As shown, the image size is 120×100 pixels.

[0092] On the CAS-PEAL database, the standard dataset of target objects, the general dataset of non-target objects, the occlusion dataset and the intra-class change dataset are designed as follows:

[0093] (1) The general dataset of non-target objects includes 180 types of people with changing lighting and 80 types of people with changing express...

Embodiment 2

[0107] Example 2: Experiment on the AR database:

[0108] Same as embodiment 1 first as figure 1 The building blocks shown, figure 1 Take B=4:

[0109] The AR face database contains 126 types of people (56 females, 70 males), with a total of more than 4,000 frontally aligned faces. Each type of person was shot in two stages, with 13 images in each stage, including 4 images of illumination changes, 3 images of expression changes, 3 images of glasses occlusion, and 3 images of scarf occlusion. The present invention selects 100 types of people for experiments, and performs cropping and normalization processing on the images, and the size after cropping is 120×100 pixels.

[0110] On the AR database, the standard data set of target objects, the general data set of non-target objects, the occlusion data set and the intra-class change data set are designed as follows:

[0111] (1) The standard sample set of the target object consists of any 30 types of people in the AR database,...

Embodiment 3

[0121] Embodiment 3: experiment in LFW database:

[0122] Same as embodiment 1 first as figure 1 The building blocks shown, figure 1 Take B=4; LFW (Labeled Faces in the Wild database) database is a real face database collected from the Internet, with a total of 13,233 face images of 5,749 categories, including illumination, expression, posture, occlusion, age, race, etc. This kind of mixed interference is more challenging for accurate face recognition. The present invention selects people who contain more than 10 images for identification, and obtains 158 types of people. In order to facilitate the experiment, 10 images of each type of person are selected here, and 1580 samples are used for the experiment. Some samples such as Figure 6 As shown, the image size is 120×100 pixels.

[0123] Since the samples in the LFW database are collected from real environments with various mixed interferences, it is impossible to construct typical occlusion datasets and intra-class vari...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a single-sample face recognition method based on the sparse representation of a hybrid extended block dictionary, which includes step (S1): constructing a general data set X of non-target objects; step (S2): constructing a standard sample set N of target objects; Step (S3): Construct the test sample set Y; Step (S4): Construct the occlusion block dictionary and intra-class difference block dictionary of non-target objects Step (S5): The mixed and complete extended block dictionary obtained according to the above steps adopts the The weighted block sparse representation classifier performs linear sparse representation on the B block images of the target object sample y to be tested, so as to perform occluded face recognition of the sample to be tested. In the present invention, the face image is divided into blocks first, and then the basic block dictionary of the target object, the occlusion block dictionary of the non-target object and the intra-class difference block dictionary are respectively constructed by using the KDA algorithm, and finally the weighted block sparse representation classifier is used to accurately classify the samples to be tested. prediction, which effectively improves the accuracy of single-sample face recognition.

Description

technical field [0001] The invention relates to the technical field of single-sample face recognition in which only one or a small number of standard samples are identified in human-computer interaction, and in particular to a single-sample face recognition method based on the sparse representation of a hybrid extended block dictionary. Background technique [0002] In recent years, due to the rapid development of technologies such as artificial intelligence, computer vision, and Internet of Things communications, face recognition technology has been widely used in real life, such as smart home appliances, smart retail, and smart monitoring. However, in some application scenarios, due to the limitation of storage space and personal privacy protection, some face recognition systems only include one or a small number of frontal standard images of each person (that is, those that are not disturbed by external factors such as illumination, expression, and occlusion). Frontal ima...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06V40/16G06V10/26G06K9/62G06T7/11G06F16/36

CPCG06T7/11G06F16/374G06T2207/20021G06T2207/30201G06V40/171G06V10/267G06F18/2132

Inventor 童莹马杲东曹雪虹陈瑞赵小燕

Owner NANJING INST OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

One-shot Face Recognition Method Based on Sparse Representation of Hybrid Extended Block Dictionary

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology