The invention discloses a multi-scale feature fused multi-supervision face in-vivo detection method, which comprises the following steps of: acquiring an image
data set, and preprocessing the
data set; gradient texture features are extracted through central difference
convolution, and codes are fused; extracting multi-scale
discriminant features through a group
receptive field branch, and splicing and fusing the multi-scale
discriminant features with a gradient texture
branch; fusing the two features, inputting the fused features into a residual structure, carrying out deep
semantic learning and coding, and inputting a result into a
depth map generator and a
mask generator to obtain a feature map; a
depth map is used for supervision, and a binary
mask is used for auxiliary supervision; and fusing output results of the
depth map generator and the
mask generator, calculating a
prediction score, and realizing end-to-end
living body detection. According to the invention, the performance and generalization ability of the network can be improved, and the method has the advantages of small parameter quantity and end-to-end detection; compared with an existing mainstream
living body detection
algorithm, the method is higher in detection precision and better in robustness.