The invention discloses a face multi-area fusion expression recognition method based on depth learning, which comprises the following steps of detecting a face position with a detection model; obtaining the coordinates of the key points by using the key point model; aligning the eyes according to the key points of the eyes, then aligning the face according to the coordinates of the key points of the whole face, and clipping the face region by affine transformation; cutting the eye and mouth areas of the image to a certain proportion; dividing the convolution neural network into one backbone network and two branch networks; carrying out the feature fusion in the last convolution layer, and finally obtaining the expression classification results by the classifier. The method of the inventionutilizes the priori information, besides the whole face, the eyes and mouth regions are also used as the input of the network, and the network can learn the whole semantic features of facial expressions and the local features of facial expressions through model fusion, so that the method simplifies the difficulty of facial expression recognition, reduces the external noise, and has strong robustness, high accuracy, low complexity of the algorithm and so on.