The invention discloses a fashion compatibility analysis method and system based on deep multi-modal feature fusion, and the method comprises the steps: processing text data through a sample feature extraction network, a visual feature extraction network based on Resnet-18, and a text feature extraction network based on one-hot coding; after features are extracted, the features are fused, a visual feature and text feature fusion network based on an attention mechanism, extracted visual features and text features are fused, a visual feature self-attention network based on the attention mechanism is adopted, and feature expression of a visual mode is enhanced; mapping the fusion features into a multi-modal vector space by using a feature representation network based on multi-layer mapping; and finally, a fusion feature compatibility calculation network is used, a fusion feature positive pair distance is shortened in a multi-modal vector space, and a negative pair distance is expanded. According to the method and the device, the fashion single items can be reasonably matched, and the accuracy of the fashion single item matching result is improved.