The invention relates to an image feature extraction and similarity measurement method used for three-dimensional city model retrieval. Features extracted through most image and three-dimensional model retrieval methods lack or ignore description of model details, and accordingly, the three-dimensional model retrieval precision is not high. The invention provides a three-dimensional city model retrieval frame based on images. Firstly, retrieval targets on the images are obtained through division, meanwhile, a light field is used for conducting two-dimensional exchanging on three-dimensional city models, features of query targets and features of the retrieval model images are extracted, finally, the similarity between the features is measured through the similarity distance, and three-dimensional city model retrieval is realized. The image feature extraction and similarity measurement method has the advantages that the three-layer frame for image feature extraction and similarity measurement is provided, multiple layers of multi-scale convolutional neural network models with spatial constraints are designed in the frame, and the distinguishable features with invariable displacement, scales and deformation are obtained; a novel similarity measurement method is provided, and similarity matching between the targets is better realized. Compared with an existing method, the efficiency and the precision of the method in three-dimensional city model retrieval are greatly improved.