The invention provides a super pixel-based target reconstruction method. The method is formed by structure-from-motion (SFM) projection matrix estimation, multi-view stereoscopic vision matching undersuper pixels and synthesis and fusion of depth graphs. A specific process includes the six major steps of: step 1, reading an image sequence, and utilizing a structure-from-motion method to estimatea camera projection matrix; step 2, carrying out super-pixel segmentation on image pairs; step 3, calculating possible depth values for each super pixel; step 4, utilizing an MRF model to select optimal depth values of the super pixels; step 5, creating a multi-scale super-pixel framework; and step 6, carrying out depth graph fusion and surface meshing. According to the method, a disadvantage of insufficient precision of super pixel-based stereoscopic vision matching is overcome, advantages that the same is high in robustness for noises and brightness deviations, can accurately provide targetcontour information, and is low in calculation complexity are utilized, better reconstruction results can be achieved for both texture regions and non-texture regions, and universality is high. The method has broad application backgrounds.