The invention relates to the technical field of exercise fitness equipment, in particular to an interactive fitness system and method based on virtual scenes. The system comprises fitness equipment, aKinect sensor, a large-screen displayer, a computer host, a loudspeaker and a fitness management system. Fitness users can select different instruments to be combined with the virtual scenes, and byintroducing action evaluation and correction and feedback of multiple aspects of vision, hearing and tactile sense, fitness actions are corrected and standardized in time, so that the effect of quickly developing standard fitness is achieved. The fitness actions of the users are evaluated in real time, action trigger thresholds or program types are self-adaptively changed according to completion effects so as to adjust the training difficulty coefficient, and meanwhile, the training difficulty can be adjusted by adjusting the training frequency and training time and selecting different instruments. When the system detects that the users complete a certain fitness action completely correctly, the number of training is automatically increased by one until the users start a next training program after the users complete training in a targeted number set by the training program, and then the training rhythm is actively controlled.