The invention discloses a video composition method, a terminal and a computer readable storage medium, wherein the terminal can realize compositing contents of a plurality of video clips into one video clip and thereby performing playing or sharing, and so on; for each to-be-composited video, the terminal respectively extracts a target object in each video frame of each video; and then, the targetobjects of the video frames, of which video frame numbers (representing positions of the video frames in the video) are corresponding, among the videos are composited into a new video frame, and allthe obtained new video frames are combined orderly to form a composited video. The invention also discloses the terminal and the computer readable storage medium, by implementation of the technical scheme, the contents in multiple video clips are composited in one video clip, processing modes of the terminal for the video are enriched, and show forms of the video contents are also enriched, thus,composited pictures can be more living and interesting, and particularly, user experience of recording and sharing the video can be improved.