The invention relates to a method for video
semantic mining, comprising the steps of: firstly performing a Chinese continuous
speech recognition, a video target recognition and a video
character recognition on a to-be-processed video; then performing a
Chinese word division and a part-of-speech tagging on the recognition result, reserving nouns and verbs as the peaks of a
graph model, wherein a side weight between the peaks is set to be the Chinese semantic distance of words represented by the two peaks; and finally mining the
semantic information of the video according to a dense subgraph finding
algorithm. The
semantic mining of the video is realized by the fusion of the three recognition results of the Chinese continuous
speech recognition, the video target recognition and the video
character recognition; the video is represented to be a
graph model, wherein the peaks are the words in the video, and the side weight is set to be the semantic distance between the two peaks; the video
semantic mining algorithm is further transformed to be the dense subgraph finding
algorithm of the
graph model. The method and the device in the invention solve the problems of high error rate of a single recognition result and incapability of efficiently fusing a plurality of recognition results in the process of the Chinese continuous
speech recognition, the video target recognition and the video
character recognition, as well as solve the problem of the video structured expression and the problem of the video semantic mining algorithm realization. The method and the device in the invention can be used for performing automatic marking, classification and semantic mining on batches of videos.