Summarization of audio and/or visual data

A technology of video data and audio data, applied in the direction of electric digital data processing, special data processing applications, digital data information retrieval, etc., can solve problems such as expensive creation and maintenance, slow access, and inability to find names or roles
CN101137986AInactive Publication Date: 2008-03-05KONINKLIJKE PHILIPS ELECTRONICS NV

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
KONINKLIJKE PHILIPS ELECTRONICS NV
Publication Date
2008-03-05
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
Patent Text Reader

Abstract

Summarization of audio and / or visual data based on clustering of object type features is disclosed. Summaries of video, audio and / or audiovisual data may be provided without any need of knowledge about the true identity of the objects that are present in the data. In one embodiment of the invention are video summaries of movies provided. The summarization comprising the steps of inputting audio and / or visual data, locating an object in a frame of the data, such as locating a face of an actor, extracting type features of the located object in the frame. The extraction of type features is done for a plurality of frames and similar type features are grouped together in individual clusters, each cluster being linked to an identity of the object. After the processing of the video content, the largest clusters correspond to the most important persons in the video.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present invention relates to summarization of audio and / or video data, and in particular to summarization of audio and / or video data based on grouping of type characteristics of objects present in the audio and / or video data. Background technique

[0002] Automatic summarization of audio and / or video data aims to efficiently represent audio and / or video data for easier browsing, searching and, more generally, content management. Automatically generated summaries can support users in searching and navigating in large data documents, for example, in order to make more efficient decisions when acquiring, moving, deleting, etc. content.

[0003] For example, the automatic generation of video previews and video summaries requires the positioning of video segments with main actors or characters. Current systems use facial and voice recognition technology to identify people appearing on video.

[0004] Patent Publication No. US2003 / 0123712 discloses a m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More