Entity identification method and device for video scene, electronic equipment and medium
A technology for entity recognition and video scenes, applied in the computer field, can solve problems such as inability to meet different business needs and low recognition accuracy, and achieve the effects of high accuracy, strong versatility, and improved accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0068] figure 1 It is a schematic flowchart of a video scene entity recognition method provided in Embodiment 1 of the present application. This embodiment is applicable to the situation of identifying entities included in the video to be processed. The method can be executed by the video scene entity recognition device provided in Embodiment 3 of the present application, and the device can be implemented by software and / or hardware. Such as figure 1 As shown, the method may include:
[0069] S101. Acquire a target video to be processed and at least one target modality.
[0070] Wherein, the format of the target video includes but not limited to AVI, FLV, RMVB and WMV formats, and the target video is uploaded to the server by the user through the client. The target mode reflects what kind of information the user wants to perform entity recognition on the target video. For example, if the user wants to perform entity recognition on the text information of the target video, t...
Embodiment 2
[0106] figure 2 It is a schematic flowchart of a video scene entity recognition method provided in Embodiment 2 of the present application. This embodiment provides a specific implementation manner for the first embodiment above, such as figure 2 As shown, the method may include:
[0107] S201. Acquire a target video to be processed and at least one target modality.
[0108] S202. Extract at least one target modality feature of the target video.
[0109] S203. If the target modality includes a text modality, execute S204; if the target modality includes a visual modality, execute S208; if the target modality includes an audio modality, execute S209.
[0110] S204. Invoke the video domain classification algorithm provided by the server to determine the target domain to which the target video belongs.
[0111] Wherein, the target field reflects the category to which the target video content belongs, such as film and television, games, and sports. The video domain classifi...
Embodiment 3
[0167] image 3 It is a schematic structural diagram of a video scene entity recognition device 300 provided in Embodiment 3 of the present application, which can execute a video scene entity recognition method provided in any embodiment of the present application, and has corresponding functional modules and beneficial Effect. Such as image 3As shown, the device may include:
[0168] A target modality acquisition module 301, configured to acquire a target video to be processed and at least one target modality;
[0169] Target modality feature extraction module 302, for extracting at least one target modality feature of the target video;
[0170] The target entity recognition algorithm determination module 303 is configured to determine the target entity recognition algorithm to be used from at least two candidate entity recognition algorithms provided by the server according to the at least one target modality; wherein, the at least two candidate entity recognition algori...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com