Entity identification method and device for video scene, electronic equipment and medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology for entity recognition and video scenes, applied in the computer field, can solve problems such as inability to meet different business needs and low recognition accuracy, and achieve the effects of high accuracy, strong versatility, and improved accuracy

Active Publication Date: 2019-12-03

BEIJING BAIDU NETCOM SCI & TECH CO LTD

View PDF7 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The embodiment of the present application provides a video scene entity recognition method, device, electronic equipment and medium, which can solve the problem of low accuracy of entity recognition in existing methods and cannot meet different business needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0068] figure 1 It is a schematic flowchart of a video scene entity recognition method provided in Embodiment 1 of the present application. This embodiment is applicable to the situation of identifying entities included in the video to be processed. The method can be executed by the video scene entity recognition device provided in Embodiment 3 of the present application, and the device can be implemented by software and / or hardware. Such as figure 1 As shown, the method may include:

[0069] S101. Acquire a target video to be processed and at least one target modality.

[0070] Wherein, the format of the target video includes but not limited to AVI, FLV, RMVB and WMV formats, and the target video is uploaded to the server by the user through the client. The target mode reflects what kind of information the user wants to perform entity recognition on the target video. For example, if the user wants to perform entity recognition on the text information of the target video, t...

Embodiment 2

[0106] figure 2 It is a schematic flowchart of a video scene entity recognition method provided in Embodiment 2 of the present application. This embodiment provides a specific implementation manner for the first embodiment above, such as figure 2 As shown, the method may include:

[0107] S201. Acquire a target video to be processed and at least one target modality.

[0108] S202. Extract at least one target modality feature of the target video.

[0109] S203. If the target modality includes a text modality, execute S204; if the target modality includes a visual modality, execute S208; if the target modality includes an audio modality, execute S209.

[0110] S204. Invoke the video domain classification algorithm provided by the server to determine the target domain to which the target video belongs.

[0111] Wherein, the target field reflects the category to which the target video content belongs, such as film and television, games, and sports. The video domain classifi...

Embodiment 3

[0167] image 3 It is a schematic structural diagram of a video scene entity recognition device 300 provided in Embodiment 3 of the present application, which can execute a video scene entity recognition method provided in any embodiment of the present application, and has corresponding functional modules and beneficial Effect. Such as image 3As shown, the device may include:

[0168] A target modality acquisition module 301, configured to acquire a target video to be processed and at least one target modality;

[0169] Target modality feature extraction module 302, for extracting at least one target modality feature of the target video;

[0170] The target entity recognition algorithm determination module 303 is configured to determine the target entity recognition algorithm to be used from at least two candidate entity recognition algorithms provided by the server according to the at least one target modality; wherein, the at least two candidate entity recognition algori...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an entity identification method and device for a video scene, electronic equipment and a medium, and relates to the field of artificial intelligence. The method according to the invention comprises the steps of obtaining a to-be-processed target video and at least one target mode; extracting at least one target modal feature of the target video; determining a target entityidentification algorithm to be used from at least two candidate entity identification algorithms provided by the server according to the at least one target mode; wherein the at least two candidate entity identification algorithms are deployed in the server; and calling a target entity identification algorithm to perform entity identification on the at least one target modal feature to obtain a target entity included in the target video. According to the method and the device, identification of different modal entities of the target video is realized, the identification result accuracy is high, different service requirements can be met, and the universality is high.

Description

technical field [0001] The embodiments of the present application relate to computer technology, especially artificial intelligence technology, and specifically design methods, devices, electronic equipment and media for entity recognition of video scenes. Background technique [0002] With the development of information technology and the increasing popularity of various video apps, video will become the most important way of information dissemination, widely used in all aspects of interpersonal communication, social life, and industrial production. Faced with massive video content, manual processing alone cannot be completed. Therefore, it is urgent to realize intelligent understanding of video content through computer technology, and then automatically and intelligently classify and label videos. [0003] The traditional method is to perform entity recognition on the target video in a unimodal manner, such as performing entity recognition through pure video text features ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06F16/783

CPCG06F16/7837

Inventor王述任可欣冯知凡张扬朱勇

OwnerBEIJING BAIDU NETCOM SCI & TECH CO LTD

Entity identification method and device for video scene, electronic equipment and medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology