Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Video description method, system and device

A video description, video technology, applied in the field of automation

Active Publication Date: 2019-07-16
HUAWEI TECH CO LTD +1
View PDF3 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Further, the embodiment of the present invention can solve the problem of long-range visual and text dependency modeling by using visual memory storage, text memory storage and attribute memory storage

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video description method, system and device
  • Video description method, system and device
  • Video description method, system and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention.

[0034] It should be understood that when used in this specification and the appended claims, the terms "comprising" and "comprises" indicate the presence of described features, integers, steps, operations, elements and / or components, but do not exclude one or Presence or addition of multiple other features, integers, steps, operations, elements, components and / or collections thereof.

[0035] It should also be understood that the terminology used in the description of the present invention is for the purpose of describing particular embodiments only and is not intended to be limiting of the present invention. As used in this specification and the appended claims, the singular forms "a", "an" and "the" are intended to include plural referents unless the context clearly dictates otherwise.

[0036] It should also be further understood...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a video description method, system and device, and the method comprises the steps: extracting the visual feature representation of a video frame at the currentmoment in a to-be-described video through a video encoder based on a convolutional neural network; writing the visual feature representation at the current moment into a visual memory at the currentmoment; reading attribute information from the attribute memory at the current moment according to the visual memory at the current moment and the text memory at the current moment; and generating predicted words by using a text decoder based on the long-short term memory network according to the word at the previous moment and the attribute information read at the current moment. Therefore, according to the embodiment, a multi-mode description method is adopted, and the flexibility of video description can be improved.

Description

technical field [0001] The present application relates to the field of automation technology, in particular to a video description method, system and device. Background technique [0002] The automatic description of video content is a major challenge in the field of computer vision and machine learning, and has a wide range of application backgrounds. For example, helping the blind describe movie content, video retrieval, and human-computer interaction. In order to realize the automatic description of video content, computer algorithms need to have a comprehensive understanding of video content, and also need to construct a powerful language model, and need to be able to accurately map the elements in the video to the language space. [0003] However, most of the current video description methods are based on fixed sentence templates to describe video information, resulting in an output description that is too blunt. Contents of the invention [0004] The present applic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/738G06K9/00G06K9/62G06N3/04
CPCG06V20/47G06F16/738G06N3/045G06F18/214
Inventor 蔡海军陈院林王亮王威
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products