Unlock instant, AI-driven research and patent intelligence for your innovation.

Continuous sign language recognition method based on space-time residual network and time sequence convolution network

A convolutional network and recognition method technology, applied in the field of continuous sign language recognition based on spatio-temporal residual network and time-series convolutional network, can solve the problem of insufficient short-term spatio-temporal feature extraction of two-dimensional convolutional neural network, large amount of calculation, and output words Issues such as lack of correlation

Active Publication Date: 2021-02-05
HEBEI UNIV OF TECH +1
View PDF6 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is to provide a continuous sign language recognition method based on spatio-temporal residual network and temporal convolutional network. Shortcomings and insufficient extraction of short-term spatio-temporal features using two-dimensional convolutional neural networks; time-series convolutional networks enhance the time correlation between block-level features, which can solve the problem of long-term dependence in RNN networks to a certain extent, and then To a certain extent, it solves the problem of lack of correlation between output words caused by the inherent conditional independence assumption of CTC

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Continuous sign language recognition method based on space-time residual network and time sequence convolution network
  • Continuous sign language recognition method based on space-time residual network and time sequence convolution network
  • Continuous sign language recognition method based on space-time residual network and time sequence convolution network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0079] In this embodiment, based on the spatiotemporal residual network and the temporal convolutional network, the specific steps are as follows:

[0080] The first step is to input the video V, perform preprocessing, and obtain the hypergraph sequence I:

[0081] Input video V=(v 1 ,...,v i ,…v T ), where T is the frame number of the input video V, which are the first frame, ..., the i-th frame, ..., the T-th frame of the original sign language image sequence, and the size of each frame of the input video V is scaled to 224× 224 pixels, and each pixel value of the video V is normalized to (0, 1), and then the 5 adjacent frames of the continuous sign language video are concatenated, and the video sequence after such preprocessing is recorded as super graph sequence Wherein N=T / 5, t=1,..., N, t is the hypergraph serial number after the concatenation of a group of tth 5 frames, and the dimension of hypergraph sequence I is (N, 15, 224, 225), The hypergraph sequence I is e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a continuous sign language recognition method based on a space-time residual network and a time sequence convolutional network. According to the method, the space-time residual network is adopted, so that the defect of large calculated amount caused by completely using a three-dimensional convolutional neural network and the defect of insufficient short-term space-time feature extraction caused by completely using a two-dimensional convolutional neural network are overcome; time correlation between block-level features is enhanced through the sequential convolutional network, the problem of long-term dependence in the RNN can be solved to a certain extent, and then the problem of correlation loss between output words caused by CTC inherent conditional independencehypothesis is solved to a certain extent.

Description

technical field [0001] The technical solution of the present invention relates to the fields of deep learning image processing and pattern recognition, specifically a continuous sign language recognition method based on a spatiotemporal residual network and a temporal convolutional network. Background technique [0002] Continuous sign language recognition aims to recognize each isolated word sign language in continuous sign language videos. Specifically, it is to use computers to use deep learning-related methods to extract sign language features in continuous sign language videos, and then perform end-to-end recognition. Sign language recognition involves multiple research fields such as video acquisition and processing, computer vision, human-computer interaction, pattern recognition, and natural language processing. [0003] The sign language used by the deaf-mute consists of a large number of gestures, body movements, and facial expressions. One of the difficulties in ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62G06N3/04
CPCG06V40/28G06N3/045G06F18/213G06F18/214G06F18/2415
Inventor 于明高阳薛翠红贾静丽王书韵刘月豪阎刚
Owner HEBEI UNIV OF TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More