Unlock instant, AI-driven research and patent intelligence for your innovation.

Continuous sign language recognition method based on spatiotemporal residual network and temporal convolutional network

A technology of convolutional network and recognition method, applied in the field of continuous sign language recognition based on spatiotemporal residual network and time series convolutional network, can solve the problem of insufficient short-term spatiotemporal feature extraction of two-dimensional convolutional neural network, large amount of calculation, output word To avoid problems such as lack of correlation, to avoid the huge amount of parameters, reduce the amount of parameters, and improve the depth

Active Publication Date: 2022-08-05
HEBEI UNIV OF TECH +1
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is to provide a continuous sign language recognition method based on spatio-temporal residual network and temporal convolutional network. Shortcomings and insufficient extraction of short-term spatio-temporal features using two-dimensional convolutional neural networks; time-series convolutional networks enhance the time correlation between block-level features, which can solve the problem of long-term dependence in RNN networks to a certain extent, and then To a certain extent, it solves the problem of lack of correlation between output words caused by the inherent conditional independence assumption of CTC

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Continuous sign language recognition method based on spatiotemporal residual network and temporal convolutional network
  • Continuous sign language recognition method based on spatiotemporal residual network and temporal convolutional network
  • Continuous sign language recognition method based on spatiotemporal residual network and temporal convolutional network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0079] Based on the spatiotemporal residual network and the time series convolutional network in this embodiment, the specific steps are as follows:

[0080] The first step is to input the video V, perform preprocessing, and obtain the hypergraph sequence I:

[0081] Input video V=(v 1 ,...,v i , ... v T ), where T is the frame sequence number of the input video V, which are the first frame, ..., the i-th frame, ..., the T-th frame of the original sign language image sequence, respectively. The size of each frame of the input video V is scaled to 224× 224 pixels, and normalize each pixel value of video V to (0, 1), then concatenate 5 adjacent frames of continuous sign language video, and record the video sequence after preprocessing as super sequence of graphs where N=T / 5, t=1,...,N, t is the serial number of the t-th 5-frame concatenated hypergraph, and the dimension of the hypergraph sequence I is (N, 15, 224, 225), The hypergraph sequence I is expressed as the followi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention is a continuous sign language recognition method based on a spatiotemporal residual network and a time-series convolutional network. The method adopts a spatiotemporal residual network to overcome the disadvantage of using a three-dimensional convolutional neural network with a large amount of computation and the short-term use of a two-dimensional convolutional neural network. The defect of insufficient spatiotemporal feature extraction; the time-series convolutional network enhances the temporal correlation between block-level features, which can solve the long-term dependence problem in the RNN network to a certain extent, and then solve the inherent conditional independence of CTC to a certain extent. Hypothesis brings the problem of missing correlation between output words.

Description

technical field [0001] The technical solution of the present invention relates to the field of deep learning image processing and pattern recognition, and specifically relates to a continuous sign language recognition method based on a spatiotemporal residual network and a time series convolutional network. Background technique [0002] Continuous sign language recognition aims to identify each isolated word sign in a continuous sign language video. Specifically, it uses a computer to use deep learning related methods to extract sign language features in continuous sign language videos, and then perform end-to-end recognition. Sign language recognition involves many research fields such as video acquisition and processing, computer vision, human-computer interaction, pattern recognition, and natural language processing. [0003] The sign language used by the deaf-mute is composed of a large number of gestures, body movements and facial expressions. A difficult problem in co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06V40/20G06V10/774G06V10/764G06V10/77G06V10/82G06N3/04
CPCG06V40/28G06N3/045G06F18/213G06F18/214G06F18/2415
Inventor 于明高阳薛翠红贾静丽王书韵刘月豪阎刚
Owner HEBEI UNIV OF TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More