A sign language video translation method based on the fusion of a time domain convolution network and a cyclic neural network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A convolutional network and neural network technology, applied in natural language translation, special data processing applications, instruments, etc., can solve problems such as difficult to learn the correspondence between actions and word semantics, difficult to learn videos, and affect the effect of recognition

Active Publication Date: 2019-01-11

HEFEI UNIV OF TECH

View PDF12 Cites 24 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The disadvantage of these features is that when extracting features, we only pay attention to the information of each video frame, while ignoring the correlation information and change information between frames in continuous video, which will affect the effect of subsequent recognition.

[0005] In the learning process of sequence models, commonly used models include support vector machines, dynamic time warping algorithms, hidden Markov models, and other traditional models. These models are more suitable for recognizing and translating single sign language words, and for continuous and multiple Videos of gestures are difficult to learn, and it is also difficult to learn the semantic correspondence between actions and words, so it is impossible to effectively translate continuous sign language sentences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0051] In this example, if figure 1 As shown, a sign language video translation method based on the fusion of time-domain convolutional network and cyclic neural network is to fully extract the spatial and temporal features in sign language video, effectively learn the features of key actions with high recognition, and effectively Avoid being interfered by factors such as the signer's body shape, sign language speed, and sign language habits during the model learning process. Its steps include:

[0052] First, the original sign language video is preprocessed to extract sign language video features; then two different network structures (time-domain convolutional network TCN and bidirectional recurrent neural network BGRU) are used simultaneously to encode continuous sign language video features and output each slice word Generated probability; then the output of the middle layer of the sub-network is spliced and sent to the fusion network (FL) to learn and generate a word s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a continuous sign language video translation method based on the fusion of a time domain convolution network and a circulating neural network, which comprises the following steps: feature extraction of the sign language video and construction of a word list; processing of a time domain convolution network TCN; the processing of a bi-directional circulating neural network BGRU; word mapping process of feature; processing of a fusion network FL; optimization of network model parameters based on the fusion of the time domain convolution network TCN and the bi-directional circulating neural network BGRU; fusion and decoding of word coding vectors. The invention can effectively overcome the communication obstacle caused by inaccurate interpretation of sign language translation by utilizing the network structure of different perspectives, learn and analyze different manifestations of data by utilizing different networks, further improve the accuracy of sign language translation, and increase the robustness of sign language translation.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and relates to technologies such as pattern recognition, natural language processing, and artificial intelligence. Specifically, it relates to a continuous sign language video translation method based on the fusion of time-domain convolutional networks and cyclic neural networks. Background technique [0002] Sign language is a way for normal people to communicate with deaf-mute people. It usually consists of a series of meaningful movements consisting of the sign language user's body movements, joint movements and facial expressions. However, there are often communication barriers between normal people who have not learned sign language and sign language users. Therefore, how to capture information such as gestures of sign language users and convert them into information that normal people can understand has gradually attracted people's attention. A good sign language translation system...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06K9/00G06F17/28

CPCG06F40/58G06V40/28

Inventor 郭丹王硕汪萌

Owner HEFEI UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A sign language video translation method based on the fusion of a time domain convolution network and a cyclic neural network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology