Gastroscope video part identification network structure based on Transformer
Patent Information
- Authority / Receiving Office
- CN · China
- Current Assignee / Owner
- ZHONGSHAN HOSPITAL FUDAN UNIV
- Publication Date
- 2021-07-27
Smart Images

Figure 1
Abstract
Description
technical field
[0001] The invention relates to a video recognition technology, in particular to a Transformer-based gastroscope video part recognition network structure. Background technique
[0002] At present, for gastroscope video recognition, the existing findings are basically based on the establishment of a full convolutional network model for single-frame images for classification, such as Densenet, Efficientnet and other series of models. These methods use convolutional layers to extract features, and then use the extracted feature to obtain a single video frame classification result. However, the image features of the stomach and digestive tract have high common characteristics, and it is difficult to learn the timing characteristics of video data and the global characteristics of digestive tract organs from a single frame of video, so it is lacking in judging the overall category of gastroscope video, thus This leads to poor classification accuracy for gastroscop...