Multi-scale target detection method and system for smoothly transmitting semantic information
A technology of target detection and semantic information, applied in the field of computer vision, can solve the problems of insufficient robustness and low generalization of the detector, and achieve the effect of solving the limited use, reducing the number of parameters, and improving the operation efficiency.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0106] Step 1, image data preparation.
[0107] The dataset used for training, validation and testing in this paper is the public dataset Pascal VOC 2007. It consists of 21 classes (20 foreground and 1 background). The images in the dataset are RGB three-channel, and each channel has 8-bit depth, so each image has a bit depth of 24. Considering that objects in real life are often subject to various disturbances, it is not as simple as the samples in the dataset. Models trained with simple datasets are difficult to apply to complex and varied real-world scenarios, that is, the generalization ability is not strong. To overcome such problems, offline data augmentation operations are introduced before training. We performed flipping, rotating, cropping, scaling, shifting, edge filling, color space conversion, noise, blurring, and random erasing without changing the relevant information of the labels to enhance the generalization ability of the model.
[0108] Step 2. Network t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com