Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

521 results about "Image description" patented technology

An image description is a textual, audio or graphical content portraying the image in a representation intelligible by the addressees. The description should be comprehensive as well as perceptible by the target audience.

Image intelligent mode recognition and searching method

The invention puts forward an image intelligent mode identification search method. The method can establish an image sample training set database and combine with basic text search engine technology and basic image content inquiry technology, so that a network creeper can perform Internet image search and URL information resolution, so as to catch the image URL and relevant information into a local primary database; perform such pre-processes as preliminary filtration, decompression and image pre-classification and etc for the images; then, calculate color characteristics, grain characteristics and shape characteristics of the extraction images, so as to gain corresponding characteristic vector sets; combine with the image URL information before saving the images into the image basic database and establishing an index for the images; perform characteristic vector similarity calculation for images in the image basic databases and sample training sets, and then, save the classified images into an image classification database; accept key words or image description that are input by the user, create the index vector, perform similarity calculation with the image characteristic vectors in the image classification database, and then, return the index results to the user.
Owner:SHANGHAI XINSHENG ELECTRONICS TECH

Image description generation method based on depth LSTM network

The invention relates to an image description generation method based on a depth LSTM network, comprising the following steps: (1) extracting the CNN characteristics of an image in an image description dataset, and acquiring an embedded vector corresponding to the image and describing the words in a reference sentence; (2) building a double-layer LSTM network, and carrying out series modeling based on the double-layer LSTM network and a CNN network to generate a multimodal LSTM model; (3) training the multimodal LSTM model by means of joint training; (4) gradually increasing the number of layers of the LSTM network in the multimodal LSTM model, carrying out training each time one layer is added to the LSTM network, and finally, getting a gradual multi-objective optimization and multilayer probability fused image description model; and (5) fusing the probability scores output by the branches of the multilayer LSTM network in the gradual multi-objective optimization and multilayer probability fused image description model, and outputting the word corresponding to the maximum probability through common decision. Compared with the prior art, the method has such advantages as multiple layers, improved expression ability, effective updating, and high accuracy.
Owner:TONGJI UNIV

Generation method of image description from structured text

The invention discloses a generation method of an image description from a structured text. The generation method comprises the steps of downloading pictures from the internet to form a picture training set; conducting morphological analysis on descriptions which correspond to the pictures in the picture training set to form the structured text; using an existing neural network model to extract convolution neural network characteristics of the pictures in the training set, and using <, picture characteristics and structured text < as inputs to form a multitasking recognition model; using the structured text extracted from the training set and a description which corresponds to the structured text as inputs of a recurrent neural network, and conducting training to obtain a parameter of a recurrent neural network model; inputting the convolution neural network characteristics of an image ready to be described, and obtaining a predicted structured text through the multitasking recognition model; inputting the predicted structured text, and obtaining the image description through the recurrent neural network model. Compared with the prior art, a better image description effect, accuracy and sentence variety can be generated through the method, and the generation method of the image description from the structured text can be effectively popularized in an application of image retrieval.
Owner:哈尔滨米兜科技有限公司

Deep learning model-based image Chinese description method

The invention discloses a deep learning model-based image Chinese description method and belongs to the field of computer vision and natural language processing. The method comprises the steps of preparing an ImageNet image data set and an AI Challenger image Chinese description data set; pre-training the ImageNet image data set by utilizing a DCNN to obtain a pre-trained DCNN model; performing image feature extraction and image feature mapping on the AI Challenger image Chinese description data set, and transmitting image features to a GRU threshold recursive network recurrent neural network;performing word coding matrix construction on an AI Challenger image mark set in the AI Challenger image Chinese description data set; extracting word embedding features by utilizing an NNLM, and finishing text feature mapping; taking the GRU threshold recursive network recurrent neural network as a language generation model, and finishing image description model building; and generating a Chinese description statement. According to the method, the blank of image Chinese description is filled up; a function of automatically generating the image Chinese description is realized; the accuracy ofdescription contents is well improved; and a foundation is laid for development of Chinese NLP and computer vision.
Owner:HARBIN UNIV OF SCI & TECH

An image description information generation method and device and an electronic device

The invention discloses an image description information generation method and device and an electronic device. The method comprises the steps of obtaining a to-be-processed target image; Inputting the target image into a target image description information generation network, wherein the target image description information generation network is a generation network which is obtained by performing adversarial training by utilizing a plurality of sample images and is used for generating image description information; wherein the adversarial training is alternating training based on an initialized image description information generation network matched with the target image description information generation network and an initialized judgment network, and the judgment network is used forjudging an output result of the image description information generation network; And generating an output result of the network according to the target image description information, and generatingtarget image description information for describing the target image. The image description information generation method and device solve the technical problem that an image description information generation method provided by related technologies is poor in generation quality.
Owner:TENCENT TECH (SHENZHEN) CO LTD

Method and system for strengthening navigation performance based on image capture and recognition technology

The invention discloses a method and a system for strengthening navigation performance based on an image capture and recognition technology. The method comprises following steps: A. inputting destination information, and obtaining user location information and route information; B. realtime capturing images in real scenes through image capture equipment and displaying the images; C. identifying the characteristics of the obtained images, obtaining parameters of image description, and checking and matching in a preset database by utilizing the parameters; D. after successful matching, obtaining the coordinate information of the captured images in the virtual map stored in the database, obtaining the specific route between the user and the capture images according to the coordinate information of the capture images and the user location information, converting the specific route into guiding information and displaying in the real scene image; E. repeating B. C. D. steps in circulation, until the navigation is finished. The invention changes conventional navigation mode of traditional navigation systems and users can precisely choose the direction and route according to the navigation guiding identification.
Owner:佛山电视台南海分台

Personnel behavior identification implementation system and method based on image segmentation and semantic extraction

The invention relates to a personnel behavior identification and detection implementation system and method based on image segmentation and semantic characteristic extraction. The system comprises an image acquisition unit, a personnel behavior detection host computer, a user inquiry unit and an output interface unit. The method comprises the following steps: the personnel behavior detection host computer identifies personnel behaviors in the image data acquired by the image acquisition unit through image segmentation and image semantic characteristic extraction so as to generate personnel behavior presentation information. In the method, the personnel behavior detection host computer maps low-level features of an image into high-level semantics through a support vector machine so as to establish a mapping relation between the image and the image description, so that the content in the picture can be comprehended through digital image processing and analysis, the behaviors of personnel in the scene can be intelligently detected, and the identification accuracy of personnel behaviors in the image can be greatly improved. The system provided by the invention has a simple structure, and the method provided by the invention is simple and convenient to implement, low in application cost and wider in application range.
Owner:THE THIRD RES INST OF MIN OF PUBLIC SECURITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products