Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

1158 results about "Text detection" patented technology

PDF document table extraction method, device and equipment and computer readable storage medium

The invention relates to the technical field of artificial intelligence, and discloses a PDF document table extraction method and device, equipment and a computer readable storage medium. The method comprises the steps of obtaining a to-be-identified PDF document, and processing the to-be-identified PDF document; preprocessing the processed PDF document, inputting the preprocessed PDF document into a convolutional neural network, outputting a feature map, inputting the feature map into an RPN region candidate network, and determining a table region; carrying out preprocessing and feature extraction on the table area based on the OCR character recognition technology, obtaining a feature picture, carrying out character detection on the feature picture, determining a text area, carrying out character recognition on the text area, determining text informatio, wherein the text information comprises text position information and text content information; and determining structure informationof the table according to the text coordinate information, dividing each cell of the table based on the structure information, and filling each corresponding cell of the table with a text corresponding to the text content information. According to the method and the device, the accuracy of PDF document table extraction is improved.
Owner:PING AN TECH (SHENZHEN) CO LTD

Text detection and recognition method combining character level classification and character string level classification

The invention discloses a text detection and recognition method combining character level and character string level classification. According to the method, pixel sets possibly belonging to the same character are extracted from images to form alternate characters; alternate characters which do not meet the geometric feature statistic rule are filtered out; a character level classifier based on the character rotation and dimension invariance features is adopted for classifying the alternate characters, and the probability that the alternate characters are certain characters is determined; the characters are combined two by two, and initial character strings are formed; the similarity between every two character strings is calculated, two character strings with the highest similarity are combined into new character strings until no character strings capable of being combined exist; a character level classifier based on the character string structure feathers is adopted for classifying the character strings to determine the character strings with semanteme; and the probability that character strings to be recognized are certain characters is utilized for recognizing the character strings, and the semantic text is obtained. The text detection and recognition method has the advantages that the text detection and recognition process is used as a whole, the interaction of the text detection and the recognition is utilized for improving the result precision, and simplicity and high efficiency are realized.
Owner:HUAZHONG UNIV OF SCI & TECH

Scene text detection method based on end-to-end full convolutional neural network

The present invention discloses a scene text detection method based on an end-to-end full convolutional neural network, which is used for the problem of finding a multi-directional text position in animage of a natural scene. The method specifically comprises the following steps: obtaining a plurality of image data sets for training scene text detection, and defining an algorithm target; carryingout feature learning on the image by using a full convolution feature extraction network; predicting an affine transformation matrix in an instance level for each sample point on the feature map, andcarrying out feature expression on the text according to the predicted affine transformation deformation sampling grid; classifying feature vectors of a candidate text, and carrying out coordinate regression and affine transformation regression to jointly optimize the model; using the learning framework to detect the precise position of the text; and carrying out non-maximum suppression on the bounding box set output by the network to obtain a final text detection result. The method disclosed by the present invention is used for scene text detection of real image data, and has a better effectand robustness for multi-directional, multi-scale, multi-lingual, shape distortion and other complicated situations.
Owner:ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products