Unlock instant, AI-driven research and patent intelligence for your innovation.

Text extraction method and device

A text extraction and text technology, applied in instruments, computing, electrical and digital data processing, etc., can solve the problems of high cost of OCR recognition technology model and loss of effective information, save the consumption of computing resources, improve accuracy, and meet the needs of users. effect of demand

Pending Publication Date: 2022-08-05
BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when the text content sequence is restored in the face of complex PDF typesetting, it may lead to the loss of effective information after conversion; and the OCR recognition technology model is expensive, and the content sequence reconstruction model depends on high-quality data annotation, so it is urgently needed. An effective solution to the above problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text extraction method and device
  • Text extraction method and device
  • Text extraction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057]In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present application. However, the present application can be implemented in many other ways different from those described herein, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementation disclosed below.

[0058] The terminology used in one or more embodiments of the present application is for the purpose of describing a particular embodiment only, and is not intended to limit the one or more embodiments of the present application. As used in one or more embodiments of this application and the appended claims, the singular forms "a," "the," and "the" are intended to include the plural forms as well, unless the context clearly dictates otherwise. It will also be understood that the term "and / or" as used in one ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a text extraction method and device. The text extraction method comprises the steps of obtaining a to-be-processed text; analyzing the to-be-processed text to obtain at least one text element containing word units and coordinate information corresponding to each text element; calculating element distances among the text elements according to the coordinate information; and performing iterative aggregation on each text element according to the element distance, and generating a target text corresponding to the to-be-processed text according to an iterative aggregation result. When text content extraction is carried out, consumption of computing resources can be saved, the content extraction accuracy can be improved, and the use requirements of downstream services are better met.

Description

technical field [0001] The present application relates to the field of artificial intelligence of computer technology, in particular to a text extraction method. The present application also relates to a text extraction apparatus, a computing device, and a computer-readable storage medium. Background technique [0002] Artificial intelligence (AI) refers to the ability of an engineered (i.e. designed and manufactured) system to perceive its environment and to acquire, process, apply and represent knowledge. The AI ​​deep learning framework implements the encapsulation of algorithms. With the development of artificial intelligence, various deep learning frameworks continue to emerge; TensorFlow, PyTorch and other general-purpose deep learning frameworks are used in natural language processing, computer vision, speech processing and other fields, as well as machine translation, smart finance, smart medical, Autonomous driving and other industries. It is a deep learning fram...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06V30/148G06F40/205G06F40/109
CPCG06V30/153G06F40/109G06F40/205
Inventor 胡声雷李长亮
Owner BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More