Retrieval method based on layout information

A layout and conditional technology, applied in the field of retrieval, can solve problems such as inability to meet comprehensive, efficient and accurate retrieval requirements, inaccurate retrieval conditions, and inaccurate retrieval, so as to achieve improved retrieval efficiency, strong retrieval pertinence, and accurate retrieval results Effect

Inactive Publication Date: 2015-03-25
天津书生软件技术有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the application, it will be found that because the input retrieval conditions are not precise enough, the existing retrieval methods will retrieve a large amount of useless text in addition to the required text
Users need to manually filter the retrieved text, so the retrieval is not precise enough
Moreover, electronic documents store not only text, but also rich graphics, images, and even media information, and the existing retrieval methods only stop at text retrieval.
At present, there are a few graphics and image retrieval methods, which can only retrieve whether images are included and locate them, but cannot perform targeted retrieval according to the retrieval conditions set by users
[0004] It can be seen that the existing character-based electronic document retrieval methods cannot meet the comprehensive, efficient and accurate retrieval requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Retrieval method based on layout information
  • Retrieval method based on layout information
  • Retrieval method based on layout information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0039] In this embodiment, the text is used as the retrieval object, and the text retrieval is performed according to the text layout information.

[0040] The retrieval of text layout description information is mainly based on the text font, font size, color, filling method, outline method, and font special effects as retrieval conditions. in,

[0041] 1. Font, which can be the specific name of the text font. It can also be the classification name of the font used in the text, such as Fangsong. The classification name of the classified fonts is the fonts of imitation Song, and it also includes the specific names of text fonts such as Fangzheng imitation Song, Chinese imitation Song, and Wenxing imitation Song. The corresponding retrieval rule is to search the text fonts in the electronic document according to the fonts set by the user.

[0042] 2. Font size, which can be the specific font size of the text, or a range of font sizes, or a description of the size of the font....

Embodiment 2

[0076] For electronic documents, texts, graphics and images not only have their own layout information, but also have common layout information. Public layout information applies to all objects contained in electronic documents. Public layout information can be combined with text, graphics, and image layout information as a retrieval condition, or it can be used alone as a retrieval condition. See Table 1, the public forum information used as retrieval criteria mainly includes several types:

[0077]

[0078] Table 1

[0079] In this embodiment, graphics are used as retrieval objects, and graphics retrieval is performed according to graphics layout information and public layout information. The difference from Embodiment 1 is that the specific content of the graphic layout information is different from the specific content of the text layout information, and graphics are stored in the form of graphics drawing commands in electronic documents. When the graphics need to be ...

Embodiment 3

[0101] In the above two embodiments, the retrieval conditions are all exact retrieval conditions, and this embodiment describes an implementation method based on page information retrieval where the retrieval conditions are inexact retrieval conditions.

[0102] In this embodiment, images are used as retrieval objects, and image retrieval is performed according to image layout information. The specific content of the image layout information used as the retrieval condition is shown in Table 3. The image layout information in Table 3 can be used alone or in combination as retrieval conditions.

[0103] See Table 3, image layout description information includes the following:

[0104]

[0105] table 3

[0106] In this embodiment, the search condition is set to be a black-and-white image with the largest display shape being an ellipse.

[0107] Figure 4 It is a flowchart of a method for implementing image retrieval based on layout information in Embodiment 3 of the presen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

PropertyMeasurementUnit
Lengthaaaaaaaaaa
Login to view more

Abstract

The invention discloses a retrieval method based on layout information. The retrieval method comprises the steps that retrieval conditions are set, wherein the retrieval conditions comprise the layout information; according to the retrieval conditions, a retrieval result is obtained from an electronic document needing to be retrieved. By the adoption of the retrieval method based on the layout information, a comprehensive, efficient and accurate electronic document retrieval way is provided, the text retrieval efficiency can be improved, graphics and images are retrieved in a targeted mode, and the retrievable object range is enlarged.

Description

technical field [0001] The invention relates to retrieval technology, in particular to a retrieval method based on layout information. Background technique [0002] With the promotion and application of computer technology, the use of electronic documents to store information is gradually replacing traditional information storage methods. Electronic paper is a type of electronic document. Electronic paper technology can replace the traditional way of saving paper information, and it can store information such as text, graphics and images in electronic format. This provides convenience for browsing and processing information on electronic paper with the help of computer technology. [0003] At present, the retrieval methods for electronic paper are mainly text-based retrieval based on text character matching. In application, it will be found that because the input retrieval conditions are not precise enough, the existing retrieval methods will retrieve a large amount of us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/903
Inventor 王东临
Owner 天津书生软件技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products