Unlock instant, AI-driven research and patent intelligence for your innovation.

Drawing text reading method and system based on cluster analysis

A technology of cluster analysis and drawing, applied in the direction of instruments, calculations, characters and pattern recognition, etc., can solve problems such as translation inconvenience, achieve the effect of improving accuracy and reducing clustering errors

Active Publication Date: 2020-02-21
成都优译信息技术股份有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The technical problem to be solved by the present invention is that when drawing texts are extracted, the continuous content is parsed to very far apart places, which makes it inconvenient to extract drawing texts and translate them. The purpose is to provide a method and system for reading drawing texts based on cluster analysis, solve the above problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Drawing text reading method and system based on cluster analysis
  • Drawing text reading method and system based on cluster analysis
  • Drawing text reading method and system based on cluster analysis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0037] like figure 1 As shown, the drawing text reading method based on cluster analysis of the present invention includes the following steps: S1: classify the text boxes on the drawing according to their angles; S2: extract the coordinate feature values ​​of the text boxes of the same angle type; S3 : Carry out cluster analysis on the text boxes of the same angle type, so that the text boxes with similar coordinate feature values ​​are clustered into the same class, and sort the text boxes according to the clustering results; S4: sort the text boxes according to the text The angle type of the box for text output.

[0038] When this embodiment is implemented, the text boxes on the drawings are first classified according to their angles, and the text boxes are divided into multiple different angle types, such as 0°, 90°, 180° and 270° that often appear in cad drawings ; Then extract the coordinate feature value of the text box of the same angle type, this coordinate feature v...

Embodiment 2

[0040] In this embodiment, on the basis of Embodiment 1, the coordinate feature value adopts the coordinate value of the upper left corner, the lower left corner, the upper right corner, the lower right corner or the coordinate value of the center point of the text box.

[0041] When this embodiment is implemented, the coordinate feature value adopts the coordinate value of the upper left corner of the text box, the coordinate value of the lower left corner, the coordinate value of the upper right corner, the coordinate value of the lower right corner or the coordinate value of the center point, because the coordinate feature value is to identify each The coordinate value of the unique position of the text box, the above five coordinate values ​​can all express the unique position of the text box, which effectively improves the accuracy of the clustering of the present invention.

Embodiment 3

[0043] In this embodiment, on the basis of Embodiment 1, the clustering adopts the optics algorithm; the optics algorithm determines the relative distance between the text boxes by reading the coordinate feature values ​​of the text boxes in the ordered text box group; The criterion for the similarity of coordinate feature values ​​is that the relative distance is less than or equal to the threshold.

[0044] During the implementation of this embodiment, since the text boxes on the drawings are regular but irregular, the inventor found through creative labor that it is difficult to accurately determine the clustering parameters due to the irregular text boxes, and when using the optics algorithm, the clustering When the parameters change reasonably, the clustering results do not change much. And in the optics algorithm that the present invention applies, determine the relative distance between the text boxes by reading the coordinate feature value of the text box in the ordere...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a drawing text reading method based on cluster analysis, comprising the following steps: S1: classify the text boxes on the drawing according to their angles; S2: extract the coordinate feature values ​​of the text boxes of the same angle type; S3: Carry out cluster analysis on the text boxes of the same angle type, so that the text boxes with similar coordinate feature values ​​are clustered into the same class, and sort the text boxes according to the clustering results; S4: sort the text boxes according to the text box The angle type for text output. The invention also discloses a drawing text reading system based on cluster analysis using the method of the invention. The drawing text reading method and system based on cluster analysis of the present invention, by clustering the text boxes, the text boxes with similar content are clustered into the same class, and then the text is output, which avoids continuous content being parsed to be very far apart place, making it more convenient to extract drawing text and translate it.

Description

technical field [0001] The invention relates to the field of text typesetting, in particular to a drawing text reading method and system based on cluster analysis. Background technique [0002] With the strengthening of international cooperation, both Chinese companies and foreign companies need to communicate with each other through drawing documents during the establishment and development of international engineering and international scientific research projects. Due to multiple revisions, manual typesetting, etc., drawings often appear in a sentence that should have been complete, but are split into multiple text boxes and manually placed close to each other. However, due to the characteristics of the drawing file itself (the text boxes are written sequentially according to the writing time in the file), the text boxes that appear to be adjacent on the drawing are not necessarily adjacent to each other in terms of saved content, and some may even be far apart. It is ve...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/00G06K9/20G06K9/62
CPCG06V30/40G06V10/225G06V30/10G06F18/2321
Inventor 席斌李明王兴强彭成超
Owner 成都优译信息技术股份有限公司