Incomplete English word identification method

A word recognition, English technology, applied in the field of information processing

Active Publication Date: 2018-06-15
KUNMING UNIV OF SCI & TECH
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved in the present invention is to provide a kind of incomplete English word recognition method for the limitation and insufficiency of prior art, to solve the phenomena such as l

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Incomplete English word identification method
  • Incomplete English word identification method
  • Incomplete English word identification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0038] Embodiment 1: a kind of incomplete English word recognition method, this method specifically comprises the following steps:

[0039] Step0: Extract English word features and build an English word feature database. Map English words into English word lattices of 16×N pixels, divide the lattices into 2N small matrices of 8×1 pixels according to the rules from top to bottom and from left to right, and record them in the small matrix of 8×1 pixels The number of pixels occupied by English words is p j ,j∈[1,2N], observe all p j , j∈[1,2N] and generate the English word feature vector {p 1 ,p 2 ,...,p 2N}, and store all English words and the generated English word feature vectors in the database to form the English word feature database P:{P 1 ,P 2 ,...,P M};

[0040] Step1: Using modern scanning technology and letter shape features, extract the picture of the incomplete English word X to be detected from paper or other carriers, and convert the picture to 16:N X The ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an incomplete English word identification method and belongs to the technical field of information processing. The method comprises the steps of mapping English words to be ina dot matrix form, extracting word features and establishing an English word feature database; converting any to-be-detected incomplete English word into an image through a modern scanning technologyand shape features of the English word, performing grayscale and binary processing on the image, extracting the features of the English word, and generating an eigenvector; according to the length ofthe eigenvector, screening out a target English word set from the database; calculating cosine theorem-based word shape similarity and Euclidean distance-based word shape similarity of the English words subjected to zero padding or cutting operation in the target English word set; and finally through a similar fusion algorithm and similar threshold judgment, obtaining a similar word set of to-be-detected incomplete English words.

Description

technical field [0001] The invention relates to a method for identifying incomplete English words, which belongs to the technical field of information processing. Background technique [0002] In the investigation of cultural relics and the identification of important documents, part of some English words may be erased for some reason. Correctly identifying these incomplete English words is of great significance for the study of modern history and the investigation of celebrity quotations. [0003] At present, the recognition of incomplete English words mainly depends on people's familiarity with English words and artificially comparing English dictionaries, and then reasoning based on contextual information. However, due to the universality of English words, this work is time-consuming and cumbersome. . If the second edition of the Oxford Dictionary is used as the basis, there are 171,476 English words in total. Even if the incomplete English words can be screened based on...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/46G06K9/34G06K9/62
CPCG06V30/40G06V30/153G06V10/40G06V30/293G06F18/25
Inventor 彭艺尹玉梅
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products