Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for identifying garbled text in a page

A recognition method and text technology, which are applied in the field of recognition methods and devices for garbled text in a page, can solve the problems of long recognition operation time, reduced efficiency and reliability of text garbled recognition, and error-prone, achieving high accuracy and improved accuracy. Efficiency and reliability of identification, effect of simple operation

Active Publication Date: 2019-05-28
ALIBABA GRP HLDG LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the existing garbled text recognition operation takes a long time and is prone to errors, which leads to a reduction in the efficiency and reliability of text garbled recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for identifying garbled text in a page
  • Method and device for identifying garbled text in a page

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of this application.

[0042] It can be understood that the pages involved in this application may be web pages (Web Pages) written based on HyperText Markup Language (HTML), and may also be referred to as Web pages.

[0043] It should be noted that the terminals involved in the embodiments of the present application may include but not limited to mobile phones, personal digital as...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an identification method and an identification device for text messy codes in a page. The identification method for the text messy codes in the page includes: obtaining a first coding format of a first text to be identified in the page, converting the first text to a second text with a second coding format according to a corresponding relationship between characters corresponding to the second coding format and characters corresponding to other coding formats, and then converting the second text to a third text according to the specific corresponding relationship between the characters corresponding to the second coding format and the characters corresponding to the first coding format, and confirming whether the messy codes exist in the first text according to the third text and the first text. The identification method and the identification device for the text messy codes in the page do not need operation personnel to participate in the identification process, are easy to operate and high in accurate rate, and thereby improve identification efficiency and reliability of the text messy codes.

Description

【Technical field】 [0001] This application relates to World Wide Web (World Wide Web, Web) page processing technology, and in particular to a method and device for identifying garbled text in a page. 【Background technique】 [0002] A World Wide Web (Web) page may include a display block consisting of one or more HyperText Markup Language (HTML) tags, called page elements, for example, text, labels, hyperlinks, buttons , input box, drop-down box, etc. Due to reasons such as the parsing of the Web page, the text in the Web page may appear garbled. In the prior art, an operator needs to check the web pages one by one to find out whether the text in the web page is garbled. [0003] However, the conventional identification of garbled text takes a long time and is prone to errors, which reduces the efficiency and reliability of garbled text identification. 【Content of invention】 [0004] Various aspects of the present application provide a method and device for recognizing ga...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/22
Inventor 丁世远
Owner ALIBABA GRP HLDG LTD