Unlock instant, AI-driven research and patent intelligence for your innovation.

Method for recovering scrambled information of webpage text

A webpage and text technology, which is applied in the field of restoring webpage text scrambling information, can solve problems such as increasing work difficulty and workload, unclear pictures, and affecting the accuracy of image recognition results, so as to reduce work difficulty and workload Effect

Inactive Publication Date: 2020-12-29
SICHUAN CHANGHONG ELECTRIC CO LTD
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The patent uses pixel points to generate pictures, which is likely to cause pixel interference, which makes the generated pictures unclear and affects the accuracy of image recognition results; in addition, in terms of mapping relationship, the patent uses a series of neural network training, which increases the difficulty and workload of the work

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for recovering scrambled information of webpage text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to make the purpose, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0024] like figure 1 As shown, a method for recovering webpage text scrambling information includes:

[0025] Step 1. Obtain the encrypted data in the source code of the webpage and the custom font file corresponding to the encrypted data;

[0026] Step 2, converting the custom font file into an XML file, which includes the coordinates needed to generate the font image;

[0027] Step 3, generating font images according to the coordinates;

[0028] Step 4. Using image recognition technology to identify the text in the font picture; the text includes: Chinese characters, letters and numbers. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of networks, in particular to a method for recovering webpage text scrambling information in order to solve the problem that a picture generated by encrypted font information is not clear, and the method comprises the following steps: 1, acquiring encrypted data in a webpage source code and a custom font file corresponding to the encrypted data; 2, converting the custom font file into an XML file, wherein the XML file comprises coordinates required for generating a font picture; 3, generating a font picture according to the coordinates; 4, identifying characters in the font picture by adopting an image identification technology; 5, establishing a mapping relationship between the encrypted data and the characters identified in the step 4; 6, replacing the encrypted data in the webpage with the identified characters according to the mapping relationship. Pictures generated in the mode are clearer, operation is simpler in the mapping process, and the workload is smaller.

Description

technical field [0001] The invention relates to the field of network technology, in particular to a method for recovering web page text scrambling information. Background technique [0002] The core problem to be solved in the crawler development process is to break through the anti-crawling technology of data sources. Common anti-crawling technologies include IP access restrictions, user login verification, dynamic loading of front-end data, request headers or browser signatures, etc. Anti-crawling technology has been around for a long time, and there are many existing solutions. However, font anti-crawling, as a new anti-crawling technology, has gradually become popular on large data source websites. This kind of anti-crawling website is mainly for some valuable The data is encoded and encrypted to establish a mapping relationship with the font file, and the data in the font file is displayed in the html source code through encoding and encryption, and the encrypted data i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/957G06F16/958G06F16/84
CPCG06F16/84G06F16/957G06F16/958
Inventor 李昌瀚
Owner SICHUAN CHANGHONG ELECTRIC CO LTD