Method and system for cross-label processing of html rich text data with format

A text data and label technology, applied in the field of web and Internet, can solve problems such as insecurity, attack code, and irregular content

Active Publication Date: 2014-01-22
FOCUS TECH
View PDF3 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] (1) Unsafe factors, such as attack codes, etc.
[0005] (2) The submitted content is not standardized, which affects the quality of the website content
Generally, this method can only be used to handle one kind of business, and its versatility is not strong. However, there are many situations in Internet processing, and the business is complicated, so the processing method needs to have certain versatility.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for cross-label processing of html rich text data with format
  • Method and system for cross-label processing of html rich text data with format
  • Method and system for cross-label processing of html rich text data with format

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] The present invention proposes a cross-label processing method and system for formatted html rich text data, which eliminates illegal and prohibited unsafe factors such as attack codes, irregular writing, and sensitive words in the text. While optimizing the display, it improves the security of the website and the quality of the content.

[0062] refer to figure 1 As shown, the system structure of the embodiment of the present invention includes

[0063] This system is composed of Parse analyzer 11 , node analyzer 12 , plain text processor 13 , similarity matching processor 14 , and text converter 15 . Parse analyzer 11, node analyzer 12, plain text processor 13, fuzzy matching processor 14, and text converter 15 are connected in sequence.

[0064] The Parse analyzer 11 is used to analyze the received html rich text data and convert it into a DOM-like tree structure object, and the tree structure object can be used on the server.

[0065] The node analyzer 12 is use...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and system for cross-label processing of html rich text data with a format. The received html rich text data with the format are converted into a plain text without the format; service processing is conducted, so that the illegal and illicit unsafe factors such as attack codes, nonstandard writing and sensitive words in the text are eliminated; the processed plain text is restored to the prior format according to the method of best similarity matching effect and converted into correct and safe html rich text data with the format, the html rich text data are displayed on a website, and the safety of the website and the quality of the content are improved.

Description

technical field [0001] The invention is applied in the field of web and Internet, and in particular relates to a cross-label processing method and system for formatted html rich text data. Background technique [0002] The Internet needs to process a large amount of data submitted by users. In most cases, the data submitted by users is composed of plain text, but in many cases, the information submitted by users is rich text data with format tags. For example, blogs have rich and diverse The information in the form of presentation, the independent website building services used by users, and e-commerce sellers all have diversified display functions. [0003] Because users can edit customized html rich text data in many fields, and these rich text data all contain formatting tags. However, the html rich text data submitted by users contains many unreliable factors, which will generate some negative factors, affect the content of the website, and the quality and safety of web...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/88
Inventor 邢世康
Owner FOCUS TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products