Web page coding language automatic identification method and device for embedded type browser

An embedded browser and web page coding technology, which is applied in the field of communication, can solve problems such as re-parsing web pages, displaying garbled characters, wasting resources, etc., and achieves the effect of eliminating the possibility of displaying garbled characters

Inactive Publication Date: 2008-01-09
ZTE CORP
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But it requires user participation, which is not convenient enough. In addition, it also needs to re-parse the webpage, which is a waste of resources
The implementation of automatic decoding for the latter scheme varies greatly. Due to the limited resources of the embedded system, the most common method is to build a default language. Once it cannot be recognized, use this language to decode. This method often causes garbled characters to be displayed.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Web page coding language automatic identification method and device for embedded type browser
  • Web page coding language automatic identification method and device for embedded type browser
  • Web page coding language automatic identification method and device for embedded type browser

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The specific implementation manners of the present invention will be described in detail below with reference to the accompanying drawings.

[0022] The object of the present invention is to provide a language coding automatic identification and analysis technology based on error statistics and trial and error. A general browser has major software modules such as a protocol stack, a web page text parser, a page layout, and a user interface. The main function of this technology is completed by the web page text parser, which requires the cooperation of the protocol stack.

[0023] The technology described in the present invention includes the following components:

[0024] Protocol stack: The protocol stack used for embedded browsers, mainly refers to the hypertext transfer protocol protocol and the wireless hypertext transfer protocol protocol.

[0025] Multilingual codec module: the operation window for operator operators, who can use this platform to initiate DM-rel...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The method includes steps: (1) obtaining data of partial web pages, and head of protocol from protocol stack of embedded type browser; (2) parsing data of web page, and head of protocol in order to obtain metadata of codes of the specified web page; (3) using metadata obtained from data of web page and metadata obtained from head of protocol to determine codes to be used for parsing text at first time based on priority; (4) parsing current data block based on the adopted codes, and accounting errors occurred in parsing procedure, and carrying out parsing procedure again by selecting codes to be used in condition when error occurs. The invention discloses method for automatic recognizing coding language for web page and parsing method with higher efficiency and success ratio when embedded type browser is in limited memory and computing power.

Description

technical field [0001] The invention relates to the field of communications, in particular to a method and device for automatic identification of web page coding languages ​​used in embedded browsers. Background technique [0002] Embedded browsers come from browsers used in desktop personal computers (PCs), and are mostly used in embedded devices such as set-top boxes, information appliances, and mobile information terminals. [0003] Different from browsers on personal computers, embedded browsers can obtain resources such as display area size, processor computing power, memory size, cache size, fonts, and language files are very limited, and the content that needs to be processed is almost the same as It is the same on a personal computer, so there is a big difference from a personal computer browser in network connection mode, content analysis, and page layout. Especially on mobile information terminals, in addition to supporting the traditional Internet, the embedded b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30H04L29/06
Inventor 谢曼
Owner ZTE CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products