Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

41 results about "UTF-8" patented technology

UTF-8 is a variable width character encoding capable of encoding all 1,112,064 valid code points in Unicode using one to four 8-bit bytes. The encoding is defined by the Unicode Standard, and was originally designed by Ken Thompson and Rob Pike. The name is derived from Unicode (or Universal Coded Character Set) Transformation Format – 8-bit.

Method, apparatus and server for processing data packet

The present invention provides a method, an apparatus and a server for processing a data packet. The method comprises the steps of: generating XMPP (Extensible Messaging and Presence Protocol)-based message data to be transmitted, wherein the XMPP-based message data comprises a message header and a message body; performing self-defined packaging for the message header to obtain a packaged message header; converting the message body into a message body conforming to a preset protocol format; packaging the packaged message header and the message body conforming to the preset protocol format to obtain a new message packet; and transmitting the new message packet. According to the method, the apparatus and the server of the present invention, a communication mode of data protocol packets is adopted, an XMPP data packet is packaged by a protocol header, integrity of data is ensured through length field of the protocol header; in addition, communication quality of different types of data packets is defined, so that communication quality modes of different types of data packets can be ensured; and an original packet body is converted into UTF-8 byte stream for processing, thereby facilitating compression with a compression algorithm.
Owner:CHINA MOBILE GRP GUANGDONG CO LTD

Voice recognition character string processing comparison method based on Pinyin

The invention relates to a voice recognition character string processing comparison method based on Pinyin. For application of an existing voice recognition technology to certain special occasions ofperson name recognition, equipment name recognition and the like, errors are generated easily due to incorrect comparison. The method is "secondary processing" based on a general Chinese character recognition algorithm; and recognized Chinese character strings are converted into Pinyin strings, and then the Pinyin strings are compared with target Pinyin strings. The method comprises the followingsteps of 1, performing Pinyin coding: performing coding on all Chinese character Pinyin, wherein the coding is similar to coding of unicode; and enumerating all Chinese character Pinyin combinations;2, performing code conversion: converting the character strings, with coding modes of GBK, Unicode, UTF-8 and the like, for expressing Chinese characters converted into the Pinyin strings; and 3, performing polyphone processing: enumerating polyphones of all family names; performing special processing; and distributing the same Pinyin codes. According to the method, accurate recognition can be rapidly realized, so that misjudgment is avoided.
Owner:深圳市艾塔文化科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products