Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for converting simplified and traditional Chinese characters

A conversion system, simplified and traditional technology, applied in the direction of electronic digital data processing, special data processing applications, instruments, etc., can solve the problems of inaccurate word segmentation, inability to translate customary differences, and achieve the effect of accurate word segmentation results

Active Publication Date: 2016-08-31
SAMSUNG ELECTRONICS CHINA R&D CENT +1
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] This application provides a method and system for mutual conversion between simplified and traditional Chinese characters to solve the problem of inaccurate word segmentation caused by only using the forward maximum matching algorithm for word segmentation in the prior art, and the inability to solve the problem of inaccurate word segmentation between simplified and traditional Chinese characters. The problem of converting the different terms brought about by the translation habits of loanwords

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for converting simplified and traditional Chinese characters
  • Method and system for converting simplified and traditional Chinese characters
  • Method and system for converting simplified and traditional Chinese characters

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In order to solve the problem of inaccurate word segmentation caused by only using the forward maximum matching algorithm for word segmentation in the prior art, and the problem of being unable to convert different terms caused by the translation habits of loanwords between simplified and traditional users. The following embodiments of the present application provide a system for mutual conversion between simplified Chinese characters and traditional Chinese characters, and a method for the system to realize mutual conversion between simplified Chinese characters and traditional Chinese characters. In the system and method of the following embodiments of the present application, the mapping relationship between Simplified Chinese characters and Traditional Chinese characters, or the mapping relationship between Traditional Chinese characters and Simplified Chinese characters, which is preset by the user can be received and stored in the user-defined mapping dictionary , ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and system for converting simplified Chinese characters into traditional Chinese characters. The system for converting the simplified Chinese characters into the traditional Chinese characters comprises a dictionary module, an input module, a conversion module and an output module, the dictionary module is used for storing a mapping dictionary and a simplified and traditional character mapping dictionary which are defined by a user, the input module is used for inputting simplified character strings to be converted, the conversion module is used for carrying out sentence division and word division on the simplified character strings in sequence and converting obtained simplified words into traditional words, and the output module is used for combining all the traditional words to form traditional character strings for output. In the word division process, a bidirectional maximum matching algorithm with a forward maximum matching algorithm and a backward maximum matching algorithm combined is adopted, the forward weight of a forward word division result and the backward weight of a backward word division result are calculated, the larger weight is used as the finial word division result, and when the weights are equal, the backward word division result is used as the finial word division result. The method and system achieve the conversion on different expression ways of the same object, and the word division result is more accurate.

Description

technical field [0001] The present application relates to the technical field of language processing, in particular to a method and system for mutual conversion between simplified and traditional Chinese characters. Background technique [0002] With the development of digitization and informatization, communication has become more and more important, and communication through electronic files has become an important means for people to communicate with each other. Due to historical reasons, some of the original traditional Chinese characters have been simplified to form the simplified Chinese characters currently used in mainland China. Thereby resulting in the objective reality that Chinese characters exist in both simplified and traditional writing forms. For example, in mainland China and Singapore, although traditional Chinese characters are occasionally used, simplified Chinese characters are used in most cases; while in Taiwan, Hong Kong and Macau, the original tradi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/28
Inventor 邹良辉胡志坤李远友韩忠海
Owner SAMSUNG ELECTRONICS CHINA R&D CENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products