Method and device for processing text data

A text data and processing method technology, applied in the field of recognition processing, can solve the problems of less research on Chinese character text conversion, achieve the effect of improving rationality and realizing intelligent conversion

Active Publication Date: 2013-01-02
讯飞医疗科技股份有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, in the prior art, in natural language texts, researchers have mainly done a lot of research on how to convert characters such as Arabic numerals and symbols in the text into standard texts, while converting Chinese text into characters such as numbers and symbols less research

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for processing text data
  • Method and device for processing text data
  • Method and device for processing text data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0054] Taking the application of this case in the speech recognition system as an example, the speech signal is detected and sent to the continuous speech recognition device to obtain the recognition result. Since continuous speech recognition is currently based on model recognition, speech signals are first mapped to consonants or other phoneme-related models, and then converted into Chinese and English characters according to the language model. Therefore, th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for processing a text data. The method comprises the following steps: acquiring an initial input result of the text data, wherein the data which is related to numeric characters in the initial input result exists in text manner, and then according to a preset matching rule, converting the data which is related to the numeric characters and exists in text manner into a corresponding numeric character format. By using the method, Chinese character representation of the text related to the number in the text data is converted into the numeric character format, and the rationality of processing text data is increased.

Description

technical field [0001] The present invention relates to the technical field of recognition processing, and more specifically, to a text data processing method and device. Background technique [0002] In natural language texts, such as Chinese texts, there are a considerable number of special symbol strings such as English characters, numeric characters, and symbolic characters. For example: through the statistics of the 1 million-word People's Daily corpus, it is found that more than 70% of the sentences contain special character strings, and the total number of characters in special character strings exceeds 6%, which shows that special character strings are extensive and abundant in natural language texts exist. [0003] The role of special symbol strings in sentences is very obvious. For example, the introduction of Arabic numerals has greatly improved the efficiency of people's acquisition of quantitative information through visual channels. In the field of continuous...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/22G06F17/30
Inventor 陈志刚何婷婷胡国平王智国胡郁刘庆峰
Owner 讯飞医疗科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products