Unlock instant, AI-driven research and patent intelligence for your innovation.

Text processing method and device

A text processing and text technology, applied in the field of text processing, can solve the problems of low efficiency and low accuracy of Chinese key phrase extraction

Pending Publication Date: 2020-10-30
BEIJING DIDI INFINITY TECH & DEV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, in the existing technology, the extraction of Chinese key phrases based on Chinese text has problems such as character encoding and language habits, resulting in low efficiency and low accuracy of Chinese key phrase extraction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text processing method and device
  • Text processing method and device
  • Text processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0090] see figure 1 , figure 1 It is a flow chart of a text processing method provided in Embodiment 1 of the present application. like figure 1 As shown in , the text processing method provided by the embodiment of the present application includes the following steps:

[0091] S101: Perform word segmentation processing on the acquired Chinese text to obtain a plurality of Chinese words.

[0092] In specific implementation, firstly, obtain the Chinese text that needs to be extracted with Chinese key phrases, and then perform word segmentation processing on the Chinese text through a word segmentation tool (such as jieba) to obtain multiple Chinese words.

[0093] Here, the format of the Chinese text is in units of lines, and each line is a sentence. In this way, it is convenient for the word segmentation tool to perform word segmentation on the Chinese text.

[0094] In one example, a piece of Chinese text is "Thank you for your patience". After word segmentation by word s...

Embodiment 2

[0121] see figure 2 , figure 2 It is a flow chart of another text processing method provided in Embodiment 2 of the present application. like figure 2 As shown in , the text processing method provided by the embodiment of the present application includes the following steps:

[0122] S201: Perform word segmentation processing on the acquired Chinese text to obtain multiple Chinese words.

[0123] In the specific implementation, firstly, obtain the Chinese text that needs to be extracted with Chinese key phrases, and then perform word segmentation processing on the Chinese text through a word segmentation tool to obtain multiple Chinese words.

[0124] S202: Find the English letter sequence corresponding to each Chinese word in the preset Chinese-English mapping table.

[0125] In the specific implementation, since the current algorithm for extracting key phrases based on English text is relatively mature, converting Chinese text into English text and using English text ...

Embodiment 3

[0166] Based on the same application concept, Embodiment 3 of this application also provides a text processing device corresponding to the text processing methods provided in Embodiment 1 and Embodiment 2. Since the problem-solving principle of the device in the embodiment of this application is the same as that of the above-mentioned implementation of this application The text processing methods in Example 1 and Example 2 are similar, so the implementation of the device can refer to the implementation of the method, and the repetition will not be repeated.

[0167] see image 3 As shown, it is one of the functional block diagrams of a text processing device 300 provided in Embodiment 3 of the present application, see Figure 4 As shown, it is the second functional block diagram of a text processing device 300 provided in the third embodiment of the present application, see Figure 5 As shown, it is a schematic structural diagram of an extraction module 340 in a text processi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of text processing, in particular to a text processing method and device. The text processing method includes the steps: performing word segmentation processing on the obtained Chinese text to obtain multiple Chinese words, searching an English letter sequence corresponding to each Chinese word in a preset Chinese-English mapping table, so as to convertthe Chinese text into a corresponding English text, further extracting a plurality of English key phrases from the English text, and respectively converting each English key phrase into a Chinese keyphrase according to the Chinese-English mapping table. Compared with a method for extracting Chinese key phrases only based on a Chinese text in the prior art, the text processing method converts theChinese text into the English text through the preset rule, adopts the mature English key phrase extraction algorithm for extracting the English key phrases from the English text, then converts the English key phrases into the Chinese key phrases, and can improve the Chinese key phrase extraction accuracy and efficiency.

Description

technical field [0001] The present application relates to the technical field of text processing, in particular to a text processing method and device. Background technique [0002] Automatic key phrase extraction is an important technology in text information processing, and it is a key technology that must be applied in text information understanding tasks such as automatic text summarization, text automatic classification, subject extraction, and patent retrieval analysis. [0003] At present, in the prior art, the extraction of Chinese key phrases based on Chinese text has problems such as character encoding and language habits, resulting in low efficiency and low accuracy of Chinese key phrase extraction. Contents of the invention [0004] In view of this, the purpose of the embodiments of the present application is to provide a text processing method and device, which can improve the accuracy and efficiency of Chinese key phrase extraction. [0005] It mainly includ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F40/20G06F40/289
Inventor 王旭陈坦访王伟玮李奘
Owner BEIJING DIDI INFINITY TECH & DEV