Text processing method, device and system

A text processing and text technology, applied in the fields of electronic digital data processing, special data processing applications, natural language data processing, etc., can solve the problem of low text classification accuracy, and achieve the effect of improving the accuracy.

Pending Publication Date: 2021-03-26
ALIBABA GRP HLDG LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The embodiment of the present application provides a text processing method, device, and system to at least solve the problem of using the default word vector as the word vector for all words that cannot be found in the word vector table when classifying text in the related art , leading to technical problems with low text classification accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text processing method, device and system
  • Text processing method, device and system
  • Text processing method, device and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0039] According to an embodiment of the present application, an embodiment of a text processing method is also provided. It should be noted that the steps shown in the flowcharts of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions, and, although A logical order is shown in the flowcharts, but in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0040] The method embodiment provided in Embodiment 1 of the present application may be executed in a mobile terminal, a computing device, or a similar computing device. figure 1 A hardware structural block diagram of a computing device (or mobile device) for implementing the text processing method is shown. Such as figure 1 As shown, computing device 10 (or mobile device 10) may include one or more (shown as 102a, 102b, ..., 102n in the figure) processor 102 (processor 102 may include but not limited to ...

Embodiment 2

[0084] According to the embodiment of the present application, a text processing method is also provided, such as Figure 9 As shown, the method includes:

[0085] Step S402, acquiring the words to be recognized in the text to be processed.

[0086] In step S402, the text processing system can obtain the text to be processed, and obtain the word to be recognized from the text to be processed, wherein, the text to be processed can be the text that the user actively inputs to the text processing system, or the text processing system can obtain the word from the text processing system. The text read at the preset position, preferably, in this application, the text to be processed is Chinese text.

[0087] In an optional embodiment, after obtaining the text to be processed, the text processing system preprocesses the text to be processed to obtain the processed text to be processed, and then performs word segmentation processing on the processed text to be processed to obtain mul...

Embodiment 3

[0103] According to the embodiment of the present application, a text processing method is also provided, such as Figure 12 As shown, the method includes:

[0104] Step S1202, acquiring the words to be recognized in the text to be processed.

[0105] In step 1202, the text processing system can obtain the text to be processed, and obtain the words to be recognized from the text to be processed, wherein, the text to be processed can be the text that the user actively inputs to the text processing system, or the text processing system can obtain The text read at the preset position. Preferably, in this application, the text to be processed is a Chinese text. For example, in a product recommendation scenario, the text processing system can obtain the text to be processed from the search term or search statement entered by the user on the e-commerce platform, for example, in Figure 8 In the schematic diagram shown, the user enters "summer women's clothing social group" in the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text processing method, device and system. The method comprises the steps of obtaining a to-be-recognized word in a to-be-processed text; when it is detected that no word vector corresponding to the to-be-recognized word exists in the preset vector table, obtaining candidate words associated with the position of the to-be-recognized word from the to-be-processed text; combining the candidate words with the words to be recognized to obtain combined words; and querying a target word vector corresponding to the combined word from a preset vector table, and performing classification processing on the to-be-processed text based on the target word vector. The technical problem of low text classification accuracy caused by adoption of default word vectors as word vectorsof all words of which the word vectors cannot be queried in a word vector table during text classification processing in related technologies is solved.

Description

technical field [0001] The present application relates to the field of text processing, in particular, to a text processing method, device and system. Background technique [0002] With the development of science and technology, people get more and more information. In order to enable people to get more valuable information from a large amount of information, it is usually necessary to classify the information. Among them, text information usually requires Do text classification. [0003] In the process of text classification of text, especially Chinese text classification of Chinese text, a deep learning model is usually used for text classification. Each word in the sentence corresponding to the text can be represented by a K-dimensional vector, and the sentence can be represented It is a matrix of N*K, which is used as the input of the deep learning model, where N is the number of words in the sentence. [0004] In addition, word vectors are usually obtained through dat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/35G06F40/289
CPCG06F16/355
Inventor 郝琪温伟高李东柯力
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products