Text processing method, device, storage medium and electronic device

A text processing and text technology, applied in the computer field, can solve problems such as the inability to distinguish polysemy or polysemy, and the inability to accurately identify text semantics, so as to achieve the effect of solving polysemy and improving accuracy

Active Publication Date: 2020-09-15
TENCENT TECH (SHENZHEN) CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The existing word vector methods all encode a given word into a unique vector for representation. The same word or the same word with different semantics will be represented as the same vector, and it cannot distinguish polysemy or one word in the text. polysemy, therefore, it is impossible to accurately identify text semantics

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text processing method, device, storage medium and electronic device
  • Text processing method, device, storage medium and electronic device
  • Text processing method, device, storage medium and electronic device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] In order to make the purpose, technical solution and advantages of the application clearer, the application will be further described in detail below in conjunction with the accompanying drawings. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0057] It should be noted that the terms "comprising" and "having" and their variants involved in the documents of this application are intended to cover non-exclusive inclusion, for example, a process, method, system, product that includes a series of steps or units Or the apparatus is not necessarily limited to those steps or units explicitly listed, but may include other steps or units not explicitly listed or inherent to the process, method, product or apparatus.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present application provides a text processing method, device, storage medium and electronic equipment, which belong to the field of computer technology and relate to artificial intelligence and natural language processing technology. The application determines the word vector corresponding to the target word according to the related vectors of the target word and other target words in the context of the target word; for the same word with different semantics, it can be expressed as different word vectors according to different contexts, so it can be solved A question of polysemy. Then, the word vector is constructed based on the word vector. Since the same word with different semantics corresponds to different word vectors, the word vectors corresponding to the same word with different semantics will be obtained from different word vectors, and the word vectors corresponding to the same word with different semantics are also different. It can solve the problem of polysemy of a word, more accurately vectorize the text, improve the accuracy of text semantic recognition, and help predict the semantics of new words.

Description

technical field [0001] The present application relates to the field of computer technology, and more specifically, to a text processing method, device, storage medium and electronic equipment. Background technique [0002] In the process of human-computer interaction, in order to enable electronic devices to understand human natural language and serve users better, it is necessary to digitize natural language. Text vectorization refers to the process of converting natural language text into digital vectors that can be understood by machines. It is the main way for natural language processing and machine understanding of natural language. [0003] Word vectorization is to convert words in natural language into word vectors, which is an important step in the process of text vectorization. Existing word vectorization methods include one-hot representation and distributed representation. The one-hot encoding method assigns each word a vector by creating a vocabulary and sequen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/205G06F40/284G06F40/289G06F16/35G06N3/04G06N3/08
CPCG06F16/355G06N3/08G06N3/044G06N3/045
Inventor 刘志煌
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products