Supercharge Your Innovation With Domain-Expert AI Agents!

Text representation method and device and computer equipment

A text and representation technology, applied in computing, neural learning methods, instruments, etc., can solve the problems of incomplete expression of sentence meaning, inaccurate representation of current context information, and large ambiguity

Pending Publication Date: 2021-02-12
虎博网络技术(上海)有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Moreover, the Chinese word segmentation results of the word segmentation tool are not completely accurate, so analyzing a word as a semantic unit of a sentence cannot fully express the meaning of the sentence
Other models represent individual characters as a semantic unit, but a single character in Chinese is ambiguous and cannot accurately represent the current contextual information
There are also some models that directly combine Chinese characters and word information to represent, but for a word, some characters can express the semantics of the entire word, while some characters will not affect the semantics of the word, and cannot accurately represent current context information
[0004] It can be seen that the existing natural language model cannot reflect the importance of word-to-word and word-to-word for the combination of words and characters, and cannot accurately and perfectly represent the semantics of sentences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text representation method and device and computer equipment
  • Text representation method and device and computer equipment
  • Text representation method and device and computer equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0066] The present invention provides a text representation method, and the text representation method can be used in the fields of natural language processing (Natural Language Processing, NLP for short) such as sentiment analysis, tag recognition, and translation. Such as figure 1 with figure 2 As shown, the steps of the text representation method provided by the present invention include:

[0067] Step S101, receiving input sentence text.

[0068] Step S102, performing a word segmentation operation on the sentence text to obtain a first word vector matrix and a first word vector matrix.

[0069] This embodiment provides a text representation method, which is used to represent the semantic features of the input sentence text. Specifically, a sentence text is input first, and the sentence text may be a text composed of multiple words.

[0070] After receiving the input sentence text, the word segmentation operation is performed on the sentence text, and a word vector mat...

Embodiment 2

[0116] Corresponding to the above text representation method, see Figure 5 , the embodiment of the present invention also provides a text representation apparatus 500 . Such as Figure 5 As shown, the text representation device 500 includes:

[0117] Input module 501, is used for receiving the sentence text of input;

[0118] The word segmentation module 502 performs a word segmentation operation on the sentence text to obtain the first word vector matrix and the first word vector matrix;

[0119] The combination module 503 dynamically combines the first word vector matrix and the first word vector matrix through an attention mechanism to obtain a word information background matrix and a word information background matrix;

[0120] The first splicing module 504 performs matrix splicing of the first word vector matrix and the word information background matrix to obtain a second word vector matrix, and performs matrix splicing of the first word vector matrix and the word in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a text representation method and device and computer equipment. The text characterization method comprises the steps of performing word segmentation on an input sentence text to obtain a first word vector matrix and a first character vector matrix; dynamically combining the first word vector matrix and the first character vector matrix through an attentionmechanism to obtain a word information background matrix and a character information background matrix; performing matrix splicing on the first word vector matrix and the word information backgroundmatrix to obtain a second word vector matrix, and performing matrix splicing on the first word vector matrix and the word information background matrix to obtain a second word vector matrix; and thencarrying out deep encoding on the second word vector matrix and the second character vector matrix to obtain a word matrix layer and a character matrix layer, and carrying out matrix splicing to obtain a complete encoding layer of the sentence text as a representation result corresponding to the sentence text. According to the text characterization method, sentence semantic information can be expressed more completely and accurately.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to a text representation method, device and computer equipment. Background technique [0002] Using natural language to communicate with computers has been pursued by people for a long time. With the rapid development of deep learning technology, more and more deep learning technologies have been applied to various fields of natural language processing, and a variety of natural language models based on deep learning have also been proposed. How to better represent natural language models has become very important issue. [0003] Some models analyze a word obtained by a word segmentation tool as a semantic unit of a sentence when modeling a sentence in a text. For Chinese, a word is directly used as a semantic representation, ignoring the information between characters within a word. Moreover, the Chinese word segmentation results of the word segmentation tool are not co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/35G06F40/289G06F40/284G06N3/04G06N3/08
CPCG06F40/35G06F40/289G06F40/284G06N3/084G06N3/045G06N3/044Y02D10/00
Inventor 蔡薇黄帅尹亚维
Owner 虎博网络技术(上海)有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More