Check patentability & draft patents in minutes with Patsnap Eureka AI!

Text representation method and device, computer equipment and storage medium

A text representation and text technology, applied in the field of information processing, can solve the problems of insufficient comprehensiveness and poor effect

Pending Publication Date: 2022-05-20
广州欢聊网络科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0014] The effect is poor for words with low word frequency;
[0015] Measuring the importance of a word simply by "word frequency" is not comprehensive enough, and sometimes important words may not appear many times;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text representation method and device, computer equipment and storage medium
  • Text representation method and device, computer equipment and storage medium
  • Text representation method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0077] figure 1 A flow chart of a text representation method provided in Embodiment 1 of the present invention, the method can be executed by a text representation device, the text representation device can be implemented by software and / or hardware, and can be configured in a computer device, for example, a server , personal computers, and so on. The text representation method specifically includes the following steps:

[0078] Step 101. Obtain the behavior sequence data and voice text data of the anchor publishing voice.

[0079] The behavior of a voice released by the anchor is more or less related to the behavior of other voices released before and after the behavior.

[0080] Sound texts can be stored in the database as data such as sound titles, sound descriptions, custom tags (first-level tags, second-level tags, etc.). Of course, in practical applications, the above audio text may also contain other information or be replaced by other information, etc. The specific ...

Embodiment 2

[0121] figure 2 It is a schematic structural diagram of a text display device provided in Embodiment 2 of the present invention, and the text display device may specifically include the following modules:

[0122] The acquisition module 201 is used to acquire the behavior sequence data and voice text data of the anchor's voice release.

[0123] The session data extraction module 202 is configured to extract session data based on behavior sequence data.

[0124] The word segmentation module 203 is configured to perform word segmentation processing on the audio text, and determine the target word segmentation of the audio text according to the obtained word segmentation.

[0125] The word vector generation module 204 is used to input the target word segmentation into the word vector generation model to obtain the word vector of the target word segmentation; the word vector generation model is obtained by training a preset model, and the training of the word vector generation m...

Embodiment 3

[0134] image 3 It is a schematic structural diagram of a computer device provided by Embodiment 3 of the present invention. image 3 A block diagram of an exemplary computer device 12 suitable for implementing embodiments of the invention is shown. image 3 The computer device 12 shown is only an example, and should not impose any limitation on the functions and scope of use of the embodiments of the present invention.

[0135] Such as image 3 As shown, computer device 12 takes the form of a general-purpose computing device. Components of computer device 12 may include, but are not limited to: one or more processors or processing units 16 , system memory 28 , bus 18 connecting various system components including system memory 28 and processing unit 16 .

[0136] Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, a processor, or a local bus using any of a variety of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a text representation method and device, computer equipment and a storage medium. In the embodiment of the invention, the method constructs the context based on the sound release behavior of the anchor, obtains the vector representation of the words in the sound text by using the word vector generation model, and obtains the text representation related to the classification task in combination with the different importance and correlation of the different words in the classification task to each category. The words more related to the classification task can be more strongly expressed, and the text expression of the sound is remarkably improved.

Description

technical field [0001] The present invention relates to the technical field of information processing, in particular to a text representation method, device, computer equipment and storage medium. Background technique [0002] Natural Language Processing (NLP) is an important direction in the field of computer science and artificial intelligence. It studies various theories and methods that can realize effective communication between humans and computers using natural language. Natural language processing is mainly used in machine translation, automatic summarization, opinion extraction, text classification, question answering, text semantic comparison, speech recognition, Chinese OCR, etc. [0003] Text representation methods occupy an important position in the field of NLP. The text representation method refers to the vectorization method of text. Representing text as a vector containing semantic information is helpful for applications such as classification, retrieval, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F40/216G06F16/35
CPCG06F40/289G06F40/216G06F16/35
Inventor 谭又伟丁宁
Owner 广州欢聊网络科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More