Supercharge Your Innovation With Domain-Expert AI Agents!

Method and device for representing text

A text representation and text technology, applied in the field of information processing, can solve the problems of low accuracy, low accuracy of text processing, and no consideration of sentence correlation, etc., to achieve the effect of improving accuracy

Active Publication Date: 2015-07-15
新浪技术(中国)有限公司
View PDF6 Cites 41 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In the text representation method provided by the above-mentioned prior art, when selecting feature words, the semantics of the feature words in the sentence are not considered, and the correlation between sentences is not considered, but the frequency of mechanical extraction from the text is greater than the preset value. In addition, since the feature words in the text vector are words in the text, independent words may have multiple meanings and cannot accurately express the connotation of the text. Therefore, the accuracy of the text vector to express the text is low , correspondingly, the accuracy of text processing is lower

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for representing text
  • Method and device for representing text
  • Method and device for representing text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to make the purpose, technical solution and advantages of the present invention clearer, the technical solution of the present invention will be clearly and completely described below in conjunction with specific embodiments of the present invention and corresponding drawings. Apparently, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0025] see figure 1 , is a schematic flowchart of a text representation method provided by an embodiment of the present invention, including:

[0026] S101: Determine each word constituting the current text.

[0027] In the embodiment of the present invention, the current text is the text obtained by the server and needs to be expressed in text. The text can be ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a device for representing a text, which are used for increasing the accuracy of text representation so as to increase the accuracy of text processing. The method comprises the steps of determining all words of a current text, determining word vectors of all the words, clustering all the word vectors, determining feature words and the weight of the feature words in the current text from all the words according to a clustering result, and determining the text vector of the current text according to the word vectors and the weight of the feature words. In such a way, the semanteme of the words in sentences and a correlation between the sentences are considered in the process of determining the feature words through clustering, the connotation of the text can be accurately represented by the determined word vectors of the feature words, thus the accuracy of the text representation is increased, and the accuracy of the text processing is further increased.

Description

technical field [0001] The invention relates to information processing technology, in particular to a text representation method and device. Background technique [0002] In the field of information processing technology, text processing is often involved. Text processing refers to the processing of text retrieval, text classification, and text analysis on the text content after text representation. Among them, text representation refers to turning the original text content into the internal representation structure of the computer. The internal representation structure is a computer program. Analyzable structure, for example, words, phrases, etc. in the text content can be used to form a computer-analyzable vector structure. [0003] The higher the accuracy of text representation, the more accurately the connotation of the current text can be expressed, the better the effect of text processing and the higher the efficiency, on the contrary, the lower the accuracy of text r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F17/30
Inventor 刘洋
Owner 新浪技术(中国)有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More