Text information representation method and system, computer equipment and storage medium

A text information and characterization technology, which is applied in computing, instruments, electrical digital data processing, etc., can solve the problems of inapplicability and inaccurate representation of text information, and achieve the effect of accurate characterization

Pending Publication Date: 2020-05-05
CHINA PING AN LIFE INSURANCE CO LTD
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In view of this, the embodiments of the present invention provide a text information characterization method, system, computer equipment, and storage medium to solve the problem that the text information characterization based on word vectors in the prior art is not accurate enough and is not suitable for article information in the field of information flow recommendation. The problem of representation of textual information of the class

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text information representation method and system, computer equipment and storage medium
  • Text information representation method and system, computer equipment and storage medium
  • Text information representation method and system, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] In order to enable those skilled in the art to better understand the solutions of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field of the invention.

[0044] The appearances of the phrase "an embodiment" in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. It is understood explicitly and implicitly by those skilled in the art that the embodiments described herein can be combined with other embodiments.

[0045] An embodiment of the present invention provides a text information representation method, such as figure 1...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of artificial intelligence, the invention relates to a text information characterization method and a system, computer equipment and a storage medium, and the methodcomprises the steps: obtaining a to-be-analyzed corpus, carrying out the word segmentation preprocessing of the to-be-analyzed corpus, generating corresponding word vectors based on the obtained segmented words, enabling the to-be-analyzed corpus to be text information, and enabling the text information to comprise at least one statement; obtaining word vectors of segmented words contained in each statement in the to-be-analyzed corpus to obtain a word vector group of each statement, and sequentially inputting the word vectors in the word vector group into the initial sentence vector algorithm model according to a sequence to generate initial sentence vectors of the corresponding statements; and inputting the initial sentence vector into a pre-trained sentence vector model to obtain a final sentence vector of each sentence, the final sentence vector being used for representing text information, and the pre-trained sentence vector model being generated based on a context relationship of the sentences. According to the method provided by the invention, the influence caused by different semantics of words in different sentences can be avoided, and the representation of the text information is more accurate.

Description

technical field [0001] The embodiments of the present invention belong to the technical field of artificial intelligence, and in particular relate to a text information representation method, system, computer equipment, and storage medium. Background technique [0002] In the field of natural language processing, text information representation is the basis for solving text processing problems. In the prior art, word vector summation based on Word2Vec is generally used as the text information representation method, but the semantics of the same word in different sentences and different contexts are different, so the representation of text information based on word vectors is inaccurate, and it is not suitable for the representation of text information such as article information in the field of information flow recommendation. Contents of the invention [0003] In view of this, the embodiments of the present invention provide a text information characterization method, sys...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/289G06F40/30
CPCY02D10/00
Inventor 侯晓龙
Owner CHINA PING AN LIFE INSURANCE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products