Text association relationship discrimination method and storage medium

A technology of association relationship and discrimination method, which is applied in the field of machine learning, can solve the problem of low accuracy of judging the association, and achieve the effect of high accuracy of correlation judgment, little influence of text length, and strong semantic understanding ability

Active Publication Date: 2020-05-01
FUJIAN YIRONG INFORMATION TECH +6
View PDF8 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] For this reason, it is necessary to provide a method for discriminating text associations to solve the problem of low accuracy in judging the associations of texts in the prior art;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text association relationship discrimination method and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] In order to explain in detail the technical content, structural features, achieved goals and effects of the technical solution, the following will be described in detail in conjunction with specific embodiments and accompanying drawings.

[0020] see figure 1 , a text association relationship discrimination method, comprising the following steps,

[0021] S100 preprocesses the input corpus to obtain a preprocessed text,

[0022] Among them, corpus preprocessing includes: cleaning up meaningless characters such as blanks at the beginning and end of the corpus, newlines, etc.; splicing multi-line corpus into one line; converting traditional characters in the corpus into simplified characters, etc. This preprocessing step is used to enhance the accuracy of the corpus Recognizability, while reducing the impact of formatting issues on the consistency of text expression.

[0023] The preprocessed text here may also be a collection of text fields after segmentation or sectio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text association relationship discrimination method and a storage medium. The method comprises the following steps, preprocessing input corpus; to acquire a preprocessed text, judging whether a text field in the preprocessed text is an ultra-short text or not, rewriting the ultra-short text according to an industry knowledge graph to obtain a rewritten text, inputting the rewritten text into a BERT model for training and analysis, and obtaining an association relationship judgment result output by the model. The invention can be combined with an ultra-short text rewriting technology to judge the short text association relationship. Compared with an existing scheme, the invention has the advantages of being high in semantic comprehension capacity, small in text length influence and high in correlation judgment accuracy, and in conclusion, the problem that in the prior art, industry text correlation judgment is still not accurate enough is solved.

Description

technical field [0001] The invention relates to the field of machine learning, in particular to an intelligent discrimination method for text association relations. Background technique [0002] With the rise of the era of big data, short text has become an increasingly common type of text data, such as article summaries, news titles, official document titles, Weibo, WeChat, customer reviews, etc. There are rich differences between these data Semantic association, how to establish an efficient association relationship mining model, and mine potential semantic information from a large number of short text data sets has become the cornerstone of many applications. Through the relationship mining model, we can obtain the potential relationship between a large number of short texts, which can be applied to many text mining tasks such as search, interest analysis, content recommendation, topic detection, text classification, and knowledge map construction. [0003] Power grid co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/36G06F16/33G06F16/383G06F40/289G06Q50/06
CPCG06F16/367G06F16/3344G06F16/383G06Q50/06Y02D10/00
Inventor 张垚庄莉梁懿苏江文王秋琳彭放李君婷丁勇王端瑞尚颖刘瑞芳
Owner FUJIAN YIRONG INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products