Method and system for identifying cross-tag risk based on text mining

A text mining and risk technology, applied in text database indexing, text database query, unstructured text data retrieval, etc., can solve problems such as inaccurate and fast positioning, and achieve the effect of reducing workload

Pending Publication Date: 2020-06-05
CHINA SOUTHERN POWER GRID COMPANY +1
View PDF2 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the expressions of feature words are often different, these methods cannot accurately and quickly locate risk points and problems in bidding documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for identifying cross-tag risk based on text mining
  • Method and system for identifying cross-tag risk based on text mining

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The concept, specific structure and technical effects of the present disclosure will be clearly and completely described below in conjunction with the embodiments and drawings, so as to fully understand the purpose, scheme and effect of the present disclosure. It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other.

[0046] Such as figure 1 Shown is a flow chart of a method for identifying the risk of collusion based on text mining according to the present disclosure, combined below figure 1 A method according to an embodiment of the present disclosure will be described.

[0047] This disclosure proposes a method for identifying the risk of collusion based on text mining, which specifically includes the following steps:

[0048] S100: Read bidding text data;

[0049] S200: Preprocessing the bidding text data to obtain the first bidding text data;

[0050] S300:...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a system for identifying a cross-tag risk based on text mining. The method includes: carrying out word segmentation after preprocessing; converting the words intostructured bidding and tendering text data according to the label; extracting a subject term of a clause text in each label in the bidding and tendering text data and selecting the subject term with the highest word frequency as the subject term; performing similarity comparison on the subject term and the subject term of the term text in each label in all the bidding text data in the knowledge base to obtain a contrast ratio, and when the contrast ratio is greater than a preset similarity threshold, marking the bidding text data as abnormal. Abnormal bidding and tendering information can be conveniently and automatically detected, bidding and tendering abnormal points can be quickly positioned, new knowledge can be intelligently and autonomously learned, risk points and bidding and tendering problems can be accurately and quickly positioned, the workload of bidding and tendering examination is greatly reduced, and risks in bidding and tendering are displayed in time.

Description

technical field [0001] The present disclosure relates to the fields of text data processing and natural language processing, and in particular to a method and system for identifying risks of collusion based on text mining. Background technique [0002] When checking the text of bidding documents (bidding technical documents), there are many repetitive structured texts that need to be checked repeatedly. If it is manually checked, it is easy to make mistakes and has high repetition, and many problems are very Obscure; and the records of bidding text generally exist in the form of unstructured text, so it is not friendly to automated text processing and it is difficult to accurately process data; [0003] The current risk detection methods for bidding texts usually use preset bidding type templates to help quickly locate problems in bidding texts, manually extract characteristic words, and use characteristic words to complete bidding through preset rules. Rapid detection of b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/31G06F16/33
CPCG06F16/313G06F16/3335
Inventor 王淼金昌铉程俊春马博朱宇龙赵永国刘森黎晚晴张君梁惠欣
Owner CHINA SOUTHERN POWER GRID COMPANY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products