Company short name identification method and system based on text rules

A recognition method and recognition system technology, which is applied in the field of company abbreviation recognition method and system based on text rules, can solve problems such as differences in the extension of named abbreviations, low recognition accuracy, and various expression forms, so as to improve the recall rate and improve The effect of recognizing the effect

Active Publication Date: 2017-12-01
GUANGZHOU WANLONG SECURITIES CONSULTING CO LTD
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Generally, the difficulty in identifying Chinese company abbreviations lies in the fact that in different fields and scenarios, the denotation of the abbreviation is different, the name changes frequently, and there are no strict rules to follow, and there are various forms of expression. The effect can easily affect the recognition effect, resulting in lower recognition accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Company short name identification method and system based on text rules
  • Company short name identification method and system based on text rules
  • Company short name identification method and system based on text rules

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0057] S01. Load the full name of the company to be identified;

[0058] S02. According to the full name of the company to be identified, load the company's listed company announcement information text;

[0059] S03. For each listed company announcement information text (denoted as Article_1), extract the sentences and paragraphs in which the full name appears in Article_1 (denoted as Sect_1);

[0060] S04. Extract the abbreviation of Sect_1 through Chinese word segmentation and context rule features

[0061] S05. In Article_1, extract a text block in the form of a table (denoted as table_1), and perform abbreviation extraction based on table features for table_1;

[0062] S06. Determine whether the abbreviation to be detected is valid, if so, end the identification process; otherwise, continue the identification process;

[0063] S07. According to the full name of the company that needs to be identified, search the Baidu webpage in combination with the preset search rules (fo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a company short name identification method and system based on text rules. The method comprises the steps of carrying out short name extraction according to company full names needing to be identified and corresponding public company announcement texts, thereby obtaining to-be-detected short names, and carrying out validness analysis on the to-be-detected short names; and carrying out short name identification through network search according to the company full names needing to be identified. The system comprises an announcement text analysis unit and a search and analysis unit. According to the method and the system, the short name identification is carried out in an announcement text mining and network search rule combined mode, so the Chinese company short name accuracy is ensured, a recall ratio is greatly improved, and the identification effect is effectively improved. The method and the system can be widely applied to the field of identification.

Description

technical field [0001] The present invention relates to the field of recognition processing, in particular to a company abbreviation recognition method and system based on text rules. Background technique [0002] Since the naming rules of Chinese company names are not strong, they are used more casually and often appear in the form of abbreviations, such as "Bank of China Co., Ltd." often appears in the form of abbreviations, such as "Bank of China" or "Bank of China". The recognition and application of the company name have brought difficulties. [0003] At present, there is no abbreviation identification method with a relatively high recall rate in the market. Generally, the difficulty in identifying Chinese company abbreviations lies in the fact that in different fields and scenarios, the denotation of the abbreviation is different, the name changes frequently, and there are no strict rules to follow, and there are various forms of expression. The effect can easily aff...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/313G06F40/295
Inventor 吴远辉
Owner GUANGZHOU WANLONG SECURITIES CONSULTING CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products