Enterprise entity name analysis and identification system

A technology of enterprise name and entity name, applied in special data processing applications, instruments, biological neural network models, etc., can solve the problems of time-consuming and labor-intensive feature templates, poor generality, and time-consuming and labor-intensive prediction process.

Inactive Publication Date: 2016-09-28
成都数联铭品科技有限公司
View PDF4 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

To use conditional random fields, you first need to design and construct feature templates based on the characteristics of the entity to be recognized. Feature templates include first-order words or multi-order phrases with a specified window size context, word prefixes, suffixes, part-of-speech tags and other status features; feature templates The construction is very time-consuming and labor-intensive, and the recognition results are highly dependent on the feat

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Enterprise entity name analysis and identification system
  • Enterprise entity name analysis and identification system
  • Enterprise entity name analysis and identification system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0052] The discovery process of the new company name in this system is as follows: For example, the following news text was obtained on the Internet: "On March 15, XXXX, the fifth meeting of the seventh board of directors of the company reviewed and approved the "About the Company and its Wholly-owned Subsidiaries" Proposal on the Company’s Investment and Establishment of Subsidiaries”, the six wholly-owned subsidiaries to be established by the company are ABCD Medical Investment Management Co., Ltd., ABCD Pharmaceutical E-Commerce Co., Ltd., ABCD Investment Fund Management Co., Ltd., ABCD New Energy Co., Ltd., and ABCD Fundamentals Co., Ltd. Facility Investment Co., Ltd., ABCD Investment Co., Ltd. Investment amount: the total investment amount is equivalent to about 630 million yuan." After word segmentation, we get: "XXXX year / March / 15th / announcement / , / company / seventh / session / The board of directors / fifth / time / meeting / , / considered / approved / " / about / company / and / wholly-owned / su...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of natural language processing, in particular to an enterprise entity name analysis and identification system. The system comprises a bidirectional recurrent neural network module; the system uses an enterprise name labeling training sample stored in an existing enterprise name database to train a bidirectional recurrent neural network; the bidirectional recurrent neural network identifies enterprise names in to-be-processed texts and extracts names which do not belong to existing enterprise names; the system enables a classification and judgment result of to-be-identified natural language sequences to depend on context information and enables the preparation rate of extraction and judgment to be higher by performing characteristic automatic learning on basic elements of the texts, such as characters, words, punctuation marks and the like, and applying the bidirectional RNN (Recurrent Neural Network); and the system discovers new enterprise entity names through existing data characteristics and has important application values in the field of big data analysis, particularly in the field of data analysis taking enterprises as analysis main bodies.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to an enterprise entity name analysis and recognition system. Background technique [0002] With the rapid development of the Internet, a large amount of public web data has been generated, which has also spurred various emerging industries based on big data technology, such as Internet medical care, Internet education, corporate or personal credit investigation, etc. The rise of these Internet industries is inseparable from the analysis of a large amount of information and data, and the value of information analysis lies in its accuracy and sensitivity. Sensitive analysis requires timely and rapid discovery of new information; however, most of the data obtained directly from web pages are very Structured, in order to use these data, data cleaning has become the place where companies spend the most time and energy. In data cleaning, the extraction of specific information,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06N3/08
CPCG06F40/295G06N3/08
Inventor 刘世林何宏靖
Owner 成都数联铭品科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products