A method for extracting enterprise name keywords
A technology of enterprise name and extraction method, which is applied in the field of data processing, can solve the problems of large investment and increased difficulty, and achieve the effect of high coverage
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0036] see figure 1 , the invention discloses a method for extracting enterprise name keywords, comprising the following steps:
[0037] S1. Build a basic hot word library related to the name of the enterprise, and tag the hot words in the basic hot word library to define the tag categories of the hot words. The basic hot thesaurus is built by the following methods:
[0038] S11. Prepare enterprise name data in advance. In this embodiment, the enterprise name data is collected by a web crawler, and the enterprise name data contains more than 40 million enterprise names.
[0039] S12. Perform Chinese word segmentation processing on the enterprise name data. The Chinese word segmentation processing is to utilize IKAnalyzer word segmentation device, word segmentation device, Ansj word segmentation device or Stanford word segmentation device to carry out Chinese word segmentation processing, certainly also can adopt other word segmentation device, the present invention does not...
example 1
[0060] 1. In step S2, the user inputs "Xiamen Meiya Shangding Information Technology Co., Ltd.", and the word segmentation result is:
[0061] {Xiamen, Xiamen City, Meiya, Yashang, Information Technology Co., Ltd., Information, Technology Co., Ltd., Technology Co., Ltd., Technology, Co., Ltd., Co., Ltd.}
[0062] 2. In step S3, the obtained array arrs_a (that is, the word segmentation matched with the hot thesaurus) is:
[0063] {Xiamen, Xiamen City, Information Technology Co., Ltd., Information, Technology Co., Ltd., Technology Co., Ltd., Technology, Co., Ltd.}
[0064] 3. In step S4, the sorted array arrs_a is:
[0065] {Information Technology Co., Ltd., Technology Co., Ltd., Technology Co., Ltd., Xiamen City, Company, Technology, Information, Xiamen}
[0066] 4. In step S5, the blank operation process is as follows:
[0067]
[0068] The final result is: Meiya Shang Ding.
[0069] 5. In step S6, it is determined that the length of "Meiya Shangding" is greater than 2,...
example 2
[0071] 1. The user enters "Xiamen Beichen Shanchuan Culture Communication Co., Ltd.", and executes steps S2-S6. The company name is all replaced with blanks, and the result is "", and executes step S7.
[0072] 2. The execution process of step S7 is:
[0073]
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com