Mechanism feature vocabulary extension system and method for public opinion crawling

A technology of vocabulary expansion and feature words, applied in transmission systems, digital transmission systems, instruments, etc., can solve the problem of insufficient comprehensiveness in obtaining public opinion data, achieve the effect of reducing the amount of useless information, improving the quality of public opinion information, and improving quality

Pending Publication Date: 2020-02-28
中科天玑数据科技股份有限公司 +1
View PDF11 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to provide a system and method for expanding organization characteristic vocabulary for public opinion crawling, so as to solve the problem that public opinion data is not comprehensive enough due to incomplete organization characteristic words

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mechanism feature vocabulary extension system and method for public opinion crawling
  • Mechanism feature vocabulary extension system and method for public opinion crawling
  • Mechanism feature vocabulary extension system and method for public opinion crawling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0065] Specifically, such as figure 1 As shown, the present embodiment provides a system for expanding the institutional characteristic vocabulary for public opinion crawling, including:

[0066] Data acquisition module: used to collect data;

[0067] Feature word cleaning and processing module: used for preliminary screening of feature words;

[0068] Feature word statistical analysis module: used to further filter feature words through correlation analysis, and finally generate extended feature words.

[0069] Using the above scheme, useless feature words are screened out and analyzed to generate extended feature words and comprehensively and quickly collect relevant public opinion information. data, improve retrieval efficiency and quality, and reduce memory usage.

[0070] In a preferred implementation manner of this embodiment, the data collection module includes:

[0071] The candidate feature word unit is used to collect intellectual property information, investment...

Embodiment 2

[0088] Such as figure 2 As shown, this embodiment provides a method for expanding organization characteristic vocabulary for public opinion crawling, including the following steps:

[0089] Data collection;

[0090] Preliminary screening of feature words;

[0091] Through correlation analysis, feature words are further screened, and finally extended feature words are generated.

[0092] Using the above scheme, useless feature words are screened out and analyzed to generate extended feature words and comprehensively and quickly collect relevant public opinion information. data, improve retrieval efficiency and quality, and reduce memory usage.

[0093] In a preferred implementation of this embodiment, the data collection includes the following steps:

[0094] Collect intellectual property information, investment information or product information, and organize them as candidate feature words;

[0095] Using the above scheme, it is possible to search all kinds of informati...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a mechanism feature vocabulary extension system for public opinion crawling. The mechanism feature vocabulary extension system comprises a data acquisition module for acquiringdata; a feature word cleaning and processing module, used for preliminarily screening feature words; and a feature word statistical analysis module, used for further screening the feature words through relevancy analysis and finally generating expansion feature words. On the other hand, the invention provides a mechanism feature vocabulary extension method for public opinion crawling. By the adoption of the scheme, useless feature words are screened out and analyzed, expanded feature words are generated, related public opinion information is comprehensively and rapidly collected, on one hand,missing inspection is effectively avoided, on the other hand, useless data added to the useless feature words is reduced, retrieval efficiency and quality are improved, and memory occupation is reduced.

Description

Technical field: [0001] The present invention relates to the field of natural language processing, in particular to a system and method for expanding institutional characteristic vocabulary for public opinion crawling. Background technique: [0002] With the rapid development of the Internet, the Internet has become an important and fast platform for people to obtain information and participate in exchanges. Public opinion has been endowed with more meanings, and the importance of public opinion has become increasingly prominent, whether for enterprises or regulatory agencies. For enterprises, improving the analysis ability of network public opinion under the new situation, grasping the dynamics of public opinion in a timely and accurate manner, and scientifically guiding network public opinion will help improve corporate reputation and prevent corporate risks. For regulatory agencies, monitoring corporate network public opinion can assist in understanding corporate operati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/284H04L12/24
CPCH04L41/147
Inventor 刘少杰贺敏杜慧孙庆王秀文董琳郭富民杜漫余智华
Owner 中科天玑数据科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products