Unlock instant, AI-driven research and patent intelligence for your innovation.

A hot word analysis and statistics system and method

A statistical system and hot word technology, applied in the field of data processing, can solve the problems of poor statistical accuracy and overall statistics without data resources, and achieve the effect of ensuring accuracy, avoiding statistical inaccuracy, and strong scalability.

Active Publication Date: 2018-05-04
迪爱斯信息技术股份有限公司
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] However, Lucene's word frequency statistics are used for the calculation of hit result scores, not as overall statistics of data resources
Solr's word frequency statistics are used to realize the automatic completion function. Although it is used as an overall statistics of data resources, for each information unit, if there are multiple target words in multiple fields (ie "domain"), it is only calculated once. It belongs to coarse-grained statistics, and the accuracy of statistics is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A hot word analysis and statistics system and method
  • A hot word analysis and statistics system and method
  • A hot word analysis and statistics system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the specific implementation manners of the present invention will be described below with reference to the accompanying drawings. Obviously, the accompanying drawings in the following description are only some embodiments of the present invention, and those skilled in the art can obtain other accompanying drawings based on these drawings and obtain other implementations.

[0056] Such as figure 1 Shown is the hot word analysis and statistics system provided in the present invention, as can be seen from the figure, the hot word analysis and statistics system includes: analysis theme module 10, concerned vocabulary module 20, word segmentation service module 30, index service module 40 , lexical statistics module 50 and hot word analysis module 60, wherein, word segmentation service module 30 is connected with analysis subject module 10 and concerned ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a hot word analysis and statistics system and method, wherein the hot word analysis system includes: an analysis topic module for determining analysis data sources, defining analysis topics, and defining the data type of each domain; a focus vocabulary module , used to form the sequence of concerned vocabulary; the word segmentation service module is used to extract the data information in the corresponding domain, and segment the data information according to the sequence of concerned vocabulary to generate the sequence of word elements; the index service module is used to record each word The index position of the element in the corresponding domain and the data information corresponding to each word element are recorded, and the word element index file is generated; the word element statistics module is used to count the number of each word element; the hot word analysis module, the generated hot words The related information and the word frequency of hot words are fed back to the user. It realizes the accurate statistics of hot words and the index storage of the association relationship of hot words, solves the requirement of analyzing hot words by class, and improves the accuracy of hot word statistics.

Description

technical field [0001] The present invention relates to the technical field of data processing, in particular to a hot word analysis and statistics system and method. Background technique [0002] With the continuous improvement of the level of informatization, people's demand for data is no longer limited to simple data acquisition and data retrieval, but more attention is paid to using the collected information to discover and solve hidden problems. For example, in the field of public security technology, public security decision-makers pay more attention to generating hot words from the collected information through technical means such as text analysis and mining technology, and then solve business problems through statistical analysis results of hot words. [0003] Hot words are hot words. Simply put, when the word frequency of a word reaches a certain threshold, it is called a hot word. As a lexical phenomenon, the popularity of hot words reflects the issues and thing...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27G06F17/30
Inventor 陈春东杜渂刘亮亮雷霆索涛王聚全喻小林汪朝辉戴贞清陈同增童金陵张嘉成
Owner 迪爱斯信息技术股份有限公司