Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Hot word analysis and statistic system and method

A statistical system and statistical method technology, applied in the field of data processing, can solve the problems of overall statistics without data resources, poor statistical accuracy, etc., to avoid statistical inaccuracy, ensure accuracy, and improve management and service capabilities.

Active Publication Date: 2015-12-30
迪爱斯信息技术股份有限公司
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] However, Lucene's word frequency statistics are used for the calculation of hit result scores, not as overall statistics of data resources
Solr's word frequency statistics are used to realize the automatic completion function. Although it is used as an overall statistics of data resources, for each information unit, if there are multiple target words in multiple fields (ie "domain"), it is only calculated once. It belongs to coarse-grained statistics, and the accuracy of statistics is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hot word analysis and statistic system and method
  • Hot word analysis and statistic system and method
  • Hot word analysis and statistic system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the specific implementation manners of the present invention will be described below with reference to the accompanying drawings. Obviously, the accompanying drawings in the following description are only some embodiments of the present invention, and those skilled in the art can obtain other accompanying drawings based on these drawings and obtain other implementations.

[0056] Such as figure 1 Shown is the hot word analysis and statistics system provided in the present invention, as can be seen from the figure, the hot word analysis and statistics system includes: analysis theme module 10, concerned vocabulary module 20, word segmentation service module 30, index service module 40 , lexical statistics module 50 and hot word analysis module 60, wherein, word segmentation service module 30 is connected with analysis subject module 10 and concerned ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a hot word analysis and statistic system and method. The hot word analysis and statistic system comprises a subject analysis module, a focus vocabulary module, a word segmenting service module, an index service module, a lexical unit statistic module, and a hot word analysis module, wherein the subject analysis module is used for determining an analysis data source, defining an analysis subject, and defining the data type of each field; the focus vocabulary module is used for forming a focus vocabulary sequence; the word segmenting service module is used for extracting data information from a corresponding field, and segmenting words in the data information according to the focus vocabulary sequence, so as to generate a lexical unit sequence; the index service module is used for recording the index position, in a corresponding field, of each lexical unit, and recording data information corresponding to the lexical unit, so as to generate a lexical unit index file; the lexical unit statistic module is used for counting the number of each lexical unit; the hot word analysis module is used for feeding back the generated hot word related information and the word frequency of each hot word to a user. The hot word analysis statistic system and method have the advantages that accurate statistics of hot words and index storage related to the hot words are realized; the type analysis requirement of the hot words can be met; the hot word statistic accuracy is improved.

Description

technical field [0001] The present invention relates to the technical field of data processing, in particular to a hot word analysis and statistics system and method. Background technique [0002] With the continuous improvement of the level of informatization, people's demand for data is no longer limited to simple data acquisition and data retrieval, but more attention is paid to using the collected information to discover and solve hidden problems. For example, in the field of public security technology, public security decision-makers pay more attention to generating hot words from the collected information through technical means such as text analysis and mining technology, and then solve business problems through statistical analysis results of hot words. [0003] Hot words are hot words. Simply put, when the word frequency of a word reaches a certain threshold, it is called a hot word. As a lexical phenomenon, the popularity of hot words reflects the issues and thing...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
Inventor 陈春东杜渂刘亮亮雷霆索涛王聚全喻小林汪朝辉戴贞清陈同增童金陵张嘉成
Owner 迪爱斯信息技术股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products