Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

81 results about "Item set mining" patented technology

Method of item-all-weighted positive or negative association model mining between text terms and mining system applied to method

The invention discloses a method of item-all-weighted positive or negative association model mining between text terms and a mining system applied to the method. The method comprises the following steps of preprocessing by using a Chinese text preprocessing module to establish a text database and a feature word item library; mining item-all-weighted feature word candidate item sets from the text database by utilizing a feature word frequent item set and negative item set mining implementation module, calculating a weight dimension ratio, and cutting out uninteresting item sets by adopting a multi-interestingness threshold value pruning strategy to obtain an interesting item-all-weighted feature work frequent item set and negative item set model; mining an effective item-all-weighted positive or negative association rule model from frequent item sets and negative item sets by utilizing an item-all-weighted positive or negative association rule mining implementation module between terms, and outputting the mined positive or negative association rule model to a user by utilizing an item-all-weighted association model result display module between terms. By applying the method and the system, unnecessary frequent item sets, negative item sets and association rule models can be greatly reduced, Chinese feature word association rule mining efficiency is improved and a high-quality association model between Chinese terms is obtained.
Owner:GUANGXI UNIVERSITY OF FINANCE AND ECONOMICS

Abnormal group identification method and device

The embodiment of the invention provides an abnormal group identification method and device. The method comprises the steps of obtaining a feature value of each user to be analyzed in a plurality of users to be analyzed; Determining a high-frequency characteristic value and a low-frequency characteristic value in the characteristic values of the users to be analyzed; Mining a maximum frequent itemset according to the high-frequency characteristic value of each to-be-analyzed user and a preset frequent item set mining strategy, and obtaining a low-frequency maximum frequent characteristic value in the maximum frequent item set; Constructing a target bipartite graph according to the low-frequency maximum frequent characteristic value and the low-frequency characteristic value in the characteristic values of the users to be analyzed, and defining the weight of an edge in the target bipartite graph; And determining an abnormal group in the to-be-analyzed user according to the weight of the edge in the target bipartite graph and a clustering result of a plurality of to-be-analyzed users obtained by performing graph clustering on the target bipartite graph. According to the embodiment of the invention, the accuracy of abnormal group recognition is improved, and the steps are simple and easy to execute.
Owner:ADVANCED NEW TECH CO LTD

Multidimensional analysis visual representation method of centralized monitoring mass data of transformer substation

The invention discloses a multidimensional analysis visual representation method of the centralized monitoring mass data of a transformer substation. The multidimensional analysis visual representation method comprises the following steps: S1) obtaining the mass warning data of the transformation substation, carrying out each-dimensionality analysis statistical processing to the mass warning data, carrying out frequent item set mining to an analysis statistical result to obtain a frequent item set, i.e. equipment closely related to a frequency reporting signal; and S2) paying attention to the equipment which satisfies frequent item set conditions, and carrying out visual representation on a signal distribution diagram and a theme window. The invention can help a dispatcher to macroscopically master the general operation situation of a whole power grid, discloses possibly potential problems and weak links, and points out the potential risks of the power grid from multiple angles, such as single types, frequent warning, equipment types, power plant and power station gathering so as to alleviate monitoring pressure and improve the control ability of the dispatcher to the whole power grid.
Owner:STATE GRID CORP OF CHINA +3

On-line classroom discussion short text real-time grouping method and system based on text clustering

The invention discloses an on-line classroom discussion short text real-time grouping method and system based on text clustering. The method comprises the steps of conducting word-splitting preprocessing and stop-word preprocessing on text data; obtaining all text item keywords, counting all the text item keywords and storing the text item keywords into a keyword table keyTable; conducting frequent item set mining on a preprocessed text set, filtering all sub-item quasi-frequent item sets and conducting coarse cluster classification in combination with a keyword table definition quasi-frequentitem set similarity calculation rule; mapping points, the closest to the cluster center, of all clusters to the text set, calculating TF-IDF values of text word sets in all the clusters and iteratingthe center of mass to be optimal according to the distance; pushing the obtained K clusters in real time in group. Through the combination of the keyword table definition quasi-frequent item set similarity calculation rule, the clustering accuracy of an on-line discussion short text is effectively improved; through a quasi-frequent item set filtering strategy, the clustering efficiency is effectively improved, and a clustering method is accelerated; the text information content discussed on an on-line classroom is automatically classified into multiple themes, and the text content is groupedaccording to the themes.
Owner:SOUTH CHINA UNIV OF TECH

Data processing method based on frequent item set mining

The present invention provides a data processing method based on frequent item set mining, which comprises the following steps of: acquiring a plurality of items of historical data tables and extracting data tables with value fields; acquiring time sequence data tables and non-time sequence data tables from the data tables with the value fields; carrying out segmentation on the time sequence data tables and carrying out cleaning on the non-time sequence data tables to obtain initial shopping basket data; merging a plurality of data tables in the initial shopping basket data to obtain merged shopping basket data; and respectively carrying out frequent item set mining on the initial shopping basket data and the merged shopping basket data to obtain a frequent item result with a designated support degree. According to the data processing method based on frequent item set mining, historical data is subjected to frequent item set mining to obtain the frequent item set support degree of each historical data table, frequent data in a random dimension can be inquired and the data processing method is convenient for an analyst to acquire the data; and meanwhile, the time sequence data is segmented, which is convenient for the analyst to inquire related data according to a time tag.
Owner:ANHUI XINHUABO INFORMATION TECH

Power distribution network line fault rule mining method, system and medium of frequent item set

PendingCN110244184AReduce the scope of patrol operation and maintenanceImprove operational reliabilityFault locationInformation repositoryMultiple attribute
The embodiment of the invention discloses a power distribution network line fault rule mining method, system and medium of a frequent item set. The method comprises the following steps: collecting a power distribution network line fault tripping record, and generating a mark event under a power distribution line fault and reason scene; generating a power distribution network line information base containing multiple attribute information according to the mark event; based on an apriori algorithm, performing fault frequent item set mining under different fault reason scenes, and constructing the frequent item set covering different frequent attribute numbers; and performing power distribution line fault tripping strong-related rule screening based on the power distribution line fault information base according to the frequent candidate set. Through the technical scheme disclosed by the embodiment of the invention, the existing power distribution fault sample data is integrated with the frequent item set data mining method, thereby mining a fault association rule, reducing the power distribution network patrolling operation and maintenance range, and solving the problem that the patrolling and overhauling charge of the whole region line of the existing power distribution network and the equipment is huge.
Owner:JIANGSU ELECTRIC POWER CO +3

Method and device for automatically extracting relation between knowledge graph events in economic field

The invention provides a method and device for automatically extracting relation beteween knowledge graph events in an economic field , a storage medium and a processor. The method comprises the steps: obtaining original data, wherein the original data is data in the field of economy and finance; according to the event knowledge graph infrastructure and the original data, constructing an event knowledge graph, wherein the event knowledge graph comprises theme events; sorting the theme events according to time; grouping the sorted theme events into a plurality of transactions according to a preset time window, wherein at least one transaction comprises the theme events; based on the affairs, mining association rules by adopting a frequent item set mining algorithm; and determining an association relation of the mined association rules by adopting a machine learning algorithm, wherein the association relation is an event relation. According to the method, the association rules are minedthrough a frequent item set mining algorithm, then the association relation of the mined association rules is determined through a machine learning algorithm, and compared with a judgment method in the prior art, the method can achieve rapid judgment of the event relation.
Owner:智慧神州(北京)科技有限公司

Recent data stream frequent item set mining method based on CPU+MIC (Central Processing Unit+ Many Integrated Core) cooperative computing

The invention discloses a recent data stream frequent item set mining method based on CPU+MIC (Central Processing Unit+ Many Integrated Core) cooperative computing. The method is implemented through a CPU and an MIC coprocessor. The method is based on the CPU+MIC cooperative computing. The method comprises the following steps: setting an initial population, wherein individuals in the population are a series of frequent item sets to be selected; implementing a searching process based on crossover, variation and selection operations of a genetic algorithm; and traversing multiple generations of processing to obtain final frequent item sets. Compared with the prior art, the recent data stream frequent item set mining method based on the CPU+MIC cooperative computing has the advantages of reasonable design, easiness and convenience in operation, high mining efficiency and the like. According to the method, data streams in a nested window are processed through the genetic algorithm based on the CPU+MIC cooperative computing, so that a frequent mode in the data streams in the nested window can be rapidly and accurately mined. The method can be applied to decision making in industries of water conservation runoff analysis, power load, financial information and the like.
Owner:LANGCHAO ELECTRONIC INFORMATION IND CO LTD

Fine-granularity method for generating API substitution rules based on frequent item set mining

ActiveCN104820587AReduce inaccuracyReduce false positive (False Positive) problemSpecific program execution arrangementsGranularityRule of replacement
The present invention discloses a fine-granularity method for generating API substitution rules based on frequent item set mining. The method is characterized by deducing the substitution rules of old and new versions of API in application by using a frequent item set mining algorithm according to the change of the dispatching of the earlier and latest versions of APIs in a class library. When services are extracted from each matched method pair, original codes of the matched method pair are compared by an LCS algorithm, a plurality of matched code segment pairs are generated by taking a point in which the number of same code lines is greater than the Range of the specified threshold as a division point, then the services are generated by change of the dispatching relation of each pair of code segments. Compared against the solution of generating the change of the dispatching relation by taking the method as a unit, the context information of the dispatching method is retained to a certain extent; the accuracy of the services is improved; according to the method, frequency item sets are generated by applying the frequent item set mining algorithm to the generated service sets, then association rules are generated, such that more various API substitution rules can be generated.
Owner:NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products