Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

153results about How to "Improve clustering efficiency" patented technology

Clustering method based on mobile object spatiotemporal information trajectory subsections

The invention discloses a clustering method based on mobile object spatiotemporal information trajectory subsections. The clustering method based on mobile object spatiotemporal information trajectory subsections comprises the steps that the three attributes of time, speed and direction are introduced, and a similarity calculation formula of the time, speed and direction is provided for analyzing an internal structure and an external structure of a mobile object trajectory; firstly, according to the space density of the trajectory, the trajectory is divided into a plurality of trajectory subsections, then the similarities of the trajectory subsections are judged by calculating differences of the trajectory subjections on the space, time, speed and direction, finally, trajectory subsections in a non-significant cluster are deleted or are merged into adjacent significant clusters on the basis of a first cluster result, and therefore an overall moving rule is displayed on the clustering spatial form. According to the clustering method based on the mobile object spatiotemporal information trajectory subsections, the clustering result is improved, higher application value is provided, a space quadtree is adopted to conduct indexing on the trajectory subsections, clustering efficiency is greatly improved under the environment of a large-scale trajectory number set, and trajectories can be effectively clustered.
Owner:胡宝清

Power system load data identification and recovery method

The invention discloses a power system load data identification and recovery method. Firstly, according to user historical load data, the number of clusters and initial cluster centers of sample data are determined on the basis of the hill climbing method; secondly, the final cluster center and the characteristic curve of the historical load data are obtained on the basis of the fuzzy C-means clustering algorithm; thirdly, each kind of load characteristic curve is processed, and the feasible region interval where normal data of the load curve is located is obtained; fourthly, according to correlation coefficients with the load characteristic curves, the category to which a to-be-tested load curve belongs is determined; finally, on the basis of the feasible region interval and the to-be-tested load curve whose category is judged, bad data of to-be-tested load data is identified and corrected. According to the method, the fuzzy C-means algorithm serves as the basis, the hill climbing function method is used, the number of clusters and the initial cluster centers are determined at the same time to improve clustering efficiency, and the initial cluster center determination problem and identification effect judgment randomness problem of bad data are solved.
Owner:TIANJIN UNIV

Modeling method for parallel smart case recommendation model

The invention relates to a modeling method for a parallel smart case recommendation model. The method comprises the following steps of obtaining existing patient cases from an electronic case database, carrying out denoising, clustering and word segmentation on the patient cases, and establishing a patient case corpus database; defining that TFIDFi, j shows the importance degree of a word or an expression in a case of the patient case corpus database, establishing an LSI vector space model according to the TFIDFi, j, and moreover, establishing a BOW word bag model according to all words and expressions in the patient case corpus database; calculating history case vectors and to-be-processed case vectors in the patient case corpus database through utilization of the LSI vector space model and the BOW word bag model; calculating cosine similarity among the history patient cases and storing the cosine similarity; and calculating the cosine similarity between the to-be-processed case vectors and the history patient case vectors, and searching similar cases of to-be-processed cases according to the cosine similarity. The model established through adoption of the method provided by the invention is high in accuracy and low in error. A recommendation result is high in quality.
Owner:QINGDAO ACADEMY OF INTELLIGENT IND

Cluster partitioning processing method and cluster partitioning processing device for virus files

The invention discloses a cluster partitioning processing method and a cluster partitioning processing device for virus files. The method comprises the following steps of: (A) statically analyzing binary data of virus files to be partitioned, and analyzing portable executable (PE) structure data of the virus files from the binary data; and (B) comparing the PE structure data of the virus files to be partitioned, and partitioning the virus files with the PE structure data according with appointed similarity into the same category. The device comprises a first data analyzing module and a first cluster partitioning module, wherein the first data analyzing module is used for statically analyzing the binary data of the virus files to be partitioned and extracting the PE structure data of the virus files from the binary data; and the first cluster partitioning module is used for comparing the PE structure data of the virus files to be partitioned and partitioning the virus files with the PE structure data according with the appointed similarity into the same category. By the cluster partitioning processing method and the cluster partitioning processing device for the virus files, the cluster partitioning efficiency of the virus files of a computer can be improved, resource consumption is reduced, and the virus catching risk caused by the virus files which run dynamically is eliminated.
Owner:TENCENT TECH (SHENZHEN) CO LTD

Time-sensitive self-adaptive on-line subtopic detecting method and system

The invention relates to a time-sensitive self-adaptive on-line subtopic detecting method and system. The method comprises the following steps: (1) vectorizing each document in a document flow; (2) carrying out incremental clustering on the documents and adjusting the central weight of subtopics according to the document weight decaying with time; (3) combining the subtopics or deleting the meaningless subtopics when the quantity of the subtopics generated by clustering or the weight ratio of a certain subtopic meets the threshold conditions or the subtopics meet the long tail detection conditions; and (4) generating a summary of the new subtopics, outputting and displaying according to the weight of each new subtopic and the internal document distribution thereof. The system comprises a document representing module, an incremental clustering module, a new subtopic discovering module and a summary generating module. According to the method and the system which are disclosed by the invention, the historical document weight decays with time and the dynamic update on the quantity and the content of the subtopics is carried out on the basis of the threshold judgment and the long tail detection, so that the subtopic detecting efficiency can be effectively increased.
Owner:INST OF INFORMATION ENG CAS +1

Farmland wireless self-organizing network topology density correlation path selecting and optimizing method

The invention relates to a farmland wireless self-organizing network topology density correlation path selecting and optimizing method. The method includes the steps that firstly, node energy is calculated, voting is carried out, and cluster heads are determined; secondly, nodes within a preset threshold value are received by the cluster heads and added, and clustering operation is carried out; thirdly, nodes corresponding to which clustering operation is not completed become intra-cluster sub-nodes according to a pre-established method; fourthly, energy advantage nodes are selected from the intra-cluster sub-nodes to serve as fake cluster heads; fifthly, reverse scattering and networking are carried out according to the cluster heads and the fake cluster heads; sixthly, environment information collecting and reporting are carried out on all nodes through a networking route; seventhly, whether a cluster head arrives at an energy approximating target or not is judged; eighthly, if a cluster head arrives at the energy approximating target, the next round is marked to carry out cluster head election, collection of the current round ends, and otherwise collection of the round ends. By means of the method, clustering efficiency is improved, cluster head selection frequency is lowered, and overall network energy efficiency is improved.
Owner:北京市农林科学院信息技术研究中心

Grid-based spatial multi-scale fast clustering method

The invention discloses a grid-based spatial multi-scale fast clustering method. The method includes the following steps that: S1, a data scale is selected, the size of grids is determined, gridding is performed on sample data, and the density values of the grids is put into statistics; S2, an initial density threshold is specified, all grids satisfying the threshold condition are reserved, and apreliminary density matrix is obtained; S3, a filter template is specified according to an observation scale, and convolution operation is performed on a global grid space; S4, a connected region is generated through neighborhood search so as to be adopted as a preliminary clustering result, integration operation is performed on the grids, the grid space is mapped onto an original point set, and an original point set clustering result is obtained; S5, the observation scale is adjusted, a transformed new filter is adopted to perform operation in the S3 and S4 on a result matrix again, a clustering result of the next observation scale is obtained; and S6, the data scale is changed, the S1 to S5 are repeated, clustering results under different data scales are obtained. The algorithm of the invention has the advantages of low complexity, high clustering efficiency and high precision, and can meet the requirements of real-time multi-scale clustering and visual analysis of massive point sets.
Owner:WUHAN UNIV

Professional field-oriented on-line theme detection method

ActiveCN107066555ASolve the difficulty of satisfying the user's need for more professional informationSolve needsCharacter and pattern recognitionSpecial data processing applicationsState of artAlgorithm
The invention discloses a professional field-oriented on-line theme detection method. The method comprises the following steps: obtaining a text vector matrix of a preprocessed text set, and extracting a dictionary from the text set; modeling the text vector matrix; calculating a mixed weight p (thetak|d) from a text d to a theme thetak and a frequency p (w|thetak) that a feature word appears in each theme thetak; obtaining the similarity between two texts di and dj, defining a theme model-based theme distance between the texts into a relative entropy distance of a text vector, and calculating a similarity matrix; compressing the text set, thus obtaining a new text sample sect; calculating a similarity matrix of the new text sample set, and selecting a deviation parameter p according to the similarity matrix; combining clustering results, thus generating a new clustering result; calculating distances between all texts in the original text set and compressed classified texts, and performing classification; outputting a text set theme and a final clustering result. Compared with the prior art, the professional field-oriented on-line theme detection method disclosed by the invention has the advantage that by the adoption of an optimal clustering algorithm, the accuracy and the efficiency of the clustering effect are improved.
Owner:TIANJIN UNIV

Stochastic gradient Bayesian SAR image segmentation method based on sketch structure

The invention discloses a stochastic gradient Bayesian SAR image segmentation method based on a sketch structure, mainly used for solving the problem that SAR image segmentation in the prior art is inaccurate. The stochastic gradient Bayesian SAR image segmentation method comprises the following implementation steps of: (1), sketching an SAR image to obtain a sketch image of the SAR image; (2), according to an area chart of the SAR image, and dividing a pixel subspace of the SAR image; (3), performing hybrid aggregation structured surface feature pixel subspace segmentation through a method based on a stochastic gradient variational Bayesian network model; (4), performing independent target segmentation based on the sketch line aggregation feature; (5), performing line target segmentation based on a visual semantic rule; (6), performing segmentation of a pixel subspace in a homogeneous area by adopting a polynomial-based logistic regression prior model; and (7), combining segmentation results to obtain a segmentation result of the SAR image. By means of the stochastic gradient Bayesian SAR image segmentation method based on the sketch structure disclosed by the invention, the good segmentation effect of the SAR image is obtained; and the stochastic gradient Bayesian SAR image segmentation method can be used for semantic segmentation of the SAR image.
Owner:XIDIAN UNIV

Travel trajectory clustering method, apparatus and device

Embodiments of the invention disclose a travel trajectory clustering method, apparatus and device. The calculation amount of travel trajectory clustering is reduced and the travel trajectory clustering efficiency is improved. The method comprises the steps of obtaining multiple travel trajectories of a user, wherein each travel trajectory comprises a starting point, an ending point and a middle point located between the starting point and the ending point; by utilizing the starting points and/or the ending points of the travel trajectories, clustering the travel trajectories to obtain a first travel trajectory set, wherein the first travel trajectory set comprises the travel trajectories with the matched starting points and/or ending points, and the number of the travel trajectories in the first travel trajectory set is greater than or equal to a first threshold; and by utilizing the middle points in the travel trajectories, clustering the travel trajectories in the first travel trajectory set to obtain a second travel trajectory set, wherein the second travel trajectory set comprises the travel trajectories with the matched starting points and middle points, and/or, the travel trajectories with the matched ending points and middle points.
Owner:NEUSOFT CORP

Air conditioner reliability influence factor-based regional clustering method

The invention discloses an air conditioner reliability influence factor-based regional clustering method. The method includes the following steps that: a system analyzes the regional differences of working environment factors and user use habit factors that influence the reliability of air conditioners and extracts working environment reliability key influence factors and user use habit reliability key influence factors; an air conditioner reliability influence factor-based regional clustering analysis comprehensive evaluation model is constructed; judgment criterion of air conditioner startup refrigeration and heating are formulated, the average consumption tendency indexes of the air conditioners are accurately quantified; and a weighted Ward clustering algorithm is adopted to carry out clustering analysis in the aspects of the working environment influence factors and the user use habit factor influence factors, so that an working environment influence factor clustering analysis result and a user use habit factor influence factor clustering analysis result are obtained, a secondary clustering method is adopted to integrate the working environment influence factor clustering analysis result and the user use habit factor influence factor clustering analysis result, so that final regional distribution can be obtained. With the air conditioner reliability influence factor-based regional clustering method of the invention adopted, the use reliability difference of air conditioners distributed in different areas can be minimum, and more scientific and more precise regional classification results can be obtained.
Owner:NANCHANG HANGKONG UNIVERSITY

Hybrid clustering algorithm based on adaptive cellular inheritance and optimal fuzzy C-means

The invention discloses a hybrid clustering algorithm based on adaptive cellular inheritance and optimal fuzzy C-means. The algorithm comprises the following steps that: utilizing Arnold Cat mapping to generate an initial population, and constructing a fitness function on the basis of the clustering criteria of the C-means; decoding individuals in the population to obtain a corresponding clustering center, distributing a degree of membership, and calculating a fitness value and the entropy of the population; carrying out state evolution on each individual, carrying out selection, dynamic intersection and a combined variation operation based on the entropy; automatically determining the fusion opportunity of the fuzzy C-means, and utilizing an implementation criteria to carry out a fuzzy C-means iterative operation; and judging whether an end condition is achieved or not, and outputting a final clustering result if the end condition is met. By use of the algorithm, the characteristics of the high global search capability of an adaptive cellular genetic algorithm and the high local search capability of a fuzzy C-means algorithm are further utilized. Compared with the prior art, the algorithm is characterized in that higher clustering efficiency and accuracy can be obtained.
Owner:NANCHANG HANGKONG UNIVERSITY

Method for establishing electrical power system clustering load model

The invention discloses a method for establishing an electrical power system clustering load model. Timing sequence load curves are firstly sequenced and load duration time curves are obtained, then according to contribution degree of a load level to adequacy index, the load duration curves are divided into three subareas, namely, a high contribution degree, a medium contribution degree and a lower contribution degree, if the subarea is the high contribution degree area, a hierarchical clustering algorithm is adopted to select a clustering center initial value for the high contribution degree area, if the subarea is the medium contribution degree area, a mean value-standard deviation method is adopted to select a clustering center initial value for the medium contribution degree area, and if the subarea is the lower contribution degree area, a clustering center initial value is confirmed according to experience or is confirmed in a random mode for the lower contribution degree area. Improved efficiency index is defined, improved efficiency is regarded as a convergence condition, and then a clustering number in a K-mean clustering algorithm is confirmed. The clustering load model obtained through the above method has high computational accuracy and rapid convergence properties when used in power system adequacy evaluation.
Owner:HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products