Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

245results about How to "Improve clustering accuracy" patented technology

Intrusion detecting method based on semi-supervised neural network

The invention discloses an intrusion detecting method based on a semi-supervised neural network, belonging to the field of network information security. The intrusion detecting method comprises the following steps of: 1) using a training set A to initialize an Oth layer of neurons of a GHSOM (Growing Hierarchical Self-Organizing Map) neural network, and calculating a QE0; 2) expanding an SOM (Self Organized Mapping) from the Oth layer of the neurons, and setting a layer identification Layer of the SOM as 1; 3) initializing each SOM expanded in a Layerth layer and training each SOM by the following steps of: adjusting weights of a winning neuron and other neurons in adjacent domains, updating a winning vector set and calculating a main label, a main label rate and an information entropy etyi of the winning neuron; and 4) calculating a qei of each neuron in the SOM and a sub network MQE (Message Queue Element), if MQE is more than QEf*mu1, inserting one row or column of the neurons in the SOM, and if QEi is more than QE0*mu2 or etyi is more than etyf*mu3, generating a layer of a new sub network on the neuron, and adding the new sub network into a sub network array of a (Layer+1)th layer. The detection accuracy of a GHSOM algorithm is improved by using the method.
Owner:PEKING UNIV

Load prediction method based on dynamic time warping and long-short time memory

The invention discloses a load prediction method based on dynamic time warping and long-short time memory, and the method comprises the following steps: S1, obtaining the basic data required for short-term load prediction of a user from a power system; S2, carrying out the clustering of users with similar power utilization behaviors through employing a dynamic time warping method according to thehistorical load data of the user; S3, performing pooling processing on the user data of the same category; S4, selecting training data, preprocessing the training data and using the preprocessed training data as input; and S5, constructing a short-term load prediction method based on the deep long-term and short-term memory recurrent neural network, and verifying the effectiveness. According to the method, the users with similar electricity consumption behaviors are clustered according to the characteristic of large cardinal number of the to-be-predicted users, so that the prediction efficiency is improved. Meanwhile, through pooling processing on the data in the same category, the diversity of the training data is increased, the short-term load prediction precision is improved, and certain engineering application significance is achieved.
Owner:NANJING UNIV OF POSTS & TELECOMM

Web image clustering method based on image and text relevant mining

The invention discloses a web image clustering method based on image and text relevant mining, which comprises the following steps of: (1) extracting images and associated texts thereof in Google image searching results according to the query; (2) extracting nouns in the associated texts to form a vocabulary list; (3) calculating the visibility of words in the vocabulary list; the visibility and a TF-IDF method are integrated for calculating the relative association between the words and the images; (4) calculating the theme degree of association between any two words in the vocabulary list; (5) a complex map is used for modeling the relative association; (6) a complex map clustering arithmetic is applied for clustering the images. The method combines the visibility of the words and the TF-IDF method to define the relative association between the words and the images and breakthroughs the restriction that the TF-IDF as a text processing text can not directly measure the relation between the words and the images; by modeling the relative association between the words and the images and between the words by the complex map, a web image clustering frame is provided so that the image searching results are classified according to the theme, thus be convenient for searching by users.
Owner:ZHEJIANG UNIV

Clustering method for network behavior habits based on K-means and LDA (Latent Dirichlet Allocation) two-way authentication

The invention discloses a clustering method for network behavior habits based on K-means and LDA (Latent Dirichlet Allocation) two-way authentication. According to the clustering method, webpage properties, keywords and frequency in internet browsing records of persons are utilized to combine with a K-means algorithm, an LDA document topic extracting model and an annealing algorithm. The clustering method comprises the following steps: firstly, performing K-means algorithm clustering and LDA document topic extracting model generation on a staff-label-frequency set and a person browsing record-person-keyword set; secondly, storing and calculating an intermediate result, and then performing K-means and LDA two-way authentication by using the annealing algorithm; calculating a global best topic-classification label sequence, and optimizing a network behavior habit clustering result by taking the global best topic-classification label sequence as a reference. By means of the K-means and LDA two-way authentication, the sensitivity to person-classification labels is improved; by using the annealing algorithm, the optimizing efficiency of the clustering result can be improved, and further the clustering accuracy is improved.
Owner:HUAIYIN INSTITUTE OF TECHNOLOGY

System and method for clustering gene expression data based on manifold learning

InactiveCN102184349AAccurately discover co-regulatory relationshipsDiscovery of co-regulatory relationshipsSpecial data processing applicationsVisual spaceCluster algorithm
The invention discloses a method for clustering gene expression data based on manifold learning, and the method provided by the invention comprises the following steps: acquiring a gene expression data matrix A through an acquisition system, and preprocessing the gene expression data matrix A by using a local linear smoothing algorithm; introducing the preprocessed data matrix A, and constructing a weighted neighborhood figure G in a three-dimensional space; taking the shortest path between two points as the approximate geodesic distance between two points; calculating a two-dimensional embedded coordinate by using an MDS (minimum discernible signal), and mapping the three-dimensional data matrix A to a two-dimensional visual space; and carrying out clustering on the two-dimensional visual space subjected to mapping by using a k-mean clustering algorithm so as to obtain the clustering result. The clustering method has the characteristics of low calculating cost, capability of eliminating high-order redundancies, suitability for pattern classification tasks, and the like; and by using the method disclosed by the invention, the current states of cells, the effectiveness of medicaments to malignant cells, and the like can be discriminated effectively according to the clustering result. The invention also provides a system for clustering gene expression data based on manifold learning.
Owner:HOHAI UNIV

Hybrid filling method for incomplete data

The invention discloses a hybrid filling method for incomplete data. The hybrid filling method comprises the following steps: (1) performing special value filling pre-processing on a missing data value in a data set; (2) extracting data attribute significant characteristics by utilizing a stack type automatic coding machine; (3) performing incremental clustering on the filled data set based on the extracted characteristics; (4) performing attribute value weighted filling on a data missing object by utilizing attribute values, corresponding to front k% objects which are most similar with the data missing object, in the obtained each clustering result; and judging difference between all missing data filling values of this time and a last filling value, and iteratively updating (2) to (4) until filling value convergence conditions are met. According to the embodiment of the invention, local similarity characteristics of data in the data set, the data clustering precision, in-class data filling accuracy and algorithm practical application non-supervision and timeliness are considered to construct an algorithm of firstly clustering the incomplete data and then filling the incomplete data, and the filling result precision and the filling algorithm speed are ensured through ideas of utilizing special value filling, adopting the stack type automatic coding machine, performing incremental clustering, performing weighted filing on in-class front k% complete data objects, and the like.
Owner:DALIAN UNIV OF TECH

Selection method of K-means initial clustering centers for taxi trajectory data

The invention relates to a selection method of K-means initial clustering centers for taxi trajectory data. The method comprises a step of extracting the road network of city traffic from an electronic map, a step of carrying out preprocessing on collected taxi trajectory data and obtaining sample data suitable for analysis by screening, a step of matching the taxi trajectory data and the road network to obtain a distribution map with taxi data points in a preset analysis range, a step of using the spot detection method in the image recognition technology to identify the main intensive region of taxi trajectory data points as the initial clustering centers of K-means, and a step of outputting the initial clustering centers of K-means. According to the method, through using the spot detection method to determine the position and number of the initial clustering centers of K-means, the defects of fuzziness, subjectivity and initial center random selection of selecting a K value in a traditional K-means method are overcome, for mass car networking data, the clustering speed of the K-Means method is speeded up, the clustering of the taxi trajectory data is realized, and the method has a certain reference value and an actual economic benefit.
Owner:FUZHOU UNIVERSITY

Professional field-oriented on-line theme detection method

ActiveCN107066555ASolve the difficulty of satisfying the user's need for more professional informationSolve needsCharacter and pattern recognitionSpecial data processing applicationsState of artAlgorithm
The invention discloses a professional field-oriented on-line theme detection method. The method comprises the following steps: obtaining a text vector matrix of a preprocessed text set, and extracting a dictionary from the text set; modeling the text vector matrix; calculating a mixed weight p (thetak|d) from a text d to a theme thetak and a frequency p (w|thetak) that a feature word appears in each theme thetak; obtaining the similarity between two texts di and dj, defining a theme model-based theme distance between the texts into a relative entropy distance of a text vector, and calculating a similarity matrix; compressing the text set, thus obtaining a new text sample sect; calculating a similarity matrix of the new text sample set, and selecting a deviation parameter p according to the similarity matrix; combining clustering results, thus generating a new clustering result; calculating distances between all texts in the original text set and compressed classified texts, and performing classification; outputting a text set theme and a final clustering result. Compared with the prior art, the professional field-oriented on-line theme detection method disclosed by the invention has the advantage that by the adoption of an optimal clustering algorithm, the accuracy and the efficiency of the clustering effect are improved.
Owner:TIANJIN UNIV

Overall reconstruction design method of plane line position of existing railway

The invention discloses an overall reconstruction design method of a plane line position of an existing railway. The method comprises the steps that the line element types of testing points are identified based on the tangent azimuth change rate of each testing point, and initial clustering of the testing points is conducted; based on the number of the testing points in each line element point group, the initial clustered line element point groups are adjusted; based on the crossing point position of straight lines at two ends of each circular curved segment, the line element point groups arefurther adjusted; iterative computation is conducted, easement curve line element testing points in a linear element point group and a circular curve line element point group are gradually identifiedand adjusted, so that the number of the testing points in the line element point groups stable, final clustering of three kinds of line element testing points is achieved, and fitting of local line positions is conducted; all the local line positions are connected and form an initial overall fitting line position, the fitting line position is optimized, and the final plane reconstruction scheme ofthe existing railway is obtained. The overall reconstruction design method of the plane line position of the existing railway can precisely identify different types of line element testing points, and can optimize the fitting line position from an overall prospective and achieve rapid overall reconstruction of the plane line position of the existing railway.
Owner:CENT SOUTH UNIV

Local similarity preserving-based hyperspectral image extreme learning machine clustering method

The invention discloses a local similarity preserving-based hyperspectral image extreme learning machine clustering method. The method comprises the steps of organizing a hyperspectral pixel matrix; calculating a linear random response of a hidden layer neuron; calculating a nonlinear activation value of the hidden layer neuron; performing three-dimensional reconstruction of hidden layer feature data; performing spatial guided filtering; performing two-dimensional reconstruction of the filtered hidden layer feature data; building local similarity preserving regular terms and an optimization model; and calculating local similarity preserving projection features, and performing K-means clustering to obtain a final clustering tag. Based on a conventional extreme learning machine, hyperspectral image spatial information of local neighborhoods is integrated through the guided filtering, and spectral local similarity of a hyperspectrum is fully utilized; projection with local preservabilityis calculated through model optimization; spatial spectral joint information is extracted; the clustering precision is improved; the calculation complexity is lowered; and the method can be widely applied to the hyperspectral unsupervised classification in the fields of territorial resources, mineral survey and precision agriculture.
Owner:NANJING UNIV OF SCI & TECH

Self-coding neural network processing method and device, computer equipment and storage medium

The invention discloses a self-encoding neural network processing method and device, computer equipment and a storage medium, and the method comprises the steps: converting a text sample into a sampleword vector, inputting the sample word vector into a convolutional neural network model, and carrying out the preliminary feature extraction of the sample word vector, and obtaining a preliminary hidden feature of the sample; inputting the preliminary implicit features of the sample into a plurality of self-coding neural networks, training the self-coding neural networks to obtain a plurality ofself-coding neural network models, and inputting the preliminary implicit features of the sample into the self-coding neural network models for feature extraction to obtain sample implicit features output by the self-coding neural network models; clustering the extracted feature samples with the hidden features of the samples to obtain a clustering result; determining whether to reconstruct the self-coding neural network according to the clustering result; and if determining that the self-coding neural network needs to be reconstructed, constructing a target self-coding neural networ accordingto the contour coefficient, and acquiring the self-coding neural network with the high clustering accuracy.
Owner:PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products