Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

706 results about "Feature engineering" patented technology

Feature engineering is the process of using domain knowledge of the data to create features that make machine learning algorithms work. Feature engineering is fundamental to the application of machine learning, and is both difficult and expensive. The need for manual feature engineering can be obviated by automated feature learning.

High potential user buying intention prediction method based on big data user behavior analysis

The invention provides a high potential user buying intention prediction method based on big data user behavior analysis. The high potential user buying intention prediction method comprises the following steps: 101 data preprocessing: the historical behavior data set of the e-commerce user is preprocessed; 102 sample defining and marking: samples are constructed with the interacted user product pairs to act as the keywords according to the historical consumption behavior of the user; 103 division of a training set and a test set: the historical data are divided into the training set and the test set by using a time window division method; 104 feature construction: feature engineering construction of the historical behavior data of the user is performed; and 105 algorithm design and implementation: feature selection of the feature group and unbalanced data processing of the data set are performed and then the final result of two-layer model iterative learning algorithm prediction is put forward. The prediction model is established on the basis of the historical behavior data of the e-commerce user of the time span of 45 days so that whether the user places an order of the commodityin the candidate commodity set P in the following 5 days can be predicted.
Owner:上海普瑾特信息技术服务股份有限公司

Similarity analysis method and system for patients suffering from cardio-cerebral vascular diseases

The invention provides a similarity analysis method and system for patients suffering from cardio-cerebral vascular diseases. The method comprises the following steps of 1 problem definition, wherein problem definition for the patients suffering from the cardio-cerebral vascular diseases is conduced; 2 data collection, wherein health care data of the patients suffering from the cardio-cerebral vascular diseases is collected; 3 data preprocessing, wherein data integration, data cleaning, missing value processing, feature deleting and abnormal point removing are included; 4 feature engineering, wherein feature construction, feature selection and feature processing are included; 5 patient clustering modeling; 6 diagnosis and treatment scheme recommendation. Accordingly, an effective similarity analysis model for the patients suffering from the cardio-cerebral vascular diseases is built, a clinician can obtain the similar populations of a give patient through the patient features and then recommends a personalized treatment plan to achieve the purpose of accurate medical treatment, population grouping based on similarity analysis can be well conducted on the patients suffering from the cardio-cerebral vascular diseases in the Chinese population, and pointed personalized rehabilitation therapy is conducted on different risk populations as early as possible.
Owner:中电科数字科技(集团)有限公司

Text classification method and device, equipment and medium

The invention discloses a text classification method and device, equipment and a medium, and relates to the technical field of natural language processing. According to the specific implementation scheme, to-be-classified texts are obtained; the word sequence of the text to be classified is input into a word vector coding model to determine a word vector sequence of the word sequence; the entity sequence of the text to be classified is input into an entity vector model to determine an entity vector sequence corresponding to the entity sequence; wherein the entity vector model determines an entity vector based on an entity vector encoding model, and the entity vector encoding model is formed by text training based on an entity knowledge graph database; and classification identification is performed on the to-be-classified text according to the word vector sequence and the entity vector sequence. According to the embodiment of the invention, the construction of feature engineering and training samples is avoided, and the construction difficulty of a text classification model is reduced; text classification is comprehensively carried out through the word vector sequence and the entityvector sequence, the semantic sensitivity of the text classification model is improved, and then the accuracy of the classification result of the to-be-classified text is improved.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Global average pooling convolutional neural network-based Chinese emotion tendency classification method

ActiveCN108614875AWith automatic feature extractionEnhanced automatic feature extractionNeural architecturesSpecial data processing applicationsFeature extractionClassification methods
The invention provides a global average pooling convolutional neural network-based Chinese emotion tendency classification method, which is a technology for analyzing a Chinese text collected from a network by utilizing a computer. The method comprises the steps of building a global average pooling convolutional neural network-based Chinese emotion tendency classification model which extracts semantic emotion features by utilizing three channel transformation convolution layers; performing pooling calculation on the features extracted by the convolution layers by a global average pooling layerto obtain confidence values corresponding to output types; and outputting emotion classification tags by Softmax. According to the method, model parameters are set for performing multi-time training,and the model with the highest classification accuracy is selected for Chinese emotion tendency classification; and, complex feature engineering in conventional emotion analysis is avoided, the semantic emotion feature extraction capability of the model is enhanced, the model over-fitting is effectively avoided, and the emotion tendency classification performance of the model is improved.
Owner:BEIJING UNIV OF POSTS & TELECOMM

Bearing fault prediction method and device based on equal division

The invention discloses a bearing fault prediction method and device based on equal division. The bearing fault prediction method comprises the following steps that one-dimensional or multi-dimensional vibration signals of a bearing are detected, and accordingly, sample signals are obtained according to the one-dimensional or multi-dimensional vibration signals; the sample signals are equally divided so as to obtain equally-divided time sequence segments; and the equally-divided time sequence segments are input into a fault prediction model according to the collecting time, and the predictionresult of each time sequence segment is obtained; and according to an attention mechanism, the weight is distributed to the finally-output contribution sizes for the hidden state of the model at eachtime, so that the fault prediction result of the bearing is generated after the contribution sizes are subjected to weighted summation. According to the bearing fault prediction method based on equaldivision, complicated feature engineering is omitted, an end-to-end fault diagnosis system is achieved, the bearing fault prediction method is further suitable for multi-channel sensing scenarios, theprediction accuracy and time efficiency of the prediction model are effectively improved, applicability is high, and the bearing fault prediction method is simple and easy to implement.
Owner:BEIJING JIAOTONG UNIV +1

Online public opinion text information sentiment polarity classification processing system and method

The invention belongs to the technical field of computer science, and discloses an online public opinion text information emotion polarity classification processing system and method, the online public opinion text emotion polarity is widely applied to a public opinion monitoring system, however, a feature engineering extraction module of a traditional machine learning method is large in text information loss, and the accuracy of a classification model is not high enough. The method comprises the steps of preprocessing data; the method comprises the following steps of: constructing a word vector in a way of pre-training a model fin-tuning through BERT; the BERT model calculates the correlation between the characters in the sentence and each of the other characters; the constructed word vector can better solve the problems of'one-word polysemy 'and'synonym' of Chinese; the loss of word vector representation is greatly reduced; in the classification model, firstly Bi-LSTM is used for effectively learning context information, then Attention is used for capturing main semantic information and effectively filtering valuable public opinion information, finally softmax classification is used, and the performance of an obtained public opinion text emotion polarity classification result is better than that of a current mainstream algorithm.
Owner:XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products