Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

56results about How to "Solve data sparsity" patented technology

Chinese domain term recognition method based on mutual information and conditional random field model

The invention discloses a Chinese domain term recognition method based on mutual information and a conditional random field model. The Chinese domain term recognition method includes the following steps: (1) gathering domain text corpus and marking all the punctuations, spaces, numbers, ASSCII (American Standard Code for Information Interchange) characters and characters except Chinese characters in the corpus; (2) setting character strings and computing the mutual information values of the character strings, (3) computing the left comentropy and the right comentropy of every character string, (4) defining character string evaluation function, setting evaluation function threshold, computing the evaluation function values of every character string, determining that every character string is a word, comparing in sequence the evaluation function value of the former character with the evaluation function value of the latter character in the character string and segmenting character meaning character strings one by one, (5) utilizing conditional random fields to train a conditional random field model and recognizing domain terms with the conditional random field model. When the Chinese domain term recognition method is used to recognize terms, the data sparsity of legitimate terms is overcome, the amount of calculation of conditional random fields is reduced, and the accuracy of the Chinese domain term recognition is improved.
Owner:SHANGHAI UNIV

Personalized data searching method and device

The application relates to a personalized data searching method and device. The device comprises the following steps: searching a data object according to a query word in a search request of a current user; determining a first behavior characteristic of a historic user in the search access process utilizing the query word according to a historic behavior log, and generating an intention vector of the query word; counting a second behavior characteristic of each user group to the data object according to a user attribute, and generating a preference vector of the user group; computing the similarity of the intention vector of the query word and the preference vector of each user group; using the user group corresponding to the condition that the similarity is greater than a set threshold value as a reference group for determining the intention preference of the current user; adjusting the sorting of the data objects searched by the current user through the query word through adoption of the historic behavior characteristics of the reference group. Thus the performance of a search platform is improved and promoted, the accuracy of a search result output to the user is improved, and the result, which is the most reasonable and the best for the search intention, is output for the user.
Owner:ALIBABA GRP HLDG LTD

Cross-domain recommendation method based on multi-view knowledge representation

PendingCN112541132AImprove recommendation performanceSolve data sparsity and cold startDigital data information retrievalMachine learningData miningData science
The invention provides a cross-domain recommendation method based on multi-view knowledge representation. The method comprises the steps of integrating different projects in a heterogeneous graph formaccording to similar attributes of the projects in different fields to form a plurality of views, taking the views as inputs of a graph attention network respectively, and obtaining initial knowledgerepresentation of the projects under the views through the graph attention network; taking the initial knowledge representation of the project under each view as the input of a multi-head attention network, obtaining and integrating project representation vectors with user preferences under different views through the multi-head attention network, and obtaining the final representation of the project with the user preferences; and recommending a corresponding project in the target domain to the user according to the final representation of the project with the user preference and the information of the target domain. The multi-view multi-head attention network learning method is set among multiple fields, project knowledge representation is fully learned, cross-field recommendation is carried out, and therefore the recommendation effect of the target field is improved.
Owner:BEIJING JIAOTONG UNIV

Estimation method and system of vehicle traveling overhead

The invention discloses an estimation method of vehicle traveling overhead. The method comprises the steps that received traffic data and received map data are differentiated to corresponding space-time segmentation road segments respectively so that the space-time segmentation road segments with data and the space-time segmentation road segments without data can be formed, and characteristic values corresponding to the space-time segmentation road segments with data and the space-time segmentation road segments without data are extracted respectively; all the space-time segmentation road segments with data and all the space-time segmentation road segments without data are differentiated to different clusters on the basis of the characteristic values, so that all the space-time segmentation road segments in the same cluster have similar characteristics; the average value of vehicle traveling speeds of all the data in any space-time segmentation road segment with data is calculated to serve as a vehicle traveling overhead estimated value of the space-time segmentation road segment with data; the average value of vehicle traveling overhead estimated values of all the space-time segmentation road segments with data in the cluster where any space-time segmentation road segment without data is located is calculated to serve as a vehicle traveling overhead estimated value of the space-time segmentation road segment without data. The invention further discloses an estimation system of vehicle traveling overhead.
Owner:GUANGZHOU HKUST FOK YING TUNG RES INST

Day-ahead transaction strategy method and system based on market supply and demand and regional meteorological prediction

The invention provides a day-ahead transaction strategy method and system based on market supply and demand and regional meteorological prediction, and the method comprises the steps: obtaining regional prediction meteorological data and station prediction meteorological data, and obtaining market supply and demand prediction data and historical power transaction data; carrying out data preprocessing; modeling the preprocessed data by adopting a multi-task learning method in deep learning to obtain a prediction model for electric power transaction market pre-judgment; inputting regional prediction meteorological data, station prediction meteorological data and market supply and demand prediction data of the D day, and predicting market pre-judgment information of power transaction of the Dday through the prediction model; and substituting the historical medium and long term average price, the electric field installed capacity, the short term prediction of the station and the market pre-judgment information into the optimization model, and solving the day-ahead 96-point power declaration and the expected strategy income. The method and system can adapt to scenes of multiple tasks,strong correlation among the tasks can be fully utilized, the prediction accuracy is improved, and the generalization ability of the model is greatly enhanced.
Owner:国能日新科技股份有限公司

Method and device of identifying target terminal

The invention discloses a method of identifying a target terminal. The method includes the following steps: extracting data from a data source, pre-processing the data according to preset strategies to remove abnormal first data, and keeping normal second data used for obtaining an analysis data set; performing data verification and/or data conversion on the second data and then obtaining an analysis data set; obtaining the analysis data set, and extracting characteristic vectors of users from the analysis data set on the basis of communication characteristics of the users, wherein the characteristic vectors of the users is used for representing the communication characteristics of the users; dividing all the users into first users and second users on the basis of the characteristic vectors of the users, obtaining cluster results corresponding to all the first users on the basis of data of the first users, and taking the cluster results as first cluster results; performing clustering on the basis of the characteristic vectors of the second users and the first cluster results, obtaining cluster results corresponding to all the users, and taking the cluster results as second clusterresults; and identifying a target terminal on the basis of the second cluster results. The invention also discloses a device of identifying a target terminal.
Owner:CHINA MOBILE GRP HEILONGJIANG CO LTD +1

Text detection and correction method based on Pinyin similarity and language model

The invention discloses a text detection and correction method based on Pinyin similarity and a language model. The method comprises the steps: collecting a large number of correct instruction text statements to serve as training statements; selecting words of a professional field from the training statements, and constructing a custom dictionary; carrying out word segmentation on the training statements by utilizing a HanLP language processing toolkit and a custom dictionary; counting the occurrence frequency of each word and each word combination in the word segmentation result in all the training statements, and constructing a Bi-Gram language model; converting the to-be-corrected statement into corresponding to-be-corrected pinyin, and converting words of the custom dictionary into corresponding dictionary pinyin; and correcting the to-be-corrected statement according to the pinyin similarity between the to-be-corrected pinyin and the dictionary pinyin in combination with the sentence rationality of the to-be-corrected statement to obtain a corrected statement. Through word pinyin similarity calculation and sentence rationality analysis, semantic information and contexts of sentences are considered, wrong words in the sentences can be detected, and the correction accuracy is improved.
Owner:THE 28TH RES INST OF CHINA ELECTRONICS TECH GROUP CORP

Chinese medical question classification system for deep encyclopedia learning

According to the Chinese medical question classification system based on deep encyclopedia learning, by using a semantic structure of Chinese search encyclopedia in combination with a deep learning method, a method for constructing a feature vector more efficiently and accurately is provided, which comprises: using a semantic association degree efficient convergence method based on the semantic structure of the Chinese search encyclopedia for constructing a network inquiry question feature vector; based on the features of the medical questions, improving a semantic association degree algorithm, solving the defect that the speed is low when feature vectors are constructed, and expanding feature words by extracting Chinese search encyclopedia word links; on the basis of a distributed Chinese word vector space of a CB-CBS language model, achieving efficient dimensionality reduction of network inquiry question feature vectors, avoiding the problem of data sparseness, greatly improving the inquiry classification efficiency; and using the CB-CBS model in combination with Chinese search encyclopedia and deep learning to construct distributed medical question word vectors, constructing a professional medical question corpus, and improving the accuracy of the word association degree and the medical question classification efficiency remarkably.
Owner:李蕊男

Road condition determination method and device, medium and electronic equipment

ActiveCN112509332AGuarantee calculation accuracy and calculation efficiencyImprove calculation accuracyDetection of traffic movementElectronic mapTransport engineering
The invention provides a road condition determination method and device, a medium and electronic equipment. The method comprises the following steps: determining a shunting path of a traffic marking road section and at least two associated road sections, and obtaining vehicle driving information; determining at least two pieces of marking road condition information according to the vehicle drivinginformation, and dividing the vehicle driving information according to the at least two pieces of marking road condition information to obtain at least two road condition sets; calculating the vehicle driving information according to the at least two road condition sets to obtain at least two sets of driving information; determining at least two pieces of associated road condition information andat least two pieces of associated driving information of the at least two associated road sections; and according to the at least two pieces of associated road condition information, the at least twopieces of associated driving information, the at least two pieces of marking road condition information and the at least two pieces of set driving information, determining shunting road condition information of the shunting path in the at least two pieces of marking road condition information. The calculation accuracy of the road condition information is improved, and the service experience of using the electronic map by a user is improved.
Owner:TENCENT TECH (SHENZHEN) CO LTD

English lexical analysis method and system for neural network machine translation

The invention provides an English lexical analysis method for neural network machine translation. The English lexical analysis method comprises the following steps: performing English word segmentation on a to-be-processed English text; carrying out query screening on each word obtained after English word segmentation by utilizing a special vocabulary; reserving result information of the queried words in a lexical analysis result; performing rule processing on the words which are not queried; carrying out word architecture restoration on the words meeting the rule processing conditions, and directly storing the words which do not meet the rule processing conditions in a lexical analysis result; and outputting a lexical analysis result, and adding the lexical analysis result into machine deep learning training. The invention further provides an English lexical analysis system for neural network machine translation. According to the English lexical analysis method for neural network machine translation, the problems that the machine learning efficiency is reduced and the translation quality is poor due to the fact that training corpus data of neural network machine translation is sparse can be solved.
Owner:北京中献电子技术开发有限公司

Chinese domain term recognition method based on mutual information and conditional random field model

The invention discloses a Chinese domain term recognition method based on mutual information and a conditional random field model. The Chinese domain term recognition method includes the following steps: (1) gathering domain text corpus and marking all the punctuations, spaces, numbers, ASSCII (American Standard Code for Information Interchange) characters and characters except Chinese characters in the corpus; (2) setting character strings and computing the mutual information values of the character strings, (3) computing the left comentropy and the right comentropy of every character string, (4) defining character string evaluation function, setting evaluation function threshold, computing the evaluation function values of every character string, determining that every character string is a word, comparing in sequence the evaluation function value of the former character with the evaluation function value of the latter character in the character string and segmenting character meaning character strings one by one, (5) utilizing conditional random fields to train a conditional random field model and recognizing domain terms with the conditional random field model. When the Chinese domain term recognition method is used to recognize terms, the data sparsity of legitimate terms is overcome, the amount of calculation of conditional random fields is reduced, and the accuracy of the Chinese domain term recognition is improved.
Owner:SHANGHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products