Patents

Literature

Patsnap Eureka AI that helps you search prior art, draft patents, and assess FTO risks, powered by patent and scientific literature data.

56 results about "Semantic mining" patented technology

Filter

Efficacy Topic

Property

Owner

Technical Advancement

Application Domain

Technology Topic

Technology Field Word

Patent Country/Region

Patent Type

Patent Status

Application Year

Inventor

Automatic standardized filing method based on text semantic mining

InactiveCN104572849AFree from readingShorten the timeSpecial data processing applicationsInformatizationSemantics

The invention relates to an automatic standardized filing method based on text semantic mining. The automatic standardized filing method is characterized by comprising the steps: crawling files from a website, and carrying out information extraction, key word extraction and automatic abstract generation on the crawled file and a local file by utilizing text semantics, and finally storing into an informatization system. For the information extraction, a rule set is established by adopting a knowledge engineering method, information points are automatically extracted from the file to form structural data; for the key word extraction, a key word is extracted according to a position and semantics of a word in a text to generate a key word index; for the automatic abstract generation, a content contained by the abstract is firstly set, corresponding information is extracted from the text, the similarity of sentences is calculated, and the texts including the key information in the original file are extracted. By adopting the automatic standardized filing method, business personnel do not need to read a great amount of files, time and labor are saved, and convenience in inquiry and application can be realized.

Automatic standardized filing method based on text semantic mining

Automatic standardized filing method based on text semantic mining

Automatic standardized filing method based on text semantic mining

Owner:MERIT DATA CO LTD

Multi-strategy fused method and device for identifying named entity

ActiveCN107330011ASemantic analysisSpecial data processing applicationsEntity typeData mining

The invention discloses a multi-strategy fused method and device for identifying a named entity. A first identification result is obtained by utilizing a first identifying model to identify the named entity in obtained linguistic data, the first identifying model can update and extend a corpus in the method, accordingly the newly produced named entity in the linguistic data can be identified, thus a first identifying result has higher accuracy rate, the named entity in the linguistic data is identified by utilizing a multi-identifying-model fused method to obtain a second identifying result, the first identifying result and the second identifying result are fused to obtain a third identifying result, then a semantic mining system is utilized to conduct role assignment on the named entity, the named entity having a role is output, accordingly it is achieved that the named entity is reliably identified under the situations that data is massive, the entity type is diversified and new words emerge in endlessly, and role assignment is conducted on the identified named entity.

Multi-strategy fused method and device for identifying named entity

Multi-strategy fused method and device for identifying named entity

Multi-strategy fused method and device for identifying named entity

Owner:ZHONGKE DINGFU BEIJING TECH DEV

In-text advertisement releasing method and system based on deep semantic mining

ActiveCN103853824APrecise positioningAdvertisementsSemantic analysisWeb siteText categorization

The invention discloses an in-text advertisement releasing method and an in-text advertisement releasing system based on deep semantic mining. The method comprises the following steps that an advertisement demand body is built; webpage contents are captured and received, webpage irrelevant to business information is removed according to the advertisement body and the text classification algorithm, the webpage types are judged, and keywords and key sentences are extracted; according to the linguistic rule, the deep semantic mining is carried out on the extracted key sentences, sentences, phases or words with commercial properties and with demands, emotion and attitude are discovered and extracted, and in addition, the advertisement marking is carried out; precise advertisement is embedded in an advertisement marking region through a generating type system, and when users browse the kind of webpage, the advertisement is shown in the specific region. The method and the system provided by the invention have the advantages that the advertisement relevant to the user reading content context demands can be issued in the webpage text contents, which page is suitable for in-text advertisement releasing in the website and the placing region and the advertisement words of the advertisement in the page can be analyzed, and the technical problems in the prior art can be solved.

In-text advertisement releasing method and system based on deep semantic mining

In-text advertisement releasing method and system based on deep semantic mining

In-text advertisement releasing method and system based on deep semantic mining

Owner:沈之锐

Web portrait search method for fusing text semantic and vision content

InactiveCN101388022AImprove retrieval performanceImprove accuracySpecial data processing applicationsFace detectionQuery string

The invention relates to a Web portrait retrieval method fused with text semantics and visual content, which comprises steps of submitting 'a query string'to a commercial search engine server to realize functions of connecting and downloading based on a HTTP protocol, downloading picture output of the commercial picture search engine and relevant websites to be a local image library, and simultaneously, extracting key tags of original websites to form XML files for post text processing, further, utilizing the AdaBoost face detection technique, mining the high-level semantics of vector models to webpage scripts containing pictures, comparing via using experienced weights and a method of dynamically weighting based on PLSA, dynamically combining visual analysis results of image characteristics and text analysis results of image characteristics via a regulating factor, obtaining rank values of relevancy of images and query, reordering the image output list of the search engine and feeding it back to users. The method has higher precision rate which is greatly increased after fusion of the characteristics.

Web portrait search method for fusing text semantic and vision content

Web portrait search method for fusing text semantic and vision content

Web portrait search method for fusing text semantic and vision content

Owner:BEIJING JIAOTONG UNIV

An ES-based electronic medical record retrieval method

ActiveCN109299239AImprove accuracyImprove recallDigital data information retrievalSemantic analysisMedical recordMedical terminology

The invention discloses an ES-based electronic medical record retrieval method, which relates to the technical field of medical data retrieval. This method introduces semantic analysis model into electronic medical record analysis, including the extraction of subject words and the calculation of semantic similarity, taking advantage of their advantages in text semantic mining, This paper providesthe algorithm support for the latent semantic mining of text information in electronic medical record retrieval by establishing the general medical semantic database (negative word, synonym, ambiguousword), It realizes the high accuracy and recall rate of information retrieval, better adapts to the medical terminology compared with the common natural language is often more complex and constantlychanging, and medical abbreviations, synonyms and polysemous words more characteristics. It meets the scientific research needs of multi-dimensional combined retrieval and the needs of full-text retrieval of related literatures based on latent semantic search. Realize intelligent full-text retrieval with semantic extension and semantic connotation extension in real sense.

An ES-based electronic medical record retrieval method

An ES-based electronic medical record retrieval method

An ES-based electronic medical record retrieval method

Owner:弘扬软件股份有限公司

System and method for human and machine voice interaction based on Java Map

ActiveCN104571485AEfficient use ofEasy to implementInput/output for user-computer interactionSpeech recognitionSpoken languageUser input

The invention provides a system and a method for human and machine voice interaction based on Java Map. The system comprises a voice identifying module, a spoken language comprehending module, a dialogue management module, a language generating module and a voice synthesizing module, wherein the voice identifying module is used for receiving voice information inputted by a user, and identifying the voice information into text data, the spoken language comprehending module is used for performing semantic mining on the text data, and converting the text data into a type which can be identified by a machine, the semantic mining is used for integrating contextual information inputted by the user according to a storage and utilization strategy of context key semantic elements of the Java Map, and extracting semantic key elements of the identified text, the dialogue management module is used for controlling the dialogue process of the human and machine interaction, the language generating module is used for integrating fragmentary answers to obtain a fluent text which meets the logic language expression type of people, and the voice synthesizing module is used for converting the generated answer text into voice information, and broadcasting the voice information to the user.

System and method for human and machine voice interaction based on Java Map

System and method for human and machine voice interaction based on Java Map

Owner:INST OF ACOUSTICS CHINESE ACAD OF SCI +1

Machine learning-based commodity comment data sentiment analysis method

InactiveCN108062304AMarket predictionsSemantic analysisLearning basedSemantic network

The invention discloses a machine learning-based commodity comment data sentiment analysis method. The method comprises the steps of collecting and extracting commodity comment data; preprocessing thedata, wherein the preprocessing comprises text duplicate removal, mechanical compression word removal and short sentence deletion; based on a Jieba word segmentation method, performing text word segmentation on the preprocessed data; building a sentiment analysis model: generating word vectors based on NNLM (Neural Network Language Model) training, and establishing a semantic network; and based on an LDA topic model, performing semantic mining, and generating a topic in an unsupervised manner. The unsupervised sentiment analysis method is realized; and a result shows that the sentiment analysis mode can effectively analyze comment sentiments of users.

Machine learning-based commodity comment data sentiment analysis method

Machine learning-based commodity comment data sentiment analysis method

Machine learning-based commodity comment data sentiment analysis method

Owner:BEIJING UNIV OF TECH

Distribution network earth fault analysis method based on deep learning

ActiveCN109324266ARealize automatic identificationReasonable arrangementFault location by conductor typesIncremental learningTransient state

The invention discloses a distribution network earth fault type identification device and method based on deep learning. Semantic mining is carried out on fault recording data by using a deep learningtechnology, a fault identification model with self-learning capacity is constructed by using incremental learning and deep reinforcement learning technologies, automatic identification for distribution network earth fault type is realized, the restriction that only route selection positioning is carried out in traditional earth fault analysis is broken through, and more abundant fault processingdecision-making information is provided; a positioning method based on a transient state zero-sequence current similarity principle is used, and compared with a traditional experience analysis accident processing method, the method has higher accuracy and wider adaptation; fault type and fault location result are synthesized, the decision basis is reasonable, the method is of more pertinency for solutions, and an accident processing plan can be arranged more reasonably.

Distribution network earth fault analysis method based on deep learning

Distribution network earth fault analysis method based on deep learning

Distribution network earth fault analysis method based on deep learning

Owner:WUHAN NARI LIABILITY OF STATE GRID ELECTRIC POWER RES INST +1

Automobile public praise semantic sentiment analysis system

ActiveCN110543547AEvaluation Quick LearnText database queryingSpecial data processing applicationsAnalysis dataMinutiae

The invention discloses an automobile public praise semantic sentiment analysis system. The system comprises a real-time monitoring module, a semantic mining module and a system display module, the real-time monitoring module is used for acquiring automobile user comment data in real time in a whole network according to a crawler technology; the semantic mining module is used for mining theme keywords related to the automobile from the comment data and performing emotional tendency analysis on the theme keywords to obtain emotional analysis data; the system display module performs statisticalanalysis on the whole vehicle model, key attributes and product competitiveness details according to the emotion analysis data to form a visual chart; according to the invention, related comment corpora of the user are analyzed by using an artificial intelligence algorithm; the technical problem that in the prior art, no system specially analyzes the evaluation of the automobile by the user, and the evaluation of the automobile by the user cannot be quickly known is solved, the evaluation of the automobile by the user can be quickly known, the system can be in a favorable position of knowing the user in competition, and the system can be widely applied to the automobile industry.

Automobile public praise semantic sentiment analysis system

Automobile public praise semantic sentiment analysis system

Automobile public praise semantic sentiment analysis system

Owner:广东数鼎科技有限公司

Method for marking marketing calls by means of semantic mining algorithm and system for governing marketing calls

ActiveCN108153727AAccurate classificationAccurate interceptionNatural language data processingData miningData miningMultiple layer

The invention discloses a method for marking marketing calls by means of a semantic mining algorithm and a system for governing the marketing calls. The method for marking the marketing calls by meansof a semantic mining algorithm method for marking the marketing calls by means of the semantic mining algorithm particularly comprises the steps that S1, labels of calls are divided into different types; S2, a dictionary base is built, wherein the dictionary base comprises word vectors corresponding to the labels; S3, the labels, belonging to the same type, in the dictionary base are extracted, and a layer of training sample is formed; S4, training is conducted by means of multiple layers of training samples, and a classification model is obtained; S5, according to the classification model, the types to which the word vectors in the dictionary base belong are marked. The invention further discloses the system for governing the marketing calls by means of the semantic mining algorithm. According to the method for marking the marketing calls by means of the semantic mining algorithm and the system for governing the marketing calls, the marketing calls can be accurately classified, a user can independently select the types of the incoming calls, and the purpose of precisely intercepting the marketing calls is achieved.

Method for marking marketing calls by means of semantic mining algorithm and system for governing marketing calls

Method for marking marketing calls by means of semantic mining algorithm and system for governing marketing calls

Method for marking marketing calls by means of semantic mining algorithm and system for governing marketing calls

Owner:ZHEJIANG PONSHINE INFORMATION TECH CO LTD

Online semantic excavation system of Chinese polysemic words and based on uniform resource locator (URL)

InactiveCN103488741AEffectively obtain online semantic classification resultsWeb data indexingSpecial data processing applicationsWeb page categorizationClassification methods

The invention discloses an online semantic excavation system of Chinese polysemic words and based on a uniform resource locator (URL). The system utilizes a webpage classification method based on the URL and can conduct semantic excavation on the Chinese polysemic words online. The process includes first constructing a URL classifier through an online URL classification catalogue; then classifying searching results (including webpage URLs and abstracts) of the polysemic words returned by a search engine by means of the URL classifier to obtain initial semantic classification results of the polysemic words; finally clustering the initial semantic classification results according to the webpage abstracts to obtain semantic excavation results of the polysemic words. The semantic excavation system has ideal accuracy and recall rate and is highly applicable to semantic excavation of network popular words.

Online semantic excavation system of Chinese polysemic words and based on uniform resource locator (URL)

Online semantic excavation system of Chinese polysemic words and based on uniform resource locator (URL)

Online semantic excavation system of Chinese polysemic words and based on uniform resource locator (URL)

Owner:EAST CHINA NORMAL UNIV

Compound word processing method and device used for semantic mining and equipment thereof

ActiveCN107894979AReduce the cost of trainingGood effectSemantic analysisSpecial data processing applicationsAlgorithmGranularity

The invention puts forward a compound word processing method and device used for semantic mining and equipment thereof. The method comprises the following steps: determining M segmented words of eachstatement in a training corpus; selecting N segmented words to generate N-dimension compound words according to the order of appearance of the M segmented words, wherein M is larger than or equal to 2and N is larger than or equal to 2 but smaller than or equal to M; putting character strings of N-dimension compound words into K-time hash operations, searching a pre-established random hash dictionary space to obtain positions only corresponding to hash operations each time and generating K-dimension word vectors of the N-dimension compound words according to floating point numbers of K positions corresponding to the K-time hash operation results, wherein K is an integer larger than 1; and screening out N-dimension target compound words meeting the pre-set condition according to K-dimensionword vectors of all the N-dimension compound words and inputting the N-dimension target compound words into a word bag model for semantic mining. Therefore, semantic features with more and larger granularities are introduced into the word bag model so that the effect of the word bag model is further improved.

Compound word processing method and device used for semantic mining and equipment thereof

Compound word processing method and device used for semantic mining and equipment thereof

Compound word processing method and device used for semantic mining and equipment thereof

Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Method for geographic semantic mining based on text big data

InactiveCN108932322AImprove understandingSpecial data processing applicationsGeolocationSemantics

The invention discloses a geographic semantic mining method based on text big data. The method comprises: using data crawling to obtain text data with geographic position labels, then assigning a geographic semantic topic to the selected part of the text data, and preprocessing the text data to generate a word vector, and then obtaining the geographic semantic topics of all the texts by a machinelearning method, and finally outputting all the geographic semantic topics in the form of vectors. The method speculates geographical semantics of a region according to the text data of the region, and provides theoretical support and assumptions for further urban planning, commercial location selection, trip planning and the like. A result of the method also contributes to strengthening people'sunderstanding of a certain region, and provides assistance for people's travel or play planning.

Method for geographic semantic mining based on text big data

Method for geographic semantic mining based on text big data

Method for geographic semantic mining based on text big data

Owner:PEKING UNIV

Comprehensive urban geographic semantic mining method based on multivariate big data

ActiveCN110442715AUnderstand comprehensivelyMeet different types of query needsCharacter and pattern recognitionResourcesUrban densityResting time

The invention discloses a comprehensive urban geographic semantic mining method based on multivariate big data. Social text data is a data source capable of best reflecting the cognition of people onurban area functions, so that the functions of urban areas are extracted by utilizing the social text data; based on the bus route data, automatic calculation is performed to obtain the relative rankof the traffic convenience of each region of the city without depending on artificially formulated rules; the urban population density distribution is analyzed from two macroscopic perspectives of thepopulation density index of the working time period of the workday and the population density index of the rest time period. According to the method, the comprehensive urban geographic semantics aredescribed from four different indexes including urban area functions, urban traffic convenience distribution, building functions and population density indexes; and in combination with the informationmined by the four indexes, different types of query requirements of different types of users can be met, and people can be better helped to comprehensively understand cities.

Comprehensive urban geographic semantic mining method based on multivariate big data

Comprehensive urban geographic semantic mining method based on multivariate big data

Comprehensive urban geographic semantic mining method based on multivariate big data

Owner:PEKING UNIV

Method for constructing spatial semantic database based on BIM and GIS data integration

PendingCN112527944AAvoid missingAvoid problems such as semantic mapping errorsVisual data miningStructured data browsingRelational tableData interface

The invention discloses a method for constructing a spatial semantic database based on BIM and GIS data integration. The method comprises the following steps: analyzing spatial data of BIM and GIS models; establishing mapping rules of object model related attribute description fields of IOT equipment and BIM and GIS model object attribute description fields, and constructing a relational database;extracting spatial semantic information, establishing a spatial semantic mapping relation table, and constructing a graph database; constructing a time sequence database according to the time sequence data transmitted by the IOT equipment; storing spatial semantic information in a graph database, storing relational data and static business data in a relational database, storing Internet-of-thingsdata in a time sequence database, and reading data interfaces of all the databases through a visualization engine. According to the method, a data storage standard of a BIM + GIS model is established, and fusion application is realized, so that a digital twin system application mode and a technical framework integrated with key technologies such as ontology, semantic mining, Internet-of-things, big data analysis and the like are constructed.

Method for constructing spatial semantic database based on BIM and GIS data integration

Method for constructing spatial semantic database based on BIM and GIS data integration

Owner:华建数创(上海)科技有限公司

N-gram-based semantic mining method for increment of topic model

InactiveCN102243625ASpecial data processing applicationsData streamAlgorithm

The invention discloses an N-gram-based semantic mining method for increment of a topic model. The N-gram-based semantic mining method comprises the steps of: (1) expanding an Author-Conference topic model, wherein a word space is expanded from Unigram to N-gram; (2) calculating a prior probability parameter of a current model according to linear weighting of posterior probability in a prior trained model with respect to current input data; (3) calculating a posterior probability value of the current model to the current data by adopting a Gibbs sampling method; and (4) repeating the steps (2) and (3) for training the model in an increment manner with respect to newly input data streams. According to the invention, the N-gram is introduced into the topic model and the property of the topic model for modeling scientific and technical literature is improved according to the semantic features contained by the N-gram; and the topic distribution of historic data is recorded by adopting asymmetric prior probability so as to train the model in an increment manner, and the efficiency of the method is increased.

N-gram-based semantic mining method for increment of topic model

N-gram-based semantic mining method for increment of topic model

N-gram-based semantic mining method for increment of topic model

Owner:BEIHANG UNIV

Binocular image rapid target detection method based on double-flow convolutional neural network

ActiveCN110110793AMitigation challengesApplication efficiency is fast and efficientCharacter and pattern recognitionSemantic informationSemantic mining

The invention discloses a binocular image rapid target detection method based on a double-flow convolutional neural network, and the method comprises the steps: carrying out the calibration of a binocular camera, and obtaining calibration parameters; correcting a training image according to the calibration parameters, and training an implicit deep semantic mining network for implicitly learning deep semantic information on the binocular image and training a multi-modal feature hybrid detection network; combining the features output by the implicit deep semantic mining network with the featuresof the multi-modal feature hybrid detection network in a channel series connection manner to form a double-flow convolutional neural network, and training the double-flow convolutional neural networkby using the training image; and obtaining a test image through the binocular camera, correcting the test image, and inputting the corrected image into the double-flow convolutional neural network for target detection to obtain a target detection result. The complementarity of RGB and deep semantic information can be comprehensively utilized, and the method has the advantages of being high in efficiency and more accurate in target detection result.

Binocular image rapid target detection method based on double-flow convolutional neural network

Binocular image rapid target detection method based on double-flow convolutional neural network

Binocular image rapid target detection method based on double-flow convolutional neural network

Owner:SUN YAT SEN UNIV

Text classification method and device, electronic equipment and storage medium

PendingCN113392210AImprove classification accuracyReduce model parametersSemantic analysisSpecial data processing applicationsText categorizationClassification methods

The invention relates to the field of computers, in particular to the technical field of artificial intelligence, and discloses a text classification method and device, electronic equipment and a storage medium; the method comprises the steps: obtaining to-be-recognized text information, inputting the text information into a trained first text classification model, and obtaining a target word vector matrix; performing semantic mining processing on each target word vector to obtain a corresponding semantic feature, and finally obtaining a target prediction classification result based on each semantic feature, wherein the first text classification model is obtained by performing parameter adjustment based on a first loss value and a second loss value, the first loss value is an error value between the prediction classification result and the actual classification result, and the second loss value is an error value between the two prediction classification results. According to the invention, two loss values are used for adjusting parameters of the first text classification model, so that a predicted classification result of the first text classification model approaches to an actual classification result and another predicted classification result, and the classification accuracy of the model is further improved.

Text classification method and device, electronic equipment and storage medium

Text classification method and device, electronic equipment and storage medium

Text classification method and device, electronic equipment and storage medium

Owner:TENCENT TECH (SHENZHEN) CO LTD

Depression auxiliary detection method based on open question and answer text

InactiveCN110532387ASpecial data processing applicationsText database clustering/classificationSpeech identificationQuestions and answers

The invention discloses a depression auxiliary detection method based on an open question and answer text, which is mainly characterized in that the depression auxiliary detection method is a text analysis method suitable for dialogue between a depression patient and a doctor, a model is retrained along with the increase of dialogue data to achieve a self-learning improvement effect, and the depression auxiliary detection method specifically comprises the following steps: text collection: extracting a dialogue text by using a voice recognition technology; text preprocessing: performing text preprocessing according to data features and model design; training a text semantic model, and performing semantic mining on the preprocessed text; predicting a text model and classifying features output by the semantic mining model. The model is self-learned, and the model can be self-iterated and updated along with the increase of the number of recognized samples.

Depression auxiliary detection method based on open question and answer text

Depression auxiliary detection method based on open question and answer text

Depression auxiliary detection method based on open question and answer text

Owner:成都中科云集信息技术有限公司

Agricultural non-point source pollution multi-source heterogeneous big data association method and big data supervision platform adopting same

PendingCN110196886AQuick annotationEfficient associationData processing applicationsRelational databasesSupport vector machinePoint source pollution

The invention relates to an agricultural non-point source pollution multi-source heterogeneous big data association method based on attribute classification. Compared with the prior art, the defect that efficient association is difficult to perform according to the data attributes is overcome. The method comprises the following steps of judging whether multi-source heterogeneous big data belongs to the quantitative data or the qualitative data; classifying the quantitative data by adopting a support vector machine, metric learning and other methods; obtaining the quantitative characteristics of the qualitative data by adopting a text semantic mining method, and classifying the qualitative data by adopting a support vector machine, a metric learning method and the like; encoding the classified result to realize the association of multi-source heterogeneous big data. The invention further provides an agricultural non-point source pollution big data supervision platform. According to thepresent invention, the attributes of the agricultural non-point source pollution multi-source heterogeneous big data are used as the classification bases, different processing methods are adopted forquantitative and qualitative data, the classification of the agricultural non-point source pollution multi-source heterogeneous big data is achieved, and the correlation is conducted by means of the generated tree structure soil pollution attribute codes.

Agricultural non-point source pollution multi-source heterogeneous big data association method and big data supervision platform adopting same

Agricultural non-point source pollution multi-source heterogeneous big data association method and big data supervision platform adopting same

Agricultural non-point source pollution multi-source heterogeneous big data association method and big data supervision platform adopting same

Owner:ANHUI UNIVERSITY

Financial risk prediction method and device based on text pre-training and multi-task learning

PendingCN113743111APoor resolutionImprove loading speedFinanceSemantic analysisRisk classificationNetwork model

The invention relates to a financial risk prediction method and device based on text pre-training and multi-task learning. The method comprises the steps of obtaining a to-be-processed text; inputting the to-be-processed text into a first neural network model to determine whether the content of the to-be-processed text includes financial risks according to the processing flow of the risk identification task; under the condition that the content of the to-be-processed text comprises the financial risk, determining the risk type of the financial risk according to the processing flow of the risk classification task by utilizing a first neural network model; and determining a risk subject matched with the risk type by using the first neural network model according to a processing flow of the risk subject identification task. According to the method and the device, the problem of poor model performance caused by lack of deep semantic mining is solved through a pre-training language model technology, and the technical problems of limited data volume and poor model performance caused by incapability of information sharing among tasks are solved by adopting multi-task processing.

Financial risk prediction method and device based on text pre-training and multi-task learning

Financial risk prediction method and device based on text pre-training and multi-task learning

Financial risk prediction method and device based on text pre-training and multi-task learning

Owner:NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT

A taxpayer tax registration address information clustering method based on a K-means algorithm model

InactiveCN109271462AIncrease the difficultyEfficient processing analysisDatabase modelsVector space modelMachine learning

The invention provides a taxpayer tax registration address information clustering method based on a K-means algorithm model, and belongs to the technical field of computers. First, the registration address is subjected to semantic mining of natural language, including the expansion of thesaurus and the operation of word segmentation. For the result of address segmentation, a Vector Space Model (VSM) is utilized to transform the text vector, and then the K-means algorithm is adopted to convert to the text vector address for clustering, an unsupervised mode is adopted to select the appropriate number K of clusters, and the structure is specified based on a clustering result and according to the need.

A taxpayer tax registration address information clustering method based on a K-means algorithm model

A taxpayer tax registration address information clustering method based on a K-means algorithm model

A taxpayer tax registration address information clustering method based on a K-means algorithm model

Owner:HEBEI AISINO TECH CO LTD

Distribution network first-aid repair work order processing method and device

PendingCN112036582AImprove the efficiency of emergency repair workActive recognition rate improvementData processing applicationsSemantic analysisAlgorithmIndustrial engineering

The embodiment of the invention provides a distribution network first-aid repair work order processing method and device, and the method comprises the steps: importing to-be-processed distribution network first-aid repair work order data and a pre-established expression of a hidden Markov model into a Viterbi algorithm for iterative calculation, and obtaining a calculated state sequence corresponding to a distribution network first-aid repair work order; carrying out convolution operation on the obtained state sequence, classifying and sorting operation results according to different power failure reasons, and obtaining and counting specific reasons used for representing power grid faults in the distribution network repair work order. According to the method, the distribution network first-aid repair work order is used as the recognition model for matching, semantic mining and learning are carried out on the work, and adaptive work order processing is realized, so that the data processing efficiency is improved, the reliability is improved, the active recognition rate of the work order is further improved, and the distribution network repair work efficiency is enhanced.

Distribution network first-aid repair work order processing method and device

Distribution network first-aid repair work order processing method and device

Distribution network first-aid repair work order processing method and device

Owner:STATE GRID ZHEJIANG ELECTRIC POWER +2

Data mining method for big data processing

InactiveCN104484409AComprehensive resultsThe result is accurateSpecial data processing applicationsAnalysis dataTopological graph

The invention relates to the technical field of internet, in particular to a data mining method for big data processing, which is comprehensive in mining result and high in data processing speed. The data mining method comprises the following steps of acquiring a retrieval demand of a user, processing the retrieval demand to be consistent data, then performing correlation matching on the consistent data and pre-stored purchased words, acquiring at least one correlation data source between the pre-stored purchased words and the retrieval demand data, building a topological graph of network, and mining in a database based on the topological graph of network. Compared with the prior art, the simple word matching or semantic mining way is abandoned, and the mining for potential key information is carried out on the data to be analyzed through starting with the correlation topological network, so that the data mining method for big data processing has the remarkable advantages that the acquired results are more comprehensive and more accurate, and the like.

Data mining method for big data processing

Owner:WUHU LERUISI INFORMATION CONSULTING

Dynamic expanding knowledge graph inference method oriented to China Mobile intelligent customer service

ActiveCN108256077ARealize new semantic mining of multi-round dialogue scenesMeet the needs of multiple rounds of dialogueSpecial data processing applicationsSemanticsSmart technology

The invention relates to a dynamic expanding knowledge graph inference method oriented to China Mobile intelligent customer service, and belongs to the technical field of artificial intelligence. Themethod comprises the steps of constructing a ternary knowledge graph representing learning; executing a knowledge graph Top-k query technology under a limiting technology; conducting single-business multi-round dialogue scene knowledge inference; conducting cross-business multi-round dialogue scene new-semantics mining. By means of the dynamic expanding knowledge graph inference method oriented tothe China Mobile intelligent customer service, natural language information with effective response interaction is provided for a user, and the requirements for multi-round dialogues with the intelligent customer service are met.

Dynamic expanding knowledge graph inference method oriented to China Mobile intelligent customer service

Dynamic expanding knowledge graph inference method oriented to China Mobile intelligent customer service

Dynamic expanding knowledge graph inference method oriented to China Mobile intelligent customer service

Owner:CHONGQING UNIV OF POSTS & TELECOMM

Pseudo-correlation feedback extended query method based on question-answering system

ActiveCN112307182AImprove sorting effectImprove accuracySemantic analysisCharacter and pattern recognitionSearch intentLexical item

The invention discloses a pseudo-correlation feedback extended query method based on a question-answering system. According to the pseudo-correlation feedback extended query method based on the question and answer system, mature semantic mining modules, such as attention mechanisms, in the question and answer system are used for reference, so that the model can truly understand the search intention of a user, and extended lexical items are selected according to interactive semantic information of query and documents. Compared with a traditional model, due to the fact that semantic interactioncharacteristics are added, the effect of selecting the extended lexical items is remarkably improved. In addition, a neural network based on a paired loss function is further added to understand statistical characteristics of lexical items, and word frequency, inverse document frequency and the like are used for correcting the possible semantic drift problem of the semantic model. Practice provesthat compared with a traditional pseudo-correlation feedback algorithm, the method has higher sorting accuracy and better robustness, and can be applied to various search scenes.

Pseudo-correlation feedback extended query method based on question-answering system

Pseudo-correlation feedback extended query method based on question-answering system

Pseudo-correlation feedback extended query method based on question-answering system

Owner:SHANGHAI JIAO TONG UNIV

Content semantic mining method for non-structured big data stream

ActiveCN105740329AMaster content semanticsMastering Emotional TendencySpecial data processing applicationsText database clustering/classificationNode clusteringSmall-world network

The invention discloses a content semantic mining method for a non-structured big data stream. The method comprises the steps of S1: extracting a text link, a label attribute and a semantic tendency keyword in the big data stream, and correspondingly defining text nodes, label nodes and content nodes; S2: constructing a text node set containing the text nodes and a label node set containing the label nodes, calculating and outputting weight values from the text nodes to the label nodes and weight values from any label node to other all label nodes; S3: according to the text node set, the label node set, the weight values from the text nodes to the label nodes and the weight values from any label node to other all label nodes, performing semantic classification on the content nodes and constructing different content node classification sets; and S4: according to the text node set and the content node classification sets, performing weighted small-world network clustering calculation on the text nodes to obtain a text node cluster set.

Content semantic mining method for non-structured big data stream

Content semantic mining method for non-structured big data stream

Content semantic mining method for non-structured big data stream

Owner:ZHEJIANG WANLI UNIV

Urban functional area identification process based on space-time semantic mining

PendingCN113627864AImprove the extraction effectSemantic analysisCharacter and pattern recognitionFunctional identificationData mining

The invention discloses an urban functional area identification process based on space-time semantic mining, which comprises documents, words, basic functional units, space-time data, a topic model, document topic distribution and unit function distribution, and is characterized in that firstly, hidden functions of an area are tried to discover through the topic model, compared with a text theme mining, the basic function units are equivalent to the documents in a corpus, space-time data in the basic function unit is similar to words in the document, unit function distribution obtained after passing through the topic model is equivalent to document topic distribution, and the used city space-time data is representative Sina microblog position sign-in data. Each piece of sign-in data comprises user information, space coordinates of sign-in positions, publishing time, publishing texts and the like. Dynamic activity modes of people can be reflected from different angles, meanwhile, POIs in a research area are obtained from a Baidu map, and function recognition of the area is achieved.

Urban functional area identification process based on space-time semantic mining

Urban functional area identification process based on space-time semantic mining

Urban functional area identification process based on space-time semantic mining

Owner:武汉市中城事大数据有限责任公司

Phrase semantic mining method driven by directed graph meaning guiding model

PendingCN111291573AMeet the needs of phrase semantic miningImplement miningSemantic analysisData miningTheoretical computer scienceGraph model

According to the phrase semantic mining method driven by the directed graph meaning guiding model, logic structure representation of a typical ontology language Word-Net is achieved through a Sem-Graph data model, and on this basis, modeling work of the Word-Net ontology language is achieved. The method comprises the following steps: performing semantic-level phrase structure mining on natural language text data based on a Sem-Graph model; modeling statements in a natural language by using a semantic-oriented semantic model, realizing statement-level semantic graph data structure description;defining a phrase semantic structure on a semantic graph; and realizing mining of frequent phrase semantics by using a frequent sub-graph mining algorithm. According to the method and the system, thedocuments can be properly described and accurately summarized, so that each minimum processing unit has independent and relatively complete semantic features, high-quality phrases related to the fieldcan be mined from a large amount of text data, and the increasing phrase semantic mining requirements are fully met.

Phrase semantic mining method driven by directed graph meaning guiding model

Phrase semantic mining method driven by directed graph meaning guiding model

Phrase semantic mining method driven by directed graph meaning guiding model

Owner:高小翎

User portrait method applied to field of network security

PendingCN110674288AComprehensive descriptionAccurate descriptionText database clustering/classificationAnomaly detectionEngineering

The invention discloses a user portrait drawing method applied to the field of network security, which is used for drawing a user portrait in combination with natural attributes, operation behavior characteristics and data use habits of a user. On the basis of a traditional user portrait method based on statistics and rules, a machine learning method, such as semantic mining, time sequence fitting, clustering and correlation analysis, is added, a user behavior model is deeply mined and analyzed, and more accurate and effective anomaly detection capability is provided.

User portrait method applied to field of network security

User portrait method applied to field of network security

User portrait method applied to field of network security

Owner:蓝盾信息安全技术有限公司

Popular searches

Knowledge engineering Documentation Automatic summarization Data science Rule sets Index term Information extraction Computer science Named entity Linguistic rule