Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

359 results about "Word meaning" patented technology

Information processing method and device for realizing intelligent question answering

The invention relates to the technical field of man-machine interaction, and discloses an information processing method and device for realizing intelligent question answering. The information processing method comprises the following steps of: carrying out sentence segmentation on question text information to obtain a user question; and searching a standard question most similar to the user question and corresponding answer information from a QA library on the basis of a question similarity. Compared with the existing keyword retrieval-based question answering method, the method disclosed by the invention does not need to require the users to have keyword decomposition ability, is automatic in the whole process and is capable of greatly enhancing the user experience and improving the search effect and the pertinence and effectiveness of answers. Meanwhile, through fusing natural language understanding technologies such as sentence model analysis, lexical analysis and lexical meaning extension, and carrying out comprehensive calculation on multi-dimensional similarity, the method is capable of improving the correctness of a final sentence similarity in a Chinese automatic question answering process, and enabling a Chinese intelligent question answering system to be possible.
Owner:JIANGMEN POWER SUPPLY BUREAU OF GUANGDONG POWER GRID

Multi-subject extracting method based on semantic categories

The invention provides a multi-subject extracting method based on semantic categories. The multi-subject extracting method based on the semantic categories comprises the following steps that firstly, a document is preprocessed according to a traditional method and a vector composed of feature words is obtained preliminarily; secondly, synonyms are merged by the utilization of the corresponding relation between word meanings and concepts of 'HowNet', polysemic word disambiguation is carried out according to the correlation between the semantic categories and the context, and a concept vector model is constructed to represent the document; then the concept vector model is converted to be a semantic category model according to the one-to-one corresponding relation between the concepts and the semantic categories; the concept similarity is calculated by the utilization of the related semantic information in the concepts in 'HowNet' and then the semantic similarity is obtained; the semantic categories are clustered by improving the K-means algorithm according to the method of presetting seeds, and a plurality of subject semantic category clusters are formed; finally, a plurality of sub-subject word sets are obtained in a reverse mode according to the corresponding relations between the semantic categories and the concepts and between the concepts and words. The method considers the semantic information, overcomes the defect that the sensibility to the initial center by the K-means algorithm and time-and-space cost are not stable, and improves the quality of extracted subjects.
Owner:HOHAI UNIV

CRFs (conditional random fields) and SVM (support vector machine) based method for extracting fine-granularity sentiment elements in product reviews

The invention discloses a CRFs (conditional random fields) and SVM (support vector machine) based method for extracting fine-granularity sentiment elements in product reviews. The method comprises the steps as follows: a, a CRFs model is adopted, review language characteristics are taken as sequences, then position labelling is performed on review languages according to the sequences, corresponding rules are adopted to perform stratified filtering on error labels, and extraction for sentiment subjects and sentiment words is finished; and b, an SVM model is adopted to perform sentiment orientation analysis on word pairs according to the extracted sentiment subjects and sentiment words as well as introduced sentence structure features. According to the invention, the sentiment subjects and the sentiment words in review sentences are extracted together, further, sentiment classification accuracy in the sentiment orientation analysis is improved, so that the sentiment element extraction and sentiment judgment are improved, and F value is up to 76.3%; due to introduction of word meaning codes, the generalization ability and the robustness of a system are improved by virtue of the word meaning codes, and the accurate rate and recall rate of review result analysis are greatly improved.
Owner:青岛类认知人工智能有限公司

Court similar case recommendation model based on word vectors and word frequencies

PendingCN110597949AThe similarity calculation results are goodAvoid Natural DisadvantagesText database queryingSpecial data processing applicationsRecommendation modelComputational model
The invention discloses a court similar case recommendation model based on word vectors and word frequencies, namely a TF-W2V similarity calculation model. The judgment documents are divided into fivecase types of criminal affairs, civil affairs, execution, compensation and administrative affairs, and in order to process, store and query the judgment documents, the model extracts the key information from the submitted judgment, and finds out the judgment with the highest similarity in the same type of judgment in the document data by adopting a Word2Vc + TF-IDF text similarity algorithm to give out the similarity and recommend the judgment. According to the method, based on a word frequency and word vector method, the keywords and the word meaning information of the texts are integrated,and the similarity of the two texts is accurately calculated. The method is applied to the court judgment for similarity calculation, and the experimental results prove that the method is simple to apply, has no requirement for a labeling training set, can be applied to the texts in different fields, consumes the moderate time in calculation, is more accurate in obtained result compared with a traditional method, is closer to the expert evaluation results, and can calculate the similarity of the court texts accurately and effectively.
Owner:HUBEI UNIV OF TECH

Word segmentation algorithm-based log parsing method and word segmentation algorithm-based log parsing system

The invention relates to the technical field of log audit and safety management, and aims at providing a word segmentation algorithm-based log parsing method and a word segmentation algorithm-based log parsing system. The word segmentation algorithm-based log parsing method comprises the following steps: performing segmentation on a log, performing word sense analysis on segmentation results, performing word sense filtration on obtained segmentation results with word sense tagging, performing feature extraction on the obtained filtered segmentation results with the word sense tagging, performing feature matching on obtained word sense order feature codes, and performing semantic parsing on obtained semantic parsing rules; the word segmentation algorithm-based log parsing system comprises a segmentation module, a word sense analysis module, a word sense filtration module, a word order feature extraction module, a feature matching module and a semantic parsing module. According to the word segmentation algorithm-based log parsing method and the word segmentation algorithm-based log parsing system disclosed by the invention, the difficulty and complexity of log parsing are greatly reduced, and therefore the efficiency of performing parsing rule development on the log is increased; the word segmentation algorithm-based log parsing method and the word segmentation algorithm-based log parsing system can be better adapted to certain changes of a log format.
Owner:HANGZHOU ANHENG INFORMATION TECH CO LTD

Relation extraction method based on Bi-LSTM input information enhancement

The invention provides a relation extraction method based on Bi-LSTM input information enhancement and belongs to the field of artificial intelligence natural language processing of computers. The method comprises the steps that by applying a strategy annotation dataset of an indeterminate label, a redundancy encoding technology is used for conducting character-level encoding on each word to generate a word form encoding vector; the word form encoding vector and a word embedding vector are spliced to generate a word vector used for capturing word form and word meaning information; Bi-LSTM of input information enhancement is used as a model encoding layer, the word vector is input to an encoding layer, and the encoding vector is output; the encoding vector is input into a decoding layer, and a decoding vector is obtained; by applying three layers of NN, an entity label, a relation type and entity number information are extracted from the decoding vector; finally, the gradient is calculated, the weight is updated, and a model is trained through a maximum target function. By means of the relation extraction method, the robustness of the system is improved, interference information caused by non-entity words is reduced, and the accuracy rate and recall rate of relation extraction are effectively increased.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Answer determination method and device for intelligent questions and answers

The embodiment of the invention provides an answer determination method and device for intelligent questions and answers. The method comprises the steps of taking an association statement set containing the current questions and the original dialogue context related to the current questions as a basic analysis object; introducing the context feature vectors of the associated statement set and performing answer prediction based on the context feature vector so as to distinguishing the word meaning of the same word in a specific context without needing to predefine various context scenes; meanwhile, using a trained context understanding model for carrying out feature vector interaction processing on the selected candidate answers and all associated statements in the associated statement set;based on the obtained multiple vector interaction matrixes, and identifying the question and answer matching degree of the candidate answers and the current question, so that the matching degree of the associated statement set and the candidate answers in the corresponding context can be accurately understood, the accuracy of selecting the target answers from the candidate answers is improved, and further the answer recommendation accuracy for the user consulting questions is improved, and the use experience of the intelligent question and answer user is improved.
Owner:ADVANCED NEW TECH CO LTD

Word multi-prototype vector representation and word sense disambiguation method based on CRP clustering

The invention discloses a word multi-prototype vector representation and word sense disambiguation method based on CRP clustering, which comprises the following steps: the text in the massive text corpus is purified and pretreated to obtain plain text, CRP algorithm is used to cluster the context window representation of target polysemous word in the text corpus set. The target polysemous words inthe text corpus set are marked according to the clustering classification, and the polysemous words are trained on the marked text corpus set to obtain the multi-prototype vector representation of the polysemous words; 2, the target short text is preprocessed to obtain a short text word sequence, a target polysemous word in a word sequence is identifued, the contextual window of the target polysemous words is used to represent the similarity between the centroids of clusters corresponding to the words in the text corpus, and the word vector corresponding to the maximum similarity clusters isused as the word vector representation of the specific meaning of the polysemous words in the context to disambiguate the meanings of the polysemous words. The invention solves the problem of polysemyexpression in word expression and the problem of ambiguity identification in word meaning expression.
Owner:NORTH CHINA UNIV OF WATER RESOURCES & ELECTRIC POWER

Searchable encryption method and system for supporting fuzzy retrieval of word morphology and word meaning in cloud environments

ActiveCN108712366AFuzzy retrieval implementationSearch Encryption SimplifiedKey distribution for secure communicationSpecial data processing applicationsHash functionCiphertext
The invention belongs to the technical field of information security, and discloses a searchable encryption method and system for supporting fuzzy retrieval of word morphology and word meaning in cloud environments. The method includes establishing a security index by adopting a Bloom filter structure to support file dynamic update; accomplishing word morphology fuzzy matching by adopting a locally sensitive Hash function; performing word meaning expansion on keywords by using a natural language toolbox, constructing a word meaning fuzzy set, and inserting the word meaning fuzzy set into the established security index so that support word meaning keyword retrieval can be converted into accurate matching retrieval; performing inner product operation to realize the ordering of retrieval results by establishing query vectors and index vectors; and using the Hash function and a security knn algorithm to make the method and the system effectively resist selection keyword attacks so that thesearching of cryptograph on a cloud server under semi-trusted conditions. Cryptograph searching functions can be achieved; and the method and the system can support fuzzy retrieval of word morphologyand word meaning, multi-keyword retrieval, and dynamic adding and deleting files.
Owner:XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products