Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

238 results about "Word order" patented technology

In linguistics, word order typology is the study of the order of the syntactic constituents of a language, and how different languages employ different orders. Correlations between orders found in different syntactic sub-domains are also of interest.

Intelligent Chinese request-answering system based on concept

The invention discloses a Chinese question answering system based on concept, which mainly comprises a data server, a question pre-treatment module, a candidate question set extracting module and a question sentence similarity calculation module. The invention aims at providing a question answering system which is based on concept, can carry out synonym expansion of keywords which are processed by question sentences which are input by the user, understand question sentences better, carry out searching and improve the recall ratio of the question answering system. Furthermore, the system has a Chinese sentence similarity calculation method based on concept from three aspects: word form, word order and word length, and improves searching precision ratio. Meanwhile, the system adopts a high-efficiency retrieval technology to realize rapid extraction of candidate question set, calculates question sentence similarity, sorts question set quickly and returns the sorted questions and answers to the user. The question answering system of the invention gives more precise understanding in concept to the question sentences input by the user and searches the accurate answers. Experiments show that the question answering system of the invention achieves high recall ratio and precision ratio.
Owner:HUAZHONG UNIV OF SCI & TECH

Short text clustering method based on deep semantic feature learning

The invention discloses a short text clustering method based on deep semantic feature learning. The method includes the steps that dimensionality reduction representation is performed on original features under the restraint of local information preservation through traditional feature dimensionality reduction, binarization is performed on an obtained low-dimension actual value vector, and error back propagation is performed with the binarized vector being supervisory information of a convolutional neural network structure to train a model; non-supervision training is performed on a term vector through an outer large-scale corpus, vectorization representation is performed on all words in text according to the word order, and the vectorized words serve as implicit semantic features of initial input feature learning text of the convolutional neural network structure; after deep semantic feature representation is obtained, a traditional K-means algorithm is adopted for performing clustering on the text. By means of the method, extra natural language processing and other specialized knowledge are not needed, design is easy, deep semantic features can be learnt, besides, the learnt semantic features have unbiasedness, and good clustering performance can be achieved more effectively.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI

Text analysis method and text analyzer

The invention discloses a text analysis method and a text analyzer. The method comprises the following steps of: performing splitting processing on an acquired text by utilizing characters as a unit, and performing characteristic tagging on characters obtained by splitting according to preset character characteristics so as to form tagged word strings; performing word segmentation processing on the tagged word strings according to pre-constructed word segmentation models so as to obtain word segmentation results containing word orders; performing merging processing on the word orders contained in the word segmentation results, and performing characteristic tagging on words obtained by merging according to the preset character characteristics so as to obtain tagged word strings; performing part-of-speech tagging on the tagged word strings according to pre-constructed part-of-speech tagging models so as to obtain part-of-speech tagging results; and if confirming that the part-of-speech tagging results contain part-of-speech tags of entity words, merging the entity words containing the part-of-speech tags in the part-of-speech tagging results according to same adjacent rules, so as to obtain a text analysis result. By applying the text analysis method and the text analyzer, the entity word text analysis accuracy rate can be improved.
Owner:新浪技术(中国)有限公司

Word segmentation algorithm-based log parsing method and word segmentation algorithm-based log parsing system

The invention relates to the technical field of log audit and safety management, and aims at providing a word segmentation algorithm-based log parsing method and a word segmentation algorithm-based log parsing system. The word segmentation algorithm-based log parsing method comprises the following steps: performing segmentation on a log, performing word sense analysis on segmentation results, performing word sense filtration on obtained segmentation results with word sense tagging, performing feature extraction on the obtained filtered segmentation results with the word sense tagging, performing feature matching on obtained word sense order feature codes, and performing semantic parsing on obtained semantic parsing rules; the word segmentation algorithm-based log parsing system comprises a segmentation module, a word sense analysis module, a word sense filtration module, a word order feature extraction module, a feature matching module and a semantic parsing module. According to the word segmentation algorithm-based log parsing method and the word segmentation algorithm-based log parsing system disclosed by the invention, the difficulty and complexity of log parsing are greatly reduced, and therefore the efficiency of performing parsing rule development on the log is increased; the word segmentation algorithm-based log parsing method and the word segmentation algorithm-based log parsing system can be better adapted to certain changes of a log format.
Owner:HANGZHOU ANHENG INFORMATION TECH CO LTD

Construction method of hybrid neural network model for dialogue generation

The invention discloses a construction method of hybrid neural network model for dialogue generation. The construction method of hybrid neural network model for dialogue generation includes the steps: acquiring a data set in a mode of dialogue statement pairs, and constructing a glossary; generating a word embedded table; initializing the convolution neural network with special structure, generating a vocabulary recommending table corresponding to the input statement, determining whether real output is provided, and if so, training the parameters of the convolution neural network in the step; initializing the recurrent neural network with special structure, using the last step to output, generating a vocabulary identity list with word order, determining whether real output is provided, and if so, training the parameters of the recurrent neural network in the step; after the training result satisfies the set index, saving the glossary and the word embedded table, and saving the parameters of the convolution neural network and the recurrent neural network, thus completing construction of the whole model. The construction method of hybrid neural network model for dialogue generation solves the problems that a current neural network dialogue model is slow in the training speed, low in the accuracy and general in statement generation because the glossary is too long.
Owner:NANJING UNIV

Integrated retrieval method for multi-language information retrieval

An integrated retrieval method for multi-language information retrieval relates to multi-language information retrieval method, solving the problems of source language information loss caused by the multi-language information retrieval of the existing separation mode, a lot of noise and low accuracy of retrieval result, specifically comprising the following steps: step one, translating the source language inquiring key word input by the user into the key word of target language; step two, dividing the key word of target language into three relation modes according to the word order of each word, the decoration and collocation relation of each word, and word distance of each word that are precision matching mode, common display mode and independent mode; step three, obtaining the condition probability of precision matching mode, condition probability of common display mode and condition probability of independent mode in the inquiring file D; step four, calculating the file generating inquiring probability in the inquiring file D; step five, calculating the similarity of source language inquiring key word and inquiring file character vector; step six, calculating the condition probability of multi-language information retrieval; step seven, returning the retrieval result. The method is suitable for cross language information retrieval.
Owner:哈尔滨工业大学高新技术开发总公司

A drug entity relationship extraction method and system based on an attention mechanism neural network

The invention relates to a drug entity relationship extraction method and system based on an attention mechanism neural network. The method comprises the following steps: (1) analyzing the text content of a pharmaceutical document, using sentences as basic units for sentence segmentation, and performing vectorization representation on each word in the sentences; (2) inputting a vectorized representation result into a recurrent neural network, extracting association characteristics of words in the sentences according to a front-back bidirectional word sequence through the recurrent neural network, and identifying all medicine entities; (3) obtaining inter-word importance weights in the sentences through an attention mechanism neural network, and combining the inter-word importance weights with the output in the step (2); And (4) inputting a result obtained in the step (3) into a convolutional neural network, and predicting a category relation between every two medicated entity words through the convolutional neural network. According to the classification method for increasing the attention mechanism concerned entity class information weight, the influence caused by wrong dependencyanalysis results in long sentences can be reduced, and the accuracy of extracting the pharmacochemical entity relationship is improved.
Owner:PEKING UNIV

Intelligent question and answer method and system based on pet knowledge graph

PendingCN110209787AFill in the lack of intelligent question and answerNatural language data processingSpecial data processing applicationsEntity linkingSequence graph
The invention discloses an intelligent question answering method and system based on a pet knowledge graph, and the method comprises the steps: constructing a named entity dictionary, abstracting questions, and facilitating the classification of the questions. A method of combining word2vec with Levenshtein Distance is provided to realize entity linking, and experiments show that the method is effective. Texts are trained by constructing a text classifier based on Naive Bayes, and the improved TF-IDF naive Bayes classification algorithm is provided, the distribution situation of feature wordsin a text set and the category distribution situation are considered, and the improved TF-IDF effectively improves the text classification effect. Through the result of the text classifier, the intention of the natural language question is determined, and the natural language question is matched with the corresponding word sequence graph. The word order graph is converted into a similar SQL querystatement of the OrientDB, and querying is performed in a graph database storing the knowledge graph. Finally, the constructed intelligent question and answer system based on the knowledge graph is displayed in an example, and experiments show that the system has a relatively high application value in question and answer application in the field of pets.
Owner:袁琦

Language teaching method

InactiveUS7104798B2Teaching apparatusNatural language processingLanguage pedagogy
A method of teaching students a language utilizing a coded medium through verbal and nonverbal communication. The students unconsciously learn the structure of the language through color, sound, shape / texture, a verb puzzle piece, gestures and grammar stories. The method includes the presentation of a new linguistic structure to the students to elicit linguistic responses from the students. The students are encouraged to respond verbally. A student is then directed to display the linguistic structure using the coded medium. The student response is then reviewed and corrected to ensure that all students use the correct gesture referencing time when addressing the puzzle piece. The teacher moves the coded medium corresponding to the correct punctuation to teach word order, syntax, cohesion and other linguistic features. A second student is directed to respond to the first student verbally and by displaying the linguistic structure with the coded medium. The students are directed to write the linguistic structure and draw pictures of the coded medium corresponding to the linguistic structure. These activities are repeated until quick and skillful responses are delivered automatically. Grammar stories are role played and reinforce the language program. The grammar stories, manipulation of the coded medium and the verb puzzle pieces and suffix word used in the method put language in a time and space relationship. Thus students unconsciously learn the structure of the language through the coded medium, gestures and grammar stories.
Owner:SPAVENTA VIRGINIA

An electric power customer service work order sentiment quantitative analysis method based on Word2Vec

ActiveCN109670167ASimplify groomingReduce online consultation timeSemantic analysisCharacter and pattern recognitionPart of speechAlgorithm
The invention discloses an electric power customer service work order sentiment quantitative analysis method based on Word2Vec, and relates to an electric power customer service work order analysis method. A traditional sentiment analysis method cannot effectively discriminate the sentiment intensity. The method of the invention comprises the steps of combining the power customer service work order text features; classifying and sorting the historical electric power customer service work orders and the unsatisfied work orders, cleaning data, combing based on the Baidu word bank to form an initialized multivariate emotion word bank; carrying out the work order text word segmentation by adopting a reverse maximum matching algorithm; based on the Word2Vec neural network, constructing the positive words, negative words, degree adverbs and a word vector of a word order fused with customer appeal semantics; performing the machine learning training through the historical customer service workorder to generate a learning model fusing appeal emotion, expanding a part-of-speech corpus based on the part-of-speech affinity-consanguinity relationship in the model, performing emotion quantization calculation by adopting a similarity word sequence matrix quantization algorithm, and completing customer service work order emotion quantization analysis, thereby effectively distinguishing emotion intensity differences, and determining an emergency degree.
Owner:STATE GRID ZHEJIANG ELECTRIC POWER +2

Statistical method and statistical system of text similarity

The invention discloses a statistical method of text similarity. The statistical method comprises the following steps: obtaining a first text and a second text need to distinguish similarity; respectively dividing the first text and the second text into a plurality of text segments according to a first dividing scale, calculating a proportion of quantity of the same text segments in the first text and the second text to total text segment quantity of the first text under the first dividing scale; deleting the same text segments from the first text and the second text, respectively obtaining a first remaining text and a second remaining text; respectively dividing the first remaining text and the second remaining text into a plurality of text segments according to a second dividing scale, calculating a proportion of quantity of the same text segments in the first remaining text and the second remaining text to total text segment quantity of the first remaining text under the second dividing scale; and calculating the comprehensive text similarity of the first text and the second text. The statistical method of the text similarity can accurately reflect the similarity degree between texts in which the orders of words and sentences are disorganized by men and detect the similar text in which the word order, the sentence order and the section order are disorganized on purpose.
Owner:南方电网互联网服务有限公司

Address resolution method and device

The embodiments of the invention provide an address resolution method and device, and relates to the technical field of information processing. The method comprises the steps of segmenting an address to be matched to generate a first segmentation result, and segmenting each standard library address to generate a second segmentation result; calculating the semantic score of third segmentation in the first segmentation result to generate a first semantic vector, and calculating the semantic score of the third segmentation in the second segmentation result to generate a second semantic vector; generating a first word order vector according to the third segmentation and the first segmentation result, generating a second word order vector according to the third segmentation and the second segmentation result, calculating a semantic similarity according to the first semantic vector and the second semantic vector, and calculating a word order similarity according to the first word order vector and the second word order vector; and selecting a standard library address matched with the address to be matched from the plurality of standard library addresses according to the semantic similarity and the word order similarity. The address resolution method and device are simple, fast and high in working efficiency.
Owner:上海博辕信息技术服务有限公司

Preprocessing module of multi-language intelligent preprocessing real-time statistical machine translation system

The invention discloses a preprocessing module of a multi-language intelligent preprocessing real-time statistical machine translation system. The preprocessing module comprises a text preprocessing module and an automatic speech recognition result preprocessing module, wherein the text preprocessing module is used for carrying out word normalized operation, class recognition labeling and chuck and word order adjustment on languages input in a text manner; and the automatic speech recognition result preprocessing module is used for carrying out word normalized operation and punctuation prediction on the languages. The preprocessing module disclosed by the invention is capable of carrying out basic operations such as word normalized operation, class recognition labeling and chuck and word order adjustment on to-be-translated text languages, so that convenience is brought to the translation carried out on the to-be-translated language texts by a subsequent translation module; or the preprocessing module disclosed by the invention is capable of carrying out work normalized operation on speech languages or carrying out preprocessing such as prediction and the like on the punctuations in speech flows, so that convenience is brought to the translation carried out by a subsequent machine translation module. The preprocessing module has the effect of carrying out labelling and preferential translation on small-probability words, so as to improve the correctness of translating the small-probability words.
Owner:唐亮
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products