Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

627 results about "Participle" patented technology

A participle (PTCP) is a form of a verb that is used in a sentence to modify a noun, noun phrase, verb, or verb phrase, and plays a role similar to an adjective or adverb. It is one of the types of nonfinite verb forms. Its name comes from the Latin participium, a calque of Greek μετοχή (metokhḗ) "partaking" or "sharing"; it is so named because the Ancient Greek and Latin participles "share" some of the categories of the adjective or noun (gender, number, case) and some of those of the verb (tense and voice).

Text classification method of Chinese web page based on steam clustering

InactiveCN101727500AWide coverageWord segmentation method is simple and easySpecial data processing applicationsFeature vectorThe Internet
The invention relates to a text classification method of a Chinese web page based on steam clustering, belonging to the technical field of internetwork data mining. The text classification method comprises the following steps of: acquiring a web page in real time; removing unprocessed labels in the format of the web page, and analyzing the characteristic information of texts of the web page; segmenting the content of the texts, using as ngram participles, and forming a plurality of word strings; computing the weight value of each word string; extracting the word string with a high weight value, and using the word string with the high weight value and the corresponding weight value thereof as characteristic vectors; computing the similarity of the characteristic vectors and characteristic information and a known class; computing obtained total similarity, and classifying the texts to the know class or establishing a new class; judging whether the know class is divided into two subclasses or not according to the number of characteristic items of the known class; and storing processed text records and the information of the known class. The text classification method sufficiently excavates the effective information of web page texts aiming at the characteristics of the web page texts and is incremental, fast, effective and more practical.
Owner:TSINGHUA UNIV

SVM based micro-blog emotion classification method fusing various kinds of emotion resources

The invention discloses an SVM based micro-blog emotion classification method fusing various kinds of emotion resources. The method includes the following steps: constructing relevant dictionaries including an emotion dictionary, a negation dictionary, and a degree adverb dictionary; performing pretreatment on different corpora, performing word segmentation and part-of-speech tagging on the corpora, and performing sentence structure analysis; comparing the segmented words and positive and negative dictionaries to acquire initial word polarity, comparing words ahead of emotion words and the word degree grade dictionary and the negation dictionary to acquire modifier weight, and multiplying the initial word polarity by the modifier weight to acquire emotion scores of each micro-blog; extracting features such as nouns, verbs, adjectives, positive and negative emotion words, degree adverb weights, emotion scores, privatives and specific symbols from part-of-speech features, emotion features, sentence pattern features, and semantic features; and inputting the extracted features into an Libsvm to perform model training so as to acquire a training model. The method can achieve emotion 5-grade classification of micro-blogs, and can accurately and roundly acquire emotion tendency of netizens.
Owner:NANJING UNIV OF SCI & TECH

Data classification method and device

The invention relates to the field of data processing and discloses a commodity classification method and device, which are used for increasing the executing efficiency of a commodity classification flow. The method comprises the following steps of: acquiring relevant data of commodities to be classified and extracting commodity titles from the data; dividing participles of commodity titles respectively and determining the weight of each participle, wherein the weight of each participle is used for representing the history occurrence rate of the participle; selecting participles of which the weight values are consistent with a preset condition respectively specific to different commodities to constitute a participle sequence; and comparing the participle sequences selected specific to thecommodities and combining relevant data of commodities having the same participle sequence. By adopting the method and the device, the quantity of relevant data of commodities needing to be processedis reduced greatly, commodity classification can be realized quickly and accurately in a short period of time, the executing efficiency of the commodity classification flow is increased effectively, the management complexity of relevant data of the commodities is lowered, and the operation load of a system is lowered.
Owner:ALIBABA GRP HLDG LTD

Semantization service generation system and method based on graph mining technique

ActiveCN103631882AShield dependenciesClose to and meet the needsWeb data indexingSpecial data processing applicationsSystem integrationWeb service
The invention provides a semantization service generation system and method based on a graph mining technique. The system is built based on a traditional server and comprises multiple techniques including natural language participles, graph mining, clustering, semantization analysis, service procedure generation and service execution. After a user collects and analyzes application requirements with a natural language or text description, key words are naturally extracted, service requirements are analyzed, usable services and combination modes thereof are mined in a built Web service tree graph, and ultimately, in a system integration operation environment, the services are automatically carried out and execution results are fed back. The system and method have the advantages that operations are directly carried out on the requirements of the user for the natural language or the text description, semantization features are emphasized, the service execution environment is integrated, the service operation results are directly obtained, use habits and requirements of the user are met, widening of a user range is facilitated, automatic operation and maintenance of the system are achieved, and the system and method are suitable for the distributive execution environment.
Owner:BEIJING UNIV OF POSTS & TELECOMM

Named entity identification method, device, medium and equipment

Embodiments of the present application disclose a named entity recognition method, a device, equipment, and a medium, wherein, the method includes: obtaining a text to be recognized; word segmentationprocessing being carried out on the text to be recognized to obtain a word segmentation sequence; inputting the word segmentation sequence to a named entity recognition model, and obtaining attributeidentifiers of named entities corresponding to each word segmentation output from the named entity recognition model; furthermore, the named entity in the text to be recognized being determined according to the attribute identification of the named entity corresponding to each participle. The named entity recognition model used in this method is based on feedforward neural network with simple network structure and fewer network parameters, which ensures that the model is easy to maintain and update. In addition, based on the multi-dimensional segmentation features that can fully and comprehensively express the semantic information of segmentation, the model determines the attribute identification of named entity corresponding to each segmentation, which ensures the accuracy of named entity recognition. In addition, the present application also provides a method and apparatus for training a named entity recognition model.
Owner:TENCENT TECH (SHENZHEN) CO LTD

Method and device for translating natural languages into commands and navigation application of method and device

The invention discloses a method and device for translating natural languages into commands and navigation application of the method and device, and belongs to the technical field of speech recognition. The method includes the steps of entering statements and marking instruction classifications of the statements; conducting word segmentation on the statements; calculating first probabilities of all segmentation words, and storing the segmentation words, the first probabilities and the sequence of the segmentation words in the statements into a first data sheet; calculating second probabilities of all segmentation words, and storing the segmentation words, the second probabilities and the instruction classifications into a second data sheet; calculating the first matching degrees between similar studying statements and a conjecturing statement, and judging that the studying statement with the highest first matching degree is more similar to the conjecturing statement; calculating the second matching degrees between all similar command classifications and the conjecturing statement, and judging that the command classification with the highest matching degree is the command classification of the conjecturing statement. By means of the method and device and the navigation application, the natural languages can be translated into commands readable by a machine more accurately and rapidly, and the expandability is good.
Owner:SHANGHAI XIUYUAN NETWORK TECH

An intelligent operation and maintenance statement similarity matching method based on natural language processing

The invention discloses an intelligent operation and maintenance statement similarity matching method based on a natural language processing technology. The method mainly comprises two parts of data processing in knowledge base construction and sentence similarity matching based on deep learning. Compared with the prior art, the method has the advantages that (1) the operation and maintenance management knowledge is subjected to word segmentation by utilizing the specific word library and the HMM to find the new word model, so that the text word segmentation accuracy is improved, and the moreperfect text word library is established; (2) word vectors are trained through a deep learning method, so that the phenomenon of'dimensionality disaster 'represented by the word vectors can be avoided, information of vocabulary contexts can be fully mined, and relations between words can be obtained; And (3) on the basis of the sentence vectors configured with the weights, not only can the importance measure of each word be obtained, but also the information of the sentence vectors can be richer through the combination of the word vectors, and the accuracy of matching on the basis of forming the sentence vectors can be guaranteed through a cosine similarity matching algorithm.
Owner:华融融通(北京)科技有限公司

Document similarity calculation method and near-duplicate document detection method and device

The invention relates to a document similarity calculation method and a near-duplicate document detection method and device. The calculation method comprises the following steps: performing word segmentation processing on two documents to be detected to obtain respective participle sets of the documents to be detected; calculating the edition similarity of all participle pairs in the two participle sets, wherein two participles in each participle pair come from the two participle sets respectively; establishing sides among the participle pairs of which the edition similarity meets a certain requirement in all the participle pairs to obtain a weighted biograph, wherein the edition similarity is the weights of the sides of corresponding participle pairs; calculating the maximum weighted matching value of the weighted bi-graph; calculating the similarity between the documents to be detected by using the maximum weighted matching value. By adopting the document similarity calculation method and the near-duplicate document detection method and device provided by the invention, high accuracy is achieved, near-duplicate texts comprising participle set edition errors can be identified effectively, the near-duplicate document detection accuracy is increased, the calculation complexity is lowered, and the calculation efficiency is optimized.
Owner:HUAWEI TECH CO LTD +1

Related resource address push method and device based on video retrieval

The invention discloses a related resource address push method and device based on video retrieval. The related resource address push method and device based on the video retrieval comprises the steps of obtaining the characteristic text information of first video resource data when the loading or playing requests of the first video resource data are received, mapping the characteristic text information as one or more first participles, searching related second participles having the co-occurrence rate with the one or more first participles higher than a preset threshold value, wherein the co-occurrence rate is the possibility of the current one or more first participles and the second participles emerge together in identical video resource data, obtaining the network chained addresses of the second video resource data matched with the one or more fist participles and the related second participles, and pushing the network chained addresses of the second video resource data. The related resource address push method and device based on the video retrieval achieves the purpose of delving resources of good quality in a video database deeply, and improves delving efficiency of the resources. In addition, an index table can be enlarged continuously along with the accumulation of the video content of the internet, and the fact that a recall rate is facilitated is enlarged.
Owner:BEIJING QIHOO TECH CO LTD

Voice recognition text error correction method in specific field

The invention relates to a voice recognition text error correction method in a specific field, wherein the method comprises the following steps: firstly, performing statistics by using correct field corpora to obtain a character and word level language model and a pinyin language model; then, receiving a text sequence to be subjected to error correction, and performing clause processing on more than one sentence; determining suspected wrong words by using a word, word and pinyin language model; determining a candidate word list of the suspected wrong words according to a language model vocabulary and a pronunciation-prone dictionary; and finally, substituting candidate words into the original text sequence, and selecting and outputting the most reasonable sentence in combination with macroscopic and microcosmic scores. Basic units with different granularities and dimensions such as characters, words, pinyin and initial and final consonants are selected to construct a language model, and word segmentation error interference caused by wrong characters is reduced; isolated character disorder is processed by adopting a word language model, and continuous recognition errors caused by pronunciation deviation is distinguished by adopting the pinyin language model; and candidate sentences after the wrong words are replaced are comprehensively evaluated by macroscopic and microcosmic scores, and the smoothness degree of the replaced sentences are measured.
Owner:网经科技(苏州)有限公司

Model complementary Chinese rhythm interruption recognition system and method

The invention discloses a model complementary Chinese rhythm interruption recognition system and a method. The model complementary Chinese rhythm interrupted recognition system includes a first step of inputting Chinese phonetic symbols, Chinese texts and segmentation boundary of every Chinese character in the Chinese phonetic symbols through a first input module, a second step of carrying out participle and part-of-speech tagging to the input Chinese texts through a participle and part-of-speech tagging module and obtaining the lexical feature and the grammatical feature of every Chinese character in the Chinese texts through calculation of a first lexical and grammatical feature calculation module, a third step of carrying out fundamental frequency extraction and sound intensity calculation to the input Chinese phonetic symbols through utilization of a fundamental frequency extraction and sound intensity calculation module by a first acoustic feature calculation module to obtain the acoustic feature of every Chinese character in the Chinese texts, and a fourth step of loading trained combined complementary models, identifying and judging the rhythm interruption type of every Chinese character through the acoustic features, the lexical features and the grammatical features of the input Chinese characters, and outputting the Chinese texts which are tagged with the rhythm interruption types.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products