Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

38 results about "Dependency grammar" patented technology

Dependency grammar (DG) is a class of modern grammatical theories that are all based on the dependency relation (as opposed to the relation of phrase structure) and that can be traced back primarily to the work of Lucien Tesnière. Dependency is the notion that linguistic units, e.g. words, are connected to each other by directed links. The (finite) verb is taken to be the structural center of clause structure. All other syntactic units (words) are either directly or indirectly connected to the verb in terms of the directed links, which are called dependencies. DGs are distinct from phrase structure grammars, since DGs lack phrasal nodes, although they acknowledge phrases. Structure is determined by the relation between a word (a head) and its dependents. Dependency structures are flatter than phrase structures in part because they lack a finite verb phrase constituent, and they are thus well suited for the analysis of languages with free word order, such as Czech, Slovak, and Warlpiri.

Dependency semantic-based Chinese unsupervised open entity relationship extraction method

The invention relates to a dependency semantic-based Chinese unsupervised open entity relationship extraction method. The method comprises the following steps of preprocessing an input text: performing Chinese word segmentation, part-of-speech tagging and dependency grammar analysis on the input text; performing named entity identification on the input text; arbitrarily selecting two entities from identified entities to form candidate entity pairs; searching for a dependency path between two entities in the candidate entity pairs; and analyzing whether a syntactic structure mapped by the dependency path is matched with a normal form of a dependency semantic normal form set or not, if yes, extracting words or phrases from the residual part of the input text according to the matched normal form to serve as relational words, forming a relational triple by the extracted relational words and the candidate entity pairs, and if not, performing normal form matching of a next group of the candidate entity pairs; and outputting the relational triple. Compared with the prior art, the method has the advantages that the calculation complexity is low; the extraction efficiency is high; distance position limitation is overcome; a simple sentence also can be extracted and the like.
Owner:TONGJI UNIV

Multi-granularity semantic chunk based entity attribute and attribute value extracting method

The invention relates to a multi-granularity semantic chunk based entity attribute and attribute value extracting method, and belongs to the technical field of Web mining and information extraction. The method comprises the following steps that a corpus set is constructed and free text extraction is performed; a corpus is subjected to word segmentation, part-of-speech tagging and phrase recognition; the corpus is subjected to semantic role labeling; the corpus is subjected to dependency grammar analysis; the corpus is subjected to semantic dependency analysis; candidate entities, attributes and attribute value triads based on three granularities of words, phrases and semantic roles are extracted; the candidate entities, attributes and attribute value triads are corrected and subjected to error classification by means of a trained classifier. Compared with the prior art, the entities, attributes and attribute value triads based on three granularities of words, phrases and semantic roles are automatically extracted from a free text, the entity attribute and attribute value extraction accuracy and efficiency are improved, and the wide application prospect is achieved in the fields of theme detection, information retrieval, automatic abstracting, question and answer systems and the like.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Entity relationship recognition method and apparatus

The present invention relates to an entity relationship recognition method and apparatus. The method comprises obtaining a statement sequence from a target text in a corpus, and performing named entity recognition and dependency grammar marker on the statement sequence to obtain a marked text sentence; matching and retrieving the marked text sentence on basis of an entity relationship seed to obtain a training example; replacing the entity relationship seed word in the training example with predetermined identification, processing the training example after replacement combined with the named entity recognition and the dependency grammar marker, and generating a candidate rule; fuzzifying the candidate rule to obtain fuzzy rules; determining whether the fuzzy rules comprise a new rule; and retrieving the corpus according to the fuzzy rules to obtain a seed set when the fuzzy rules comprise the new rule, and using the obtained seed set as an entity relationship recognition result. Manual participation can be effectively reduced, dependence on the calibrated corpus is reduced, a new entity relationship can be found timely, and the entity relationship recognition method and apparatus are self-adaptive to entity relationship mining in different fields.
Owner:LETV HLDG BEIJING CO LTD +1

Medical data collection and analysis method and system

The invention relates to a medical data collection and analysis method, comprising the following steps of 1, uploading original data to a data platform; 2, converting the original data to data in an RDF (Resource Description Framework) format by using a semantic annotation algorithm based on a conditional random field in combination with a dependency grammar; 3, associating RDF data of a same patient in the data processed in step 2 through a data mining algorithm, and storing the RDF data into an Hbase database based on a distributed file system; 4, analyzing data in the Hbase database by using a statistical method and a machine learning method to obtain analysis conclusions; 5, organizing and classifying the analysis conclusions to construct a therapeutic scheme knowledge base. According to the medical data collection and analysis method and a system, the whole clinical diagnosis and treatment data of the patient are collected pertinently, a large number of data are analyzed to clinically diagnose in assistance, forecast disease and analyze the patient; the medical data collection and analysis method and the system can clinically help doctors to make an effective, exact and individualized therapeutic scheme.
Owner:XIEHE HOSPITAL ATTACHED TO TONGJI MEDICAL COLLEGE HUAZHONG SCI & TECH UNIV

User comment attribute extraction method based on bi-directional dependency syntactic tree representation

The invention discloses a user comment attribute extraction method based on bi-directional dependency syntactic tree representation. The method comprises the steps that 1) given user comment text is preprocessed, and a dependency syntactic tree is generated; 2) a bi-directional dependency syntactic tree representation network is established, and dependency characteristics among words are extracted; 3) the dependency characteristics are input into a bi-directional LSTM nerve network, sequence characteristics among the words are extracted on the basis of the dependency characteristics, and accordingly the dependency characteristics are effectively combined with the sequence characteristics; 4) the combined characteristics are coded by using a linear chain condition random field; 5) a Viterbialgorithm is used for conducting decoding to obtain comment attributes of all text. According to the user comment attribute extraction method, the aim is effectively achieved that in user comment attribute extraction tasks, the dependency characteristics of text syntax are extracted and efficiently combined with the sequence characteristics to achieve end-to-end training; the condition random field is used for coding the combined characteristics, the Viterbi algorithm is used for decoding the combined characteristics, and the good effect can be achieved in the user comment attribute extraction tasks.
Owner:SOUTHWEST JIAOTONG UNIV

Address knowledge processing method and device based on graphs

The invention relates to an address knowledge processing method and an address knowledge processing device based on graph. The method comprises the following steps: (10) syncopating an address text into an address word sequence; (20) performing part-of-speech tagging on each address word in the address word sequence according to a predefined part-of-speech tagging set which reflects features of address words; (30) performing dependency grammar analysis on the tagged address word sequence according to a predefined address word dependency rule, and using physical address words as nodes, so as to obtain a side which reflects the dependency among the physical address words; (40) comparing with the original content of an address knowledge base, and inputting the newly added nodes or side into the address knowledge base. The invention further provides the address knowledge processing device based on the graphs. According to the address knowledge processing method and the address knowledge processing device based on the graphs, address information can be organized according to the inherent logic among addresses, so as to form the address knowledge base; the address query precision can be increased by utilizing the address knowledge base; a reasoning function based on address knowledge can be supported.
Owner:SHENZHEN AUDAQUE DATA TECH

Construction method and system of incremental-translation-oriented structured language model

InactiveCN102945231ATest set perplexity dropsPerplexity downSpecial data processing applicationsDiscriminantAlgorithm
The invention discloses a construction method and a construction system of an incremental-translation-oriented structured language model. The method comprises the following steps: step 1, performing dependency grammar analysis on incrementally generated translation segments to obtain dependency tree segment assembly; step 2, extracting a discriminant feature instance on the dependency tree segment assembly, and calculating a feature score of the discriminant feature instance by a discriminant dependency grammar model; step 3, performing pruning on the dependency tree segment assembly according to the feature score, taking a maximal value of the feature score as the score of the structured language model, reserving the segment having the highest score in the structured language model, and acquiring the optimized dependency tree segment assembly; and step 4, splicing the next translation segment onto the dependency tree segment assembly through a shift-specification operation, repeating the step 1, the step 2 and the step 3 until finishing the translation, and generating the complete dependency tree. According to the construction method and the construction system of the incremental-translation-oriented structured language model, the grammar information and the long-distance dependency information can be merged into the language model, the effective optimization algorithm is proposed for dynamic calculation of the structured language model in a decoding process, and the translation quality is improved.
Owner:INST OF COMPUTING TECH CHINESE ACAD OF SCI

Word sense disambiguation method fusing sentence local context with document domain information

The invention relates to a word sense disambiguation method fusing a sentence local context with document domain information, and belongs to the technical field of natural language processing. The word sense disambiguation method comprises the steps of: 1, carrying out dependency grammar analysis on a sentence where an ambiguous word is positioned, and obtaining sentence local context related words with a direct dependency relationship with the ambiguous word; 2, carrying out dependency grammar analysis on a domain document set, collecting all dependency tuples which the domain document set contains, and constructing a dependency tuple library; 3, carrying out statistic analysis on the dependency tuple library, and finding a group of domain related words with the closest relationship with the ambiguous word; 4, according to a dependency distribution similarity of the domain related words and word sense relevance between the domain related words and the local context, determining disambiguation weights of the domain related words; 5, merging the sentence local context related words with the domain related words, and constructing a related word set; and 6, according to weighted accumulation relevance of each word sense of the ambiguous word and the related word set, judging a correct word sense. According to the method disclosed by the invention, adaptability of a word sense disambiguation system on a specific domain can be improved, and disambiguation accuracy can be improved.
Owner:山东经伟晟睿数据技术有限公司

Emotion analysis method and device and electronic equipment

The invention discloses an emotion analysis method and device and an electronic device. The method comprises the steps of determining a to-be-analyzed sentence in a to-be-analyzed text; performing subject matching on each to-be-analyzed sentence based on a preset subject information base, wherein the preset subject information base comprises multiple pieces of subject information; when the targetsubject is matched in the to-be-analyzed sentence, determining the weighting coefficient of each word in the to-be-analyzed sentence to the target subject by utilizing a subject emotion self-attentionmechanism, and forming the subject emotion self-attention mechanism by combining dependency grammar modeling; determining emotion words in the to-be-analyzed sentence and polarity of the emotion words; utilizing the emotion words, the polarity of the emotion words and the weighting coefficient to determine the emotion value of the to-be-analyzed sentence to the target subject; and combining the emotion values of all the to-be-analyzed sentences matched with the target subject in the to-be-analyzed text, and determining the emotion value of the to-be-analyzed text for the target subject. According to the invention, the emotional tendency of the target subject in the text can be accurately determined.
Owner:北京百分点科技集团股份有限公司

Bayesian word sense disambiguation method based on mass pseudo-data

The invention particularly relates to a new bayesian word sense disambiguation method based on mass pseudo-data. The problems that a current word sense disambiguation method is poor in disambiguation effect and capable of wasting time and labor when disambiguation knowledge is obtained are solved. The new bayesian word sense disambiguation method includes the steps that through a dependency grammar analyzer, training examples containing ambiguous words in a training corpus base are subjected to syntactic analysis, and tuples with the dependence relationship with the ambiguous words are collected; then through a machine translation system, example sentences containing the tuples in a machine translation corpus base are searched. The steps are repeatedly carried out in a mode, the searched example sentences are added into a pseudo-training corpus base, and then through the training corpus base and the pseudo-training corpus base, a bayesian disambiguation model is trained; word meanings of the ambiguous words are decided through the disambiguation model, and on the basis of a small amount of manually-annotated corpuses, the data sparsity problem of word sense disambiguation can be effectively solved, the accuracy of word sense disambiguation is increased, and the new bayesian word sense disambiguation method has broad development prospects.
Owner:SHANXI UNIV

Method for mapping Chinese problems on basis of LDA (latent Dirichlet allocation)

The invention discloses a method for mapping Chinese problems on the basis of LDA (latent Dirichlet allocation). The method includes classifying document libraries by the aid of LDA theme models; classifying word characteristics for the problems by the aid of Softmax regression models; assigning high weights to notional words according to difference of categories of the word characteristics, assigning low weights to functional words according to the difference of the categories of the word characteristics and allowing the weights of different word characteristics of the notional words to be different from one another; finding dependency relations of terms in sentences by means of syntactic analysis on the basis of dependency grammar; assigning different weights according to the difference of components of the terms in the sentences; multiplying two portions to obtain a weight of each word in each problem; establishing relationships by the aid of weighted distribution of terms in the problems and distribution of themes and lexical terms in documents according to Bayesian rules. The method has the advantages that the documents are classified on the basis of the LDA theme models, the different weights are distributed on the reference of the word characteristics of the lexical terms in the interrogative sentences and the components in the sentences, accordingly, effects of important lexical terms can be improved during classification, and the Chinese problem mapping accuracy can be improved.
Owner:识因智能科技(北京)有限公司

A Semantic Role Labeling and Semantic Extraction Method for Unrestricted Path Natural Language

The invention relates to a semantic role tagging and semantic extracting method of an unrestricted path natural language. The method comprises the steps that firstly Chinese path natural language linguistic data under an unrestricted condition is collected and a Chinese path natural language linguistic database is built; secondly, automatic tagging of the path natural language linguistic data is achieved by using the semantic role tagging method based on chunk analysis and dependency grammar analysis; finally, according to a semantic role tagging result, path unit division is sequentially conducted, and navigational semantic information of path units is extracted. Path natural language semantic role tagging is conducted by using the semantic role tagging method based on chunk analysis and dependency grammar analysis, according to the extracted semantic role tagging result, path unit division is conducted, and finally the semantic information of the path units is extracted. By means of the method, path unit division can be sequentially and accurately conducted, the path semantic information can be accurately extracted, and accordingly the method can provide guidance for smooth implementation of asking for directions and navigation with a robot.
Owner:NORTH CHINA ELECTRIC POWER UNIV (BAODING)

A User Comment Attribute Extraction Method Based on Bidirectional Dependency Syntax Tree Representation

The invention discloses a user comment attribute extraction method based on bi-directional dependency syntactic tree representation. The method comprises the steps that 1) given user comment text is preprocessed, and a dependency syntactic tree is generated; 2) a bi-directional dependency syntactic tree representation network is established, and dependency characteristics among words are extracted; 3) the dependency characteristics are input into a bi-directional LSTM nerve network, sequence characteristics among the words are extracted on the basis of the dependency characteristics, and accordingly the dependency characteristics are effectively combined with the sequence characteristics; 4) the combined characteristics are coded by using a linear chain condition random field; 5) a Viterbialgorithm is used for conducting decoding to obtain comment attributes of all text. According to the user comment attribute extraction method, the aim is effectively achieved that in user comment attribute extraction tasks, the dependency characteristics of text syntax are extracted and efficiently combined with the sequence characteristics to achieve end-to-end training; the condition random field is used for coding the combined characteristics, the Viterbi algorithm is used for decoding the combined characteristics, and the good effect can be achieved in the user comment attribute extraction tasks.
Owner:SOUTHWEST JIAOTONG UNIV

Construction method and system of incremental-translation-oriented structured language model

InactiveCN102945231BTest set perplexity dropsPerplexity downSpecial data processing applicationsDiscriminantAlgorithm
The invention discloses a construction method and a construction system of an incremental-translation-oriented structured language model. The method comprises the following steps: step 1, performing dependency grammar analysis on incrementally generated translation segments to obtain dependency tree segment assembly; step 2, extracting a discriminant feature instance on the dependency tree segment assembly, and calculating a feature score of the discriminant feature instance by a discriminant dependency grammar model; step 3, performing pruning on the dependency tree segment assembly according to the feature score, taking a maximal value of the feature score as the score of the structured language model, reserving the segment having the highest score in the structured language model, and acquiring the optimized dependency tree segment assembly; and step 4, splicing the next translation segment onto the dependency tree segment assembly through a shift-specification operation, repeating the step 1, the step 2 and the step 3 until finishing the translation, and generating the complete dependency tree. According to the construction method and the construction system of the incremental-translation-oriented structured language model, the grammar information and the long-distance dependency information can be merged into the language model, the effective optimization algorithm is proposed for dynamic calculation of the structured language model in a decoding process, and the translation quality is improved.
Owner:INST OF COMPUTING TECH CHINESE ACAD OF SCI

Extraction Method of Entity Attributes and Attribute Values ​​Based on Multi-granularity Semantic Blocks

The invention relates to a multi-granularity semantic chunk based entity attribute and attribute value extracting method, and belongs to the technical field of Web mining and information extraction. The method comprises the following steps that a corpus set is constructed and free text extraction is performed; a corpus is subjected to word segmentation, part-of-speech tagging and phrase recognition; the corpus is subjected to semantic role labeling; the corpus is subjected to dependency grammar analysis; the corpus is subjected to semantic dependency analysis; candidate entities, attributes and attribute value triads based on three granularities of words, phrases and semantic roles are extracted; the candidate entities, attributes and attribute value triads are corrected and subjected to error classification by means of a trained classifier. Compared with the prior art, the entities, attributes and attribute value triads based on three granularities of words, phrases and semantic roles are automatically extracted from a free text, the entity attribute and attribute value extraction accuracy and efficiency are improved, and the wide application prospect is achieved in the fields of theme detection, information retrieval, automatic abstracting, question and answer systems and the like.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

A Word Sense Disambiguation Method Fused with Sentence Local Context and Document Domain Information

The invention relates to a word sense disambiguation method fusing a sentence local context with document domain information, and belongs to the technical field of natural language processing. The word sense disambiguation method comprises the steps of: 1, carrying out dependency grammar analysis on a sentence where an ambiguous word is positioned, and obtaining sentence local context related words with a direct dependency relationship with the ambiguous word; 2, carrying out dependency grammar analysis on a domain document set, collecting all dependency tuples which the domain document set contains, and constructing a dependency tuple library; 3, carrying out statistic analysis on the dependency tuple library, and finding a group of domain related words with the closest relationship with the ambiguous word; 4, according to a dependency distribution similarity of the domain related words and word sense relevance between the domain related words and the local context, determining disambiguation weights of the domain related words; 5, merging the sentence local context related words with the domain related words, and constructing a related word set; and 6, according to weighted accumulation relevance of each word sense of the ambiguous word and the related word set, judging a correct word sense. According to the method disclosed by the invention, adaptability of a word sense disambiguation system on a specific domain can be improved, and disambiguation accuracy can be improved.
Owner:山东经伟晟睿数据技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products