Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

64 results about "Lexical set" patented technology

A lexical set is a group of words that all fall under a single category based on some shared phonological feature.

Comment analysis method based on word vectors and syntactic features and visual interactive interface

The invention provides a comment analysis method based on word vectors and syntactic characteristics in the field of data analysis. The comment analysis method comprises the steps of obtaining commodity page comment data of an e-commerce website; preprocessing the acquired target data set; extracting a appendix lexical set provided by Hownet and NTU to form a basic emotion dictionary; carrying outword vector training on the obtained preprocessed data set through a Word2Vec tool; establishing a probability transfer matrix by using the semantic similarity matrix; carrying out core sentence rule-based processing on the obtained commodity comment text; carrying out preprocessing on the obtained text without the redundancy; performing part-of-speech extraction (commodity attributes, negative words, degree words and sentiment words) evaluation matching on the obtained dependency relationship pairs; combining the evaluation matching pair with an emotion dictionary, subjecting evaluation objects to appraisal value calculation and quality sorting, and finally, realizing the evaluation objects through a visual interaction interface, so that accurate, real-time, automatic and convenient processing and analysis on commodity comment data are realized, and the method can be used in an e-commerce platform.
Owner:NANJING UNIV OF POSTS & TELECOMM

Limited domain-oriented knowledge graph updating method and system

The invention provides a limited domain-oriented knowledge graph updating method and system, and the method comprises the steps: inputting limited domain question and answer corpora, extracting candidate entities of sentences in the corpora through word segmentation, screening common functional words in a word segmentation result through a word frequency dictionary, and obtaining a candidate entity set; constructing an inverted index dictionary according to the limited domain knowledge graph to obtain a similar vocabulary set of each candidate entity; training the candidate entities and the corresponding similar vocabulary sets into word vectors, and calculating cosine similarity so as to judge the types of the candidate entities; obtaining the relationship between every two candidate entities in the candidate entity set by using the trained Bert text classification model; and updating the candidate entity type and the relationship between the candidate entities into the knowledge graph according to the judgment. The knowledge graph updating method provided by the invention is higher in efficiency, can recognize the newly appearing entity type according to the existing entities inthe graph, and effectively improves the knowledge graph updating speed and accuracy.
Owner:HUAZHONG NORMAL UNIV

Text keyword recognition method and device, computer equipment and readable storage medium

The invention relates to the technical field of intelligent decision making of artificial intelligence, and discloses a text keyword recognition method. The method comprises the steps of obtaining text information, and performing word segmentation on the text information to obtain a vocabulary set; calculating the word frequency of each vocabulary in the vocabulary set, splitting the vocabulary set to obtain a sub-vocabulary set and an association relationship among the vocabularies in the sub-vocabulary set, and obtaining a total vocabulary table with characteristic values according to the word frequency of each vocabulary in the sub-vocabulary set and the association relationship among the vocabularies; and arranging the vocabularies in the total vocabulary table according to the characteristic values, and setting the vocabularies of which the characteristic values exceed a preset characteristic threshold value as keywords. The invention also relates to a blockchain technology, and information can be stored in the blockchain node. The key degree of the vocabulary is evaluated from two dimensions of the word frequency of each vocabulary in the vocabulary set and the degree of dependence of any vocabulary in the vocabulary set by other vocabularies, so that the accuracy of obtaining the keyword capable of reflecting the core meaning of the text information is improved.
Owner:ONE CONNECT SMART TECH CO LTD SHENZHEN

Construction method and device of user knowledge concept network and evaluation method of user knowledge

The invention discloses a construction method and device of a user knowledge concept network and an evaluation method of user knowledge.The construction method of the user knowledge concept network comprises the steps that firstly, each text contained in a text set containing m independent texts is preprocessed, and then each vocabulary of corpus serves as a concept subject term; all sentences and vocabularies are traversed, vocabularies appearing together with the concept subject terms in the same sentence are included into vocabulary sets corresponding to the concept subject terms, then vocabulary element screening is conducted on each vocabulary set, and a concept library is constructed; the field division is performed on concepts contained in the concept library by adopting a hierarchical clustering method; then, according to the matching condition of vocabularies contained in the user text data and a concept library, concepts contained in the user text data are obtained; and finally, a user knowledge concept network is constructed according to the concepts contained in the user text data and the divided concept fields. According to the method, the accuracy and objectivity of evaluation can be improved.
Owner:武汉渔见晚科技有限责任公司

Text information processing method and system

The invention provides a text information processing method and system, and the method comprises the steps: carrying out the word segmentation of a to-be-approved text, and obtaining a vocabulary setcomprising a plurality of vocabularies; extracting features of each vocabulary in the vocabulary set to obtain a vocabulary feature set; inputting the vocabulary feature set into a preset classification model for vocabulary classification, and determining whether the to-be-approved text contains sensitive words or not; if the sensitive words are contained, outputting text information used for indicating that the to-be-approved text does not pass the approval; and if the sensitive words are not included, outputting text information used for indicating that the to-be-approved text passes approval. According to the scheme, vocabulary classification is carried out on the to-be-approved text by utilizing the pre-trained classification model, and whether the to-be-approved text contains the sensitive words or not is determined. And outputting text information used for indicating whether the approval text passes approval or not according to the determination result without manual approval, sothat manpower and approval cost are saved, and approval speed and approval efficiency are improved.
Owner:BANK OF CHINA

Element extraction method and device, electronic equipment and storage medium

The invention provides an element extraction method and device, electronic equipment and a storage medium. The method comprises the steps of obtaining a to-be-extracted text and a vocabulary set of the to-be-extracted text; based on a matching result between character strings corresponding to every two characters in the to-be-extracted text and the vocabulary set, the relevancy between every two characters is determined, and the character strings are obtained by being intercepted from the to-be-extracted text with the two corresponding characters as starting points and ending points; coding each character in the to-be-extracted text on the basis of the relevancy between every two characters to obtain an element boundary feature of each character; and determining an element extraction result of the to-be-extracted text based on the element boundary features of the characters. According to the element extraction method and device, the electronic equipment and the storage medium provided by the invention, the matched vocabularies and the original sentences do not need to be spliced, and the original input length is not changed, so that the coding efficiency is improved. In addition, compared with an existing vocabulary splicing method, the storage space is saved.
Owner:IFLYTEK (SUZHOU) TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products