Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

35 results about "Name disambiguation" patented technology

Name disambiguation method and apparatus

The invention provides a name disambiguation method and apparatus. The method comprises the following steps: preprocessing full-text information of names to be disambiguated so as to extract semantic features of the full-text information; according to the semantic features, generating semantic fingerprints of the full-text information of the names to be disambiguated, including mail fingerprints, coauthor fingerprints, mechanism fingerprints and text fingerprints; through comparing the full-text information of the names to be disambiguated with semantic fingerprints having same-name full-text information as the names to be disambiguated in a preset semantic fingerprint database, determining similarity between the full-text information of the names to be disambiguated and the semantic fingerprints having the same-name full-text information as the names to be disambiguated in the preset semantic fingerprint database; and according to the semantic fingerprint similarity, determining a name group after disambiguation which the semantic fingerprints of the full-text information of the names to be disambiguated belongs to. By using such a method, while name disambiguation accuracy is ensured, the name disambiguation speed is improved, and increment name disambiguation is supported.
Owner:INST OF SCI & TECHN INFORMATION OF CHINA

Disambiguation processing method, system and device for cross-enterprise personnel name duplication in industrial and commercial registration information, processor and storage medium thereof

The invention relates to a disambiguation processing method for a cross-enterprise personnel name duplication phenomenon in industrial and commercial registration information, and the method comprises the steps: carrying out the data collection and filtering processing according to the industrial and commercial registration information, and obtaining an industrial and commercial information personnel list; sampling the obtained business information personnel list to obtain part of personnel information data and corresponding enterprise registration information; grouping the obtained data by constructing an undirected graph model, and calculating the similarity between every two nodes in each sub-graph generated by the undirected graph model; and according to the training vector and the prediction vector, constructing a similarity vector to train a logic regression model, and carrying out similarity weighting processing to obtain a name disambiguation result. The invention also relates to a corresponding system, device, processor and storage medium. By adopting the method, the system, the device, the processor and the storage medium, the enterprise names can be automatically disambiguated, and a certain support is provided for enterprise association relationship analysis.
Owner:上海睿翎法律咨询服务有限公司

Name disambiguation method and system based on LightGBM classification and representation learning

The invention provides a LightGBM classification and representation learning-based name disambiguation method and a LightGBM classification and representation learning-based name disambiguation system for scientific literature data and aiming at author homonymy phenomena in literatures. According to the supervised learning part, meta-information features of papers in a training set and associated information features among the papers are extracted by utilizing feature engineering, a positive example and negative example sample pair data set is constructed through sampling and serves as input of a LightGBM dichotomy model, and model output serves as the probability that the two papers belong to the same author. The representation learning part refers to a word2vec text semantic representation method and a meta-path-based relation network representation method to capture semantic information of papers and relation characteristics between the papers. And finally, based on the output of the supervision model and the representation learning model, cluster division is performed on the to-be-disambiguated paper set by using a hierarchical clustering algorithm to realize homonymy disambiguation. According to the method, high expandability and stability can be achieved on the premise that the accuracy rate and the recall rate are not lost, parallel calculation can be completely achieved, and the execution efficiency is improved.
Owner:COMP NETWORK INFORMATION CENT CHINESE ACADEMY OF SCI

Disambiguation method and device for thesis author and computer equipment

The invention relates to the artificial intelligence technology, and discloses a paper author disambiguation method comprising the following steps: respectively forming author names involved in all papers in a database into name trees according to preset rules; obtaining association relationship heterogeneous networks corresponding to all papers in a database; obtaining paper semantic representations respectively corresponding to all papers in the database; constructing a similar matrix based on the name tree, the association relationship heterogeneous network and the paper semantic representation; clustering the similar matrixes to obtain paper clustering groups corresponding to all papers in a database; judging whether the paper clustering group corresponding to the author to be disambiguated belongs to a paper clustering group corresponding to a specified author or or not; and if not, judging that the author to be disambiguated is different from the specified author. According to the method and device, the author names are preprocessed to construct the name tree, then clustering errors caused by different expression modes of name writing are eliminated according to the name tree, it is guaranteed that the names of the same author are divided into the same group as much as possible, and the name disambiguation accuracy is improved.
Owner:PING AN TECH (SHENZHEN) CO LTD

Part-of-speech tagging-based internet news related place name identification method and system

The invention discloses a part-of-speech tagging-based internet news related place name identification method and a part-of-speech tagging-based internet news related place name identification system,and belongs to the technical field of natural language processing. According to the part-of-speech tagging-based internet news related place name identification method, the method comprises the stepsof supplementing the context information of news by utilizing the overall reporting region of a news media column; assisting a place name disambiguation program to correctly judge a place name, converting news contents into a pure noun phrase sequence by utilizing part-of-speech tagging, carrying out place name identification on the noun phrase sequence, carrying out place name subtraction on place name identification results twice, eliminating inaccurate place names, and finally carrying out weighted summary on the two place name subtraction results to confirm the place name. The part-of-speech tagging-based internet news related place name identification method is popular and easy to understand. The implementation process of the method is simple. The problem that news related place names are low in extraction accuracy can be effectively solved. The part-of-speech tagging-based internet news related place name identification method has good application and popularization value.
Owner:INSPUR SOFTWARE CO LTD

Method and device for name disambiguation

The invention provides a name disambiguation method and apparatus. The method comprises the following steps: preprocessing full-text information of names to be disambiguated so as to extract semantic features of the full-text information; according to the semantic features, generating semantic fingerprints of the full-text information of the names to be disambiguated, including mail fingerprints, coauthor fingerprints, mechanism fingerprints and text fingerprints; through comparing the full-text information of the names to be disambiguated with semantic fingerprints having same-name full-text information as the names to be disambiguated in a preset semantic fingerprint database, determining similarity between the full-text information of the names to be disambiguated and the semantic fingerprints having the same-name full-text information as the names to be disambiguated in the preset semantic fingerprint database; and according to the semantic fingerprint similarity, determining a name group after disambiguation which the semantic fingerprints of the full-text information of the names to be disambiguated belongs to. By using such a method, while name disambiguation accuracy is ensured, the name disambiguation speed is improved, and increment name disambiguation is supported.
Owner:INST OF SCI & TECHN INFORMATION OF CHINA

Chinese and English literature author name fusion disambiguation method

The invention belongs to the technical field of name disambiguation, and particularly relates to a Chinese and English literature author name disambiguation method. According to the method, Chinese author name disambiguation and English author name disambiguation are carried out based on semantic fingerprints, author cooperation network similarity, author reference network similarity and the like, and disambiguation of Chinese authors and name pinyin in English literatures is completed according to a Chinese disambiguation result and an English disambiguation result. According to the method, whether authors of different literatures are the same person or not can be accurately distinguished, the same author in Chinese and English can be well recognized, the author needing to be found can be quickly positioned, the accuracy rate is high, and retrieval work can be conveniently carried out; the calculation of the similarity of the scientific research duration of the authors is introduced, so that disambiguation of Chinese and English names of the Chinese authors can be well assisted, the age range of the authors can be determined, other authors with the same name not in the range can be filtered out, and the disambiguation accuracy is improved.
Owner:中科大数据研究院

Natural person name disambiguation method and device based on enterprise association relationship and medium

The invention discloses a natural person name disambiguation method and device based on an enterprise association relationship and a medium, and the method comprises the steps: obtaining basic training data, taking an enterprise and a person as two node types, and constructing an enterprise-person basic heterogeneous graph; according to a preset splitting rule, splitting partial personnel nodes with a plurality of edges in the basic heterogeneous graph to obtain a derivative graph; according to the derivative graph, training a preset heterogeneous graph neural network model to obtain a node vector representation model; and adding to-be-merged personnel as personnel nodes into the basic heterogeneous graph, and judging whether the to-be-merged personnel nodes and other homonymous nodes in the basic heterogeneous graph need to be merged or not according to the node vector representation model. Compared with the prior art, the method has the advantages that the enterprise data is graphical through the enterprise association relationship, and then the ambiguity elimination processing is performed on the homonymous nodes through the trained graph neural network model, so that the accuracy of the ambiguity elimination result is greatly improved.
Owner:SUZHOU LANGDONG NET TEC CO LTD

Name disambiguation method, device, electronic device, and computer-readable storage medium

The embodiment of the present application relates to the field of information retrieval technology, and discloses a name disambiguation method, device, electronic equipment, and computer-readable storage medium, wherein the name disambiguation method includes: according to the word sparse distributed representation generated in advance based on the training corpus SDR, determine the document information of at least two documents in at least two language categories to be disambiguated, one document corresponds to one language category; then, based on the pre-built document author classification model for at least two language categories, According to the document information of each document in at least two language types, classify each document according to the author of the document to obtain the first author category corresponding to each document, and the document author classification model of one language type corresponds to the processing Documents of corresponding language categories; Next, the first author categories under each language category are merged, so as to disambiguate the names of the document authors of each document in each language category.
Owner:INST OF SCI & TECHN INFORMATION OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products