Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

134 results about "Patent literature" patented technology

Patent literature similarity measurement method based on ontology

The invention relates to a patent literature similarity measurement method based on ontology, and relates to the technical field of natural language information processing for the ontology. The method comprises the following steps: extracting a core technical scheme according to the structural features, the position features and the keyword features of patent literatures; constructing a model for the relation between thematic terms of patent classes; constructing a field dictionary according to the model for the relation between the thematic terms of the patent classes and segmenting terms and removing stop terms for the core technical scheme; extracting keywords and weight by combining the relation between the thematic terms to TF-IDF as TextRank term initial weight; training a FastText model, and generating a term vector; and calculating an EMD distance to obtain a semantic distance according to keywords, term weight and term vector. Compared with the prior art, the patent literature similarity measurement method based on the ontology solves the problem that the similarity is low due to the fact that the structural features, the field features, the term relation features and the semantics approximate expression of the patent literature are not fully considered.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Intelligent retrieval method and device for calculating patent literature similarity based on word frequency and semantics, electronic equipment and storage medium thereof

The invention provides an intelligent retrieval method and device for calculating patent literature similarity based on word frequency and semanteme, electronic equipment and a storage medium of the electronic equipment. Bag-of-words statistics and word vector calculation are conducted on all literatures in a patent database, and corresponding bag-of-words data and word distance data are obtained;the method comprises the following steps: establishing a model, inputting contents or examination question numbers, acquiring titles, abstracts, claims and specifications of patents to be examined from question bank data and carrying out various combinations, performing rough selection and fine selection respectively according to a bag-of-words algorithm and a semantic algorithm, performing textsimilarity analysis on selected data, and performing fusion sorting on analysis results to obtain comprehensive similarity. Through duplicate checking and screening, a suspicious answer set of the to-be-checked patent is given. According to the method, the retrieval speed is increased, two rounds of screening are adopted, the first round of roughing aims at rapidly narrowing the comparison range,and the second round of fine selection aims at improving the accuracy; manpower and time can be effectively saved, a patent reviewer is helped to reduce the related patent review range, and review efficiency is improved.
Owner:北京知呱呱科技服务有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products