Unlock instant, AI-driven research and patent intelligence for your innovation.

Methods, systems, and storage media for automatically identifying relevant chemical compounds in patent documents

A technology for patent documents and identification systems, used in document management systems, patent retrieval, word processing, etc.

Pending Publication Date: 2021-03-23
ELSEVIER +1
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, these compounds may only be available through patent literature for a period of time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Methods, systems, and storage media for automatically identifying relevant chemical compounds in patent documents
  • Methods, systems, and storage media for automatically identifying relevant chemical compounds in patent documents
  • Methods, systems, and storage media for automatically identifying relevant chemical compounds in patent documents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present disclosure generally relates to a system that automatically extracts compounds from patent documents and determines the relevance of the compounds to the patent documents. The method described herein involves a training device, in particular, configured to extract patent documents from a database, normalize the patent documents and feed the patent documents to a machine learning system (referred to in the text as a chemical entity recognition system) , allowing machine learning systems to be trained to automatically identify compounds in normalized patent documents and determine whether those compounds are related to related patent documents.

[0024] Patent data contained in patent documents can be obtained from various patent databases, including but not limited to those provided by various patent offices such as the European Patent Office (EPO), the United States Patent and Trademark Office (USPTO), the World Intellectual Property Organization (WIPO), the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Methods, systems, and non-transitory media for training a chemical entity recognition system to extract chemical compounds from a patent document and determine a relevance of the chemical compounds tothe patent document are disclosed. A method includes obtaining patent documents from patent databases, normalizing each patent document into a unified format, and generating a chemical patent corpus.The chemical patent corpus includes chemical entities, each having relevancy annotations that indicate a relevance to the patent document from which the chemical entity is extracted. The method further includes providing the chemical patent corpus to the chemical entity recognition system, which tags the one or more chemical entities in a corresponding normalized patent document, extracts additional chemical entities, assigns a confidence score to each additional chemical entity, and labels each additional chemical entity as relevant or irrelevant to an associated patent document based on information contained in the chemical patent corpus.

Description

[0001] Cross References to Related Applications [0002] This application claims priority to U.S. Provisional Patent Application No. 62 / 639,656, entitled "Automatic Identification of Related Compounds in Patents," filed March 7, 2018, the entire contents of which are hereby incorporated by reference. technical field [0003] The present disclosure relates to methods, systems, and storage media for automatically identifying compounds in patent documents, and more particularly, to methods for training chemical entity recognition systems to automatically extract compounds from patent documents and to correlate compounds with respect to corresponding patent documents. A method, system and storage medium for classifying sex. Background technique [0004] Chemistry-related publications may include patent applications and scientific journal articles. In a commercial R&D program, the first public disclosure of a new compound may occur during the patent application process. Sometim...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/12G06F40/20G06F16/93
CPCG06F16/93G06F40/20G06F40/149G06F2216/11
Inventor 沙贝尔·A·阿肯迪辛纳克·雷伊马库斯·施沃雷尔海克·纳优加布里埃尔·伊尔曼马蒂亚斯·伊默尔克劳迪娅·鲍巴斯
Owner ELSEVIER