Chemical information extraction method and device, equipment and storage medium
A chemical information and chemical technology, applied in the field of chemical information, which can solve the problems of error-prone, difficult to maintain data sets, and time-consuming manual copying of information.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0067] figure 1It is a flow chart of a chemical information extraction method provided by Embodiment 1 of the present invention. This embodiment is applicable to extracting structured data from chemical industry literature containing chemical information of un / semi-structured data. This method can be implemented by the present invention example provided by the chemical information extraction device, which can be implemented in the form of software and / or hardware, and integrated into the computer equipment provided by the embodiment of the present invention, such as figure 1 As shown, the method specifically includes the following steps:
[0068] S101. Obtain chemical documents.
[0069] Exemplarily, in some embodiments of the present invention, documents and materials related to chemical components and their reactions can be collected from the entire network. The document format may include a word document, an RTF document, an Excel document, an HTML webpage, a PDF document...
Embodiment 2
[0101] figure 2 A schematic structural diagram of a chemical information extraction device provided in Embodiment 2 of the present invention, as shown in figure 2 As shown, the chemical information extraction device includes:
[0102] A chemical document acquisition module 201, configured to acquire chemical documents;
[0103] A separation module 202, configured to separate images and texts from the chemical documents;
[0104] A tag extraction module 203, configured to extract a chemical structure from the image and a tag for annotating the chemical structure;
[0105] A mapping relationship establishment module 204, configured to establish a mapping relationship between the chemical structure and the label to obtain first storage information;
[0106] An association relationship extraction module 205, configured to extract chemical entities and association relationships between chemical entities from the text to obtain second storage information;
[0107] A storage mo...
Embodiment 3
[0130] Embodiment 3 of the present invention provides a computer device, image 3 A schematic structural diagram of a computer device provided by Embodiment 3 of the present invention, such as image 3 As shown, the computer device includes a processor 301, a memory 302, a communication module 303, an input device 304, and an output device 305; the number of processors 301 in the computer device may be one or more, image 3 Take a processor 301 as an example; the processor 301, memory 302, communication module 303, input device 304 and output device 305 in the computer equipment can be connected by bus or other methods, image 3 Take connection via bus as an example. The above-mentioned processor 301, memory 302, communication module 303, input device 304 and output device 305 may be integrated on the control board of the computer equipment.
[0131] The memory 302, as a computer-readable storage medium, can be used to store software programs, computer-executable programs an...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com