Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Poison-target literature knowledge mining method and system based on network crawling

A technology of knowledge mining and web crawler, applied in the Internet field, can solve problems affecting the research and development of poisons and the work efficiency of scientific researchers, difficult knowledge mining, and staying at the level of static content search and matching, etc.

Pending Publication Date: 2020-10-20
ACADEMY OF MILITARY MEDICAL SCI
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The existing literature retrieval system for poisons and targets is based on keywords input by users, such as poison names, target names, or a combination of both, to perform information retrieval and fuzzy matching in the background literature database, and find documents with high similarity and return them to users. The search method still stays at the level of static content search and matching, it is difficult to obtain the hidden knowledge in the literature, and it is even more difficult to carry out knowledge mining from the massive biomedical literature, which seriously affects the work efficiency of poison research and scientific research workers

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Poison-target literature knowledge mining method and system based on network crawling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0043] The present invention proposes a poison-target literature knowledge mining method based on web crawling, including:

[0044] Acquiring toxicant and target data information and processing to create comprehensive data sets;

[0045] Develop web crawler tools;

[0046] Based on the comprehensive data set, use the web crawler tool to crawl and process the poison and target document text information to establish a document text database;

[0047] Based on the literature text database, using natural language processing technology to determine the potential relationship between poison and target, forming a knowledge base of poison-target relationship;

[0048] The literature text database and the poison-target relations...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a poison-target literature knowledge mining method and system based on network crawling. The poison-target literature knowledge mining method comprises the steps of: acquiring poison and target data information, and processing the poison and target data information, so as to establish a comprehensive data set; developing a web crawler tool; crawling poison and target literature text information by utilizing the web crawler tool based on the comprehensive data set, and processing the poison and target literature text information to establish a literature text database; based on the literature text database, determining a poison-target potential action relationship by utilizing a natural language processing technology to form a poison-target relationship knowledge base; and performing poison-target literature knowledge mining by utilizing the literature text database and the poison-target relationship knowledge base. The poison-target literature knowledge mining method and the system based on network crawling are high in efficiency, good in accuracy and high in intelligent degree.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a poison-target document knowledge mining method and system based on web crawling. Background technique [0002] With the rapid development of toxicology and molecular biology and other disciplines, a large number of data sets related to poisons and targets have emerged on the Internet, but at present, the resource storage of these data sets is scattered, the format is heterogeneous, and different poisons and target data sets exist. With a large amount of redundant information, poison names and aliases are confused, and there is a lack of unified naming conventions. Although the emergence of these data sets provides a reference for the vast number of toxicology researchers, due to the lack of uniform data standards, the lack of necessary data filtering and quality control mechanisms, resulting in low efficiency of manual retrieval and query, too much redundant information an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/951G06F16/33
CPCG06F16/951G06F16/334Y02D10/00
Inventor 周文霞韩露张永祥肖智勇黄晏刘港高圣乔罗丹
Owner ACADEMY OF MILITARY MEDICAL SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products