Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and system for discriminating and eliminating false positive results

A technology of false positives and result sets, applied in the field of knowledge mining of literature databases, can solve problems such as unreasonable selection, lack of accuracy in the usage of specialized words, lack of uniformity, and systematic defects in algorithms, achieving universal applicability and high efficiency. False positive screening rate and error correction rate, and the effect of reducing false positive rate

Active Publication Date: 2016-02-24
SHANGHAI INST OF BIOLOGICAL SCI CHINESE ACAD OF SCI
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the process of knowledge mining and discovery of databases, false positive knowledge mining results are unavoidable. The reasons for this phenomenon mainly come from the following three aspects: ①The quality of raw data collection in the database is low; ②In the text mining tool dictionary In the process of compiling, the accuracy and uniformity of the usage of professional words is not enough; ③In the process of information integration and knowledge mining, the selection of computer algorithms and mining methods and approaches is unreasonable or the algorithm itself has systematic defects
[0004] At present, although the global research on database knowledge mining and discovery is at a relatively hot stage, the research on database knowledge mining false positives is still in the exploratory stage. The relevant patents and published publications in this research field There are not many related research reports
Moreover, a comprehensive method for eliminating false positive mining results in the process of document knowledge mining has not been found so far. Therefore, it is urgent to develop a new comprehensive method and approach for screening and eliminating false positive results of document knowledge mining.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for discriminating and eliminating false positive results
  • Method and system for discriminating and eliminating false positive results
  • Method and system for discriminating and eliminating false positive results

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The above-mentioned features and advantages of the present invention can be better understood after reading the detailed description of the embodiments of the present disclosure in conjunction with the following drawings. In the drawings, components are not necessarily drawn to scale, and components with similar related properties or characteristics may have the same or similar reference numerals.

[0033] figure 1 The flow chart of a preferred embodiment of the method for screening and eliminating false positive results of the present invention is shown, figure 2 At the same time, its realization principle is shown. Please combine figure 1 and figure 2 , the implementation steps of the method in this embodiment are described in detail as follows.

[0034] Step S1: Optimizing the target literature database.

[0035] The optimization process performed on the target bibliographic database includes freezing partial words. exist figure 2 In this method, based on t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a system for discriminating and eliminating false positive results, so that the false positive rate in a literature database knowledge mining process can be efficiently reduced. The technical scheme is that the method comprises: performing optimization processing on a target literature database and a controlled vocabulary; performing classification on false positive results and establishing a discovery mode; generating a false positive result set; extracting the false positive results and original data from the false positive result set, and performing classification; performing grouping on the false positive results of all types, extracting data in mining results of all types, performing training on the data by using an algorithm library to obtain a proper elimination algorithm, performing checking by using residual grouping data, and if the checking is passed, selecting the elimination algorithm, otherwise, modifying the algorithm library and performing re-training until the proper elimination algorithm is found; based on the found elimination algorithm, constructing a false positive elimination dictionary and a corresponding false positive elimination logic algorithm library; and traversing the whole false positive result set, eliminating all the false positive results, feeding back information of eliminating the false positive results to the target literature database, and finally correcting all false positive mining results in the target literature database.

Description

technical field [0001] The invention relates to the technology in the knowledge mining of the document database, in particular to a method and a system for screening and eliminating false positive results in the knowledge mining of the document database. Background technique [0002] Document knowledge mining refers to the process of extracting, integrating and discovering useful information and knowledge points from documents. Through document knowledge mining, a large number of documents can be quickly processed and knowledge in a specific field can be obtained. Document knowledge mining involves data mining, text mining, natural language Processing and information integration and other research areas. Taking life science literature knowledge mining as an example to illustrate, the main content of life science literature knowledge mining research is divided into five parts: information retrieval, entity recognition, information extraction, text mining and information integ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 陈恒赵衍陈成材张永娟
Owner SHANGHAI INST OF BIOLOGICAL SCI CHINESE ACAD OF SCI