Unlock instant, AI-driven research and patent intelligence for your innovation.

Information extraction method and device

A kind of information extraction, part of the technology, applied in the field of text processing, can solve problems such as inability to extract, achieve the effect of facilitating data query and positioning, and enhancing the effect of data structuring

Active Publication Date: 2021-11-09
北京嘉和海森健康科技有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the technical problem in the prior art that more words cannot be extracted from word segmentation results obtained according to the longest matching principle, an embodiment of the present application provides an information extraction method and device

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information extraction method and device
  • Information extraction method and device
  • Information extraction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] In order to enable those skilled in the art to better understand the solution of the present application, the technical solution in the embodiment of the application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiment of the application. Obviously, the described embodiment is only It is a part of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0049] In traditional information extraction methods, text is often segmented based on a preset lexicon to obtain useful information, such as words representing diseases, symptoms, operations, etc., and then segmented based on the longest matching principle, that is, according to the Segment the longest word matched in the database to obtain the word segmentation r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the present application discloses an information extraction method, which performs word segmentation on the preset text according to the preset thesaurus, obtains the first word segmentation result, extracts a plurality of undetermined words included in the first word segmentation result, and extracts a plurality of undetermined words from the plurality of undetermined words Determine the undetermined words without inclusion relationship as the information extraction result of the first word segmentation result. Due to the use of two word segmentations, not only the longer first word segmentation results can be extracted, but also the shorter information about the first word segmentation results without inclusion relationship can be further extracted from the longer first word segmentation results The extraction results, such as extracting words representing information such as parts and diseases from the complete words representing the operation name, on the one hand increase the amount of information extracted, on the other hand, through the information of the first word segmentation result and the first word segmentation result The structure level setting of the extraction results enhances the data structure effect and facilitates data query and positioning. The embodiment of the present application also discloses an information extraction device.

Description

technical field [0001] The present application relates to the field of text processing, in particular to an information extraction method and device. Background technique [0002] Electronic Medical Record (EMR) is also called computerized medical record system or computer-based patient record. It is a digital medical service work record of medical personnel in medical institutions for clinical diagnosis and treatment of outpatients and inpatients, guiding intervention, and using information systems to generate text, symbols, charts, data, and graphics. The development of electronic medical records provides convenience for doctors to understand patient information and clinical research in real time. However, at present, there are both structured data and unstructured data in electronic medical records, and some important information mostly exists in unstructured data, such as chief complaints, current medical history, past history, etc. in electronic medical records. There...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/289
CPCG06F40/289
Inventor 李重勋王利叶胡可云陈联忠
Owner 北京嘉和海森健康科技有限公司