Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A full-text search matching engine based on icd9/10 word segmentation thesaurus

An index and engine technology, applied in the field of full-text retrieval and information retrieval for medical information word segmentation thesaurus, can solve the problems of difficult further processing of words, low recognition rate, low accuracy rate, etc., to optimize the scoring and sorting process, and achieve high hits. rate and correlation, the effect of improving efficiency and accuracy

Active Publication Date: 2022-06-21
百洋智能科技集团股份有限公司
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

And this process is prone to mapping errors
[0005] On the other hand, although computers can be used to perform ICD code correlation retrieval, due to the lack of professional business word segmentation thesaurus, there are problems such as low recognition rate, low hit rate, low accuracy rate, and difficult further processing of recognized words.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A full-text search matching engine based on icd9/10 word segmentation thesaurus
  • A full-text search matching engine based on icd9/10 word segmentation thesaurus
  • A full-text search matching engine based on icd9/10 word segmentation thesaurus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments in the present invention, all other embodiments obtained by those of ordinary skill in the art fall within the protection scope of the present invention.

[0028] like figure 1 As shown, the composition and relationship of a full-text search matching engine module based on ICD9 / 10 word segmentation thesaurus proposed by the present invention are as follows:

[0029] Data collection module. Responsible for providing external data interfaces or data dump services. Store historical ICD-related data into the engine. It provides the basis for subsequent data segmentation and synonym tagging.

[0030] Data analysis modu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a general matching solution aimed at business characteristics. The ICD9 / 10 disease classification and surgical operation lexicon formed and accumulated by natural language processing methods such as keyword extraction and part-of-speech tagging have a high degree of matching of texts in ICD9 / 10 related fields. At the same time, a personalized Elasticsearch analyzer is configured to improve targeting The full-text index hit rate and accuracy rate of the service.

Description

technical field [0001] The present application relates to the field of information retrieval, in particular to the field of full-text retrieval for medical information word segmentation thesaurus. Background technique [0002] With the advancement of medical informatization, hospitals have formed medical information systems such as HIS (Hospital Information System) and EMR (Electronic Medical Records). [0003] A large amount of unstructured data exists in the medical information system. Due to inconsistent coding standards, differences in doctors' idioms, and inconsistent design methods for cross-vendor information systems. For disease classification, ICD (International Classification of Diseases, International Classification of Diseases) codes have been produced internationally for easy sharing and processing. [0004] Due to the data format and data content of each medical institution, even if the same code is used, there will usually be unique personalized processing. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G16H10/60G06F16/242G06F40/247G06F40/289
CPCG16H10/60G06F16/242G06F40/247G06F40/289
Inventor 谭明智周宗霞李翔王凤阳
Owner 百洋智能科技集团股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products