Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Full-text retrieval matching engine based on ICD9/10 word segmentation lexicon

An engine and indexing technology, which is applied in the full-text retrieval of medical information word segmentation thesaurus and information retrieval field, can solve the problems of low recognition rate, difficulty in further processing of vocabulary, low accuracy rate, etc., and achieve high hit rate and correlation, optimization Scoring and sorting process, the effect of improving efficiency and accuracy

Active Publication Date: 2020-11-06
百洋智能科技集团股份有限公司
View PDF8 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

And this process is prone to mapping errors
[0005] On the other hand, although computers can be used to perform ICD code correlation retrieval, due to the lack of professional business word segmentation thesaurus, there are problems such as low recognition rate, low hit rate, low accuracy rate, and difficult further processing of recognized words.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Full-text retrieval matching engine based on ICD9/10 word segmentation lexicon
  • Full-text retrieval matching engine based on ICD9/10 word segmentation lexicon
  • Full-text retrieval matching engine based on ICD9/10 word segmentation lexicon

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments of the present invention belong to the protection scope of the present invention.

[0028] like figure 1 Shown, a kind of full-text retrieval matching engine module composition and relation based on ICD9 / 10 participle thesaurus that the present invention proposes are as follows:

[0029] Data collection module. Responsible for providing external data interface or data transfer service. Store historical ICD-related data into the engine. Provide a basis for subsequent data word segmentation and synonym tagging.

[0030] Data analysis module. The input of this part is the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a universal matching solution directed at the characteristics of service. ICD9 / 10 disease classification and surgical operation lexicons which are formed and accumulated by adopting natural language processing modes such as keyword extraction and part-of-speech tagging are adopted; a text matching degree in ICD9 / 10 related fields is high; and meanwhile, a personalized Elasticsearch analyzer is configured, so the full-text index hit rate and accuracy of the service are improved.

Description

technical field [0001] This application relates to the field of information retrieval, in particular to the field of full-text retrieval for medical information word segmentation thesaurus. Background technique [0002] With the advancement of medical informatization, hospitals have formed medical information systems such as HIS (Hospital Information System) and EMR (Electronic Medical Records). [0003] A large amount of unstructured data exists in medical information systems. Due to non-uniform coding standards, differences in doctors' idioms, and non-uniform design methods of cross-vendor information systems. For the classification of diseases, ICD (International Classification of Diseases, International Classification of Diseases) codes have been generated internationally for easy sharing and processing. [0004] As each medical institution adopts the same code in terms of data format and data content, it usually has unique personalized processing. Therefore, under no...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G16H10/60G06F16/242G06F40/247G06F40/289
CPCG16H10/60G06F16/242G06F40/247G06F40/289
Inventor 谭明智周宗霞李翔王凤阳
Owner 百洋智能科技集团股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products