Supercharge Your Innovation With Domain-Expert AI Agents!

Method and apparatus for natural language processing of medical text in chinese

A medical and Chinese technology, applied in the field of natural language processing framework, can solve problems such as different language elements and the framework cannot be directly converted into Chinese

Pending Publication Date: 2022-02-11
TENCENT AMERICA LLC
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Second, although there are some existing medical text processing frameworks expressed in English, such as the Unified Medical Language System (UMLS) and the Tenth Revision of the International Statistical Classification of Diseases and Related Health Problems (ICD-10), these frameworks cannot be directly translated into Chinese , because many language elements differ significantly from

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for natural language processing of medical text in chinese
  • Method and apparatus for natural language processing of medical text in chinese
  • Method and apparatus for natural language processing of medical text in chinese

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] In the medical domain, a large number of documents are based on and use free or unstructured text as their representation. However, applying AI techniques in the medical field may require processing, structuring, and understanding of medically relevant entities. Embodiments of the present disclosure relate to a natural language processing (NLP) framework 100 for understanding medical content expressed in Chinese, such as medical text data 104 . The NLP framework 100 may include a deep attention-based named entity recognition (NER) model 101 for identifying medically relevant entities and their categories in unstructured medical text data 104 together with a Chinese medical dictionary. The multidimensional entity understanding framework 102 can be used to structure free-text content by determining a series of attributes that describe the corresponding core medical entities. Additionally, the medical knowledge graph 103 may be used to perform medical entity normalization...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method for processing unstructured Chinese-language medical text includes identifying a medical entity in the unstructured Chinese-language medical text using an attention-based named-entity recognition (NER) model, structuring the identified medical entity using a multiple-dimensional entity understanding framework, normalizing the structured medical entity using a medical knowledge graph, and outputting the normalized medical entity.

Description

[0001] Cross References to Related Applications [0002] This application claims priority to U.S. Patent Application Serial No. 16 / 395,439, filed April 26, 2019, in the United States Patent and Trademark Office, which is hereby incorporated by reference in its entirety. technical field [0003] The present disclosure relates to a natural language processing (NLP) framework for processing and understanding medical-related content expressed in Chinese. Background technique [0004] In recent years, electronic health record (EHR) systems and electronic medical record (EMR) systems have been increasingly adopted in hospitals around the world. EHR systems can collect a wide range of medical data, including structured and unstructured data, text and images. More specifically, most text-based clinical data are still collected and stored in the form of unstructured natural language. Although great efforts have been made in structuring and formalizing medical content, only a small ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/08
CPCG06F16/313G06F40/205G06F40/295G06N5/022G16H50/70G16H15/00G16H50/20G06N7/01G06N3/044G06N5/02
Inventor 杨涛涂旻李亚亮谢于晟张尚卿王堃杜楠范伟
Owner TENCENT AMERICA LLC
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More