Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for structured processing of Chinese pathological text

A technology of text structure and processing method, applied in unstructured text data retrieval, electronic digital data processing, special data processing applications, etc., can solve the problem of difficulty in obtaining useful information and achieve high accuracy

Inactive Publication Date: 2015-09-09
DONGHUA UNIV +1
View PDF5 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Existing natural language processing technology and data mining technology, such as word segmentation, cluster classification, information extraction, etc., have been able to solve this problem to a large extent, but how to efficiently and accurately extract information from unstructured Chinese pathological texts? Obtaining useful information remains a major challenge

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for structured processing of Chinese pathological text
  • Method for structured processing of Chinese pathological text
  • Method for structured processing of Chinese pathological text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Below in conjunction with specific embodiment, further illustrate the present invention. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. In addition, it should be understood that after reading the teachings of the present invention, those skilled in the art can make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined by the appended claims of the present application.

[0028] Embodiments of the present invention relate to a method for structured processing of Chinese pathological text, comprising the following steps: (1) extracting template information corresponding to each sample from the hierarchical structure of pathological report text data samples and indicators; (2) extracting template information corresponding to each sample; Template information is extracted, including short sentence ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to a method for structured processing of a Chinese pathological text. The method comprises the following steps: extracting template information corresponding to each sample from a hierarchical stricture of a sample of text data of a pathological report text data and indicator; extracting the template information comprising short sentence segmentation and indicator name extraction; classifying the short sentences; with respect to each sample, in combination with a classification result cluster and a short sentence cluster, calculating a TF value, an IDF value and a C-value of each indicator name in an indicator name list in a short sentence language material, and screening out an indicator name whose TF value, IDF value and C-value satisfy a threshold, and using the obtained indicator name as a component of the final template. According to the present invention, a non-structured Chinese pathological text can be structured.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a method for structured processing of Chinese pathological texts. Background technique [0002] With the rapid development of the Internet and the gradual advancement of the informatization process of the domestic medical industry, the amount of information in the medical industry is also growing exponentially. At the same time, with the improvement of living standards, people are gradually shifting from the most basic ability to see a doctor to focusing on the efficiency and accuracy of medical diagnosis. The large amount of unstructured pathological data accumulated under traditional medical methods and increasing in real time every day at this stage is not only difficult to store and manage, but also cannot meet the needs of today's pathological analysis. [0003] How to obtain useful information from unstructured pathological data has become one of the cu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/35G06F16/36
Inventor 陈德华冯洁莹朱立峰乐嘉锦刘茜茜薛瑞东
Owner DONGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products