Medical ancient Chinese sentence segmentation method based on Bayesian statistics learning

A technique of statistical learning and ancient Chinese, applied in the field of medical ancient Chinese sentence segmentation based on Bayesian statistical learning, can solve the problems of lack, high processing cost, low processing effect, etc., and achieve the effect of smooth progress

Active Publication Date: 2017-12-19
CHENGDU UNIV OF INFORMATION TECH
View PDF1 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But nowadays, applying the more mature modern Chinese processing technology to the immature Chinese medical text processing technology in China, whether these methods are as good as in the modern text when facing the processing performance of the medical ancient Chinese text, does not matter. To be further verified
However, due to the inconsistency of the processing regulations and the lack of corpus for the required processing tasks, the technology that has been applied has resulted in low efficiency in the processing of classics. to violate
[0005] To sum up, the problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Medical ancient Chinese sentence segmentation method based on Bayesian statistics learning
  • Medical ancient Chinese sentence segmentation method based on Bayesian statistics learning
  • Medical ancient Chinese sentence segmentation method based on Bayesian statistics learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0052] At present, more mature modern Chinese processing techniques are applied to domestic immature Chinese medical text processing technologies. Whether these methods are as good as in modern medical text processing performance for medical ancient Chinese texts remains to be further studied. verify.

[0053] The application principle of the present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0054] In the medical ancient Chinese sentence segmentation method based on Bayesian statistical learning provided by the embodiment of the present invention, in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of language processing and discloses a medical ancient Chinese sentence segmentation method based on Bayesian statistics learning. According to the medical ancient Chinese sentence segmentation method based on Bayesian statistics learning, two tuples and trituples are also added for characteristic attributes or one-tuple, two-tuple and trituple diversified characteristic attributes are combined to obtain multiple groups of experiment data results based on a naive Bayesian method for sentence identification, and finally a best model is obtained; thus, an ancient Chinese sentence segmentation task is achieved. The medical ancient Chinese sentence segmentation method is combined with actual processing text contents, values F of various characteristics in the prior art can be improved by at least 25% by adopting the experiment method, medical ancient Chinese text sentence identification rules are systematically analyzed and concluded, the processing method can be applied to the field of actual traditional Chinese medicine, a medical ancient Chinese text sentence identification corpus is established, and accordingly achievements in scientific research can play a greater role.

Description

technical field [0001] The invention belongs to the field of language processing, in particular to a method for segmenting medical ancient Chinese sentences based on Bayesian statistical learning. Background technique [0002] Natural language processing technology has a strong language correlation. In foreign countries, relatively mature language processing technology has been applied to the text processing of medical information and patient case history, so as to help doctors extract key information from huge medical-related information data. , and transform it into an effective knowledge system, and then further apply it to related work. In China, major medical institutions in various provinces and cities across the country are also intensively working on the modern intelligent processing of big data in the medical field. in progress. [0003] A large number of medical Chinese ancient books are collected in libraries all over the country and major scientific research ins...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/35G06F40/211G06F40/284G06F40/289
Inventor 王亚强刘胤唐聃舒红平
Owner CHENGDU UNIV OF INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products