Unlock instant, AI-driven research and patent intelligence for your innovation.

Medicine word segmentation searching method and system thereof

A search method and word segmentation technology, which is applied in drug search and Internet fields, and can solve problems such as fuzzy, unsatisfactory precise search, lost word search, and increased server processing pressure.

Active Publication Date: 2020-09-18
壹药网科技(上海)股份有限公司
View PDF7 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] It can be seen that the word segmentation results of such drugs are too scattered and vague. When the user enters the drug name, only the content containing these words can be searched, resulting in a large number of weakly relevant content being recalled. The demand for word search forces users to change keywords for multiple searches, which affects the experience and increases the processing pressure on the server

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Medicine word segmentation searching method and system thereof
  • Medicine word segmentation searching method and system thereof
  • Medicine word segmentation searching method and system thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In the following description, many technical details are proposed in order to enable readers to better understand the application. However, those skilled in the art can understand that the technical solutions claimed in this application can be realized even without these technical details and various changes and modifications based on the following implementation modes.

[0050] In order to make the purpose, technical solution and advantages of the present application clearer, the implementation manner of the present application will be further described in detail below in conjunction with the accompanying drawings.

[0051] The first embodiment of the present application relates to a drug word segmentation search method, the process of which is as follows Figure 1-2 As shown, the method includes the following steps:

[0052] Steps 110-120: Establish a drug dictionary in advance based on existing drug data, and set a rule dictionary, wherein the rule dictionary includ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The application relates to the technical field of the Internet, and discloses a medicine word segmentation searching method and a system thereof. The method comprises the following steps of: establishing a medicine dictionary in advance according to the existing medicine data, and setting a rule dictionary; performing multi-path word segmentation on an input search character string according to the medicine dictionary to obtain a multi-path word segmentation result, wherein if the number of single words of at least one group of continuous single words in the coarsest granularity path of the multi-path word segmentation result is within a preset range, word segmentation is performed on the search character string according to feature words in the rule dictionary; and performing medicine search by using the multi-path word segmentation result. The segmentation efficiency and accuracy of the new words and the unmarked words in the pharmaceutical industry are higher, and the cost of manualmarking can be reduced.

Description

technical field [0001] This application relates to the technical field of the Internet, in particular to the technical field of drug search. Background technique [0002] At present, drug search through the Internet has become more and more common. The current mainstream word segmentation methods in the industry are mainly expanding and extending around the three directions of dictionary-based, statistics-based, and understanding-based. Although these conventional methods basically meet the needs of modern Chinese, The word segmentation of everyday language, but due to the particularity of drug search, medical vocabulary has the characteristics of many remote words, vague meanings, and vague semantics, which makes the existing model unable to meet the word segmentation needs of the pharmaceutical industry. [0003] For example, the common drug name: Wangao Irbesartan Hydrochlorothiazide Dispersible Tablets, the results obtained by the native model of many tokenizers (such as...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G16H50/70G16H70/40G06F40/242G06F40/247G06F40/284
CPCG16H50/70G16H70/40G06F40/242G06F40/247G06F40/284
Inventor 卓建飞胡茂华王新岐
Owner 壹药网科技(上海)股份有限公司