Supercharge Your Innovation With Domain-Expert AI Agents!

Terminology standardization method, system and corresponding equipment and storage medium

A terminology and standard technology, applied in character and pattern recognition, natural language data processing, instruments, etc., can solve the problems affecting the accuracy rate, difficult to compromise, and affect the recall rate, so as to improve the accuracy and ensure the recall rate Effect

Active Publication Date: 2021-04-06
望海康信(北京)科技股份公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the similarity calculated based on traditional algorithms is a scalar value, which cannot meet the standardization of terminology in some specific fields, and there are often cases of neglecting the beginning and the end
Some terms have a high degree of similarity to each other, thresholded and cannot be distinguished
If the threshold is too high, it will affect the recall rate; if the threshold is too low, it will affect the accuracy rate, so it is difficult to compromise

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Terminology standardization method, system and corresponding equipment and storage medium
  • Terminology standardization method, system and corresponding equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Embodiments and examples of the present invention will be described in detail below with reference to the drawings.

[0036] The scope of applicability of the present invention will become apparent from the detailed description given below. It should be understood, however, that the detailed description and specific examples, while indicating the preferred embodiment of the invention, are given for purposes of illustration only.

[0037] Some terms in some domains, such as the medical field, have a high degree of similarity with each other. For example, the following standard medical service terms: pulmonary valve replacement, aortic valve replacement, cardiac valve replacement surgery, and aortic valvuloplasty, all have a high degree of similarity to each other, through the traditional Similarity algorithms have a hard time telling them apart, so they may not match exactly when normalized. The present invention can solve this problem well.

[0038] figure 1 A flow ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application discloses a term standardization method, system, corresponding device and storage medium, wherein the method includes: performing word segmentation and part-of-speech tagging and entity recognition for each standard term; generating a first reference text space vector according to each standard term , wherein the entity recognition result includes a word type label; construct a vector search model according to the first reference text space vector; perform word segmentation and part-of-speech tagging on the term to be standardized and perform entity recognition; generate a text space vector to be standardized; search for similarity from the vector search model The highest M reference text space vectors; calculate the similarity of each word type label dimension; calculate the total similarity; use the standard term corresponding to the reference text space vector with the highest total similarity as the standard term of the term to be standardized. The invention can not only ensure the recall rate, but also improve the matching accuracy.

Description

technical field [0001] The present application relates to the field of electronic digital data processing, in particular to a term standardization method, system, corresponding equipment and storage medium. Background technique [0002] In many industries, due to historical and regional reasons, the data of each unit has its own set of terminology names. The inconsistency of these terms has a very restrictive effect on the development of informatization. With the development of technology, relevant state departments have successively issued terminology norms and standards in various fields. However, it is a very troublesome thing to map with the national standard. If there is no good technical means, it can only be mapped manually, which is very time-consuming and labor-intensive. The current popular method in the industry is to use computer program algorithms for standardized mapping. By calculating the similarity between the original term and the standard term, when the s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/289G06F40/295G06F40/247G06K9/62
CPCG06F40/289G06F40/295G06F40/247G06F18/22
Inventor 张俊锋程煜华黄俊杰侯丹丹翟文丽
Owner 望海康信(北京)科技股份公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More