Patent term extraction method

A term and patented technology, applied in the field of patent term extraction, can solve the problems of term recognition accuracy and recall rate that cannot be popularized and applied, and achieve the effect of excellent extraction effect, improved accuracy rate, and improved recall rate

Pending Publication Date: 2021-05-11
BEIJING INFORMATION SCI & TECH UNIV
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the use of statistical and rule-based methods has achieved certain results, the entity recognition of Chinese terms in professional fields (such as new energy fields, etc.) still relies on manually defined features and domain expertise. Unable to be promoted and applied due to the limitations of specific field situations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Patent term extraction method
  • Patent term extraction method
  • Patent term extraction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038]In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described below in conjunction with the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, not to limit the present application. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0039] Those skilled in the art can understand that, unless otherwise defined, all terms (including technical terms and scientific terms) used herein have the same meanings as commonly understood by those of ordinary skill in the art to which this application belongs. It should also be understood that terms, such as those defined in commonly used dictionaries, should be understood to have me...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a patent term extraction method which comprises the following steps: converting each character of a labeled character-level corpus into a word vector by utilizing a BERT pre-training language model layer, inputting the word vector into a BiLSTM layer for semantic coding, and automatically extracting sentence features; and decoding and outputting the prediction label sequence with the maximum probability by using a CRF layer to obtain the labeling type of each character, and extracting and classifying entities in the sequence. According to the patent term extraction method provided by the embodiment of the invention, the BERT is utilized to vectorize the professional field patent text, the accuracy of the term extraction result can be effectively improved, the extraction effect is better than that of a current mainstream deep learning term extraction model, the accuracy, the recall rate and the F1 value are remarkably improved in the professional field patent text term extraction, and professional field patent long sequence terms with many characters can be accurately and rapidly identified.

Description

technical field [0001] This application relates to the technical field of text processing, in particular to a patent term extraction method. Background technique [0002] The automatic extraction of technical terms plays an important role in domain machine translation and domain knowledge graphs. At present, the average annual growth rate of patent applications remains stable, and how to translate patent texts more quickly and accurately has become a problem worthy of attention. There are two main ways to translate patent documents at present. One is translation by professionals in the field. However, high-quality translation is a demanding and time-consuming production task. Human translation experts are highly required and can meet this requirement. There is a shortage of qualified translators, and because of the novelty, reliability and authority of patent documents, translators need to use the domain terminology list to grasp the accurate translation of domain terms in ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/295G06F40/30G06N3/04G06N3/067
CPCG06F16/35G06F40/30G06F40/295G06N3/0675G06N3/044G06N3/045
Inventor 吕学强游新冬董志安
Owner BEIJING INFORMATION SCI & TECH UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products