Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Element extraction method and device, electronic equipment and storage medium

A technology of elements and characters, applied in the field of natural language processing, can solve the problems of affecting coding efficiency, increasing input length, and occupying large storage space, so as to save storage space and improve coding efficiency

Pending Publication Date: 2022-03-25
IFLYTEK (SUZHOU) TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides an element extraction method, device, electronic equipment and storage medium to solve the problem of splicing several words on the original sentence in the element extraction method in the prior art, resulting in increased input length, affecting coding efficiency and occupying a large storage space defect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Element extraction method and device, electronic equipment and storage medium
  • Element extraction method and device, electronic equipment and storage medium
  • Element extraction method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the present invention. Obviously, the described embodiments are part of the embodiments of the present invention , but not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0050] In recent years, with the development of computing power and the advancement of neural network theory, the application of deep learning to feature extraction has made great progress, and different variants of model encoders (CNN, RNN, Transformer, etc.) have been verified in succession. Effectiveness on the task of Named Entity Recognition (NER).

[0051]The representative model of fu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an element extraction method and device, electronic equipment and a storage medium. The method comprises the steps of obtaining a to-be-extracted text and a vocabulary set of the to-be-extracted text; based on a matching result between character strings corresponding to every two characters in the to-be-extracted text and the vocabulary set, the relevancy between every two characters is determined, and the character strings are obtained by being intercepted from the to-be-extracted text with the two corresponding characters as starting points and ending points; coding each character in the to-be-extracted text on the basis of the relevancy between every two characters to obtain an element boundary feature of each character; and determining an element extraction result of the to-be-extracted text based on the element boundary features of the characters. According to the element extraction method and device, the electronic equipment and the storage medium provided by the invention, the matched vocabularies and the original sentences do not need to be spliced, and the original input length is not changed, so that the coding efficiency is improved. In addition, compared with an existing vocabulary splicing method, the storage space is saved.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to an element extraction method, device, electronic equipment and storage medium. Background technique [0002] The work of element extraction is mainly to extract structured information from unstructured text, which is a very important subfield in natural language processing. The introduction of lexical information into feature extraction has attracted more and more attention from researchers, especially in some professional fields with insufficient corpus, the role of domain vocabulary is even more significant. [0003] At present, element extraction methods that integrate lexical information usually need to concatenate several vocabularies on the original sentence, which increases the input length and affects the coding efficiency. In addition, the size of the word embedding layer (Embedding) of the existing feature extraction model is directly propor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/31G06F40/126G06F40/284G06F40/30G06F16/903
CPCG06F16/313G06F40/126G06F40/284G06F40/30G06F16/90344
Inventor 刘加新李直旭郑新陈志刚陈耀玲李明洹支洪平
Owner IFLYTEK (SUZHOU) TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products