Signature information extraction method and device

A technology of signature information and extraction method, which is applied in the computer field to achieve the effect of reducing negative impact and precise extraction

Active Publication Date: 2019-03-12
BEIJING KNOWNSEC INFORMATION TECH
View PDF3 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In order to overcome the above-mentioned deficiencies in the prior art, the purpose of this application...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Signature information extraction method and device
  • Signature information extraction method and device
  • Signature information extraction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] The technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. The components of the embodiments of the application generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations.

[0056] Accordingly, the following detailed description of the embodiments of the present application provided in the accompanying drawings is not intended to limit the scope of the claimed application, but merely represents selected embodiments of the present application. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative work, all belong to the scope of protection of this application.

[0057]...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the present application provides a signature information extraction method and device, which can extract the signature information of a rule very quickly and conveniently by extracting the structured information in each statement separately by using a regular expression. Extracting unstructured information uses machine learning classification model and character granularity sequence annotation, which can solve the limitation of traditional way of using mail template alignment to get extracted information. In the implementation process, the TF-IDF word frequency feature and tagged sequence feature are extracted, and the extracted TF-IDF word frequency feature and annotation sequence feature are inputted into address binary classification model and character granularity sequence annotation model respectively, and the name information and address information in each sentence are obtained. Thus, by extracting the TF-IDF word frequency features, the address information canbe identified completely, and the tagged sequence features are used to greatly reduce the negative impact of wrong word segmentation on the identification of name information, so as to accurately extract the mail signature information.

Description

technical field [0001] This application relates to the field of computer technology, in particular, to a method and device for extracting signature information. Background technique [0002] The traditional email signature extraction method generally adopts email template comparison, but it has great limitations. It is generally only suitable for email signature extraction in standard format. If the email to be extracted does not match the standard template, the extraction result accuracy can be greatly affected. Another method is to segment the full text of the email, and extract the name and other entity information in the email according to the characteristics of each word and its context. However, this method is greatly affected by word segmentation tools, and there are often names extracted after word segmentation. Entity information is partially lost, or incomplete, or there are redundant words, which will also have a great impact on the accuracy of the extraction res...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06K9/62
CPCG06F40/216G06F40/295G06F18/24Y02D10/00
Inventor 邹晶岳永鹏
Owner BEIJING KNOWNSEC INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products