Meta-content analysis and annotation of email and other electronic documents

a technology of metadata analysis and annotation, applied in the field of electronic documents, can solve problems such as being lost in the noise of surrounding tex

Inactive Publication Date: 2007-02-13
SAP AMERICA
View PDF57 Cites 125 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0014]combining said meta-content index with said header and said body to provide an enhanced document; and

Problems solved by technology

Some lexical items within an email document contain greater inherent semantic weight (e.g. dates, email addresses, names of people, names of organizations), but receive no special markings and can thereby become lost in the noise of the surrounding text.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Meta-content analysis and annotation of email and other electronic documents
  • Meta-content analysis and annotation of email and other electronic documents
  • Meta-content analysis and annotation of email and other electronic documents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031]In general, the invention comprises a system to perform meta-content analysis and annotation of an outbound email, or other electronic documents. A preferred embodiment of the system includes the following components:[0032]1. A system to separate the body of an email from the email header.[0033]2. A system to open attachments within the email.[0034]3. A system to extract the email send date from the email header.[0035]4. A system to perform named entity meta-content extraction across the email body and attachments, and optionally the email header.[0036]5. A system to normalize the email send date, instances of dates with the email body and attachments, and instances of currency within the email body and attachments to canonical (conforming to a general rule or acceptable procedure) representations.[0037]6. A system to color-code dates, appearing both in the meta-content index and in the email body, to indicate temporal proximity to the email send date.[0038]7. A system for dis...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Meta-content analysis and annotation upon the body of email documents, and other electronic documents, and to create a displayable index of these instances of meta-content, which is sorted and annotated by type are provided. In addition, the electronic document is enhanced by providing links for the semantic foci to external documents containing related information. An electronic document adapted for delivery to one or more recipients, the electronic document including a header and a body, is processed by:performing meta-content extraction of semantic foci within said header and said body, the semantic foci comprising a plurality of type of information including one or more of email addresses, URLs, dates, currency values, organization names, names of people, names of places, and phone numbers;creating a meta-content index the document based upon said extracted semantic foci;arranging the meta-index according to said plurality of types;combining said meta-content index with said header and said body to provide an enhanced document; andsending said enhanced document to said one or more recipients via a communication network.The process includes converting the electronic mail document to a markup language format, and wherein said meta-content index comprises one or more objects expressed in said markup language adapted for presentation with body in said enhanced document.

Description

BACKGROUND OF THE INVENTION[0001]1. Field of the Invention[0002]The present invention relates to electronic documents intended for transmission to recipients, such as email documents produced and transmitted by electronic mail services. The invention also relates to the creation, extraction, presentation and other actions related to meta-content for such documents.[0003]2. Description of Related Art[0004]The present format typical email documents is analogous to that of the isolated page of text before the advent of hypertext linking. With the exception of sender-specified attachments and in-line universal resource locators (URL's), email documents are, for the most part, static text. Semantic foci within the text of email documents are often difficult to identify for use by recipients.[0005]Meta-content has been used for analyzing electronic documents in a variety of settings. Such meta-content may be descriptive matter, extrapolation, summary and / or interpolation of content that e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(United States)
IPC IPC(8): G06N3/00G06Q10/10H04L12/58
CPCG06Q10/107H04L51/063H04L12/583
Inventor MEYER, DAVIDBERNSTEIN, STEVEN MILLER
Owner SAP AMERICA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products