Template-based structured document classification and extraction
A structured document and document classification technology, applied in structured data retrieval, calculation model, database model, etc., can solve the impractical problem of reverse engineering of data extraction template
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0024] figure 1 The figure shows an example environment in which a corpus of structured documents 100 can be clustered into clusters 132 1-m , and wherein clusters containing structured documents can be analyzed to generate data extraction templates 134 1-m . As used herein, "structured documents" may refer to B2C communications such as emails, text messages (eg SMS, MMS), instant messages and any other that are typically (but not always) automatically generated eg using templates B2C communication. Additionally, in some implementations, structured documents may include other types of documents, such as letters (e.g., in Portable Document Format (“PDF”) and / or word processing formats), invoices, bills, receipts, invitations (e.g. Invitations received via social networking applications) or other structured documents that may not be considered communications and / or attachments to other communications (eg, email). In various implementations, structured documents may be struct...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


