Method and apparatus for sociological data mining

a sociological data and mining technology, applied in the field of electronic documents, can solve the problems of large number of documents being returned, data retrieval techniques suffering, and the precision and usability of knowledge management and search technology has not kept pa

Inactive Publication Date: 2006-11-09
SUNRISE SERIES 54 OF ALLIED SECURITY TRUST I
View PDF11 Cites 254 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But the precision and usability of knowledge management and search technology has not kept pace.
These data retrieval techniques suffer from two fundamental flaws.
Firstly, they often result in either vast numbers of documents being returned, or, if

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for sociological data mining
  • Method and apparatus for sociological data mining
  • Method and apparatus for sociological data mining

Examples

Experimental program
Comparison scheme
Effect test

example 1

An Email Containing Only the Following Text

[0460] Hey, gang. I just got back from Comdex and saw some of the new specialized speech recognition headsets that Acme was demonstrating. I think they'd be perfect for our automated transcription project. Do you think there's money left in the discretionary budget for these? If there is, who do I need to talk to about getting a purchase order?

example 2

A Long Document Formalizing a Company's Sexual Harassment Policy Could be Tagged as “Initiator,+,−,−”

[0461] Such a textblock would very likely be written with few or no first or second person pronouns, indicating formality and the intention of reaching a broad, nonspecific audience. Simple counts of the number of paragraphs and average number of sentences per paragraph suffice to identify the structure of most such documents 505 as formal. Finally, it can be assumed that any such policy statement will contain a relatively high count of named entities (company names, responsible parties, and the like) that would indicate its key role as an introductory or summary document 505 in an information flow.

[0462] The existence of tag sequences permits an additional query type for information retrieval. Specifically, locating documents that affirm or deny a previous statement or request, but are lacking in content themselves, are made available to users of the system by means of this set of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A processing system for retrieving interrelated documents is described. The system comprises a document repository for storing a plurality of documents, a metadata repository for storing a plurality of metadata elements to represent relations between the documents, and a sociological analysis engine to identify relationships between the documents using the metadata elements from the metadata repository.

Description

FIELD OF THE INVENTION [0001] The present invention relates to electronic documents, and more particularly to a method for retrieving a document or (more typically) a group of documents that satisfies a user-defined criterion or set of criteria. Additionally, the invention relates to the detection of patterns among these documents, and the various actors operating upon them. BACKGROUND [0002] The volume of electronic information in both personal and corporate data stores is increasing rapidly. Examples of such stores include electronic mail (e-mail) messages, word-processed and text documents, contact management tools, and calendars. But the precision and usability of knowledge management and search technology has not kept pace. The vast majority of searches performed today are still keyword searches or fielded searches. A keyword search involves entering a list of words, which are likely to be contained within the body of the document for which the user is searching. A fielded sear...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F7/00G06Q10/00
CPCG06F17/30716Y10S707/99935G06Q10/10G06F17/30722G06F16/38G06F16/34
Inventor CHARNOCK, ELIZABETHROBERTS, STEVEN L.HOLSINGER, DAVID J.
Owner SUNRISE SERIES 54 OF ALLIED SECURITY TRUST I
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products