Enron Emails 2011-04-02 (Zipped HTML) [CALO] [CRYPTOME]

seeders: 1
leechers: 0
Added on July 18, 2015 by Cryptomein Other > Unsorted
Torrent verified.



Enron Emails 2011-04-02 (Zipped HTML) [CALO] [CRYPTOME] (Size: 422.93 MB)
 enron_mail_20110402.tar.gz422.93 MB


Description

This dataset was collected and prepared by the CALO Project (A Cognitive Assistant that Learns and Organizes). It contains data from about 150 users, mostly senior management of Enron, organized into folders. The corpus contains a total of about 0.5M messages. This data was originally made public, and posted to the web, by the Federal Energy Regulatory Commission during its investigation.

The email dataset was later purchased by Leslie Kaelbling at MIT, and turned out to have a number of integrity problems. A number of folks at SRI, notably Melinda Gervasio, worked hard to correct these problems, and it is thanks to them that the dataset is available. The dataset here does not include attachments, and some messages have been deleted "as part of a redaction effort due to requests from affected employees". Invalid email addresses were converted to something of the form user@enron.com whenever possible (i.e., recipient is specified in some parse-able format like "Doe, John" or "Mary K. Smith") and to no_address@enron.com when no recipient was specified.

Sharing Widget


Download torrent
422.93 MB
seeders:1
leechers:0
Enron Emails 2011-04-02 (Zipped HTML) [CALO] [CRYPTOME]