I've developed a simple script that allows one to export CEF events directly from archive files.
Brief disclaimer - this tool is officially not supported and not maintained. I'm providing this script here in the hopes someone finds it useful. If you make any improvements, please feel free to share them back with the community. What follows is the README from the tarball, as it has some good examples and instructions.
This is a simple utility that exports CEF records from a Logger archive file. It prints them to stdout by design, allowing the user to redirect them to a file or pipe them into something else (grep, awk, whatever) for further manipulation.
Written in python (targetting 2.6.x) and using only the standard libraries that should be available on all RHEL installations, this should be fairly self contained.
$ ./lacat -h
Usage: lacat [options] path_to_dat path_to_meta
Extracts cef events from Logger Archive files to stdout
THIS SOFTWARE IS NOT SUPPORTED. USE AT YOUR OWN RISK.
Why is it called lacat?
Because "Logger_Archive_cat" was too long to type.
-h, --help show this help message and exit
-j, --json export as json instead of raw cef
-f FILTER, --filter=FILTER
specify a key=val to filter records by. multiple -s
The usage is hopefully quite straightforward and the implementation fast enough. I'm still optimizing it a bit to squeeze a bit more performance so check back here for revisions.
Place the file lacat in your path and make the file executable:
chmod x lacat
Export raw CEF and capture in the file outfile.cef
./lacat ArcSight_Data_1_0504403158265495556.dat ArcSight_Metadata_1_504403158265495556.csv > outfile.cef
Export all CEF records, one per line in JSON format, and capture in outfile.json
./lacat -j ArcSight_Data_1_0504403158265495556.dat ArcSight_Metadata_1_504403158265495556.csv > outfile.json
Filter results by limiting output to destination IP 10.0.0.1
./lacat -f dst=10.0.0.1 ArcSight_Data_1_0504403158265495556.dat ArcSight_Metadata_1_504403158265495556.csv
Filter results by limiting output to destination IP 10.0.0.1 and UDP events only.
./lacat -f dst=10.0.0.1 -s proto=UDP ArcSight_Data_1_0504403158265495556.dat ArcSight_Metadata_1_504403158265495556.csv
Multiple -s options can be specified to create an AND condition. You can always specify -j to get each record output in JSON for ease of parsing with other languages.