Primary Lines in this Script
Line 7
- Store a vector myvector in the database totalwinnings that keeps track of the maximum value of the bet column and aggregates the values in the winnings column for each unique value of the user column.
Line 11
- Tell Essentia to look for data on your local datastore.
Line 13
- Create a new rule to take any files in your home directory with ‘onlinecasino’ in their name and put them in the casino category. Also tell Essentia not to look for a date in the filenames.
Line 17
- Pipe all files in the category casino to the aq_pp command.
- In the aq_pp command, tell the preprocessor to take data from stdin, ignoring errors and skipping the first line (the header).
- Then define the incoming data’s columns, skipping the second and fifth columns (time and country), and import the data to the vector in the totalwinnings database so the attributes listed there can be applied.
Line 19
- Export the modified and aggregated data from the database and save the results to a csv file.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 | ess cluster set local
ess purge local
ess udbd stop
ess server reset
ess drop database totalwinnings
ess create database totalwinnings
ess create vector myvector s,pkey:user i,+max:bet f,+add:winnings
ess udbd start
ess select local
ess category add casino "$HOME/EssentiaPublic/*onlinecasino*" --dateregex none
ess summary
ess stream casino "*" "*" "aq_pp -f,+1,eok - -d s:user X i:bet f:winnings X -udb -imp totalwinnings:myvector" --debug
ess exec "aq_udb -exp totalwinnings:myvector -o totalwinnings.csv" --debug
|