At Booking.com, we have a constant flow of events coming from various applications and internal subsystems (1.5 Billions events per day). This critical data needs to be stored for real-time, medium and long term analysis.
Our events are schema-less, making it difficult to use standard analysis tools.This presentation will explain how we built a storage and analysis solution based on Riak.
First, the talk will briefly show real events examples, and how we serialize and aggregate them.
Then, Riak configuration and data modeling will be detailed, including how data are sent to Riak, and read out of it.
Next will be a section that demonstrates Riak flexibility solutions via 2 real examples: how we cut in half the cluster internal network usage, and how we used post-commit hook to perform real-time data crunching on the cluster nodes.
Finally, the talk will present our solution using Yokozuna to build a Time Series Database on top of Riak, for near real-time ad hoc analysis of a portion of our data flow.
About Damien Krotkine
Damien Krotkine is a software engineer at Booking.com (world's leading online hotel and accommodation reservations company). He currently works on the events subsystem, where he helps gathering, storing, managing and analyzing big quantities of data in real-time. Previously, he has been working in various fields like Linux Distribution, e-commerce, online real-time advertising. He's an active member of the Perl community, maintaining some NoSQL related modules ( Redis driver, Riak client, Bloomd client ... )
Follow Damien @damsieboy