Via LANYRD

In this brief video, Jonathan Seidman and Ramesh Venkataramiah discuss the use of Hadoob, Hive for doing analytics on 500 GB /day data that gets generated via Orbitz.

A website like Orbitz generates millions of searches each day. Storing and processing the ever-growing volumes of data generated by all of those searches becomes prohibitive though traditional systems such as relational databases. This presentation details how Orbitz is using new tools such as Hadoop and Hive to meet these challenges. We’ll discuss how Hadoop and Hive are being used to analyze search data in order to optimize the products shown to users and detect trends in search keywords. This includes such tasks as using Hadoop to extract and transform data, and using Hive to perform statistical analysis on that data.

Can a similar setup be used to analyze high frequency trading data ? Looks like a possibility.