The following are the learnings from the podcast.

  • Data lake dream
  • Make data usable
  • Building an API that helps the user consume data
  • Don’t build any data with our making it available for any organisation
  • Cloudera - Espouses data lake
  • Need to get the search strategy right
  • Hadoop is similar to Linux
  • Free Linux- quick iteration, scale up, go mobile
  • Following Spark

Connecting dots

  • Spatial temporal pattern
  • Used to study sports