About Me

I have been around the data and analytics space for about 20 years and have done Product Management, Professional Services and Pre-sales. I have started putting a few things together that I find interesting

  1. I keep hearing about ACID in the context of analytics and put something together on a way of looking at it in the context of Druid
  2. Analysing transactional data using Druid – Joining large tables at query time is expensive. I have put together a pipeline using Flink, Kafka and Druid so that the joins can be in Flink
  3. Student-t distribution with tuple sketch. Use a student t distribution to draw conclusions from a smaller sample
  4. Use Druid as feature store for model training