Clickstream Denormalization

advancedanalytics

Client challenges

Project scope

Our client is an American retail giant who wanted to perform analytics on the clickstream data in real time on a daily basis.

Solution

Analytics Foundation

Datametica built a foundation layer to perform analytics on the clickstream data. The data is ingested in two ways - batch cycle of one day, and real time – after which it is cleansed and a single denormalized layer is built using different lookup tables and files. This denormalized layer is accessed to carry out analytics.

Benefits

time
Real-time summaries

Building quick summaries in real time

mail
Recommendations

Sending recommendations to customers for increased e-commerce Sales

magic
Tools / Technologies
  • Sqoop
  • Pig
  • Hive
  • Java to write UDFs
Top