Header Distribution (Photograph-derived manufacture)

advancedanalytics

Client

Internet-based image publishing service

The client is a leading Internet-based image publishing service who wanted to find out the percentage distribution of all image metadata headers over a time dimension. These attributes are dynamic in nature as they vary with different makes and models.

Solution

Dynamic Hadoop model

Datametica built a solution to model the dynamism on Hadoop. The data is ingested into Hadoop every hour and analytics is carried out on these data sets to answer metadata header distribution.

Benefits

automatedworkloadmigrator
Segregation of details
support
Serves as base for Analytics
diamond
Cleansed Data
magic
Tools / Technologies Used
  • Pig
  • Hive
  • Shell
  • Scheduler - UC4
Top