Oracle long running job Offload to Hadoop (Online, Sports Goods)


Client challenges

Project Scope

Our client is a full-line sporting goods retailer offering a broad assortment of brand name sporting goods equipment, apparel, and footwear in a specialty store environment. The client wanted to offload long-running Oracle jobs to Hadoop. In Oracle, jobs were running for over six hours; the target was to run them in less than an hour.


Jobs Offload to Hadoop

Datametica helped the client in the following ways:

  • Sourced data from multiple data stores to Hadoop (AS400, Oracle and other files)
  • Eliminated the additional long-running ETL jobs
  • Enabled the job to run in 30 minutes on a small Hadoop cluster
  • Provided data feed to downstream systems like SAS


 Time planning

Business users have the data eight hours earlier than with the legacy ETL/Oracle solution; this helps them to plan the business cycle an entire day earlier and be ready to deploy their plans at start of the next business day

Better costs

More cost effective solution than Oracle



Offload to Hadoop made the data available on an analytics platform, enabling analytics to be performed on the same data

Tools / Technologies
  • Sqoop
  • Hive
  • Pig
  • HBase