Data lake Implementation

moderndataplatform

Client:

Leading insurance provider in the USA

Our client is the most significant insurance provider in the USA with over decades of industrial recognition.

Project scope:

The client was looking at reviving their Data Architecture by establishing a well-governed, full-fledged Big Data platform on Cloud for increased agility, lower cost and faster decision making.

Following are some of the critical needs of the client

  • The data lake platform should have functions that include integrated security authorization to the client’s LDAP / AD Authentication
  • Integrate and store data from multiple data sources
  • The data lake platform should be accessible by the client’s analytic toolsets: R, SAS and Tableau

                                                                        Our Solution:

Datametica built a single platform on Google Cloud which facilitates faster processing, Analytics and AI capabilities for the business use cases

Following are the processes involved :

  • Ingest data from multiple sources into the incoming layer
  • Perform Data Quality checks on the ingested data
  • Apply business logic on clean data for further implementation of analytics Use case
  • Expose data to Big query for further analysis
  • Restrict unauthorized access to data using IAM groups
  • Automation of jobs using JAMS

Google Cloud Products we used:

GCP products we used

Benefits:

costsavings
Scalability

Easily scalable to accommodate data growth, all types of computing requirements and variety of workloads

 

operational
Efficient & Accurate

The new model was able to churn the output in an efficient and accurate manner

diamond
Single point of truth

Simple data model and single point of truth, acting as consumption point for different applications

Top