847-505-9933 | +91 20 66446300 info@datametica.com

Community

Category Archive for: "Community"
In Blog Posted

Big Data 101

Author : Parampreet Singh INTRODUCTION Big Data is the ocean of information we swim in every day – vast zettabytes of data flowing from our computers, mobile devices, and machine sensors. With the right solutions, organizations can dive into all that data and gain valuable insights that were previously unimaginable.
Read More
In Blog Posted

Apache Ranger Authorization On HDFS

Author: Arjun More. Apache Ranger Authorization On HDFS Authorization, a function of specifying access rights to resources related to information security. Once the user is successfully authenticated, Authorization shall tell us what any given user can or cannot do inside Hadoop cluster. In HDFS this is primarily governed by file permissions. HDFS file permissions are […]
Read More
In Blog Posted

Metadata and its challenges

Author: Dhirendra Sinha – “A product enthusiast”. Metadata and its challenges   Before jumping onto why we require metadata and the importance of metadata for its Business Users, let us take a step back and explore what exactly is “Metadata”. The term metadata was coined in 1969 by Jack E. Myers for his Metamodel product […]
Read More
In Blog Posted

Format Preservation Encryption

By Hemanth Meka - Senior Associate, Big Data at DataMetica Introduction: FPE is a special type of encryption. While generating the cipher text there is a lot of interest in preserving the type and length so that the cipher text also looks like original text. For example consider phone number, because the phone number is […]
Read More
In Blog Posted

Big Data and the Rise of the Enterprise Data Hub

Dr. Phil Shelley, President, DataMetica Solutions Incorporated and former CIO and recently CTO of Sears Holdings. Dr. Shelley has many years’ experience in CIO/CTO and business leadership roles, now working as part of the DataMetica team to bringing Hadoop and new data architectures to other large enterprises. After decades, where the Enterprise Data Warehouse (EDW) […]
Read More
In Blog Posted

Value of the cloud for Big Data

Dr. Phil Shelley, President, DataMetica Solutions Incorporated and former CIO and recently CTO of Sears Holdings. Dr. Shelley has many years experience in CIO/CTO and business leadership roles, now working as part of the DataMetica team to bringing Hadoop and new data architectures to other large enterprises. After decades, where the Enterprise Data Warehouse (EDW) […]
Read More
In Blog Posted

Installation of Latest version of Cloudera CDH5.4.0

Installation of Latest version of Cloudera CDH 5.4.0 Cloudera was the first, and is currently, the leading provider and supporter of Apache Hadoop for the enterprise. Cloudera offers software for business critical data challenges including storage, access, management, analysis, security and search. There are two ways to install CDH(Cloudera’s Hadoop cluster) – first one is […]
Read More
In Blog Posted

Integration of Spark Streaming with Flume

Flume is a distributed and reliable service which is used for efficiently collecting and moving large amount of streaming event data. Spark Streaming brings Spark’s language-integrated API to stream processing; enabling us to process live data streaming jobs in a manner similar to batch processing jobs. Data can be ingested from various input sources like […]
Read More
In Blog Posted

Migration of Ambari server

Ambari is Hortonworks Hadoop cluster management framework. It helps monitor Hadoop ecosystem services. This article shows the procedure to migrate the Ambari server in a production cluster. Currently, you might have your Ambari server setup using default DB, which is PostgresSQL. Before starting the migration, the Ambari server has to be down, which will make […]
Read More
page 1 of 3
Contact Us

We're not around right now. But you can send us an email and we'll get back to you, asap.