pelican faqs

Is this a UI-based tool?

Yes, Pelican is a zero coding web UI tool that makes the interaction between the software and the user seamless.

Can Pelican be used as an enterprise tool in my organization?

Datametica’s PELICAN is an enterprise-ready trusted software technology ready to be deployed into the customer’s production environment. Features offered:

Sample data encryption (AES 256)
User authentication
- Customer managed – Using LDAP/AD integration (individual user & group)
- Product managed – Using SMTP integration
SSL compatibility
Multiple deployment options
Programmatic interfaces (APIs)
Code migration utility
Backup solutions and utilities to eliminate Pelican metadata loss
Detail logging framework
Access and Role management module
SDLC adoption and vulnerability testing using market leading scanning tools

Do I need to know coding to use Pelican?

Datametica’s Pelican is a state-of-the-art data validation technology that requires ZERO coding. Even a non-technical person can run the validations, understand the reports, and perform bug triage as per the tool’s recommendations.

What types of Validation Pelican does?

Datametica’s Pelican performs various types of cell-level validation like Data, Metadata, and Scope.

Data
- Cell level comparison between two tables. Two modes available:
  - Litmus – Quick check
  - Full – Comprehensive check
    - Identification of all columns with mismatching cells in single execution
    - Samples of mismatches

- - Mismatch data reconciliation between two tables. Parameters:
    - Mismatched rows
    - Missing / Extra rows
    - Duplicate rows

Metadata
- Column positional ordering
- Data type

Scope
- Full table
- Incremental data

How do you make sure that Pelican is safe to get into my company's network?

Pelican product development follows an extensive SDLC process. This includes testing all the integration points to ensure the highest standards of security compliance, for e.g., vulnerability scans, metadata and data security, network security, and user security.

Pelican is an enterprise application deployed in the customer network for data validation. It is used only by the employees and partners who have secured access to the network. PELICAN does not need access to any outside network. Pelican access is controlled by a ground delivery team, which is covered by an NDA agreement with customers.

What are the infrastructural requirements if I want to use Pelican?

The recommended baseline infrastructure configuration required to run Pelican effortlessly is a Docker or GKE machine equivalent to a machine type of n1-standard-32 or higher.

Will Pelican impact my source and target data platforms while performing validation?

Pelican runs simple SQL queries on both Source and Target platforms. Usually the Pelican user is allocated a specific capacity and resources to avoid impacts to existing workloads.

How does Pelican handle different data types?

Pelican has an inbuilt intelligent algorithm that identifies data type difference between two platforms and utilizes options like expression and user inputs to make sure they are handled correctly for validation.

Can I interact programmatically with Pelican?

Yes, Pelican offers the ability to interact programmatically with the help of APIs. APIs can be used to configure, manage and process table validations within Pelican from the customer’s enterprise scheduler.

What are the operating costs of Pelican?

The operating costs for Pelican has two components

Cost of the Pelican machine
Select query costs on the DB platforms

What support is available if I buy a licensed product?

Demo and best practices sessions
Tool deployment assistance
Licensing
Upgrades
Bugs and Enhancements support
Critical and High Vulnerability Mitigation
Dedicated SME support (based on licensing model)
Tool support for issues like service outages, performance issues, unknown errors, etc.

How can Pelican ensure data security during a data validation check?

Pelican offers high data security during data validation as it doesn’t move the actual data on either the source or target over the network for comparison. It uses hashing mechanisms that enable it to validate without actually moving data or creating copies of the existing data, i.e., zero data movement. This helps save valuable storage space, network bandwidth and reduces data leakage risk. Read our case study about a similar use case here.

What source and target databases does Pelican support?

As mentioned on our Pelican product page, the following are the source and target pairs that are supported by Pelican:

Source	Target
Teradata	Big Query
Netezza	Big Query
Hive	Big Query
Oracle	Big Query
BigQuery	Big Query
DB2	Big Query
Oracle	Hive
Teradata	Hive
Netezza	Hive
Hive	Hive
Big Query	Hive
Vertica	Big Query
Teradata	Snowflake
Teradata	Synapse
Netezza	Snowflake
Netezza	Synapse
Ms SQL Server	Hive
Ms SQL Server	Big Query
Greenplum	Redshift
Hive	Delta Lake
Hive	Synapse
Snowflake	Big Query
Teradata	Delta Lake
Redshift	Big Query
Ms SQL Server	Ms SQL Server
Oracle	PostgreSQL

Where would the PELICAN tool be deployed Cloud or On-prem?

There are various Pelican deployment options based on the source and target data warehouses. Here is the list:

Datametica Solutions Pvt. Ltd | pelican faqs

What is the frequency of standard software updates – options being Monthly, Quarterly, Half-Yearly, Annual?

Pelican’s software update release frequency is quarterly. In every release, Datametica’s Pelican team works on improving the following:

Functionality
Security
Performance
Technical Debt
Bug fixes
Documentation
Miscellaneous

about datametica

Datametica is a global leader in data warehouse migration and modernization to the cloud. We empower businesses by migrating their Data/Workload/ETL/Analytics to the Cloud by leveraging Automation. Through their automated products: Eagle – Data warehouse Assessment & migration Planning Product, Raven – Automated Workload Conversion product and Pelican – Automated Data Validation Tool, Datametica automates and accelerates data migration to the cloud enabling us to remove anxiety from the migration process, making it Faster, with Greater Accuracy, Lesser Risk, and at more Competitive Cost.

We expertise in transforming legacy Teradata, Oracle, Hadoop, Netezza, Vertica, Greenplum along with ETLs like Informatica, Datastage, AbInitio & others, to cloud-based data warehousing with other capabilities in data engineering, advanced analytics solutions, data management, data lake implementation and cloud optimization.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 Years	This cookie name is associated with Google Universal Analytics - which is a significant update to Google's more commonly used analytics service. This cookie is used to distinguish unique users by assigning a randomly generated number as a client identifier. It is included in each page request in a site and used to calculate visitor, session and campaign data for the sites analytics reports. By default it is set to expire after 2 years, although this is customisable by website owners.
_gat_gtag_UA_	a few seconds	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.
_gid	1 days	This cookie name is associated with Google Universal Analytics. This appears to be a new cookie and as of Spring 2017 no information is available from Google. It appears to store and update a unique value for each page visited.

Is this a UI-based tool?

Can Pelican be used as an enterprise tool in my organization?

Do I need to know coding to use Pelican?

What types of Validation Pelican does?

How do you make sure that Pelican is safe to get into my company's network?

What are the infrastructural requirements if I want to use Pelican?

Will Pelican impact my source and target data platforms while performing validation?

How does Pelican handle different data types?

Can I interact programmatically with Pelican?

What are the operating costs of Pelican?

What support is available if I buy a licensed product?

How can Pelican ensure data security during a data validation check?

What source and target databases does Pelican support?

Where would the PELICAN tool be deployed Cloud or On-prem?

What is the frequency of standard software updates – options being Monthly, Quarterly, Half-Yearly, Annual?

about datametica

subscribe to our blog

let your data move seamlessly to cloud

Datametica

Empowering businesses by migrating their data/workloads/ETL/analytics to the cloud. We are truly ‘Giving Data Wings’.

Quick Links

Products

Migration to GCP

Migration to Azure

Our Solution