phdata-logo Docs About phData
Cloud Foundation

Cloud Foundation #

phData’s Cloud Foundation tool facilitates an automated, infrastructure-as-code-approach to deploying and supporting data products. By automating infrastructure provisioning — rather than spinning up compute, storage, and networking manually — you can not only deploy faster, but also in a more consistent, repeatable fashion.

Cloud Foundation provides a ready path to get there. Rather than writing your infrastructure-as-code entirely from scratch, you can draw from our AWS CloudFormation library of production-ready “Gold Templates” (ready-to-go infrastructure patterns developed by phData and deployed successfully into production by our customers), as well as build scripts to facilitate a streamlined CI/CD pipeline via pull requests.

phData helps you get started — first by creating a source code repository, and then by running a bootstrapping script to configure your Cloud Foundation instance — before providing the Gold Templates and sample configuration parameters to provision your infrastructure.

Our Cloud Foundation offerings include:

  • Infrastructure-as-Code Library — Prebuilt CloudFormation Templates for AWS EMR, Airflow, AWS Redshift, AWS DMS, Snowflake, Databricks, Cloudera Hadoop, and more. Templates include best practices tailored to support data-focused workloads such as machine learning pipelines, cloud-native data warehouse, IoT, data lakes).

  • AWS Cloud Foundation CI/CD Library — Library for automatic testing and execution of AWS CloudFormation templates along with promotion of changes from development to production environments.

Why use Cloud Foundation #

  • Execute Faster — Accelerate cloud projects with our ready-to-go AWS CloudFormation templates, rather than building your own from scratch.

  • Optimize for ML and data analytics — phData Cloud Foundation is dedicated to machine learning and data analytics, with prebuilt stacks for a range of analytical tools, including AWS EMR, Airflow, AWS Redshift, AWS DMS, Snowflake, Databricks, Cloudera Hadoop, and more.

  • Improve Quality — Spend less developer time troubleshooting errors and performance issues with time-tested infrastructure templates and configuration patterns (many of which have been used in production for years)

  • Automate deployment — Cloud Foundation includes automated, pull-request-based testing and deployment of new stacks and change-sets to existing stacks. Developer experience is dramatically improved and more efficient because standard git tooling handles cloud deployments.

  • Improve auditability and governance — Automating deployments provides a clearer picture of who did what and when, and which changes are most responsible for driving up costs. And Cloud Foundation gives you control to enforce governance and ensure standardized tags, naming schemes, and approval processes.