Terraform DataForge Quick Start - AWS Default Setup

Follow each of the steps in this guide in order to use the Terraform DataForge Quick Start tool to create all the necessary resources and infrastructure for a new DataForge Workspace.

Google Chrome is the supported browser for DataForge. Using any unsupported browsers may cause issues loading pages.

Databricks

  1. Create a Databricks Account. You can sign up for a free trial at https://www.databricks.com/try-databricks#account

Github

  1. Create a Github Account if you do not already have one and follow the signup instructions to confirm your account. The Free account will work for this quick start: https://github.com/signup
  2. Once you're signed into Github, open the DataForgeLabs Terraform Module Examples repo
  3. Click the Fork option near the top-right and click the Create Fork button to fork the repo into your own account.

Terraform

  1. Sign up for a Terraform Account if you do not already have one and confirm the new account through the email confirmation that is sent: https://app.terraform.io/public/signup/account
  2. Once you are signed in to Terraform, create a new Organization. The organization can be given any name you like as long as it follows the Terraform creation guidelines listed.
  3. Create a new Workspace in Terraform:
    • Choose the Version Control Workflow as you will need to sync with your Github account
    • Select Github and Github.com for the version control provider.
  4. In the window popup that appears, select the Authorize Terraform Cloud button. If the popup window does not appear, you may need to adjust your browser settings to allow popups.
  5. On the next popup that appears, select the button to Install. If you are seeing a Terraform page that has a spinning icon and shows Github App Installation, look for the popup mentioned on your windows.
  6. You should now be on a screen to choose your repository.  Select the repository that you forked from the DataForge Terraform Module Examples repo in the Github steps.
  7. Expand the "Additional Options" section and enter "aws" into the Terraform Working Directory, then select the Create option at the bottom of the page

  8. Terraform will show a Configure Terraform Variables page, automatically listing the variables needed.  Enter the value of each variable from your AWS or Databricks accounts. Use the Inputs Guide to read about where to find the value of each variable. For environment_prefix, use only alphanumeric characters and dashes (underscores will cause failures). After entering the values, select the Save Variables button.
  9. Select the Start Run button, optionally give the run a name like "dataforge quickstart", and leave the Run Type as "Plan and apply (standard)".  Select the Start button.

  10. When the Plan stage is complete, you will see a green checkmark and the message "Plan Finished". Scroll to the bottom of the page and select the Apply button to finish letting Terraform stand up all the resources in your Cloud environment. 
  11. When the Apply stage in Terraform is complete, you should see a green checkmark and the message "Apply Complete". Copy the Databricks "workspace_url" and the "instance_profile_arn" to be used in the next and final section of steps. These two values can also be found in the Overview -> Outputs section if needed.

Databricks

  1. Open the new Databricks Workspace URL that was copied from the previous step in Terraform.
  2. Select your initial drop-down in the top-right corner and select Settings. Databricks Example here
  3. Select Security on the sub-menu and click the "Manage" option next to Instance Profiles.
  4. Click the "Add instance profile" option and paste the instance_profile_arn value copied from Terraform into the "Instance Profile ARN" text box and click the Add button.

You are now finished using the DataForge Terraform Quick Start and all of the necessary resources and infrastructure have been created for you to easily request a new DataForge Workspace. A new Databricks Workspace should exist in your Databricks account for you to use. You will need a Databricks Personal Access Token to enter into the DataForge Workspace Request.

Please return to the New DataForge Workspace Creation form to finish your setup.

If issues arise or additional help is needed, please open a support request with the DataForge team and one of our members will assist you with getting the Quick Start working.

Updated

Was this article helpful?

0 out of 0 found this helpful