A Guide for Using Cloud Integration to Load Data into Amazon Redshift: Establishing Connections

Amazon Redshift, one of the fast-rising stars in the AWS ecosystem has taken the data warehousing world by storm ever since it was introduced almost two years ago. Amazon Redshift operates completely in the cloud, and allows you to provision nodes on-demand. This model allows you to overcome many of the pains associated with traditional data warehousing techniques, such as provisioning extra server hardware, sizing and preparing databases for loading or extensive SQL scripting.

However, when loading data into Redshift, you may find it challenging to do so in a timely manner. To reduce the time taken to load this data, you may have to spend a tremendous amount of time writing SQL optimization queries which takes away the value proposition of using Redshift in the first place.

Informatica Cloud helps you load this data quickly into Redshift in just a few minutes. To start using Informatica Cloud, you’ll need to establish connections from Redshift and your other data source first. Here are a few easy steps to help you get started with establishing connections from a relational database such as MySQL as well as Redshift into Informatica Cloud:

  1. Login into your Informatica Cloud account, go to Configure -> Connections, click “New”, and select “MySQL” for “Type”
  2. Select your Secure Agent and fill in the rest of the database details:
    Amazon Redshift Step 2
  3. Test your connection and then click ‘OK’ to save and exit
    Amazon Redshift
  4. Now, login to your AWS account and go to Redshift service page
  5. Go to your cluster configuration page  and make a note of the cluster and cluster database properties: Number of Nodes, Endpoint, Port, Database Name, JDBC URL. You also will need:
    • The Redshift database user name and password (which is  different from your AWS account)
    • AWS account Access Key
    • AWS account Secret Key
  6. Exit the AWS console.
  7. Now, back in your Informatica Cloud account, go to Configure -> Connections and click “New”.
  8. Select “AWS Redshift (Informatica)” for “Type” and fill in the rest of the details from the information you have from above
    Redshift step 3
  9. Test the connection and then click ‘OK’ to save and exit
    Redshift Step 4

As you can see, establishing connections was extremely easy and can be done in less than 5 minutes. To learn how customers such as UBM used Informatica Cloud to deliver next-generation customer insights with Amazon Redshift, please join us on September 16 for a webinar where we’ll have product experts from Amazon and UBM explaining how your company can benefit from cloud data warehousing for petabyte-scale analytics using Amazon Redshift.

Comments

  • Doesn’t working. The Connection test in a new Redshift connection creation is always Loading… 🙁