Tag Archives: S3

Hadoop Extends Data Architectures: Part 3 In Hadoop Series

The list and diversity of NoSQL, “NewSQL”, cloud, grid, and other data architecture options seem to grow every year.

The Harry Potter books and movies were a particularly popular inspiration for project names. For example, at LinkedIn, to empower features such as “People You May Know” and “Jobs You May Be Interested In”, LinkedIn uses Hadoop together with an Azkaban batch workflow scheduler and Voldemort key-value store. We’ll see if the Twilight series has a similar impact on project names.

(more…)

FacebookTwitterLinkedInEmailPrintShare
Posted in Big Data | Tagged , , , , , , , , , , , , , | Leave a comment

Informatica Announces First Cloud Archiving Service Optimized For Databases

This is my first blog for Perspectives and I wanted to talk about one of last week’s announcements -Informatica introduced the first ever cloud archiving service optimized for databases.   Since then, I’ve had a number of questions from customers and analysts about what exactly makes it optimized?   Certainly, a number of vendors have the ability to land data in the cloud, so how is this different?  Let me capture the highlights:

  • The data is massively compressed before transmission-close to 60:1. That can take the upload time down from days or weeks to hours or minutes. No need to constantly mail DVDs to Amazon to get data on S3. (more…)
FacebookTwitterLinkedInEmailPrintShare
Posted in Application ILM, Cloud Computing, Data Integration | Tagged , , | Leave a comment