Absolutely nobody disputes the potential value of big data. It provides an economic way to ask new analytics questions that we were never able to ask before. And that is possible because we are able to combine new, large, and widely disparate data sets in ways that were never economically possible before. The challenge people are now facing is that it getting harder and harder to show business value.
In this third and final part of this blog post, I provide more technical details on the changes we made to Parquet and Spark. This post will be of interest to a software developer. If a reader is familiar with Parquet and Spark with sufficient technical details, the reader may appreciate this content more, though... more
Text search is an essential operation in many applications dealing with semi-structured big data. One such application, which many of us know about, deals with program logs, which not only contains data for troubleshooting, but also often, other information helpful for understanding how the application operates under normal conditions, for diagnosing performance characteristics, snapshot of internal... more