Tag Archives: Hadoop Summit
Hadoop Enriches Data Science: Part 2 Of Hadoop Series
Enterprises use Hadoop in data-science applications that improve operational efficiency, grow revenues or reduce risk. Many of these data-intensive applications use Hadoop for log analysis, data mining, machine learning or image processing.
Commercial, open source or internally developed data-science applications have to tackle a lot of semi-structured, unstructured or raw data. They benefit from Hadoop’s combination of storage and processing in each data node spread across a cluster of cost-effective commodity hardware. Hadoop’s lack of fixed-schema works particularly well for answering ad-hoc queries and exploratory “what if” scenarios.
Posted in Big Data, Operational Efficiency
Tagged customer, data mining, data-science, DNA, Facebook, Hadoop, Hadoop Summit, HBase, HIVE, mashup, MySQL, ODF, Operational Efficiency, security, StumbleUpon, Teradata, twitter, Yahoo!, Zettaset
Leave a comment

