distributed - .:: Data Sains Lab ::.

Apache Hadoop: What is that & how to install and use it? (Part 1)

3rd June 20193rd June 2019 by agungw132

Next: How to install a standalone Hadoop Part 1: Understanding Apache Hadoop as a Big Data Distributed Processing & Storage Cluster In the last post, I discussed on which occasion we prefer distributed approach such as Hadoop and Spark over the monolithic approach. I will discuss more detail about Apache Hadoop in this article. This … Read moreApache Hadoop: What is that & how to install and use it? (Part 1)

Tableau, the “de facto” distributed visualization platform for big data

3rd June 20192nd June 2019 by agungw132

Long time not checked Tableau, this application has incorporated new connectors for recent technologies, e.g., Google Big Query, Spark SQL, etc. It surely positions itself as the ‘de facto’ distributed visualization platform for big data. I was wondering when Tableau delivers its desktop version on Linux platform. Microsoft Excel Text File Microsoft Access JSON File … Read moreTableau, the “de facto” distributed visualization platform for big data

Be cautious to include legacy resources as part of the big data system

1st June 20191st June 2019 by agungw132

Very often, many organizations insist to involve legacy resources (e.g., applications, data storage) into the big data system. On one hand, it could accelerate and ease the implementation of a big data use case, but it also creates a bottleneck in the workflow that would be problematic in the long term. If the monolithic applications … Read moreBe cautious to include legacy resources as part of the big data system

Share this:

Share this:

Share this: