June 4, 2019 Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 2) How to use GBQ Note that the data should be located in Google cloud, whether in Google Cloud Storage, Google Drive, Cloud BigTable, or… Share this:TweetWhatsAppPrintEmail
June 3, 2019 Apache Hadoop: What is that & how to install and use it? (Part 2) Part 2: How to install a standalone Hadoop Now, we are going to install a standalone Hadoop. The easiest way is to use VM… Share this:TweetWhatsAppPrintEmail
June 3, 2019 Apache Hadoop: What is that & how to install and use it? (Part 1) Next: How to install a standalone Hadoop Part 1: Understanding Apache Hadoop as a Big Data Distributed Processing & Storage Cluster In the last… Share this:TweetWhatsAppPrintEmail
June 2, 2019 High Performance Computing (HPC) TU Delft High performance computing (HPC): What is it and what is it used for? I am so grateful to have access to wealthy resources of… Share this:TweetWhatsAppPrintEmail
June 2, 2019 Big Data & AI landscape 2018 As described by AgileEngine (https://agileengine.com/megatrends-in-big-data/), there are four megatrends in big data in 2019, i.e.:1) From “big data” to “just data” because most organizations… Share this:TweetWhatsAppPrintEmail
June 2, 2019 Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 1) Next: Part 2 Google Big Query (GBQ) as a serverless service from Google Serverless is one of big data solution to watch in 2018… Share this:TweetWhatsAppPrintEmail
June 2, 2019 Apache Zeppelin, a polyglot data science tools Use of polyglot application for big data exploratory will be more important in the future. It allows us to run multiple interpreters in a… Share this:TweetWhatsAppPrintEmail
June 2, 2019 Repository of public datasets For anyone who is looking for datasets for his/her project. Share this:TweetWhatsAppPrintEmail
June 2, 2019 Standardized patterns for improving the data quality of big data Abstract: Data seldom create value by themselves. They need to be linked and combined from multiple sources, which can often come with variable data… Share this:TweetWhatsAppPrintEmail
June 2, 2019 Arista, a Linux-based networking devices For years, the networking industry is dominated by Cisco and its operation system, IOS. IOS, IMHO, is not designed to be customized by the… Share this:TweetWhatsAppPrintEmail