April 12, 2021 Jupyter Matlab kernel Buat teman2 di tanah air yang belum beruntung utk mengakses matlab di kampus/sekolahnya (karena lisensinya yang mahal), bisa akses gratis matlab kernel (lisensi TUD)… Share this:TweetWhatsAppPrintEmail
January 25, 2020 Home Lab I like to learn something new everyday, whether it is related to my PhD research (big data value creation) or not. I have investigated… Share this:TweetWhatsAppPrintEmail
January 24, 2020 Benchmark Python’s Dataframe: Pandas vs. Datatable vs. PySpark SQL Setup Machine: 16-thread Xeon 2.6 GHz, 32 GB RAM, NVME PCIx16 System: Ubuntu 16.04, Spark 2.4.4, Python 3.7.4, Pandas 0.25.1, Datatable 0.10.1 Data: 100… Share this:TweetWhatsAppPrintEmail
June 4, 2019 Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 2) How to use GBQ Note that the data should be located in Google cloud, whether in Google Cloud Storage, Google Drive, Cloud BigTable, or… Share this:TweetWhatsAppPrintEmail
June 3, 2019 Apache Hadoop: What is that & how to install and use it? (Part 2) Part 2: How to install a standalone Hadoop Now, we are going to install a standalone Hadoop. The easiest way is to use VM… Share this:TweetWhatsAppPrintEmail
June 3, 2019 Apache Hadoop: What is that & how to install and use it? (Part 1) Next: How to install a standalone Hadoop Part 1: Understanding Apache Hadoop as a Big Data Distributed Processing & Storage Cluster In the last… Share this:TweetWhatsAppPrintEmail
June 2, 2019 High Performance Computing (HPC) TU Delft High performance computing (HPC): What is it and what is it used for? I am so grateful to have access to wealthy resources of… Share this:TweetWhatsAppPrintEmail
June 2, 2019 Big Data & AI landscape 2018 As described by AgileEngine (https://agileengine.com/megatrends-in-big-data/), there are four megatrends in big data in 2019, i.e.:1) From “big data” to “just data” because most organizations… Share this:TweetWhatsAppPrintEmail
June 2, 2019 Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 1) Next: Part 2 Google Big Query (GBQ) as a serverless service from Google Serverless is one of big data solution to watch in 2018… Share this:TweetWhatsAppPrintEmail
June 2, 2019 Apache Zeppelin, a polyglot data science tools Use of polyglot application for big data exploratory will be more important in the future. It allows us to run multiple interpreters in a… Share this:TweetWhatsAppPrintEmail