Big Data & AI landscape 2018

As described by AgileEngine (https://agileengine.com/megatrends-in-big-data/), there are four megatrends in big data in 2019, i.e.:1) From “big data” to “just data” because most organizations currently already embrace big data. 2) machine learning is the new engine after many organizations suffer creating value from big data. 3) everyone to the cloud because of the following benefits: … Read moreBig Data & AI landscape 2018

Big Data Application & eTOM

Which department in an enterprise is concerned with a certain big data objective? The most compelling answer is to combine the eTOM framework with big data use cases. To understand how the big data applications are implemented in practices, we need to pinpoint the applications on the business processes in an organization. For such purpose, we … Read moreBig Data Application & eTOM

Movie review: Anon

How privacy is no longer an issue in the future A very good movie I watched this week: Anon (https://en.wikipedia.org/wiki/Anon_(film) ), especially the dialog at the end of the movie. It taught me a lot the philosophy of privacy.  Anon: you invaded my privacy, it is nothing. I tried to get my privacy back, it … Read moreMovie review: Anon

Google G-Suite: Unlimited of everything

It is unbelievable, if you have a company, then you can have unlimited of everything from Google (email, storage, apps, etc.), only $10/user/month! …wars on “data collection”…

How China to become AI global leader

Kai-Fu Lee mentions there are four stages that China took to become global AI leader:1) Copied from U.S.2) Inspired by U.S., then leapfrogged3) Chinese innovations4) Entering the era of Copy-from-china (become “data collectors” in other countries) It is interesting to see how U.S. reacts recently by blockade its trade with China.

Big Data Expo NL

I was so lucky visiting the annual Big Data Expo @ Utrecht, the Netherlands on 19-20 September 2018 (https://www.bigdata-expo.nl/en) Here are some presentations I could capture:1. HOW TABLEAU ENABLED ABN AMRO TO VISUALIZE BIG DATA EFFICIENTLY (https://www.dropbox.com/sh/piazgsw9esw8dsl/AAAjjdjNNQFO0AbDRGyVns6ya?dl=0)2. HOW TO LET AN ELEPHANT DANCE; IMPLEMENTATION OF A DATA LAKE IN NEAR REAL-TIME AT A DUTCH INSURER … Read moreBig Data Expo NL

Dataiku: flexible data science tools

In the previous post, the flexibility given by data science tools greatly reduces the performance, i.e., the execution speed. Fortunately, Dataiku, a data science tool, provides multiple ways to aggregate big data: 1) using the built-in building blocks; 2) using a custom R script with the built-in I/O blocks; or3) using an independent custom R … Read moreDataiku: flexible data science tools

Why do we need a smart home?

This is why a smart home is needed: to understand your home better and communicate with pros on their own language — My dialog with the dishwasher’s company (DC): Me: Hi, I’d like to inform you about my dishwasher’s problem. DC: Sure, please give me your address (by address, they know what type of the … Read moreWhy do we need a smart home?

CPU vs. GPU

Inspired by the benchmark from Matt Dowle (https://h2oai.github.io/db-benchmark/), I compared his benchmark with GPU (Detail: https://lnkd.in/e7iHg7N). For processing big data, GPU K20 2 GB is slightly better than 20 cores CPU Xeon 2.6 GHz 125.8 GB RAM, even much better in some tests 🙂 Of course, the performance comes with a price. Thanks to Omnisci … Read moreCPU vs. GPU

Syncsort DMX-h & IBM SPSS Modeler

Two other popular data processing platform in the IT world are explored, i.e., DMX-h and SPSS Modeler. 1) DMX-h I was an extensive user of this beast software in 2009-2012. It is an amazing ETL platform, I used to process terabytes of chunked files which was completed in a short time (compared to a relational … Read moreSyncsort DMX-h & IBM SPSS Modeler