.:: Data Sains Lab ::.

The power of data analytics

  • About
Skip to content

Category: data_engineer

June 1, 2019

InfluxDB compression

I’m always amazed at how people improve data storing technique, e.g., Influxdb, a time-series NoSQL database, that not only responds to a query very…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 1, 2019

Mailbox checker

As an online shopper, I need to regularly check my mailbox, 5 floors separated from home. It is really annoying to check whether a…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 1, 2019

Syncsort DMX-h & IBM SPSS Modeler

Two other popular data processing platform in the IT world are explored, i.e., DMX-h and SPSS Modeler. 1) DMX-h I was an extensive user…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 1, 2019

My first experiment with Zigbee

Instead of buying an expensive and proprietary Zigbee gateway, I bought a Zigbee sniffer that could be “changed” to be a universal gateway (https://lnkd.in/dj9HwBF)….

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 1, 2019

Zwave+ vs. WiFi-based IoT devices

There are at least 4 competing IoT connectivity technologies that have already numerous rolled-out products in the market, i.e., RF-433 MHz, WiFi (ESP8266-based devices),…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 1, 2019

Why we need a big data platform such as Hadoop & Spark?

On the last post, I mentioned that aggregating & sorting 100 million rows dataset (~ 2.4 GB) using monolithic approach takes 4 seconds to…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 1, 2019

Be cautious to include legacy resources as part of the big data system

Very often, many organizations insist to involve legacy resources (e.g., applications, data storage) into the big data system. On one hand, it could accelerate…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email

Posts navigation

Previous 1 … 3 4

Recent Posts

  • Jupyter Matlab kernel
  • Home Lab
  • Benchmark Python’s Dataframe: Pandas vs. Datatable vs. PySpark SQL
  • Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 2)
  • Apache Hadoop: What is that & how to install and use it? (Part 2)

Recent Comments

  • Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 1) - .:: Data Sains Lab ::. on Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 2)
  • Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 2) - .:: Data Sains Lab ::. on Be cautious to include legacy resources as part of the big data system
  • Apache Hadoop: What is that & how to install and use it? (Part 1) - .:: Data Sains Lab ::. on Apache Hadoop: What is that & how to install and use it? (Part 2)
  • Apache Hadoop: What is that & how to install and use it? (Part 1) - .:: Data Sains Lab ::. on Why we need a big data platform such as Hadoop & Spark?
  • Google BigQuery, a serverless batch query of big datasets - .:: Data Sains Lab ::. on Be cautious to include legacy resources as part of the big data system

Archives

  • April 2021
  • January 2020
  • June 2019

Category

  • architect
  • big_data_developer
  • data_analyst
  • data_engineer
  • data_scientist
  • others
  • project_manager
  • Uncategorised

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
Copyright .:: Data Sains Lab ::.. All rights reserved. | Powered by WordPress & Writers Blogily Theme
loading Cancel
Post was not sent - check your email addresses!
Email check failed, please try again
Sorry, your blog cannot share posts by email.