.:: Data Sains Lab ::.

The power of data analytics

  • About
Skip to content

Category: data_engineer

January 25, 2020

Home Lab

I like to learn something new everyday, whether it is related to my PhD research (big data value creation) or not. I have investigated…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
January 24, 2020

Benchmark Python’s Dataframe: Pandas vs. Datatable vs. PySpark SQL

Setup Machine: 16-thread Xeon 2.6 GHz, 32 GB RAM, NVME PCIx16 System: Ubuntu 16.04, Spark 2.4.4, Python 3.7.4, Pandas 0.25.1, Datatable 0.10.1 Data: 100…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 3, 2019

Apache Hadoop: What is that & how to install and use it? (Part 2)

Part 2: How to install a standalone Hadoop Now, we are going to install a standalone Hadoop. The easiest way is to use VM…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 3, 2019

Apache Hadoop: What is that & how to install and use it? (Part 1)

Next: How to install a standalone Hadoop Part 1: Understanding Apache Hadoop as a Big Data Distributed Processing & Storage Cluster In the last…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 2, 2019

High Performance Computing (HPC) TU Delft

High performance computing (HPC): What is it and what is it used for? I am so grateful to have access to wealthy resources of…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 2, 2019

Repository of public datasets

For anyone who is looking for datasets for his/her project.

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 2, 2019

Standardized patterns for improving the data quality of big data

Abstract: Data seldom create value by themselves. They need to be linked and combined from multiple sources, which can often come with variable data…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 2, 2019

Arista, a Linux-based networking devices

For years, the networking industry is dominated by Cisco and its operation system, IOS. IOS, IMHO, is not designed to be customized by the…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 2, 2019

Container on ARM SBSc

Most single-board computers (SBCs) today are powered by ARM. Containerization on SBCs like Raspberry Pi or Orange Pi brings so much flexibility in a…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 2, 2019

Saving electricity by suspending idle servers

Electricity is a (very) expensive resource in Europe. By putting the servers into sleep/suspend mode (while idle), I can save 80% of the power…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email

Posts navigation

1 2 … 4 Next

Recent Posts

  • Jupyter Matlab kernel
  • Home Lab
  • Benchmark Python’s Dataframe: Pandas vs. Datatable vs. PySpark SQL
  • Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 2)
  • Apache Hadoop: What is that & how to install and use it? (Part 2)

Recent Comments

  • Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 1) - .:: Data Sains Lab ::. on Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 2)
  • Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 2) - .:: Data Sains Lab ::. on Be cautious to include legacy resources as part of the big data system
  • Apache Hadoop: What is that & how to install and use it? (Part 1) - .:: Data Sains Lab ::. on Apache Hadoop: What is that & how to install and use it? (Part 2)
  • Apache Hadoop: What is that & how to install and use it? (Part 1) - .:: Data Sains Lab ::. on Why we need a big data platform such as Hadoop & Spark?
  • Google BigQuery, a serverless batch query of big datasets - .:: Data Sains Lab ::. on Be cautious to include legacy resources as part of the big data system

Archives

  • April 2021
  • January 2020
  • June 2019

Category

  • architect
  • big_data_developer
  • data_analyst
  • data_engineer
  • data_scientist
  • others
  • project_manager
  • Uncategorised

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
Copyright .:: Data Sains Lab ::.. All rights reserved. | Powered by WordPress & Writers Blogily Theme
loading Cancel
Post was not sent - check your email addresses!
Email check failed, please try again
Sorry, your blog cannot share posts by email.