.:: Data Sains Lab ::.

The power of data analytics

  • About
Skip to content

Tag: spark

June 1, 2019

Tensorframes: Tensorflow + Spark

Combining data-intensive best solution (apache spark) and compute-intensive best approach (Tensorflow with GPU) results in Tensorframes. The speedup is remarkable. Hopefully, I could get…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 1, 2019

Why we need a big data platform such as Hadoop & Spark?

On the last post, I mentioned that aggregating & sorting 100 million rows dataset (~ 2.4 GB) using monolithic approach takes 4 seconds to…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email
June 1, 2019

Be cautious to include legacy resources as part of the big data system

Very often, many organizations insist to involve legacy resources (e.g., applications, data storage) into the big data system. On one hand, it could accelerate…

Share this:

  • Tweet
  • WhatsApp
  • Print
  • Email

Recent Posts

  • Jupyter Matlab kernel
  • Home Lab
  • Benchmark Python’s Dataframe: Pandas vs. Datatable vs. PySpark SQL
  • Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 2)
  • Apache Hadoop: What is that & how to install and use it? (Part 2)

Recent Comments

  • Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 1) - .:: Data Sains Lab ::. on Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 2)
  • Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 2) - .:: Data Sains Lab ::. on Be cautious to include legacy resources as part of the big data system
  • Apache Hadoop: What is that & how to install and use it? (Part 1) - .:: Data Sains Lab ::. on Apache Hadoop: What is that & how to install and use it? (Part 2)
  • Apache Hadoop: What is that & how to install and use it? (Part 1) - .:: Data Sains Lab ::. on Why we need a big data platform such as Hadoop & Spark?
  • Google BigQuery, a serverless batch query of big datasets - .:: Data Sains Lab ::. on Be cautious to include legacy resources as part of the big data system

Archives

  • April 2021
  • January 2020
  • June 2019

Category

  • architect
  • big_data_developer
  • data_analyst
  • data_engineer
  • data_scientist
  • others
  • project_manager
  • Uncategorised

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
Copyright .:: Data Sains Lab ::.. All rights reserved. | Powered by WordPress & Writers Blogily Theme
loading Cancel
Post was not sent - check your email addresses!
Email check failed, please try again
Sorry, your blog cannot share posts by email.