πŸš€ Refleksi dari JupyterCon 2025

πŸ“Œ Problem Statement1. Meskipun ekosistem Jupyter berkembang pesat, banyak praktisi masih menghadapi tantangan pada reproducibility, kolaborasi, dan aksesibilitas workflow data.2. Notebook sering dipakai untuk eksplorasi, tetapi kurang optimal untuk dokumentasi, deployment, dan praktik engineering yang baik.3. Kesadaran mengenai inklusivitas dan keberagaman dalam komunitas data tetap menjadi hal penting yang perlu ditingkatkan.πŸ› οΈ Methodology / Solusi / … Read moreπŸš€ Refleksi dari JupyterCon 2025

πŸ’Ύ Punya SSD Eksternal yang Disimpan Lama di Laci? Hati-Hati, Datanya Bisa Hilang Sendiri! 😱

Seringkali kita mengira SSD itu media penyimpanan paling aman dan awet buat backup foto atau dokumen penting. Ternyata, kalau didiamkan tanpa listrik (unpowered) terlalu lama, SSD bisa mengalami “amnesia”. XDA Developers membahas fenomena ini. Simak bedahannya: 1. ⚠️ Problem Statement (Masalah) * Data Rot: Berbeda dengan Hard Disk (HDD) yang menggunakan magnet, atau CD yang … Read moreπŸ’Ύ Punya SSD Eksternal yang Disimpan Lama di Laci? Hati-Hati, Datanya Bisa Hilang Sendiri! 😱

🧠 Google Nggak Mau Kalah! Gemini 2.0 “Overthinking” Kini Hadir di AI Studio! πŸš€

Sementara kita heboh sama Claude Opus 4.5, Google diam-diam merilis update besar di AI Studio. Tweet terbaru mereka mengonfirmasi kehadiran model eksperimental baru yang siap menantang OpenAI o1.Ini bedahannya:1. ⚠️ Problem Statement (Masalah) * Reasoning Dangkal: Model “Flash” atau “Pro” biasa seringkali menjawab terlalu cepat tanpa berpikir dalam (System 1 thinking), sehingga sering salah di … Read more🧠 Google Nggak Mau Kalah! Gemini 2.0 “Overthinking” Kini Hadir di AI Studio! πŸš€

πŸ€– Bos NVIDIA Marah? “Gila Kalau Kalian Gak Pake AI!” (Wajib Baca buat Karyawan 2025!) 😑

Jensen Huang (CEO NVIDIA) baru saja memberikan “pep talk” keras kepada 36.000 karyawannya. Ternyata, di perusahaan pembuat chip AI terbesar di dunia pun, masih ada manajer yang ragu pake AI. Reaksi Jensen? “Are you insane?” (Kalian gila?).Ini ringkasannya: 1. ⚠️ Problem Statement (Masalah) * Internal Pushback: Jensen menemukan laporan bahwa beberapa manajer di NVIDIA melarang … Read moreπŸ€– Bos NVIDIA Marah? “Gila Kalau Kalian Gak Pake AI!” (Wajib Baca buat Karyawan 2025!) 😑

πŸ•ΈοΈ Jangan Mulai Project Knowledge Graph Kamu dengan “Perfect Ontology”!

Membangun Knowledge Graph (KG) itu terlihat seksi: menghubungkan semua data perusahaan menjadi satu jaringan pintar. Tapi realitanya? Banyak inisiatif KG yang gagal total alias “mati sebelum berkembang”. Artikel dari The Ontologist ini membedah alasan kegagalan tersebut dan cara menghindarinya. Mari kita ringkas! πŸ‘‡ πŸ›‘ 1. The Problem (Masalah Utama)Kesalahan terbesar data engineer atau ontologist pemula … Read moreπŸ•ΈοΈ Jangan Mulai Project Knowledge Graph Kamu dengan “Perfect Ontology”!

Google BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 1)

Next: Part 2 Google Big Query (GBQ) as a serverless service from Google Serverless is one of big data solution to watch in 2018 according to Computer World UK (/compwuk). Google BigQuery (GBQ) is an example of enterprise-grade serverless service (either Function-as-a-Service, FaaS or Datawarehouse-as-a-Service) offered by Google Cloud Platform. GBQ was first launched as … Read moreGoogle BigQuery, a serverless Datawarehouse-as-a-Service to batch query huge datasets (Part 1)

Apache Zeppelin, a polyglot data science tools

Use of polyglot application for big data exploratory will be more important in the future. It allows us to run multiple interpreters in a single notebook and transfer variables among multiple kernels. The use of multiple language programming give unleashes the strong points from each interpreter.

Standardized patterns for improving the data quality of big data

Abstract: Data seldom create value by themselves. They need to be linked and combined from multiple sources, which can often come with variable data quality. The task of improving data quality is a recurring challenge. In this paper, we use a case study of a large telecom company to develop a generic process pattern model … Read moreStandardized patterns for improving the data quality of big data

Tableau, the “de facto” distributed visualization platform for big data

Long time not checked Tableau, this application has incorporated new connectors for recent technologies, e.g., Google Big Query, Spark SQL, etc. It surely positions itself as the ‘de facto’ distributed visualization platform for big data. I was wondering when Tableau delivers its desktop version on Linux platform. Microsoft Excel Text File Microsoft Access JSON File … Read moreTableau, the “de facto” distributed visualization platform for big data

Spin up Oracle database in minutes using Docker

Oracle database is one of most wanted skill perhaps until today. As far as I know, compared to its rivals such as MySQL and Postgres, its installation requires substantial effort. The application itself also draws huge memory and storage. Today, Oracle is contained in a ready-to-use container. Only in minutes, an Oracle instance could be … Read moreSpin up Oracle database in minutes using Docker