Writing
Recent Writing
Data materialization isn't an orchestration problem, it's a convergence problem, and we need a new system to handle it.
(Part of the "Is the orchestrator dead or alive?" symposium on Data People, Etc.)
Despite its ubiquity and ease of access, CSV is a wretched way to exchange data. The time has long passed to retire CSV and replace it with something better.
When does it make sense for an object to be in Amazon S3's Intelligent-Tiering ("S3-IT") storage class? The answer, unfortunately, is "it depends". (Published on the Duckbill Group blog.)
The AWS Data Ecosystem Grand Tour
In late 2019 and early 2020, I wrote a series of articles offering a whirlwind tour of the portions of AWS's vast service ecosystem dedicated to data management.