Writing
Recent Writing
Workflow schedulers are a bad fit for data materialization, and something better is badly needed. This article examines a potential solution that will feel familiar, its benefits, and the challenges facing its implementors.
Despite its ubiquity and ease of access, CSV is a wretched way to exchange data. The time has long passed to retire CSV and replace it with something better.
When does it make sense for an object to be in Amazon S3’s Intelligent-Tiering ("S3-IT") storage class? The answer, unfortunately, is "it depends". (Published on the Duckbill Group blog.)
The AWS Data Ecosystem Grand Tour
In late 2019 and early 2020, I wrote a series of articles offering a whirlwind tour of the portions of AWS's vast service ecosystem dedicated to data management.