Join us as we explore innovative ways to handle multimodal datasets, optimize performance, and simplify your data workflows.


Native Extensions via Stable C ABI, Live Query Dashboard, and 2-5x faster Parquet Reads on Nested Types

Row-wise, async, generator, and batch UDFs in Daft: one decorator, zero boilerplate, local or distributed

Daft Observability Roadmap: metrics, OTEL integration, real-time dashboards, and DataFrame APIs for debugging and monitoring distributed pipelines.

Another killer release featuring arrow-rs migration, Apache OpenDAL support, Flight shuffle, better metrics, and Tencent Cloud COS integration.

Daft v0.7.3 adds distributed observability with df.metrics via OTEL, nightly builds, and native Lance vector search.

Distributed Random Access for Audio, Video, Documents, and Code

Formalizing role definitions for contributors and maintainers

.png&w=3840&q=100)
Running model-driven data pipelines reliably at production scale

Chris Kellogg on his decision to join Eventual

A deep dive into Daft’s distributed execution engine, Flotilla, for multimodal data pipelines

I Got Tired of Tuning Batch Sizes, So I Made Them Tune Themselves