Join us as we explore innovative ways to handle multimodal datasets, optimize performance, and simplify your data workflows.

Daft Observability Roadmap: metrics, OTEL integration, real-time dashboards, and DataFrame APIs for debugging and monitoring distributed pipelines.

Another killer release featuring arrow-rs migration, Apache OpenDAL support, Flight shuffle, better metrics, and Tencent Cloud COS integration.

Daft v0.7.3 adds distributed observability with df.metrics via OTEL, nightly builds, and native Lance vector search.

Distributed Random Access for Audio, Video, Documents, and Code


A deep dive into Daft’s distributed execution engine, Flotilla, for multimodal data pipelines

I Got Tired of Tuning Batch Sizes, So I Made Them Tune Themselves

Our engineering team's best practices for working with AI coding agents.

Sourcetable CTO Andy Grosser discusses their data infrastructure choices and why reliability and scale drove their architecture decisions.

How Teraflop AI processed 7 million court documents and 40 million pages spanning 365 years of U.S. caselaw for under a dollar using Daft.

Leveraging ablation for contrastive image understanding evaluation in Daft

A new inference backend that maximizes batch inference throughput.