Join us as we explore innovative ways to handle multimodal datasets, optimize performance, and simplify your data workflows.


A deep dive into Daft’s distributed execution engine, Flotilla, for multimodal data pipelines

I Got Tired of Tuning Batch Sizes, So I Made Them Tune Themselves

Our engineering team's best practices for working with AI coding agents.

Sourcetable CTO Andy Grosser discusses their data infrastructure choices and why reliability and scale drove their architecture decisions.

How Teraflop AI processed 7 million court documents and 40 million pages spanning 365 years of U.S. caselaw for under a dollar using Daft.

Leveraging ablation for contrastive image understanding evaluation in Daft

A new inference backend that maximizes batch inference throughput.

Spark, Ray Data, and Daft

Daft's new distributed engine

The Swordfish Engine

Using Daft’s observability tools to uncover performance pitfalls