Join us as we explore innovative ways to handle multimodal datasets, optimize performance, and simplify your data workflows.

Our engineering team's best practices for working with AI coding agents.

Sourcetable CTO Andy Grosser discusses their data infrastructure choices and why reliability and scale drove their architecture decisions.

How Teraflop AI processed 7 million court documents and 40 million pages spanning 365 years of U.S. caselaw for under a dollar using Daft.

Leveraging ablation for contrastive image understanding evaluation in Daft

A systems engineer’s view of the new AI stack

A new inference backend that maximizes batch inference throughput.

Spark, Ray Data, and Daft

Daft's new distributed engine

The Swordfish Engine

Using Daft’s observability tools to uncover performance pitfalls

A deep dive into GPU optimizations for production-scale multimodal data processing

OCR, Spatial Analysis & GPU Embeddings with Python