Welcome to the Daft blog

Join us as we explore innovative ways to handle multimodal datasets, optimize performance, and simplify your data workflows.

Product Engineering Announcements Team Company Thought Leadership Case Studies Tutorials Video

Multimodal Structured Outputs: Evaluating VLM Image Understanding at Scale

Engineering

December 2, 2025

Multimodal Structured Outputs: Evaluating VLM Image Understanding at Scale

Leveraging ablation for contrastive image understanding evaluation in Daft

Processing 99% of U.S. Caselaw for Under $1 in the Common Pile

Engineering

Case Studies

December 2, 2025

Processing 99% of U.S. Caselaw for Under $1 in the Common Pile

How Teraflop AI processed 7 million court documents and 40 million pages spanning 365 years of U.S. caselaw for under a dollar using Daft.

Cutting LLM Batch Inference Time in Half: Dynamic Prefix Bucketing at Scale

Engineering

November 4, 2025

Cutting LLM Batch Inference Time in Half: Dynamic Prefix Bucketing at Scale

Learn how Dynamic Prefix Bucketing reduces LLM batch inference time, improves throughput, and unlocks faster multimodal processing at scale.

Benchmarks for Multimodal AI: Spark, Ray Data, and Daft

Engineering

October 1, 2025

Benchmarks for Multimodal AI: Spark, Ray Data, and Daft

Multimodal AI workloads break traditional data engines. Daft ran 2-7x faster than Ray Data and 4-18x faster than Spark while finishing jobs reliably across audio, video, document, and image workloads.

Introducing Flotilla: Simplifying Multimodal Data Processing at Scale

Announcements

Engineering

October 1, 2025

Introducing Flotilla: Simplifying Multimodal Data Processing at Scale

Flotilla, Daft's new distributed engine, processes terabytes of multimodal data in a single query up to 18x faster than Spark and Ray Data, while running efficiently, reliably, and without manual tuning.