Welcome to the Daft blog

Join us as we explore innovative ways to handle multimodal datasets, optimize performance, and simplify your data workflows.

GPU Inference with @daft.cls
Product
March 23, 2026

GPU Inference with @daft.cls

Run GPU models on millions of rows without OOM. Real patterns from ByteDance, Essential AI, and more.

Daft OSS: New Governance Model
Announcements
February 10, 2026

Daft OSS: New Governance Model

Today, we're introducing updates to the Daft OSS governance model defining new roles for contributors and maintainers with expanded permissions.

Tuning Daft's Distributed UDFs: Lessons from ByteDance
Engineering
February 6, 2026

Tuning Daft's Distributed UDFs: Lessons from ByteDance

Learn from the ByteDance Volcengine LAS Team on how to optimize Daft UDFs on Ray. Discover the formula to evenly distribute data across actors.

Announcing Early Access to Daft Cloud
Product
January 20, 2026

Announcing Early Access to Daft Cloud

Early access to Daft Cloud for running model-driven AI pipelines reliably at production scale. Built on Daft OSS for continuous, resilient execution.

Why I joined Eventual
Team
January 14, 2026

Why I joined Eventual

Chris Kelloggs shares why he joined Eventual to build open-source, distributed systems for large-scale AI and multimodal data workloads

Introducing Dynamic Batching: Auto-Tuning for Daft Pipelines
Engineering
January 12, 2026

Introducing Dynamic Batching: Auto-Tuning for Daft Pipelines

Manually tuning batch sizes is hard. So I implemented dynamic batching to never deal with it ever again.

Daft 2025 Year in Review - Minor Releases, Major Evolution
Company
January 5, 2026

Daft 2025 Year in Review - Minor Releases, Major Evolution

In 2025, we shipped 56 releases and introduced features that changed how teams run multimodal AI pipelines at scale.

Knowledge curation (not search) is the AI big data problem
Thought Leadership
December 24, 2025

Knowledge curation (not search) is the AI big data problem

Google was Information Retrieval. Wikipedia is Knowledge Curation.

How We Use AI Coding Agents
Engineering
December 15, 2025

How We Use AI Coding Agents

Our engineering team's best practices for working with AI coding agents.

How Sourcetable Built the World's First AI Spreadsheet with Daft
Engineering
Case Studies
December 11, 2025

How Sourcetable Built the World's First AI Spreadsheet with Daft

Sourcetable CTO Andy Grosser discusses their data infrastructure choices and why reliability and scale drove their architecture decisions.

PreviousPage 2 of 6Next
Get updates, contribute code, or say hi.
Daft Engineering Blog
Join us as we explore innovative ways to handle multimodal datasets, optimize performance, and simplify your data workflows.
Github Discussions Forums
join
GitHub logo
The Distributed Data Community Slack
join
Slack logo