Welcome to the Daft blog

Join us as we explore innovative ways to handle multimodal datasets, optimize performance, and simplify your data workflows.

GPU Inference with @daft.cls
Product
March 23, 2026

GPU Inference with @daft.cls

Run GPU models on millions of rows without OOM. Real patterns from ByteDance, Essential AI, and more.

Processing 300K Images Without OOM
Engineering
August 6, 2025

Processing 300K Images Without OOM

A Streaming Solution

Multimodal Data Processing Goes Global
Announcements
July 28, 2025

Multimodal Data Processing Goes Global

Daft Community is expanding to China, bridging the gap between English documentation and Chinese innovation cycles, in partnership with Bytedance Team

Eventual Raises $30M to Build the Future of Data Processing
Announcements
June 24, 2025

Eventual Raises $30M to Build the Future of Data Processing

We've raised $30M to build generational technology for simple, reliable, and performant data processing across all modalities and regardless of scale.

We cloned over 15,000 repos to find the best developers
Engineering
April 22, 2025

We cloned over 15,000 repos to find the best developers

An adventure in AI and data engineering to analyze developers across Github

High-Performance File System Support With DeepSeek 3FS
Engineering
March 18, 2025

High-Performance File System Support With DeepSeek 3FS

Learn how Daft integrates with DeepSeek SmallPond 3FS to deliver faster file access and efficient data handling for modern workloads.

From v0.2 to v0.3: Harder, Better, Faster, Stronger
Announcements
November 4, 2024

From v0.2 to v0.3: Harder, Better, Faster, Stronger

Join us on the journey from Daft v0.2 to v0.3! Daft v0.3 was released last month, marking the first minor version increment in almost 10 months.

Introducing Daft-SQL for High-Performance Data Exploration
Engineering
October 23, 2024

Introducing Daft-SQL for High-Performance Data Exploration

A SQL API enabling users to interact with their data in a new but familiar way. Learn how Daft-SQL brings fast, scalable querying to multimodal workloads, helping teams explore large datasets efficiently with a distributed engine.

Reading Delta Lake with Daft
Engineering
April 10, 2024

Reading Delta Lake with Daft

Discover how Daft reads Delta Lake tables efficiently, giving teams fast access to large datasets and seamless integration into data workflows.

Adversarial file reading: from 10,000 small CSVs to massive Parquet files
Engineering
March 6, 2024

Adversarial file reading: from 10,000 small CSVs to massive Parquet files

Learn how adversarial file reading speeds up data ingestion at scale, enabling fast conversion from thousands of CSVs into efficient Parquet files.

PreviousPage 5 of 6Next
Get updates, contribute code, or say hi.
Daft Engineering Blog
Join us as we explore innovative ways to handle multimodal datasets, optimize performance, and simplify your data workflows.
Github Discussions Forums
join
GitHub logo
The Distributed Data Community Slack
join
Slack logo