Join us as we explore innovative ways to handle multimodal datasets, optimize performance, and simplify your data workflows.

Run GPU models on millions of rows without OOM. Real patterns from ByteDance, Essential AI, and more.

Today, we're introducing updates to the Daft OSS governance model defining new roles for contributors and maintainers with expanded permissions.

Learn from the ByteDance Volcengine LAS Team on how to optimize Daft UDFs on Ray. Discover the formula to evenly distribute data across actors.

Early access to Daft Cloud for running model-driven AI pipelines reliably at production scale. Built on Daft OSS for continuous, resilient execution.

Chris Kelloggs shares why he joined Eventual to build open-source, distributed systems for large-scale AI and multimodal data workloads

Manually tuning batch sizes is hard. So I implemented dynamic batching to never deal with it ever again.

In 2025, we shipped 56 releases and introduced features that changed how teams run multimodal AI pipelines at scale.

Google was Information Retrieval. Wikipedia is Knowledge Curation.

Our engineering team's best practices for working with AI coding agents.

Sourcetable CTO Andy Grosser discusses their data infrastructure choices and why reliability and scale drove their architecture decisions.