Join us as we explore innovative ways to handle multimodal datasets, optimize performance, and simplify your data workflows.

Robotics is hitting a data wall. The architecture debate gets much of the attention, but the data constraint is more fundamental.

Robotics is hitting a data wall. The architecture debate gets much of the attention, but the data constraint is more fundamental.

A new dashboard, per-operator memory attribution, and OTel endpoints for your existing collector. Everything you need to see what Daft is doing with your query.

Daft v0.7.14 rewrites the Parquet reader on arrow-rs for up to 17x faster remote reads, ships streaming distributed limits, and adds native UUIDv7 generation.

Three Daft releases cut May 12–15. Bidirectional streaming ASOF joins, Arrow PyCapsule, Iceberg idempotent commits and table properties, Spark month arithmetic.

How we built, broke, and re-built our ASOF joins — 5.5x faster, half the memory of pandas, and scaled to a distributed cluster.

daft.VideoFile decodes only the frames you need. Keyframes, time-sampled, or windowed seek, built for robotics datasets, dashcams, and moderation queues.

Jim Fan argues robotics will follow the exact LLM playbook - and VLAs are already being replaced by World Action Models.

Physical AI has become a real trend, but is there something real here or is it just hype?

Daft now supports native extensions via Apache Arrow's C Data Interface. daft-h3 is the first community extension — 9 Rust-native H3 geospatial functions, 3–16x faster than Python UDFs.