Join us as we explore innovative ways to handle multimodal datasets, optimize performance, and simplify your data workflows.

Daft v0.7.15 ships with try_cast for safe type conversion, Flight shuffle LZ4 compression, UUIDv7 timestamp extraction, and PostgreSQL support.

How Daft rebuilt distributed shuffle around Arrow Flight, local disk, and streaming reads to handle multi-terabyte workloads.

A new dashboard, per-operator memory attribution, and OTel endpoints for your existing collector. Everything you need to see what Daft is doing with your query.

Daft v0.7.14 rewrites the Parquet reader on arrow-rs for up to 17x faster remote reads, ships streaming distributed limits, and adds native UUIDv7 generation.

Three Daft releases cut May 12–15. Bidirectional streaming ASOF joins, Arrow PyCapsule, Iceberg idempotent commits and table properties, Spark month arithmetic.

daft.VideoFile decodes only the frames you need. Keyframes, time-sampled, or windowed seek, built for robotics datasets, dashcams, and moderation queues.

Daft now supports native extensions via Apache Arrow's C Data Interface. daft-h3 is the first community extension — 9 Rust-native H3 geospatial functions, 3–16x faster than Python UDFs.

30 contributors shipped Daft v0.7.10 — the most participation in any Daft release to date. The result: 41 new features and functions across distributed joins, duplicate detection, temporal arithmetic,

How to transcribe thousands of audio files with Whisper using daft.AudioFile — handling resampling, silence splitting, and worker-resident model loading without the boilerplate.