First off, I've worked with infinite data retention on trading and insurance so definitely not…

Mar 29, 2024

First off, I've worked with infinite data retention on trading and insurance so definitely not simple use cases and definitely not theoretical. Second, data didn't sit around and was routinely reprocessed to rebuild state during releases if the topology or schema changed. Third, the main reason why you would want to have ALL your data in Kafka is that if Kafka is your source of truth why copy the data around and complicate your architecture if you don't need to? As for streaming, Kafka is so much more. Granted it is not as good at reading as a traditional database, no indexes, no ad hoc or push down queries etc... but for simple queries getting the data from a local state store is faster and simpler than from a networked external db and many times this is enough.

Written by Tom Kaszuba

Responses (1)