jackdaw

https://github.com/FundingCircle/jackdaw
thom 2020-03-10T09:22:53.042300Z

Any gotchas implementing custom data stores? Is it a widely accepted architecture to have some tables that just grow monotonically forever?

thom 2020-03-10T09:25:55.045200Z

Basically we do updates or retractions potentially months later and so far Kafka Streams seems completely magic compared to other platforms for keeping downstream stuff up to date, as long as we feed it storage and don’t mind new instances taking a while to drink the data in. But we’d perhaps feel more comfortable if it wasn’t just backed by RocksDB.

gklijs 2020-03-10T12:43:49.045400Z

You could also use ksqlDB, that basically let you have the RockDB exist in another service, and has an api to get data from there. It does solve the restart cost, but you need another piece. And with the other piece you might use a more traditional database as well. You also need to take into account how you want to restore the data in case something happens. Relying just on replication in Kafka might be to risky.

2020-03-10T17:21:15.045600Z

also consider what happens if your data is subject to "right to be forgotten" type rules