onyx

FYI: alternative Onyx :onyx: chat is at <https://gitter.im/onyx-platform/onyx> ; log can be found at <https://clojurians-log.clojureverse.org/onyx/index.html>
antique 2017-12-16T18:38:25.000088Z

I’m learning Onyx and it looks like the only way to get windowing to work properly is to let it write checkpoints to S3 is this correct?

lucasbradstreet 2017-12-16T18:52:43.000090Z

Yes, though if your windows are small and you’re using a test/ephemeral ZK server, you can set this setting: http://www.onyxplatform.org/docs/cheat-sheet/latest/#peer-config/:onyx.peer/storage.zk.insanely-allow-windowing-QMARK

lucasbradstreet 2017-12-16T18:52:53.000093Z

but note that it will fail when you write big windows out.

antique 2017-12-16T19:05:19.000006Z

Is that documented anywhere? I may have missed it but that’s a pretty big dependency and unfortunately I can’t use S3 (I doubt I can even get IT to open the firewall for it). How hard would it be to write another storage backend?

lucasbradstreet 2017-12-16T19:06:53.000027Z

Others have done it for Google cloud’s S3 equivalent, and it worked out ok. If you’re going to run it on prem, building a HDFS plugin would probably be the best bet.

lucasbradstreet 2017-12-16T19:09:41.000062Z

You basically have to implement this: https://github.com/onyx-platform/onyx/blob/0.12.x/src/onyx/storage/s3.clj

lucasbradstreet 2017-12-16T19:09:59.000032Z

Google cloud storage was pretty much a straight port: https://github.com/tenaciousjzh/onyx/blob/0.11.x/src/onyx/storage/gcs.clj

antique 2017-12-16T19:15:47.000019Z

Thanks!

Travis 2017-12-16T19:30:26.000020Z

Yeah, our Google cloud storage port has worked out so far. Hopefully we can get that in soon

lucasbradstreet 2017-12-16T19:35:28.000013Z

🙂. Anything that might be useful on our end to separate it out into a plugin? I guess the main thing would be the information model and any schema checks?

lucasbradstreet 2017-12-16T19:35:47.000047Z

Then we could just separate out the gcs.clj file into its own project, with the GCS dependencies.

Travis 2017-12-16T20:05:33.000019Z

I think currently we just added the stuff to onyx core where the S3 schema's and info models are. I'm sure we can pull it out

lucasbradstreet 2017-12-16T20:10:56.000061Z

Yeah, I saw. It looks like it should be easy to pull out. I mostly want to keep the schema checks and maybe add a nice error if the dep hasn’t been included.