Not sure how useful it will be, but it's at least a nice demonstration of a full Clojure+Clojurescript data science tool: we've loaded the CORD19 database into our Medical Search platform DOC Search and made it freely available (http://covid19.doctorevidence.com) (or https://search.doctorevidence.com/ with user/pass: covid19 / covid19) feel free to play around or provide much needed feedback ๐
I gave a talk about the platform at the Dutch Clojure Days last year https://www.youtube.com/watch?v=EM61rn9Gxl4 for a little bit more background on what the platform tries to achieve ๐
Thanks for sharing. I guess no code is public though
Unfortunately not, we're looking into open sourcing some libraries from it, but the whole stack itself is maybe a bit much (and that takes a bit more corporate convincing probably ๐ )
This is awesome! What did you use for the visualizations?
Thanks! Just vanilla d3 ๐
Anyone have experience with Apache Commons Math? https://commons.apache.org/proper/commons-math/
Or JavaScript's "stdlib"? https://github.com/stdlib-js/stdlib
When it'll be the data-science meetups of scicloj ?
I wrapped certain parts into fastmath
library.
Hackaton tomorrow for example.
See this link: https://scicloj.github.io/posts/2020-03-17-covid-19-hackathon-planning/
๐ thanks!
How'd that go? Any issues?
We recently removed our commons-math wrapped functions from our math library. A lot of the code in commons-math3 is pretty gross and doesn't handle edge cases very well.
I see! That's helpful, thanks. I'd love to hear more about that if you feel like sharing.
Most of our code uses generative testing. We found that many of apache's functions exhibit unwanted and inconsistent behavior when numbers get large or very small, and passed Infinity or NaN. As it turned out, we weren't really using much of commons-math because we'd often prefer to write the function in Clojure (both for speed and known, consistent behavior).
I actually didn't find any corner cases, maybe I didn't use too much. Mostly I rely on optimization, randomness, partly statistics and distributions.
Do you gen test? It finds just about everything.
I had issues with empirical and enumerated distributions only.
No, I don't. I rely on tests done by lib authors.
Curious what you've found.
I donโt recall the specifics. https://github.com/Provisdom/math/commit/b59140d76501b7e5e56ca425ad444be900b304b5 was the main commit of us removing it. Not sure if thatโd give any insight though. Itโs been more of a death by a thousand cuts for us with apache-math. Iโd wager if you start adding in gen tests to your existing code, youโd quickly hit these sort of issues.