Clojurians Log v2

Clojure programming

Channels

# 100-days-of-code # aatree # admin-announcements # adventofcode # ai # alda # aleph # all-the-channels # announcements # arachne # architecture # asami # atlanta-clojurians # atom-editor # autochrome-github # avi # aws # aws-lambda # babashka # babashka-sci-dev # bangalore-clj # beginners # berlin # biff # bigdata # bitcoin # boot # boot-dev # boulder-clojurians # braid-chat # braveandtrue # brevis # bristol-clojurians # business # calva # capetown # carry # cbus # cestmeetup # chestnut # chlorine-clover # cider # circleci # clara # clj-commons # cljdoc # cljfx # clj-http # clj-kondo # clj-on-windows # cljs-dev # cljs-experience # cljsfiddle # cljsjs # cljsrn # cljtogether # clojars # clojure # clojure-android # clojure-argentina # clojure-art # clojure-austin # clojure-australia # clojure-austria # clojure-bangladesh # clojure-bay-area # clojure-beijing # clojure-belgium # clojure-berlin # clojure-boston # clojure-brasil # clojurebridge # clojurebridge-ams # clojure-canada # clojure-chennai # clojure-chicago # clojure-china # clojure-colombia # clojure-conj # clojurecup # clojure-czech # clojured # clojure-denmark # clojure-denver # clojure-derby # clojuredesign-podcast # clojure-dev # clojure-dusseldorf # clojure-ecuador # clojure-egypt # clojure-estonia # clojure-europe # clojure-filipino # clojure-finland # clojure-france # clojure-gamedev # clojure-germany # clojure-greece # clojure-guangzhou # clojure-hamburg # clojure-hk # clojure-houston # clojure-hungary # clojure-india # clojureindia # clojure-indonesia # clojure-ireland # clojure-israel # clojure-italy # clojure-japan # clojure-kc # clojure-korea # clojure-losangeles # clojure-madison # clojure-mexico # clojure-miami # clojure-mk # clojure-mke # clojure-morsels # clojure-my # clojure-new-zealand # clojure-nl # clojure-nlp # clojure-norway # clojure-poland # clojure-portugal # clojure-provo # clojure-quebec # clojureremote # clojure-romania # clojure-russia # clojure-sanfrancisco # clojurescript # clojurescript-ios # clojure-sdn # clojure-seattle # clojure-serbia # clojure-sg # clojure-shanghai # clojure-spain # clojure-spec # clojuresque # clojure-survey # clojure-sweden # clojure-switzerland # clojure-taiwan # clojure-turkiye # clojure-uk # clojure-ukraine # clojureverse-ops # clojurewerkz # clojurewest # clojurex # clojure-za # clojurian-chat-app # clojutre # cloverage # cloxp # clr # code-art # code-reviews # community-development # component # conf-proposals # conjure # consulting # contributions-welcome # copenhagen-clojurians # core-async # core-logic # core-matrix # core-typed # cryogen # crypto # css # cursive # cz-clojure # d2q # datacrypt # datahike # datalevin # datalog # data-oriented-programming # data-science # datascript # datavis # dato # datomic # defnpodcast # deps-new # depstar # devcards # devops # dirac # docker # docs # domino-clj # duct # dunaj # eastwood # editors # emacs # error-message-catalog # etaoin # ethereum # euroclojure # events # exercism # expound # figwheel # figwheel-main # flambo # fulcro # funcool # functionalprogramming # funimage # garden # ghostwheel # girouette # gis # google-cloud # gorilla # graalvm # graalvm-mobile # graclj # graphql # gratitude # gsoc # hammock-driven-dev # helix # heroku # hispano # holy-lambda # honeysql # hoplon # hugsql # humor # hypercrud # hyperfiddle # immutant # improve-getting-started # incanter # indycljs # inf-clojure # instaparse # integrant # interceptors # interop # introduce-yourself # iot # iotivity # ipfs # jackdaw # jaunt # java # javascript # javelin # jobs # jobs-discuss # jobs-rus # joker # jukebox # juxt # jvm # kaocha # keechma # kekkonen # keyboards # klipse # kosmos # lambdaisland # ldnclj # ldnproclodo # lein-figwheel # leiningen # liberator # liquid # livestream # local-first-clojure # london-clojurians # lsp # luminus # lumo # mail # malli # mathematics # meander # melbourne # membrane # mental-health # microservices # mid-cities-meetup # midje # minecraft # minimallist # missionary # monads # mount # music # new-channels # new-clojure # nextjournal # nginx # nrepl # numerical-computing # nyc # observability # off-topic # om # om-next # onyx # other-languages # other-lisps # overtone # pamela # parinfer # pathom # pedestal # perun # philosophy # phzr # planck # plastic # play-clj # podcasts # polylith # portal # portkey # portland-or # powderkeg # practicalli # precept # prelude # programming-beginners # project-updates # proletarian # proton # protorepl # pulsar # pure-frame # qa # qlkit # quil # random # rdf # react # reactive # reading-clojure # reagent # reclojure # re-frame # reitit # releases # remote-jobs # respo # rethinkdb # reveal # rewrite-clj # ring # ring-swagger # robots # rum # schema # sci # sfcljs # shadow-cljs # _silence # sim-testing # sioux-falls # slack-help # sneer # sneer-br # spacemacs # specmonstah # specter # speculative # spirituality-ethics # sql # startup-in-a-month # sydney # test200 # test-check # testing # thejaloniki # timbre # tmp-json-parsing # tools-build # tools-deps # trading # tree-sitter # uncomplicate # unrepl # untangled # utah-clojurians # videos # vim # vrac # vscode # wasm # web-security # windows # xtdb # yada # yleinen

Apps

powderkeg

cgrand 2017-03-09T10:35:03.273019Z

@viesti can’t imagine how this Iterable/Iterator regression was unavoidable. Breaking stuff for fun

cgrand 2017-03-09T10:35:10.273602Z

(and employment)

viesti 2017-03-09T10:45:19.326469Z

resisting to say something about Scala in general :)

viesti 2017-03-09T10:58:20.398010Z

another thing that I haven't made clear to myself is Spark runtime version vs version linked into the app

cgrand 2017-03-09T10:58:43.400119Z

viesti 2017-03-09T10:59:33.404672Z

can app linked with 2.1.0 run in a cluster running 1.5.0 for example

cgrand 2017-03-09T11:00:19.408906Z

depends on when classes are resolved

cgrand 2017-03-09T11:00:32.410351Z

Working on that right now

cgrand 2017-03-09T11:02:13.419686Z

(defmacro ^:private compile-cond [&amp; choices]
	(let [x (Object.)
	      expr
	      (reduce (fn [_ [test expr]]
                  (when (eval test) (reduced expr))))
	        x (partition 2 choices)]
	  (when (= x expr)
	    (throw (ex-info "No valid choice." {:form &amp;form})))))

cgrand 2017-03-09T11:03:07.424596Z

with that you could ship an app oblivious to Spark version as long as powderkeg is not aot compiled

cgrand 2017-03-09T11:03:28.426565Z

else keg would hardcode the spark used during aot

cgrand 2017-03-09T11:45:25.637761Z

https://github.com/HCADatalab/powderkeg/commit/8c3d7f27423f1649d14dada96978b949d64506d3

cgrand 2017-03-09T11:47:08.646166Z

Having to ship powderkeg 2.10 and 2.11 is no fun, neither is asking user to add chill

cgrand 2017-03-09T11:47:13.646456Z

any idea?

viesti 2017-03-09T12:00:36.712704Z

Scala binary compatibility :picard-facepalm:

viesti 2017-03-09T12:02:12.721500Z

flambo seems to support only 2.10

viesti 2017-03-09T12:03:32.728833Z

http://spark.apache.org/downloads.html says: Note: Starting version 2.0, Spark is built with Scala 2.11 by default. Scala 2.10 users should download the Spark source package and build with Scala 2.10 support.

cgrand 2017-03-09T12:07:58.750101Z

raaaaah

cgrand 2017-03-09T12:08:43.753767Z

can we detect scala version at runtime?

viesti 2017-03-09T12:08:54.754524Z

flambo seems to have 0.8.0 for spark 2.x and 0.7.2 for spark 1.x

viesti 2017-03-09T12:08:55.754604Z

https://github.com/yieldbot/flambo/commit/8edda47f85cbb84f4c798d57c9918ab59235b98b

viesti 2017-03-09T12:08:56.754667Z

😄

viesti 2017-03-09T12:09:18.756428Z

guessing that thy just dropped with 0.7.2 🙂

viesti 2017-03-09T12:09:36.757836Z

hmm

cgrand 2017-03-09T12:09:41.758218Z

(I’m thinking about shading chill twice and using the right one)

viesti 2017-03-09T12:10:47.763312Z

hmm, is it even possible to load chill conditionally?

viesti 2017-03-09T12:16:30.790899Z

user=&gt; (import 'scala.util.Properties)
scala.util.Properties
user=&gt; (scala.util.Properties/versionString)
"version 2.11.8”

viesti 2017-03-09T12:17:03.793520Z

found from http://www.scala-lang.org/old/node/7532

viesti 2017-03-09T12:18:02.798617Z

@cgrand it seems to be the way to detect Scala runtime version: http://stackoverflow.com/a/6968014

viesti 2017-03-09T12:19:31.806060Z

on current powderkeg:

user=&gt; (scala.util.Properties/versionString)
"version 2.10.4”

cgrand 2017-03-09T12:21:58.818087Z

in fact chill is a dep of spark itslef so I can remove it

viesti 2017-03-09T12:23:08.823974Z

ah, neat, was already thinking of classloader magic http://stackoverflow.com/questions/11759414/java-how-to-load-different-versions-of-the-same-class

viesti 2017-03-09T12:24:15.829292Z

this would be quite neat actually, to as a user be able to select spark version

viesti 2017-03-09T12:24:59.832962Z

going to make a snack for the kids now

cgrand 2017-03-09T12:25:34.836033Z

I’m 1h ahead so lunch is long due 🙂

viesti 2017-03-09T12:39:51.910329Z

apropo, saw this related to DataSet/DataFrame http://spark.apache.org/docs/latest/sql-programming-guide.html#creating-datasets

cgrand 2017-03-09T13:22:55.149248Z

yeah that’s what I used

viesti 2017-03-09T13:25:57.168956Z

have to learn to read more carefully 🙂

cgrand 2017-03-09T13:27:55.181259Z

I’m not happy with the fact that I lose schema metadata too often and can’t always reconstruct a spec for the resulting dataset

viesti 2017-03-09T14:29:25.658258Z

yup, but seems promising for taking over DataFrames/Dataset/MLlib 🙂

cgrand 2017-03-09T15:12:37.083935Z

I have quickly looked at Travis CI documentation and it can spawn containers

viesti 2017-03-09T15:30:47.271223Z

haven't used Travis myself, but have heard good things about it

viesti 2017-03-09T15:34:35.311099Z

https://docs.travis-ci.com/user/docker/ and https://circleci.com/docs/1.0/docker/ look similar at start :) (enabling docker service)

viesti 2017-03-09T15:35:22.318892Z

Circleci autodetects clojure projects and runs lein test, travis might do same

viesti 2017-03-09T15:36:11.327242Z

is it better to run tests against a container than in local mode?

viesti 2017-03-09T15:36:54.334693Z

answering to myself, could test 1.x and 2.x Cluster that way

cgrand 2017-03-09T15:43:13.399972Z

yes it’s definitely better because local mode share the VM and most classloaders so it hides bugs

cgrand 2017-03-09T15:44:14.411630Z

PoCing transducers on spark took me one day (never touched spark before) in local mode

cgrand 2017-03-09T15:44:55.419960Z

Everything else was figuring out how to have it run on a cluster.

viesti 2017-03-09T15:47:41.450964Z

yup

viesti 2017-03-09T15:48:33.459991Z

hmm so we could use this https://github.com/gettyimages/docker-spark

cgrand 2017-03-09T16:00:16.590906Z

@powderkeg I merged spark2 and spark1.5 code in https://github.com/HCADatalab/powderkeg/tree/spark2, I had local networking issues today which prevented me from testing. Please try on your own

cgore 2017-03-09T18:44:16.319637Z

I’m getting the following on the spark2 branch, with Spark 2.1.0 running: CompilerException java.lang.ClassNotFoundException: com.twitter.chill.java.RegexSerializer, compiling:(carbonite/serializer.clj:1:1)

cgore 2017-03-09T18:50:12.378019Z

And a bit worse after a lein clean

cgrand 2017-03-09T19:04:07.519669Z

lein with-profile +spark2 repl

cgore 2017-03-09T20:14:02.210027Z

oops

cgore 2017-03-09T20:14:08.211001Z

yeah, that helps 😄

cgore 2017-03-09T20:14:21.213010Z

Now I get this error, further along:

cgrand 2017-03-09T20:16:50.235674Z

Ok it looks like I botched the macro…

cgrand 2017-03-09T21:29:52.900800Z

@cgore indeed https://github.com/HCADatalab/powderkeg/commit/ae0998755abd25a2362a5e08709d2d209beaee58

cgore 2017-03-09T23:08:45.717096Z

@gene

cgore 2017-03-09T23:36:05.886698Z

That looks like it’s working for me now.