Clojurians Log v2

Clojure programming

Channels

# 100-days-of-code # aatree # admin-announcements # adventofcode # ai # alda # aleph # all-the-channels # announcements # arachne # architecture # asami # atlanta-clojurians # atom-editor # autochrome-github # avi # aws # aws-lambda # babashka # babashka-sci-dev # bangalore-clj # beginners # berlin # biff # bigdata # bitcoin # boot # boot-dev # boulder-clojurians # braid-chat # braveandtrue # brevis # bristol-clojurians # business # calva # capetown # carry # cbus # cestmeetup # chestnut # chlorine-clover # cider # circleci # clara # clj-commons # cljdoc # cljfx # clj-http # clj-kondo # clj-on-windows # cljs-dev # cljs-experience # cljsfiddle # cljsjs # cljsrn # cljtogether # clojars # clojure # clojure-android # clojure-argentina # clojure-art # clojure-austin # clojure-australia # clojure-austria # clojure-bangladesh # clojure-bay-area # clojure-beijing # clojure-belgium # clojure-berlin # clojure-boston # clojure-brasil # clojurebridge # clojurebridge-ams # clojure-canada # clojure-chennai # clojure-chicago # clojure-china # clojure-colombia # clojure-conj # clojurecup # clojure-czech # clojured # clojure-denmark # clojure-denver # clojure-derby # clojuredesign-podcast # clojure-dev # clojure-dusseldorf # clojure-ecuador # clojure-egypt # clojure-estonia # clojure-europe # clojure-filipino # clojure-finland # clojure-france # clojure-gamedev # clojure-germany # clojure-greece # clojure-guangzhou # clojure-hamburg # clojure-hk # clojure-houston # clojure-hungary # clojure-india # clojureindia # clojure-indonesia # clojure-ireland # clojure-israel # clojure-italy # clojure-japan # clojure-kc # clojure-korea # clojure-losangeles # clojure-madison # clojure-mexico # clojure-miami # clojure-mk # clojure-mke # clojure-morsels # clojure-my # clojure-new-zealand # clojure-nl # clojure-nlp # clojure-norway # clojure-poland # clojure-portugal # clojure-provo # clojure-quebec # clojureremote # clojure-romania # clojure-russia # clojure-sanfrancisco # clojurescript # clojurescript-ios # clojure-sdn # clojure-seattle # clojure-serbia # clojure-sg # clojure-shanghai # clojure-spain # clojure-spec # clojuresque # clojure-survey # clojure-sweden # clojure-switzerland # clojure-taiwan # clojure-turkiye # clojure-uk # clojure-ukraine # clojureverse-ops # clojurewerkz # clojurewest # clojurex # clojure-za # clojurian-chat-app # clojutre # cloverage # cloxp # clr # code-art # code-reviews # community-development # component # conf-proposals # conjure # consulting # contributions-welcome # copenhagen-clojurians # core-async # core-logic # core-matrix # core-typed # cryogen # crypto # css # cursive # cz-clojure # d2q # datacrypt # datahike # datalevin # datalog # data-oriented-programming # data-science # datascript # datavis # dato # datomic # defnpodcast # deps-new # depstar # devcards # devops # dirac # docker # docs # domino-clj # duct # dunaj # eastwood # editors # emacs # error-message-catalog # etaoin # ethereum # euroclojure # events # exercism # expound # figwheel # figwheel-main # flambo # fulcro # funcool # functionalprogramming # funimage # garden # ghostwheel # girouette # gis # google-cloud # gorilla # graalvm # graalvm-mobile # graclj # graphql # gratitude # gsoc # hammock-driven-dev # helix # heroku # hispano # holy-lambda # honeysql # hoplon # hugsql # humor # hypercrud # hyperfiddle # immutant # improve-getting-started # incanter # indycljs # inf-clojure # instaparse # integrant # interceptors # interop # introduce-yourself # iot # iotivity # ipfs # jackdaw # jaunt # java # javascript # javelin # jobs # jobs-discuss # jobs-rus # joker # jukebox # juxt # jvm # kaocha # keechma # kekkonen # keyboards # klipse # kosmos # lambdaisland # ldnclj # ldnproclodo # lein-figwheel # leiningen # liberator # liquid # livestream # local-first-clojure # london-clojurians # lsp # luminus # lumo # mail # malli # mathematics # meander # melbourne # membrane # mental-health # microservices # mid-cities-meetup # midje # minecraft # minimallist # missionary # monads # mount # music # new-channels # new-clojure # nextjournal # nginx # nrepl # numerical-computing # nyc # observability # off-topic # om # om-next # onyx # other-languages # other-lisps # overtone # pamela # parinfer # pathom # pedestal # perun # philosophy # phzr # planck # plastic # play-clj # podcasts # polylith # portal # portkey # portland-or # powderkeg # practicalli # precept # prelude # programming-beginners # project-updates # proletarian # proton # protorepl # pulsar # pure-frame # qa # qlkit # quil # random # rdf # react # reactive # reading-clojure # reagent # reclojure # re-frame # reitit # releases # remote-jobs # respo # rethinkdb # reveal # rewrite-clj # ring # ring-swagger # robots # rum # schema # sci # sfcljs # shadow-cljs # _silence # sim-testing # sioux-falls # slack-help # sneer # sneer-br # spacemacs # specmonstah # specter # speculative # spirituality-ethics # sql # startup-in-a-month # sydney # test200 # test-check # testing # thejaloniki # timbre # tmp-json-parsing # tools-build # tools-deps # trading # tree-sitter # uncomplicate # unrepl # untangled # utah-clojurians # videos # vim # vrac # vscode # wasm # web-security # windows # xtdb # yada # yleinen

Apps

clojure

New to Clojure? Try the #beginners channel. Official docs: https://clojure.org/ Searchable message archives: https://clojurians-log.clojureverse.org/

p-himik 2020-09-27T03:41:48.210200Z

Huh, so "Cartesian product of functions" != "Cartesian product of collections of functions and arguments". TIL

(defn X [f1 f2]
  (fn [x y]
    [(f1 x) (f2 y)]))

2020-09-27T15:42:36.214900Z

If I have a websocket component that has an internal clojure.core.async/mult to distribute all incoming messages, does it make sense to have that component extend the clojure.core.async/Mult protocol? Is that the intended use of the protocol? or is it better to just make my own tap functions in my own namespace

vemv 2020-09-27T15:58:07.215500Z

Given N sets, how do I find out the largest possible set (or sets) that is a subset of all N sets?

(magic [#{1 1} #{1 2} #{1 3}]) ;; =&gt; #{1}

Brute-force solutions wouldn't cut it for the size of my inputs

p-himik 2020-09-27T16:08:07.215700Z

clojure.set/intersection.

vemv 2020-09-27T16:13:44.217Z

That's not what I want, as the inputs can be disparate and most likely the intersection will be #{} 99% of the times So I should reword: of all N sets -> of the most sets as possible

p-himik 2020-09-27T16:15:55.217600Z

I still don't understand. What would your desired function return for [#{1 2} #{3 4}], for example?

p-himik 2020-09-27T16:17:58.218100Z

Or better yet, can you describe the algorithm? Because I cannot parse that sentence with "->". :)

chucklehead 2020-09-27T16:22:44.223800Z

something like the union of the results of intersecting of each subset with the union of all subsets?

👀 1

p-himik 2020-09-27T16:24:22.225400Z

Uhm, that would be just the union of the input, no?

vemv 2020-09-27T16:24:31.225700Z

> What would your desired function return for [#{1 2} #{3 4}], for example? that's a pretty degenerate case (for my needs) since it's very small and there's nothing in common. I don't care if it returns #{} or something else I'm trying to find the largest possible subsets given N sets of size ~~1000 to~~ 100000 each, which are presumed to have quite a lot in common. There will be many possible answers, and what a 'good' anwer is can be tweaked. e.g. maybe for me a set with 300 elements that is a subset of 40 other sets (but not a subset of 20 other sets), is interesting enough

2020-09-27T16:25:55.229200Z

Combine the sets into a bag having all elements. Then use frequencies on the bag to find the largest count?

👍 1

p-himik 2020-09-27T16:27:45.230700Z

What would that return for [#{1 2} #{1 2} #{3} #{3}]? The frequencies are the same, yet there's no #{1 2 3} subset.

chucklehead 2020-09-27T16:29:20.230800Z

yeah, oops, I guess you could intersect each subset with union of all the other subsets?

p-himik 2020-09-27T16:30:58.232200Z

I don't know. I think the task is not defined clearly enough for us to come up with any robust solution.

p-himik 2020-09-27T16:31:40.233Z

But it feels like a task of turning a 2D metric into a 1D one, and that always requires some particular approach to dimensionality reduction.

vemv 2020-09-27T16:37:21.236800Z

#{1 2} seems a good answer, because its size is large, even if it's not a subset of all inputs. Ultimately I'm not seeking an unequivocal mathematical function, but something that gives good answers to humans for the desired size range (1000-100000). at sizes 1-10 it's probably easy to get lost because the differences are too small to be evident. for example, for an input of 100 sets sized 10000 each: * an answer of a set sized 5000 which is a subset of 80 sets is very valuable * an answer of a set sized 9999 which is a subset of 1 set is not very valuable * an answer of a set sized 2 which is a subset of 80 sets is not very valuable

vemv 2020-09-27T16:39:07.238200Z

...maybe my problem is too specific, sorry for that. at first I thought it could be a commonly solved problem.

vemv 2020-09-27T16:41:52.238300Z

yes, counting seems a good first step :) I hope it doesn't lead to an exhaustive search

p-himik 2020-09-27T16:44:41.240Z

The main issue is that you have 2 numbers (subset size and the amount of its supersets) that you have to reduce to just one ("score" of a particular subset). It's a dimensionality reduction problem, and there never is a cookie-cutter solution. You will have to somehow write that function that takes two numbers and outputs just one.

vemv 2020-09-27T16:47:18.240100Z

thanks! dimensionality reduction seems a good read

p-himik 2020-09-27T16:48:15.240300Z

And a deep rabbit hole. :)

p-himik 2020-09-27T16:49:17.240600Z

This is a quick informative read: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006907