clojure-europe

For people in Europe... or elsewhere... UGT https://indieweb.org/Universal_Greeting_Time
dharrigan 2020-10-28T06:12:48.444500Z

Good Morning!

javahippie 2020-10-28T06:20:08.444700Z

✌️

synthomat 2020-10-28T06:33:16.445100Z

Good morning :spock-hand:

slipset 2020-10-28T07:05:18.445400Z

Good morning

2020-10-28T07:12:49.445600Z

Morning

dominicm 2020-10-28T08:27:17.445800Z

Morning :)

thomas 2020-10-28T08:44:46.446Z

morning

javahippie 2020-10-28T09:15:33.446800Z

What algorithm would you choose to compare post addresses? My current project is using Jaro-Winkler, and it’s horrible 😕

javahippie 2020-10-28T09:16:57.447800Z

I believe every approach that uses a string distance is a bad one for structured addresses. Leave out a ZIP code, and the distance explodes. Have similar words with different meanings, and you get a false match

javahippie 2020-10-28T09:17:30.448Z

Normalizing the data is, of course, not possible

2020-10-28T09:27:45.448800Z

equivalence of esszets and double s is a personal favourite

2020-10-28T09:28:07.449200Z

similiarly for umlauts

javahippie 2020-10-28T09:30:48.451200Z

Yeah. Or “P.O. Box”, “Post Box”, “P.O”… “Street”, “Str.“, … Just blindly applying a function to a string won’t do good, you’d need to intelligently tokenize normalize. Oh, and all of the addresses are global, of course. Germany, China, U.S.A and Chile alone don’t have comparable address formats in general

javahippie 2020-10-28T09:33:08.452Z

The main issue is, that every time the software matches something wrong, somebody creates an issue I have to investigate 🙈

😱 1
synthomat 2020-10-28T09:33:29.452400Z

that algorithm sounds like a case for a new SaaS business

javahippie 2020-10-28T09:34:35.452600Z

:thinking_face:

dominicm 2020-10-28T09:39:18.453400Z

Bucks and Buckinghamshire is a fun one too

RAMart 2020-10-28T09:39:36.453600Z

> that algorithm sounds like a case for a new SaaS business Good luck with the GDPR compliance...

javahippie 2020-10-28T09:41:35.455Z

The issue is, that the customer cannot understand that “I want to find a business partner with a similar address in my database of 3.000.000 addresses” is not an easy problem to solve.

synthomat 2020-10-28T09:42:28.455800Z

what’s wrong with gdpr? as far as I understood it’s only about adresses and not names

synthomat 2020-10-28T09:46:29.456100Z

@javahippie “similar address” is very broad 🙂 does it need to go by street?

javahippie 2020-10-28T09:48:45.457800Z

It’s “Company, Street/P.O Box, Zip Code, City, Country”. But can be anything somebody in an office enters. Things that also appear sometimes: District, building, floor, office number…..

javahippie 2020-10-28T09:49:10.458200Z

Never said that to a customer but.. I believe they need an AI 😄

RAMart 2020-10-28T09:51:05.458300Z

Good luck debugging the AI when the customer reports the next "wrong match". Hm... I use this "good luck" phrase too often. 🙈

RAMart 2020-10-28T09:52:44.458500Z

The address send to your SaaS could be anything. Including names, persons and the like.

ordnungswidrig 2020-10-28T12:20:00.459100Z

insert universal greeting

borkdude 2020-10-28T12:24:02.459900Z

Howdy! Made this script to detect code using some spec pattern: https://gist.github.com/borkdude/a391146ad81a06c28fb97ccdc1f64d44 I'm considering of building this out to a library.

borkdude 2020-10-28T12:24:44.460600Z

Note sure if spec would be the way to go or malli. I guess that's a typical 2020 Clojure problem. As of now, it would be spec, but in the future malli might be more flexible

slipset 2020-10-28T12:26:54.461500Z

@borkdude while you're here. I was thinking about something we've probably discussed before: clj-find-usages

slipset 2020-10-28T12:27:39.462500Z

Which would be something I could invoke (a bit like clj-kondo) from emacs which would statically analyze my project and find the usages of some symbol.

slipset 2020-10-28T12:28:01.463100Z

Basically, my problem is that the find-usages in Cider is not Working(TMO)

borkdude 2020-10-28T12:28:02.463300Z

I think some plugins already do this. It's possible using clj-kondo's analysis output.

borkdude 2020-10-28T12:30:33.463900Z

@slipset One example is https://github.com/didibus/anakondo which provides completions, but could in theory also jump to definition. Maybe it can be extended with usages as well

borkdude 2020-10-28T12:30:58.464500Z

I see it's on their roadmap. Maybe you could help @didibus

borkdude 2020-10-28T12:31:50.465200Z

But for the spec tool I'm considering, I possibly want to support patterns using fully qualified symbols so alias usage will match on that as well

borkdude 2020-10-28T12:33:37.465900Z

e.g. (require '[foo :as f]) (f/dude) and searching with foo/dude will give you the match for (f/dude)

raymcdermott 2020-10-28T19:23:45.467700Z

For matching addresses -> https://github.com/openvenues/libpostal

😲 1
raymcdermott 2020-10-28T19:24:39.468700Z

there are also higher level language bindings though it seems like the Java lib hasn't been touched for a while

javahippie 2020-10-28T19:45:54.469400Z

That’s nice! Will suggest this, it should be possible to wire it into the pipeline. Thanks!

1
borkdude 2020-10-28T20:44:17.469700Z

Made a thing today: https://github.com/borkdude/grasp

2020-10-28T20:52:12.470200Z

as I saw on the #announcements woah

2020-10-28T20:52:26.470600Z

I think we all know kung fu now

2020-10-28T20:57:26.470800Z

and congrats 🙂

dominicm 2020-10-28T21:21:16.471400Z

Mailing lists: a cool way to receive updates or kinda lame? Thinking for software projects.

2020-10-28T21:43:45.471900Z

mailing lists are cool

2020-10-28T21:43:54.472100Z

everyone has email