data-science

Data science, data analysis, and machine learning in Clojure https://scicloj.github.io/pages/chat_streams/ for additional discussions
2021-03-29T09:55:08.017900Z

I have another question about <http://tech.ml|tech.ml>.dataset : It seems that an Excel document with more than one tab/sheet causes an exception Multiple (2) datasets found in file - is there a way to configure it to select a specific sheet?

2021-03-29T09:55:55.018Z

I looked in the docs, but couldn't find anything about this. @chris441 Do you have any ideas on how to solve multiple tabs/datasets?

chrisn 2021-03-29T13:15:21.018300Z

@david043 - Yes - the -&gt;dataset pathways are set to produce a single datasets and at this point I do not have an option to select one out of potentially many which is an interesting idea. In either https://techascent.github.io/tech.ml.dataset/tech.v3.libs.poi.html or https://techascent.github.io/tech.ml.dataset/tech.v3.libs.fastexcel.html you can find methods that will return a sequence of datasets from an xlsx or xls file.

2021-03-29T14:07:55.018500Z

Thanks! I did a quick experiment with workbook-&gt;datasets and I think it will be perfect 😄

jsa-aerial 2021-03-29T14:12:42.018700Z

@david043 also, generally, DS hangs out in Zulip. For TMD in particular, try https://clojurians.zulipchat.com/#narrow/stream/151924-data-science/topic/tech.2Eml.2Edataset

2021-03-29T16:16:58.019800Z

I’ll check it out! Wasn’t aware of that platform 😄