Good news: 44,583 usable responses!

Bad news: Responses alone use 4,592,255 cells on Google Sheets, which is 92% of the 5m cell limit, before I even start writing formulae to count everything!

Good news: I've split it into one Sheet per question, and the data is now processed!

Bad news: This means I will definitely have to (*gulp*) learn new software and maybe even upgrade my computer in order to handle next year's responses, assuming they increase at a similar rate.

Good news: Maybe tomorrow I can start writing up the worldwide report for 2021??

Please note, this is very optimistic, I may not manage it because I have a LOT of fatigue today. On the other hand, it might be a pretty effective way to keep me resting on the sofa. 🤔

I wish I could somehow crowdfund "Cassian learns a new software" - no amount of money will make my brain do the thing, you know?

On that note! Statisticians, can you help out?

I'm going to need some software that can handle a LOT of calculations on a fairly average computer, involving probably 50,000 survey responses. The most important factor is user-friendliness.

Any recommendations?

Please use PostgreSQL @gendercensus !
It'll do 50,000 calculations instantly an average computer, or at max 10 secs on a toaster, and save you lots of headaches in the future!

@foxxy Sounds good so far. Can it be used without any coding experience at all?

@gendercensus @foxxy Not particularly. Though depending on how you wanna do it @foxxy and I might be able to assist.

@doxxy @foxxy That's promising! Thank you, I will make a note to look into it when it's not late o'clock!

@gendercensus @doxxy @foxxy I too would be willing to extend assistance there!

One thing I'd point out here, philosophically, is that Excel is a program designed to trick you into thinking you're not "coding" when you use it. SQL is much more nakedly 'code', but it's very much the same type of code as Excel formulas; more of your current skills will transfer than you might think.

@Nentuaby @gendercensus @doxxy @foxxy SQL for a statistics problem is very much a "if I have a hammer, everything looks like a nail."

Vanilla SQL doesn't do much in terms of visualization, summary statistics, or hypothesis testing. And since smartsurvey delivers results in CSV and SPSS, there's little reason to go into SQL for this sort of thing.

Follow

@cbrachyrhynchos @Nentuaby @gendercensus @foxxy This is probably why r lang and julia was mentioned. Lots of tools there for easy data visualization.

Sign in to participate in the conversation
Critter Camp - Mastodon

This server is part of a collection of services that are mostly federated and run by a couple of furries.