A datasette of mybinder.org launches

I wanted to explore the data about launches on mybinder.org that we collect in our analytics. I also wanted to learn about datasette. As a result there is now https://binderlytics.herokuapp.com/binder-launches/binder

It contains the last 90days of data, about 1.6million rows 270days of data, about 4.4million rows.

Please share interesting things you find, queries you use, things that look a bit weird. Here are some I used:


The notebook used to create the DB is https://github.com/betatim/binder-datasette/blob/fcbf0fc9d468fc46aadde8c7cf762964ac589c1a/create-db.ipynb. I will probably re-run it with more than the last 90 days. Maybe even all ~500 days of data. Resulting in a dataset of about 7-8million rows!

6 Likes

Now with ~4.4 million launches or 270 days of history.

This is fabulous. Thank you @betatim.

1 Like

Graph of launches by provider per day (warning: takes a while for the graph to load). Not suprisingly it’s dominated by GitHub, I couldn’t find a way to display log10(count(provider)). You can easily distinguish weekday vs weekend though.


Least popular GitHub repos (only 1 launch ever)

1 Like

The chart is super cool. Even super cooler is that the link contains all the info so it just works when I click it :slight_smile:

Based on your “less popular repos”: a list of “somewhat popular repos” (more than 5 but lass than 50)