Would a "The Littlest Binder" be useful?

yuvipanda · May 9, 2019, 6:12pm

Continuing the discussion from PyCon 2019 and mybinder.org:

For a similar use case, I’ve been thinking of building ‘The Littlest Binder’ (maybe as a plugin to The Littlest JupyterHub). Note it says ‘Binder’ and not ‘BinderHub’ and that is intentional!

The idea is that you can set up a machine that’ll serve ephemeral sessions (similar to BinderHub) to anyone who joins, but of only one repository you set up. So if you have a single repository you wanna use for your workshop / talk, you can set up one VM with “The Littlest BinderHub” pointing to your repo, and give that link out to folks. When your workshop is done, you can tear it down. The links would no longer work (so you should also give them mybinder.org links!), but it gives you a lot more control and power than now.

Would something like this be useful?

betatim · May 9, 2019, 6:40pm

We should figure out how to combine it with Creating a new Binder-at-home tool or where there is/isn’t overlap.

minrk · May 9, 2019, 7:47pm

I think this is useful in principle. To some degree, this is what tmpnb was already - given an image, serve anonymous instances. Add a call to repo2docker to build the image and you are there.

choldgraf · May 10, 2019, 1:11am

A quick clarification question - how would this be different from a tool that would run a bunch of system commands (assuming you’re on a linux machine perhaps) when given a repo that adhered to the repo2docker spec?

I think something like “repo2docker but without needing docker” would be pretty useful! Paired with a little bit of documentation that would basically be “the littlest binderhub” no?

betatim · May 10, 2019, 5:37am

I think what Yuvi is proposing is some syntactic sugar and pre-configured configuration that sets up a JupyterHub with NullAuthenticator and as part of the setup(?) runs repo2docker https://githost.com/myworkshop.git. A tmpnb cookie cutter?

Creating a version of repo2docker that doesn’t use any kind of container would be amazing but I think impossible, if you don’t allow VMs. A repo2container that is like r2d but doesn’t use the docker toolchain could be possible and exciting (no need for root), just a lot of work

minrk · May 10, 2019, 7:20am

I assume it would be tmpauthenticator and DockerSpawner to match the tmpnb behavior, since that would be simplest. I figured it would not be replicating repo2docker without docker.

I think this would also be a great exercise for the hooks @yuvipanda put into tljh to see if they are sufficient for this case.

mathematicalmichael · May 11, 2019, 1:43am

yas please! definitely would make use of this

sgibson91 · May 15, 2019, 3:16pm

Hi, wanted to put my two cents in on this thread as I’ve just had a meeting with @KirstieJane and we may have a use case!

We have a case study that we’d like to include in the Turing Way book which way exceeds the computational requirements mybinder.org provides (see this issue), so to get around this I built a bespoke BinderHub on a cluster with bigger VMs. While we want anyone who comes along to the Turing Way book to be able to run this case study on this new BinderHub, we don’t want them to be able to run any repo they like.

The littlest binder that just hosts one repo would be ideal! Or is there a way that I can lock down my current Hub to only build one repo from the config?

Much appreciated!

KirstieJane · May 15, 2019, 6:55pm

To what @sgibson91 said

choldgraf · May 15, 2019, 8:15pm

Y’all should check out a (what is I think undocumented) feature of BinderHub to ban repositories. This uses a regex, so I think you should be able to do something like “match anything that is not the repository you want to allow”

See here for where this is configured in mybinder.org:

github.com

jupyterhub/mybinder.org-deploy/blob/cbc66897db5f12883770192ab02a3dfa0cb1040e/mybinder/values.yaml#L45


      - 443 # https
      - 9418 # git
      - 873 # rsync
      - 1094 # xroot
    cidr: 0.0.0.0/0


config:
  GitHubRepoProvider:
    # Add banned repositories to the list below
    # They should be strings that will match "^<org-name>/<repo-name>.*"
    banned_specs:
      # e.g. '^org/repo.*'
      - ^ines/spacy-binder.*
      - ^soft4voip/rak.*
  BinderHub:
    use_registry: true
    build_image: jupyter/repo2docker:25ebca04
    per_repo_quota: 100
    about_message: |
      <p>mybinder.org is public infrastructure operated by the <a href="https://jupyterhub-team-compass.readthedocs.io/en/latest/team.html#binder-team">Binder Project team</a>.<br /><br />
      The Binder Project is a member of <a href="https://jupyter.org">Project Jupyter</a>, which is a fiscally

We should definitely add this to the documentation. See here for an issue I just opened:

psychemedia · May 16, 2019, 9:43am

Been thinking again about a “littlest Binderhub” and how it might relate to a couple of contexts:

a class that provides several Jupyter Book books that each run against a different docker container;
electron wrapped courses as per the spacy course (ANN) (which in turn makes me wonder: electron wrapped Jupyter Book?)

In this case, it might be useful to have an easily installed Binderhub environment that can launch one or more containers in order to support a range of courses / books?

See also the discussion around snakestagram / custom conda envts for baking into nteract app.

sgibson91 · May 16, 2019, 1:03pm

Thank you @choldgraf!

For anyone interested in this, the syntax is as follows:

config:
  GitHubRepoProvider:
    banned_specs:
      - ^(?!org\/repo).*$

where org and repo are the ones you want to allow.

Will also add to the issue.

yuvipanda · May 19, 2019, 12:18am

I spent all day procrastinating on other important work I had to do, so here’s a demo of an instance of The Littlest Binder!

You can access it at http://34.74.126.139. I’ll probably keep that running for a few more days or so.

The core of this is a new repo2dockerspawner. Images are built if needed with repo2docker every time a user server starts. So if you are pointing to ‘master’ and it has new commits, new users will automatically.

There’s a lot of work left to do, but this is a good start. Soon enough, we can have a TLJH Plugin that turns it into The Littlest Binder. repo2dockerspawner probably has a lot of other single-node uses too.

psychemedia · May 19, 2019, 10:50am

Ooooh… exciting…

So is this “just” a jupyterhub with a spawner that gives it the ability to launch a against a repo, rebuilding if necessary? So presumably the Jupyterhub landing page could also give a user a selection of several environments - named docker containers, git repos - from which they could launch a specific environment?

And TLJH plugins look really interesting too…

minrk · May 19, 2019, 10:57am

Yeah, DockerSpawner already lets you pick from a whitelist of images, so mapping these onto repos instead should be a pretty small change.

If you require the admin to build images beforehand rather than enabling builds as part of spawn, then the default DockerSpawner + image_whitelist should be all you need. It’s less automatic and therefore less “bindery” but might fit, depending on your needs.

psychemedia · May 19, 2019, 12:34pm

So… erm… could / does TLJH support a py / conda env spawner? So you could have separate envts, user selectable, on TLJH?

[Update: hmm; I suppose a better way to offer different environments is just to install different kernels that a user could open a notebook with. ]

yuvipanda · May 19, 2019, 5:20pm

Yup, this is an easy next step.

Them. Helps keep TLJH small and focused while allowing other cool things to build on top of it.

psychemedia · May 19, 2019, 7:50pm

Yes… the plugins look like a really useful thing…

And made me wonder… If I have binder directory with conda.yml file (for example), then it should be relatively to figure out what I’d need to put into an environment customisation plugin?

yuvipanda · May 19, 2019, 9:27pm

Yep! In fact, I’ve already written most of the code for that in https://github.com/pangeo-data/pangeo-stacks/blob/193204e330bb3771855b1225fa41ad4663d63aca/onbuild/r2d_overlay.py. Needs to be extracted into its own package though.

psychemedia · May 20, 2019, 11:04am

Just trying to keep up by explaining to myself what I think this all means / makes possible / requires!

Topic		Replies	Views
Jovian.ml increased usage in Binder General	8	1864	October 3, 2020
BinderHub for HPC BinderHub	21	3454	August 26, 2020
Building a "The Littlest BinderHub" BinderHub	9	2104	September 15, 2023
Embed binder-related metadata in notebook? Binder	8	1335	August 11, 2021
GitHub Actions + Binder Binder community , how-to	7	2343	November 22, 2019

Would a "The Littlest Binder" be useful?

Related topics