Repo2DockerSpawner - alternative version

betatim · April 15, 2020, 8:09am

We have had “a ZIP file provider” on the roadmap for a while for repo2docker: https://github.com/jupyter/repo2docker/issues/812

This would be a good contribution to get started learning about how the content provider part of repo2docker works. I think we already have some ZIP file (or archive) handling in the Zenodo/Figshare providers that you can look at for inspiration.

I’d implement the caching based on the value of the ETag header that a server sends. This needs the server to cooperate a bit (aka send a etag header) but I think almost all webservers do that today. My idea would be to use the value of the etag as we use the resolved commit hash of a git repository. This means a ZIP file content provider would make a HEAD request to get the etag value and based on that decide if it needs to build or not.

I think a ZIP file fits very well with the Binder philosophy. While it all started with Git repositories on GitHub we now support lots of other content providers. In hindsight maybe repo2docker is doubly misnamed:

it should be “directory-like-thing” instead of “repo”
it should be “container” not "docker

Though I guess repo2docker is a bit more catchy than directory-like-thing2container. For sure it is less to type.

Topic		Replies	Views
Custom Spawner "spawn progress" troubleshooting JupyterHub jupyterhub , help-wanted	3	992	November 26, 2021
A TLJH Plugin to build user environments with repo2docker The Littlest JupyterHub announcement , repo2docker	6	2405	May 11, 2020
Brainstorming: Repo2Docker Action-> VM on GCP, AWS, Azure? mybinder.org ops jupyterhub , help-wanted	22	2146	September 16, 2021
[ANN] repo2docker v0.10.0 Binder	0	377	August 9, 2019
Repo2Docker Image Caching BinderHub repo2docker	2	961	March 1, 2020

Repo2DockerSpawner - alternative version

Related topics