Testing notebooks

hamel · June 6, 2020, 7:33am

Yes please do let me know if anyone wants to use that Action or has any feedback, I will gladly make changes or improvements!

MSeal · July 7, 2020, 4:08pm

Also our Google Summer of Code project in nteract had it’s first initial release for testing notebooks: https://test-book.readthedocs.io/en/latest/. It treats ipynb files as py files for testing purposes. Try it out, @rohitsanj is looking for early users to get feedback on the library.

Given a code cell in a Jupyter Notebook:

def func(a, b):
   return a + b

You can write a unit test using testbook in a Python file as follows:

import testbook

@testbook.testbook('/path/to/notebook.ipynb', execute=True)
def test_func(tb):
   func = tb.ref("func")

   assert func(1, 2) == 3

betatim · July 7, 2020, 8:26pm

Can I pass/set a relative path as well in the testbook decorator? Or asking my real question: what is the idea for dealing with notebooks and their tests being moved around on the filesystem?

MSeal · July 8, 2020, 5:05am

Yes the notebook path can be relative, it just does an open based on the current working directory. Though what you’re maybe getting as is it should be relative to the test file? It doesn’t do that today, but I believe is possible to do and would be a good issue to open if you wanted to chat through options / expectations.

rohitsanj · July 8, 2020, 6:10am

what is the idea for dealing with notebooks and their tests being moved around on the filesystem?

The idea was to follow conventional unit testing norms of having the source code (notebook) and tests as separate entities.

As to tests being moved around… that’s again an issue where the user needs to make sure that all the test modules are organized and can provide a uniform relative path to the notebooks which could be at least one directory up.

Something like this…

notebooks/
tests/

And the relative path in the testbook decorators will be ../notebooks/foo.ipynb and so on.

betatim · July 8, 2020, 9:36am

Relative paths sound great. I was mainly thinking about the case where there is a repository of notebooks + tests that you checkout to some different absolute path than I. Or me moving the directory around.

Relative paths sounds like a good solution. I don’t have any experience/inputs, just curious to find out what you were thinking as you are one of the world wide experts on this because you actively work on this

hamel · July 15, 2020, 1:40pm

Yes I have an example:

github.com

github/covid19-dashboard/blob/fae7d2c634eddd0d43136c162ac9fd55ad65fc43/.github/workflows/update-nb.yaml#L66


      
              sudo apt-get update -y
              sudo apt-get -y --force-yes install chromium-chromedriver
              npm install -g electron@6.1.4 orca
              pip3 install -r ./_notebooks/requirements.txt
              python3 -m ipykernel install --user --name python3
              sudo chmod -R 777 .
          
          - name: update notebooks
            id: update_nb
            run: |
              ./_action_files/run_notebooks.sh
          
          - name: Create an issue if notebook update failure occurs
            if: github.event_name == 'schedule' && steps.update_nb.outputs.error_bool == 'true'
            uses: actions/github-script@0.6.0
            with:
              github-token: ${{secrets.GITHUB_TOKEN}}
              script: |
                var err = process.env.ERROR_STRING;
                var run_id = process.env.RUN_ID;
                github.issues.create({

This calls a bash script

github.com

github/covid19-dashboard/blob/fae7d2c634eddd0d43136c162ac9fd55ad65fc43/_action_files/run_notebooks.sh#L12


      
          set -e
          cd $(dirname "$0")/..
          cd _notebooks/
          
          ERRORS=""
          
          for file in *.ipynb
          do
              if [ "${file}" = "2020-03-16-covid19_growth_bayes.ipynb" ]; then
                  echo "Skipping ${file}"
              elif papermill --kernel python3 "${file}" "${file}"; then
                  echo "Sucessfully refreshed ${file}\n\n\n\n"
              else
                  echo "ERROR Refreshing ${file}"
                  ERRORS="${ERRORS}, ${file}"
              fi
          done
          
          # Emit Errors If Exists So Downstream Task Can Open An Issue
          if [ -z "$ERRORS" ]
          then

This is used to refresh notebooks with papermill with CI on a recurring schedule for https://covid19dashboards.com/

cc: @choldgraf

MSeal · July 16, 2020, 12:40am

This is used to refresh notebooks with papermill with CI on a recurring schedule for https://covid19dashboards.com/

Those are really neat btw @hamel ! And that’s a good example of some easily combined tools to add real value for folks.

Several jupyter related libraries use a pattern like this: https://github.com/nteract/papermill/blob/2d26912065575e955595e711fee3f4c415e8ea81/tox.ini#L31-L35 to also test notebooks used in documentation or example directories. I’m probably going to transition those to testbook in combination with How to parametrize fixtures and test functions — pytest documentation to make it more visible and have better debugging for failures in the future.

edublancas · July 5, 2022, 11:48am

I started working on a project to test notebooks with non-deterministic outputs. For example, a cell that prints the accuracy of a model is expected to vary from one run to the other. The idea is to keep a record of the previous cell’s outputs and throw an error when the output deviates too much from the expected range (right now the range is defined as 3 standard deviations from the mean).

Currently, it only works with numeric outputs. I’ve been thinking of adding support for plots.

I’d love to hear what others think!

Topic		Replies	Views
Binder Notebook Builder Bot Binder	6	1254	April 14, 2020
Binder as part of a test framework Binder	5	721	December 3, 2018
Creating a future infrastructure for notebooks to be submitted and peer-reviewed Publishing	25	4778	September 17, 2020
GitHub Actions + Binder Binder community , how-to	7	2354	November 22, 2019
Embed binder-related metadata in notebook? Binder	8	1338	August 11, 2021

Testing notebooks

Related topics