Analysis on thousands of notebooks for JupyterCon 2020: what would you like to see?

dcgrigsby · August 21, 2020, 4:11pm

I’ve collected a few tens of thousands of Jupyter notebooks upon which I’ve been doing analysis. I’ll be turning what I’ve learned into a short presentation for the upcoming JupyterCon.

Along with the presentation, I’ll be publishing the corpus of notebooks along with the code for running map-reduce jobs to do the analysis.

Ahead of that, I thought I’d write and ask for suggestions from this community. Are there things you’d like to learn, questions you’d like to answer, visualizations you’d like to see, or anything related to using Jupyter notebooks as a data source in and of themselves?

I’ve got enough tooling that I should be able to incorporate your suggestions into the presentation before the deadline.

Thank you,

Dan Grigsby

nawar29 · August 26, 2020, 3:39am

Maybe more integrations with FS like S3 and what not, also Jupyterdash stuff are pretty useful thing to analyze

nicolaskruchten · August 27, 2020, 7:34pm

I’d love to see some stats on which libraries people are importing, and which functions within those modules people are calling! I saw a study done where folks tried to figure out what kinds of charts people were making in their notebooks by figuring out which matplotlib functions are being called, and this is really interesting to me, although I’d want to know for Plotly

Topic		Replies	Views
Transform your Jupyter Notebook into a beautiful dashboard Notebook	0	90	January 31, 2025
Repository of "beautiful" notebooks for students Meta	1	641	August 3, 2020
Looking for opportunities to study Jupyter Notebooks at work General	1	1171	July 22, 2020
Thoughts and Experiences from using Jupyter in Enterprise General	11	7272	May 4, 2022
Any recent usage polling on Jupyter Lab vs. Notebook usage? Meta community	22	6273	June 4, 2020

Analysis on thousands of notebooks for JupyterCon 2020: what would you like to see?

Related topics