A few times in the Governance meetings we have mentioned being able to get some statistics that could help us better understand the different communities. I know @choldgraf has done some work and produced some summary reports based on the data found in the github repositories.
I also work on some open source projects at Apache and one of the projects there I think could be very useful in gathering community statistics. The project is called Apache Kibble and even though it is in incubation and still being developed, we can still use it to gather some potentially interesting information.
So what I’d like to do as a Proof of Concept (PoC) is to load some of the Jupyter repos into Kibble and see what we get. So I selected notebook and have loaded the data from the github repository and also the discourse forum.
Here are a couple of examples of what it can give us (This is the Notebook Github repo - I ran this yesterday and the time period is over the last 6 months)
And this is analysis of the Notebook discourse forum that ran yesterday and once again the time period is over the last 6 months.
I haven’t checked anything so would really like to get some feedback on the information to see whether it could be useful. I’d also like to know whether here in Governance the best place to have the discussion or should I take it to the Notebook community as this it might be more relevant to them?
If anyone wants access to the PoC Kibble instance then let me know and also if anyone has any suggestions for any other area or areas we could use import data for as part of the PoC then also please let me know.