How can we organise for Gigantum to join forces with/lead an effort to build a Jupyter wide telemetry system? The benefit for Gigantum is that data is collected from the larger Jupyter user and the advantage for Jupyter is that we get the tooling needed to collect telemetry. Seems like a win win.
To increase the chances that users opt-in to this data collection it would have to make an effort to go above and beyond in terms of privacy, anonymisation and the need to trust those who are collecting the data. IMHO a goal to strive for is to setup things so that notebook users at Netflix/Bloomberg/etc could be allowed to turn this on.
Adding some links on how others have approached the privacy challenges:
- RAPPOR https://ai.google/research/pubs/pub42852
- Prio https://crypto.stanford.edu/prio/
- https://hacks.mozilla.org/2018/10/testing-privacy-preserving-telemetry-with-prio/
I need to do a bit more digging in my history/bookmarks for more resources from the big tech organisations who have a solution (even if it is a crappy one) to most of the privacy and technical issues involved.
As with all things crypto: the only easy thing is to screw it up