I’m here at CSVConf and just heard about an interesting project that’s sponsored by the Open Knowledge Foundation. It’s called FrictionlessData and it seems to be an attempt at defining specifications for data of varying kinds, with the goal of making it easier to share, discover, ingest, etc.
It seems like it might be an interesting avenue to pursue if we wanted to extend the repo2docker spec to include data that doesn’t live within the repo itself.
I’ll try to keep looking into it while I’m here, but flagging it here in case anybody else has worked with it before.
Docs here: https://frictionlessdata.io/docs/
Specs page here: https://frictionlessdata.io/specs/
For example, tabular data spec here: https://frictionlessdata.io/specs/tabular-data-resource/