Tracking Software Contributionsdoi: 10.6084/m9.figshare.705892Joint ORCID – DRYAD Symposium on Research Attribution23 May 2013, OxfordNeil Chue Hong (@npch)ORCID: 0000-0002-8876-7606N.ChueHong@software.ac.uk Unless otherwise indicatedslides licensed under
Lifecycle Authorship Research Papers, data, software areall research outputs ofwhat is a continuous cycle. With software, technologymakes it easier to trackand measure, but notreward. And we cannot separatepapers, data and softwarewhen we release research. • Cite • Identify Index Reuse
Versioning • Why do we version? • To indicate a change • To allow sharing • To confer special status Version control systems make this easy Public v1 Public v2 Public v3 Personal v3 Personal v3a Personal v1 Personal v2 Personal v2a Personal v2a
Granularity Function Algorithm Program Library / Suite / Package …
Boundary • What do we choose to identify: • Workflow? • Software that runs workflow? • Software referenced by workflow? • Software dependencies? • What’s the minimum citable part?
Authorship Authorship • Which authors have had what impact on each version of the software? • Who had the largest contribution to the scientific results in a paper? • Can micro-attribution work? Can track author, but not contribution? • http://beyond-impact.org/?p=175 OGSA-DAI projects statistics from Ohloh
5 Stars of Research Software • Community • There is a community infrastructure • Open • Software has permissive license • Defined • Accurate metadata for the software • Extensible • Usable, modifiable for my purpose • Runnable • I can access and run software C R O E D • c.f. • 5 Stars of Linked Data (Berners-Lee) • 5 Stars of Online Journals (Shotton) “Golden Star” Originally by Ssolbergj CC-BY