Gehani, A., Kim, M., & Zhang, J. (2009, February). Steps toward managing lineage metadata in grid clusters. In First workshop on on Theory and practice of provenance (pp. 1-9).
The lineage of a piece of data is of utility to a wide range of domains. Several application-specific extensions have been built to facilitate tracking the origin of the output that the software produces. In the quest to provide such support to extant programs, efforts have been recently made to develop operating system functionality for auditing filesystem activity to infer lineage relationships. We report on our exploration of mechanisms to manage the lineage metadata in Grid clusters.