For various use-cases we're working on overhauling the logging in timely dataflow, with a couple of requirements in mind:
- instrumentation should be low/zero overhead when it's not in use;
- we should be able to stream the logs out to a separate computation/system for real-time analysis (this is specifically for some research use-cases and should be discussed);
- we'd like to be able to replay the instrumentation in the same computation that produced them, if at all possible.
For various use-cases we're working on overhauling the logging in timely dataflow, with a couple of requirements in mind: