Hey HN!
Pipevals is early and rough (this is a learning project), but usable.
It currently lets you:
- build evaluation pipelines as graphs
- run them against datasets
- track how output quality changes over time
It currently lets you:
- build evaluation pipelines as graphs
- run them against datasets
- track how output quality changes over time