Skip to main content

Long eval clean up times

Speed up Weave evaluation cleanup by flushing data and increasing client parallelism.

What is pairwise evaluation and how do I do it?

Compare two model outputs with pairwise evaluation using a custom scorer in Weave.