2 comments

  • rbalicki 2 days ago ago

    Very cool! This lets you grade output across different base models. Does it also allow you grade output across different prompts?

    • randall 2 days ago ago

      that’s the next step… we have a structured approach to prompting too that we think will help people build better prompts too.