Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...
If you’ve ever spent a Sunday afternoon building worksheets from scratch, formatting questions, writing answer keys, and adjusting everything for three different reading levels, you already know how ...