Comparing models is just as important in Large Language Model Ops (LLMOps) as it is in MLOps, but the process for doing so is a little less clear. In “classical” machine learning, it usually suffices to compare models on a set of clear numerical metrics; the model with the better score w...