Arthur Offers Tool to Compare Large Language Model Systems
t was fun while it lasted, but the “how cool is that?” stage of generative AI adoption is now being replaced by the boring work of figuring which products are best for your purpose. Arthur is making things a bit less painful with Arthur Bench, an open source tool to compare performance of large language model systems. They’ve also published the first in a series of comparisons among popular products.