Tag: Evaluating

Everything You Should Know About Evaluating Large Language Models

As open source language models become more readily available, getting lost in all the options is easy. How do we determine their performance and compare them? And how can we confidently say that one model is better than another? This article provides some answers by presenting training and eva...

Everything You Should Know About Evaluating Large Language Models

As open source language models become more readily available, getting lost in all the options is easy. How do we determine their performance and compare them? And how can we confidently say that one model is better than another? This article provides some answers by presenting training and eva...

Evaluating Superblocks

The implementation of the Superblock model in Barcelona has been a topic of fervent debate, drawing contrasting responses from various stakeholders. While the Poblenou Superblock project has garnered significant media attention globally, it has not been without its share of criticisms. In this artic...

Observing without evaluating

When I first saw this quote a few years ago, it was mystifying to me. I couldn’t reconcile decoupling evaluation from observation. But the more I considered the quote, and the more I familiarized myself with design thinking, the wisdom of this “ability” revealed itself. I discov...

Evaluating Image Segmentation Models

What is Image Segmentation? Image Segmentation is the process of partitioning a digital image into different image classes or objects. This is a very important part of self-driving cars as correctly detecting what objects are in front of the car and where they are located are critical inputs to t...

Evaluating Text Output in NLP: BLEU at your own risk

One question I get fairly often from folks who are just getting into NLP is how to evaluate systems when the output of that system is text, rather than some sort of classification of the input text. These types of problems, where you put some text into your model and get some other text out of it, a...