18. Case Study: How to Weigh a Donkey#

Donkeys play important roles in rural Kenya. People need them to move crops and water, plow fields, and for personal transportation. When a donkey gets sick, the veterinarian needs to figure out how much the donkey weighs in order to prescribe the right amount of medicine. But, many vets in rural Kenya don’t have access to a scale to weigh the donkey so they need to guess how much the donkey’s weight. Too little medicine and an infection can re-emerge; too much medicine can cause a harmful overdose. There are over 1.8 million donkeys in Kenya, so it’s important to have a simple, accurate way to estimate the weight of a donkey.

In this case study, we follow the work of Kate Milner and Jonathan Rougier to create a model that veterinarians in the Kenyan countryside can use to make accurate estimates of a donkey’s weight. As usual, we walk through the steps of the data science lifecycle, but this time our work departs from the basics covered so far in this book. You can think of this case study as an opportunity to reflect on many of the core principles of working with data and understand how they can be extended to address the context of the situation. We directly evaluate sources of measurement error, design a special loss function that reflects the concern about an overdose, build a model while keeping applicability utmost in mind, and evaluate model predictions using special criterion that are relative to the donkey’s size.

We begin with the scope of the data.