{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Overview\n", "## Motivation\n", "\n", "Usually when faced with prediction problems involving ordered labels (i.e. low, medium, high) and tabular data, data scientists turn to regular multinomial classifiers from the gradient boosted tree family of models, because of their ease of use, speed of fitting, and good performance. Parametric ordinal models have been around for a while, but they have not been popular because of their poor performance compared to the gradient boosted models, especially for larger datasets.\n", "\n", "Although classifiers can predict ordinal labels adequately, they require building as many classifiers as there are labels to predict. This approach, however, leads to slower training times, and confusing feature interpretations. For example, a feature which is positively associated with the increasing order of the label set (i.e. as the feature's value grows, so do the probabilities of the higher ordered labels), will va a positive association with the highest ordered label, negative with the lowest ordered, and a \"concave\" association with the middle ones.\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
| \n",
" | \n",
" \n",
" | \n",
" \n",
" | \n",
"
|---|