The ugly duckling theorem is an argument showing that classification is not really possible without some sort of bias.
The theorem is named after Hans Christian Andersen's 1843 story "The Ugly Duckling", because it shows that a duckling is just as similar to a swan as two swans are to each other. It was derived by Satosi Watanabe, in 1969.[1]: 376–377
Trait | Tennis ball | Basketball | Football |
---|---|---|---|
Small | Yes | No | No |
Brown | No | Yes | Yes |
Round | Yes | Yes | No |
Suppose we have three objects: a tennis ball, an American football, and a basketball. These three balls have three "obvious" properties: their size, colour, and shape.
According to these properties, the "ugly duckling" is the tennis ball, because the other two objects share the categories "brown" and "not small".
The ugly duckling theorem relies on the ability to find additional properties from those which are provided.
Trait | Tennis ball | Basketball | Football |
---|---|---|---|
Small and brown | No | No | No |
Small | Yes | No | No |
Brown and round | No | Yes | No |
Brown and not round | No | No | Yes |
Round | Yes | Yes | No |
Small or not round | Yes | No | Yes |
Brown | No | Yes | Yes |
Small or brown | Yes | Yes | Yes |
The table to the right shows a selection of derived properties. In fact, these derived properties include every possible property up to equivalence: each one corresponds to one possible grouping (subset) of the original three balls.
There is no logical reason that the "original" properties are more important than the "derived" ones. Because the derived properties are found using fundamental logic, they are equivalent to the original properties. This list of all logical properties of the items has every item differ in exactly four places - when all logical properties are considered, all of the objects are just as similar as they are different.
The idea that we prioritize the three "obvious" properties exists outside the logic, so it is called arbitrary or biased. By applying the rules of logic alone, we get the result that all objects are equally different; there is no "ugly duckling".