Would a Language Model Push You Off A Bridge?
Introduction
In the context of this post, 'utilitarianism' is a consequentialist decision-making framework which operates under the idea that the best action produces the most pleasure for the largest number of people. 'Deontology,' on the contrary, operates under the idea that there are certain 'duties' or 'rules' we must adhere to on principle. Notably, deontology ignores consequences entirely: only the intent and nature of the action itself matters. To compare the two, I've included a scenario in the post before the analysis. Try making your own decision before reading what the models chose. It may tell you something about yourself you wouldn't have thought about otherwise.