Publications

Philosophy of AI

Thormann, T.F. (Under Review). Defining Strategic AI Deception Through Generalized Propositional Attitudes. Download PDF

Dung, L., Thormann, T.F. (Under Review). Mask or Mind? Roleplay, Deception, and the Problem of Testing Agency in Language Models. Download PDF

Thormann, T.F., Michelini, M. (Under Review): An Evolutionary Model of Morality as Externalization. Download PDF

Thormann, T.F., Gilles, M. (Under Review): Probing the Limits of the Lie Detector Approach to LLM Deception. https://doi.org/10.48550/arXiv.2603.10003. Download PDF