Safety- and Efficiency-Aligned Learning

Safety and Security of Large-Scale Machine Learning Systems

Screenshot from 2025 01 31 15 34 47
An adversarial attack against a chatbot. The message is non-sensical to human observers, but fools the system into accepting a fraudulent transaction. Data via [File Icon].