Algorithms and Society Members Publications

Evaluating LLMs as risk scores

Survey evaluation
Accuracy and calibration of LLMs on human prediction tasks

Members

Thumb ticker sm andre innsbruck face 2
Social Foundations of Computation
  • Doctoral Researcher
Thumb ticker sm 20241104 hardt moritz 12 cleaned kleiner
Social Foundations of Computation
  • Director
Thumb ticker sm portrait celestine
Algorithms and Society
Hector Endowed Fellow of the ELLIS Institute

Publications

Social Foundations of Computation Algorithms and Society Conference Paper Evaluating Language Models as Risk Scores Cruz, A. F., Hardt, M., Mendler-Dünner, C. Advances in Neural Information Processing Systems 37 (NeurIPS 2024), The Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS), December 2024 (Published) ArXiv Code URL BibTeX