Social Foundations of Computation Members Publications

Evaluating Language Models as Risk Scores

Folktexts
Folktexts encodes tabular data rows as prompts, runs inference on a model, and decodes a risk score from the language model. This makes it possible to use LLMs on tabular data in much the same way that sklearn would do it.

Members

Thumb ticker sm andre innsbruck face 2
Social Foundations of Computation
  • Doctoral Researcher
Thumb ticker sm 20241104 hardt moritz 12 cleaned kleiner
Social Foundations of Computation
  • Director
Thumb ticker sm portrait celestine
Algorithms and Society
Hector Endowed Fellow of the ELLIS Institute

Publications

Social Foundations of Computation Algorithms and Society Conference Paper Evaluating Language Models as Risk Scores Cruz, A. F., Hardt, M., Mendler-Dünner, C. Advances in Neural Information Processing Systems 37 (NeurIPS 2024), The Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS), December 2024 (Published) ArXiv Code URL BibTeX