Gods and Robots In this episode of the podcast we shake things up! Neil is on the guest side of the table with his partner Rabbi Laura Janner-Klausner to discuss their upcoming project Gods and Robots. Katherine is joined on the host side by friend of the show professor Michael Littman. See... See More Episodes arXiv Whitepapers AI Agents That Matter AI agents are an exciting new research direction, and agent development is driven by benchmarks. Our analysis of current agent benchmarks and evaluation practices reveals several shortcomings that hinder their usefulness in real-world applications. First, there is a narrow focus on accuracy without... Allocation Requires Prediction Only if Inequality Is Low Algorithmic predictions are emerging as a promising solution concept for efficiently allocating societal resources. Fueling their use is an underlying assumption that such systems are necessary to identify individuals for interventions. We propose a principled framework for assessing this assumption... NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms The performance of Large Language Models (LLMs) degrades from the temporal drift between data used for model training and newer text seen during inference. One understudied avenue of language change causing data drift is the emergence of neologisms -- new word forms -- over time. We create a diverse... More featured content News Articles Understanding AI outputs: study shows pro-western cultural bias in the way AI decisions are explained How to spot fake online reviews (with a little help from AI) Stay in the loop. Subscribe to our newsletter for a weekly update on the latest podcast, news, events, and jobs postings. E-mail Leave this field blank Supermarket facial recognition failure: why automated systems must put the human factor first From shrimp Jesus to fake self-portraits, AI-generated images have become the latest form of social media spam Building fairness into AI is crucial – and hard to get right Beware businesses claiming to use trailblazing technology. They might just be ‘AI washing’ to snare investors Generative AI could leave users holding the bag for copyright violations Something felt ‘off’ – how AI messed with our human research, and what we learned Face recognition technology follows a long analog history of surveillance and control based on identifying physical features Emotion-tracking AI on the job: Workers fear being watched – and misunderstood More news