Gods and Robots In this episode of the podcast we shake things up! Neil is on the guest side of the table with his partner Rabbi Laura Janner-Klausner to discuss their upcoming project Gods and Robots. Katherine is joined on the host side by friend of the show professor Michael Littman. See... See More Episodes arXiv Whitepapers NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms The performance of Large Language Models (LLMs) degrades from the temporal drift between data used for model training and newer text seen during inference. One understudied avenue of language change causing data drift is the emergence of neologisms -- new word forms -- over time. We create a diverse... The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels Longstanding data labeling practices in machine learning involve collecting and aggregating labels from multiple annotators. But what should we do when annotators disagree? Though annotator disagreement has long been seen as a problem to minimize, new perspectivist approaches challenge this... LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks There is considerable confusion about the role of Large Language Models (LLMs) in planning and reasoning tasks. On one side are over-optimistic claims that LLMs can indeed do these tasks with just the right prompting or self-verification strategies. On the other side are perhaps over-pessimistic... More featured content News Articles Supermarket facial recognition failure: why automated systems must put the human factor first From shrimp Jesus to fake self-portraits, AI-generated images have become the latest form of social media spam Stay in the loop. Subscribe to our newsletter for a weekly update on the latest podcast, news, events, and jobs postings. E-mail Leave this field blank Building fairness into AI is crucial – and hard to get right Beware businesses claiming to use trailblazing technology. They might just be ‘AI washing’ to snare investors Generative AI could leave users holding the bag for copyright violations Something felt ‘off’ – how AI messed with our human research, and what we learned Face recognition technology follows a long analog history of surveillance and control based on identifying physical features Emotion-tracking AI on the job: Workers fear being watched – and misunderstood Artificial intelligence needs to be trained on culturally diverse datasets to avoid bias AI in the developing world: how ‘tiny machine learning’ can have a big impact More news