Gods and Robots In this episode of the podcast we shake things up! Neil is on the guest side of the table with his partner Rabbi Laura Janner-Klausner to discuss their upcoming project Gods and Robots. Katherine is joined on the host side by friend of the show professor Michael Littman. See... See More Episodes arXiv Whitepapers Language model developers should report train-test overlap Language models are extensively evaluated, but correctly interpreting evaluation results requires knowledge of train-test overlap which refers to the extent to which the language model is trained on the very data it is being tested on. The public currently lacks adequate information about train-test... When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1 In "Embers of Autoregression" (McCoy et al., 2023), we showed that several large language models (LLMs) have some important limitations that are attributable to their origins in next-word prediction. Here we investigate whether these issues persist with o1, a new system from OpenAI that differs from... Confident Teacher, Confident Student? A Novel User Study Design for Investigating the Didactic Potential of Explanations and their Impact on Uncertainty Evaluating the quality of explanations in Explainable Artificial Intelligence (XAI) is to this day a challenging problem, with ongoing debate in the research community. While some advocate for establishing standardized offline metrics, others emphasize the importance of human-in-the-loop (HIL)... More featured content News Articles Generative AI could leave users holding the bag for copyright violations Something felt ‘off’ – how AI messed with our human research, and what we learned Stay in the loop. Subscribe to our newsletter for a weekly update on the latest podcast, news, events, and jobs postings. E-mail Leave this field blank Face recognition technology follows a long analog history of surveillance and control based on identifying physical features Emotion-tracking AI on the job: Workers fear being watched – and misunderstood Artificial intelligence needs to be trained on culturally diverse datasets to avoid bias AI in the developing world: how ‘tiny machine learning’ can have a big impact Why AI can’t replace air traffic controllers Taylor Swift deepfakes: new technologies have long been weaponised against women. The solution involves us all Study: Smart devices’ ambient light sensors pose imaging privacy risk AI companies are merging or collaborating to even out the gap in access to vital datasets More news