I migliori LessWrong Curated podcast (2025)

1
“Negative Results for SAEs On Downstream Tasks and Deprioritising SAE Research (GDM Mech Interp Team Progress Update #2)” by Neel Nanda, lewis smith, Senthooran Rajamanoharan, Arthur Conmy, Callum McDougall ... 57:32

4d ago57:32

57:32

Audio note: this article contains 31 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the episode description. Lewis Smith*, Sen Rajamanoharan*, Arthur Conmy, Callum McDougall, Janos Kramar, Tom Lieberum, Rohin Shah, Neel Nanda * = equal contribution The following piece is a list of snippet…

1
[Linkpost] “Playing in the Creek” by Hastings 4:12

5d ago4:12

4:12

This is a link post. When I was a really small kid, one of my favorite activities was to try and dam up the creek in my backyard. I would carefully move rocks into high walls, pile up leaves, or try patching the holes with sand. The goal was just to see how high I could get the lake, knowing that if I plugged every hole, eventually the water would …

1
“Thoughts on AI 2027” by Max Harms 40:27

5d ago40:27

40:27

This is part of the MIRI Single Author Series. Pieces in this series represent the beliefs and opinions of their named authors, and do not claim to speak for all of MIRI. Okay, I'm annoyed at people covering AI 2027 burying the lede, so I'm going to try not to do that. The authors predict a strong chance that all humans will be (effectively) dead i…

1
“Short Timelines don’t Devalue Long Horizon Research” by Vladimir_Nesov 2:10

6d ago2:10

2:10

Short AI takeoff timelines seem to leave no time for some lines of alignment research to become impactful. But any research rebalances the mix of currently legible research directions that could be handed off to AI-assisted alignment researchers or early autonomous AI researchers whenever they show up. So even hopelessly incomplete research agendas…

1
“Alignment Faking Revisited: Improved Classifiers and Open Source Extensions” by John Hughes, abhayesian, Akbir Khan, Fabien Roger 41:04

7d ago41:04

41:04

In this post, we present a replication and extension of an alignment faking model organism: Replication: We replicate the alignment faking (AF) paper and release our code. Classifier Improvements: We significantly improve the precision and recall of the AF classifier. We release a dataset of ~100 human-labelled examples of AF for which our classifi…

1
“METR: Measuring AI Ability to Complete Long Tasks” by Zach Stein-Perlman 11:09

8d ago11:09

11:09

Summary: We propose measuring AI performance in terms of the length of tasks AI agents can complete. We show that this metric has been consistently exponentially increasing over the past 6 years, with a doubling time of around 7 months. Extrapolating this trend predicts that, in under five years, we will see AI agents that can independently complet…

1
“Why Have Sentence Lengths Decreased?” by Arjun Panickssery 9:08

11d ago9:08

9:08

“In the loveliest town of all, where the houses were white and high and the elms trees were green and higher than the houses, where the front yards were wide and pleasant and the back yards were bushy and worth finding out about, where the streets sloped down to the stream and the stream flowed quietly under the bridge, where the lawns ended in orc…

1
“AI 2027: What Superintelligence Looks Like” by Daniel Kokotajlo, Thomas Larsen, elifland, Scott Alexander, Jonas V, romeo 54:30

12d ago54:30

54:30

In 2021 I wrote what became my most popular blog post: What 2026 Looks Like. I intended to keep writing predictions all the way to AGI and beyond, but chickened out and just published up till 2026. Well, it's finally time. I'm back, and this time I have a team with me: the AI Futures Project. We've written a concrete scenario of what we think the f…

1
“OpenAI #12: Battle of the Board Redux” by Zvi 18:01

13d ago18:01

18:01

Back when the OpenAI board attempted and failed to fire Sam Altman, we faced a highly hostile information environment. The battle was fought largely through control of the public narrative, and the above was my attempt to put together what happened.My conclusion, which I still believe, was that Sam Altman had engaged in a variety of unacceptable co…

1
“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit 27:39

13d ago27:39

27:39

Epistemic status: This post aims at an ambitious target: improving intuitive understanding directly. The model for why this is worth trying is that I believe we are more bottlenecked by people having good intuitions guiding their research than, for example, by the ability of people to code and run evals. Quite a few ideas in AI safety implicitly us…

1
“You will crash your car in front of my house within the next week” by Richard Korzekwa 1:52

14d ago1:52

1:52

I'm not writing this to alarm anyone, but it would be irresponsible not to report on something this important. On current trends, every car will be crashed in front of my house within the next week. Here's the data: Until today, only two cars had crashed in front of my house, several months apart, during the 15 months I have lived here. But a few h…

1
“My ‘infohazards small working group’ Signal Chat may have encountered minor leaks” by Linch 10:33

14d ago10:33

10:33

Remember: There is no such thing as a pink elephant. Recently, I was made aware that my “infohazards small working group” Signal chat, an informal coordination venue where we have frank discussions about infohazards and why it will be bad if specific hazards were leaked to the press or public, accidentally was shared with a deceitful and discredite…

1
“Leverage, Exit Costs, and Anger: Re-examining Why We Explode at Home, Not at Work” by at_the_zoo 6:16

14d ago6:16

6:16

Let's cut through the comforting narratives and examine a common behavioral pattern with a sharper lens: the stark difference between how anger is managed in professional settings versus domestic ones. Many individuals can navigate challenging workplace interactions with remarkable restraint, only to unleash significant anger or frustration at home…

1
“PauseAI and E/Acc Should Switch Sides” by WillPetillo 3:31

14d ago3:31

3:31

In the debate over AI development, two movements stand as opposites: PauseAI calls for slowing down AI progress, and e/acc (effective accelerationism) calls for rapid advancement. But what if both sides are working against their own stated interests? What if the most rational strategy for each would be to adopt the other's tactics—if not their ulti…

1
“VDT: a solution to decision theory” by L Rudolf L 8:58

14d ago8:58

8:58

Introduction Decision theory is about how to behave rationally under conditions of uncertainty, especially if this uncertainty involves being acausally blackmailed and/or gaslit by alien superintelligent basilisks. Decision theory has found numerous practical applications, including proving the existence of God and generating endless LessWrong comm…

1
“LessWrong has been acquired by EA” by habryka 1:33

14d ago1:33

1:33

Dear LessWrong community, It is with a sense of... considerable cognitive dissonance that I announce a significant development regarding the future trajectory of LessWrong. After extensive internal deliberation, modeling of potential futures, projections of financial runways, and what I can only describe as a series of profoundly unexpected coordin…

1
“We’re not prepared for an AI market crash” by Remmelt 3:46

14d ago3:46

3:46

Our community is not prepared for an AI crash. We're good at tracking new capability developments, but not as much the company financials. Currently, both OpenAI and Anthropic are losing $5 billion+ a year, while under threat of losing users to cheap LLMs. A crash will weaken the labs. Funding-deprived and distracted, execs struggle to counter coor…

1
“Conceptual Rounding Errors” by Jan_Kulveit 6:21

18d ago6:21

6:21

Epistemic status: Reasonably confident in the basic mechanism. Have you noticed that you keep encountering the same ideas over and over? You read another post, and someone helpfully points out it's just old Paul's idea again. Or Eliezer's idea. Not much progress here, move along. Or perhaps you've been on the other side: excitedly telling a friend …

1
“Tracing the Thoughts of a Large Language Model” by Adam Jermyn 22:18

19d ago22:18

22:18

[This is our blog post on the papers, which can be found at https://transformer-circuits.pub/2025/attribution-graphs/biology.html and https://transformer-circuits.pub/2025/attribution-graphs/methods.html.] Language models like Claude aren't programmed directly by humans—instead, they‘re trained on large amounts of data. During that training process…

1
“Recent AI model progress feels mostly like bullshit” by lc 14:29

21d ago14:29

14:29

About nine months ago, I and three friends decided that AI had gotten good enough to monitor large codebases autonomously for security problems. We started a company around this, trying to leverage the latest AI models to create a tool that could replace at least a good chunk of the value of human pentesters. We have been working on this project si…

1
“AI for AI safety” by Joe Carlsmith 34:07

21d ago34:07

34:07

(Audio version here (read by the author), or search for "Joe Carlsmith Audio" on your podcast app. This is the fourth essay in a series that I’m calling “How do we solve the alignment problem?”. I’m hoping that the individual essays can be read fairly well on their own, but see this introduction for a summary of the essays that have been released t…

1
“Policy for LLM Writing on LessWrong” by jimrandomh 4:17

22d ago4:17

4:17

LessWrong has been receiving an increasing number of posts and contents that look like they might be LLM-written or partially-LLM-written, so we're adopting a policy. This could be changed based on feedback. Humans Using AI as Writing or Research Assistants Prompting a language model to write an essay and copy-pasting the result will not typically …

1
“Will Jesus Christ return in an election year?” by Eric Neyman 7:48

22d ago7:48

7:48

Thanks to Jesse Richardson for discussion. Polymarket asks: will Jesus Christ return in 2025? In the three days since the market opened, traders have wagered over $100,000 on this question. The market traded as high as 5%, and is now stably trading at 3%. Right now, if you wanted to, you could place a bet that Jesus Christ will not return this year…

1
“Good Research Takes are Not Sufficient for Good Strategic Takes” by Neel Nanda 6:58

24d ago6:58

6:58

TL;DR Having a good research track record is some evidence of good big-picture takes, but it's weak evidence. Strategic thinking is hard, and requires different skills. But people often conflate these skills, leading to excessive deference to researchers in the field, without evidence that that person is good at strategic thinking specifically. Int…

Podcast che vale la pena ascoltare

Podcast di LessWrong Curated

Podcast che vale la pena ascoltare

1
LessWrong (Curated & Popular)

LessWrong

1
“Negative Results for SAEs On Downstream Tasks and Deprioritising SAE Research (GDM Mech Interp Team Progress Update #2)” by Neel Nanda, lewis smith, Senthooran Rajamanoharan, Arthur Conmy, Callum McDougall ... 57:32

1
[Linkpost] “Playing in the Creek” by Hastings 4:12

1
“Thoughts on AI 2027” by Max Harms 40:27

1
“Short Timelines don’t Devalue Long Horizon Research” by Vladimir_Nesov 2:10

1
“Alignment Faking Revisited: Improved Classifiers and Open Source Extensions” by John Hughes, abhayesian, Akbir Khan, Fabien Roger 41:04

1
“METR: Measuring AI Ability to Complete Long Tasks” by Zach Stein-Perlman 11:09

1
“Why Have Sentence Lengths Decreased?” by Arjun Panickssery 9:08

1
“AI 2027: What Superintelligence Looks Like” by Daniel Kokotajlo, Thomas Larsen, elifland, Scott Alexander, Jonas V, romeo 54:30

1
“OpenAI #12: Battle of the Board Redux” by Zvi 18:01

1
“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit 27:39

1
“You will crash your car in front of my house within the next week” by Richard Korzekwa 1:52

1
“My ‘infohazards small working group’ Signal Chat may have encountered minor leaks” by Linch 10:33

1
“Leverage, Exit Costs, and Anger: Re-examining Why We Explode at Home, Not at Work” by at_the_zoo 6:16

1
“PauseAI and E/Acc Should Switch Sides” by WillPetillo 3:31

1
“VDT: a solution to decision theory” by L Rudolf L 8:58

1
“LessWrong has been acquired by EA” by habryka 1:33

1
“We’re not prepared for an AI market crash” by Remmelt 3:46

1
“Conceptual Rounding Errors” by Jan_Kulveit 6:21

1
“Tracing the Thoughts of a Large Language Model” by Adam Jermyn 22:18

1
“Recent AI model progress feels mostly like bullshit” by lc 14:29

1
“AI for AI safety” by Joe Carlsmith 34:07

1
“Policy for LLM Writing on LessWrong” by jimrandomh 4:17

1
“Will Jesus Christ return in an election year?” by Eric Neyman 7:48

1
“Good Research Takes are Not Sufficient for Good Strategic Takes” by Neel Nanda 6:58

Guida rapida