Elon Musk May Be Transitioning to Bipolar Type I — LessWrong
Published on March 11, 2025 5:45 PM GMTEpistemic status: Speculative pattern-matching based on public information. In 2023,...
How Language Models Understand Nullability — LessWrong
Published on March 11, 2025 3:57 PM GMTTL;DR Large language models have demonstrated an emergent ability...
Forethought: a new AI macrostrategy group — LessWrong
Published on March 11, 2025 3:39 PM GMTForethought[1] is a new AI macrostrategy research group cofounded by Max...
Preparing for the Intelligence Explosion — LessWrong
Published on March 11, 2025 3:38 PM GMTThis is a linkpost for a new paper called Preparing...
AI Control May Increase Existential Risk — LessWrong
Published on March 11, 2025 2:30 PM GMTEpistemic status: The following isn't an airtight argument, but...
When is it Better to Train on the Alignment Proxy? — LessWrong
Published on March 11, 2025 1:35 PM GMTThis is a response to Matt's earlier post. If...
Do reasoning models use their scratchpad like we do? Evidence from distilling paraphrases — LessWrong
Published on March 11, 2025 11:52 AM GMTTL;DR: We provide some evidence that Claude 3.7 Sonnet...
A Hogwarts Guide to Citizenship — LessWrong
Published on March 11, 2025 5:50 AM GMTThose engaged with questions of how to make the...
Cognitive Reframing—How to Overcome Negative Thought Patterns and Behaviors — LessWrong
Published on March 11, 2025 4:56 AM GMTCognitive reframing is a powerful psychological technique that encourages...
Trojan Sky — LessWrong
Published on March 11, 2025 3:14 AM GMTYou learn the rules as soon as you’re old...
Have you actually tried raising the birth rate? — LessWrong
Published on March 10, 2025 6:06 PM GMTI just saw on twitter someone claiming that we...
Split Personality Training: Revealing Latent Knowledge Through Personality-Shift Tokens — LessWrong
Published on March 10, 2025 4:07 PM GMTProduced as part of the ML Alignment & Theory...
We Have No Plan for Preventing Loss of Control in Open Models — LessWrong
Published on March 10, 2025 3:35 PM GMTNote: This post is intended to be the first...
Lock-In Threat Models — LessWrong
Published on March 10, 2025 10:22 AM GMTEpistemic status: a combination and synthesis of others' work,...
Book Review: Affective Neuroscience — LessWrong
Published on March 10, 2025 6:50 AM GMTAfter years of clumsily trying to pick up neuroscience...
The chessboard world — LessWrong
Published on March 10, 2025 1:26 AM GMTrelevant roon Our new friend in the cloud As...
when will LLMs become human-level bloggers? — LessWrong
Published on March 9, 2025 9:10 PM GMT"Short AI timelines" have recently become mainstream. One now...
Everything I Know About Semantics I Learned From Music Notation — LessWrong
Published on March 9, 2025 6:09 PM GMTThis video provides a lot of background: https://www.youtube.com/watch?v=Eq3bUFgEcb4 and...
Phoenix Rising — LessWrong
Published on March 9, 2025 11:53 AM GMTPreserving the memory, and the cells, of the best...
How well can Claude write coding questions? — LessWrong
Published on March 9, 2025 5:29 AM GMTI'm curious as to how well Claude can write...
The machine has no mouth and it must scream — LessWrong
Published on March 8, 2025 4:40 PM GMTI'm in a coworking space on the 25th floor...
HPMOR Anniversary Party — LessWrong
Published on March 7, 2025 7:45 PM GMTDetails will follow.see https://www.lesswrong.com/posts/KGSidqLRXkpizsbcc/it-s-been-ten-years-i-propose-hpmor-anniversary-partiesDiscuss
How Do We Fix the Education Crisis? — LessWrong
Published on March 8, 2025 2:59 AM GMTKey points:Standardized assessments do not provide signals for the...
GPT-4.5 Can Play Losing Chess — LessWrong
Published on March 8, 2025 12:58 AM GMTAfter recently playing some chess against GPT-4.5 (it is...
#1 — LessWrong
Published on March 7, 2025 8:09 PM GMTTheir comment was a paranoid and conspiratorial thought process...
are "almost-p-zombies" possible? — LessWrong
Published on March 7, 2025 10:58 PM GMTIt's probably not possible to have a twin of...
Sufficiently Decentralized Intelligence is Indistinguishable from Synchronicity — LessWrong
Published on March 7, 2025 9:50 PM GMTLoosely inspired by a submission to a hackathon on...
Amplifying the Computational No-Coincidence Conjecture — LessWrong
Published on March 7, 2025 9:29 PM GMTIntroductionRecently, the Computational No-Coincidende Conjecture[1] was proposed, presented as an...
[ages 16-21] Apply to PAIR & ESPR, Summer AI & Rationality Programs — LessWrong
Published on March 7, 2025 7:49 PM GMTTL;DR: PAIR on AI & Reasoning. ESPR on Everything,...
Forecasting newsletter #3/2025: Long march through the institutions — LessWrong
Published on March 7, 2025 6:17 PM GMTHighlights:Manifold ending (a) cash markets, Kalshi slapped by regulators...
Childhood and Education #9: School is Hell — LessWrong
Published on March 7, 2025 12:40 PM GMTThis complication of tales from the world of school...
The Insanity Detector and Writing — LessWrong
Published on March 7, 2025 11:19 AM GMTA clinically insane person is detectable as such. Talking...
So how well is Claude playing Pokémon? — LessWrong
Published on March 7, 2025 5:54 AM GMTBackground: After the release of Claude 3.7 Sonnet,[1] an Anthropic...
Are recent LLMs better at reasoning or better at memorizing? — LessWrong
Published on March 7, 2025 2:44 AM GMTTLDR; By carefully designing a reasoning benchmark that counteracts...
The Dead Planet Theory — LessWrong
Published on March 7, 2025 2:43 AM GMTHi, this is my first post on LessWrong but...
Anthropic’s Recommendations to OSTP for the U.S. AI Action Plan — LessWrong
Published on March 6, 2025 10:38 PM GMTNote: This is an automated crosspost from Anthropic. The...
How Can Average People Contribute to AI Safety? — LessWrong
Published on March 6, 2025 10:50 PM GMTIntroductionBy now you've probably read about how AI and...
Lots of brief thoughts on Software Engineering — LessWrong
Published on March 6, 2025 7:50 PM GMTI have lots of thoughts about software engineering, some...
What the Headlines Miss About the Latest Decision in the Musk vs. OpenAI Lawsuit — LessWrong
Published on March 6, 2025 7:49 PM GMTDiscuss
AISN #49: Superintelligence Strategy — LessWrong
Published on March 6, 2025 5:46 PM GMTWelcome to the AI Safety Newsletter by the Center...
Anthropic Decision Theory and the Strength of Life-Filled Futures — LessWrong
Published on March 6, 2025 5:23 PM GMTThis post was written by prompting chatgptIntroductionDiscussions of anthropic...
Decision-Relevance of worlds and ADT implementations — LessWrong
Published on March 6, 2025 4:57 PM GMTCrossposted on the EA Forum.Lots of effort has been...
AI #106: Not so Fast — LessWrong
Published on March 6, 2025 3:40 PM GMTThis was GPT-4.5 week. That model is not so...
Can a finite physical device be Turing equivalent? — LessWrong
Published on March 6, 2025 3:02 PM GMTHere's some very useful quotes by the article, and...
We should start looking for scheming "in the wild" — LessWrong
Published on March 6, 2025 1:49 PM GMTTLDR: AI models are now capable enough that we might...
Publish your genomic data — LessWrong
Published on March 6, 2025 12:39 PM GMTPublish your genomic data to the public domain as...
Which meat to eat: CO₂ vs Animal suffering — LessWrong
Published on March 6, 2025 12:37 PM GMTAnimal agriculture generates an ungodly amount of animal suffering...
Musings on Scenario Forecasting and AI — LessWrong
Published on March 6, 2025 12:28 PM GMTI have yet to write detailed scenarios for AI...
What is Lock-In? — LessWrong
Published on March 6, 2025 11:09 AM GMTEpistemic status: a combination and synthesis of others' work,...
A Bear Case: My Predictions Regarding AI Progress — LessWrong
Published on March 5, 2025 4:41 PM GMTThis isn't really a "timeline", as such – I...