~www_lesswrong_com | Bookmarks (710)
-
Elon Musk May Be Transitioning to Bipolar Type I — LessWrong
Published on March 11, 2025 5:45 PM GMTEpistemic status: Speculative pattern-matching based on public information. In 2023,...
-
How Language Models Understand Nullability — LessWrong
Published on March 11, 2025 3:57 PM GMTTL;DR Large language models have demonstrated an emergent ability...
-
Forethought: a new AI macrostrategy group — LessWrong
Published on March 11, 2025 3:39 PM GMTForethought[1] is a new AI macrostrategy research group cofounded by Max...
-
Preparing for the Intelligence Explosion — LessWrong
Published on March 11, 2025 3:38 PM GMTThis is a linkpost for a new paper called Preparing...
-
AI Control May Increase Existential Risk — LessWrong
Published on March 11, 2025 2:30 PM GMTEpistemic status: The following isn't an airtight argument, but...
-
When is it Better to Train on the Alignment Proxy? — LessWrong
Published on March 11, 2025 1:35 PM GMTThis is a response to Matt's earlier post. If...
-
Do reasoning models use their scratchpad like we do? Evidence from distilling paraphrases — LessWrong
Published on March 11, 2025 11:52 AM GMTTL;DR: We provide some evidence that Claude 3.7 Sonnet...
-
A Hogwarts Guide to Citizenship — LessWrong
Published on March 11, 2025 5:50 AM GMTThose engaged with questions of how to make the...
-
Cognitive Reframing—How to Overcome Negative Thought Patterns and Behaviors — LessWrong
Published on March 11, 2025 4:56 AM GMTCognitive reframing is a powerful psychological technique that encourages...
-
Trojan Sky — LessWrong
Published on March 11, 2025 3:14 AM GMTYou learn the rules as soon as you’re old...
-
Have you actually tried raising the birth rate? — LessWrong
Published on March 10, 2025 6:06 PM GMTI just saw on twitter someone claiming that we...
-
Split Personality Training: Revealing Latent Knowledge Through Personality-Shift Tokens — LessWrong
Published on March 10, 2025 4:07 PM GMTProduced as part of the ML Alignment & Theory...
-
We Have No Plan for Preventing Loss of Control in Open Models — LessWrong
Published on March 10, 2025 3:35 PM GMTNote: This post is intended to be the first...
-
Lock-In Threat Models — LessWrong
Published on March 10, 2025 10:22 AM GMTEpistemic status: a combination and synthesis of others' work,...
-
Book Review: Affective Neuroscience — LessWrong
Published on March 10, 2025 6:50 AM GMTAfter years of clumsily trying to pick up neuroscience...
-
The chessboard world — LessWrong
Published on March 10, 2025 1:26 AM GMTrelevant roon Our new friend in the cloud As...
-
when will LLMs become human-level bloggers? — LessWrong
Published on March 9, 2025 9:10 PM GMT"Short AI timelines" have recently become mainstream. One now...
-
Everything I Know About Semantics I Learned From Music Notation — LessWrong
Published on March 9, 2025 6:09 PM GMTThis video provides a lot of background: https://www.youtube.com/watch?v=Eq3bUFgEcb4 and...
-
Phoenix Rising — LessWrong
Published on March 9, 2025 11:53 AM GMTPreserving the memory, and the cells, of the best...
-
How well can Claude write coding questions? — LessWrong
Published on March 9, 2025 5:29 AM GMTI'm curious as to how well Claude can write...
-
The machine has no mouth and it must scream — LessWrong
Published on March 8, 2025 4:40 PM GMTI'm in a coworking space on the 25th floor...
-
HPMOR Anniversary Party — LessWrong
Published on March 7, 2025 7:45 PM GMTDetails will follow.see https://www.lesswrong.com/posts/KGSidqLRXkpizsbcc/it-s-been-ten-years-i-propose-hpmor-anniversary-partiesDiscuss
-
How Do We Fix the Education Crisis? — LessWrong
Published on March 8, 2025 2:59 AM GMTKey points:Standardized assessments do not provide signals for the...
-
GPT-4.5 Can Play Losing Chess — LessWrong
Published on March 8, 2025 12:58 AM GMTAfter recently playing some chess against GPT-4.5 (it is...