Bookmarks (710)

  • screenshot

    Elon Musk May Be Transitioning to Bipolar Type I — LessWrong

    Published on March 11, 2025 5:45 PM GMTEpistemic status: Speculative pattern-matching based on public information. In 2023,...

  • screenshot

    How Language Models Understand Nullability — LessWrong

    Published on March 11, 2025 3:57 PM GMTTL;DR Large language models have demonstrated an emergent ability...

  • screenshot

    Forethought: a new AI macrostrategy group — LessWrong

    Published on March 11, 2025 3:39 PM GMTForethought[1] is a new AI macrostrategy research group cofounded by Max...

  • screenshot

    Preparing for the Intelligence Explosion — LessWrong

    Published on March 11, 2025 3:38 PM GMTThis is a linkpost for a new paper called Preparing...

  • screenshot

    AI Control May Increase Existential Risk — LessWrong

    Published on March 11, 2025 2:30 PM GMTEpistemic status: The following isn't an airtight argument, but...

  • screenshot

    When is it Better to Train on the Alignment Proxy? — LessWrong

    Published on March 11, 2025 1:35 PM GMTThis is a response to Matt's earlier post. If...

  • screenshot

    Do reasoning models use their scratchpad like we do? Evidence from distilling paraphrases — LessWrong

    Published on March 11, 2025 11:52 AM GMTTL;DR: We provide some evidence that Claude 3.7 Sonnet...

  • screenshot

    A Hogwarts Guide to Citizenship — LessWrong

    Published on March 11, 2025 5:50 AM GMTThose engaged with questions of how to make the...

  • screenshot

    Cognitive Reframing—How to Overcome Negative Thought Patterns and Behaviors — LessWrong

    Published on March 11, 2025 4:56 AM GMTCognitive reframing is a powerful psychological technique that encourages...

  • screenshot

    Trojan Sky — LessWrong

    Published on March 11, 2025 3:14 AM GMTYou learn the rules as soon as you’re old...

  • screenshot

    Have you actually tried raising the birth rate? — LessWrong

    Published on March 10, 2025 6:06 PM GMTI just saw on twitter someone claiming that we...

  • screenshot

    Split Personality Training: Revealing Latent Knowledge Through Personality-Shift Tokens — LessWrong

    Published on March 10, 2025 4:07 PM GMTProduced as part of the ML Alignment & Theory...

  • screenshot

    We Have No Plan for Preventing Loss of Control in Open Models — LessWrong

    Published on March 10, 2025 3:35 PM GMTNote: This post is intended to be the first...

  • screenshot

    Lock-In Threat Models — LessWrong

    Published on March 10, 2025 10:22 AM GMTEpistemic status: a combination and synthesis of others' work,...

  • screenshot

    Book Review: Affective Neuroscience — LessWrong

    Published on March 10, 2025 6:50 AM GMTAfter years of clumsily trying to pick up neuroscience...

  • screenshot

    The chessboard world — LessWrong

    Published on March 10, 2025 1:26 AM GMTrelevant roon Our new friend in the cloud As...

  • screenshot

    when will LLMs become human-level bloggers? — LessWrong

    Published on March 9, 2025 9:10 PM GMT"Short AI timelines" have recently become mainstream.  One now...

  • screenshot

    Everything I Know About Semantics I Learned From Music Notation — LessWrong

    Published on March 9, 2025 6:09 PM GMTThis video provides a lot of background: https://www.youtube.com/watch?v=Eq3bUFgEcb4 and...

  • screenshot

    Phoenix Rising — LessWrong

    Published on March 9, 2025 11:53 AM GMTPreserving the memory, and the cells, of the best...

  • screenshot

    How well can Claude write coding questions? — LessWrong

    Published on March 9, 2025 5:29 AM GMTI'm curious as to how well Claude can write...

  • screenshot

    The machine has no mouth and it must scream — LessWrong

    Published on March 8, 2025 4:40 PM GMTI'm in a coworking space on the 25th floor...

  • screenshot

    HPMOR Anniversary Party — LessWrong

    Published on March 7, 2025 7:45 PM GMTDetails will follow.see https://www.lesswrong.com/posts/KGSidqLRXkpizsbcc/it-s-been-ten-years-i-propose-hpmor-anniversary-partiesDiscuss

  • screenshot

    How Do We Fix the Education Crisis? — LessWrong

    Published on March 8, 2025 2:59 AM GMTKey points:Standardized assessments do not provide signals for the...

  • screenshot

    GPT-4.5 Can Play Losing Chess — LessWrong

    Published on March 8, 2025 12:58 AM GMTAfter recently playing some chess against GPT-4.5 (it is...

  • screenshot

    #1 — LessWrong

    Published on March 7, 2025 8:09 PM GMTTheir comment was a paranoid and conspiratorial thought process...

  • screenshot

    are "almost-p-zombies" possible? — LessWrong

    Published on March 7, 2025 10:58 PM GMTIt's probably not possible to have a twin of...

  • screenshot

    Sufficiently Decentralized Intelligence is Indistinguishable from Synchronicity — LessWrong

    Published on March 7, 2025 9:50 PM GMTLoosely inspired by a submission to a hackathon on...

  • screenshot

    Amplifying the Computational No-Coincidence Conjecture — LessWrong

    Published on March 7, 2025 9:29 PM GMTIntroductionRecently, the Computational No-Coincidende Conjecture[1] was proposed, presented as an...

  • screenshot

    [ages 16-21] Apply to PAIR & ESPR, Summer AI & Rationality Programs — LessWrong

    Published on March 7, 2025 7:49 PM GMTTL;DR: PAIR on AI & Reasoning. ESPR on Everything,...

  • screenshot

    Forecasting newsletter #3/2025: Long march through the institutions — LessWrong

    Published on March 7, 2025 6:17 PM GMTHighlights:Manifold ending (a) cash markets, Kalshi slapped by regulators...

  • screenshot

    Childhood and Education #9: School is Hell — LessWrong

    Published on March 7, 2025 12:40 PM GMTThis complication of tales from the world of school...

  • screenshot

    The Insanity Detector and Writing — LessWrong

    Published on March 7, 2025 11:19 AM GMTA clinically insane person is detectable as such. Talking...

  • screenshot

    So how well is Claude playing Pokémon? — LessWrong

    Published on March 7, 2025 5:54 AM GMTBackground: After the release of Claude 3.7 Sonnet,[1] an Anthropic...

  • screenshot

    Are recent LLMs better at reasoning or better at memorizing? — LessWrong

    Published on March 7, 2025 2:44 AM GMTTLDR; By carefully designing a reasoning benchmark that counteracts...

  • screenshot

    The Dead Planet Theory — LessWrong

    Published on March 7, 2025 2:43 AM GMTHi, this is my first post on LessWrong but...

  • screenshot

    Anthropic’s Recommendations to OSTP for the U.S. AI Action Plan — LessWrong

    Published on March 6, 2025 10:38 PM GMTNote: This is an automated crosspost from Anthropic. The...

  • screenshot

    How Can Average People Contribute to AI Safety? — LessWrong

    Published on March 6, 2025 10:50 PM GMTIntroductionBy now you've probably read about how AI and...

  • screenshot

    Lots of brief thoughts on Software Engineering — LessWrong

    Published on March 6, 2025 7:50 PM GMTI have lots of thoughts about software engineering, some...

  • screenshot

    AISN #49: Superintelligence Strategy — LessWrong

    Published on March 6, 2025 5:46 PM GMTWelcome to the AI Safety Newsletter by the Center...

  • screenshot

    Anthropic Decision Theory and the Strength of Life-Filled Futures — LessWrong

    Published on March 6, 2025 5:23 PM GMTThis post was written by prompting chatgptIntroductionDiscussions of anthropic...

  • screenshot

    Decision-Relevance of worlds and ADT implementations — LessWrong

    Published on March 6, 2025 4:57 PM GMTCrossposted on the EA Forum.Lots of effort has been...

  • screenshot

    AI #106: Not so Fast — LessWrong

    Published on March 6, 2025 3:40 PM GMTThis was GPT-4.5 week. That model is not so...

  • screenshot

    Can a finite physical device be Turing equivalent? — LessWrong

    Published on March 6, 2025 3:02 PM GMTHere's some very useful quotes by the article, and...

  • screenshot

    We should start looking for scheming "in the wild" — LessWrong

    Published on March 6, 2025 1:49 PM GMTTLDR: AI models are now capable enough that we might...

  • screenshot

    Publish your genomic data — LessWrong

    Published on March 6, 2025 12:39 PM GMTPublish your genomic data to the public domain as...

  • screenshot

    Which meat to eat: CO₂ vs Animal suffering — LessWrong

    Published on March 6, 2025 12:37 PM GMTAnimal agriculture generates an ungodly amount of animal suffering...

  • screenshot

    Musings on Scenario Forecasting and AI — LessWrong

    Published on March 6, 2025 12:28 PM GMTI have yet to write detailed scenarios for AI...

  • screenshot

    What is Lock-In? — LessWrong

    Published on March 6, 2025 11:09 AM GMTEpistemic status: a combination and synthesis of others' work,...

  • screenshot

    A Bear Case: My Predictions Regarding AI Progress — LessWrong

    Published on March 5, 2025 4:41 PM GMTThis isn't really a "timeline", as such – I...