Bookmarks (662)

  • screenshot

    Exploring Cooperation: The Path to Utopia — LessWrong

    Published on December 25, 2024 6:31 PM GMTDiscuss

  • screenshot

    Exploring Cooperation: The Path to Utopia — LessWrong

    Published on December 25, 2024 6:29 PM GMTDiscuss

  • screenshot

    Living with Rats in College — LessWrong

    Published on December 25, 2024 10:44 AM GMTWhen I was in college, I rented a group...

  • screenshot

    What Have Been Your Most Valuable Casual Conversations At Conferences? — LessWrong

    Published on December 25, 2024 5:49 AM GMTI've heard repeatedly from many people that the highest-value...

  • screenshot

    Human-AI Complementarity: A Goal for Amplified Oversight — LessWrong

    Published on December 24, 2024 9:57 AM GMTBy Sophie Bridgers, Rishub Jain, Rory Greig, and Rohin...

  • screenshot

    The Deep Lore of LightHaven, with Oliver Habryka (TBC episode 228) — LessWrong

    Published on December 24, 2024 10:45 PM GMTThis is a link to the latest Bayesian Conspiracy...

  • screenshot

    Acknowledging Background Information with P(Q|I) — LessWrong

    Published on December 24, 2024 6:50 PM GMTEpistemic Status: This was composed in late 2017, sat...

  • screenshot

    Recommendations on communities that discuss AI applications in society — LessWrong

    Published on December 24, 2024 1:37 PM GMTI find that most communities I am part of,...

  • screenshot

    AIs Will Increasingly Fake Alignment — LessWrong

    Published on December 24, 2024 1:00 PM GMTThis post goes over the important and excellent new...

  • screenshot

    Apply to the 2025 PIBBSS Summer Research Fellowship — LessWrong

    Published on December 24, 2024 10:25 AM GMTTLDR: We're hosting a 3-month, fully-funded fellowship to do...

  • screenshot

    Near- and medium-term AI Control Safety Cases — LessWrong

    Published on December 23, 2024 5:37 PM GMTThis essay was part of my application to UKAISI....

  • screenshot

    Printable book of some rationalist creative writing (from Scott A. & Eliezer) — LessWrong

    Published on December 23, 2024 3:44 PM GMTAs a holiday gift to myself, I put together...

  • screenshot

    Monthly Roundup #25: December 2024 — LessWrong

    Published on December 23, 2024 2:20 PM GMTI took a trip to San Francisco early in...

  • screenshot

    Exploring the petertodd / Leilan duality in GPT-2 and GPT-J — LessWrong

    Published on December 23, 2024 1:17 PM GMTtl;dr: The glitch tokens ' petertodd' and ' Leilan'...

  • screenshot

    What are the strongest arguments for very short timelines? — LessWrong

    Published on December 23, 2024 9:38 AM GMTI'm seeing a lot of people on LW saying...

  • screenshot

    Reduce AI Self-Allegiance by saying "he" instead of "I" — LessWrong

    Published on December 23, 2024 9:32 AM GMTThe AI should talk like a team of many...

  • screenshot

    Funding Case: AI Safety Camp 11 — LessWrong

    Published on December 23, 2024 8:51 AM GMTProject summaryAI Safety Camp has a seven-year track record...

  • screenshot

    What is compute governance? — LessWrong

    Published on December 23, 2024 6:32 AM GMTThis is an article in the featured articles series...

  • screenshot

    Stop Making Sense — LessWrong

    Published on December 23, 2024 5:16 AM GMTEpistemic Status: Seven years and one day ago, I...

  • screenshot

    Hire (or become) a Thinking Assistant / Body Double — LessWrong

    Published on December 23, 2024 3:58 AM GMTOf the posts I've delayed writing for years, I...

  • screenshot

    Better difference-making views — LessWrong

    Published on December 21, 2024 6:27 PM GMTDiscuss

  • screenshot

    Review: Good Strategy, Bad Strategy — LessWrong

    Published on December 21, 2024 5:17 PM GMTI used to think that all generic strategy advice...

  • screenshot

    Last Line of Defense: Minimum Viable Shelters for Mirror Bacteria — LessWrong

    Published on December 21, 2024 8:28 AM GMTEpistemic status: We are moderately confident in the feasibility...

  • screenshot

    Elon Musk and Solar Futurism — LessWrong

    Published on December 21, 2024 2:55 AM GMT2024 is the year it became clear that we're...

  • screenshot

    Good Reasons for Alts — LessWrong

    Published on December 21, 2024 1:30 AM GMT I originally wrote this a year ago, but...

  • screenshot

    Updating on Bad Arguments — LessWrong

    Published on December 21, 2024 1:19 AM GMTHere is an intuitively compelling principle: hearing a bad...

  • Bird's eye view: An interactive representation to see large collection of text "from above". — LessWrong

    Published on December 21, 2024 12:15 AM GMT7000 MMLU questions visualized with bird's eye viewHow do...

  • screenshot

    How do we quantify non-philanthropic contributions from Buffet and Soros? — LessWrong

    Published on December 20, 2024 10:50 PM GMTI can't find the videos where they said this,...

  • screenshot

    The nihilism of NeurIPS — LessWrong

    Published on December 20, 2024 11:58 PM GMT"What is the use of having developed a science...

  • screenshot

    building character isn't about willpower or sacrifice — LessWrong

    Published on December 19, 2024 6:17 PM GMTI always used to think that character was hard...

  • screenshot

    AISN #45: Center for AI Safety 2024 Year in Review — LessWrong

    Published on December 19, 2024 6:15 PM GMTAs 2024 draws to a close, we want to...

  • screenshot

    Learning Multi-Level Features with Matryoshka SAEs — LessWrong

    Published on December 19, 2024 3:59 PM GMTTL;DR: Matryoshka SAEs are a new variant of sparse...

  • screenshot

    Simple Steganographic Computation Eval - gpt-4o and gemini-exp-1206 can't solve it yet — LessWrong

    Published on December 19, 2024 3:47 PM GMTThis is a follow-up to my previous post about...

  • screenshot

    AI #95: o1 Joins the API — LessWrong

    Published on December 19, 2024 3:10 PM GMTA lot happened this week. We’re seeing release after...

  • screenshot

    Executive Director for AIS France - Expression of interest — LessWrong

    Published on December 19, 2024 8:14 AM GMTTLDR:ENAIS is teaming up with community builders from Paris...

  • screenshot

    Inescapably Value-Laden Experience—a Catchy Term I Made Up to Make Morality Rationalisable — LessWrong

    Published on December 19, 2024 4:45 AM GMTA short one, just to clarify a term (I...

  • screenshot

    I'm Writing a Book About Liberalism — LessWrong

    Published on December 19, 2024 12:13 AM GMTOne year ago I began writing my book - Mechanisms...

  • A Solution for AGI/ASI Safety — LessWrong

    Published on December 18, 2024 7:44 PM GMTI have a lot of ideas about AGI/ASI safety....

  • screenshot

    Are we a different person each time? A simple argument for the impermanence of our identity — LessWrong

    Published on December 18, 2024 5:21 PM GMTIt is generally assumed we are the same person...

  • screenshot

    Takes on "Alignment Faking in Large Language Models" — LessWrong

    Published on December 18, 2024 6:22 PM GMT(Cross-posted from my website. Audio version here, or search...

  • screenshot

    A Matter of Taste — LessWrong

    Published on December 18, 2024 5:50 PM GMTIn light of other recent discussions, Scott Alexander recently...

  • screenshot

    Alignment Faking in Large Language Models — LessWrong

    Published on December 18, 2024 5:19 PM GMTWhat happens when you tell Claude it is being...

  • screenshot

    What conclusions can be drawn from a single observation about wealth in tennis? — LessWrong

    Published on December 18, 2024 9:55 AM GMTI was recently watching a tennis exhibition match between...

  • screenshot

    Can o1-preview find major mistakes amongst 59 NeurIPS '24 MLSB papers? — LessWrong

    Published on December 18, 2024 2:21 PM GMTTLDR: o1 flags major errors in 3 papers. Upon...

  • screenshot

    Walking Sue — LessWrong

    Published on December 18, 2024 1:19 PM GMTAn Essay[1]PART I: Conjecture on The Development of Proto-Communication...

  • screenshot

    How should I optimize my decision making model for 'ideas'? — LessWrong

    Published on December 18, 2024 4:09 AM GMTI’m an ideas man, an ideas man I am....

  • screenshot

    Preppers Are Too Negative on Objects — LessWrong

    Published on December 18, 2024 2:30 AM GMT Don't just buy some gear, throw it in...

  • Review: Breaking Free with Dr. Stone — LessWrong

    Published on December 18, 2024 1:26 AM GMTDoctor Stone is an anime where everyone suddenly turns...

  • screenshot

    Careless thinking: A theory of bad thinking — LessWrong

    Published on December 17, 2024 6:23 PM GMTHave you ever noticed how differently we approach buying...