Exploring Cooperation: The Path to Utopia — LessWrong
Published on December 25, 2024 6:31 PM GMTDiscuss
Exploring Cooperation: The Path to Utopia — LessWrong
Published on December 25, 2024 6:29 PM GMTDiscuss
Living with Rats in College — LessWrong
Published on December 25, 2024 10:44 AM GMTWhen I was in college, I rented a group...
What Have Been Your Most Valuable Casual Conversations At Conferences? — LessWrong
Published on December 25, 2024 5:49 AM GMTI've heard repeatedly from many people that the highest-value...
Human-AI Complementarity: A Goal for Amplified Oversight — LessWrong
Published on December 24, 2024 9:57 AM GMTBy Sophie Bridgers, Rishub Jain, Rory Greig, and Rohin...
The Deep Lore of LightHaven, with Oliver Habryka (TBC episode 228) — LessWrong
Published on December 24, 2024 10:45 PM GMTThis is a link to the latest Bayesian Conspiracy...
Acknowledging Background Information with P(Q|I) — LessWrong
Published on December 24, 2024 6:50 PM GMTEpistemic Status: This was composed in late 2017, sat...
Recommendations on communities that discuss AI applications in society — LessWrong
Published on December 24, 2024 1:37 PM GMTI find that most communities I am part of,...
AIs Will Increasingly Fake Alignment — LessWrong
Published on December 24, 2024 1:00 PM GMTThis post goes over the important and excellent new...
Apply to the 2025 PIBBSS Summer Research Fellowship — LessWrong
Published on December 24, 2024 10:25 AM GMTTLDR: We're hosting a 3-month, fully-funded fellowship to do...
Near- and medium-term AI Control Safety Cases — LessWrong
Published on December 23, 2024 5:37 PM GMTThis essay was part of my application to UKAISI....
Printable book of some rationalist creative writing (from Scott A. & Eliezer) — LessWrong
Published on December 23, 2024 3:44 PM GMTAs a holiday gift to myself, I put together...
Monthly Roundup #25: December 2024 — LessWrong
Published on December 23, 2024 2:20 PM GMTI took a trip to San Francisco early in...
Exploring the petertodd / Leilan duality in GPT-2 and GPT-J — LessWrong
Published on December 23, 2024 1:17 PM GMTtl;dr: The glitch tokens ' petertodd' and ' Leilan'...
What are the strongest arguments for very short timelines? — LessWrong
Published on December 23, 2024 9:38 AM GMTI'm seeing a lot of people on LW saying...
Reduce AI Self-Allegiance by saying "he" instead of "I" — LessWrong
Published on December 23, 2024 9:32 AM GMTThe AI should talk like a team of many...
Funding Case: AI Safety Camp 11 — LessWrong
Published on December 23, 2024 8:51 AM GMTProject summaryAI Safety Camp has a seven-year track record...
What is compute governance? — LessWrong
Published on December 23, 2024 6:32 AM GMTThis is an article in the featured articles series...
Stop Making Sense — LessWrong
Published on December 23, 2024 5:16 AM GMTEpistemic Status: Seven years and one day ago, I...
Hire (or become) a Thinking Assistant / Body Double — LessWrong
Published on December 23, 2024 3:58 AM GMTOf the posts I've delayed writing for years, I...
Better difference-making views — LessWrong
Published on December 21, 2024 6:27 PM GMTDiscuss
Review: Good Strategy, Bad Strategy — LessWrong
Published on December 21, 2024 5:17 PM GMTI used to think that all generic strategy advice...
Last Line of Defense: Minimum Viable Shelters for Mirror Bacteria — LessWrong
Published on December 21, 2024 8:28 AM GMTEpistemic status: We are moderately confident in the feasibility...
Elon Musk and Solar Futurism — LessWrong
Published on December 21, 2024 2:55 AM GMT2024 is the year it became clear that we're...
Good Reasons for Alts — LessWrong
Published on December 21, 2024 1:30 AM GMT I originally wrote this a year ago, but...
Updating on Bad Arguments — LessWrong
Published on December 21, 2024 1:19 AM GMTHere is an intuitively compelling principle: hearing a bad...
Bird's eye view: An interactive representation to see large collection of text "from above". — LessWrong
Published on December 21, 2024 12:15 AM GMT7000 MMLU questions visualized with bird's eye viewHow do...
How do we quantify non-philanthropic contributions from Buffet and Soros? — LessWrong
Published on December 20, 2024 10:50 PM GMTI can't find the videos where they said this,...
The nihilism of NeurIPS — LessWrong
Published on December 20, 2024 11:58 PM GMT"What is the use of having developed a science...
Forecast 2025 With Vox's Future Perfect Team — $2,500 Prize Pool — LessWrong
Published on December 20, 2024 11:00 PM GMTDiscuss
building character isn't about willpower or sacrifice — LessWrong
Published on December 19, 2024 6:17 PM GMTI always used to think that character was hard...
AISN #45: Center for AI Safety 2024 Year in Review — LessWrong
Published on December 19, 2024 6:15 PM GMTAs 2024 draws to a close, we want to...
Learning Multi-Level Features with Matryoshka SAEs — LessWrong
Published on December 19, 2024 3:59 PM GMTTL;DR: Matryoshka SAEs are a new variant of sparse...
Simple Steganographic Computation Eval - gpt-4o and gemini-exp-1206 can't solve it yet — LessWrong
Published on December 19, 2024 3:47 PM GMTThis is a follow-up to my previous post about...
AI #95: o1 Joins the API — LessWrong
Published on December 19, 2024 3:10 PM GMTA lot happened this week. We’re seeing release after...
Executive Director for AIS France - Expression of interest — LessWrong
Published on December 19, 2024 8:14 AM GMTTLDR:ENAIS is teaming up with community builders from Paris...
Inescapably Value-Laden Experience—a Catchy Term I Made Up to Make Morality Rationalisable — LessWrong
Published on December 19, 2024 4:45 AM GMTA short one, just to clarify a term (I...
I'm Writing a Book About Liberalism — LessWrong
Published on December 19, 2024 12:13 AM GMTOne year ago I began writing my book - Mechanisms...
A Solution for AGI/ASI Safety — LessWrong
Published on December 18, 2024 7:44 PM GMTI have a lot of ideas about AGI/ASI safety....
Are we a different person each time? A simple argument for the impermanence of our identity — LessWrong
Published on December 18, 2024 5:21 PM GMTIt is generally assumed we are the same person...
Takes on "Alignment Faking in Large Language Models" — LessWrong
Published on December 18, 2024 6:22 PM GMT(Cross-posted from my website. Audio version here, or search...
A Matter of Taste — LessWrong
Published on December 18, 2024 5:50 PM GMTIn light of other recent discussions, Scott Alexander recently...
Alignment Faking in Large Language Models — LessWrong
Published on December 18, 2024 5:19 PM GMTWhat happens when you tell Claude it is being...
What conclusions can be drawn from a single observation about wealth in tennis? — LessWrong
Published on December 18, 2024 9:55 AM GMTI was recently watching a tennis exhibition match between...
Can o1-preview find major mistakes amongst 59 NeurIPS '24 MLSB papers? — LessWrong
Published on December 18, 2024 2:21 PM GMTTLDR: o1 flags major errors in 3 papers. Upon...
Walking Sue — LessWrong
Published on December 18, 2024 1:19 PM GMTAn Essay[1]PART I: Conjecture on The Development of Proto-Communication...
How should I optimize my decision making model for 'ideas'? — LessWrong
Published on December 18, 2024 4:09 AM GMTI’m an ideas man, an ideas man I am....
Preppers Are Too Negative on Objects — LessWrong
Published on December 18, 2024 2:30 AM GMT Don't just buy some gear, throw it in...
Review: Breaking Free with Dr. Stone — LessWrong
Published on December 18, 2024 1:26 AM GMTDoctor Stone is an anime where everyone suddenly turns...
Careless thinking: A theory of bad thinking — LessWrong
Published on December 17, 2024 6:23 PM GMTHave you ever noticed how differently we approach buying...