~www_lesswrong_com | Bookmarks (664)
-
Win/continue/lose scenarios and execute/replace/audit protocols — LessWrong
Published on November 15, 2024 3:47 PM GMTIn this post, I’ll make a technical point that...
-
Proposing the Conditional AI Safety Treaty (linkpost TIME) — LessWrong
Published on November 15, 2024 1:59 PM GMTTechnological progress can excite us, politics can infuriate us,...
-
Seven lessons I didn't learn from election day — LessWrong
Published on November 14, 2024 6:39 PM GMTI spent most of my election day -- 3pm...
-
Effects of Non-Uniform Sparsity on Superposition in Toy Models — LessWrong
Published on November 14, 2024 4:59 PM GMTAbstractThis post summarises my findings on the effects of...
-
The Early Christian Strategy — LessWrong
Published on November 14, 2024 5:02 PM GMTScott Alexander's latest today discusses Robert Axelrod's Prisoner’s Dilemma...
-
'Estimat - Values and Data’s For Starters'- A Necessary Proposal? — LessWrong
Published on November 14, 2024 2:37 PM GMT1. PROBLEM In today’s digital era, teenagers face a dual...
-
AI #90: The Wall — LessWrong
Published on November 14, 2024 2:10 PM GMTAs the Trump transition continues and we try to...
-
Evolutionary prompt optimization for SAE feature visualization — LessWrong
Published on November 14, 2024 1:06 PM GMTTLDR:Fluent dreaming for language models is an algorithm based on...
-
AXRP Episode 38.0 - Zhijing Jin on LLMs, Causality, and Multi-Agent Systems — LessWrong
Published on November 14, 2024 7:00 AM GMTYouTube link Do language models understand the causal structure...
-
FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI — LessWrong
Published on November 14, 2024 6:13 AM GMTFrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in...
-
Concrete Methods for Heuristic Estimation on Neural Networks — LessWrong
Published on November 14, 2024 5:07 AM GMTThanks to Erik Jenner for helpful comments and discussion(Epistemic...
-
Heresies in the Shadow of the Sequences — LessWrong
Published on November 14, 2024 5:01 AM GMTReligions are collections of cherished but mistaken principles. So...
-
Current Attitudes Toward AI Provide Little Data Relevant to Attitudes Toward AGI — LessWrong
Published on November 12, 2024 6:23 PM GMTEpistemic status: Sudden public attitude shift seems quite possible,...
-
Basics of Handling Disagreements with People — LessWrong
Published on November 12, 2024 5:55 PM GMTEpistemic Status: This is a collection of useful heuristics...
-
Registrations Open for 2024 NYC Secular Solstice & Megameetup — LessWrong
Published on November 12, 2024 5:50 PM GMTOn December 14th, New York City will have a...
-
2024 NYC Secular Solstice & Megameetup — LessWrong
Published on November 12, 2024 5:46 PM GMTSecular Solstice is a celebration of hope in darkness....
-
2025 Q1 Pivotal Research Fellowship (Technical & Policy) — LessWrong
Published on November 12, 2024 10:56 AM GMTWe’re excited to announce that applications are now open...
-
Theories With Mentalistic Atoms Are As Validly Called Theories As Theories With Only Non-Mentalistic Atoms — LessWrong
Published on November 12, 2024 6:45 AM GMT[ This is supposed to be a didactic post....
-
The lying p value — LessWrong
Published on November 12, 2024 6:12 AM GMTQuick check: do you agree or disagree with the...
-
The Packaging and the Payload — LessWrong
Published on November 12, 2024 3:07 AM GMTI.As I've run and studied meetups, there's a useful...
-
Consider tabooing "I think" — LessWrong
Published on November 12, 2024 2:00 AM GMTPeople say "I think" a lot. Here are some...
-
Festival Stats 2024 — LessWrong
Published on November 12, 2024 2:00 AM GMT Each year ( 2014, 2015, 2016, 2017, 2018,...
-
Personal AI Planning — LessWrong
Published on November 10, 2024 2:00 PM GMT LLMs are getting much more capable, and progress...
-
AI alignment via civilizational cognitive updates — LessWrong
Published on November 10, 2024 9:33 AM GMT(This started as a reply to @Tamsin Leake 's...