~www_lesswrong_com | Bookmarks (664)
-
AI Safety at the Frontier: Paper Highlights, September '24 — LessWrong
Published on October 2, 2024 9:49 AM GMTThis is a selection of AI safety paper highlights...
-
Self-Help Corner: Loop Detection — LessWrong
Published on October 2, 2024 8:33 AM GMTThe more I work on myself, the more I...
-
The murderous shortcut: a toy model of instrumental convergence — LessWrong
Published on October 2, 2024 6:48 AM GMTSuppose you can tell your AI to meet a...
-
Switching to a Yamaha P-121 Keyboard — LessWrong
Published on October 2, 2024 2:20 AM GMT The keyboard is a bit of an awkward...
-
Foresight Vision Weekend 2024 — LessWrong
Published on October 1, 2024 9:59 PM GMTVision Weekend US, Foresight’s annual festival, is approaching, offering...
-
Knowledge Base 1: Could it increase intelligence and make it safer? — LessWrong
Published on September 30, 2024 4:00 PM GMTThis series of posts presents the idea of building...
-
Point of Failure: Semiconductor-Grade Quartz — LessWrong
Published on September 30, 2024 3:57 PM GMTChatGPT 4o’s interpretation of semiconductor grade quartzWe rarely think...
-
on bacteria, on teeth — LessWrong
Published on September 30, 2024 3:56 PM GMTYou may have heard that tooth decay is caused...
-
SB 1047 gets vetoed — LessWrong
Published on September 30, 2024 3:49 PM GMTJust what it says on the tin. Covered most...
-
California Governor Gavin Newsom vetoes AI Safety Bill SB 1047 — LessWrong
Published on September 30, 2024 2:54 PM GMTNPR writes:Gov. Gavin Newsom of California on Sunday vetoed...
-
Of Birds and Bees — LessWrong
Published on September 30, 2024 10:52 AM GMTThe HierarchyThere is a hierarchy in life from simple...
-
A new process for mapping discussions — LessWrong
Published on September 30, 2024 8:57 AM GMTRecently my team and I have been working on...
-
MATS Alumni Impact Analysis — LessWrong
Published on September 30, 2024 2:35 AM GMTSummaryThis winter, MATS will be running our seventh program. In...
-
Most capable publicly available agents? — LessWrong
Published on September 30, 2024 12:04 AM GMTLooking to do a little compare and contrast.Discuss
-
the case for CoT unfaithfulness is overstated — LessWrong
Published on September 29, 2024 10:07 PM GMT[Meta note: quickly written, unpolished. Also, it's possible that...
-
The Geometry of Feelings and Nonsense in Large Language Models — LessWrong
Published on September 27, 2024 5:49 PM GMTThis post has some ablation results around the thesis...
-
What is Randomness? — LessWrong
Published on September 27, 2024 5:49 PM GMTepistemic status: my intuition after reading and watching a...
-
Why is o1 so deceptive? — LessWrong
Published on September 27, 2024 5:27 PM GMTThe o1 system card reports:0.8% of o1-preview’s responses got...
-
The Offense-Defense Balance of Gene Drives — LessWrong
Published on September 27, 2024 4:47 PM GMTI recently wrote a twitter thread for Works In...
-
Book Review: On the Edge: The Future — LessWrong
Published on September 27, 2024 2:00 PM GMTPreviously: The Fundamentals, The Gamblers, The Business We have...
-
Is cybercrime really costing trillions per year? — LessWrong
Published on September 27, 2024 8:44 AM GMTMany sources report that cybercrime costs the global economy...
-
Australian AI Safety Forum 2024 — LessWrong
Published on September 27, 2024 12:40 AM GMTWe're excited to announce the inaugural Australian AI Safety...
-
Gell-Mann checks — LessWrong
Published on September 26, 2024 10:45 PM GMTtl;dr: "Gell-Mann amnesia" is a cognitive bias—an observation of...
-
Doing Nothing Utility Function — LessWrong
Published on September 26, 2024 10:05 PM GMTOne of the questions I've heard asked is "how...