~www_lesswrong_com | Bookmarks (708)
-
Brainrot — LessWrong
Published on January 26, 2025 5:35 AM GMTJanuary: In early 2026, Meta launches a fleet of...
-
Notes on Argentina — LessWrong
Published on January 26, 2025 3:51 AM GMTFitz Roy Massif, El Chalten, ArgentinaI recently got back...
-
Recommendations for Recent Posts/Sequences on Instrumental Rationality? — LessWrong
Published on January 26, 2025 12:41 AM GMTI absolutely love the Science of Winning at Life...
-
Anomalous Tokens in DeepSeek-V3 and r1 — LessWrong
Published on January 25, 2025 10:55 PM GMT“Anomalous”, “glitch”, or “unspeakable” tokens in an LLM are...
-
The Rising Sea — LessWrong
Published on January 25, 2025 8:48 PM GMTAnd then we hit a wall. Nobody expected it. Well......
-
Liron Shapira vs Ken Stanley on Doom Debates. A review — LessWrong
Published on January 24, 2025 6:01 PM GMTI summarize my learnings and thoughts on Liron Shapira's...
-
Is there such a thing as an impossible protein? — LessWrong
Published on January 24, 2025 5:12 PM GMTThis is something I’ve been thinking about since my...
-
Stargate AI-1 — LessWrong
Published on January 24, 2025 3:20 PM GMTThere was a comedy routine a few years ago....
-
QFT and neural nets: the basic idea — LessWrong
Published on January 24, 2025 1:54 PM GMTPreviously in the series: The laws of large numbers...
-
Eliciting bad contexts — LessWrong
Published on January 24, 2025 10:39 AM GMTSay an LLM agent behaves innocuously in some context...
-
Insights from "The Manga Guide to Physiology" — LessWrong
Published on January 24, 2025 5:18 AM GMTPhysiology seemed like a grab-bag of random processes which...
-
Do you consider perfect surveillance inevitable? — LessWrong
Published on January 24, 2025 4:57 AM GMTA lot of my recent research work focusses on:1....
-
Uncontrollable: A Surprisingly Good Introduction to AI Risk — LessWrong
Published on January 24, 2025 4:30 AM GMTI recently read Darren McKee's book "Uncontrollable: The Threat...
-
Contra Dances Getting Shorter and Earlier — LessWrong
Published on January 23, 2025 11:30 PM GMT I think of a standard contra dance as...
-
Starting Thoughts on RLHF — LessWrong
Published on January 23, 2025 10:16 PM GMTCross posted from SubstackContinuing the Stanford CS120 Introduction to...
-
Recursive Self-Modeling as a Plausible Mechanism for Real-time Introspection in Current Language Models — LessWrong
Published on January 22, 2025 6:36 PM GMT(and as a completely speculative hypothesis for the minimum...
-
Ut, an alternative gender-neutral pronoun — LessWrong
Published on January 22, 2025 5:36 PM GMTThis post is about ‘ut’, a gender-neutral pronoun I...
-
Mechanisms too simple for humans to design — LessWrong
Published on January 22, 2025 4:54 PM GMTCross-posted from Telescopic TurnipAs we all know, humans are...
-
Training Data Attribution: Examining Its Adoption & Use Cases — LessWrong
Published on January 22, 2025 3:41 PM GMTNote: This report was conducted in June 2024 and...
-
Training Data Attribution (TDA): Examining Its Adoption & Use Cases — LessWrong
Published on January 22, 2025 3:40 PM GMTNote: This report was conducted in June 2024 and...
-
The Quantum Mars Teleporter: An Empirical Test Of Personal Identity Theories — LessWrong
Published on January 22, 2025 11:48 AM GMTtl;dr: If a copy is not identical to the...
-
Bayesian Reasoning on Maps — LessWrong
Published on January 22, 2025 10:45 AM GMTThis is a linkpost for an article I've written...
-
Against blanket arguments against interpretability — LessWrong
Published on January 22, 2025 9:46 AM GMTOn blanket criticism and refutationIn his long post on...
-
Evolution and the Low Road to Nash — LessWrong
Published on January 22, 2025 7:06 AM GMTSolution concepts in game theory—like the Nash equilibrium and...