Supermen of the (Not so Far) Future — LessWrong
Published on May 2, 2025 3:55 PM GMTDespite being fairly well established as a discipline, genetics...
AI Welfare Risks — LessWrong
Published on May 2, 2025 5:49 PM GMTMy paper "AI Welfare Risks" has been accepted for...
Steering Language Models in Multiple Directions Simultaneously — LessWrong
Published on May 2, 2025 3:27 PM GMTNarmeen developed, ideated and validated K-steering at Martian. Luke...
RA x ControlAI video: What if AI just keeps getting smarter? — LessWrong
Published on May 2, 2025 2:19 PM GMTThe video is about extrapolating the future of AI...
OpenAI Preparedness Framework 2.0 — LessWrong
Published on May 2, 2025 1:10 PM GMTRight before releasing o3, OpenAI updated its Preparedness Framework...
Ex-OpenAI employee amici leave to file denied in Musk v OpenAI case? — LessWrong
Published on May 2, 2025 12:27 PM GMTSeveral ex-employees of OpenAI filed an amicus brief in...
The Continuum Fallacy and its Relatives — LessWrong
Published on May 2, 2025 2:58 AM GMTNote: I didn't write this essay, nor do I...
Roads are at maximum efficiency always — LessWrong
Published on May 2, 2025 10:29 AM GMTOn a theoretical road, the number of cars traveling...
My Research Process: Understanding and Cultivating Research Taste — LessWrong
Published on May 1, 2025 11:08 PM GMTThis is post 3 of a sequence on my...
AI Governance to Avoid Extinction: The Strategic Landscape and Actionable Research Questions — LessWrong
Published on May 1, 2025 10:46 PM GMTWe’re excited to release a new AI governance research...
AI #114: Liars, Sycophants and Cheaters — LessWrong
Published on May 1, 2025 2:00 PM GMTGemini 2.5 Pro is sitting in the corner, sulking....
Slowdown After 2028: Compute, RLVR Uncertainty, MoE Data Wall — LessWrong
Published on May 1, 2025 1:54 PM GMTIt'll take until ~2050 to repeat the level of...
Anthropomorphizing AI might be good, actually — LessWrong
Published on May 1, 2025 1:50 PM GMTIt is often noted that anthropomorphizing AI can be...
Dont focus on updating P doom — LessWrong
Published on May 1, 2025 11:10 AM GMTMotivation: Improving group epistemics.TL; DR (Changes to) P doom/alignment...
Prioritizing Work — LessWrong
Published on May 1, 2025 2:00 AM GMT I recently read a blog post that concluded...
Don't rely on a "race to the top" — LessWrong
Published on May 1, 2025 12:33 AM GMTTo make frontier AI safe enough, we need to...
Meta-Technicalities: Safeguarding Values in Formal Systems — LessWrong
Published on April 30, 2025 11:43 PM GMTFormal Systems Create CoordinationWe live in a mesh of...
Obstacles in ARC's agenda: Finding explanations — LessWrong
Published on April 30, 2025 11:03 PM GMTAs an employee of the European AI Office, it's...
GPT-4o Responds to Negative Feedback — LessWrong
Published on April 30, 2025 8:20 PM GMTWhoops. Sorry everyone. Rolling back to a previous version.Here’s...
State of play of AI progress (and related brakes on an intelligence explosion) [Linkpost] — LessWrong
Published on April 30, 2025 7:58 PM GMTThis time around, I'm sharing a post on Interconnects...
Misrepresentation as a Barrier for Interp (Part I) — LessWrong
Published on April 29, 2025 5:07 PM GMTJohn: So there’s this thing about interp, where most...
AISN #53: An Open Letter Attempts to Block OpenAI Restructuring — LessWrong
Published on April 29, 2025 4:13 PM GMTWelcome to the AI Safety Newsletter by the Center...
What could Alphafold 4 look like? — LessWrong
Published on April 29, 2025 3:45 PM GMTI made another biology-ML podcast! Two hours long, deeply...
Sealed Computation: Towards Low-Friction Proof of Locality — LessWrong
Published on April 29, 2025 3:26 PM GMTInference CertificatesAs a prerequisite for the virtuality.network, we need...
Dating Roundup #4: An App for That — LessWrong
Published on April 29, 2025 1:10 PM GMTPreviously: #1, #2, #3. As time goes by, the...
Talk on letters to AI (London) — LessWrong
Published on April 29, 2025 9:50 AM GMTOn Thursday 1 May 2025 1900 (UK time) I'm...
D&D.Sci Tax Day: Adventurers and Assessments Evaluation & Ruleset — LessWrong
Published on April 29, 2025 2:00 AM GMTThis is a follow-up to last week's D&D.Sci scenario:...
Fundamentals of Safe AI (Phase 1) – Applications Open for the Global Cohort — LessWrong
Published on April 28, 2025 8:52 PM GMTIntroductionAI systems are advancing rapidly.At the same time, concerns...
How to Build a Third Place on Focusmate — LessWrong
Published on April 28, 2025 11:46 PM GMTIntroductionFocusmate changed my life. I started using it mid-2023...
China's Petition System: It Looks Like Democracy — But It Isn't — LessWrong
Published on April 28, 2025 8:56 PM GMTIn most democracies,the very way people file complaints tells...
Victoria, BC — LessWrong
Published on April 27, 2025 5:08 PM GMTI am starting a meetup for anyone in Victoria....
Our Reality: A Simulation Run by a Paperclip Maximizer — LessWrong
Published on April 27, 2025 4:17 PM GMTOur universe is probably a computer simulation created by...
Questions for old LW members: how have discussions about AI changed compared to 10+ years ago? — LessWrong
Published on April 27, 2025 4:11 PM GMTI wasn't on LessWrong 10 years ago, during the...
The case for multi-decade AI timelines [Linkpost] — LessWrong
Published on April 27, 2025 3:31 PM GMTSo this post is an argument that multi-decade timelines...
After Internet Dependency — LessWrong
Published on April 27, 2025 8:18 AM GMT ChatGPT recently introduced improved cross-chat memory, and I've found...
My Research Process: Key Mindsets - Truth-Seeking, Prioritisation, Moving Fast — LessWrong
Published on April 27, 2025 2:38 PM GMTThis is post 2 of a sequence on my...
I doubt model collapse will happen — LessWrong
Published on April 27, 2025 2:08 PM GMTA common idea held by people debating AI art...
Propaganda-Bot: A Sketch of a Possible RSI — LessWrong
Published on April 27, 2025 12:15 PM GMTSomeone asked me for an example of Recursive Self...
Emergence of superintelligence from AI hiveminds: how to make it human-friendly? — LessWrong
Published on April 27, 2025 4:51 AM GMT"Hive mind" may not be the ideal term. What...
"The Urgency of Interpretability" (Dario Amodei) — LessWrong
Published on April 27, 2025 4:31 AM GMTDario Amodei posted a new essay titled "The Urgency...
How Democratic Is Effective Altruism — Really? — LessWrong
Published on April 25, 2025 4:02 PM GMTIntroductionEffective Altruism (EA) is a social movement that aims...
Will Programmer Compensation Decouple from Productivity? — LessWrong
Published on April 25, 2025 3:32 PM GMTSince the 1970s, productivity has outpaced wage growth in...
Zstd Window Size — LessWrong
Published on April 25, 2025 2:40 PM GMT At work we've recently been using zstd as...
List of petitions against OpenAI's for-profit move — LessWrong
Published on April 25, 2025 10:03 AM GMTLetters to attorney generals, etc, to block OpenAI from...
A review of "Why Did Environmentalism Become Partisan?" — LessWrong
Published on April 25, 2025 5:12 AM GMTI was recently encouraged to read Jeffrey Heninger's report...
LLM Pareto Frontier But Live — LessWrong
Published on April 24, 2025 9:22 PM GMTTLDR: I really like the graph where they show...
Modifying LLM Beliefs with Synthetic Document Finetuning — LessWrong
Published on April 24, 2025 9:15 PM GMTIn this post, we study whether we can modify...
This prompt (sometimes) makes ChatGPT think about terrorist organisations — LessWrong
Published on April 24, 2025 9:15 PM GMTYesterday, I couldn't wrap my head around some programming...
Token and Taboo — LessWrong
Published on April 24, 2025 8:17 PM GMTWhat in retrospect seem like serious moral crimes were...
Trouble at Miningtown: Prologue — LessWrong
Published on April 24, 2025 7:09 PM GMTIn late 2019 I wrote a TTRPG.The theme was...