Bookmarks (704)

screenshot

Supermen of the (Not so Far) Future — LessWrong

lesswrong.com

screenshot

1

screenshot

AI Welfare Risks — LessWrong

lesswrong.com

screenshot

1

screenshot

Steering Language Models in Multiple Directions Simultaneously — LessWrong

lesswrong.com

screenshot

1

screenshot

RA x ControlAI video: What if AI just keeps getting smarter? — LessWrong

lesswrong.com

screenshot

1

screenshot

OpenAI Preparedness Framework 2.0 — LessWrong

lesswrong.com

screenshot

1

screenshot

Ex-OpenAI employee amici leave to file denied in Musk v OpenAI case? — LessWrong

lesswrong.com

screenshot

1

screenshot

The Continuum Fallacy and its Relatives — LessWrong

lesswrong.com

screenshot

1

screenshot

Roads are at maximum efficiency always — LessWrong

lesswrong.com

screenshot

1

screenshot

My Research Process: Understanding and Cultivating Research Taste — LessWrong

lesswrong.com

screenshot

1

screenshot

AI Governance to Avoid Extinction: The Strategic Landscape and Actionable Research Questions — LessWrong

lesswrong.com

screenshot

1

screenshot

AI #114: Liars, Sycophants and Cheaters — LessWrong

lesswrong.com

screenshot

1

screenshot

Slowdown After 2028: Compute, RLVR Uncertainty, MoE Data Wall — LessWrong

lesswrong.com

screenshot

1

screenshot

Anthropomorphizing AI might be good, actually — LessWrong

lesswrong.com

screenshot

1

screenshot

Dont focus on updating P doom — LessWrong

lesswrong.com

screenshot

1

screenshot

Prioritizing Work — LessWrong

lesswrong.com

screenshot

1

screenshot

Don't rely on a "race to the top" — LessWrong

lesswrong.com

screenshot

1

screenshot

Meta-Technicalities: Safeguarding Values in Formal Systems — LessWrong

lesswrong.com

screenshot

1

screenshot

Obstacles in ARC's agenda: Finding explanations — LessWrong

lesswrong.com

screenshot

1

screenshot

GPT-4o Responds to Negative Feedback — LessWrong

lesswrong.com

screenshot

1

screenshot

State of play of AI progress (and related brakes on an intelligence explosion) [Linkpost] — LessWrong

lesswrong.com

screenshot

1

screenshot

Misrepresentation as a Barrier for Interp (Part I) — LessWrong

lesswrong.com

screenshot

1

screenshot

AISN #53: An Open Letter Attempts to Block OpenAI Restructuring — LessWrong

lesswrong.com

screenshot

1

screenshot

What could Alphafold 4 look like? — LessWrong

lesswrong.com

screenshot

1

screenshot

Sealed Computation: Towards Low-Friction Proof of Locality — LessWrong

lesswrong.com

screenshot

1

screenshot

Dating Roundup #4: An App for That — LessWrong

lesswrong.com

screenshot

1

screenshot

Talk on letters to AI (London) — LessWrong

lesswrong.com

screenshot

1

screenshot

D&D.Sci Tax Day: Adventurers and Assessments Evaluation & Ruleset — LessWrong

lesswrong.com

screenshot

1

screenshot

Fundamentals of Safe AI (Phase 1) – Applications Open for the Global Cohort — LessWrong

lesswrong.com

screenshot

1

screenshot

How to Build a Third Place on Focusmate — LessWrong

lesswrong.com

screenshot

1

screenshot

China's Petition System: It Looks Like Democracy — But It Isn't — LessWrong

lesswrong.com

screenshot

1

screenshot

Victoria, BC — LessWrong

lesswrong.com

screenshot

1

screenshot

Our Reality: A Simulation Run by a Paperclip Maximizer — LessWrong

lesswrong.com

screenshot

1

screenshot

Questions for old LW members: how have discussions about AI changed compared to 10+ years ago? — LessWrong

lesswrong.com

screenshot

1

screenshot

The case for multi-decade AI timelines [Linkpost] — LessWrong

lesswrong.com

screenshot

1

screenshot

After Internet Dependency — LessWrong

lesswrong.com

screenshot

1

screenshot

My Research Process: Key Mindsets - Truth-Seeking, Prioritisation, Moving Fast — LessWrong

lesswrong.com

screenshot

1

screenshot

I doubt model collapse will happen — LessWrong

lesswrong.com

screenshot

1

screenshot

Propaganda-Bot: A Sketch of a Possible RSI — LessWrong

lesswrong.com

screenshot

1

screenshot

Emergence of superintelligence from AI hiveminds: how to make it human-friendly? — LessWrong

lesswrong.com

screenshot

1

screenshot

"The Urgency of Interpretability" (Dario Amodei) — LessWrong

lesswrong.com

screenshot

1

screenshot

How Democratic Is Effective Altruism — Really? — LessWrong

lesswrong.com

screenshot

1

screenshot

Will Programmer Compensation Decouple from Productivity? — LessWrong

lesswrong.com

screenshot

1

screenshot

Zstd Window Size — LessWrong

lesswrong.com

screenshot

1

screenshot

List of petitions against OpenAI's for-profit move — LessWrong

lesswrong.com

screenshot

1

screenshot

A review of "Why Did Environmentalism Become Partisan?" — LessWrong

lesswrong.com

screenshot

1

screenshot

LLM Pareto Frontier But Live — LessWrong

lesswrong.com

screenshot

1

screenshot

Modifying LLM Beliefs with Synthetic Document Finetuning — LessWrong

lesswrong.com

screenshot

1

screenshot

This prompt (sometimes) makes ChatGPT think about terrorist organisations — LessWrong

lesswrong.com

screenshot

1

screenshot

Token and Taboo — LessWrong

lesswrong.com

screenshot

1

screenshot

Trouble at Miningtown: Prologue — LessWrong

lesswrong.com

screenshot

1