LESSWRONGThe Best of LessWrong
LW

The Best of LessWrong

Here you can find the best posts of LessWrong. When posts turn more than a year old, the LessWrong community reviews and votes on how well they have stood the test of time. These are the posts that have ranked the highest for all years since 2018 (when our annual tradition of choosing the least wrong of LessWrong began).

For the years 2018, 2019 and 2020 we also published physical books with the results of our annual vote, which you can buy and learn more about here.

Sort by:

curatedyear

+

Rationality

Eliezer Yudkowsky

Local Validity as a Key to Sanity and Civilization

"Other people are wrong" vs "I am right"

Strong Evidence is Common

You Are Not Measuring What You Think You Are Measuring

Gears-Level Models are Capital Investments

How to Ignore Your Emotions (while also thinking you're awesome at emotions)

Scott Garrabrant

Yes Requires the Possibility of No

Scott Alexander

Trapped Priors As A Basic Problem Of Rationality

[DEACTIVATED] Duncan Sabien

Split and Commit

A Sketch of Good Communication

Eliezer Yudkowsky

Meta-Honesty: Firming Up Honesty Around Its Edge-Cases

[DEACTIVATED] Duncan Sabien

Lies, Damn Lies, and Fabricated Options

[DEACTIVATED] Duncan Sabien

CFAR Participant Handbook now available to all

What Are You Tracking In Your Head?

The First Sample Gives the Most Information

[DEACTIVATED] Duncan Sabien

Shoulder Advisors 101

Feature Selection

Mistakes with Conservation of Expected Evidence

Scott Alexander

Varieties Of Argumentative Experience

Eliezer Yudkowsky

Toolbox-thinking and Law-thinking

The Felt Sense: What, Why and How

[DEACTIVATED] Duncan Sabien

Cup-Stacking Skills (or, Reflexive Involuntary Mental Motions)

The Costly Coordination Mechanism of Common Knowledge

Jacob Falkovich

Seeing the Smoke

Epistemic Legibility

Daniel Kokotajlo

Taboo "Outside View"

Gears vs Behavior

Noticing Frame Differences

[DEACTIVATED] Duncan Sabien

Reality-Revealing and Reality-Masking Puzzles

Eliezer Yudkowsky

ProjectLawful.com: Eliezer's latest story, past 1M words

Eliezer Yudkowsky

Self-Integrity and the Drowning Child

Jacob Falkovich

The Treacherous Path to Rationality

Scott Garrabrant

Tyranny of the Epistemic Majority

Most Prisoner's Dilemmas are Stag Hunts; Most Stag Hunts are Schelling Problems

Being a Robust Agent

Heads I Win, Tails?—Never Heard of Her; Or, Selective Reporting and the Tragedy of the Green Rationalists

Reason isn't magic

Integrity and accountability are core parts of rationality

The Schelling Choice is "Rabbit", not "Stag"

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

Propagating Facts into Aesthetics

Simulacrum 3 As Stag-Hunt Strategy

Catching the Spark

Jacob Falkovich

Is Rationalist Self-Improvement Real?

Excerpts from a larger discussion about simulacra

Simulacra Levels and their Interactions

Radical Probabilism

sarahconstantin

Naming the Nameless

Comment reply: my low-quality thoughts on why CFAR didn't get farther with a "real/efficacious art of rationality"

Rationalism before the Sequences

The Rationalists of the 1950s (and before) also called themselves “Rationalists”

+

Optimization

sarahconstantin

The Pavlov Strategy

Coordination as a Scarce Resource

What should you change in response to an "emergency"? And AI risk

Prediction Markets: When Do They Work?

Being the (Pareto) Best in the World

Is Success the Enemy of Freedom? (Full)

How factories were made safe

HoldenKarnofsky

All Possible Views About Humanity's Future Are Wild

Why has nuclear power been a flop?

Simple Rules of Law

Power Buys You Distance From The Crime

Eliezer Yudkowsky

Is Clickbait Destroying Our General Intelligence?

Scott Alexander

The Tails Coming Apart As Metaphor For Life

Asymmetric Justice

Nuclear war is unlikely to cause human extinction

Moloch Hasn’t Won

Motive Ambiguity

Can crimes be discussed literally?

The Real Rules Have No Exceptions

Lars Doucet's Georgism series on Astral Codex Ten

When Money Is Abundant, Knowledge Is The Real Wealth

HoldenKarnofsky

This Can't Go On

Scott Alexander

Studies On Slack

Working With Monsters

Why haven't we celebrated any major achievements lately?

The Credit Assignment Problem

Inadequate Equilibria vs. Governance of the Commons

The Amish, and Strategic Norms around Technology

Discontinuous progress in history: an update

Scott Alexander

Rule Thinkers In, Not Out

A voting theory primer for rationalists

HoldenKarnofsky

Nonprofit Boards are Weird

Beyond Astronomical Waste

+

World

The Redaction Machine

On the Loss and Preservation of Knowledge

Introduction to abstract entropy

Swiss Political System: More than You ever Wanted to Know (I.)

Interfaces as a Scarce Resource

Transportation as a Constraint

There’s no such thing as a tree (phylogenetically)

Scott Alexander

Is Science Slowing Down?

Anti-social Punishment

Research: Rescuers during the Holocaust

Toni Kurz and the Insanity of Climbing Mountains

Book Review: Design Principles of Biological Circuits

Literature Review: Distributed Teams

The Intelligent Social Web

Unconscious Economics

Spaghetti Towers

Historical mathematicians exhibit a birth order effect too

What Money Cannot Buy

Scott Alexander

Book Review: The Secret Of Our Success

Specializing in Problems We Don't Understand

Why did everything take so long?

[Answer] Why wasn't science invented in China?

Scott Alexander

Mental Mountains

My attempt to explain Looking, insight meditation, and enlightenment in non-mysterious terms

Evolution of Modularity

Science in a High-Dimensional World

How uniform is the neocortex?

Building up to an Internal Family Systems model

My computational framework for the brain

Counter-theses on Sleep

What makes people intellectually active?

Birth order effect found in Nobel Laureates in Physics

Elephant seal 2

Anti-Aging: State of the Art

Steelmanning Divination

Book summary: Unlocking the Emotional Brain

+

Practical

Pain is not the unit of Effort

Staring into the abyss as a core life skill

Rest Days vs Recovery Days

Notes from "Don't Shoot the Dog"

Luck based medicine: my resentful story of becoming a medical miracle

How To Write Quickly While Maintaining Epistemic Rigor

[DEACTIVATED] Duncan Sabien

Ruling Out Everything Else

Paper-Reading for Gears

Forum participation as a research strategy

Butterfly Ideas

Eliezer Yudkowsky

Your Cheerful Price

To listen well, get curious

HoldenKarnofsky

Useful Vices for Wicked Problems

The Curse Of The Counterfactual

Leaky Delegation: You are not a Commodity

Losing the root for the tree

The Onion Test for Personal and Institutional Honesty

“PR” is corrosive; “reputation” is not.

You Get About Five Words

HoldenKarnofsky

Learning By Writing

Noticing the Taste of Lotus

Do you fear the rock or the hard place?

Slack Has Positive Externalities For Groups

Limerence Messes Up Your Rationality Real Bad, Yo

Cryonics signup guide #1: Overview

microCOVID.org: A tool to estimate COVID risk from common activities

The Loudest Alarm Is Probably False

"Can you keep this confidential? How do you know?"

[DEACTIVATED] Duncan Sabien

+

AI Strategy

Without specific countermeasures, the easiest path to transformative AI likely leads to AI takeover

Daniel Kokotajlo

Cortés, Pizarro, and Afonso as Precedents for Takeover

Daniel Kokotajlo

The date of AI Takeover is not the day the AI takes over

paulfchristiano

What failure looks like

Daniel Kokotajlo

What 2026 looks like

It Looks Like You're Trying To Take Over The World

What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)

paulfchristiano

Another (outer) alignment failure story

Draft report on AI timelines

Eliezer Yudkowsky

Biology-Inspired AGI Timelines: The Trick That Never Works

HoldenKarnofsky

Reply to Eliezer on Biological Anchors

AGI safety from first principles: Introduction

Daniel Kokotajlo

Fun with +12 OOMs of Compute

AI Safety "Success Stories"

Counterarguments to the basic AI x-risk case

Reframing Superintelligence: Comprehensive AI Services as General Intelligence

What an actually pessimistic containment strategy looks like

Eliezer Yudkowsky

MIRI announces new "Death With Dignity" strategy

Chris Olah’s views on AGI safety

Comments on Carlsmith's “Is power-seeking AI an existential risk?”

The Parable of Predict-O-Matic

Let’s think about slowing down AI

human psycholinguists: a critical appraisal

larger language models may disappoint you [or, an eternally unfinished draft]

Daniel Kokotajlo

Against GDP as a metric for timelines and takeoff speeds

paulfchristiano

Arguments about fast takeoff

Eliezer Yudkowsky

Six Dimensions of Operational Adequacy in AGI Projects

+

Technical AI Safety

Some AI research areas and their relevance to existential safety

EfficientZero: How It Works

Security Mindset: Lessons from 20+ years of Software Security Failures Relevant to AGI Alignment

Decision theory does not imply that we get to have nice things

Reward is not the optimization target

Worlds Where Iterative Design Fails

Specification gaming examples in AI

Inner Alignment: Explain like I'm 12 Edition

An overview of 11 proposals for building safe advanced AI

Alignment By Default

How To Go From Interpretability To Alignment: Just Retarget The Search

Search versus design

Selection vs Control

The Solomonoff Prior is Malign

paulfchristiano

My research methodology

Eliezer Yudkowsky

The Rocket Alignment Problem

Eliezer Yudkowsky

AGI Ruin: A List of Lethalities

A central AI alignment problem: capabilities generalization, and the sharp left turn

Reframing Impact

Scott Garrabrant

Robustness to Scale

paulfchristiano

Inaccessible information

Seeking Power is Often Convergently Instrumental in MDPs

On how various plans miss the hard bits of the alignment challenge

Alignment Research Field Guide

paulfchristiano

The strategy-stealing assumption

Optimality is the tiger, and agents are its teeth

Models Don't "Get Reward"

The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables

Language models seem to be much better than humans at next-token prediction

An Untrollable Mathematician Illustrated

An Orthodox Case Against Utility Functions

Selection Theorems: A Program For Understanding Agents

Coherence arguments do not entail goal-directed behavior

The ground of optimization

paulfchristiano

Where I agree and disagree with Eliezer

Eliezer Yudkowsky

Ngo and Yudkowsky on alignment difficulty

Embedded Agents

Risks from Learned Optimization: Introduction

chinchilla's wild implications

Why Agent Foundations? An Overly Abstract Explanation

Paul's research agenda FAQ

Eliezer Yudkowsky

Coherent decisions imply consistent utilities

paulfchristiano

Open question: are minimal circuits daemon-free?

Gradient hacking

Causal Scrubbing: a method for rigorously testing interpretability hypotheses [Redwood Research]

Humans provide an untapped wealth of evidence about alignment

A Mechanistic Interpretability Analysis of Grokking

How "Discovering Latent Knowledge in Language Models Without Supervision" Fits Into a Broader Alignment Scheme

Understanding “Deep Double Descent”

The shard theory of human values

Inner and outer alignment decompose one hard problem into two extremely hard problems

Eliezer Yudkowsky

Challenges to Christiano’s capability amplification proposal

Scott Garrabrant

Finite Factored Sets

paulfchristiano

ARC's first technical report: Eliciting Latent Knowledge

Introduction To The Infra-Bayesianism Sequence