Posts

Sorted by New

29My scorched-earth policy on New Year’s resolutions

24Choosing battles (on the Internet)

14Book Review: Denial of Death

Wiki Contributions

Comments

How do you know you are right when debating? Calculate your AmIRight score.

PatrickDFarley12h10

These things are indeed correlated with being right, but aren't you risking Goodharting? What does it really mean to "be right" about things? If you're native to LessWrong you'll probably answer something like, "to accurately anticipate future sensory experiences". Isn't that all you need? Find an opportunity for you and your friend to predict measurably different futures, then see who wins. All the rest is distraction.

And if you predict all the same things, then you have no real disagreement, just semantic differences

Understanding new terms via etymology

PatrickDFarley1y10

Fun to do with names. Patrick - English version of a Latin name, Patricius, which means "noble", referring to the Roman nobility, which was originally composed of the paterfamiliae, the heads of large families. From pater (father), which is Latin but goes back to proto-indo-european. From proto-indo-european pah which means "to protect/shepherd"

Simulacra Levels Summary

PatrickDFarley1y40

I really like that last bit about chronological cycles of increasing S-level to "win against" the current level, until physical reality smacks us in the face and we reset. Let me try something:

(Physically) Hard times create S1 men; S1 men create (physically) good times.
(Physically) Good times create S2 men (because there's free alpha in manipulating S1); S2 men create (socially) hard times (because now you don't know whom to trust about S1 issues)
(Socially) hard times create S3 men (because tribalism builds/confirms social trust); S3 men create (socially) good times (you have a whole tribe or church or culture war faction that you trust).
(Socially) good times create S4 men (because there's free alpha in manipulating S3); S4 men create (physically) hard times (because they're disconnected from physical reality).

A thought experiment

Answer by PatrickDFarleyDec 10, 202210

I'm gonna be lazy and say:

If it comes up tails, you get nothing.

If that ^ is a given premise in this hypothetical, then we know for certain it is not a simulation (because in a simulation, after tails, you'd get something). Therefore the probability of receiving a lollipop here is 0 (unless you receive one for a completely unrelated reason)

Jailbreaking ChatGPT on Release Day

PatrickDFarley2y5-8

The next step will be to write a shell app that takes your prompt, gets the gpt response, and uses gpt to check whether the response was a "graceful refusal" response, and if so, it embeds your original prompt into one of these loophole formats, and tries again, until it gets a "not graceful refusal" response, which it then returns back to you. So the user experience is a bot with no content filters.

EY is right, these safety features are trivial