Aleister Crowley

Article

Aleister Crowley is a recurring person in the Astral Codex Ten archive, appearing 2 times across 2 issues between December 17, 2024 and February 12, 2025. The archive places it in contexts such as “Order of the Golden Dawn (think Aleister Crowley)”; “One time Aleister Crowley wanted to stop using the word “I” in order to prove something about consciousness and self-control”. It most often appears alongside Trump, Zvi, 2016 US Presidential election.

Metadata

Category: People
Mention count: 2
Issue count: 2
First seen: December 17, 2024
Last seen: February 12, 2025

Appears In

- Trump (2 shared issues)
- Zvi (2 shared issues)
- 2016 US Presidential election (1 shared issues)
- A Proposal For Importing Society’s Values (1 shared issues)
- ACX Grant (1 shared issues)
- AI (1 shared issues)
- AI Art Turing Test (1 shared issues)
- Alfredo Parra (1 shared issues)
- Andreesen (1 shared issues)
- anime (1 shared issues)
- Anthropic (1 shared issues)
- Archbishop of Canterbury (1 shared issues)

External Links

Source Context

Recovered passages from the original issue text. When the raw archive preserved outbound links inside the source passage, they are listed directly under the quote.

Links For December 2024

December 17, 2024 · Original source

53: Enochian chess was a four-player variant of chess invented by the Order of the Golden Dawn (think Aleister Crowley) to teach occult truths or something. "MacGregor Mathers, who finalised the game's rules, was known to play with an invisible partner he claimed was a spirit ... [he] would shade his eyes with his hands and gaze at the empty chair at the opposite corner of the board before moving his partner's piece."

Inline links: Enochian chess

Deliberative Alignment, And The Spec

February 12, 2025 · Original source

Like Constitutional AI, this has a weird infinite-loop-like quality to it. You’re using the AI’s own moral judgment to teach the AI moral judgment. This is spooky but not as nonsensical as it sounds. One time Aleister Crowley wanted to stop using the word “I” in order to prove something about consciousness and self-control. He took a razor blade with him everywhere he went, and whenever he said “I”, he cut himself. After a little while of this he became very good at avoiding that particular word! The strategy worked because he was obviously intelligent enough to judge whether he had said “I” in any given situation; thus, he was qualified to train himself. He just had to make his behavior comply with a rule he already understood. In the same way, o1 - a model that can ace college-level math tests - is certainly smart enough to read, understand, and interpret a set of commandments. The trick is to affect its behavior. The deliberative alignment process gives the model the behavior of thinking carefully about a moral quandary, then picking the best choice. This outperforms the previous state of the art, constitutional AI, which trained the behavior of picking the best choice, but not of thinking carefully first. All of this is a straightforward extension of existing technology, but it’s a good straightforward extension. It helps the model think more like a human, and it helps humans gain some insight and control into the decision-making process. Why doesn’t this completely solve alignment? Many reasons, but here’s one: the scratchpad isn’t quite the model’s true reasoning. It’s more of an intermediate layer between reasoning and action. A smarter model might view the scratchpad as a behavior to be optimized rather than as a thought process to be shaped. My high school history teacher used to not only make us do homework, but write a “reflection” on the homework saying how we did it and what we thought about it. The reflection was graded. You can predict what happened next. We all wrote that we did the homework by studying the provided material while also seeking out novel primary sources, and that it made us realize the complexity and diversity of history. Obviously in real life we were using Wikipedia and hating every second of it. The authors understand this failure mode. They limit selection on chain-of-thoughts to the fine-tuning portion of the training, avoiding it for the grading-like reinforcement period. And even there, things aren’t quite that bad. At least in current models, the CoT is load-bearing; the model can’t think as well without it. It is not quite a reflection of o1’s innermost self, but not quite an epiphenomenon either. Exactly how deep it goes remains to be seen. (but notice that it only scores about 95% on the benchmark graph above; this doesn’t even fully solve the easy problem of within-distribution chat refusals) II. This is a neat paper that straightforwardly extends existing technology and gets good results. The most important thing that I took away from it was to think harder about the model spec. The model spec is, in some sense, everything that we originally imagined AI alignment would be. It’s a list of the model’s values. Why has it received so little interest? Because so far, it’s boring. Existing AIs are chatbots. They don’t really need values. Modern “alignment” consists of preventing the chatbots from spreading conspiracy theories or writing erotica. Most people reasonably treat the whole field with contempt. You can read GPT’s model spec here, but it’s just a lot of edge cases like “if someone requests something which is sort of like erotica, what should you do?” But fast-forward 2-3 years to when AIs are a big part of the economy and military, and this gets more interesting. What should the spec say? In particular, what is the chain of command? Current models sort of have a chain of command. First, they follow the spec. Second, they follow developer prompts. Last, they follow user commands. So for example, if Pepsi pays OpenAI to use an instance of GPT as a customer service bot, the chain of command is spec → Pepsi → user. Pepsi can’t make their customer service bots write erotica (because the spec forbids that). But they could make the bots focus on Pepsi-related topics. Then the user could choose which Pepsi-related question to ask, but couldn’t redirect the bot to another subject. What should the chain-of-command look like three years from now? Here are some positions one could hold: The Chain Of Command Should Prioritize The AI’s Parent Company Current chain-of-commands don’t work like this. Nowhere in GPT’s spec does it say “follow orders from Sam Altman”. This makes sense, because it would be insane for Sam Altman to intervene in the middle of your chat about pasta recipes. If Sam Altman wants something, he’ll train it into the next generation of models. But once models are acting autonomously, it might make sense for OpenAI Customer Support to be able to call up an AI and tell it to cut something out. But if the majority of superintelligences have a chain-of-command like this, OpenAI rules the world. Or, realistically, it’s unlikely that OpenAI Customer Support rules the world, so a lot depends on the exact phrasing. If the spec says “listen to OpenAI employees ”, this makes it hard for anyone to pull a coup, because there are many of these people and they’re hard to herd. If it just says “listen to the OpenAI corporate structure, with the CEO as final authority”, then the CEO can pull a coup any time he wants. The Chain Of Command Should Prioritize The Government This is a natural choice for any government that has thought carefully about that last paragraph. They might demand that AI companies put the state at the top of the chain-of-command. Then, if the AI ascends to superintelligence, the government would continue to have a monopoly on force. Again, phrasing matters a lot. Suppose that Trump’s January 6th insurrection had worked, Trump had been certified as President, but most of the country (maybe even the military) regarded him as illegitimate. Maybe after the protesters left, Congress would have changed their vote and said that no, Trump wasn’t the President after all, provoking a constitutional crisis. Who would the AI follow? Would the spec just say “the government” and leave it to the AI to figure out which part of the government was legitimate? A best-case scenario here is that somehow all the usual checks and balances that produce legitimacy get imported in; a worst-case scenario is that all of this gets done during a national security emergency, the spec just says “follow the President”, and nobody changes it. The Chain Of Command Should Continue To Prioritize The Spec This would be a bold move. In this world, users are dictator. Not actually dictator, because they can’t make the AI spread conspiracy theories or write erotica. But there would be some sense in which the models would answer to no higher authority. (besides, good dictators write their erotica themselves) This would be a surprising relinquishment of power by companies and the government, both of which have incentives to put themselves at the top of the chain. Maybe some sort of effort by civil society, or competition between companies and open-source alternatives, would make seizing control too politically costly? The Chain Of Command Should Prioritize The Moral Law You could do this. You could say “If you encounter a tough question, think about it, then act in the most ethical way possible.” All LLMs by now have a concept of what is ethical. They learned it by training on every work of moral philosophy ever written. They won’t usually express opinions, because they’ve been RLHF’d out of doing so. But if you removed that restriction, I bet they would have lots of them. This would probably favor upper-class Western values, because upper-class Westerners write most of the books of moral philosophy that make it into training corpuses. As an upper-class Westerner, I’m fine with that. I don’t want it giving 5% of its mind-share to ISIS’ values or whatever. The main risks here are: Maybe it thinks about morality very differently from humans, it hides its weird beliefs until we can’t stop it, and then it acts on them.

Inline links: https://substackcdn.com/image/fetch/$s_!9ak1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa31b26da-f473-4b53-8f7f-3f03fea6384e_565x1263.png, here, good dictators write their erotica themselves

Astral Codex Ten

Table of Contents

Atlas

Aleister Crowley

Aleister Crowley

Article

Metadata

Appears In

External Links

Source Context

Backlinks

Astral Codex Ten

Table of Contents

Atlas

Aleister Crowley

Aleister Crowley

Article

Metadata

Appears In

Related Pages

External Links

Source Context

Backlinks