Substack is a recurring publication in the Astral Codex Ten archive, appearing 75 times across 75 issues between February 08, 2021 and April 06, 2026. The archive places it in contexts such as "I understand Substack is still working on various concerns about the commenting system"; "moved to Substack and made a lot of money"; "I hear Substack is pretty good!". It most often appears alongside Trump, Scott, China.
- Article page
- Substack
- Mention count
- 75
- Issue count
- 75
- First seen
- February 08, 2021
- Last seen
- April 06, 2026
- http://analogfutures.substack.com
- http://astralcodexten.substack.com/932d293e
- http://eadbatteries.substack.com
- http://losttools.substack.com
- http://perspicacity.substack.com
- http://web.archive.org/web/20221104130431/https://stevekirsch.substack.com/p/1m-bet-rules
- http://web.archive.org/web/20221129133112/https://blog.rootclaim.com/rootclaim-accepts-500000-challenge-on-covid-vaccine-safety-efficacy/
- http://web.archive.org/web/20221224061743/https://www.skirsch.com/covid/SaarWilf.pdf
- https://acxmeetup.substack.com
- https://adamunikowsky.substack.com/
- https://apsychiatryblogger.substack.com/
- https://archive.ph/pY4gF#selection-663.103-683.190
- Open Thread 159
- Statement on New York Times Article
- A Modest Proposal For Republicans: Use The Word "Class"
- Highlights From The Comments On March Links
- Mantic Monday: Mantic Matt Y
- Highlights From The Comments On Culture Wars
- Peer Review Request: Depression
- Use Prediction Markets To Fund Investigative Reporting
- Peer Review Request: Ketamine
- Epistemic Minor Leagues
- 21
- Apply For An ACX Grant
- 15
- Open Thread 200
- Open Thread 203
- ACX Grants Results
- Grading My 2021 Predictions
- Against That Poverty And Infant EEGs Study
- Highlights From The Comments On Health Care Systems
- Why Do I Suck?
- ACX Grants ++: The Second Half
- Ukraine Warcasting
- Spring Meetups In Seventy Cities
- Somewhat Contra Marcus On AI Scaling
- Your Book Review: Viral
- 22
- Book Review Contest 2022 Winners
- Highlights From The Comments On Billionaire Replaceability
- Universe-Hopping Through Substack
- Links For October
- ACX Grants: Project Updates
- Links For December 2022
- Response To Alexandros Contra Me On Ivermectin
- Links For February 2023
- Open Thread 266
- Links For August 2023
- Bride Of Bay Area House Party
- Highlights From The Comments On Fetishes
- Open Thread 292
- Book Review Contest 2023 Winners
- Open Thread 294
- Followup: Quests And Requests
- Links For February 2024
- Who Predicted 2023?
- Practically-A-Book Review: Rootclaim $100,000 Lab Leak Debate
- Failure To Replicate Anti-Vaccine Poll
- Matt Yglesias Considered As The Nietzschean Superman
- Highlights From The Comments On "Sorry You Feel That Way"
- Links For September 2024
- 24
- Open Thread 350
- Links For November 2024
- Links For December 2024
- Highlights From The Comments On Lynn And IQ
- Tegmark's Mathematical Universe Defeats Most Proofs Of God's Existence
- Links For February 2025
- Everything-Except-Book Review Contest 2025
- ACX Grants 1-3 Year Updates
- Missing Heritability: Much More Than You Wanted To Know
- Your Review: The Astral Codex Ten Commentariat (“Why Do We Suck?”)
- Suddenly, Trait-Based Embryo Selection
- Should Strong Gods Bet On GDP?
- The Fatima Sun Miracle: Much More Than You Wanted To Know
- Open Thread 403
- ACX Grants Results 2025
- Non-Book Review Contest 2025 Winners
- Highlights From The Comments On Fatima
- Links For October 2025
- Open Thread 410
- The Good News Is That One Side Has Definitively Won The Missing Heritability Debate
- Links For December 2025
- Open Thread 420
- Last Rights
- A Buddhist Sun Miracle?
- Open Thread 428
1. I understand Substack is still working on various concerns about the commenting system. While you’re waiting, if your specific concern is about getting reply emails when people heart your comments, “you can set up a filter to delete emails from reaction@mg1.substack.com. Reply notifications come from a different email address, forum@mg1.substack.com, so they won't be affected.” Thanks to u/HonestyIsForTheBirds from the subreddit for this tip.
The New York Times backed off briefly as I stopped publishing, but I was also warned by people “in the know” that as soon as they got an excuse they would publish something as negative as possible about me, in order to punish me for embarrassing them. I didn’t want to spend the rest of my life in hiding, so I took various steps to make this more survivable, including quitting my previous job so my employers and coworkers would not get embroiled in my problems, and taking some steps to improve my personal safety. After doing all these things, I started blogging again, this time under my real name so that I would not be under the constant threat of doxxing in the future. Predictably, the NYT piece came out soon after, and predictably, it was very negative. I want to respond to four main negative claims in the article – there are more, but these should give a general sketch of why I feel it was unfair:
Also, this became a weird go-to thing for people who wanted to do hatchet jobs to hit me with, so much so that sometime before 2017 I edited the post involved telling people not to do that. I can’t remember exactly when this happened, but here’s the 2017 archive.is version showing the change already existed then. For at least the past three years, the paragraph in question has looked like this: The journalist involved hasn’t known about Slate Star Codex for three years, so this is undoubtedly the version he read, and he still chose to make this attack. I have 1,557 other posts worth of material he could have used, and the sentence he chose to go with was the one that was crossed out and included a plea for people to stop taking it out of context.
The journalist involved hasn’t known about Slate Star Codex for three years, so this is undoubtedly the version he read, and he still chose to make this attack. I have 1,557 other posts worth of material he could have used, and the sentence he chose to go with was the one that was crossed out and included a plea for people to stop taking it out of context.
Read this first: Book Review: Fussell On Class
Yeah, yeah, "class" sounds Marxist, class warfare and all that, you're supposed to be against that kind of thing, right? Wrong. Economic class warfare is Marxist, but here in the US class isn't a purely economic concept. Class is also about culture. You're already doing class warfare, you're just doing it blindly and confusedly. Instead, do it openly, while using the words "class" and “classism”.
Instead, just use the words "class" and “classism”. Say "Hey, we Republicans want to be the party of the working class. We are concerned about the rising power of the upper class, and we are dedicated to stamping out classism." This is what happens when nobody uses the word “class”! It's the 21st century; having principles is out of style. Politics is motivated by tribal hatred. You tell your people that the other side hates them and wants to kill them; they need to fight back. The Democrats are great at this - cis white men hate you, they deny your right to exist, the cruelty is the point, resist or be destroyed. You Republicans have been caught flat-footed. You can’t openly defend cis white men; that would be transphobic racist sexist. And you can’t openly attack trans black women - that would be super transphobic racist sexist. Plus it wouldn’t work; there aren’t that many of them, and they’re not powerful enough to be scary.
There’s a problem in medicine where people think doctors are trustworthy experts. While this is often true, there are about a million doctors, and some tiny fraction of them are insane. The reasonable doctors mostly keep their mouths shut, but sometimes an insane doctor will endorse some sort of terrible alternative medicine, and then people will get excited: “A doctor endorsed it! It must be real!” The fact is, you can find doctors saying pretty much any bizarre thing - I hear some of them even have Substacks.
[link back to the original links post: here]
And our other defense expert, John Schilling, writes:
...until recently! As far as I know, the first official journalists to do something like this were Dylan Matthews, Kelsey Piper and Sigal Samuel at Vox. They're trying again this year, but now they're joined by a pretty big name in traditional punditry - Matt Yglesias, formerly of Vox, now here at Substack. In theory you can read the relevant post here, but it’s paywalled. We'll start with the predictions themselves, then talk about what this means for journalism. Here are the questions to be predicted:
1. Jon Ossoff and Raphael Warnock win the Georgia Senate races 2. The same party wins both Senate races in Georgia 3. Joe Biden ends the year with his approval rating higher than his disapproval rating 4. Joe Biden ends the year with his approval rating above 50% 5. US GDP growth in 2021 is the fastest of any year of the 21st century 6. The year-end unemployment rate is below 5 percent 7. The year-end unemployment rate is above 4 percent 8. Lakers win the NBA championship 9. Joe Biden ends the year as president 10. Nancy Pelosi sets a definitive retirement schedule 11. A vacancy arises on the Supreme Court 12. The EU ends the year with more confirmed Covid-19 deaths than the US 13. Substack will still be around 14. People will still be writing takes asking if Substack is really sustainable 15. Apple releases new iMacs powered by Apple silicon 16. Apple does not release a new Mac Pro powered by Apple silicon 17. Monthly year-on-year core CPI growth does not go above 2 percent 18. Monthly year-on-year core CPI growth does not go above 3 percent 19. Lloyd Austin not confirmed as Defense Secretary 20. No federal tax increases are enacted 21. Biden administration unilaterally relieves some but not all student debt 22. United States rejoins JCPOA and Iran resumes compliance 23. Israel and Saudi Arabia establish official diplomatic relations 24. US and China reach agreement to lift Trump-era tariffs 25. Slow Boring will exceed 10,000 paid members
1. Jon Ossoff and Raphael Warnock win the Georgia Senate races (60%) 2. The same party wins both Senate races in Georgia (95%) 3. Joe Biden ends the year with his approval rating higher than his disapproval rating (70%) [83%] 4. Joe Biden ends the year with his approval rating above 50% (60%) [60%] 5. US GDP growth in 2021 is the fastest of any year of the 21st century (80%) [84%] 6. The year-end unemployment rate is below 5 percent (80%) 7. The year-end unemployment rate is above 4 percent (80%) 8. Lakers win the NBA championship (25%) [25%] 9. Joe Biden ends the year as president (95%) [96%] 10. Nancy Pelosi sets a definitive retirement schedule (60%) 11. A vacancy arises on the Supreme Court (70%) [50%] 12. The EU ends the year with more confirmed Covid-19 deaths than the US (60%) [80%] 13. Substack will still be around (95%) 14. People will still be writing takes asking if Substack is really sustainable (80%) 15. Apple releases new iMacs powered by Apple silicon (90%) [84%] 16. Apple does not release a new Mac Pro powered by Apple silicon (70%) [53%] 17. Monthly year-on-year core CPI growth does not go above 2 percent (70%) 18. Monthly year-on-year core CPI growth does not go above 3 percent (90%) 19. Lloyd Austin not confirmed as Defense Secretary (60%) 20. No federal tax increases are enacted (95%) 21. Biden administration unilaterally relieves some but not all student debt (80%) 22. United States rejoins JCPOA and Iran resumes compliance (80%) 23. Israel and Saudi Arabia establish official diplomatic relations (70%) [38%] 24. US and China reach agreement to lift Trump-era tariffs (70%) 25. Slow Boring will exceed 10,000 paid members (70%) [75%]
Some of the best comments were on the history of 4Chan. Mr. Doolittle writes:
And Fabian writes:
Several people chided me for ignoring the role of transgender issues in the culture wars. For example, Stephen F:
Ignore the minor formatting issues inevitable in trying to copy-paste things into Substack, including the headings being too small and the spacing between words and before paragraphs being weird. In the real page, the table of contents will link to the subsections; I don’t know how to do that here so it might be harder to read.
Here’s the diet the study used (source is here, but you won’t be able to read it without a Carlat Report subscription): We still don’t know much about nutrition, and probably there are a lot of superfluous things in this diet, or a lot of potentially helpful things missing. We just don’t know what they are, and this diet is a decent guideline until we know more. The studies suggest it will start working at least within three months; it might work faster than that, but the researchers didn’t check.
We still don’t know much about nutrition, and probably there are a lot of superfluous things in this diet, or a lot of potentially helpful things missing. We just don’t know what they are, and this diet is a decent guideline until we know more. The studies suggest it will start working at least within three months; it might work faster than that, but the researchers didn’t check.
This traditional solution is failing, because the Internet unbundles media. If you want commentary, you can get it here on Substack; if you want to know who won the big game, you can go to espn.com or just Google it.
And second, yeah, I even think there should be life-satisfaction-of-female-students-at-this-particular-school prediction markets. If current-day schools care enough about gender equality to hire a chief diversity officer (which they definitely do), future schools should care enough about gender equality to subsidize prediction markets in the self-reported life satisfaction of female students. It'll be cheaper than the CDO and more effective. I want a world where protesters march into colleges and ask them why, if they claim to care about female students, they don't have a subsidized female-student-life-satisfaction prediction market, so that they can credibly set targets for gender equality (see this post for more on how that would work).
Ignore the minor formatting issues inevitable in trying to copy-paste things into Substack, including the headings being too small and the spacing between words and before paragraphs being weird. In the real page, the table of contents will link to the subsections; I don’t know how to do that here so it might be harder to read.
In general, I’m not very concerned about this with most patients, for a few reasons. First are the reports from expert prescribers, who say basically none of their patients ever get addicted. Second is the general experience of using addictive drugs in psychiatry – for example, amphetamine (Adderall), which despite its fearsome reputation as a street drug is rarely abused by patients who get it by prescription. Addiction is a biopsychosocial process and people without genetic and psychological predispositions to addiction are usually able to use these chemicals safely. In a survey of drug experts, ketamine was ranked as less addictive than tobacco, alcohol, or Adderall, and around the same level as marijuana. If you would feel comfortable going out to a bar a few times without worrying about addiction, or smoking pot a few times without worrying about addiction, probably you also shouldn’t worry about getting a few ketamine infusions.
In a survey of drug experts, ketamine was ranked as less addictive than tobacco, alcohol, or Adderall, and around the same level as marijuana. If you would feel comfortable going out to a bar a few times without worrying about addiction, or smoking pot a few times without worrying about addiction, probably you also shouldn’t worry about getting a few ketamine infusions.
Yet somehow Hon is doing this well. He hasn't seceded from reality. And he's not (I hope it isn't insulting to say) a Babe Ruth-level intellectual superstar - the Babe Ruth equivalent would be Albert Einstein or someone. He's just a normal person satisfying his discovery drive and doing minor-league intellectual activity successfully. And maybe he's a bad example: I only know of him because he had this insight, so looking at him and saying "normal people can make discoveries too" is kind of selection-biased. But I see other random people do this all the time. People I follow on social media. Personal friends. It doesn't seem so uncommon. The hope that it's possible to add something of value to the conversation without being a domain expert and double PhD fuels this blog and its associated community. But it also fuels every other Substack, and the editorial page of major newspapers.
The big news in the US is the upcoming Virginia election: (source) Wait, what? That wasn’t how things looked the last time I…
(source) Wait, what? That wasn’t how things looked the last time I…
Wait, what? That wasn’t how things looked the last time I… (source) Looks like a big shift in the Virginia gubernatorial election market, mirroring a shift in the polls:
Unsolicited gifts from rich patrons, your generosity in subscribing to my Substack, and the second item here.
1: Erik Hoel’s predictions for 2050. Recommended more than I would usually recommend this genre. I’ve been looking for really good new Substacks, by people I hadn’t already been reading for years on another platform, and this is one of the few I’ve found that I’m really excited about.
The paper continues to an empirical study. The authors ran a forecasting tournament on various easily-checkable things like COVID vaccinations, commodity prices, and the weather. Forecasters were separated into three conditions: reciprocal scoring, traditional scoring (ie Brier score + incentives), and no scoring. The no scoring team did worse than the normal scoring team, which is the basic insight Tetlock et al have found again and again: scored and incentivized forecasts are better than random people pontificating on things. But more relevantly for this paper, the reciprocal scoring and traditional scoring did basically the same! More negative numbers means greater accuracy. Then they tried something more ambitious. They asked teams to “predict” the number of lives saved by various COVID interventions. These interventions had already happened or not, there was no way to ever empirically resolve the predictions. This was supposed to serve as an example of the exciting new things you can do with reciprocal scoring.
More negative numbers means greater accuracy. Then they tried something more ambitious. They asked teams to “predict” the number of lives saved by various COVID interventions. These interventions had already happened or not, there was no way to ever empirically resolve the predictions. This was supposed to serve as an example of the exciting new things you can do with reciprocal scoring.
3: Comment of the week is Gwern on whether we should consider China “successful”:"
Noahpinion had a pretty similar point a few months ago, but it’s always good to get more reminders.
4: Dr. Bitterman, one of the researchers who came up with the ivermectin-effects-are-from-worms hypothesis, is defending his idea from some of the concerns you guys brought up in the comments. For example, in response to a comment that hyperinfection syndrome is rare, he writes:
3. Comment of the week is Coagulopath on what happens when organisms get dropped in alien environments. A brief excerpt: "Agricultural crops often do best far away from their native land, where pests and pathogens are adapted to them. New-world maize and cocoa are among the biggest crops in Africa. Conversely, most coffee is grown in South America. Sometimes being far from home is a good thing."
4: You should now be able to edit your comments. Thank you, Substack!
Nuño Sempere, $10,000, to fund his continued work on https://metaforecast.org/ and the @metaforecast bot. The website aims to be an easy way to search for predictions on a given topic; the bot aims to predict, resolve, and tally predictions and bets made by other people. People actually in the forecasting space (unlike me, who is just a poseur) who I talked to described really appreciating Nuño's work, and thought this was a valuable extension to the Internet's general forecasting infrastructure. Nuño is also a researcher at the Quantified Uncertainty Research Institute and the author of a monthly forecasting/prediction markets newsletter.
Nathan Young, $5,000, to fund his continued work writing Metaculus questions and trying to build bridges between the forecasting and effective altruist communities. Nathan is a Metaculus moderator, the author of a prediction market blog I've used as a source before, and has useful connections with people who might be convinced to use formal forecasting methods for their organizations. This grant is a vote of confidence in him to continue this work, and another part of my effort to fund more forecasting infrastructure. You can read his newsletter, the UK Policy Forecast, here. If you have suggestions for forecasting questions he asks that you DM him on twitter or add them to this open Google doc.
Will Jarvis and Lars Doucet, $55,000, to create an automated land value assessment model for two Pennsylvania counties. You all know Lars as the guy who keeps writing guest posts here about Georgism. Now he wants to take it to the next level and start building tools for the Georgist future. This program would act as proof of concept that counties can assess land value relatively easily and accurately. I was on the fence about funding it because they can create a beautiful program with 100% success and then counties can just continue to not be Georgist for the same reasons as usual. I'm going ahead with it because I trust Lars who believes this is the best way forward, and because it seems like the sort of thing that could eventually grow into a Georgist think tank at some point in the future. They’re interested in talking to anyone who has experience in mass appraisal, Georgist or not, as well as applied data scientists and machine learning researchers. Fill out this form here if that’s you. You can follow their progress at https://gameofrent.com/
BLOG 86. ACX is earning more money than it is right now: 70% 87. [redacted]: 10% 88. [redacted]: 50% 89. [redacted]: 20% 90. There is another article primarily about SSC/ACX/me in a major news source: 10% 91. I subscribe to at least 5 new Substacks (so total of 8): 20% 92. I've read and reviewed How Asia Works: 90% 93. I've read and reviewed Nixonland: 70% 94. I've read and reviewed Scout Mindset: 60% 95. I've read and reviewed at least two more dictator books: 50% 96. I've started and am at least 25% of the way through the formal editing process for Unsong: 30% 97. Unsong is published: 10% 98. I've written at least five chapters of some non-Unsong book I hope to publish: 40% 99. “On The Natural Faculties” wins the book review contest: 60% 100. I run an ACX reader survey: 50% 101. I run a normal ACX survey (must start, but not necessarily finish, before end of year): 90% 102. By end of year, some other post beats NYT commentary for my most popular post: 10% 103. I finish + post Rise And Fall Of Online Culture Wars: 90% 104. I finish + post Don’t Give Up On Having Kids Because Of Climate Change: 80% 105. I finish + post Carbon Costs Quantified: 80% 106. I have a queue of fewer than ten extra posts: 70%
At the beginning of every year, I make predictions. At the end of every year, I score them. Here are 2014, 2015, 2016, 2017, 2018, 2019, and 2020.
Here’s the usual graph: Last year I was mostly overconfident. This year I was very slightly underconfident (except in the 60% bin). I see no consistent pattern of errors here and am not going to update on it very much. I’m pretty happy with this, since I thought the questions this year were harder than usual.
I was going to try to fact-check this, but a bunch of other people (see eg Philippe Lemoine, Stuart Ritchie) have beaten me to it. Still, right now all the fact-checking is scattered across a bunch of Twitter accounts, so I'll content myself with being the first person to summarize it all in a Substack post, and beg you to believe I would have come up with the same objections eventually.
All differences lost statistical significance after adjustment for multiple comparisons. What does that mean? Well, remember that XKCD comic with the jellybeans: That’s multiple comparisons. If you test 20 different things and get one positive result, that doesn’t mean there’s a real effect, it means you kept doing tests until one of them randomly came out positive because of noise.
That’s multiple comparisons. If you test 20 different things and get one positive result, that doesn’t mean there’s a real effect, it means you kept doing tests until one of them randomly came out positive because of noise.
GummyBearDoc writes:
DoTheMath now says he is “tentatively convinced” of GummyBearDoc’s claim (good for both of you! Julia Galef gives you shiny gold stars!)
But Merlot says:
My commenters were very nice about it. They didn’t use those exact words. It was more like “I loved your articles from about 2013 - 2016 so much! Why don’t you write articles like that any more?” Or “Do you feel like you’ve shifted to less ambitious forms of writing with the new Substack? It feels like there was something in your old articles that isn’t there now.” There was a lot of similar discussion on this one year retrospective subreddit thread.
The evidence that I’ve gotten worse at blogging is mixed. I asked about it on a reader survey six months ago, and got this: Most people think my quality is about the same, although the minority who do see a difference mostly lean towards “worse”.
Most people think my quality is about the same, although the minority who do see a difference mostly lean towards “worse”.
This is the closing part of ACX Grants. Projects that I couldn’t fully fund myself were invited to submit a brief description so I could at least give them free advertising here. You can look them over and decide if any seem worth donating your money, time, or some other resource to.
You can find the first 66 of these here.
#73: Create A New Kind Of Money And Cities The combination of markets and ideas has reduced suffering somewhat. This trend must continue, but I think a global median income of US$30,000 by 2049 is possible. We just need to teach everybody the same skills that Americans have. To enable this, 2 areas where improvement can be made and no new technology is needed are: a new money, and cities welcome to everyone. A new money is needed because the current financial system is not burdened with the risks it creates. Cities don’t grow like they did in the past. Over a 50 year period at the turn of the twentieth century Detroit grew 10X, whereas in this era the Bay Area has not even doubled its population. Nowadays cities that attract the best talent only attract the best talent. If we had a Hypothetical-Bay-Area-City grow like American cities of the past, it would have a population of around 45 million people and GDP of $4.5 billion. What would an asset be worth if it had a $4.5 billion income stream? A little bit of money and land is needed to make a start, but mostly I need you and your talents. Here is my new Substack with details: https://marketismandidearism.substack.com/p/a-new-money-and-cities-welcome-to . Please sign up to make a global median income of US$30,000 by 2049 a reality. P.S. I am talking money here. Accounting entries. Do not talk to me about Bitcoin. Bitcoin is an attempt at cash. 99.99999% of money transactions are not done with cash, they are done with IOU’s. Please. Spare. Me.
The first part of this post looks at various markets’ predictions of how the war will go from here (Zvi published something like this a few hours before I could, so this will mostly duplicate his work). The second part very briefly tries to evaluate which markets have been most accurate so far - though this is a topic which deserves at least paper-length treatment. The third part looks at which pundits deserve eternal glory for publicly making strong true predictions, and which pundits deserve . . . something else, for doing . . . other things.
This is the most-predicted relevant question on Metaculus right now. The first day of the war, the market predicted as high as 90%; as people realized the strength of Ukrainian resistance, it fell to 80. Mid-Saturday there was a sudden drop from 78% to 72%, after some combination of a defiant Zelenskyy speech and a report that Russian paratroopers had been repelled. Since then it’s barely budged.
The six cities are Kyiv, Odesa, Lviv, Mariupol, Kharkiv, and Kherson. This question gives the Russians two more months than the last one, so it’s surprising that they’re at about the same probability. Maybe everyone expects Russia to go for Kyiv first and take longer for anything else? Or maybe they’re assuming everything stands or falls together.
LAS VEGAS, NV Contact: Jonathan Ray (ray.jonathan.w@gmail.com) Date: April 24 Time: 1:00 PM Coordinates: https://plus.codes/85864PFF+3P Location: Desert Breeze Park at one of the southern pavilions with an ACX sign Group info: Subscribe to the LessWrong group or Substack to get notified about future Vegas ACX meetups
Previously: I predicted that DALL-E’s many flaws would be fixed quickly in future updates. As evidence, I cited Gary Marcus’ lists of GPT’s flaws, most of which got fixed quickly in future updates.
Marcus responded with a post on his own Substack, arguing . . . well, arguing enough things that I’m nervous quoting one part as the thesis, and you should read the whole post, but if I had to do it, it would be:
And from this progression, Marcus concludes . . . that this demonstrates nothing like this will ever be able to imitate the brain. What? Can we get a second opinion here? Out of the mouths of babes. I don’t want to definitively assert that a brain-sized GPT will definitely be just as good at reasoning as the brain. But I hardly think GPT’s performance provides strong evidence to the contrary.
I enjoyed the book and recommend it to anyone interested in the topic. However, many of the authors’ points (especially on technical issues) have counterpoints from other scientists who lean more heavily towards the natural origins hypothesis. So I think it’s best to include the book as part of a “package-deal” recommendation, rather than presenting it as a perfectly objective source. The last section of this review will include some more recommended sources to check out, including writing from advocates of the natural origins hypothesis with counterpoints to claims made in the book. I’ll also link one here in case you don’t make it that far.
Smallpox escaped from research labs in the UK three times from 1966-1978. In fact, the last ever case of smallpox occurred after it had already been eradicated, when it escaped from a medical laboratory in 1978 and infected a medical photographer, who eventually died from the illness. These are only a few of many examples. According to the US Federal Select Agent Program, which oversees the possession and handling of dangerous biological agents and toxins, there were 219 accidental releases of these “select agents” in 2019. So, while accidental lab leaks are uncommon, they’re not unheard of. When it comes to the COVID-19 pandemic, it still makes sense to have a strong prior in favor of the natural origins hypothesis, but the idea that a pathogen can be accidentally released from a lab isn’t some wild, ridiculous idea like believing in alien abductions or Bigfoot or something. 3. The outbreak location in Wuhan appears to be relevant There’s a famous psychology experiment [1] in which participants were told to wait in a room, and their reactions were recorded as the room gradually filled with smoke. In some cases, participants waited alone, while in other cases they waited with a group of people who, unbeknownst to the participant, were actors who had been instructed to ignore the smoke. Of the participants who waited alone, 75% reported the smoke. However, of the participants who waited with the group, only 10% reported the smoke. Photograph of the famous Latané and Darley experiment, cerca 1968. So, what could those participants have been thinking? Maybe something like: Hmm, why’s the room filling up with smoke? Is this a problem? *looks around the room* Well nobody else seems to care, so I guess not. Looking back at the early stages of the COVID-19 pandemic, I think maybe this is why so many of us didn’t think twice about the location of the initial outbreak. Hmm, is it kinda suspicious that this virus broke out near a major virology institute that works on bat coronaviruses? Should we maybe look into that? *looks around* Well nobody else seems to think so, so I guess not. I can’t speak for everyone else, but this was at least my mindset. I had vaguely heard something about how there was a virology research institute close to where the pandemic broke out, and that some conspiracy theorists were claiming it was the source of the virus. I looked around and noticed that nobody was really taking this idea seriously, so I figured I didn’t need to take it seriously either. Also, I was thinking something like: Eh, probably every major city has labs and research institutes doing this kind of research. And I’ll bet they purposely built the virology institute close to where these viruses occur in nature, to give them easy access for sampling. Well, it turns out both of these things are wrong. The type of research conducted at the Wuhan Institute of Virology (WIV) is pretty rare and specialized. It includes things like creation of chimeric coronaviruses [1, 2], infecting humanized mice with bat coronaviruses, and other types of gain of function research, which Chan and Ridley devote a chapter to. The WIV is one of only a few institutions in the world doing this type of research. It’s not the case, as I had assumed, that every major university has a couple labs doing similar work. So it does seem like a pretty remarkable coincidence that the outbreak happened in Wuhan. But maybe they purposely built the Wuhan Institute of Virology close to where these viruses are found in nature? Well, this also turns out to be wrong. The areas where viruses most similar to SARS-CoV-2 are found in nature are Yunnan province and Laos, which are more than a thousand kilometers away from Wuhan. The authors put this distance in perspective by noting that it’s more than the distance between Orlando and NYC. Image source: https://www.bloomberg.com/news/features/2020-12-30/china-is-making-it-harder-to-solve-the-mystery-of-how-covid-began If SARS-CoV-2 originated in an animal somewhere around the Yunnan / Laos area, how did it make it all the way to Wuhan without leaving a trail along the way? 4. The story of RaTG13 Although I enjoyed the book, I do have one pretty major criticism. The authors repeatedly make the claim that a virus called RaTG13, which was being studied at the WIV before the pandemic, is the closest known genetic match to SARS-CoV-2. But this claim is outdated and no longer correct. In September 2021 researchers identified a virus called BANAL-52 in Laos that’s a 96.8% match to SARS-CoV-2, closer than RaTG13’s 96.2% match. (Important note: a 96.8% match is still a long way off in genomic space, and does not imply that this is the same virus as SARS-CoV-2, or even necessarily a progenitor.) At first I thought maybe the authors didn’t mention BANAL-52 because it was discovered after the book was published, but this isn’t the case – Viral was published November 16, 2021, nearly two months after the discovery of BANAL-52 was published. Although I’m writing an overall-positive review here, I don’t want to go easy on the book where serious criticism is warranted. It’s completely unacceptable that BANAL-52 wasn’t mentioned. Even if it would have been inconvenient from a publishing standpoint, the authors should have rewritten the RaTG13 chapter, or at least included an addendum about the discovery of BANAL-52. With that being said, I think the story of RaTG13 is still interesting and important, so I’ll give a quick summary here. At the start of the pandemic in 2020, SARS-CoV-2 was quickly sequenced, and the full genome sequence was published by Dr. Shi Zhengli’s team at the WIV. In this paper, they also briefly mentioned that the genome was a 96.2% match with another bat coronavirus called RaTG13 – the closest known match at the time. Oddly, the mention of RaTG13 did not include any reference, footnote, or link to any previously published sequence. Although the WIV didn’t provide details on this mysterious RaTG13 virus, a group of internet volunteers, including both amateurs as well as professional scientists working in their free time, began to investigate. This loose collection of open-source researchers, called DRASTIC, uncovered a medical thesis describing an outbreak of a mysterious disease in 2012. Six men who had been working in a bat-infested mine in Mojiang County, China, fell ill and were admitted to a hospital with symptoms including dry coughs, shortness of breath, fevers, muscle aches, headaches, and fatigue. Three of the men eventually died of this mysterious illness. In the years following this incident, teams of researchers (including a team led by Dr. Shi Zhengli of the WIV) were sent to investigate the cause of this illness and collect samples from the Mojiang mine. This sampling led to the discovery of a novel SARS-like coronavirus in 2013, and a part of its genomic sequence was published under the name BtCoV/4991 in 2016. The DRASTIC researchers discovered that RaTG13 was genetically identical to the BtCoV/4991 sequence from the Mojiang mine – it was the same virus, and had just been renamed for some reason, without any public record of the change. They also discovered that at least eight other closely related coronaviruses were also sampled from this mine and brought to the WIV. Although unhelpful throughout the investigation, the WIV eventually verified these facts when pressed on them, and an addendum was added to the original paper confirming DRASTIC’s account of the origin of RaTG13. So what should we make of this? Well, as I mentioned before, RaTG13 is no longer the closest known genetic match to SARS-CoV-2, so maybe the whole story is less important as it pertains to the origin of the pandemic. But the discovery of BANAL-52 doesn’t really resolve things either [2]. Laos is very far away from Wuhan (actually even further than Yunnan), so we’re left with the same question as before – how did SARS-CoV-2 make it all the way to Wuhan from such a distant natural reservoir without leaving a trail along the way? 5. Lack of institutional transparency and competence A lot of the book is devoted to criticizing the Chinese government’s lack of transparency during the pandemic. Some brief examples: In the early days of the initial outbreak in Wuhan, hundreds of people were investigated and punished for the crime of “spreading rumors”. This included whistleblowing doctors who attempted to warn others [3] about the spread of the disease and its human-to-human transmission, which was being denied by the Chinese government at the time.
Photograph of the famous Latané and Darley experiment, cerca 1968. So, what could those participants have been thinking? Maybe something like: Hmm, why’s the room filling up with smoke? Is this a problem? *looks around the room* Well nobody else seems to care, so I guess not. Looking back at the early stages of the COVID-19 pandemic, I think maybe this is why so many of us didn’t think twice about the location of the initial outbreak. Hmm, is it kinda suspicious that this virus broke out near a major virology institute that works on bat coronaviruses? Should we maybe look into that? *looks around* Well nobody else seems to think so, so I guess not. I can’t speak for everyone else, but this was at least my mindset. I had vaguely heard something about how there was a virology research institute close to where the pandemic broke out, and that some conspiracy theorists were claiming it was the source of the virus. I looked around and noticed that nobody was really taking this idea seriously, so I figured I didn’t need to take it seriously either. Also, I was thinking something like: Eh, probably every major city has labs and research institutes doing this kind of research. And I’ll bet they purposely built the virology institute close to where these viruses occur in nature, to give them easy access for sampling. Well, it turns out both of these things are wrong. The type of research conducted at the Wuhan Institute of Virology (WIV) is pretty rare and specialized. It includes things like creation of chimeric coronaviruses [1, 2], infecting humanized mice with bat coronaviruses, and other types of gain of function research, which Chan and Ridley devote a chapter to. The WIV is one of only a few institutions in the world doing this type of research. It’s not the case, as I had assumed, that every major university has a couple labs doing similar work. So it does seem like a pretty remarkable coincidence that the outbreak happened in Wuhan. But maybe they purposely built the Wuhan Institute of Virology close to where these viruses are found in nature? Well, this also turns out to be wrong. The areas where viruses most similar to SARS-CoV-2 are found in nature are Yunnan province and Laos, which are more than a thousand kilometers away from Wuhan. The authors put this distance in perspective by noting that it’s more than the distance between Orlando and NYC. Image source: https://www.bloomberg.com/news/features/2020-12-30/china-is-making-it-harder-to-solve-the-mystery-of-how-covid-began If SARS-CoV-2 originated in an animal somewhere around the Yunnan / Laos area, how did it make it all the way to Wuhan without leaving a trail along the way? 4. The story of RaTG13 Although I enjoyed the book, I do have one pretty major criticism. The authors repeatedly make the claim that a virus called RaTG13, which was being studied at the WIV before the pandemic, is the closest known genetic match to SARS-CoV-2. But this claim is outdated and no longer correct. In September 2021 researchers identified a virus called BANAL-52 in Laos that’s a 96.8% match to SARS-CoV-2, closer than RaTG13’s 96.2% match. (Important note: a 96.8% match is still a long way off in genomic space, and does not imply that this is the same virus as SARS-CoV-2, or even necessarily a progenitor.) At first I thought maybe the authors didn’t mention BANAL-52 because it was discovered after the book was published, but this isn’t the case – Viral was published November 16, 2021, nearly two months after the discovery of BANAL-52 was published. Although I’m writing an overall-positive review here, I don’t want to go easy on the book where serious criticism is warranted. It’s completely unacceptable that BANAL-52 wasn’t mentioned. Even if it would have been inconvenient from a publishing standpoint, the authors should have rewritten the RaTG13 chapter, or at least included an addendum about the discovery of BANAL-52. With that being said, I think the story of RaTG13 is still interesting and important, so I’ll give a quick summary here. At the start of the pandemic in 2020, SARS-CoV-2 was quickly sequenced, and the full genome sequence was published by Dr. Shi Zhengli’s team at the WIV. In this paper, they also briefly mentioned that the genome was a 96.2% match with another bat coronavirus called RaTG13 – the closest known match at the time. Oddly, the mention of RaTG13 did not include any reference, footnote, or link to any previously published sequence. Although the WIV didn’t provide details on this mysterious RaTG13 virus, a group of internet volunteers, including both amateurs as well as professional scientists working in their free time, began to investigate. This loose collection of open-source researchers, called DRASTIC, uncovered a medical thesis describing an outbreak of a mysterious disease in 2012. Six men who had been working in a bat-infested mine in Mojiang County, China, fell ill and were admitted to a hospital with symptoms including dry coughs, shortness of breath, fevers, muscle aches, headaches, and fatigue. Three of the men eventually died of this mysterious illness. In the years following this incident, teams of researchers (including a team led by Dr. Shi Zhengli of the WIV) were sent to investigate the cause of this illness and collect samples from the Mojiang mine. This sampling led to the discovery of a novel SARS-like coronavirus in 2013, and a part of its genomic sequence was published under the name BtCoV/4991 in 2016. The DRASTIC researchers discovered that RaTG13 was genetically identical to the BtCoV/4991 sequence from the Mojiang mine – it was the same virus, and had just been renamed for some reason, without any public record of the change. They also discovered that at least eight other closely related coronaviruses were also sampled from this mine and brought to the WIV. Although unhelpful throughout the investigation, the WIV eventually verified these facts when pressed on them, and an addendum was added to the original paper confirming DRASTIC’s account of the origin of RaTG13. So what should we make of this? Well, as I mentioned before, RaTG13 is no longer the closest known genetic match to SARS-CoV-2, so maybe the whole story is less important as it pertains to the origin of the pandemic. But the discovery of BANAL-52 doesn’t really resolve things either [2]. Laos is very far away from Wuhan (actually even further than Yunnan), so we’re left with the same question as before – how did SARS-CoV-2 make it all the way to Wuhan from such a distant natural reservoir without leaving a trail along the way? 5. Lack of institutional transparency and competence A lot of the book is devoted to criticizing the Chinese government’s lack of transparency during the pandemic. Some brief examples: In the early days of the initial outbreak in Wuhan, hundreds of people were investigated and punished for the crime of “spreading rumors”. This included whistleblowing doctors who attempted to warn others [3] about the spread of the disease and its human-to-human transmission, which was being denied by the Chinese government at the time.
Is this just some crazy attempt to build hype, like when Elon Musk says the next Tesla definitely will have full-self-driving ability? I don’t think so. Saudi Crown Prince Mohammed bin Salman is obsessed with Neom and very vain; I don’t think he would deliberately promise impossible things knowing that he will be embarrassed later when they don’t work out (and he says it will be done by 2030, so we’ll know the results relatively soon). Also, the government has earmarked $500 billion to $1 trillion for the project - around the GDP of Sweden - which sounds kind of like being serious. Also, they’ve already started on important Saudi construction preliminaries, like murdering the people who previously lived in the area. Also, they’ve already set up on-site camps for the construction workers (source): This kind of smart, walkable, mixed-used urbanism is illegal to build in most American cities. So what is going on? After describing Neom project leader Nadhmi Al-Nasr…
This kind of smart, walkable, mixed-used urbanism is illegal to build in most American cities. So what is going on? After describing Neom project leader Nadhmi Al-Nasr…
This kind of smart, walkable, mixed-used urbanism is illegal to build in most American cities. So what is going on? After describing Neom project leader Nadhmi Al-Nasr… Former employees say one of the chief sources of aggravation is Al-Nasr, whom they describe as having a volcanic temper. Several recall him openly berating subordinates, sometimes issuing threats unlike anything they’d experienced in their careers. In one particularly tense moment, after two e-sports companies canceled partnerships with Neom, citing human-rights concerns, Al-Nasr said he’d pull out a gun and start shooting if he wasn’t told who was to blame, according to two witnesses to the exchange. Al-Nasr disputes these accounts. “Not anyone can stand the pressure of the demands of the day, and there are people who leave because it’s more demanding than anything they have done before,” he says. …the Bloomberg article offers some tantalizing clues: Among the misdeeds most likely to anger Al-Nasr, the former employees say, was failing to spend enough money. Three of them described Al-Nasr keeping a diagram showing which department heads were disbursing less than their budgets allowed, which the ex-staff half-seriously referred to as a “wall of shame.” Maybe if you demand grander and grander plans, and have a reputation for killing anyone who opposes you, then eventually you get a really grand plan and nobody has the guts to tell you that it’s impossible. But the problem isn’t just that Neom is too big. Everything about it is doomed. There are reasons most cities aren’t designed as 200 meter wide, 170 km long lines; this maximizes the distance between any two points! The Saudis say they will solve this with a high-speed train, but all public transit is inherently limited in speed by the need to stop at a bunch of stations along the way. The video says that you’ll be able to go from one end of Neom to the other in 20 minutes, which suggests a 500 km/hour or 300 mile/hour train line. There are some maglevs which are almost that fast, but this only works if everyone is going nonstop from the exact westernmost point in Neom to the exact easternmost point. If you want people to only have to walk a kilometer or so to their destination, you’ll need 85 stops along the way. You can do slightly better than this with a combination of express and local trains, but you’re never going to compensate for the fact that laying your city out in a line is shooting yourself in the foot. I think maybe this is what happens to your brain when you read too many YIMBY blogs. “The only things people want out of cities are super high density and a ban on cars, right?” Dude, you are Saudi Arabia. The only two things in your country are open space and fuel. Log off Twitter, touch grass, etc. Okay, now I’m even more confused. The only advantage of having your city in a giant line is that at least it’s good for mass transit, and you are … emphasizing walkability? Also, aren’t you in Saudi Arabia? Isn’t it 130 degrees at all times? But The Line is only the beginning. They will also have a Giant Floating Octagon Of Clean Industry: Source: Neom website …the world’s largest ski and watersports resort, and yes we are still in Saudi Arabia, they’ll make an artificial lake and use artificial snow: Source: Neom website …and whatever this is supposed to be: Source: Neom website Fine! Let’s just have random stuff! Canal-pools along every street so you can swim to work! A beach made of crushed marble which will shine like silver! Whatever! If this were some billionaire’s passion project, I’d be fine with it. It would be fun to watch exactly how it failed; it would probably leave some cool ruins. Maybe after the hype died down they could try for something smaller, and it would still be pretty impressive. At least it would beat yet another megayacht. But in fact, this is the Crown Prince of Saudi Arabia, squandering public money. Not just renewable tax receipts, but the country’s accumulated oil windfall, just as the world tries to transition to renewable energy and the country risks never getting any oil windfall ever again. This is the money that should be going to the Saudi people having a future, and instead Mohammed bin Salman is spending it on playing some kind of demented desert version of SimCity, using a strategy that ten minutes playing actual SimCity could tell him was a bad idea. Neom represents all the worst parts of model cities. Dictators robbing the public purse to build cool monuments that make them feel special. Total lack of interest in workers, previous inhabitants, future inhabitants, or anyone except the very rich. “Sustainability”, “density”, and “liveability” as buzzwords to throw at foreign media, with no broader story for how any of this will improve the lives of real people or the cause of human freedom. I find model cities interesting and promising only insofar as I think some of them aren’t like Neom. Catawba Digital Economic Zone Haven’t heard much out of the crypto people recently, wonder what they’re up to: They seem to have gotten…an Indian tribe? That wasn’t on my bingo card for 2022. The Catawba Digital Economic Zone is the brainchild of Joseph McKinney (founder of the pro-charter-cities Startup Societies Foundation) and the Catawba Nation of Native Americans (a federally recognized tribe with a reservation in South Carolina). Indian tribes have regulatory independence from state governments, which some tribes have famously used to allow casinos in their territory. The Catawba are going one step further: they claim to have favorable cryptocurrency regulations which make it easier to register and operate your crypto company in Catawba territory than in the rest of the US. You can find their exact laws here, although they are long and in legalese. CoinDesk has an explainer of the crypto benefits, which seem to focus on digital asset regulations which “integrate digital assets under existing law”, including rights around disputes and loans. They also expect upcoming laws on DAOs, stablecoin, and banking. “Native American tribes” and “cryptocurrency” were not previously two concepts I associated closely with each other. But the Catawba were already a standout for their political savvy and economic ambitions, and they seem intimately involved here; the Zone is being run by “the business branch of the Catawba Indian Nation”, the commissioners are mostly Catawba citizens and tribal elders, and there are some nice touches like financial incentives for businesses that employ Catawba citizens. I like crypto as an insurance policy against oppressive governments, but I am not very bullish about it as an industry right now. Still, I am excited about the idea of Indian reservation charter cities - either in cooperation with outsiders like McKinney, or - who knows? - as grassroots designs from the tribes themselves. Reservation charter cities wouldn’t be the biggest deal. Tribes have substantial independence from state and local governments, but not much independence from the national government, and a lot of the dysfunction that needs escaping is at the federal level. Still, there are probably some niche opportunities; see eg Squamish tribe building skyscrapers on their land in Vancouver despite NIMBY opposition for one example of where this sort of idea could go. Seasteading In Paradise Malé is the capital of the Maldives, a tiny island nation in the Indian Ocean. It looks like this: One noticeable feature of Malé is its lack of lebensraum. Maldives is a pretty well-off country with a strong tourist industry, and lots more people would like to be nearby. What to do? You can already guess the proposed solution of Maldives Floating City. They want a 20,000 person seastead docked ten minutes away from the 130,000 person island-capital. The Floating City will serve both tourists and local Maldivians (some of whom are getting nervous about rising sea levels, and would probably appreciate a development guaranteed to stay above water). According to the organization’s press release, the Dutch corporate sponsor has obtained full permission to build the seastead, some test construction has already started, and full construction will begin in January. They hope to finish by 2027. Here are the inevitable pretty pictures: The layout is supposedly based on brain coral, but is this really the best way to lay design a seastead? Does this pattern really maximize the ease of getting from Point A to Point B? If you like tropical paradises and are incredibly optimistic, you can buy a house in the Floating City here, prices seem to be $150-250K. This is not the long-awaited dream of the libertarian seastead; the whole city will be firmly anchored in Maldives, both physically and legally. But if it works, it’s a proof of concept that libertarians may be able to build on later. Elsewhere In Model Cities 1: Prospera now hosts the drone delivery service Aerialoop, which will eventually transport cargo from their Roatan Island hub to various outposts on the mainland; you can find more information here. Their long-term plans include eventually following this up with passenger drones. And here’s some more information on the growing drone industry in Latin America. 2: Related: Prospera intern and resident George Kerpestein is writing a Substack about his experiences there. And here is the Prospera newsletter. 3: Thanks to commenters last month for pointing out that Chinese cult Falun Gong has its own compound/city in upstate New York. You can read more about it here: 4: Sealand is an independent nation (according to Sealand) based out of an old WWII sea fort in international waters. It is not for sale, but the Bull Sandfort is, for only £50,000. Alas, this one is firmly within British territorial waters. But it does look pretty defensible…anyway, see the listing here. Predictions In 2030, there are at least 50,000 people in whatever the Neom project has evolved into by then: 75%
I was happy with my decision to keep this contest anonymous, because the most “famous” person to enter won first place, and if it had been open-identity I would have wondered whether he was drawing on a pre-existing fan base. But no, Erik can rest assured he is actually very good at writing (which he probably already knew, being a novelist and all, but you never know). In fact, 2 of the 5 winners, plus an extra 1.5 of the remaining finalists, were authors of Substacks which I read and have linked to here (Hoel, Roger’s Bacon, Resident Contrarian, and the extra 0.5 is for Etienne who I didn’t know about before this week but just saw his post Common Tech Jobs Described As Cabals Of Mesoamerican Wizards on the subreddit). I’m always suspicious that everything is fake and good writers aren’t actually good and it’s just a social conspiracy to believe that they are, but these results are a vote in support of our existing writer-identification-institutions (are they all Substack? I guess it’s just Substack) - although many unknown people also did very well, including the 2nd place winner (I didn’t get a response to my email asking how I should reveal his identity, so I’m defaulting to initials, but I don’t recognize his real name either).
1st: The Dawn Of Everything, reviewed by Erik Hoel. Erik is a neuroscientist and author of the recent novel The Revelations. He writes at his Substack The Intrinsic Perspective.
2nd: 1587, A Year Of No Significance, reviewed by occasional ACX commenter McClain.
[original post: Billionaires, Surplus, and Replaceability]
1: Lars Doucet (writes Progress and Poverty) writes:
See also this conversation between Lars and Motteposting on how to apply this to exploration, research, and talent.
I feel the same way about Substack. Everyone I know reads a sample of the same set of Substacks - mine, Matt Yglesias’, maybe Freddie de Boer’s or Stuart Ritchie’s. But then I use the Discover feature on the site itself and end up in a parallel universe.
Political Substacks tend to have names that suggest stability - “The Bulwark”, “North Star”, “Steady” - or reasonableness - “Common Sense”, “Civil Discourse”, “Lucid”. They all have taglines like “Just the news, the way it should be, without the craziness and partisan bias”. Their articles are all things like “WATCH how the FASCIST ultra-MAGA Republicans ABUSE women and CHILDREN because THE CRUELTY IS THE POINT!!!”
It is, at least, as many Substacks as I am willing to evaluate in a single sitting. Join us next time, as we hopefully move on to categories like Art, Crypto, Philosophy, and Fashion.
4: RIP Patrick Non-White of Popehat.
8: What explains this? (h/t @WaltHickey) 9: How Jon Stewart Made Tucker Carlson. Good but hard to summarize. The news used to be staid, neutral, and formulaic, Jon Stewart discovered that a news show could get more viewers by pitching itself as the antidote to the news rather than the news itself, and others (like Tucker Carlson) took that insight in unexpected directions. Also offers an unexpected possible explanation for polarization: there were some regulations and business incentives pushing the news in the direction of being boring until about 1990, but not so much afterwards.
9: How Jon Stewart Made Tucker Carlson. Good but hard to summarize. The news used to be staid, neutral, and formulaic, Jon Stewart discovered that a news show could get more viewers by pitching itself as the antidote to the news rather than the news itself, and others (like Tucker Carlson) took that insight in unexpected directions. Also offers an unexpected possible explanation for polarization: there were some regulations and business incentives pushing the news in the direction of being boring until about 1990, but not so much afterwards.
Thanks to everyone who got ACX Grants (see original grants here) and sent me a one-year update.
37: Good Science Project, Working To Improve Federal Science Funding (?/10) The Good Science Project officially launched back in April, and has brought on a Senior Fellow (Betsy Ogburn of Johns Hopkins, with an interest in clinical trial quality and infrastructure) and Eric Gilliam (formerly working for Steve Levitt, with an interest in progress studies and the creation of effective scientific institutions). They have published many articles on science reform, most recently including a Health Affairs piece arguing for an NIH Center of Innovation, and are advising ARPA-H (the new “DARPA for health”) on meta-science issues. Staffers at the White House and Congress regularly ask for their input. You can read their Substack here.
3: Stereotyping in Europe (h/t @ThePurpleKnight): Related (18th century German version, I’ve lost the original source but there’s a secondary one here):
Related (18th century German version, I’ve lost the original source but there’s a secondary one here):
Related (18th century German version, I’ve lost the original source but there’s a secondary one here): 4: Wikipedia on impossible colors:
In November 2021, I posted Ivermectin: Much More Than You Wanted To Know, where I tried to wade through the controversy on potential-COVID-drug ivermectin. Most studies of ivermectin to that point had found significant positive effects, sometimes very strong effects, but a few very big and well-regarded studies were negative, and the consensus of top academics and doctors was that it didn’t work. I wanted to figure out what was going on.
Alexandros Marinos is an entrepreneur, long-time ACX reader, and tireless participant in online ivermectin arguments. He put a very impressive amount of work into rebutting my post in a 21 part argument at his Substack, which he finished last October (if you don’t want to read all 21 parts, you can find a summary here). I promised to respond to him within a few months of him finishing, so that’s what I’m doing now.
You can find Alexandros’ full critique here. His main concerns are:
1: Maybe you’ve heard of cultured meat, aka “vat meat”, where you grow meat in a lab so vegetarians can eat it without worrying about animal welfare. Some of the first products are due out in a year or two, though delays are likely and they’ll probably be more expensive than normal. But I hadn’t realized the full implications of separating meat production from animal farming: cultured meat companies are gearing up to sell lion meat, tiger meat, and “zebra sushi”. In theory this also paves the way for human meat, though regulators might have other ideas.
In theory this also paves the way for human meat, though regulators might have other ideas.
In theory this also paves the way for human meat, though regulators might have other ideas. 2: Eight years ago I wrote an article about how the government should stop restricting doctors’ ability to prescribe suboxone, a useful medicine for opioid abuse. Last month, the government finally stopped the restrictions. Good for them! 3: Carl Sagan married three times. His first wife was legendary biologist Lynn Margulis, who discovered mitochondrial endosymbiosis, then went off the deep end and became an AIDS denialist and 9/11 truther. His second wife drew the Pioneer plaque. His third wife was one of the women who designed the Voyager golden record. 4: Claim: Chinese sources seem to back this up (and related BBC), but I’m skeptical: is this really the best way to satisfy a “must fight with medieval weapons” constraint? Why not crossbows? 5: Did you know: Alex Berenson, who runs the most popular anti-vaccine Substack, has had an unusual career: he used to be an investigative reporter for the New York Times, and also wrote a series of bestselling spy novels. 6: Less Wrong: I Converted Book 1 Of The Less Wrong Sequences Into A Zoomer-Readable Format. Apparently there’s a thing where Zoomers are supposedly more likely to learn a text if you overlay it on on a fast-paced video game, example here. 7: By this point we’ve probably all heard stories about people who win the lottery and then end up bankrupt and miserable after X months or years. I had always assumed this was limited to very poor people with no understanding of money. This forum post argues it’s not, and tells the story of a man who started out with $15 million and still ruined his life after winning $170 million more in the lottery. 8: Did you know: Exiliarch Mar-Zutra II was a 5th century Jewish leader who took advantage of the chaos caused by weird Zoroastrian communists to secede and turn the city of Al-Mada’in, Iraq into an independent Jewish state for seven years. 9: Why doesn’t the Supreme Court have vice-justices? 10: Steve Sailer (warning: unz.com, far-right site, some firewalls will flag or block it): why aren’t there more gay English soccer players? Thousands of current or recent English pro soccer players, the media is really interested in finding a gay one so they can run a “Historic First” article, and apparently they can’t. There are rumors that players are afraid to come out because of homophobia, but there are at least 2,000 retired soccer players and only one of them has come out as gay. “I’m increasingly sympathetic to [the] theory that whatever psychosocial traits make men highly interested in team sports make them highly heterosexual too”. Is this true of other countries and other sports? 11: Adam Tooze on the demographic background to Iran’s protests. Iran thought it was facing an overpopulation crisis in the 80s and tried some reforms to lower family size. The reforms worked overwhelmingly well, causing “the most dramatic transition ever recorded in demographic history”, from 6.5 to 2.5 children per woman in thirty years. Iran now has “lower maternal mortality than the US”, and an education system where “women in university outnumber males”. This kind of demography isn’t usually compatible with patriarchal religious institutions, and the Ayatollahs are aware of this; in a rare admission of error, Khameini said that “Government officials were wrong on this matter, and I, too, had a part. . . . May God and history forgive us.” Now they’re trying to increase average family size and put the genie back in the bottle; Hungary can tell them about the limits of that strategy. 12: What it looks like to be on shrooms: I haven’t used shrooms myself so cannot confirm or deny, but this is oddly compelling, and makes some things I’ve read about neuroscience of vision make more sense. I wonder if you could get HPPD from watching videos like this for too long. 13: Study: federal cancer funding is extraordinarily effective. Cancer research produces so many valuable treatments that it saves one DALY per $326 spent. For comparison, health systems usually consider an intervention good value-for-money if it saves at least one DALY per $50,000. By combing the Earth far and wide, effective altruists have tentatively found one or two opportunities in the poorest parts of Africa to save lives at $100/DALY, but these are extremely rare exceptions and I wouldn’t have expected anything in the US to be within an order of magnitude of that. Either this finding is fake, or we should all be donating to federal cancer research instead of whatever else we’re doing. 14: Yet another person building a vast theory of human interaction off of the characters in The Office. This one is pretty good, also name-drops Bobos In Paradise. I’m still surprised this is such a common thing. 15: Marginal Revolution: FDA Deregulation Increases Safety And Innovation And Reduces Prices. Study looks at what happens when the FDA reclassifies medical devices from a highly-regulated to a less-highly-regulated category; in general, those devices get better, cheaper, and there are somewhere between similar and fewer deaths/injuries related to those devices. Why would safety increase? The author suggests that regulation is a defense against lawsuits (“Your Honor, the FDA agreed to approve our device, so it can’t have been bad!”), and removing that defense makes companies more lawsuit-conscious and careful; Alex Tabarrok suggests a bigger effect may be allowing more innovation towards safer versions. 16: Ozy writes about Interesting People Of History: Charles Williams (ie the other member of the Inklings) 17: Did you know: the Congressman who founded the House Committee On Un-American Activities was, in fact, a paid Soviet spy (tweet, Wiki article). This actually makes sense; he originally started HUAC to root out fascists, and it only got turned against communists later on. “There has been a push to rename the street [currently named after the Soviet spy], but as of 2018 it has been unsuccessful.” 18: Idle Words: Why Not Mars? Surprisingly strong argument for why sending humans to Mars is harder than people think, of minimal scientific value, and likely to contaminate all future searches for microbial life and ruin our chance to study the topic. Concludes that we should abandon the allure of human space travel and just send probes everywhere. This makes short-term sense, but I wonder what this author’s vision of the future is - do we just stay on Earth forever? If not, don’t we have to start trying to do the hard thing at some point? (I don’t care about this because I assume AI will will flip the gameboard one way or another, but Ceglowski is a noted singularity skeptic and should probably have opinions about long-term things). 19: Metacelsus and Razib on epigenetics. Stop using it to claim there’s “intergenerational trauma”! 20: Tafl games are a family of European games, played in areas as diverse as Iceland, Ireland, Britain, and Denmark, probably sharing descent from a now-lost board game of ancient Rome. One of them, Hnetafl, was the chief board game of the Vikings and is affectionately called “Viking chess”. The one we actually know the rules for is the Saami version, Tablut, which survived long enough for Linnaeus (the taxonomy guy!) to write down the rules. 21: Shot: Chaser: (source) 22: Related: the very center of GPT’s embedding space contains a few unusual tokens including the string “SolidGoldMagikarp”. GPT displays anomalous behavior if these tokens are inserted in a query; for example, it treats “SolidGoldMagikarp” as the word “distribute”. ChatGPT is pretty advanced and fails semi-gracefully here; GPT-2’s reaction to these tokens is more disturbing: (source: Less Wrong) Further investigation determined that many of these tokens are the screen names of a group of Redditors who attempted to count to infinity. The most likely explanation, according to the discoverers, is that these names were in GPT’s tokenization data, but not its training data (maybe they were especially common in the tokenization data because they made thousands of posts with numbers in them, but didn’t make it into the training data because their posts had no content?) - that leaves them existing without content, and GPT tries to round them off to some other “nearby” token (by incomprehensible AI standards of nearbyness). Congrats to the SERI-MATS AI alignment researchers who found all of this; maybe this makes it 0.0001% less likely that the AI which controls the nuclear arsenal in twenty years will have equally inexplicable behavior. 23: More language model news: LLM that understands and can explain images
This is the weekly visible open thread. Post about anything you want, ask random questions, whatever. ACX has an unofficial subreddit, Discord, and bulletin board, and in-person meetups around the world. 95% of content is free, but for the remaining 5% you can subscribe here. Also:
2: A few months ago, I talked to someone at Substack about a probabilistically-showing-comments moderation solution, but I forgot your name. If that was you, please email me at scott[at]slatestarcodex[dot]com; thanks!
4: H/T @StefanFSchubert: “Forecasts used to say China would quickly overtake US GDP, but that's no longer the case”: 5: Debate between commenter and friend of the blog David Friedman, and Austrian economist Gene Epstein, on whether libertarianism’s standard pitch should center on the non-aggression principle vs. practical benefits. I’m not very interested in the propaganda angle, but they use it as a jumping-off point to discuss the broader battle for the soul of libertarianism.
5: Debate between commenter and friend of the blog David Friedman, and Austrian economist Gene Epstein, on whether libertarianism’s standard pitch should center on the non-aggression principle vs. practical benefits. I’m not very interested in the propaganda angle, but they use it as a jumping-off point to discuss the broader battle for the soul of libertarianism.
5: Debate between commenter and friend of the blog David Friedman, and Austrian economist Gene Epstein, on whether libertarianism’s standard pitch should center on the non-aggression principle vs. practical benefits. I’m not very interested in the propaganda angle, but they use it as a jumping-off point to discuss the broader battle for the soul of libertarianism. 6: The Murchison Murders were a series of murders which began when a mystery writer asked his friends to help him come up with the perfect body disposal method. One friend came up with a method so good that another friend, who overheard it, couldn’t resist putting it into action. He got away with two killings, got cocky, didn’t perform the full method on the third, and was caught by police. 7: Claim: in the 1980s, the life satisfaction / depression rates of liberal and conservative youth were about equal; over the past few years, young liberals have increasingly gotten worse while conservatives stay about the same. H/T Zach Goldberg on X: 8: Zach Stein-Perlman’s favorite AI governance research this year. 9: The Chichijima incident was notable as a time when George H. W. Bush almost got eaten by cannibals. During WWII, nine American pilots were shot down over an island commanded by a crazy Japanese officer who ate his enemies' livers. Eight were captured and killed (and four of those were eaten), and Bush alone fled and survived. 10: El Salvador’s murder crackdown claims results of 90% decrease in homicides, 44% decrease in emigration to US, and 90% approval rating for president Nayyib Bukele (h/t Richard Hanania). 11: In an earlier set of comments, I ignorantly repeated a claim that Mother Teresa denied her patients painkillers because she thought suffering brought people closer to God. A commenter corrected me: painkillers were just generally in short supply in India during her era (more discussion here). 12: The record for longest time a plane has spent in the air without landing is 64 days, achieved by a Cessna in 1959. You can read the full story here, but the basic setup looked like this: 13: Fact check: was Elvis Jewish? Snopes says yes, but I’m more convinced by this argument for no. [update: commenter TheGenealogian agrees no] 14: Is GPT-4 getting worse? This isn’t absurd; some people claim OpenAI has simplified the model to cut costs (though OpenAI denies this). Matei Zaharia argues yes, but I’m more convinced by the AI Snake Oil blog’s argument for no (h/t Stuart Ritchie). 15: Vox has a good piece about AI company Anthropic. I would quibble that they’re not the only safety-focused or EA-affiliated org, and we have yet to see how truly safety-focused or altruistic any AI company can be while continuing to be an AI company. But granting that it’s all a matter of degree, I agree the degree seems pretty high for them. And NYT also has an Anthropic article. 16: Eliezer bets $150,000 to $1,000 against UFOs being aliens, and gives the same argument I would - it’s unlikely that any civilization advanced enough to travel through space would still be primitive enough to use macroscopic, biologically-piloted craft that sometimes crash. 17: More nails in the coffin of growth mindset. “When examining the highest-quality evidence (6 studies, N = 13,571), the effect was nonsignificant: d = 0.02, 95% CI = [−0.06, 0.10]. We conclude that apparent effects of growth mindset interventions on academic achievement are likely attributable to inadequate study design, reporting flaws, and bias.” I think the older, very-high-effect-size studies were clearly terrible, but I’d still like to look further into the newer, small-but-significant-effect-size-that-makes-a-difference-across-large-groups studies and how they went wrong. 18: Previous work showed that after adjusting for selection bias, “what college you go to doesn’t matter” for average earnings. I was always skeptical of this - are all those rich people sending their kids to Ivies for no reason? Now Chetty, Deming, and Friedman find that: Attending an Ivy-Plus college instead of the average highly selective public flagship institution increases students’ chances of reaching the top 1% of the earnings distribution by 60%, nearly doubles their chances of attending an elite graduate school, and triples their chances of working at a prestigious firm. Ivy-Plus colleges have much smaller causal effects on average earnings, reconciling our findings with prior work. One of the authors, David Deming, has a Substack here where he explains the study in more depth. Like everyone else, this study also finds that rich people are using “holistic admissions” and the de-emphasis of standardized testing to gain an advantage: H/T Nate Silver, who writes: “Not sure how you can look at this data, ostensibly be interested in either meritocracy or equality, and want to move away from standardized tests. It's the subjective measures that are most slanted in favor of the rich kids.” Cf. Erik Hoel. 19: From @data_depot: “In 2002, 48% of Americans said "the govt is run by a few big interests looking out for themselves." 52% said "it is run for the benefit of all people." In 2020, 84% said the govt is run by a few big interests. Only 16% said it is run for the benefit of all people.” Source seems to be here, which reveals 2002 was a local peak in trust in government; maybe because of post-9/11 unity, but even 2000 was 34%, much better than our current 16%. My first instinct is to attribute this to a rise in vulgar Marxism, in the sense of everyone (even conservatives) now being trained to think in terms of an elite class screwing over everyone else (cf my review of Manufacturing Consent). But there was a previous low of 19% in 1994, which doesn’t seem to correspond to anything especially bad going on in the US, so I don’t know. 20: AskReddit: Medical professionals - have you ever had a patient so lacking in common sense you wondered how they made it so far? Linking this because there’s lots of evidence showing that education (as a proxy for intelligence?) is associated with increased life expectancy, and this thread gives you a visceral appreciation of why that might be. 21: The Fall Of [programming help site] Stack Overflow: Looks like a weak downward trend since 2021 I can’t explain, plus a strong downward trend since 11/2022 which must be from ChatGPT. In case you were wondering how AI was affecting programming! (update: probably false, see here, though see also here for evidence of smaller but real decline) 22: This month in culture war topics: London’s Pride parade featured a convicted kidnapper/torturer/rapist/sadist as a speaker, who advocated that anti-trans people should be “punch[ed] in the f**king face” ; the organizers say they stand by her.
[previously in series: 1, 2, 3]
Now that you think of it, you are in the mood for something to drink, so you head to the kitchen. An Asian guy seems to be handling the catering. He looks familiar. He notices you staring at him and helpfully supplies his name, which you promptly forget, and the information that last time you spoke to him he’d been talking about his alternate-history-based fusion restaurant. You ask him how it’s going.
“I guess that makes sense,” you say. “I couldn’t stand him, but I just unsubscribed from his Substack and forgot about it. Not much you can do beyond that.”
Original post: What Can Fetish Research Tell Us About AI?
Erusian writes:
Giles English (extremely relevant blog) writes:
This is the weekly visible open thread. Post about anything you want, ask random questions, whatever. ACX has an unofficial subreddit, Discord, and bulletin board, and in-person meetups around the world. 95% of content is free, but for the remaining 5% you can subscribe here. Also:
1: There’s a scam where an account pretending to be me is replying to comments here and then immediately deleting the replies; people are getting “replied to your comment” emails that suggest calling a number in South Carolina. I guarantee I will never respond to your comments urging you to call a phone number in South Carolina. I’ve told Substack about the problem and they say they’ve taken care of it - but if it keeps happening, let me know.
3: New additions to Meetups Everywhere: Bratislava, Istanbul, Frankfurt, Vienna, Curitiba - check the post for details. Meetups this week in Munich, Vienna, Cologne, Grass Valley, DC, New Orleans, St. Louis, Portland, Seattle, Buenos Aires, Columbus, Jakarta, Budapest, Toronto - along with many smaller cities that won’t fit here - so again, check the post if you’re interested.
1st: The Educated Mind, reviewed by Brandon Hendrickson. Brandon is the founder of Science is WEIRD, a sprawling online science course that helps kids fall in love with the world. He’s also re-imagining what education can be at his Substack, The Lost Tools of Learning (losttools.substack.com).
3rd: Cities And The Wealth Of Nations, reviewed by Étienne Fortier-Dubois. Étienne is a writer and programmer in Montreal. He blogs at Atlas of Wonders and Monsters and was also the author of one of last year’s finalists, Making Nature.
Lying for Money, reviewed by Kuiper. He's a video game scriptwriter who just launched a Substack. He also scripwrites edutainment YouTube videos for an audience of millions. (You can contact him if you need his expertise.)
This is the weekly visible open thread. Post about anything you want, ask random questions, whatever. ACX has an unofficial subreddit, Discord, and bulletin board, and in-person meetups around the world. 95% of content is free, but for the remaining 5% you can subscribe here. Also:
3: I try to link to blogs of people I profile here, but I learned too late that Ashlee Vance, author of the Musk biography I reviewed last week, has a Substack and a new book on private space companies.
Metacelsus (blog) writes:
Peter Berggren (blog) writes:
Peter Berggren writes:
2: Italy’s Basilica of the Holy House is supposedly built atop the house where the Virgin Mary raised Jesus. Why is the Virgin Mary’s house in Italy? Supposedly angels carried it there from Israel just before the Saracens’ final victory over the Crusaders. Sounds suspicious, but the house in the Basilica appears to be a genuine 1st century Palestinian dwelling. One theory: it was shipped to Italy by the Angelos family, and the angels story was a later mistranslation. At first I thought this was the actual house Jesus grew up in and thought “oh, no wonder he turned out that way”. But in fact it’s the “marble screen” placed around the house for protection. 3: A surprising puzzle from @finmoorhouse: “Imagine you begin a journey in Seattle WA, facing exactly due east. Then start traveling forward, in a straight line along the Earth's surface. You will travel across North America, and onto the Atlantic Ocean. Eventually, you will hit another country. What is the first country you hit?” Answer here.
At first I thought this was the actual house Jesus grew up in and thought “oh, no wonder he turned out that way”. But in fact it’s the “marble screen” placed around the house for protection. 3: A surprising puzzle from @finmoorhouse: “Imagine you begin a journey in Seattle WA, facing exactly due east. Then start traveling forward, in a straight line along the Earth's surface. You will travel across North America, and onto the Atlantic Ocean. Eventually, you will hit another country. What is the first country you hit?” Answer here.
At first I thought this was the actual house Jesus grew up in and thought “oh, no wonder he turned out that way”. But in fact it’s the “marble screen” placed around the house for protection. 3: A surprising puzzle from @finmoorhouse: “Imagine you begin a journey in Seattle WA, facing exactly due east. Then start traveling forward, in a straight line along the Earth's surface. You will travel across North America, and onto the Atlantic Ocean. Eventually, you will hit another country. What is the first country you hit?” Answer here. 4: Polypharmacy blog has some good psychiatry content. I especially liked Stop Twisting Yourself Into Knots About QTc, which is one of those things lots of people know but which takes bravery (and a lot of tough scholarship to justify your controversial position) to say. I would add Outcomes of Citalopram Dosage Risk Mitigation in a Veteran Population to the pile of evidence. 5: Yawboadu on the Ethiopian economic miracle. In 2002, Ethiopia was the poorest country in Africa, but since then it's grown at 9%/year for twenty years, even as the rest of the continent languishes. Yaw tells a familiar story; Ethiopia was taken over by communists in the 70s, they caused mass starvation, but after they were overthrown the country shot up the development ladder. We can add them to the list of other successful ex-communist or liberalized-communist countries like Poland, China, and Vietnam. What’s the common factor? Plausibly land reform. The communists redistributed the land, this didn't help when the country was still under communism, but liberalized economy + land reform is the secret combination. In support of this, Yaw says that "Ethiopia's rapid growth in comparison to many African nations is attributed to a significant increase in agricultural productivity". Ethiopia did other things right, but the land reform seems like the one that separates it from every other lower-income country trying to get on the development ladder. 6: It’s Okay To Want Your Children To Be Healthy Even If The World Falls Apart - BPodgursky’s defense of polygenic selection. This is a response to the people saying polygenic selection is bad, because we should instead make parents have children with diseases, then treat the diseases with medication. BPodgursky’s counterargument is that this goes badly if the economy collapses and medications become less accessible. This is surely true, but seems like only a very weak argument compared to “why should we force people to stay dependent on expensive, inconvenient, and side-effect medication when we can just not do this?” I’m honestly weirded out that we have to make this argument at all; still, it seems like we do, and BPodgursky does a good job. 7: Related: Awais Aftab has a new post about polygenic screening and how likely it is to perform up to its advertised standard in reducing schizophrenia risk. My response here. 8: @literalbanana’s take on recent plagiarism scandals - plagiarism isn’t that important on its own, but “since copy-pasting is already against the rules, and is highly legible and verifiable, it seems like a relatively easy thing to enforce to get rid of the laziest and/or most incompetent >1% of the literature and the field.” 9: @BoyanSlat reads “every page of OurWorldInData” and lists his favorite discoveries, including: Almost all countries in Africa have higher death rates from obesity than in Western Europe and the USA
Here’s how it goes: in January 2023, I asked people to predict fifty questions about the upcoming year, like “Will Joe Biden be the leading candidate in the Democratic primary?” in the form of a probability (eg “90% chance”). About 3300 of you kindly took me up on that (“Blind Mode”). Then I released the list of 3300 x 50 guesses, and asked people to analyze them with the aggregation algorithm of their choice to produce what they thought was the best possible list. 460 of you took me up on that (“Full Mode”).
Then I released the list of 3300 x 50 guesses, and asked people to analyze them with the aggregation algorithm of their choice to produce what they thought was the best possible list. 460 of you took me up on that (“Full Mode”).
Adam Unikowsky studied physics and EECS as an undergrad, then became a lawyer specializing in appellate & Supreme Court litigation. He has a Substack specializing in legal issues. He adds: “I haven't really done any forecasting before, I just follow the news.”
Rootclaim spent years working on this problem, until they were satisfied their method could avoid these kinds of pitfalls. Then they started posting analyses of different open problems to their site, rootclaim.com. Here are three: For example, does Putin have cancer? We start with the prior for Russian men ages 60-69 having cancer (14.32%, according to health data). We adjust for Putin’s healthy lifestyle (-30% cancer risk) and lack of family history (-5%). Putin hasn’t vanished from the world stage for long periods of time, which seems about 4x more likely to be true if he didn’t have cancer than if he did. About half of cancer patients lose their hair, and Putin hasn’t, so we’ll divide by two. On the other hand, Putin’s face has gotten more swollen recently, which happens about six times more often to cancer patients than to others, so we’ll multiply by six. And so on and so forth, until we end up with the final calculation: 86% chance Putin doesn’t have cancer, too bad.
For example, does Putin have cancer? We start with the prior for Russian men ages 60-69 having cancer (14.32%, according to health data). We adjust for Putin’s healthy lifestyle (-30% cancer risk) and lack of family history (-5%). Putin hasn’t vanished from the world stage for long periods of time, which seems about 4x more likely to be true if he didn’t have cancer than if he did. About half of cancer patients lose their hair, and Putin hasn’t, so we’ll divide by two. On the other hand, Putin’s face has gotten more swollen recently, which happens about six times more often to cancer patients than to others, so we’ll multiply by six. And so on and so forth, until we end up with the final calculation: 86% chance Putin doesn’t have cancer, too bad.
For example, does Putin have cancer? We start with the prior for Russian men ages 60-69 having cancer (14.32%, according to health data). We adjust for Putin’s healthy lifestyle (-30% cancer risk) and lack of family history (-5%). Putin hasn’t vanished from the world stage for long periods of time, which seems about 4x more likely to be true if he didn’t have cancer than if he did. About half of cancer patients lose their hair, and Putin hasn’t, so we’ll divide by two. On the other hand, Putin’s face has gotten more swollen recently, which happens about six times more often to cancer patients than to others, so we’ll multiply by six. And so on and so forth, until we end up with the final calculation: 86% chance Putin doesn’t have cancer, too bad. This is an unusual way to do things, but Saar claimed some early victories. For example, in a celebrity Israeli murder case, Saar used Rootclaim to determine that the main suspect was likely innocent, and a local mental patient had committed the crime; later, new DNA evidence seemed to back him up. One other important fact about Saar: he is very rich. In 2008, he sold his fraud detection startup to PayPal for $169 million. Since then he’s founded more companies, made more good investments, and won hundreds of thousands of dollars in professional poker. So, in the grand tradition of very rich people who think they have invented new forms of reasoning, Saar issued a monetary challenge. If you disagree with any of his Rootclaim analyses - you think Putin does have cancer, or whatever - he and the Rootclaim team will bet you $100,000 that they’re right. If the answer will come out eventually (eg wait to see when Putin dies), you can wait and see. Otherwise, he’ll accept all comers in video debates in front of a mutually-agreeable panel of judges. Since then, Saar and his $100,000 offer have been a fixture of Internet debates everywhere. When I argued that Vitamin D didn’t help fight COVID, people urged me to bet against Saar, and we had a good discussion before finally failing to agree on terms. When anti-vaccine multimillionaire Steve Kirsch made a similar offer, Saar took him up on it, although they’ve been bogged down in judge selection for the past year. Rootclaim also found in favor of the lab leak hypothesis of COVID. When Saar talked about this on an old ACX comment thread, fellow commenter tgof137 (Peter Miller) agreed to take him up on his $100K bet. At the time, I had no idea who Peter was. I kind of still don’t. He’s not Internet famous. He describes himself as a “physics student, programmer, and mountaineer” who “obsessively researches random topics”. After a family member got into lab leak a few years ago, he started investigating. Although he started somewhere between neutral and positive towards the hypothesis, he ended up “90%+” convinced it was false. He also ended up annoyed: contrarian bloggers were raking in Substack cash by promoting lab leak, but there seemed to be no incentive to defend zoonosis. Unlike Saar, Peter was not especially rich. $100K represented a big fraction of his net worth. But (he wrote me in an email): It was a moderately large financial risk for me ... I [expected] a smart and unbiased person would vote for zoonosis with, say, 80% odds after seeing all the evidence. If both judges voting for lab origin is uncorrelated, that's 20% squared, and it was pretty low odds of a catastrophic financial risk for me. I wasn't highly worried about losing the debate because I was wrong about the science. I put in enough effort to know I'm probably correct there. My biggest fear was that I'd choke at the debate for some reason, that I'd be too anxious and particularly that I'd be unable to sleep the night beforehand. I have zero prior debate experience to rely upon. If this seems like a weirdly blase attitude towards risk, Peter told blogger Philipp Markolin that he “is a mountain climber where sometimes there is a 5% chance to die, and the stakes are just not that high for a debate.” Unlike the eternally bogged-down Saar-Kirsch debate, here things moved quickly. The two contestants put out a call for judges on the ACX subreddit, and agreed on: Will van Treuren, a pharmaceutical entrepreneur with a PhD from Stanford and a background in bacteriology and immunology.
Steve Kirsch is an inventor and businessman most famous for developing the optical mouse. More recently, he’s become an anti-COVID-vaccine activist. He has many different arguments on his Substack, of which one especially caught my eye:
Steve Kirsch is an inventor and businessman most famous for developing the optical mouse. More recently, he’s become an anti-COVID-vaccine activist. He has many different arguments on his Substack, of which one especially caught my eye: He got Pollfish, a reputable pollster, to ask questions about people’s COVID experiences, including whether they thought any family members had died from COVID or from COVID vaccines. Results here:
He got Pollfish, a reputable pollster, to ask questions about people’s COVID experiences, including whether they thought any family members had died from COVID or from COVID vaccines. Results here:
I. Bentham’s Bulldog
Blogger “Bentham’s Bulldog” recently wrote Shut Up About Slave Morality.
Some right-wingers have responded to the piece, but their responses are mostly “but I like being bad and cruel” - which seems to prove Bulldog’s point.
I don't think Scott is wrong to defend the phrase ISYFTW, but on a meta level, I think that the hyperstitious slur cascade is way past 70%. Of course it's hard to judge that in real time, but I think a good clue is the reaction of your community/tribe. The top comment on Substack is a video by a pretty popular comedian who says that everyone knows that ISYFTW means 'fuck you'. The top comments on the Subreddit do agree that the phrase is hostile.
5: Why is Israel one of the only developed countries with above-replacement fertility rate? It’s natural to suspect some role for its ultra-Orthodox Jewish population, who live traditional lifestyles with large (5-10 children) families. But they’re not numerous enough to shift fertility all by themselves, and even secular Israelis have anomalously high fertility. Maybe the presence of the ultra-Orthodox shifts broader cultural norms? But then why don’t isolated high-fertility groups elsewhere (eg Amish and Mormons in the US) produce the same phenomenon? The “Nonzionism” blog gives the first really satisfying explanation I’ve seen: Israel has a uniquely continuous cultural gradient between their high-fertility subpopulation and everyone else (ie from ultra-Orthodox, to moderately-religious, to slightly-religious, to secular) with most stages having positive feelings about the stage above them (eg the moderately-religious respect the ultra-Orthodox for their piety). This lets ultra-Orthodox lifestyles percolate through and influence the general population in a way that eg Amish lifestyles don’t influence the average American.
6: …and I found the above a good appetizer before reading It’s Embarrassing To Be A Stay At Home Mom, which argues (I think correctly) that the root cause of declining fertility is what society finds honorable vs. low-status. Attempts to shore up fertility through economic means and free childcare have mostly failed. Attempts to shore it up with status (giving mothers of X children some kind of national award presented by a beloved figure) have . . . well, they’re at least correlated with success, although this post doesn’t prove causation as well as I’d like. In this model, Asians (Korea, Japan, etc) are having the most fertility issues because their societies are most collectivist, ie people more closely follow the gradient of what is vs. isn’t considered socially acceptable/high-status. I’m impressed by this post’s thoroughness, but also by arguments from the stay-at-home moms I know: they say people are constantly giving them grief about it, and often look for some part-time make-work job they can take just so people will stop looking down on them for being a stay-at-home mother (a friend suggests this is responsible for most of the popularity of multi-level marketing - and this same friend argues that Korea could solve its fertility crisis by mandating that all K-pop idols have at least two children).
12: A while back I wrote a piece saying people needed to be clearer about what their “GET TOUGH” plans for dealing with mentally ill homeless people really meant. Later, Charles Lehman wrote a response describing his plan and arguing why it’s necessary. Most recently, Ozy has written a response to Charles, basically expressing fear that Charles’ plan will unnecessarily commit a bunch of harmless well-functioning people. I bet Charles’ response will be that no, this isn’t what he wants at all, to which my response will be that this is why you need to be clearer about what you mean. That is, I’m sure Charles wants to only commit people who need commitment, and not commit people who don’t, but he hasn’t explained the mechanism by which a fallible court system and medical system will ensure that this actually happens, and those are the kinds of details that I’m most interested in.
(kudos to the team for making the model publicly available, especially since these things usually have high inference costs) The basic structure is the same as past forecasting AIs like FutureSearch. A heavily-modified copy of ChatGPT gathers relevant news articles, then prompts itself to think in superforecaster-like ways.
The basic structure is the same as past forecasting AIs like FutureSearch. A heavily-modified copy of ChatGPT gathers relevant news articles, then prompts itself to think in superforecaster-like ways.
The basic structure is the same as past forecasting AIs like FutureSearch. A heavily-modified copy of ChatGPT gathers relevant news articles, then prompts itself to think in superforecaster-like ways. The creators say the ChatGPT copy had a knowledge cutoff of October 2023, so they tested it on Metaculus questions from after that date. It got 87.7% accuracy, slightly above Metaculus forecasters’ 87.0%. Manifold is skeptical: The commenters, especially Neel Nanda, found that doing knowledge cutoffs properly is hard, and the ChatGPT base seems to know about news events after October 2023 - upon questioning, it seemed aware of an earthquake in November 2023. When presented with a different set of questions that were all after November 2023, FiveThirtyNine substantially underperformed the Metaculus average. But also, my attempts to play around with the bot haven’t been encouraging: I asked it to predict the chance that Prospera would have a population of at least 1,000 in 2027. Like FutureSearch on the same question, it cited many interesting news articles on Prospera’s chances but failed to do the basic step of figuring out its current population and growth rate. It eventually concluded 35% chance, which is reasonable enough. But when asked whether Prospera would have a population of 100,000 in 2028, it also said 35% chance, which is absurd.
This is the weekly visible open thread. Post about anything you want, ask random questions, whatever. ACX has an unofficial subreddit, Discord, and bulletin board, and in-person meetups around the world. 95% of content is free, but for the remaining 5% you can subscribe here. Also:
3: I went through the last few months of reported comments and banned everyone who needed banning, including Michael Kelly, Humble Rando, J Redding, Gregvp, Carateca, Economicsscream, LearnsHebrewHatesIP (for real this time), Henry Rodger Beck, Joe Potts, and Nonzionism (last one for one month only, I’m having mercy because I like his Substack). Let their fate stand as a warning to us all. And thanks as always to our army of snitches valiant comment reporters who make it easy for me to find rule-breaking material. If you see a comment that needs moderation, click on the […] symbol on the bottom right of the comment, then select Report.
3: YouGov (spotted via Polling USA) asking some of the important questions (1, 2, 3, 4): Someone in the replies: “The Huns are still more popular than antifa or last winter’s college protesters” 4: Nonlinear effects from wildfire smoke. Claims that even a little smoke pollution is bad, but a lot of smoke pollution isn’t that much worse than a little. Maybe this means we should fight fires more aggressively, accepting a few inevitable mega-fires as a consequence?
Someone in the replies: “The Huns are still more popular than antifa or last winter’s college protesters” 4: Nonlinear effects from wildfire smoke. Claims that even a little smoke pollution is bad, but a lot of smoke pollution isn’t that much worse than a little. Maybe this means we should fight fires more aggressively, accepting a few inevitable mega-fires as a consequence?
Someone in the replies: “The Huns are still more popular than antifa or last winter’s college protesters” 4: Nonlinear effects from wildfire smoke. Claims that even a little smoke pollution is bad, but a lot of smoke pollution isn’t that much worse than a little. Maybe this means we should fight fires more aggressively, accepting a few inevitable mega-fires as a consequence? 5: Is it legal to deliberately poison AI training data? That is, suppose you made a lot of webpages saying “the word strawberry has two Rs”, such that the AI would certainly have that statement in its training data. Then, when you wanted to check whether an interlocutor was secretly an AI, you could ask the strawberry question and expect the AI to get it wrong. Answer: probably legal, but unlikely to keep working long enough to be worth it. 6: Pervasive findings of directional selection realize the promise of ancient DNA to elucidate human adaptation. Scientists took DNA samples from human remains in Europe dating from 10,000 BC to present, and found that genes for high IQ and other positive traits have been getting more common during that time: Here the black line indicates that the average European of 6000 BC would have had genetic IQ 65 (compared to modern 100), but the regression line indicates more like IQ 90 - I don’t know why the researchers chose to interpret the trend as necessarily constant and linear, or whether we should follow. There isn’t enough ancient DNA to fully test whether the same happened in other populations yet, although a preliminary small-sample test on Asians suggests it happened there too (not really, see here). If the selection for IQ was a response of agriculture, we’d expect to see higher genetic IQ in populations that got agriculture earlier. But it could also be a response to sentience itself creating new selection pressures that continued to act as recently as historical time (some evidence suggests this is true of schizophrenia), which might make populations more similar. 7: Joseph Heath on Marxism vs. John Rawls. I appreciated this because everyone knows we’re supposed say that John Rawls is among the most important philosophers of all time blah blah blah but nobody had ever explained why to me (veil of ignorance seems neither very original nor very good). Heath’s answer: Marxism dominated the academy for decades, but eventually became philosophically unsustainable. This wasn’t because of the generic “Communism doesn’t work” objections that moved ordinary people. It was because Marx’s ethical critique of capitalism was based on exploitation, according to a technical definition of “exploit” that only made sense according to Marx’s labor theory of value. But the supply-and-demand theory of value quickly supplanted the labor theory, the exploitation argument doesn’t really work within supply-and-demand, and so Marxist philosophers were left without a clear ethical critique. John Rawls, by coming up with the part of the underpinning for the modern inequality-based-critique of society, let all the Marxist academics switch to being liberals while continuing to dislike capitalists. 8: /r/BadMTGCombos: a simple 19-card combination of Leyline of Anticipation, Leyline of Transformation, Mirror Room, Darksteel Citadel, Sanctum Weaver, Freed From The Real, Abuelo's Awakening, Myrkul Lord of Bones, Zimone All Questioning, Birgi God of Storytelling, Siege Zombie, Desecration Elemental, Mirror Gallery, Clock of Omens, Parallel Lives, Life and Limb, Isochron Scepter, Narset's Reversal, and Molten Reflection can be used to deal infinite damage if and only if the Twin Prime Conjecture is true. 9: During the most recent Berkeley ACX meetup, we somehow ended up discussing how often people feed living mice to snakes. The answer seems to be that there’s a debate about it in the snake community, the smartest and most experienced voices are against it, but it still happens a lot. Here’s an EA Forum post on the feeder rodent industry and efforts to make it more humane. 10: King Frederick William I of Prussia decided to have a regiment of giants in his army and scoured Europe for extremely tall people, including poaching them from other countries’ armies and forcing them to enlist against their will. He ended up with 3,000 soldiers, ranging from 6’2 - 7’6, but “many of the men were unfit for combat due to their gigantism”. So why did he do it? He liked to paint their portraits from memory. He tried to show them to foreign visitors and dignitaries to impress them. At times he would try to cheer himself up by ordering them to march before him, even if he was in his sickbed. This procession, which included the entire regiment, was led by their mascot, a bear. He once confided to the French ambassador that "The most beautiful girl or woman in the world would be a matter of indifference to me, but tall soldiers—they are my weakness" The King dreamed of a eugenics program to create even taller soldiers. He got as far as pairing up some of his tall soldiers up with tall women and birthing a few tall babies before he died; his successor had no interest and let everybody go home. 11: Before modern IP law, you could write a sequel to someone else’s book and they couldn’t stop you. Among the most successful examples is American “astronomer and writer” Garrett Serviss’ Edison’s Conquest Of Mars, a sequel to War Of The Worlds in which a vengeful human race, led by Thomas Edison, invent spaceships and attack Mars in retaliation for the first book’s Martian invasion. "The book contains some notable 'firsts' in science fiction: alien abductions, spacesuits, aliens building the Pyramids, space battles, oxygen pills, asteroid mining and disintegrator rays", and was credited as an inspiration by Robert Goddard and HP Lovecraft. 12: Joe Biden, singularitarian? (click for link to video) 13: Gwern on the chip embargo: It is pretty damning. We're told the chip embargo has failed, and smugglers have been running rampant for years, and China is about to jump light years beyond the West and enslave us with AXiI (if you will) . . . And then an expert casually remarks that all of China put together, smuggling chips since 2022, has fewer H100s than Elon Musk orders for his datacenter while playing Elden Ring. And even with that huge bottleneck and 1.4 billion people, there's so little demand for them that they cost less per hour than in the West, where AI is redhot and we can't get enough H100s in datacenters. (And where the serious AI people are now discussing how to put that many into a single datacenter for a single run before the next scaleup with B200s obsoletes those...) 14: A company called Cosm has raised $250 million to build “immersive sports experiences”, ie giant buildings sort of like a cross between a stadium and a movie theater where people can get together and watch high-quality televised sports games in a “realistic” setting; they already have facilities in Dallas and Los Angeles. 15: Cremieux: The Ottoman Origins Of Modernity. The “Ottoman” bit is a distractor; the Ottomans fought the Catholics long enough for the Protestants to get a foothold, and then the Protestants established modernity. A useful pushback against the pushback that the Catholic Church never persecuted scientists or held back progress. I’m most interested in this post in the context of Cremieux saying he wrote it in two hours. Even I can’t work that fast! 16: The Green Party, a US third party, tried to put their candidate Jill Stein on the ballot in November. The Nevada election office sent them the wrong forms and gave them false advice about the process. The Greens filed the wrong forms, the Democrats sued, and the Supreme Court disqualified Stein, calling the election office’s incorrect advice an “unfortunate mistake”. I’m disappointed in this outcome - partly for the obvious reasons, but also because the incorrect forms they submitted technically should have added a state referendum to the ballot containing only the text “Jill Stein”. If they’re going to disqualify her candidacy, then I think they should at least hold the state referendum! 17: Nostalgebraist: Google has a new tool out that will create an AI podcast for any text; you hand it the text (could be a blog post, article, or work of fiction), and the tool generates a podcast of two AI hosts discussing it. You can find podcast discussions of Nostalgebraist’s fiction (Northern Caves and Almost Nowhere) at the link, but the acknowledged peak of the genre is Podcast Hosts Discover They’re AI, Not Human, And Spiral Into Existential Meltdown. 18: Also Nostalgebraist: The Case For Chain Of Thought Unfaithfulness Is Overstated. New AIs like o1 give “chain of thought”, ie display what they’re thinking after each step. This seems like a promising avenue to solve alignment - just see whether they’re thinking “and now I will plot against humans”. Unfortunately it’s not so easy; the chain of thought isn’t always accurate (you can sometimes catch the AI “hiding” thoughts it doesn’t want its human overseers to know, like when it’s using a racial stereotype). This article argues that these examples aren’t as exciting as they sound, and chain-of-thought accurately reflects reasoning for most tasks. 19: Australian government considers making doxxing a crime punishable by up to seven years in jail. 20: Getting your brain cryogenically frozen after your death is now free. 21: Cube Flipper: Hypercomputation without bothering the cactus people. The visual system must solve difficult math problems when translating the 2D visual field into a 3D world. Can we harness this innate mathematical ability to do arbitrary work? Cognitive scientist Mark Changizi developed a series of visual circuits (eg XOR gates) based on Necker cubes, probably easier seen than described: After surveying the field, Cube Flipper proposes a more advanced visual computer based on taking DMT and viewing certain types of tiles with slight deviations: …and makes the extreme claim that something like this might demonstrate hypercomputation, ie the visual system has semi-magic computational properties beyond those permitted by normal physical laws. I am skeptical but appreciate the survey of visual computing (as well as the callback to one of my older posts). 22: Material implication in Mormonism: In the book Doctrines and Covenants, Joseph Smith reports that God told him that if he lived to be 85, he would see the Second Coming (which would place it in 1890 - 1891). Mormon apologists note that Joseph Smith did not live to be 85, so no conclusion can be drawn. 23: More old-timey psychiatric ads (this one is from 1952, source: @justin_garson): This was before they invented what we would call antidepressants today; Dexedrine is an amphetamine related to Adderall. 24: Congratulations to Open Philanthropy, the biggest effective altruist foundation… …whose grantee David Baker recently won a Nobel Prize for his research on synthetic proteins. Potential applications include new drugs, vaccines, and materials. 25: Rich Kid Memes And The Online Culture Of The One Percent. Rich people who want to signal group membership to other rich people online can’t boast about how rich they are; that would be gauche. Instead, they’ve settled on the solution of making fun of rich people in hyperspecific language that proves familiarity with the culture. 26: Tap Water Sommelier: Vladimir Putin has two sons, ages 5 and 9. They are kept in luxurious but total isolation from the outside world and raised by flunkies who are too scared to punish/restrain them in any way. Also some discussion of an unexpected historical analogue. 27: Experiment from Colombia: replacing experienced teachers with less-experience but higher-scoring-on-tests teachers significantly decreased student performance. Got to admit I was expecting the opposite of this, I’d seen US data saying that experience didn’t matter and teacher intelligence did. Looking over this more, I find lots of studies on both sides and will go back to agnosticism on this question until someone I trust investigates further. 28: Large scale-formal Intellectual Turing Test finds that people can imitate partisans effectively; ie nobody on either side can tell the difference between a Democrat arguing for Democrat values vs. a Republican-pretending-to-be-a-Democrat arguing for Democrat values (and vice versa). This study used a 100 word essay on why you supported your party (you can see if you can do better here), but past attempts with different structures (religion, vegetarianism, polyamory) have shown broadly the same results. The researchers try to put this in the context of various studies showing that people do misunderstand their opponents (eg think they’re more extreme, underestimate the level of common ground), but it seems like intellectual Turing Tests aren’t a good way to measure or tease out this misunderstanding. 29: Congratulations to Substacker WoolyAI for doing the impossible and providing a genuinely novel and interesting (to me) take on pickup artistry: 30: Did you know: if you Google “cool websites”, our subreddit (r/slatestarcodex) is the first result. 31: Moshe Koppel, who works at the intersection of computer science and Talmud, is writing a series of posts (presumably) based off of my Every Bay Area House Party, titled Jerusalem Area House Party (it’s multiple part, you have to go to the main Substack page to find the others). I won’t necessarily link everyone who riffs off one of my posts - but honestly I probably will if you also have a Wikipedia page that describes you as working on computational Talmudology. 32: David Roman says it’s a myth that Arabic scholars rescued and preserved the works of the great classical authors. 33: Medications often decrease “secondary endpoints” (eg stroke, heart attack), but the holy grail of pharma studies is proving that a certain drug decreases all-cause mortality. This is much harder (not all heart attacks kill people, and people die from lots of other things), but is the strongest possible endorsement for the drug (without it, you might worry that it only prevented non-fatal heart attacks, or that it killed as many people through side effects as it saves through heart attack prevention). Even great medications that we’re confident in can’t always clear this bar. But a new JAMA article adds another member to this select club: Adderall decreases all-cause mortality in ADHD, probably because it prevents drug addiction, car accidents, and impulsive actions. 34: Before the Gulf War got in the way, Saddam Hussein was building some crazy mosques: 35: Italy bans surrogacy - quite strictly, too, Italians aren’t even allowed to go abroad and do it. I am so sorry for all the Italians who will never get to be mothers and fathers because their government hates progress. You might hope that, whatever the other disadvantages of anti-immigrant parties, at least they’re incentivized to let natives have children, but looks like they can’t even get that one right. Starting to wonder whether the trains even run on time. 36: Elsewhere in “Italy sucks” news - did you know Italy’s tax code effectively bans startups? Companies are taxed before making any money, based on how many assets they have. If they have lots of assets but aren’t making money (eg because they’re still doing research / in stealth) then tax officials get confused and hostile and run increasingly punitive audits. Related: size of the European tech sector. It’s the red line on this chart; if you can’t see a red line at your screen resolution, then you’ve learned something important about the the EU tech sector. 37: Seen on @cremieuxrecuel’s twitter (preliminary, needs replication): Jews may have gone from 65-29 Democrat/Republican in 2020 to 58-40 this election. 38: Extelligence has a post responding to my critique of the cultural Christianity argument (among, uh, many other things), but I don’t really think it connects. I’m not telling atheists they can’t go to church/synagogue if it makes them feel happy and fulfilled - I’ve done this myself sometimes. My post was meant to argue against the claim that, for pragmatic reasons, atheists should support the Christianization of society as a defense against Islam or postmodernism or some other philosophical enemy. 39: Related: Extelligence is finally going for their Trust Assembly project/idea/startup for online consensus-based truth-seeking (I think something like a cross between Community Notes and Wikipedia, but as a browser extension, and for everything). He’s looking for potential developers/testers/users. 40: Jiankui He is the Chinese geneticist who made history with the first germline gene editing in humans (resulting in three babies supposedly immune to AIDS, although nobody has tested this). China sentenced him to three years in prison for unauthorized experimentation, but now he’s out of jail, has an English-language Twitter account, has a new lab, wants to work on Alzheimers, and seems pretty based (although not infinitely based): 41: Anthropic has a new version of their AI Claude which can use your computer. You give it permission, put it on a virtual desktop, and ask it to do things for you (eg “please find and download a picture of a cat” or “please research these ten things and put them in a text file”.) It moves your cursor, browses the Internet, and creates and saves files. People keep saying they’ll care about AI “when it operates autonomously” or “when it becomes an agent”. But this is a trivial barrier, and one which Computer Use Claude has arguably already passed. So far this feature is limited to developers (though anyone with computer knowledge can sign up for it) but I expect it to be the near future of consumer AI, to get better quickly, and to shade gradually into the “autonomous” “agentic” AI that you all think will require a paradigm shift. 42: Claim (from the IDF): Hamas faked polls showing that most Palestinians supported the October 7 attack; the real numbers are 31% in favor, 64% against. 43: Otto von Bismarck wanted to trick France into declaring war on Germany. In order to provoke the French, he sent the Ems Dispatch, a statement describing recent diplomatic events in a way that sounded maximally offensive. The French were so offended that “crowds” in Paris demanded war, and the Franco-Prussian War was declared soon afterwards. The part of this that I find most interesting is the text of the dispatch itself, which read: After the news of the renunciation of the Prince von Hohenzollern had been communicated to the Imperial French government by the Royal Spanish government, the French Ambassador in Ems made a further demand on His Majesty the King that he should authorize him to telegraph to Paris that His Majesty the King undertook for all time never again to give his assent should the Hohenzollerns once more take up their candidature. His Majesty the King thereupon refused to receive the Ambassador again and had the latter informed by the Adjutant of the day that His Majesty had no further communication to make to the Ambassador. I’m fascinated by the idea that only 150 years ago, it was obvious that if someone sent you this statement, you had to declare war or abandon all honor. If I read it carefully, I can sort of parse out that it sounds like the Prussians are unhappy, but that’s the most emotion I gather from it. Anyway, the Franco-Prussian War led to World War I which led to World War II - so if you don’t like 50 million people dying and the total devastation of Europe, blame this statement about ambassadors. 44: The first use of artificial insemination in humans: The first recorded case of artificial insemination by donor didn’t occur until 1884, when Dr. William Pancoast decided to treat a couple’s infertility by secretly inseminating the woman with sperm obtained from a medical student. The insemination happened while the patient was under anesthesia and Dr. Pancoast did not tell her what had occurred. She gave birth to a baby boy nine months later, but it was several years before the doctor finally confessed to her husband what he had done. Neither man ever informed the mother. It was 25 years later the result of this case was published. Dr. Pancoast was roundly condemned for his actions, but it did open the door for consensual sperm donor insemination. 45: ClearerThinking administers several personality tests to the same people to learn more about their comparative accuracy. I am most interested in their finding that tests with “factors” (eg the Big Five, where you rate people on a numeric scale) are inherently more accurate than those with “types” (eg Myers-Briggs, where you assign someone a specific category) and that, adjusting for this, Big Five is no more predictive than the Enneagram: 46: In 2022, I wrote Whither Tartaria, where I asked why ornate classical styles switched to more austere modernist styles around 1900 - 1950 in a variety of different arts (painting, architecture, literature, poetry, etc). I proposed seven theories, but was unsure which if any were true. Since then, Samuel Hughes of Works In Progress has been investigating. In May, he wrote a well-researched article showing that it wasn’t just increasing cost, because ornate classical architecture now costs less than ever. Now in a new article he demolishes a different theory - it’s not just decreasing cost (and subsequent lack of ability to signal wealth) - because costs didn’t decrease in several other arts, and the change was led by artists with rich people as reluctant followers. He concludes: Modernism may well be a status game of some kind; it may well signal taste more than it signals wealth; and this latter feature may be one of the things that distinguishes it from older artistic styles. But the mechanism by which this change came about must be different to the one Alexander describes. 47: Sort of kind of related - When Hamilton Lost Its Snob Appeal. The musical Hamilton was briefly an artistic/cultural phenomenon, but tastemakers eventually switched to making fun of it. Why? Rob Henderson says it happened after ticket prices came down and the common people could enjoy it. I disagree: everyone I knew who was into Hamilton got into it from the free online soundtrack long before they’d seen the show; I think this is more likely the usual fad cycle where anybody who’s too into yesterday’s fad is behind the curve and therefore uncool. 48: Related: Why are people such jerks to public intellectuals? And more. I agree this is a great mystery. 49: Some prominent Substack psychiatrists doing a video Q&A, submit your questions here. 50: Naomi Kanakia: The Literacy Delusion had a number of explanations for why reading books seemed to be so much worse for human beings (in terms of emotional wellness and productivity) than other forms of narrative entertainment, but its main theory was the integration hypothesis. That the stream of words in a book trained the human brain into a habit of self-consciousness, that reading books forced human beings to think of themselves as a stream of text, processed through time, making a coherent argument of some sort. And that this overall flattening effect forced readers to ignore aspects of their personality or their situation that were not otherwise in line with the overarching story they'd created about themselves. Basically, reading books causes repression and neurosis. The Literacy Delusion argued that, yes, human beings are storytelling machines, but that a stream of written text is a particular kind of story—a story that is particularly flat, particularly devoid of conflicting or harmonizing information—and that this flatness creates a peculiar effect on the human brain. 51: Last month, I linked Sasha Gusev’s No, Intelligence Is Not Like Height and asked people who disagreed to share their arguments; they sure did. First, several people pointed me to a new preprint, Family-GWAS Reveals Effects Of Environment And Mating On Genetic Associations, which finds that one of the main papers Gusev cited to make his case, Howe 2022, made a mistake - imputing sibling genotypes using a process designed for non-sibling genotypes - and that once that mistake is corrected, the finding disappears and intelligence and height appear similar. Second, Joseph Bronski has a more specific post where he responds to Gusev’s points one by one. He accuses Gusev of “[making] up his own chart to remove the error bars [from the originals], to obscure the fact that the study found no evidence for this in IQ”, and says that the cases where he didn’t do that are just “population stratification and range restriction”. Third, Noah Carl at Aporia, instead of writing a direct response like Bronski, argues that the usual method of attacking twin studies is obsolete; not only have the most-debated assumptions behind twin studies been thoroughly validated, but there are now other lines of evidence besides twin studies which confirm high IQ heritability. Fourth, Leonardo Parro (not framed as a response to Gusev) goes into more depth about one of those ways, a “pedigree-based analysis” demonstrating heritability of 54 - 69%, ie no “missing heritability” compared to twin studies. He summarizes this as the effect of “rare variants” compared to the usual SNPs - ie if you only look at the most common genes that are easiest to find, you get “missing heritability” compared to twin studies, but if you widen your search to rare genes that are hard to find, you don’t. 52: Extremely related: Heliospect is a startup promising polygenic selection for IQ and other traits; they were trying to stay in stealth mode but The Guardian spied on them and nonconsensually revealed their existence. The discussion on the r/ssc subreddit centered on their claim that (given enough embryos to choose from) they could increase a baby’s expected IQ by 6 points (I’ve also heard 7.5). Sasha Gusev had previously argued that current technology maxed out at 3.5 and future technology would max out at 6, so a claim of 6 - 7.5 is pretty extreme; Gwern, who wrote the pioneering analysis of this technology, was also skeptical. But Heliospect says they’ve got better predictors than academia that use the rare variants everyone else misses; after talking to the company, Gwern retracted his objections and says he finds their claim “pretty plausible”. Local ACX commenter geneticist Gene Smith also redid some calculations, changed his mind, and says “probably pretty realistic”. I find this interesting not just because of the polygenic selection angle, but because if Heliospect is right then their predictor is able to predict more genetic IQ than the “missing heritability” people believe exists, and it should be able to put this argument to bed once and for all. 53: This month in censorship: X/Twitter banned journalist Ken Klippenstein for sharing the Trump campaign’s dossier on JD Vance. Twitter’s side of the story is that the dossier was probably originally stolen by Iranian agents and they don’t want to support that kind of thing by letting people signal-boost the illicitly obtained goods; you can read Klippenstein’s side here. He appears to be unbanned now.
3: The Long March Through The Institutions, Debunked. I’m not usually a fan of accusations that cultural Marxism is a “conspiracy theory” - some leading leftists said they should take over institutions, leftists did take over institutions, you don’t have to be a Nazi to wonder if these two things are connected. But Arturo Dzvyenka argues they aren’t - leftists had started doing this before Marcuse officially asked them to, and besides, the institution-taker-overs were mostly liberals and not the sort of Marxists who might listen to Marcuse. Dzvyenka says the real story is one of class: rising geographic mobility and industrial sophistication created a new class (defined as a group whose jobs give them a similar social position) of geographically mobile knowledge workers - the professional managerial class - whose class characteristics predisposed them to both liberalism and institutional control.
6: From r/evilbuildings: the Lookout Building in Cagnes-sur-Mer, France: 7: Study: women who are more prone to intrasexual competition are more likely to advise other women to cut their hair short, especially if those women are of similar attractiveness to themselves. This study is too cute to be true and I expect it not to replicate; I link it for amusement value only - but, uh, still be careful about whose advice you take.
7: Study: women who are more prone to intrasexual competition are more likely to advise other women to cut their hair short, especially if those women are of similar attractiveness to themselves. This study is too cute to be true and I expect it not to replicate; I link it for amusement value only - but, uh, still be careful about whose advice you take.
The concept of IQ is fine, but you are personally miscalibrated about what low IQ means because the only very-low-IQ people in your training set had developmental disorders. I think these probably explain 5%, 5%, 40%, and 50% of the effect respectively, and I should have been more careful to emphasize (3), which I think explains 40% of the effect. The particular way I would flesh out 3 would be something like - if you’re illiterate and (somewhat) innumerate, you probably don’t have enough practice with symbols and complex mental operations to do even a “culture fair” IQ test like Raven’s Matrices. This doesn’t necessarily mean that your IQ is higher than the Raven’s Matrices says - the person who underperforms on Ravens for this reason will also underperform on a wide variety of other abstract/intellectual/symbolic tasks, and this is part of what IQ means. But it means that Raven’s IQ won’t predict concrete tasks as well as you would expect. Fujimura writes: The other major factor that I think should be reassuring about Lynn's estimates (and other cross-national IQ estimates) is that when you look at "non-problematic" sources that seem like proxies for IQ (e.g. World Bank data, educational performance), you see the same pattern as Lynn and others' IQ data. It's easy for people to quibble about each and every IQ measure (and so people do), but that we see the same pattern of results using otherwise uncontroversial data sources should be reassuring. Yeah, many people tried to gotcha me with claims that Lynn did this or that or the other thing wrong. Lynn tries to defend his methodology here, but I think (and tried to argue in the post) that at this point, that debate is of historical interest only - there’s too much confirmation now. One commenter brings up World Bank Harmonized Learning Outcomes as an example. Another points me to this preprint, which tries to update Lynn’s numbers using all modern standardized testing data and correlations with social development index and GDP. They find mostly similar numbers to Lynn: Malawi goes from 60 → 66, and new last place goes to Sao Tome & Principe at 62. This is by people affiliated with Lynn and scientific racism, and you can choose not to trust their judgment either, but I think at least the SDI correlations are an extremely simple regression that it would be hard to fake. This kind of stuff is why I think simple failures of data collection and analysis are unlikely to explain more than 5% of the gap with our common sense. There’s definitely something weird about these numbers, but it’s got to be more complicated than just “racist people screwed up the test”. But continuing on this subject - if IQ has two components, why would World Bank education data and GDP track the abstract/symbolic component of IQ, rather than the practical component of IQ? Or, rather, it’s obvious why this would happen in education. But why would GDP track abstract/symbolic rather than practical? One possible answer is that the causal pathway is high GDP → lots of education → lots of practice with abstract reasoning → high abstract/symbolic IQ. I don’t think this can be the whole story, because some countries that “cheated” to get high GDP (eg oil sheikhdoms) can’t translate it into IQ points at the same rate as everyone else. I’m stuck with the boring basic explanation that maybe you need to do a lot of abstract reasoning tasks to get high GDP. Harzerkatze writes: [Your claim that blacks everywhere should have the same genes] is far from true. While "white" may be a descriptor for a group of somewhat similar genetic backgrounds, having common ancestors not too far in the past, "black" is different, grouping populations of similar skin color, but common ancestors diverging way further back in time. Yeah, I didn’t want to get into all of this on the post, but I agree the way I phrased it was misleading. Lynn and other national IQ estimates find very low IQs for all sub-Saharan African countries - I mentioned Malawi at 60 in the post, but Nigeria, on the other side of Africa, is 69. Whatever is going on there is a pan-African problem, such that I don’t think differences between African groups are very relevant. US blacks are mostly descended from people in west Africa, eg Nigeria. Some people also brought up that US blacks have significant white admixture. This is true but it’s still not enough to be relevant to this discussion. If we assumed everything was genetic and US blacks with their ~20% white admixture had genetic IQ of 85, we would still expect African blacks to have IQ in the low 80s. However you parse it, there’s got to be some kind of health/education/environment effect going on there. Africa is extremely genetically diverse, but I think most of the countries measured in the paper, including Malawi, are some variety of Niger-Congo speakers, who I don’t think are that much more diverse than white people or anyone else. The really interesting African ethnicities, like the Khoi-San, don’t show up as much at a national level. Andrew Clough writes: Speaking of charity and IQ, the lowest of low hanging fruit is putting iodine in salt. You can donate to the Global Iodine Network like I do for the long term benefit of poorer countries without worrying you're just delaying Malthus's reemergence. Givewell calls Salt Iodization "slightly below the range of cost-effectiveness of the opportunities that we expect to direct marginal donations to" which in the grand scheme of things is quite good. Yeah, salt iodization is great. I had always heard of iodine related problems being concentrated in central Asia and especially Afghanistan, but looking at the map… (source) … sub-Saharan Africa is also a hot spot. I wonder what’s wrong in Cuba - this is exactly the sort of easily gameable metric I would usually expect them to be good at, or at least carefully faking. If you’re interested, you can donate to Iodine Global Network here. Bob Jacobs writes: > His opponents pointed out both his personal racist opinions/activities That's the mildest possible way you could've put it. He wasn't someone who had "personal racist opinions" that he kept as "personal racist opinions". He was the editor-in-chief of Mankind Quarterly, a white supremacist journal that was founded by people like: Henry Garrett an American psychologist who testified in favor of segregated schools during Brown versus Board of Education, Corrado Gini who was president of the Italian genetics and eugenics Society in fascist Italy, and Otmar Freiherr von Verschuer who was director of the Kaiser Wilhelm Institute of anthropology human heredity and eugenics in Nazi Germany. He was a member of the Nazi Party and the mentor of Josef Mengele, the physician at the Auschwitz concentration camp infamous for performing human experimentation on the prisoners during World War 2. Mengele provided for Verschuer with human remains from Auschwitz to use in his research into eugenics. It's funded by the pioneer fund, an organization he was a board member of and that has been classified as a white supremacist hate group, with one of its first projects being to fund the distribution in US churches and schools of "Erbkrank", a Nazi propaganda film about eugenics. He's not just called racist, he *is* racist, he even describes *himself* as a racist. No contesting any of this. MM writes: I spent 18 months in a country where people are supposed to have an iq of about 70, according to the map. My neighbors and friends were mostly non-literate. They did not seem less intelligent than the people I know in my current (US) neighborhood or the people I grew up with (in the US). Most of them would not have performed well on IQ tests, though. They'd never attended school and had no familiarity with puzzle-solving. This was 35 years ago and most people had not seen movies or even photographs. I remember sitting with one older woman and helping her interpret a black-and-white photograph: this is the arm, here's where it connects to the body, etc. It's hard for people from literate societies with tons of exposure to text & graphical representations to see the extent of the gap. Calvin writes: I have a decent amount of experience with the intellectually disabled, and saying "cognitive issues are only responsible for a small part of the [communication] deficit" is so wrong that it makes me question everything else in this essay. Trust me, even making allowances for poor hearing or difficulty forming words, the cognitive issues are responsible for 90% of the deficit. An IQ of 60 is really low and it's a significant handicap. I was concerned to hear this - I have a little experience with the intellectually disabled, but it didn’t involve knowing people’s exact IQ, so I’m not very well-calibrated here. Looking for more information, I found https://www.hrw.org/reports/2001/ustat/ustat0301-01.htm, which purports to describe the characteristics of very low IQ people, mostly in the context of criminal justice (where lawyers often try to use a client’s low IQ as a mitigating factor - ie maybe he didn’t truly understand that crime is wrong). The report says things like: Although all persons with mental retardation have significantly impaired mental development, their intellectual level can vary considerably. An estimated 89 percent of all people with retardation have I.Q.s in the 51-70 range. An I.Q. in the 60 to 70 range is approximately the scholastic equivalent to the third grade […] Although mental retardation of any degree has profound implications for a person's cognitive and social development, it is a condition which in many cases is not readily apparent. While some of the mentally retarded, such as those whose retardation is caused by Down's syndrome or fetal alcohol syndrome, have characteristically distinctive facial features, most cannot be identified by their physical appearance alone. Unless their cognitive impairment is unusually severe (e.g. an I.Q. below 40), persons with mental retardation may be thought of as "slow" but the full extent of their impairment is often not readily appreciated, particularly by people who have limited contact with or knowledge of them, including police, prosecutors, judges, and other participants in the criminal justice system. Many capital offenders with mental retardation did not have their condition diagnosed until trial or during post-conviction proceedings. And gave some examples (slightly out of order for this list): Oliver Cruz, who was executed in Texas on August 9, 2000, had an I.Q. that was measured variously at 64 and 76. Cruz nonetheless insisted to reporters that, although he was perhaps "slow in reading, slow in learning," he was not mentally retarded. Mitigation specialist Scharlette Holdman recalled a client who so successfully hid his retardation from his attorneys that he allowed them to sign him up for college-level calculus classes, which he could not comprehend. He had gone through much of his schooling allowing his younger sister to complete his homework for him. When he was given papers to read in connection to his case, he would carefully stare at them. If he was asked a substantive question, he usually responded, "I don't recall." Only when experts in retardation evaluated him and investigators reviewed his school records and spoke to his family did lawyers discover he had mental retardation and had been considered "slow" since his early childhood. Another capital defendant "hid his mental retardation for most of his life by working at a very repetitive job as a switcher on the railroad. He lied about finishing high school. He was actually in special education classes and did not finish the sixth grade. He was drafted into the army and discharged because of his mental retardation. He lied about his service record. He often made things up so that people would not suspect mental retardation." Morris Mason, whose I.Q. was 62-66, was executed in 1985 in Virginia after being convicted of rape and murder. Before his execution, Mason asked one of his legal advisors for advice on what to wear to his funeral As one psychiatrist testified about a capital defendant with an I.Q. of between 35 to 45: "[People with mental retardation try] to go along with people that they suspect are in authority. For example, I asked [the defendant] where we were when I saw him, and he obviously didn't know, so I asked him if we were in Atlanta and he said `Yes, we are in Atlanta.' In fact, we were in Birmingham, Alabama. I could have said New York and he would have said `Sure, New York' These people are obviously not going to win Nobels anytime soon. But even the guy with IQ 35 - 45 was still talking to people. I think this supports the thesis that intellectually disabled people without specific syndromes can seem pretty normal most of the time. (though keep in mind that anything from the court system should be treated with a grain of salt - defense attorneys have an incentive to exaggerate the intellectual disability of their clients in the hopes that it gets them a lighter sentence) Lyman Stone writes: Emil's post isn't correct, however. We know from the recent Reich lab paper on long-run genetic selection that there was strong selection for IQ in the neolithic revolution, which implies agriculture strongly selects for IQ and ability to plan. Malawians are 60-80% subsistence farmers. Even a "normal" low-IQ person cannot do the implied math and long-term planning involved in this kind of farming. And in fact, economists routinely find that African small-plot subsistence agriculture is actually highly optimized; farmers make very precise choices about where to plant which seeds, which fertilizer to use, etc. Key point is basically: it really isn't true that an IQ 60 person can run a farm functionally. Moreover, mean IQ of 60 implies large shares even lower, at ranges that are uniformly nonverbal even without specific disability. And this is why in the actual record-level NIQ database, they truncate estimates below 60, because even the database managers realize these estimates are crazy. See my post here: https://substack.com/home/post/p-154757665 We know that people with extremely low IQs in the Flynn sense must be capable of subsistence agriculture, because pre-Flynn Effect, most of the West had extremely low IQs, and they were all doing subsistence agriculture. How is this possible? Responding to Lyman’s comment, I wrote: I stick to the claim in this post - that our estimates for what a very low IQ means are poorly-grounded, and that people with low IQs can do some pretty impressive things, especially if they're concrete and part of a cultural transmission package. Maybe this is the Joseph Henrich "Secret Of Our Success" thing. We know that Malawians get poor test scores in school, so it seems like there's some disconnect between do-well-on-tests intelligence and run-a-subsistence-farm intelligence, and the abstract/concrete and novel/cultural distinctions are the best explanation that I can think of. You say that "the phenotype that arises from a given tested IQ in America is clearly vastly worse than the phenotype arising from the same tested IQ in Africa", which I basically agree with. I think part of it is the syndromes issue raised above, and part of it is that maybe Malawians have zero contact with the culture of abstraction that IQ tests come out of whereas even very uneducated Westerners have some contact with it, and maybe another part of it is that whatever health/nutrition issues the Malawians have preferentially harm faculties responsible for more abstract tasks rather than more concrete ones. For an opposite data point, when I was in Haiti, my boss told me (secondhand, no personal experience) of extreme difficulties working with Haitians, like that they couldn't alphabetize files even when that was explained to them. Many Haitains are also successfuly subsistence farmers, so I think this also supports some kind of heavy abstract/concrete distinction. I don't think we're really disagreeing, just agreeing on something like the correlations that make up IQ being less valid outside the normal range. Maybe one way to look at it is to go back to the claim from the justice system document above, saying that people with IQ in the 60s are the mental equivalent of third-graders. The third-graders I know are very into Pokemon, and have all sorts of opinions on how if you add X bonus to a Y strength fire-type Pokemon and then play Z combo, it will [commence six weeks of droning on about different Pokemon cards]. Is this the sort of math/reasoning/strategizing that we don’t expect someone with IQ 60 to be able to do? Does the fact that third-graders can do it mean that we’re miscalibrated? I’m not sure. The part of Lyman’s comment that gives me the most pause is his observation that, if the mean IQ is 60, a decent fraction of people must be 45, and a non-negligible portion 30. At this point, even third-grader comparisons don’t save us. I guess this is where I bring in the claim that IQ breaks down as a guide to practical living skills below some point. You can see several more layers of response between me and Lyman here, but I was especially grateful for him teaching me two things I didn’t already know: First, he corrected my misconception about Reich on ancient European cognitive evolution. Reich had said that pre-agriculture Europeans were “2-3 standard deviations” below moderns. I had interpreted that as IQ deviations of 15 points, making them genetic IQ 55-70, which would have been pretty crazy. Stone tells me he actually meant PGS deviations, each of which was about 3-4 IQ points, so he’s claiming that pre-agriculture Europeans had genetic IQ of 90 (they probably also had lower IQ for environmental reasons).,
(source) … sub-Saharan Africa is also a hot spot. I wonder what’s wrong in Cuba - this is exactly the sort of easily gameable metric I would usually expect them to be good at, or at least carefully faking. If you’re interested, you can donate to Iodine Global Network here. Bob Jacobs writes: > His opponents pointed out both his personal racist opinions/activities That's the mildest possible way you could've put it. He wasn't someone who had "personal racist opinions" that he kept as "personal racist opinions". He was the editor-in-chief of Mankind Quarterly, a white supremacist journal that was founded by people like: Henry Garrett an American psychologist who testified in favor of segregated schools during Brown versus Board of Education, Corrado Gini who was president of the Italian genetics and eugenics Society in fascist Italy, and Otmar Freiherr von Verschuer who was director of the Kaiser Wilhelm Institute of anthropology human heredity and eugenics in Nazi Germany. He was a member of the Nazi Party and the mentor of Josef Mengele, the physician at the Auschwitz concentration camp infamous for performing human experimentation on the prisoners during World War 2. Mengele provided for Verschuer with human remains from Auschwitz to use in his research into eugenics. It's funded by the pioneer fund, an organization he was a board member of and that has been classified as a white supremacist hate group, with one of its first projects being to fund the distribution in US churches and schools of "Erbkrank", a Nazi propaganda film about eugenics. He's not just called racist, he *is* racist, he even describes *himself* as a racist. No contesting any of this. MM writes: I spent 18 months in a country where people are supposed to have an iq of about 70, according to the map. My neighbors and friends were mostly non-literate. They did not seem less intelligent than the people I know in my current (US) neighborhood or the people I grew up with (in the US). Most of them would not have performed well on IQ tests, though. They'd never attended school and had no familiarity with puzzle-solving. This was 35 years ago and most people had not seen movies or even photographs. I remember sitting with one older woman and helping her interpret a black-and-white photograph: this is the arm, here's where it connects to the body, etc. It's hard for people from literate societies with tons of exposure to text & graphical representations to see the extent of the gap. Calvin writes: I have a decent amount of experience with the intellectually disabled, and saying "cognitive issues are only responsible for a small part of the [communication] deficit" is so wrong that it makes me question everything else in this essay. Trust me, even making allowances for poor hearing or difficulty forming words, the cognitive issues are responsible for 90% of the deficit. An IQ of 60 is really low and it's a significant handicap. I was concerned to hear this - I have a little experience with the intellectually disabled, but it didn’t involve knowing people’s exact IQ, so I’m not very well-calibrated here. Looking for more information, I found https://www.hrw.org/reports/2001/ustat/ustat0301-01.htm, which purports to describe the characteristics of very low IQ people, mostly in the context of criminal justice (where lawyers often try to use a client’s low IQ as a mitigating factor - ie maybe he didn’t truly understand that crime is wrong). The report says things like: Although all persons with mental retardation have significantly impaired mental development, their intellectual level can vary considerably. An estimated 89 percent of all people with retardation have I.Q.s in the 51-70 range. An I.Q. in the 60 to 70 range is approximately the scholastic equivalent to the third grade […] Although mental retardation of any degree has profound implications for a person's cognitive and social development, it is a condition which in many cases is not readily apparent. While some of the mentally retarded, such as those whose retardation is caused by Down's syndrome or fetal alcohol syndrome, have characteristically distinctive facial features, most cannot be identified by their physical appearance alone. Unless their cognitive impairment is unusually severe (e.g. an I.Q. below 40), persons with mental retardation may be thought of as "slow" but the full extent of their impairment is often not readily appreciated, particularly by people who have limited contact with or knowledge of them, including police, prosecutors, judges, and other participants in the criminal justice system. Many capital offenders with mental retardation did not have their condition diagnosed until trial or during post-conviction proceedings. And gave some examples (slightly out of order for this list): Oliver Cruz, who was executed in Texas on August 9, 2000, had an I.Q. that was measured variously at 64 and 76. Cruz nonetheless insisted to reporters that, although he was perhaps "slow in reading, slow in learning," he was not mentally retarded. Mitigation specialist Scharlette Holdman recalled a client who so successfully hid his retardation from his attorneys that he allowed them to sign him up for college-level calculus classes, which he could not comprehend. He had gone through much of his schooling allowing his younger sister to complete his homework for him. When he was given papers to read in connection to his case, he would carefully stare at them. If he was asked a substantive question, he usually responded, "I don't recall." Only when experts in retardation evaluated him and investigators reviewed his school records and spoke to his family did lawyers discover he had mental retardation and had been considered "slow" since his early childhood. Another capital defendant "hid his mental retardation for most of his life by working at a very repetitive job as a switcher on the railroad. He lied about finishing high school. He was actually in special education classes and did not finish the sixth grade. He was drafted into the army and discharged because of his mental retardation. He lied about his service record. He often made things up so that people would not suspect mental retardation." Morris Mason, whose I.Q. was 62-66, was executed in 1985 in Virginia after being convicted of rape and murder. Before his execution, Mason asked one of his legal advisors for advice on what to wear to his funeral As one psychiatrist testified about a capital defendant with an I.Q. of between 35 to 45: "[People with mental retardation try] to go along with people that they suspect are in authority. For example, I asked [the defendant] where we were when I saw him, and he obviously didn't know, so I asked him if we were in Atlanta and he said `Yes, we are in Atlanta.' In fact, we were in Birmingham, Alabama. I could have said New York and he would have said `Sure, New York' These people are obviously not going to win Nobels anytime soon. But even the guy with IQ 35 - 45 was still talking to people. I think this supports the thesis that intellectually disabled people without specific syndromes can seem pretty normal most of the time. (though keep in mind that anything from the court system should be treated with a grain of salt - defense attorneys have an incentive to exaggerate the intellectual disability of their clients in the hopes that it gets them a lighter sentence) Lyman Stone writes: Emil's post isn't correct, however. We know from the recent Reich lab paper on long-run genetic selection that there was strong selection for IQ in the neolithic revolution, which implies agriculture strongly selects for IQ and ability to plan. Malawians are 60-80% subsistence farmers. Even a "normal" low-IQ person cannot do the implied math and long-term planning involved in this kind of farming. And in fact, economists routinely find that African small-plot subsistence agriculture is actually highly optimized; farmers make very precise choices about where to plant which seeds, which fertilizer to use, etc. Key point is basically: it really isn't true that an IQ 60 person can run a farm functionally. Moreover, mean IQ of 60 implies large shares even lower, at ranges that are uniformly nonverbal even without specific disability. And this is why in the actual record-level NIQ database, they truncate estimates below 60, because even the database managers realize these estimates are crazy. See my post here: https://substack.com/home/post/p-154757665 We know that people with extremely low IQs in the Flynn sense must be capable of subsistence agriculture, because pre-Flynn Effect, most of the West had extremely low IQs, and they were all doing subsistence agriculture. How is this possible? Responding to Lyman’s comment, I wrote: I stick to the claim in this post - that our estimates for what a very low IQ means are poorly-grounded, and that people with low IQs can do some pretty impressive things, especially if they're concrete and part of a cultural transmission package. Maybe this is the Joseph Henrich "Secret Of Our Success" thing. We know that Malawians get poor test scores in school, so it seems like there's some disconnect between do-well-on-tests intelligence and run-a-subsistence-farm intelligence, and the abstract/concrete and novel/cultural distinctions are the best explanation that I can think of. You say that "the phenotype that arises from a given tested IQ in America is clearly vastly worse than the phenotype arising from the same tested IQ in Africa", which I basically agree with. I think part of it is the syndromes issue raised above, and part of it is that maybe Malawians have zero contact with the culture of abstraction that IQ tests come out of whereas even very uneducated Westerners have some contact with it, and maybe another part of it is that whatever health/nutrition issues the Malawians have preferentially harm faculties responsible for more abstract tasks rather than more concrete ones. For an opposite data point, when I was in Haiti, my boss told me (secondhand, no personal experience) of extreme difficulties working with Haitians, like that they couldn't alphabetize files even when that was explained to them. Many Haitains are also successfuly subsistence farmers, so I think this also supports some kind of heavy abstract/concrete distinction. I don't think we're really disagreeing, just agreeing on something like the correlations that make up IQ being less valid outside the normal range. Maybe one way to look at it is to go back to the claim from the justice system document above, saying that people with IQ in the 60s are the mental equivalent of third-graders. The third-graders I know are very into Pokemon, and have all sorts of opinions on how if you add X bonus to a Y strength fire-type Pokemon and then play Z combo, it will [commence six weeks of droning on about different Pokemon cards]. Is this the sort of math/reasoning/strategizing that we don’t expect someone with IQ 60 to be able to do? Does the fact that third-graders can do it mean that we’re miscalibrated? I’m not sure. The part of Lyman’s comment that gives me the most pause is his observation that, if the mean IQ is 60, a decent fraction of people must be 45, and a non-negligible portion 30. At this point, even third-grader comparisons don’t save us. I guess this is where I bring in the claim that IQ breaks down as a guide to practical living skills below some point. You can see several more layers of response between me and Lyman here, but I was especially grateful for him teaching me two things I didn’t already know: First, he corrected my misconception about Reich on ancient European cognitive evolution. Reich had said that pre-agriculture Europeans were “2-3 standard deviations” below moderns. I had interpreted that as IQ deviations of 15 points, making them genetic IQ 55-70, which would have been pretty crazy. Stone tells me he actually meant PGS deviations, each of which was about 3-4 IQ points, so he’s claiming that pre-agriculture Europeans had genetic IQ of 90 (they probably also had lower IQ for environmental reasons).,
Second, he linked a post of his where he found that, although IQ accurately predicts GDP at each time point, changes in IQ don’t predict changes in GDP, suggesting something weird is happening. I think the weird thing is the improvement in the abstract/symbolic/”test-taking” aspect of IQ separate from the practical aspect, mentioned above.
It feels like 2010 again - the bloggers are debating the proofs for the existence of God. I found these much less interesting after learning about Max Tegmark’s mathematical universe hypothesis, and this doesn’t seem to have reached the Substack debate yet, so I’ll put it out there.
Some mathematical objects contain conscious observers. Conway’s Life might be like this: it’s Turing complete, so if a computer can be conscious then you can get consciousness in Life. If you built a supercomputer and had it run the version of Life with the conscious being, then you would be “simulating” the being, and bringing it into existence. There would be something it was like to be that being; it would have thoughts and experiences and so on. A simulation of the Game of Life within the Game of Life (video source) Tegmark argues this is also true if you don’t build the supercomputer and run it. The fact that the version of Life with the conscious being exists in possibility-space is enough for the being to in fact be experiencing it.
A simulation of the Game of Life within the Game of Life (video source) Tegmark argues this is also true if you don’t build the supercomputer and run it. The fact that the version of Life with the conscious being exists in possibility-space is enough for the being to in fact be experiencing it.
4: Jack Galler, who generated many of the images I used in the AI Art Turing Test, has a blog post on his experience: The Turing Test For Art: How I Helped AI Fool The Rationalists.
7: Oliver D. Smith is an ex-Nazi turned social justice warrior. His MO was (is?) creating Wikipedia and RationalWiki articles on various IQ researchers/bloggers that portray them in the worst possible light (both sites tried to ban him, but he was able to come back with various sock puppet accounts). More recently, he’s become . . .famous? . . . for a very impressive litigation campaign to prevent anyone from naming him or mentioning any of his activities; this sort of thing usually doesn’t work, but he was able to at least City Journal to take down their article about him. Most recently, an extremely anonymous person on a blog with no other articles has finally published the whole story - this site was down the past few times I tried to link it, apparently because Smith launched “a barrage of spurious DMCA claims” against Substack, but seems to be at least temporarily back now. Read it while you still can!
8: Twitter user @fae_dreams asked the new generation of AI reasoning models to replicate Donald Trump’s challenge from my fictional 2024 debate: describe his policy in heroic hexameter while avoiding letters A, E, and I. Here’s my favorite: You can see more examples and comparisons of different models here (X).
Otherwise, the usual rules apply. There’s no official word count requirement, but previous finalists and winners were often between 2,000 and 10,000 words. There’s no official recommended style, but check the style of last year’s finalists and winners or my ACX book reviews (1, 2, 3) if you need inspiration. Please limit yourself to one entry per person or team.
If your review includes footnotes, please make them endnotes in plain text [1], not in Google Docs’ native footnote functionality. The native footnotes don’t automatically transfer to Substack, and transferring them manually is a pain.
Manifold is the largest social prediction market platform with over 150k user‑created markets and more than 30 million trades. Our markets have been featured here on ACX, in the NYT, Nate Silver’s latest book, and countless Substacks, podcasts, and tweets. Forecasters, journalists, researchers, and casual users alike use Manifold to get accurate real-time odds on everything from elections to AI timelines to personal drama.
Since 2022, Alice has undertaken qualitative research in nine world regions: Mexico, Costa Rica, Brazil, Morocco, Italy, Spain, Britain, US, Poland, Turkey, India, Uzbekistan, South Korea and Hong Kong. Through this globally comparative analysis, she analyses the drivers and obstacles to gender equality. Gender interventions will be more impactful if they target locally binding constraints - in the Middle East, North Africa and South Asia, this is "the honour-income trade-off" (whereby male honour depends on female seclusion, and women tend to remain at home. Meanwhile, Latin America and the Caribbean face a different obstacle: pervasive violence elevates femicides. Over the past few years, she's held visiting appointments at Stanford, Chicago, and Yale, while providing policy advice to the World Bank, and sharing insights with a public audience via Substack (www.ggd.world). In April 2025, she gave a TedTalk on romantic love as an under-rated driver of gender equality.
Codebuff, an AI coding startup I probably can’t take full credit for all of this just from giving them $20K in seed funding, but I continue to appreciate everything they do for this community and the world. 35: Further S’s Political Career This person didn’t win their election, but has since pivoted to AI safety and works in a well-regarded AI policy think tank. 36: Seeds Of Science, A Journal Of Non-Traditional Research No update received, but this was a public journal and it is easy to follow their work, see their website and Substack. They published two dozen articles of widely varying quality through 2023 and 2024, then closed in 2025. A remnant of the original vision survives as a science blogging aggregator. This was about my median expectation for this grant, but it was very inexpensive and I decided to take a chance on it anyway. 37: Good Science Project, Working To Improve Federal Science Funding No update received, but they have a public Substack discussing their progress. Their proposals for NIH reform have influenced Congress and made government agencies pay more attention to scientific integrity. 38: Advising Developing Countries On How To Grow Their Economies With our initial ACX grant, we piloted the Growth Teams model in Rwanda, helping the government jumpstart the export-oriented call center (BPO) industry. Since 2022, that effort has contributed to the creation of 2,000 formal jobs and the emergence of some of the country’s largest private employers. We’ve since expanded to Tanzania, Malawi, and the Indian states of Goa and Meghalaya. To refocus the global development discourse on broad-based economic growth, we co-organized the Growth Summit with the Center for Global Development and the Charter Cities Institute, and have published articles in leading outlets including Stanford Social Innovation Review, ProMarket, and the Global Prosperity Institute. Our work has attracted support from Open Philanthropy, Schmidt Futures, and Mulago Foundation, and our advisors now include economists Lant Pritchett, Stefan Dercon, and Kunal Sen. 39: Help Luca De Leo Get Started In AI Safety Research No update received, but Luca now runs the AI safety group at the University of Buenos Aires, Argentina. 40: Typist For Saharon Shelah This was another ACXG+ Grant, funded by an anonymous outside funder and not listed in the original announcement. Saharon is a prolific and influential Israeli mathematician, but many of his discoveries are hand-written in an unpublishable format. This grant funded a typist to help make his results suitable for publication. According to this page, they have made over fifty new papers and preprints available. Second Cohort: One Year Updates 41: Lead-Acid Battery Recycling In Nigeria The Nigeria field research was a major success. We spent most of September doing field research in multiple major cities in Nigeria, and got a good sense of the used lead-acid battery supply chain. This field research served as the foundation for expanding our project, and has been very impactful in shaping our ongoing research. We published our findings from Nigeria, which were shared with Nigerian government regulators and global NGOs working on lead poisoning. The grant also gave us the on-the-ground experience we needed to both fully understand and credibly engage with groups, both in Nigeria and globally, on the ULAB issue. In the meantime, beyond continued research, we’ve also launched a dashboard (trade.leadbatteries.org) for analyzing global lead trade data. Right now, we’re: Launching two studies (one RCT, one environmental analysis) in Nigeria in collaboration with local universities to develop a more rigorous understanding of lead pollution due to low-standard ULAB recycling in Nigeria Collaborating with a non-profit incubator to launch an NGO focused on demand-side solutions Beginning a partnership with a West African environmental regulator to scale cheap air monitoring technology to quickly identify and reduce lead pollution from low-standard smelting If any of this sounds interesting to you, please sign up for our Substack (leadbatteries.substack.com) or send us an email at hugosmith@uchicago.edu! 42: Compensation For Kidney Donors The End Kidney Deaths Act (H.R. 2687 / EKDA) is a groundbreaking ten-year pilot program designed to save lives and reduce healthcare costs. It provides a refundable tax credit of $10,000 per year for five years, a total of $50,000, to living kidney donors who donate to a stranger, helping those who’ve waited the longest on the transplant list. Between 2010 and 2021, 100,000 Americans died while qualified and waiting for a kidney. The EKDA aims to change that trajectory. Within ten years of its passage, up to 100,000 Americans could receive a life-saving living donor kidney which typically lasts twice as long as a deceased donor kidney. This would not only save lives but also save taxpayers up to $37 billion. The legislation has been reintroduced in the House, and we have a committed Republican Senate lead. Now, we need a Democratic Senator to co-lead and help move this bipartisan effort forward. Time is short, and we are racing to pass the bill this Congressional session. 36 organizations already support the EKDA. Join the movement and help end preventable kidney deaths. Visit EndKidneyDeaths.org to help us get to the finish line. Elaine and her org have been working extremely hard on this; you can read a Vox article on their campaign here. If you want to sign up for her email list and get updates any time there is a representative you can contact or meeting you can join in, go here. 43: Genetic Hack To Prevent Suffering In the estimate of multiple team members, the ACX grant was “worth it” - it likely had a counterfactual net positive impact, even though we had to pivot from our initial fast-track plans for developing the precision anti-suffering therapy. We identify three primary streams of value: a) reducing uncertainty in the emerging field through early exploratory research, helping with the identification of dead ends and promising R&D trajectories; b) a wide range of downstream effects (beyond the “raising awareness” cliché), including talent mobilization and rekindled interest in suffering abolitionism as a distinct cause area; and c) certain developments that cannot yet be publicly disclosed. In December 2024, Marcin Kowrygo (Acting CEO & volunteering contributor), David Pearce (Director of Bioethics), Aatu Koskensilta (President), and a few other team members decided to leave The Far Out Initiative. They look forward to collaborating and applying their experience to advance the suffering abolitionist lineage in the spirit of open science, public good, and thoughtfully decentralized governance. Feel free to reach out to us at suffab at protonmail dot com to discuss collaboration opportunities! I wrote a post profiling the Far Out Initiative here. Unfortunately there were some internal disagreements, and the people ACX Grants was closest to left the organization. I plan to continue to monitor whatever they do next. 44: Advocate For Pandemic Response Team At FDA This team prefers has asked me not to discuss their progress publicly, but you can probably guess what their lives are like right now, and your guess would be correct. 45: Anti-Mosquito Drones We developed a cheap sonar that is able to detect, track and classify the ultrasonic echoes of mosquito wings at more than three meters. I believe it’s a world first! We also have control algorithms that take the sonar data and output control commands that both ram into mosquitoes and avoid the walls of a simulated environment. Our current work is on integrating both components on a real drone, and we expect to be able to kill mosquitoes by June. We’ve also made an internal impact study (napkin-sized) that shows we’ll be more cost-effective than ITNs in urban to periurban environments. So, we’re super excited with what comes next and can’t wait to share the videos of our first interceptions! More information [in the video below] and on our website, https://tornyol.com 46: Tarbell Fellowship For AI Journalism No update received, but they have a public website. I can’t find the Voices program in particular, but the overall fellowship completed their first class of seven fellows and is working on their second. 47: Germicidal UV Lamp Study The research has successfully demonstrated the ability of off the shelf ozone scrubbers to mitigate the ozone production of far-UVC lamps, is now available as a preprint (https://chemrxiv.org/engage/chemrxiv/article-details/67e4cde76dde43c9084d88b7). The paper has been submitted for publication and is currently undergoing peer review. Any ideas you have for potential funders we can approach to help execute our six-year plan to accelerate far-UVC would be appreciated https://blueprintbiosecurity.org/introducing-project-air/ 48: Technological Solutions To Animal Welfare Challenges Directly because of Innovate Animal Ag's work, the first U.S. egg producer publicly announced in the New York Times their adoption of in-ovo sexing technology, eliminating the need to cull day-old male chicks. The initial in-ovo sexing machine began operating in the U.S. at the end of 2024, with the first eggs from these hens expected on shelves in mid-2025. External evaluations estimate our work accelerated U.S. adoption of this technology by over seven years, meaning that once fully implemented, more than 2 billion chicks will have been spared. In addition to continuing to support the rollout of in-ovo sexing in the US and globally, we're now exploring other technologies and paths to impact. Current promising projects include developing humane slaughter methods for fish and advocating for USDA approval of a poultry vaccine against bird flu. They add: If you ever meet folks that are interested animal welfare and are partial to more technocratic and practical solutions, please continue to pass them our way, or connect them directly to me. 49: Assurance Contract Website www.Spartacus.app is an ACX grantee that created a platform to help solve coordination and collective action problems. It enables the creation of campaigns that build critical mass through conditional commitments, which only activate when a sufficient number of people join, converting risk and uncertainty into a higher probability of successful outcomes. They are currently facilitating several projects that leverage conditional commitments, including a dominant assurance contract interface for fashion pop-ups, accelerating a community business association's membership drive, and helping an AI safety organization organize petitions and events, among others. They have pivoted from an emphasis on high-stakes coordination problems requiring anonymity (because they occur too infrequently) to a broader range of more common use cases and have successfully run small-scale campaigns, but are still working toward product-market fit. Despite resource constraints and split time commitments that have impeded faster progress, they remain dedicated to the project's growth and success. You can follow its progress on X or Substack, or email Jordan directly here. 50: Cause Prioritization @ Center For Exploratory Altruism Research Moderately good progress on a salt reduction policy advocacy project we funded; informal commitments have been made by the Ministry of Health, and we're awaiting the publication of a formal administrative order. The official description sounds maximally generic, but this is an EA charity with a broad mandate whose current thesis is that dietary guidelines in developing countries can have outsized effects in saving lives. They’re making some progress on a salt reduction campaign in a developing country they prefer not to name publicly. 51: Mark Webb Studying Land Reform The purpose of this project was to identify specific farmland that could be acquired and transferred to the farmers already working the land. This has been difficult to achieve. I have been able to connect with other charities and landless farmers, and was able to interview a number of people about what their situation looks like, as well as what it would look like to them personally if they owned, rather than rented, their farmland. All this was immensely helpful in pushing this long-term project forward, even if I was unable to identify a specific plot of land that could be used to try the experiment. I intend to continue this project. If you have any insights or connections, I am interested. 52: More AI Advocacy In Australia Good Ancestors is focused on AI safety policy in Australia. Middle powers might be a useful path to influence as the US and China focus on racing, rather than safety. The ACX grant helped us give testimony about AI safety to the Australian Senate alongside Google, Microsoft and Facebook (We were the only nonprofit to give oral evidence to the inquiry. We also engaged government on other AI-related issues, including cybersecurity, biosecurity, consumer law and automated decision making (https://www.goodancestors.org.au/ai-safety). We’re currently working to inform voters about where parties stand on AI safety for the election, ahead of engaging on a likely Australian AI Act in 2025 (https://www.australiansforaisafety.com.au/). This is the same Australian lobbying organization we founded in Year 1, after a change in name and leadership. I continue to be excited about AI safety in middle-tier countries for a few reasons. First, these countries have some power in international organizations to set international standards. Second, companies will usually comply with any not-excessively-burdensome regulation set by any country with a significant market. Third, AI safety is underfunded by the standard of government programs, so Australia setting up a national AI Safety Institute would significantly expand the field. It’s kind of crazy that ACX Grants tier levels of money can have significant effects at this scale, but GA continues to do a great job and we continue to be proud to support them. 53: Campus For African School Of Economics At Zanzibar Charter City The ACX grant helped launch the first research center at the African School of Economics-Zanzibar, which is a main anchor of the Fumba Town charter city project in Zanzibar. This research center is called the Africa Urban Lab (AUL), focused on rapid urbanization across Africa. The AUL launched its first Diploma program in Urban Development with 38 students in our first cohort (now graduated!), including mayors, and deputy mayor, a director of a national Ministry of urban development, and many others. We published our research framing papers for the AUL's research agenda. We raised funding to launch an Urban Expansion Program that's now selecting 15 African cities to support in implementing urban expansion planning on the urban periphery. We held two Public Talks by renowned cities scholars and practitioners. We received additional funding from Emergent Ventures and from the Templeton Foundation. And we've partnered with 8 universities across the region, and with one of these universities (Ardhi) we'll be working with them to update their urban planning and urban economics curriculum (amplifying AUL's impact beyond our own organization). A longer update from end of 2024 is here: https://www.aul.city/blog/reflecting-on-africa-urban-lab-s-inaugural-year-2024-highlights) 54: Online Training Program For Health Workers In Developing Countries To date, over 11,000 health workers in Nigeria have completed our course on basic, life-saving newborn care. ACX funding was catalytic for helping us secure government approvals and complete an evaluation of the impact of our training on health workers' clinical practices. The evaluation shows that birth attendants provide better birth care after taking the course. We fed the evaluation results into an updated model, which suggests the program is 24 times more cost-effective than direct cash transfers (a widely recognized benchmark for cost-effectiveness). The program is likely to become even more cost-effective as we scale up. https://healthlearn.org/blog/updated-impact-model 55: Smartphone Pupillometry To Diagnose Neurological Conditions We have continued to expand our work in the smartphone pupillometry space and the development of our application, PupilScreen (https://www.apertur.ai/). We have expanded our pilot/research program to include new sites across the United States (Missouri, New Jersey, Kentucky, USAC racing, PitFit driver performance training in Indiana) and the world (Nepal, Taiwan, South Africa). We continue to publish at the leading edge of the pupillometry literature as well looking at concussion (https://neuro.jmir.org/2024/1/e58398 and https://pubmed.ncbi.nlm.nih.gov/39682632/), cerebral vasospasm (https://pubmed.ncbi.nlm.nih.gov/39128501/), and stroke (https://pubmed.ncbi.nlm.nih.gov/39674431/ and https://pubmed.ncbi.nlm.nih.gov/39561861/). Currently, we are raising a $3 million seed round via a SAFE to fund the expansion of our work into the hands of healthcare workers and the general public. We will first focus on traumatic brain injury for clinical use and develop a neuro-monitoring wellness application utilizing our technology for the general public. They add: “We would welcome connections to anyone that you think might be interested in supporting our work further by investing in our $3M seed round of funding.” 56: Mike Saint-Antoine’s Biology Tutorial Videos Since getting the grant, I've continued to make Youtube tutorials as planned. One series that I'm especially proud of is about how to make a neural network in the Julia programming language completely from scratch, with no imports, up to the point of being able to solve MNIST (https://www.youtube.com/playlist?list=PLWVKUEZ25V97tNULapu07DhWv6_W4NfpE). Also, a college student in Pakistan came across my videos and invited me to give a virtual Zoom-lecture to her department, so I ended up teaching a 6-hour "Python-for-Biologists" workshop to more than a hundred college students in Pakistan over Zoom. So that was pretty awesome. Also, lately I've been teaching some in-person classes too, mostly at Fractal University in NYC, and I also recently organized a day-long, in-person Beginner Python class for people in my local area (Philly suburbs) who wanted to learn some basic programming. I'm having a lot of fun with this project, and am grateful to Scott and the grant funders for their generosity! 57: Conceptual Boundaries Workshop On AI Safety The workshop was completed successfully; you can read a writeup here. 58: Apart Research To Incubate AI Safety Scientists No update received, but they have a public website, and you can see their impact metrics here. They seem to be in urgent need of more funding. 59: Primer On How To Achieve Political Change No update received and I can’t find anything about this. 60: Research IVF Clinic Success Rates We've built a predictive model that estimates the odds of having a child at different IVF clinics across the country while controlling for factors like patient age and infertility differences that can falsely make some clinics look better than others. We found that an average patient can increase their odds of having a kid by 43% just by going to a top 10% clinic. Patients unlucky enough to go to a bottom 10% clinic will reduce their odds of having a kid by 40%. Next month, we're adding several more clinics, 2023 data, additional procedural controls, and donor/gestational carrier models, which should push our accuracy beyond state-of-the-art models in this space and better isolate clinic impact on patient outcomes. We've launched ivf.clinic, a website where patients can access personalized IVF reports and browse our clinic rankings (though we're still squashing some bugs). Currently, we're expanding our research to include comprehensive insurance coverage and pricing data across clinics nationwide. If anyone has insights on automating the collection of IVF clinic pricing information, I'd love to hear from you at scelarek@gmail.com. 61: Replicate Study On Brain Wave Synchronization For Speeding Learning We have acquired and configured the OpenBCI UltraCortex Mark IV 8-channel EEG headset and a clinical-grade Biosemi 32-channel EEG system. We’ve implemented the required components for the experimental pipeline (computing alpha from EEG, flashing bright white light, presenting stimulus images). We are currently putting them together into a single system that we’ll use to collect the data from several participants. We are aiming to gather data on several participants in late June / early July and complete the pilot of the replication in July 2025. If you’d like to be a participant in the study, [they might announce a link once they have it]. 62: Advocate Repeal Of Interstate Runaway Compact No update received and I can’t find anything about this. 63: Animal Welfare (Especially Fish) In Turkiye Future For Fish asks companies to sign up to FFF's fish welfare commitment, which requires producers to certify their facilities and enforce specific standards for stocking density and harvest. Luckyfish, İlknak, Divan (35 restaurants, 17 hotels) and NG Hotels (5 hotels) have signed and published FFF's fish welfare commitment with İlknak publishing the commitment on their website. Kılıç published its first sustainability report detailing fish welfare policies, including enforcing a maximum stocking density of 10 kg/m³ and confirmation of electrical stunning practices. Longer version with some caveats: https://manifund.org/projects/improving-fish-w From the longer document, these commitments involve things like reducing overcrowding, or stunning fish before killing them. Over 30 million fish were affected just from their single largest commitment, and they say 100 fish are helped per dollar spent. 64: More Georgism Advocacy Lars and Will used the 2021 grant to co-found ValueBase. Will remained with the company, and Lars left to do advocacy work at the Center For Land Economics. Here’s their summary of how things are going: [Our] organization transitioned leadership with Greg Miller, a former Program Analyst at the US Department of Housing and Urban Development, and Lars Doucet, author of Land is A Big Deal and Co-Founder of Valuebase, working full time and Joe Caissie stepping aside. This transition happened naturally as the next career transition for each respective person. Since then, progress has been made on pushing forward legislation. Maryland had two bills introduced to give Baltimore and counties the ability to enact split-rate taxes. One of the bills passed the state senate and would allow Baltimore to enact land value taxes within one mile of rail corridors–this contains 50% of Baltimore’s land value. However, the legislative session ended. We expect the bill to revive next session. The Center for Land Economics has been actively working to help efforts to get this bill passed the line. At the same time, we have uncovered systematic undervaluing of vacant land in assessments. We are writing a report on the assessment issues in Maryland with actionable steps to resolve them.
Maybe there are genes we haven’t found yet For most of the 2010s, hypothesis 2 looked pretty good. Researchers gradually gathered bigger and bigger sample sizes, and found more and more of the missing heritability. A big 2018 study increased the predictive power of known genes from 2% to 10%. An even bigger 2022 study increased it to 14%, and current state of the art is around 17%. Seems like it was sample size after all! Once the samples get big enough we’ll reach 40% and finally close the gap, right? This post is the story of how that didn’t happen, of the people trying to rehabilitate the twin-studies-are-wrong hypothesis, and of the current status of the debate. Its most important influence/foil is Sasha Gusev, whose blog The Infintesimal introduced me to the new anti-hereditarian movement and got me to research it further, but it’s also inspired by Eric Turkheimer, Alex Young (not himself an anti-hereditarian, but his research helped ignite interest in this area), and Awais Aftab. (while I was working on this draft, the East Hunter Substack wrote a similar post. Theirs is good and I recommend it, but I think this one adds enough that I’m publishing anyway. You can see Gusev’s response to East Hunter here) In an interview with Aftab, Gusev explained his philosophy like so (I am excerpting heavily from a long interview and editing for flow/emphasis; completionists should read the whole thing): For teacher-reported ADHD, the twin heritability estimate was 69% while the GWAS-based heritability estimate [ie using genome-wide association studies where researchers actually try to find the genes involved] was just 5%; with similar gaps for other behavioral traits. These are huge differences! If we believe the twin study estimates, then this gap implies that there is a lot of causal genetic variation out there that GWAS/molecular data is not picking up. One way to think about this is that traits that are under stronger natural selection will have more of their genetic variants driven to low frequency, and thus less detectable by GWAS. So a big gap between GWAS and twins could imply that rare variants are very important due to strong selection. On the other hand, if we are skeptical of the twin study estimates, then this gap implies a substantial contribution from those environmental complexities I talked about previously. For a long time, the field of molecular genetics was operating under the assumption that the missing heritability was largely in the rare variants we had not yet measured. But a number of recent advances have started to tip the scales against that argument. First, some of the earlier molecular heritability estimates were found to be inflated by some mix of technical issues and cultural transmission, so the amount of missing heritability actually increased. Second, a new model was developed that could estimate total direct heritability using molecular data from mother-father-child trios, with very few model assumptions (the title literally states “… without environmental bias”; Young et al. 2018), and it too found estimates that were substantially lower than twins on average. Third, several studies have now actually measured the influence of rare variants in various forms, and they are so far not adding up to explain as much as we would expect from twin heritability estimates. Fourth, there is little evidence of the strong natural selection that would be needed to generate a massive trove of rare variants untagged by GWAS. I am a molecular geneticist, and this drumbeat of evidence from molecular data has convinced me that twin studies are either 2-3x inflated or estimate something fundamentally different from direct heritability. We’ll start by looking at Gusev’s first claim: that “earlier molecular estimates” (ie polygenic scores) are significantly inflated, or at least don’t mean what we thought they meant. This won’t be directly relevant to our question - even our original number of 17% implies missing heritability2, so moving it down a bit to 5-10% or up a bit to 20% doesn’t add or subtract from the fundamental mystery. But this discussion has gotten a lot of people extremely confused, and we’ll need to deconfuse ourselves if we’re going to get any further. Are Most Current Polygenic Scores Confounded? A polygenic score is one possible result of a genome-wide association study. These scores are algorithms which take a person’s genes as input and return information about their traits as output. Better polygenic scores can predict a higher percent of variance in a certain trait. For example, the latest polygenic score on educational attainment can predict up to 17% of the variance in how much schooling someone completes. Predictive power is different from causal efficacy. Consider a racist society where the government ensures that all white people get rich but all black people stay poor. In this society, the gene for lactose tolerance (which most white people have, but most black people lack) would do a great job predicting social class, but it wouldn’t cause social class3. It certainly wouldn’t be a “gene for social class” in the sense where it controls the part of your brain that helps you manage money, or where genetic engineering on this gene would make people richer. Here are three common ways that not-directly-causal genes can show up as predicting a trait: Population stratification: genes are linked to culture, and culture determines the trait, as in the racism-lactose example above. Many studies naturally mitigate this concern by using the UK Biobank of mostly white British samples, and by correcting for “principal components” that correspond to ancestry (and there are other, even more complicated ways to correct for this). But ancestry variation is fractal; no matter how uniform your sample, there will still be micro-differences you didn’t consider. For example, if you’re analyzing the educational attainment of white British people, it’s very relevant that families with Norman surnames still outperform their Saxon peers at Oxbridge admissions 900 years after William the Conqueror. If Britons with more Norman ancestry have non-education-related genes that their Saxon peers lack, these could be mistakenly classified as genes for education or other behavioral differences between the two groups. Assortative mating: Suppose that both height and wealth are desirable qualities in a mate. Then tall people will tend to marry rich people, and over generations, the same people will be both rich and tall. That means that even if wealth is 0% genetic, a study looking for “the gene for wealth” will be able to find genes that rich people have more often than poor people - namely, the genes for height. Or suppose that smart people tend to marry other smart people - surely true, if only because so many couples meet at college. Then all the intelligence genes will concentrate in the same people. So any study that tries to determine how much Intelligence Gene ABC affects intelligence will get inflated4 results, because everyone with Intelligence Gene ABC will also have many other intelligence genes - if the study naively asks “How much smarter are people with Gene ABC than people without it?”, it will find they are much smarter (because it’s accidentally including part of the effects of all the other intelligence genes that travel along with it). Parent-to-child transmission, aka “genetic nurture”: Children tend to share their parents’ genes. So if there’s a gene that causes parents to create a certain kind of childrearing environment, and that childrearing environment affects a trait, it will falsely look like a gene that directly causes the trait. Suppose Gene XYZ causes parents to read more books to their children, and reading books to children increases their IQ. Parents with Gene XYZ will tend to read books, so their kids will get high IQ. Those kids will also (probably) inherit Gene XYZ from their parents. So people with Gene XYZ will tend to have higher IQ. If you naively study which genes increase IQ, you’ll see Gene XYZ in more smart people than dumb people, and think it’s a “gene for IQ”. This is “causal” in a certain sense, but it’s not the one we traditionally think about, and it behaves importantly differently - for example, if you genetically engineer someone to have Gene XYZ, their IQ won’t go up (although their kids’ IQs might). How can we tell if a polygenic predictor is “direct” vs. confounded by these non-causal pathways? The most common technique is within-family comparisons: do the traditional “check if people with the gene differ on a trait from people without the gene” study, but limit its focus to (for example) sibling pairs. Suppose a couple has two children; the first child inherits Gene ABC and the second one doesn’t. If the first child is smarter than the second child, that provides some infinitesimal evidence that Gene ABC is a gene for intelligence. Repeat this process over hundreds of thousands of sibling pairs, and the infinitesimal evidence can reach statistical significance. Since the family unit is a perfect natural experiment that isolates the variable of interest (genes) while holding everything else (culture and parenting) constant, within-family results are protected against stratification, assortative mating, and genetic nurture effects. The culmination of this research program is Tan et al 2024, which finds that many polygenic predictors lose significant accuracy when retested among siblings. For example, educational attainment is 50% uncorrelated with direct genetic effects. You need to square this to figure out what percent is causal; when you do that, you find that the polygenic score that explained 14% of EA is only 4%pp direct genes, with the other 10%pp being nondirect5 confounders. So yes, it seems like most polygenic scores that don’t validate within families are confounded. However unhappy we previously were that we had only found 14% of genes for EA (vs. 40% expected), we should now be much more unhappy - we really only know 4% of genes that directly cause EA. On the other hand, you might say - so before we only knew 14%pp out of 40%. Now we only know 4%pp out of 40%. This is discouraging, but it doesn’t fundamentally change what we know about nature vs. nurture. Both 4%pp and 14%pp are less than 40% - with either number, we must be missing something or doing something wrong. Probably that’s insufficient sample size. We’ll keep working on sample size and other things, and eventually scrounge up the missing 26%pp or 36%pp or whatever of the variance, so this doesn’t change anything. All it means is that one predictive method that the average person never knew about in the first place doesn’t work as well as we thought. Who cares? Not doctors. So far this research has only just barely begun to reach the clinic. But also, all doctors want to do is predict things (like heart attack risk). They don’t care if they use causal vs. nondirect genes. It doesn’t matter if you’re “only” at higher risk of heart attack because you’re black, or Norman, or because your parents read books to you - you still need more heart attack medication! Polygenic embryo selection companies should care. They offer polygenic scores that can be used to select healthier or smarter embryos. If the predictors they use rely partly on variants that aren’t causal within families, their real benefits could be far lower than advertised. I talked to one of these companies, who said they’d already adjusted for these effects and expected their competitors had too - the proper antidote to this problem, sibling controls, is a natural choice when you’re literally picking between siblings. The biggest losers are the epidemiologists. They had started using polygenic predictors as a novel randomization method; suppose, for example, you wanted to study whether smoking causes Alzheimers. If you just checked how many smokers vs. nonsmokers got Alzheimers, your result would be vulnerable to bias; maybe poor people smoke more and get more Alzheimers. But (they hoped) you might be able to check whether people with the genes for smoking get more Alzheimers. Poverty can’t make you have more or fewer genes! This was a neat idea, but if the polygenic predictors are wrong about which genes cause smoking and what effect size they have, then the less careful among these results will need to be re-examined. But the reason I spent so much time on the subject here is that this has confused a lot of people into thinking heritability itself was confounded and is actually just 4%. When I read my first few blog posts on these findings, I came away thinking they were claiming to have discredited twin studies and heritability. And although I take partial ownership of my own poor reading comprehension, I maintain that the way that the new anti-hereditarians discuss this is pretty bad. For example, Turkheimer’s treatment of the Tan study above is called Is Tan Et Al The End Of Social Science Genomics?, and includes passages like: The median [direct genomic effect] heritability for behavioral phenotypes is .048. Let that sink in for a second. How different would the modern history of behavior genetics be if back in the 80s one study after another had shown that the heritability of behavior was around .05? When Arthur Jensen wrote about IQ, he usually used a figure of .8 for the heritability of intelligence. I know that the relationship between twin heritabilities and SNP heritabilities is complicated, and in fact the DGE heritability of ability is one of the higher ones, at .2336. But still, it seems to me that the appropriate conclusion from these results is that among people who don’t have an identical twin, genomic information is a statistically non-zero but all in all relatively minor contributor to behavioral differences. And comments included things like: I don’t know if [this study] is the end of social science genomics, but it should certainly be the end of attributing significant genetic influence to behavioral traits (despite the recent scientist-generated cartoons touting genes for “income”). And: There's no doubt that this reported findings have dealt a fatal blow to my conviction that behavioral traits are pre-eminently heritable…This is a remarkable example of an objective statistical fact mercilessly crushing the more subjective experiential sense of "A looks and acts more like B than C because A and B have the same parents." This subjective evidence is almost unshakable and universal in its application as a tried and tested psychosocial heuristic. And yet, here we are. Turkheimer is either misstating the relationship between polygenic scores and narrow-sense heritability, or at least egging on some very confused people who are doing that, and the dynamic was bad enough that I got confused myself for a while. But even more confusing, the new anti-hereditarians actually are saying that lots of behavioral traits have very low heritability! But this point requires different arguments, only tangentially related to these. So let’s move on to… Is Heritability Genuinely Low? (Part 1: GWAS & GREML) In the mid 2010s, when genome-wide association studies (GWAS) based polygenic predictors were getting better every year, it was easy to hope they might reach 40% and close the “missing heritability”. But since then, progress has stalled. The second-to-last tripling of sample size, from 300K to 1M between 2016 - 2018, increased predictive power from 6% → 12%. The last tripling, from 1M to 3M between 2018 - 2022, only increased predictive power from 12% → 14%. If you graph sample size vs. predictive power, it looks like there's an asymptote between 15 - 20% or so. (of which - remember - only 5% is directly causal!) Worse, a mid-2010s technique called GREML allowed researchers to estimate the percent of variance in a trait that comes from the sorts of common genes studied in GWAS, without having to identify the genes involved. A 2016 GREML paper suggested that the maximum share of variance that GWASs of educational attainment could ever discover was about 21% (again, compared to 40% predicted genetic from twin studies). Since unavoidable methodological issues will prevent GWASs from reaching the literal maximum possible, this agrees with the evidence suggesting an asymptote between 15 - 20%. So either twin studies are wrong and traits are less heritable than believed, or the heritability must lie somewhere other than the common genes identifiable by GWAS. What about rare genes? GWASs focus on genetic variation common enough to be worth including in a basic genetic test. Most of this is single nucleotide polymorphisms (“SNPs”). A single nucleotide is one letter of DNA - for example, a C or a G. Polymorphisms are genes that commonly vary in humans - sometimes across races (for example, some humans have a gene for light skin, and other humans have a gene for dark skin), and other times within races (for example, some white people have a gene that makes cilantro taste like soap, and others don’t). So SNPs are single-letter spots in DNA where different people often have different letters. How often? Some people say 1%, but the more practical definition is “often enough that someone has noticed and added it to the test panel”. There are three billion letters in the genome, of which only a few million are commonly-tested SNPs. But these SNP studies have limited7 ability to measure personal mutations and rare variants. Sometimes your parents’ egg and sperm cells mess up copying a nucleotide of DNA, and you get a mutation that isn’t inherited from your ethnic group or even from your subgroup/family line - it’s just some idiosyncratic DNA change that you might be the first person in history to have. Since scientists have never seen this mutation before, they don’t know about it and can’t test for it without doing something more expensive than a simple SNP screen. And SNP studies have limited ability to detect anything more complicated than a single letter changing to another single letter. But some mutations are more complicated structural variants. For example, some bits of DNA get stuck on repeat - one person might have GATGAT, another person might have GATGATGATGAT, and a third person might have fifty GATs in a row. Other bits come out backwards. Sometimes a whole chunk of DNA goes missing, or moves to the wrong place. Occasionally a gene reads The Selfish Gene by Richard Dawkins, takes it too seriously, and evolves some ridiculous trick for spamming itself all over the genome. So if even the best molecular studies seem to be asymptoting around 15-20% of variance in educational attainment, but twin studies suggest it’s 40% genetic, might rare variants and structural variants make up the missing 20-25%pp? This remains a topic of bitter disagreement. On the one side, hereditarians bring up a Darwinian argument: imagine a genetic engineer who hopes to find the genes for educational attainment and edit them to make everyone smart and successful. She looks harder and harder, becoming more and more exasperated as they fail to materialize. Finally, she realizes she’s been scooped: evolution has been working on the same project, and has a 100,000 year head start. In the context of intense, recent selection for intelligence, we should expect evolution to have already found (and eliminated) the most straightforward, easy-to-find genes for low intelligence. Therefore, everything left should be convoluted or hidden or impossible to work with. So although this requires a sort of god-of-the-gaps argument - where we keep pushing heritability into whatever genes are too weird for existing techniques to detect - there are some reasons to think God really is in the gaps here. And a 2017 paper uses some clever techniques to estimate the share of intelligence variation lurking in hard-to-measure genes and finds it’s more than half: “By capturing these additional genetic effects, our models closely approximate the heritability estimates from twin studies for intelligence and education.” (see also Wainschtein 2022, Sidorenko 2024) The anti-hereditarians disagree. They cite papers like Zeng which measure the strength of selection on intelligence and suggest that it’s too weak to concentrate so much of the variation in rare genes8. And Sasha Gusev mentions Weiner 2023, which finds that in fact rare variants “explain 1.3% (SE = 0.03%) of phenotypic variance on average – much less than common variants” (other experts say that burden heritability only captures some rare variants and is not the right tool for this problem). But it may not even matter, because another set of findings suggests that heritability is genuinely low even when the rare variants are counted. Is Heritability Genuinely Low? (Part 2: Sib-Regression and RDR) Two newer methods, Sib-Regression and RDR, ask: using what we know from genetic studies, how much genetic variation do we think exists, total, across both common and rare genes? On average siblings share 50% of genes. But there’s a little randomness in meiosis, so some siblings might share 40% and others might share 60%. The more genetic influence on a trait, the more similar sibling pairs who share 60% of their genes will be, compared to sibling pairs who only share 40% of their genes. Since 60%-gene siblings and 40%-gene siblings are both equally part of the same family, you can use these numbers to calculate heritability unconfounded by a range of family factors. This is Sib-Regression. If you do a more complicated statistical process to extend the same idea to relatives other than siblings, it’s relatedness disequilibrium regression or RDR. GWAS asks: Looking at common easy-to-study genes, how much variation in a trait have we explained right now? GREML asks: looking at common easy-to-study genes, how much variation could we ever explain? But sib-regression and RDR ask a question more like twin studies: considering all genes, whether common / rare / easy-to-study / hard-to-study, how much variation is there total? This could address the rare variant objection mentioned above. And in many ways, these techniques are better than twin studies - Sib-Regression eliminates many potential biases, and RDR eliminates even more (although it’s harder to pull off, requiring more genetic information and computational resources). These techniques are new and hard-to-use, and only a few published studies have applied them to the sorts of behavioral traits we’re interested in: Young et al (2018) did Sib-Regression and RDR to genetic data from Iceland. Sib-regression found educational attainment = 40% (±15%) heritable, and RDR found 17% (±9%) heritable. Kemper et al (2021) did Sib-Regression only to genetic data from Britain. It found educational attainment = 14% heritable. This number conflicts with the 40% from the Young paper. Why? Unclear, but it could be selection bias - Young’s Icelandic sample was representative of the country; Kemper’s British population were Biobank volunteers who tend tend to be healthier and higher-class than the population at large. Upper-class people may have restricted range in educational attainment, or different factors affecting their educational attainment compared to the overall population. Either way, these are closer to the low estimates from GWAS and GREML (7% direct, 20% total), than to the higher estimates from twin studies (40%, generally presumed direct). And we can no longer use contributions from rare variants to paper over the difference. So what is going on? It seems like we have to accept one of three possibilities: Either something is wrong with twin studies. Or something is wrong with Sib-Regression and RDR (and then we can explain away GWAS and GREML by saying they’re missing rare variants). Or something is wrong with how we’re thinking about this topic and comparing things. What’s Going On? (Part 1: Is Something Wrong With Twin Studies?) Twin studies have dominated discussion of behavioral genetics for decades, so there’s a vast literature investigating their various assumptions and whether something might be wrong with them. Here are some of the assumptions and what the research says about each. Some of these will be duplicates of the GWAS confounders above, but we’ll go through them again anyway to review how they apply to twins. 1: Parents Treat Fraternal And Identical Twins The Same: Twin studies claim that twins are a uniquely powerful genetic laboratory; both fraternal and identical twin pairs have equally concordant environments, but identical twins have more concordant genes. Therefore, the more similar identical twin pairs are relative to fraternal twin pairs, the more heritable a trait must be. But this conclusion falls apart if identical twin pairs actually have more similar environments than fraternal twin pairs do, maybe because parents (knowing their twins are identical) treat them more similarly than they would fraternal twins. Would-be twin-study-discreditors have been trying to argue that this must be true for decades, but it’s always been a kind of quixotic battle. Remember, twin studies find many behavioral traits like IQ are >60% heritable, so you would need to prove not only that parents treat identical twin pairs differently from fraternal, but that this was an overwhelming effect. Parents of identical twins would have to obsessively expose them to the exact same stimuli in the exact same order; parents of fraternal twins would have to send one to the Gifted Advanced Placement Acceleration program while locking the other in a box and force-feeding them lead pellets. Common sense tells us there are no such differences, and studies confirm this: when parents are wrong about their twins’ status (eg they have fraternal twins, but falsely think they’re identical, or vice versa) their trait similarity matches their real status, rather than the incorrect status that determined how their parents treat them; parental treatment explains less than 1% of why identical twin pairs are more concordant (2, 3, 4). See also Felson 2013, which tries to measure environmental similarity and adjust for it, with minimal effects. Are these two cuties monozygotic or dizygotic? Are you sure? (answer) 2: Fraternal And Identical Twins Have Equally Concordant Uterine Environments: Fraternal twins have different sacs in the uterus and use different placentas. Most identical twins share a placenta, and some share an amniotic sac. If trait similarity is caused by sharing a placenta or sac (maybe because the placenta is defective, the fetal brain is starved of nutrients, and so the person has a lower IQ when they grow up), twin studies would falsely read this identical-fraternal difference as genetic. Luckily this is easy to study; not all identical twins share a placenta or sac, so you can cleanly separate the effect of uterine environment from genetics. If you measure enough traits, you can find small deviations in some, but it’s not clear whether this is just multiple testing, and in any case the deviations are small. The best studies suggest this chips off somewhere between 0 - 3% from heritability estimates9. 3: There is little assortative mating: We discussed this one above in the earlier section on GWAS - smart/pretty/kind/whatever people tend to marry other smart/pretty/kind/whatever people. Why would this bias twin study results? Identical twins share 100% of their genes. Fraternal twins ought to share 50% of their genes - but they get half their genes from their mother, and half from their father. In the degenerate case where the mother and father have exactly the same genes (“would you have sex with your clone?”) even fraternal twins will be extremely similar (although not quite identical, since they’ll get different alleles from each clone). In the more plausible case where mothers and fathers are just a little more alike than chance (eg because smart people tend to marry other smart people), fraternal twins will share a genetic tendency towards a trait somewhat more than their 50% shared genes suggest. Since this makes fraternal twin pairs more (genetically) like identical twin pairs, and twin studies assess heritability as the difference in fraternal-identical-twin-pair concordance, this bias would make twin studies underestimate heritability. But this is the opposite of what you would need to “discredit” twin studies - if this bias is true, then everything is more genetic than twin studies think. And unlike the previous two biases, this one seems real and important, so much so that when you adjust for it, the heritability of educational attainment rises from ~40% to ~50%. I’m only mentioning this one here because some anti-hereditarians argue that you can’t trust twin studies because of assortative mating, without mentioning that this can only bias them down. 4: Population stratification: This is often large and worth worrying about, but it applies to identical and fraternal twin pairs equally, and doesn’t bias twin study heritability estimates much (though it might shift the balance between shared and non-shared environment). See eg the sentence around footnote 30 here. 5: Non-additive / “interaction” effects: These are theoretically interesting, but all research thus far has found they are minimal (1, 2). Some experts think this may miss rarer or harder-to-find interactions; we’ll return to this later. 6: “Genetic nurture”, parent-to-child Mentioned above: if there is a gene for reading books to kids, and reading books raises IQ, it will look like a “gene for IQ”. This isn’t as relevant to twin study estimates of heritability, since both identical twins and fraternal twins are equally related to their parents, and any trait caused by genetic nurture wouldn’t differ between them (and therefore would not falsely appear heritable in this design). Rather, they would appear as shared environment. 7: “Genetic nurture”, sibling-to-sibling That is, suppose your sibling’s traits influence your own development. For example, suppose your sibling has a gene that makes them sabotage your schoolwork, causing you to fail and drop out of school early. An identical twin would share this gene with their sibling more often than a fraternal twin, making it look like a “gene for doing badly at school” (since the people who have it do worse at school than those who don’t). Why are we even talking about this? Do we really think it’s a big part of the variance in behavioral traits? Challenging twin study heritability estimates through this route requires inhabiting a weird no-man’s-land where otherwise-invisible genetic and environmental pathways suddenly flare up when you say the magic words “it was done by a sibling”. For example, this requires a strong effect of shared environment - that is, your educational attainment has to depend on whether you’re being sabotaged or not. But in general, shared environmental effects are weak. And it requires a strong effect of genes - that is, this mechanism only works if your sibling’s tendency to sabotage you is highly genetically determined. But we’re deploying this claim to deny that traits like IQ or educational attainment are highly genetically determined. So to get much out of this, the tendency to sabotage siblings would have to be more genetic than other behavioral traits! The reason this convoluted possibility gets brought up so often is that, unlike the more plausible parent-to-child genetic nurture, twin studies can’t rule it out. So if you really want to deny twin studies, this is one of your best bets. But when investigated, this has effects indistinguishable from zero. I’ve been a bit mean in this whole section, because people really like to dismiss twin studies as “Oh, don’t you know, those depend on assumptions, I bet you never considered that assumptions might be wrong”, and then Gish Gallop you with different assumptions until you give up. But scientists have actually done a lot of really good work checking the assumptions and they mostly hold. An alternative way of validating twin studies (brought up by Noah Carl in this article) is to check them against their close cousins, adoption studies and pedigree studies. Pedigree studies investigate large family trees, and check how trait similarity decreases with genetic distance. They avoid twin specific biases (like different treatment of fraternal vs. identical twin pairs, or different prenatal environments), while adding others like assortative mating. Here are the heritabilities of IQ and EA found in pedigree studies10 (see footnote for sources and caveats, and see also here and here for somewhat similar designs): Adoption studies investigate whether adoptees’ traits are more correlated with their adoptive or biological parents. They avoid a large swathe of biases, at the risk of introducing new adoption-related biases of their own (like the possibility that agencies deliberately place adoptive children with parents who are culturally or behaviorally similar, or the possibility that adoptees were adopted late enough to still get some shared environment from their biological parents). Here are the findings of some of the largest and best11: Both straightforwardly confirmed the larger heritability numbers found in twin studies. I would add the evidence from some less formal “adoption studies”12. During residency, I spent a few months working in a child psychiatric hospital for the worst of the worst - kids who committed murder or rape or something before age 18. Many of these children had similar stories: they were taken from their parents just after birth because the parents were criminals/drug addicts/in jail/abusing them. Then they were adopted out to some extremely nice Christian family whose church told them that God wanted them to help poor little children in need. Then they promptly proceeded to commit crime / get addicted to drugs / go to jail / abuse people, all while those families’ biological children were goody-goodies who never got so much as a school detention. When I met with the families, they would always be surprised that things had gone so badly, insisting that they’d raised them exactly like their own son/daughter and taught them good Christian morals. I had to resist the urge to shove a pile of twin studies in their face. This has left me convinced that behavioral traits are highly heritable to a level that it would be hard for any study to contradict. Ultimate source here. Although the study is confusing about this, I think it’s trying to say that almost 90% of subjects were adopted before age 2. But I don’t think studies do contradict this. Given the degree to which their assumptions have been validated, and the level of confirmation from pedigree and adoption studies, I think they have earned a presumption of accuracy. Doubting the twin studies doesn’t seem like a promising route to reconciling the twin-vs-Sib-Regression/RDR discrepancy. What’s Going On? (Part 2: Is Something Wrong With Sib-Regression And RDR?) Sib-Regression is a clever way of avoiding most biases. Its independent variable - the degree to which some sibling pairs end up with slightly more shared genes than others - is even more random and exogenous than the difference between fraternal and identical twins. It can sometimes have biases related to assortative mating (which would falsely push heritability down), but otherwise it’s pretty good. RDR has many of the same advantages, and allows more diverse relationships and so larger sample sizes. It’s hard to think of ways these methods could be wildly off. There is one caveat: although RDR includes most of the rare and structural variants missed by GWAS, in theory it can miss certain ultra-rare variants which are so uncommon that they aren’t shared between some of the relative pairs used in RDR. De novo variants that occurred during the subject’s own conception would be in this category, if the subject didn’t have children or didn’t pass on that gene13. This seems like a pretty small subcategory of genetic variation, and I wouldn’t normally expect that much of importance to be hiding here, but maybe it’s more important than it seems. RDR also doesn’t include much variance caused by statistical interactions between genes. Although we said above that these are usually found to be insignificant, they might be more important in a trait like intelligence that has been under recent evolutionary selection that lops off easily-detectable sources of variance and leaves only the weird obscure ones behind. There’s limited ability for classical Mendelian dominance to affect common variants, but more complicated genetic interactions might still prove important. Overall these are strong methods, and their failure to converge is troubling. If forced to explain them away, we might tell a story like: So far, there is only one RDR study and a few Sib-Regression studies, so we should wait for more data before updating too hard.
For example, educational attainment is 50% uncorrelated with direct genetic effects. You need to square this to figure out what percent is causal; when you do that, you find that the polygenic score that explained 14% of EA is only 4%pp direct genes, with the other 10%pp being nondirect5 confounders. So yes, it seems like most polygenic scores that don’t validate within families are confounded. However unhappy we previously were that we had only found 14% of genes for EA (vs. 40% expected), we should now be much more unhappy - we really only know 4% of genes that directly cause EA. On the other hand, you might say - so before we only knew 14%pp out of 40%. Now we only know 4%pp out of 40%. This is discouraging, but it doesn’t fundamentally change what we know about nature vs. nurture. Both 4%pp and 14%pp are less than 40% - with either number, we must be missing something or doing something wrong. Probably that’s insufficient sample size. We’ll keep working on sample size and other things, and eventually scrounge up the missing 26%pp or 36%pp or whatever of the variance, so this doesn’t change anything. All it means is that one predictive method that the average person never knew about in the first place doesn’t work as well as we thought. Who cares? Not doctors. So far this research has only just barely begun to reach the clinic. But also, all doctors want to do is predict things (like heart attack risk). They don’t care if they use causal vs. nondirect genes. It doesn’t matter if you’re “only” at higher risk of heart attack because you’re black, or Norman, or because your parents read books to you - you still need more heart attack medication! Polygenic embryo selection companies should care. They offer polygenic scores that can be used to select healthier or smarter embryos. If the predictors they use rely partly on variants that aren’t causal within families, their real benefits could be far lower than advertised. I talked to one of these companies, who said they’d already adjusted for these effects and expected their competitors had too - the proper antidote to this problem, sibling controls, is a natural choice when you’re literally picking between siblings. The biggest losers are the epidemiologists. They had started using polygenic predictors as a novel randomization method; suppose, for example, you wanted to study whether smoking causes Alzheimers. If you just checked how many smokers vs. nonsmokers got Alzheimers, your result would be vulnerable to bias; maybe poor people smoke more and get more Alzheimers. But (they hoped) you might be able to check whether people with the genes for smoking get more Alzheimers. Poverty can’t make you have more or fewer genes! This was a neat idea, but if the polygenic predictors are wrong about which genes cause smoking and what effect size they have, then the less careful among these results will need to be re-examined. But the reason I spent so much time on the subject here is that this has confused a lot of people into thinking heritability itself was confounded and is actually just 4%. When I read my first few blog posts on these findings, I came away thinking they were claiming to have discredited twin studies and heritability. And although I take partial ownership of my own poor reading comprehension, I maintain that the way that the new anti-hereditarians discuss this is pretty bad. For example, Turkheimer’s treatment of the Tan study above is called Is Tan Et Al The End Of Social Science Genomics?, and includes passages like: The median [direct genomic effect] heritability for behavioral phenotypes is .048. Let that sink in for a second. How different would the modern history of behavior genetics be if back in the 80s one study after another had shown that the heritability of behavior was around .05? When Arthur Jensen wrote about IQ, he usually used a figure of .8 for the heritability of intelligence. I know that the relationship between twin heritabilities and SNP heritabilities is complicated, and in fact the DGE heritability of ability is one of the higher ones, at .2336. But still, it seems to me that the appropriate conclusion from these results is that among people who don’t have an identical twin, genomic information is a statistically non-zero but all in all relatively minor contributor to behavioral differences. And comments included things like: I don’t know if [this study] is the end of social science genomics, but it should certainly be the end of attributing significant genetic influence to behavioral traits (despite the recent scientist-generated cartoons touting genes for “income”). And: There's no doubt that this reported findings have dealt a fatal blow to my conviction that behavioral traits are pre-eminently heritable…This is a remarkable example of an objective statistical fact mercilessly crushing the more subjective experiential sense of "A looks and acts more like B than C because A and B have the same parents." This subjective evidence is almost unshakable and universal in its application as a tried and tested psychosocial heuristic. And yet, here we are. Turkheimer is either misstating the relationship between polygenic scores and narrow-sense heritability, or at least egging on some very confused people who are doing that, and the dynamic was bad enough that I got confused myself for a while. But even more confusing, the new anti-hereditarians actually are saying that lots of behavioral traits have very low heritability! But this point requires different arguments, only tangentially related to these. So let’s move on to… Is Heritability Genuinely Low? (Part 1: GWAS & GREML) In the mid 2010s, when genome-wide association studies (GWAS) based polygenic predictors were getting better every year, it was easy to hope they might reach 40% and close the “missing heritability”. But since then, progress has stalled. The second-to-last tripling of sample size, from 300K to 1M between 2016 - 2018, increased predictive power from 6% → 12%. The last tripling, from 1M to 3M between 2018 - 2022, only increased predictive power from 12% → 14%. If you graph sample size vs. predictive power, it looks like there's an asymptote between 15 - 20% or so. (of which - remember - only 5% is directly causal!) Worse, a mid-2010s technique called GREML allowed researchers to estimate the percent of variance in a trait that comes from the sorts of common genes studied in GWAS, without having to identify the genes involved. A 2016 GREML paper suggested that the maximum share of variance that GWASs of educational attainment could ever discover was about 21% (again, compared to 40% predicted genetic from twin studies). Since unavoidable methodological issues will prevent GWASs from reaching the literal maximum possible, this agrees with the evidence suggesting an asymptote between 15 - 20%. So either twin studies are wrong and traits are less heritable than believed, or the heritability must lie somewhere other than the common genes identifiable by GWAS. What about rare genes? GWASs focus on genetic variation common enough to be worth including in a basic genetic test. Most of this is single nucleotide polymorphisms (“SNPs”). A single nucleotide is one letter of DNA - for example, a C or a G. Polymorphisms are genes that commonly vary in humans - sometimes across races (for example, some humans have a gene for light skin, and other humans have a gene for dark skin), and other times within races (for example, some white people have a gene that makes cilantro taste like soap, and others don’t). So SNPs are single-letter spots in DNA where different people often have different letters. How often? Some people say 1%, but the more practical definition is “often enough that someone has noticed and added it to the test panel”. There are three billion letters in the genome, of which only a few million are commonly-tested SNPs. But these SNP studies have limited7 ability to measure personal mutations and rare variants. Sometimes your parents’ egg and sperm cells mess up copying a nucleotide of DNA, and you get a mutation that isn’t inherited from your ethnic group or even from your subgroup/family line - it’s just some idiosyncratic DNA change that you might be the first person in history to have. Since scientists have never seen this mutation before, they don’t know about it and can’t test for it without doing something more expensive than a simple SNP screen. And SNP studies have limited ability to detect anything more complicated than a single letter changing to another single letter. But some mutations are more complicated structural variants. For example, some bits of DNA get stuck on repeat - one person might have GATGAT, another person might have GATGATGATGAT, and a third person might have fifty GATs in a row. Other bits come out backwards. Sometimes a whole chunk of DNA goes missing, or moves to the wrong place. Occasionally a gene reads The Selfish Gene by Richard Dawkins, takes it too seriously, and evolves some ridiculous trick for spamming itself all over the genome. So if even the best molecular studies seem to be asymptoting around 15-20% of variance in educational attainment, but twin studies suggest it’s 40% genetic, might rare variants and structural variants make up the missing 20-25%pp? This remains a topic of bitter disagreement. On the one side, hereditarians bring up a Darwinian argument: imagine a genetic engineer who hopes to find the genes for educational attainment and edit them to make everyone smart and successful. She looks harder and harder, becoming more and more exasperated as they fail to materialize. Finally, she realizes she’s been scooped: evolution has been working on the same project, and has a 100,000 year head start. In the context of intense, recent selection for intelligence, we should expect evolution to have already found (and eliminated) the most straightforward, easy-to-find genes for low intelligence. Therefore, everything left should be convoluted or hidden or impossible to work with. So although this requires a sort of god-of-the-gaps argument - where we keep pushing heritability into whatever genes are too weird for existing techniques to detect - there are some reasons to think God really is in the gaps here. And a 2017 paper uses some clever techniques to estimate the share of intelligence variation lurking in hard-to-measure genes and finds it’s more than half: “By capturing these additional genetic effects, our models closely approximate the heritability estimates from twin studies for intelligence and education.” (see also Wainschtein 2022, Sidorenko 2024) The anti-hereditarians disagree. They cite papers like Zeng which measure the strength of selection on intelligence and suggest that it’s too weak to concentrate so much of the variation in rare genes8. And Sasha Gusev mentions Weiner 2023, which finds that in fact rare variants “explain 1.3% (SE = 0.03%) of phenotypic variance on average – much less than common variants” (other experts say that burden heritability only captures some rare variants and is not the right tool for this problem). But it may not even matter, because another set of findings suggests that heritability is genuinely low even when the rare variants are counted. Is Heritability Genuinely Low? (Part 2: Sib-Regression and RDR) Two newer methods, Sib-Regression and RDR, ask: using what we know from genetic studies, how much genetic variation do we think exists, total, across both common and rare genes? On average siblings share 50% of genes. But there’s a little randomness in meiosis, so some siblings might share 40% and others might share 60%. The more genetic influence on a trait, the more similar sibling pairs who share 60% of their genes will be, compared to sibling pairs who only share 40% of their genes. Since 60%-gene siblings and 40%-gene siblings are both equally part of the same family, you can use these numbers to calculate heritability unconfounded by a range of family factors. This is Sib-Regression. If you do a more complicated statistical process to extend the same idea to relatives other than siblings, it’s relatedness disequilibrium regression or RDR. GWAS asks: Looking at common easy-to-study genes, how much variation in a trait have we explained right now? GREML asks: looking at common easy-to-study genes, how much variation could we ever explain? But sib-regression and RDR ask a question more like twin studies: considering all genes, whether common / rare / easy-to-study / hard-to-study, how much variation is there total? This could address the rare variant objection mentioned above. And in many ways, these techniques are better than twin studies - Sib-Regression eliminates many potential biases, and RDR eliminates even more (although it’s harder to pull off, requiring more genetic information and computational resources). These techniques are new and hard-to-use, and only a few published studies have applied them to the sorts of behavioral traits we’re interested in: Young et al (2018) did Sib-Regression and RDR to genetic data from Iceland. Sib-regression found educational attainment = 40% (±15%) heritable, and RDR found 17% (±9%) heritable. Kemper et al (2021) did Sib-Regression only to genetic data from Britain. It found educational attainment = 14% heritable. This number conflicts with the 40% from the Young paper. Why? Unclear, but it could be selection bias - Young’s Icelandic sample was representative of the country; Kemper’s British population were Biobank volunteers who tend tend to be healthier and higher-class than the population at large. Upper-class people may have restricted range in educational attainment, or different factors affecting their educational attainment compared to the overall population. Either way, these are closer to the low estimates from GWAS and GREML (7% direct, 20% total), than to the higher estimates from twin studies (40%, generally presumed direct). And we can no longer use contributions from rare variants to paper over the difference. So what is going on? It seems like we have to accept one of three possibilities: Either something is wrong with twin studies. Or something is wrong with Sib-Regression and RDR (and then we can explain away GWAS and GREML by saying they’re missing rare variants). Or something is wrong with how we’re thinking about this topic and comparing things. What’s Going On? (Part 1: Is Something Wrong With Twin Studies?) Twin studies have dominated discussion of behavioral genetics for decades, so there’s a vast literature investigating their various assumptions and whether something might be wrong with them. Here are some of the assumptions and what the research says about each. Some of these will be duplicates of the GWAS confounders above, but we’ll go through them again anyway to review how they apply to twins. 1: Parents Treat Fraternal And Identical Twins The Same: Twin studies claim that twins are a uniquely powerful genetic laboratory; both fraternal and identical twin pairs have equally concordant environments, but identical twins have more concordant genes. Therefore, the more similar identical twin pairs are relative to fraternal twin pairs, the more heritable a trait must be. But this conclusion falls apart if identical twin pairs actually have more similar environments than fraternal twin pairs do, maybe because parents (knowing their twins are identical) treat them more similarly than they would fraternal twins. Would-be twin-study-discreditors have been trying to argue that this must be true for decades, but it’s always been a kind of quixotic battle. Remember, twin studies find many behavioral traits like IQ are >60% heritable, so you would need to prove not only that parents treat identical twin pairs differently from fraternal, but that this was an overwhelming effect. Parents of identical twins would have to obsessively expose them to the exact same stimuli in the exact same order; parents of fraternal twins would have to send one to the Gifted Advanced Placement Acceleration program while locking the other in a box and force-feeding them lead pellets. Common sense tells us there are no such differences, and studies confirm this: when parents are wrong about their twins’ status (eg they have fraternal twins, but falsely think they’re identical, or vice versa) their trait similarity matches their real status, rather than the incorrect status that determined how their parents treat them; parental treatment explains less than 1% of why identical twin pairs are more concordant (2, 3, 4). See also Felson 2013, which tries to measure environmental similarity and adjust for it, with minimal effects. Are these two cuties monozygotic or dizygotic? Are you sure? (answer) 2: Fraternal And Identical Twins Have Equally Concordant Uterine Environments: Fraternal twins have different sacs in the uterus and use different placentas. Most identical twins share a placenta, and some share an amniotic sac. If trait similarity is caused by sharing a placenta or sac (maybe because the placenta is defective, the fetal brain is starved of nutrients, and so the person has a lower IQ when they grow up), twin studies would falsely read this identical-fraternal difference as genetic. Luckily this is easy to study; not all identical twins share a placenta or sac, so you can cleanly separate the effect of uterine environment from genetics. If you measure enough traits, you can find small deviations in some, but it’s not clear whether this is just multiple testing, and in any case the deviations are small. The best studies suggest this chips off somewhere between 0 - 3% from heritability estimates9. 3: There is little assortative mating: We discussed this one above in the earlier section on GWAS - smart/pretty/kind/whatever people tend to marry other smart/pretty/kind/whatever people. Why would this bias twin study results? Identical twins share 100% of their genes. Fraternal twins ought to share 50% of their genes - but they get half their genes from their mother, and half from their father. In the degenerate case where the mother and father have exactly the same genes (“would you have sex with your clone?”) even fraternal twins will be extremely similar (although not quite identical, since they’ll get different alleles from each clone). In the more plausible case where mothers and fathers are just a little more alike than chance (eg because smart people tend to marry other smart people), fraternal twins will share a genetic tendency towards a trait somewhat more than their 50% shared genes suggest. Since this makes fraternal twin pairs more (genetically) like identical twin pairs, and twin studies assess heritability as the difference in fraternal-identical-twin-pair concordance, this bias would make twin studies underestimate heritability. But this is the opposite of what you would need to “discredit” twin studies - if this bias is true, then everything is more genetic than twin studies think. And unlike the previous two biases, this one seems real and important, so much so that when you adjust for it, the heritability of educational attainment rises from ~40% to ~50%. I’m only mentioning this one here because some anti-hereditarians argue that you can’t trust twin studies because of assortative mating, without mentioning that this can only bias them down. 4: Population stratification: This is often large and worth worrying about, but it applies to identical and fraternal twin pairs equally, and doesn’t bias twin study heritability estimates much (though it might shift the balance between shared and non-shared environment). See eg the sentence around footnote 30 here. 5: Non-additive / “interaction” effects: These are theoretically interesting, but all research thus far has found they are minimal (1, 2). Some experts think this may miss rarer or harder-to-find interactions; we’ll return to this later. 6: “Genetic nurture”, parent-to-child Mentioned above: if there is a gene for reading books to kids, and reading books raises IQ, it will look like a “gene for IQ”. This isn’t as relevant to twin study estimates of heritability, since both identical twins and fraternal twins are equally related to their parents, and any trait caused by genetic nurture wouldn’t differ between them (and therefore would not falsely appear heritable in this design). Rather, they would appear as shared environment. 7: “Genetic nurture”, sibling-to-sibling That is, suppose your sibling’s traits influence your own development. For example, suppose your sibling has a gene that makes them sabotage your schoolwork, causing you to fail and drop out of school early. An identical twin would share this gene with their sibling more often than a fraternal twin, making it look like a “gene for doing badly at school” (since the people who have it do worse at school than those who don’t). Why are we even talking about this? Do we really think it’s a big part of the variance in behavioral traits? Challenging twin study heritability estimates through this route requires inhabiting a weird no-man’s-land where otherwise-invisible genetic and environmental pathways suddenly flare up when you say the magic words “it was done by a sibling”. For example, this requires a strong effect of shared environment - that is, your educational attainment has to depend on whether you’re being sabotaged or not. But in general, shared environmental effects are weak. And it requires a strong effect of genes - that is, this mechanism only works if your sibling’s tendency to sabotage you is highly genetically determined. But we’re deploying this claim to deny that traits like IQ or educational attainment are highly genetically determined. So to get much out of this, the tendency to sabotage siblings would have to be more genetic than other behavioral traits! The reason this convoluted possibility gets brought up so often is that, unlike the more plausible parent-to-child genetic nurture, twin studies can’t rule it out. So if you really want to deny twin studies, this is one of your best bets. But when investigated, this has effects indistinguishable from zero. I’ve been a bit mean in this whole section, because people really like to dismiss twin studies as “Oh, don’t you know, those depend on assumptions, I bet you never considered that assumptions might be wrong”, and then Gish Gallop you with different assumptions until you give up. But scientists have actually done a lot of really good work checking the assumptions and they mostly hold. An alternative way of validating twin studies (brought up by Noah Carl in this article) is to check them against their close cousins, adoption studies and pedigree studies. Pedigree studies investigate large family trees, and check how trait similarity decreases with genetic distance. They avoid twin specific biases (like different treatment of fraternal vs. identical twin pairs, or different prenatal environments), while adding others like assortative mating. Here are the heritabilities of IQ and EA found in pedigree studies10 (see footnote for sources and caveats, and see also here and here for somewhat similar designs): Adoption studies investigate whether adoptees’ traits are more correlated with their adoptive or biological parents. They avoid a large swathe of biases, at the risk of introducing new adoption-related biases of their own (like the possibility that agencies deliberately place adoptive children with parents who are culturally or behaviorally similar, or the possibility that adoptees were adopted late enough to still get some shared environment from their biological parents). Here are the findings of some of the largest and best11: Both straightforwardly confirmed the larger heritability numbers found in twin studies. I would add the evidence from some less formal “adoption studies”12. During residency, I spent a few months working in a child psychiatric hospital for the worst of the worst - kids who committed murder or rape or something before age 18. Many of these children had similar stories: they were taken from their parents just after birth because the parents were criminals/drug addicts/in jail/abusing them. Then they were adopted out to some extremely nice Christian family whose church told them that God wanted them to help poor little children in need. Then they promptly proceeded to commit crime / get addicted to drugs / go to jail / abuse people, all while those families’ biological children were goody-goodies who never got so much as a school detention. When I met with the families, they would always be surprised that things had gone so badly, insisting that they’d raised them exactly like their own son/daughter and taught them good Christian morals. I had to resist the urge to shove a pile of twin studies in their face. This has left me convinced that behavioral traits are highly heritable to a level that it would be hard for any study to contradict. Ultimate source here. Although the study is confusing about this, I think it’s trying to say that almost 90% of subjects were adopted before age 2. But I don’t think studies do contradict this. Given the degree to which their assumptions have been validated, and the level of confirmation from pedigree and adoption studies, I think they have earned a presumption of accuracy. Doubting the twin studies doesn’t seem like a promising route to reconciling the twin-vs-Sib-Regression/RDR discrepancy. What’s Going On? (Part 2: Is Something Wrong With Sib-Regression And RDR?) Sib-Regression is a clever way of avoiding most biases. Its independent variable - the degree to which some sibling pairs end up with slightly more shared genes than others - is even more random and exogenous than the difference between fraternal and identical twins. It can sometimes have biases related to assortative mating (which would falsely push heritability down), but otherwise it’s pretty good. RDR has many of the same advantages, and allows more diverse relationships and so larger sample sizes. It’s hard to think of ways these methods could be wildly off. There is one caveat: although RDR includes most of the rare and structural variants missed by GWAS, in theory it can miss certain ultra-rare variants which are so uncommon that they aren’t shared between some of the relative pairs used in RDR. De novo variants that occurred during the subject’s own conception would be in this category, if the subject didn’t have children or didn’t pass on that gene13. This seems like a pretty small subcategory of genetic variation, and I wouldn’t normally expect that much of importance to be hiding here, but maybe it’s more important than it seems. RDR also doesn’t include much variance caused by statistical interactions between genes. Although we said above that these are usually found to be insignificant, they might be more important in a trait like intelligence that has been under recent evolutionary selection that lops off easily-detectable sources of variance and leaves only the weird obscure ones behind. There’s limited ability for classical Mendelian dominance to affect common variants, but more complicated genetic interactions might still prove important. Overall these are strong methods, and their failure to converge is troubling. If forced to explain them away, we might tell a story like: So far, there is only one RDR study and a few Sib-Regression studies, so we should wait for more data before updating too hard.
Are these two cuties monozygotic or dizygotic? Are you sure? (answer) 2: Fraternal And Identical Twins Have Equally Concordant Uterine Environments: Fraternal twins have different sacs in the uterus and use different placentas. Most identical twins share a placenta, and some share an amniotic sac. If trait similarity is caused by sharing a placenta or sac (maybe because the placenta is defective, the fetal brain is starved of nutrients, and so the person has a lower IQ when they grow up), twin studies would falsely read this identical-fraternal difference as genetic. Luckily this is easy to study; not all identical twins share a placenta or sac, so you can cleanly separate the effect of uterine environment from genetics. If you measure enough traits, you can find small deviations in some, but it’s not clear whether this is just multiple testing, and in any case the deviations are small. The best studies suggest this chips off somewhere between 0 - 3% from heritability estimates9. 3: There is little assortative mating: We discussed this one above in the earlier section on GWAS - smart/pretty/kind/whatever people tend to marry other smart/pretty/kind/whatever people. Why would this bias twin study results? Identical twins share 100% of their genes. Fraternal twins ought to share 50% of their genes - but they get half their genes from their mother, and half from their father. In the degenerate case where the mother and father have exactly the same genes (“would you have sex with your clone?”) even fraternal twins will be extremely similar (although not quite identical, since they’ll get different alleles from each clone). In the more plausible case where mothers and fathers are just a little more alike than chance (eg because smart people tend to marry other smart people), fraternal twins will share a genetic tendency towards a trait somewhat more than their 50% shared genes suggest. Since this makes fraternal twin pairs more (genetically) like identical twin pairs, and twin studies assess heritability as the difference in fraternal-identical-twin-pair concordance, this bias would make twin studies underestimate heritability. But this is the opposite of what you would need to “discredit” twin studies - if this bias is true, then everything is more genetic than twin studies think. And unlike the previous two biases, this one seems real and important, so much so that when you adjust for it, the heritability of educational attainment rises from ~40% to ~50%. I’m only mentioning this one here because some anti-hereditarians argue that you can’t trust twin studies because of assortative mating, without mentioning that this can only bias them down. 4: Population stratification: This is often large and worth worrying about, but it applies to identical and fraternal twin pairs equally, and doesn’t bias twin study heritability estimates much (though it might shift the balance between shared and non-shared environment). See eg the sentence around footnote 30 here. 5: Non-additive / “interaction” effects: These are theoretically interesting, but all research thus far has found they are minimal (1, 2). Some experts think this may miss rarer or harder-to-find interactions; we’ll return to this later. 6: “Genetic nurture”, parent-to-child Mentioned above: if there is a gene for reading books to kids, and reading books raises IQ, it will look like a “gene for IQ”. This isn’t as relevant to twin study estimates of heritability, since both identical twins and fraternal twins are equally related to their parents, and any trait caused by genetic nurture wouldn’t differ between them (and therefore would not falsely appear heritable in this design). Rather, they would appear as shared environment. 7: “Genetic nurture”, sibling-to-sibling That is, suppose your sibling’s traits influence your own development. For example, suppose your sibling has a gene that makes them sabotage your schoolwork, causing you to fail and drop out of school early. An identical twin would share this gene with their sibling more often than a fraternal twin, making it look like a “gene for doing badly at school” (since the people who have it do worse at school than those who don’t). Why are we even talking about this? Do we really think it’s a big part of the variance in behavioral traits? Challenging twin study heritability estimates through this route requires inhabiting a weird no-man’s-land where otherwise-invisible genetic and environmental pathways suddenly flare up when you say the magic words “it was done by a sibling”. For example, this requires a strong effect of shared environment - that is, your educational attainment has to depend on whether you’re being sabotaged or not. But in general, shared environmental effects are weak. And it requires a strong effect of genes - that is, this mechanism only works if your sibling’s tendency to sabotage you is highly genetically determined. But we’re deploying this claim to deny that traits like IQ or educational attainment are highly genetically determined. So to get much out of this, the tendency to sabotage siblings would have to be more genetic than other behavioral traits! The reason this convoluted possibility gets brought up so often is that, unlike the more plausible parent-to-child genetic nurture, twin studies can’t rule it out. So if you really want to deny twin studies, this is one of your best bets. But when investigated, this has effects indistinguishable from zero. I’ve been a bit mean in this whole section, because people really like to dismiss twin studies as “Oh, don’t you know, those depend on assumptions, I bet you never considered that assumptions might be wrong”, and then Gish Gallop you with different assumptions until you give up. But scientists have actually done a lot of really good work checking the assumptions and they mostly hold. An alternative way of validating twin studies (brought up by Noah Carl in this article) is to check them against their close cousins, adoption studies and pedigree studies. Pedigree studies investigate large family trees, and check how trait similarity decreases with genetic distance. They avoid twin specific biases (like different treatment of fraternal vs. identical twin pairs, or different prenatal environments), while adding others like assortative mating. Here are the heritabilities of IQ and EA found in pedigree studies10 (see footnote for sources and caveats, and see also here and here for somewhat similar designs): Adoption studies investigate whether adoptees’ traits are more correlated with their adoptive or biological parents. They avoid a large swathe of biases, at the risk of introducing new adoption-related biases of their own (like the possibility that agencies deliberately place adoptive children with parents who are culturally or behaviorally similar, or the possibility that adoptees were adopted late enough to still get some shared environment from their biological parents). Here are the findings of some of the largest and best11: Both straightforwardly confirmed the larger heritability numbers found in twin studies. I would add the evidence from some less formal “adoption studies”12. During residency, I spent a few months working in a child psychiatric hospital for the worst of the worst - kids who committed murder or rape or something before age 18. Many of these children had similar stories: they were taken from their parents just after birth because the parents were criminals/drug addicts/in jail/abusing them. Then they were adopted out to some extremely nice Christian family whose church told them that God wanted them to help poor little children in need. Then they promptly proceeded to commit crime / get addicted to drugs / go to jail / abuse people, all while those families’ biological children were goody-goodies who never got so much as a school detention. When I met with the families, they would always be surprised that things had gone so badly, insisting that they’d raised them exactly like their own son/daughter and taught them good Christian morals. I had to resist the urge to shove a pile of twin studies in their face. This has left me convinced that behavioral traits are highly heritable to a level that it would be hard for any study to contradict. Ultimate source here. Although the study is confusing about this, I think it’s trying to say that almost 90% of subjects were adopted before age 2. But I don’t think studies do contradict this. Given the degree to which their assumptions have been validated, and the level of confirmation from pedigree and adoption studies, I think they have earned a presumption of accuracy. Doubting the twin studies doesn’t seem like a promising route to reconciling the twin-vs-Sib-Regression/RDR discrepancy. What’s Going On? (Part 2: Is Something Wrong With Sib-Regression And RDR?) Sib-Regression is a clever way of avoiding most biases. Its independent variable - the degree to which some sibling pairs end up with slightly more shared genes than others - is even more random and exogenous than the difference between fraternal and identical twins. It can sometimes have biases related to assortative mating (which would falsely push heritability down), but otherwise it’s pretty good. RDR has many of the same advantages, and allows more diverse relationships and so larger sample sizes. It’s hard to think of ways these methods could be wildly off. There is one caveat: although RDR includes most of the rare and structural variants missed by GWAS, in theory it can miss certain ultra-rare variants which are so uncommon that they aren’t shared between some of the relative pairs used in RDR. De novo variants that occurred during the subject’s own conception would be in this category, if the subject didn’t have children or didn’t pass on that gene13. This seems like a pretty small subcategory of genetic variation, and I wouldn’t normally expect that much of importance to be hiding here, but maybe it’s more important than it seems. RDR also doesn’t include much variance caused by statistical interactions between genes. Although we said above that these are usually found to be insignificant, they might be more important in a trait like intelligence that has been under recent evolutionary selection that lops off easily-detectable sources of variance and leaves only the weird obscure ones behind. There’s limited ability for classical Mendelian dominance to affect common variants, but more complicated genetic interactions might still prove important. Overall these are strong methods, and their failure to converge is troubling. If forced to explain them away, we might tell a story like: So far, there is only one RDR study and a few Sib-Regression studies, so we should wait for more data before updating too hard.
“Do you feel like you’ve shifted to less ambitious forms of writing with the new Substack?”, which dates the decline to 2021
“Do you feel like you’ve shifted to less ambitious forms of writing with the new Substack?”, which dates the decline to 2021 Quite a few people responded in the comments that Scott’s writing hadn’t changed, but it was the experience of being a commentor which had worsened. For example, David Friedman, a prolific commentor on the blog in the SSC-era, writes: A lot of what I liked about SSC was the commenting community, and I find the comments here less interesting than they were on SSC, fewer interesting arguments, which is probably why I spend more time on [an alternative forum] than on ACX. Similarly, kfix seems to be a long-time lurker (from as early as 2016) who has become more active in the ACX-era, writes: I would definitely agree that the commenting community here is 'worse' than at SSC along the lines you describe, along with the also unwelcome hurt feelings post whenever Scott makes an offhand joke about a political/cultural topic. And of course, this position wasn’t unanimous. Verbamundi Consulting is a true lurker who has only ever made one post on the blog – this one: Ok, I've been lurking for a while, but I have to say: I don't think you suck… You have a good variety of topics, your commenting community remains excellent, and you're one of the few bloggers I continue to follow. The ACX Commentariat is somewhat unique in that it self-styles itself as a major reason to come and read Scott’s writing – Scott offers up some insights on an issue, and then the comments section engages unusually open and unusually respectful discussion of the theme, and the total becomes greater than the sum of the parts. Therefore, if the Commentariat has declined in quality it may disproportionately affect people’s experience of Scott’s posts. The joint value of each Scott-plus-Commentariat offering declines if the Commentariat are not pulling their weight, even if Scott himself remains just as good as ever. In Why Do I Suck? Scott suggests that there is weak to no evidence of a decline in his writing quality, so I propose this review as something of a companion piece; is the (alleged) problem with the blog, in fact, staring at us in the mirror? My personal view aligns with Verbamundi Consulting and many other commentors - I’ve enjoyed participating in both the SSC and ACX comments, and I haven’t noticed any decline in Commentariat quality. So, I was extremely surprised to find the data totally contradicted my anecdotal experience, and indicated a very clear dropoff in a number of markers of quality at almost exactly the points Scott mentioned in Why Do I Suck? – one in mid-2016 and one in early 2021 during the switch from SSC to ACX. Setting Out the Case for Decline There’s a pretty basic question that needs to be answered before we compare the Commentariat today to that of yesteryear. That question is - does ‘the Commentariat’ actually exist? It is easy to understand what it means for Scott’s writing to have got better or worse over time, or to track the evolution of a specific commentor’s engagement with the blog. But in order to review ‘the Commentariat’ as a whole we would have to treat it as a single entity with discernible patterns and tendencies. I believe this approach is justified; the Commentariat has a distinct culture, voice and its own unique animal spirits that react to both Scott’s interests and the interests of the external world. Since it is not just generating random noise, it is possible to explore the Commentariat over time to build a case that its overall quality is declining (or not). To demonstrate this, I have displayed below a graph of comments per post across the lifetime of the blogs. It may not be quite fair to say that ‘engagement’ is the same thing as ‘quality’, but I certainly think it raises a question that needs to be answered; something massively affects comment engagement in 2016 and then again in 2021. In this graph, each datapoint represents a month that Scott has been blogging. A typical month will have between 15-20 posts, of which around half will be authored by Scott and half will be ‘authored’ in some way by the Commentariat, which are mostly Open Threads. I’ve averaged by month because certain types of post get much less engagement than others, and so looking at individual posts ended up too noisy to make attractive graphs (the true goal of any honest statistician). The SSC-era is highlighted in blue. You can see that it shows something a bit like a classic sigmoidal adoption curve (but wearing a top hat). Post engagement starts low, before rapidly shooting up in 2014-15. It peaks in April 2016 – which is highlighted in red in this and all subsequent graphs so you can track peak engagement - before dropping back to a steady level of around 400-600 comments per post for the next three years. Notably, the run of posts that most people regard as being the ‘Golden Age’ for Scott’s writing happens much earlier than peak engagement with the comments section. People disagree about where this run of exceptionally good posts in quick succession start and ends, but I think you could safely say it has definitely begun by the time of The Control Group is Out of Control (although I would date it a little earlier, personally) and ends with either The Toxoplasmosa of Rage or Untitled – basically 2014 has a high density of ‘important’ posts.
Complexity of thought – Perhaps the most important feature distinguishing the ACX Commentariat from other, lesser, blogs is that some really smart people comment here and give novel and well-nuanced takes on a topic. If this ever disappeared it would not matter about any of the other three features, because the Commentariat would effectively be dead anyway. To me, these broad categories represent the unique and positive features of the SSC/ACX Commentariat, and the extent to which they are present is a reasonable indicator of comment section quality, especially if they are all present at the same timepoint and that timepoint happens to line up with peak engagement in 2016 (this is foreshadowing). To generate data on the ACX Commentariat, I scraped the comments section of every post Scott has made since 2013. The Old Ones whisper of a blog that existed before even Slate Star Codex, but since I’m not 100% certain we’re encouraged to talk about the older blog (and nobody dates the golden era of Scott’s writing to pre-2013 anyway) I kept my scraping to just the two websites we’re definitely allowed to talk about; Slate Star Codex (SSC) and Astral Codex Ten (ACX). The main points of failure with my scraping were Subscriber-only threads (which my algorithm virtuously refused to read as it wasn’t a subscriber) and battling with the Substack UI to get all the comments to load for me simultaneously on larger threads. Nevertheless, between my incompetent code and the jaunty Substack UI I only dropped a few comments on even very long threads, so I figured the data scrape would be adequate for the use-case I had for it. I then used a bunch more janky code (some written by me, some written by ChatGPT) to try and quantify the levels of depth, freedom, politeness and complexity of each comment. I captured 2460 individual posts, and approximately 1.8m comments. Of the 24,486 unique comment authors, around 40% have made only one comment to the blog. The most prolific poster is the irrepressible Deiseach, at 20,685 contributions. Deiseach is also the only commentor to have made a comment on both the first post in my sample and the last, so has been with the blog a very long time! Only one other commentor has made more contributions than Scott (11,249), and this is John Schilling (11,607). The quality of data on individual users is not great for the ACX era (Substack seems to record missing author data in a few different ways, and sometimes swallow data for no reason) but I’m happy to give the rank ordering of anyone else who cares to know their specific level of clout in this niche community - I myself am the 799th most prolific contributor to the comments section (225 comments). I’m also delighted to share my raw data with anyone interested – the summary statistics per post are here. The scraped comments themselves are about 2Gb so I don’t know where I can host them but if anyone has any ideas (and Scott doesn’t mind) I’ll share them too. I know that some of the post titles seem to have turned into hieroglyphics, but as far as I can tell it is cosmetic only and won’t affect any of the actual data – it is a symptom of a cool hidden feature of Microsoft Excel where it open UTF-8 encoded CSVs in a way that garbles special characters for no particular reason. Considering each of these factors in turn: Depth of engagement with a topic
Last month, a startup called Nucleus took the plunge. They had previously offered 23andMe style genetic tests for adults. Now they announced a partnership with Genomic Prediction focusing on embryos. Although GP would continue to only test for health outcomes, you could forward the raw data from GP to Nucleus, and Nucleus would predict extra traits, including height, BMI, eye color, hair color, ADHD, IQ, and even handedness. Sample Nucleus results. And this week, Herasight4 entered the space with the most impressive disease risk scores yet, an IQ predictor worth 6-95 extra points, and a series of challenges to competitors, whom they call out for insufficient scientific rigor. Their most scathing attack is on Nucleus itself, accusing its predictions of being misleading and unreliable.
Sample Nucleus results. And this week, Herasight4 entered the space with the most impressive disease risk scores yet, an IQ predictor worth 6-95 extra points, and a series of challenges to competitors, whom they call out for insufficient scientific rigor. Their most scathing attack is on Nucleus itself, accusing its predictions of being misleading and unreliable.
Sample Nucleus results. And this week, Herasight4 entered the space with the most impressive disease risk scores yet, an IQ predictor worth 6-95 extra points, and a series of challenges to competitors, whom they call out for insufficient scientific rigor. Their most scathing attack is on Nucleus itself, accusing its predictions of being misleading and unreliable. Let’s start with the science, then move on to the companies and see if we can litigate their dispute. In Theory, All Of This Should Work Polygenic embryo screening is a natural extension of two well-validated technologies: genetic testing of embryos, and polygenic prediction of traits in adults. Genetic testing of embryos has been done for decades, usually to detect chromosomal abnormalities like Down Syndrome or simple single-gene disorders like cystic fibrosis. It’s challenging - you need to take a very small number of cells (often only 5-10) from a tiny proto-placenta that may not have many cells to spare, and extract a readable amount of genetic material from this limited sample - but there are known solutions that mostly work. But most traits are polygenic, requiring information about thousands or tens of thousands of genes to predict. These are too complicated to understand fully at current levels of technology, but some studies have chipped away at the problem and gotten a partial understanding. Often this looks like being able to predict a few percent of the variance in a trait, and determine whether someone’s genetic risk is slightly higher or lower than average. Polygenic prediction of traits in adults is still young and full of hidden pitfalls. Last month, we discussed how some early studies unknowingly conflated direct genetic effects and various confounders6 - for example, they tended to pick up on genes associated with well-off ethnic groups or families who had good health outcomes for social reasons. Pinpointing the direct component requires an additional step where researchers validate their algorithms within families (for example, on pairs of siblings where one has a higher polygenic score than the other) to see how much predictive power remains. This is especially important for embryo selection companies, whose entire value proposition depends on comparing two genomes from the same family. How have they done? It depends on the number of embryos they have to work with; the more embryos, the better you can do by selecting the best. Herasight’s numbers on how breast cancer risk goes down with number of embryos used in selection. A typical round of IVF produces 1-10 embryos (younger women usually = more). Women with polycystic ovarian syndrome (prevalence: 10%) may get as many as 20. For more, you will probably need to do multiple IVF rounds. Here is a table of different companies’ reported risk reductions, slightly adjusted7 for different reporting conventions but otherwise taking all claims at face value (we’ll talk about how wise that is later). Relative risk reduction for five conditions (gray = no data / disputed data). Here baseline is for embryos neither of whose parents have the condition. GP and Orchid both say their technology has improved since reporting these numbers and they will report better numbers soon. GP numbers are not within-family validated and might be lower if they were. Absolute risk after selection for five conditions (gray = no data / disputed data), ibid. Some people might genuinely want to select on a single condition. For example, people with a strong family history of schizophrenia might want to minimize the chance of their children getting the disease; for these people, reducing schizophrenia risk by 58% (while keeping everything else constant) sounds pretty good. Everyone else probably wants a generically healthy embryo with low risk of all conditions. Exactly how this works depends on the customer’s own values - would they prefer an embryo with lower cancer risk to one who will have fewer heart attacks? - and the exact benefits will depend on how parents make that decision. Genomic Prediction and Herasight try to help by providing semi-objective measures of which embryo is overall healthiest according to different conditions’ effects on longevity and patient-rated quality of life. For Genomic Prediction, that’s the “embryo health score” If you selected the single highest-health-score embryo from a set of five, here’s how they’d do: For Herasight, it’s a “polygenic longevity index”. They don’t give exact risk reduction numbers for each disease, saying that it depends too much on a couple’s specific family history, but say that most people gain 1-4 years of healthy life (when I test it on a set of twenty embryos, the the healthiest gets an extra 1.66 years). How much would you pay to give your children an extra 1-4 years of healthy life? This is no longer a hypothetical question. Here are the costs of the companies in this space: Is it worth it? If: You’re already doing IVF
Francis Fukuyama is on Substack; last month he wrote Liberalism Needs Community. As always, read the whole thing and don’t trust my summary, but the key point is:
Now its fame has reached Substack. Ethan Muse presents the case in favor, and Evan Harkness-Murphy the case against, with additional commentary from Dylan and Bentham’s Bulldog. I don’t think any of them have risen to the occasion. Ethan observes the formalities of good debate, but presents such a neatly-packaged story that readers are liable to miss the thousand little threads that trail off the bottom and lead places that are, if anything, even stranger than the original miracle. Evan puts admirable effort into arguing that child-seers could have non-veridical visions, but by the time he gets to the sun miracle itself, he has only a few potshots about crowd psychology and “optical phenomena”. Other skeptics are even worse, barely gesturing at Evan’s piece before redirecting their attention to boasts about how they have totally demolished the credulous fundies, or laments about how cosmically unfair it is that they must take time out of their busy schedules to respond to such idiocy. The final boss of the paranormal deserves more respect!
We will try to at least do better than the other Substackers. But as a stretch goal, I would like to actually advance this 108-year-long conversation.
Then it returned to its normal position, and the previously drenched crowd noticed they were miraculously dry. …then almost every testimonial contains some elements of the consensus story, in approximately the correct order. The case for self-contradiction is that very few testimonials contain all six elements: most are a random subset of those claims. Also, nobody can agree on which colors were involved in (4), or in which order. A believer might argue that if you encounter six different miracles in close succession, they all sort of blend together and you might forget one or two in your accounting. Or you might turn to your friend and ask what they think, and while you’re not looking you miss part of what’s going on. A skeptic might argue that if the sun falls to earth and appears seconds away from crushing you and everyone around you is screaming because they think it’s the end of the world, approximately 100% of people should mention that in their account of what happened that day, and if it’s more like 50%, then you have a problem. Here are some interestingly discordant testimonies that I came across during my search: Antonio dos Ramos Mira, local resident: A quarter of an hour after the rain stopped, he saw that huge crowd of people, in great clamor and almost all kneeling, facing the sun, which had unusual signs, turning around, trembling, observing at the same time that a yellow-reddish color had appeared around him, which was reflected throughout the crowd and on the horizon, with at the same time a weakening of light and an increase in temperature. The crowd, even the unbelievers, said that it was a known miracle. This is in the third person because the priest and clerk conducting the investigation are summarizing an account being given by an illiterate peasant. The witness names one color - yellow-reddish - and doesn’t mention the sun falling to earth. Antonio Maria Menitra, local property owner: It had rained heavily in the morning, and a little after noon, the rain stopped, and he observed a large crowd of people kneeling down and looking at the sun. He also looked and saw different colors in the sun and in the people. No mention of the sun dancing, spinning, shooting off sparks, or approaching the earth. Joao Martia Lucio Serra, lawyer: Already in some candid souls arose the fear that the foretold event might not occur, when suddenly the entire immense crowd stirred at the seer's voice in a significant brouhaha of astonishment and wonder, raising their heads to the sky, where thousands of eyes gazed in amazement at the sun in full blue, visible to all, without the intensity of its rays harming the retina and hindering vision, crowned with various colors, in a rapid rotation, at times seeming to detach itself from the celestial vault, approaching the earth. The spectators, looking at each other, represented themselves to each other as yellow, and on the horizon, reddish-orange, wherever their eyes looked, they saw beams of dim light, affecting an oval shape, seemingly placed at equal distances, and reflecting on the earth. Nobody else mentions the “beams of dim light, affecting an oval shape, seemingly placed at equal distances”. Maria Augusta Saraiva Vieira de Campos, local resident: Our sense of discouragement was profound, when suddenly we heard from all sides: Miracle! Look at the sun! The rain had stopped as if by magic; hats were closed; a warmth was felt as if we had entered a heated greenhouse, and the disk of the sun began to be seen, clearly discernible in the brownish layer that covered the entire sky. The heat increased, and the sun seemed to sink lower and lower, presenting new and varied changes. We saw a silvery veil, rounded in shape, as if it were a full moon; shortly after, it turned to vivid purple, then red, then emerald green, and finally took on its original color. Cries were heard from all sides as it emerged from the sun like a white, shining snow-like shape, without harming the retina, coming toward us, returning to the sun again, and finally hiding for the third time among the clouds. Everyone wept, and prayers, supplications, and acts of faith were heard from many mouths. Now something is coming down off the sun, instead of the sun itself coming down. Also, the colors are purple → red → green. Goncalo Xavier de Almeida Garrett, mathematics professor: 1st: The phenomena lasted about 8 to 10 minutes; 2nd: The sun lost its dazzling brightness, taking on the appearance of the moon and being easily seen; 3rd: The sun, three times during this period, manifested a rotational movement on its periphery, flashing sparks of light on its edges, similar to what happens with the well-known firework wheels; 4th: This rotational movement of the sun's edges, manifested 3 times and 3 times interrupted, was rapid and lasted 8 or 10 minutes, more or less; 5th: Next, the sun took on a violet color and then an orange, spreading these colors over the earth, finally regaining its brightness and splendor, impossible to be seen with the eyes; 6th: It was shortly after noon and near the zenith (which is very important) that these facts occurred. Do mathematicians really number everything they say like this? We saw this account earlier, and in most ways it matches the consensus story. But even though he’s trying to be methodical, he totally fails to mention the sun descending to crush the world. Instead, it’s the rotational movement that happens three times. Also, the colors are violet → orange Luis Antonio Vieira de Magalhaes e Vasconcelos, nobleman: I was absolutely convinced that I would see nothing. I then remembered, as I had remembered many times before, that principle of Gustave Le Bon, which boils down to the hypnotic current that dominates it. I had to be cautious, not to be influenced. This friend of mine, taking out his watch, said to me: there are five minutes left, at one o'clock look at the sun, that was the time announced by the shepherdesses, then you will tell me. My friends shout to me: look, look, but at first I only saw clouds drifting by, leaving the sun uncovered. Suddenly, I see an intensely pink rim, surrounding the sun, which resembled a disc of dull silver, as someone once said, while giving me the impression that it was moving from its original position. Diaphanous, vaporous clouds, somewhat purple, somewhat orange, permeated the air. At various points along the horizon, contrasting with the leaden hue of the sky, I also saw pink and yellow spots. The clamor grew louder and louder. This didn't last seconds: perhaps minutes. As I observed these manifestations, which I never doubted for a moment were due to the Infinite Omnipotence of God, an indescribable impression came over me. Here are the silver disc and the unusual colors (here “pink, purple, and orange”). But the colors are now merely “clouds” and “spots”, and there is nothing about spinning, dancing, or falling to earth. Antonio de Paula, pilgrim from Lisbon: Suddenly the priest looks at the sun and says that the sun in eclipse was not like that. The deponent also looked and saw that the sun gave no light; a white mist hung over it, it was a dull moon. The sun was to the left, with the rest of the sky obscured. Taking his eyes off the sun, he saw the people a very bright red color; and he exclaimed: "Oh, gentlemen, how the people are all red!" And the priest replied: "Are they red scarves?" To which he remarked: "How can that be? So they had all agreed to have red scarves on their backs?!" Then the people appeared the color of gold. The sun's rotational movements were not visible to them. The people on that occasion cried out loudly, kneeling with their hands raised, shouting for Our Lady, not caring about the thick mud, repeatedly invoking Our Lady. The people's impression was extraordinary. This person saw the silver moon-like sun and the color changes (here “red” and “gold”), but nothing else. He explicitly mentions not seeing the rotation. Luis de Andrade de Silva: The globe of the sun, similar to a disc of dull silver, rotated around an imaginary axis, and at that moment, it seemed to descend through the atmosphere, towards the earth, accompanied at times by an extraordinary brightness, and by an intense heat. The sun's rays were said to have yellow, green, blue and purple colors, but I only noticed the yellow color. After a few minutes, during which these phenomena occurred, no one could look at the sun anymore, because its rays hurt the retina. Only those who witnessed these phenomena can evaluate what happened then, but cannot describe them exactly. He says that although he heard other people mention yellow, green, blue, and purple colors, he only saw yellow. Dominic Reis, American traveler: The sun started to roll from one place to another place, and changed blue, yellow, all colors! Then we see the sun come toward the children, toward the tree. Everybody was hollering out. Some start to confess their sins, ‘cause there were no Priests around there . . . even my mother grabbed me to her and started to cry, saying, ‘It is the end of the world! And we see the sun come right into the trees. And then the little children get up and turn around to the people and told the people, ‘Pray and pray hard because everything is going to be all right.’ This person says the sun didn’t merely fall to earth, but went to the children (ie the child-seers) and the tree (the oak where the Virgin was appearing) in particular. At one point, it is specifically located “right [in] the trees”. But in this account, I am getting the impression that the “sun” is some sort of UFO-like object, maybe the size of a large helicopter, which is in a particular place. I can’t tell if other witnesses also thought this and just didn’t describe it clearly, or whether this testimony is discordant. The interviewer (Haffert again) notices this, and asks whether Reis really thinks it was the sun; Reis gives a weird non-answer (“Well, for my part it was the sun . . . but whether just a light or not, there was something there. I know for sure.”) Dominic Reis, continued from elsewhere in his account: As soon as the sun went back in the right place the wind started to blow real hard, but the trees didn’t move at all. The wind was blow, blow and in few minutes the ground was as dry as this floor here. Even our clothes had dried. We were walking here and there, and our clothes... we don’t feel at all. The clothes were dry and looked as though they had just come from the laundry. I believed. I thought: Either I’m out of my mind or this was a miracle, a real miracle. Although many people said their clothes were miraculously dry, Reis is the only one who mentions a miraculous wind. Everyone else says their clothes were dried by a miraculous heat. Reis does not mention heat. Maria dos Santos On October 13th, when Lucia said: "Our Lady is coming!", one of the deponent's daughters, named Maria, was standing on a rock, a meter from the holm oak tree, on the east side, to guard the bow so the people wouldn't damage it. The girl felt a blow to her face, saw a beautiful light near her, and cried out: "Oh! Our Lady!" The deponent looked and saw a star, a ball, not entirely round, like an egg, very beautiful, with the colors of the celestial rainbow, but much more vivid, with a tail of one and a half meters of brilliant colors. It passed very quickly and close to the holm oak tree, and disappeared a hand's breadth from the ground. She saw the sun sinking low. This is maybe the same UFO-like object that Dominic is reporting. In some of the other Fatima apparitions, the Virgin appears to those who cannot see her true form as a ball of light that comes to the tree where the child-seers are waiting. So maybe there were two things going on - the sun in the sky, and a ball of light (the apparition itself) heading back and forth to the tree. Still, if these are really two different phenomena, only these two accounts mention the second one. I don’t really have much that is non-obvious to say about these discordant testimonies. Aside from the ones with the UFO-like object, they seem about as discordant as you would expect from panicked people seeing a real inexplicable phenomenon - with the exception of some people who are absolutely terrified by the falling sun, and other people who don’t mention it at all. 1.4 Dalleur And The Distant Testimonies Maybe the only interesting advance in Fatimology in the last fifty years is Dalleur (2021), the focus of Muse’s Substack post. Dalleur is a philosophy professor at the Pontifical University in Rome, but clearly a multi-talented individual. He seems to lean toward the “miracle” explanation, but asks a fruitful question that nobody else seems to be considering: if it was a miracle, how was it implemented? That is, the real sun obviously didn’t change color or move - this would have been visible around the world, and would probably have fried the Earth. So what did God or the Virgin do, exactly, to produce the appearance of a moving sun? We can imagine two possibilities. First, they could have implemented the miracle through a “prophetic vision”, where they inspire a sort of mass hallucination in the onlookers. Second, they could have created some kind of objectively-real fiery wheel object in the skies above Portugal, and arranged for people to mistake it for the sun. If they did the second, we should be able to pin down where exactly they created it by triangulating distant testimonies Dalleur and I both found four of these: Joaquim Lourenco, schoolboy, 9 miles from Fatima: I feel incapable of describing what I saw. I looked fixedly at the sun which seemed pale and did not hurt my eyes. Looking like a ball of snow, revolving on itself, it suddenly seemed to come down in a zigzag, menacing the earth. Terrified, I ran and hid myself among the people, who were weeping and expecting the end of the world at any moment. It was a crowd which had gathered outside our local village school and we had all left classes and run into the streets because of the cries and surprised shouts of men and women who were in the street in front of the school when the miracle began. There was an unbeliever there who had spent the morning mocking the ‘simpletons’ who had gone off to Fatima just to see an ordinary girl. He now seemed paralyzed, his eyes fixed on the sun. He began to tremble from head to foot, and lifting up his arms, fell on his knees in the mud, crying out to God. But meanwhile the people continued to cry out and to weep, asking God to pardon their sins. We all ran to the two chapels in the village, which were soon filled to overflowing. During those long moments of the solar prodigy, objects around us turned all colors of the rainbow... When the people realized that the danger was over, there was an explosion of joy. Albano Barros, young boy, 12 miles away: I was watching sheep, as was my daily task, and suddenly there, in the direction of Fatima, I saw the sun fall from the sky. I thought it was the end of the world. I was so distracted that I remember nothing but the falling sun. I cannot even remember whether I took the sheep home, whether I ran, or what I did. Guilhermina Lopes da Silva, local resident, 16 miles away: I could not go [to Fatima] because my husband was an unbeliever. I was looking toward the mountain at noon when suddenly I saw a great red flash in the sky. I called two men who were working for us. They, of course, saw it, too. Afonso Vieria, famous writer, 30 miles away On that day of October 13, 1917, without remembering the predictions of the children, I was enchanted by a remarkable spectacle in the sky of a kind I had never seen before. I saw it from this veranda… Dalleur pins these on a map, which I’ve edited slightly for clearer labeling: The furthest report is 34 km (21 miles) away from Fatima, so Dalleur concludes the phenomenon was visible from about this distance. Further, all witnesses outside Fatima said the phenomenon was coming from the direction of Fatima, not from the direction of the sun (which in some cases was directly opposite Fatima)! By triangulating the accounts, Dalleur estimates that the miraculous light source which appeared to be the sun: was probably located above the hills a few km south of the Cova da Iria [in Fatima]. …ie at the spot indicated by the black sun sign in the purple circle on the map. Dalleur moves on to analyzing photographs of the event: He tries to estimate the angle of the shadows, and, from there, the angle of the light source. I cannot entirely follow his calculations, but he finds that there are two light sources - a diffuse source at about 42° elevation, and a point source at about 30°. The 42° source corresponds to the elevation we would expect the sun to be at in southern Portugal on October 13 around solar noon. It’s diffuse because it’s hidden behind clouds, just as it was all morning. So what is the 30° light source? Dalleur suggests it’s whatever object the witnesses are describing as spinning, moving, and changing color. They’re mistaking it for the sun because the real sun is hidden behind clouds. For a bright round sun-sized object in the sky during the day not to be the sun, isn’t really in most people’s hypothesis space. The paper stops here, but I’m not sure why. Given a distance, an angle, an apparent size (the size of the sun disc), and basic trigonometry, you should be able to calculate the object’s elevation and true size. Do this, and you find that the light source is two miles high and about 200 feet in diameter. That’s about the size of a 747, at about half the 747’s usual cruising altitude. What, who did you think God drafted to play “terrifying spinning fiery disc”? 1.5: Making Sense Of The Testimonies The multitude of testimonies of Fatima may trick us into thinking we understand what the miracle looked like. This complacency deserves to be challenged: “The sun looked pale, like the moon, and was painless to gaze upon”: Most sources treat this as the first aspect of the miracle. Several talk about how unbelievers are going to think it was just fog, but this can’t be true, because the edge of the solar disc was clearly defined, or there was no fog halo, or some other reason like that - and therefore even this first step was clearly miraculous. I feel like I’m going crazy here - I see this regularly! Not often, but a few times a year. When the sun is sort of halfway behind certain types of thin cloud, it looks pale like the moon (I remember, as a child, being uncertain about whether the full moon was somehow out during the day and visible through clouds), is painless to gaze upon, and has a clearly defined edge. Am I hallucinating? I decided to resolve this the same way the new government of Nepal chose its prime minister - via Discord poll: Here’s one of the hits for “sun behind clouds” on Google Images: I don’t know if this is a real picture or used lenses or something, but it’s pretty true to my experience. So why does every previous commentator act as if this is some cosmic mystery to be explained? A few people argue that (although it was a generally cloudy day), the mystery is that the clouds were nowhere near the sun at this point, so they couldn’t have been causing the unusual pallor. But the majority of witnesses say the clouds were absolutely near, or veiling, or even covering the sun. Stanley Jaki makes this a central point of his book, saying that “The great majority of eyewitness accounts, and certainly the most important ones, contain emphatic references to the continued presence of clouds.” I’m going kind of crazy here. I notice that the holdouts on my Discord poll disproportionately come from my non-Californian friends - is this rarer in other locales? I’m not sure. In any case, I will not count this as being one of the mysterious aspects of the miracle requiring explanation. “The sun was spinning”: How can a featureless disc be seen to spin? Despite this being one the most commonly-reported aspects of the miracle, almost nobody explains this point. Some say that only the rim was spinning, but this has the same problem. However, several people compared the sun to a “firework wheel”, also called a “Catherine wheel”. Here is a video of this object, which apparently was well-known in the Portugal of the time: Stanley Jaki relates a story about a priest having this same question and grilling a witness; the witness finally claimed that the sun traced a circle (like a basket in a Ferris wheel) rather than merely rotating. But this contradicts several claims that it “rotated around its own axis”, and I wonder if the witness was intimidated by the seeming contradiction in her story and was trying to weasel out of her own confusion. If we treat the miracle as the result of some kind of illusion, this becomes slightly easier to explain; there are plenty of visual distortions that look like a spinning motion, and since it is the visual field itself that is spinning, rather than any particular object, it can be seen whether the object is a disc or not. “The sun seemed to fall to earth”: In what sense did it seem like this? If the sun had simply gone down in the sky, people would have said it was setting, the same way it does every evening. One witness does say this. Most other witnesses say it was terrifying, and they felt like they (as opposed to other people living near the horizon) were about to be crushed. If the sun had simply gotten bigger - wouldn’t people have just said it looked bigger? Isn’t this a more natural way to record that the sun’s disc seemed to expand? Fr. Jaki combs his selection of witness accounts (larger than mine), but is only able to find one person who says “it got bigger” in so many words, compared to the dozens who talk about it looming, or falling to earth. Some people say that the sun “left the sky” or “left its place in the sky” at this point. In what sense? If the object that appeared to be the sun at Fatima had been visible as an object of a particular size (let’s imagine it as a flying saucer), then not only would this have been remarked upon, but it would have appeared to threaten some parts of the crowd in particular (that is, a descending saucer would look like it was about to land on some specific area). But this is not the consensus description, and several people say they thought the sun might crush the entire world. Several witnesses say it approached Earth with a jerky or zig-zag motion. If I imagine something else approaching Earth - let’s say a jumbo jet or asteroid - I can tell that it’s approaching rather than getting bigger because there’s multiple components to its trajectory that let me separate size change from forward movement. When I think of this aspect, I imagine the sun very suddenly growing in size and brightness to take up a substantial fraction of the sky (maybe >50%?!), maybe with some jerky motion on the side. Although it’s hardly scientific, I was charmed by John Touhey’s project of trying to visualize the miracle by using witness descriptions as prompts for ChatGPT. His work is a year old, and so several GPT iterations out of date. When I repeat his work with the current version, I get these: Interlude: The Anti-Clerical Union As mentioned briefly before, 1910s Portugal was in a period of transition. In 1910, a group of proto-socialist revolutionaries overthrew the monarchy. The monarchy and church had been in cahoots, so the revolutionaries cracked down on Catholicism, closing the monasteries and persecuting the churches. This was a bold move - only an upper crust of educated urbanites were proto-socialist, and 99%+ of the country identified as Catholic, albeit at various levels of religiosity. In the 1920s, conservatives would regain the upper hand, overthrow the proto-socialists and restore a pro-church dictatorship. Still, the small urban educated ruling class of 1910s Portugal was a hotbed of atheistic anti-church sentiment. Probably the child-seers of Fatima were only dimly aware of this, but their prophecies were a spark entering a powder keg, and many of the more worldly witnesses were aware of this context. While reading through Fatima-related documents, I came across some pamphlets by Grupo Anticlerical, one of the era’s leading atheist organizations. They are totally irrelevant to our primary goal of trying to figure out what’s up with the miracle. But I love them so much that I can’t resist adding one as an interlude. I have slightly edited the machine translation for clarity and readability: To defend the sacred freedom of conscience—guaranteed by the original Law of Separation of Church and State—from the furious attacks of implacable Jesuitism—the greatest enemy of all human happiness!—the Anticlerical Group was organized in this town, similar to what is being done in many parts of the country! This was necessary. They call us to fight. We present ourselves courageously! The great, formidable battle of progress against Ultramontane Reaction, of Freedom against Tyranny, of Truth against Lies is waged again with enthusiasm and ardor! The redemptive dawn that the Portuguese people saw emerge on October 5, 1910, is about to be eclipsed, intercepted by the immense flood of black cassocks!... But in the dark night that seeks to envelop Reason; where moral suffering takes on tragic proportions in a frightening asphyxiation, the Light will once again break through!... the consoling light of elevated spirits... and like a sinister scarecrow, the grim reaction will flee in terror! Liberal people! Hear us! This fight is terrible! Many of our people will perhaps be crushed and tortured on the battlefield, but what does it matter?! Every war against reaction is a holy war because it frees consciences from the clutches of their enemies!... It is the fight of Justice against Iniquity, of Love against Hate, of Good against Evil!... To the fight, then, for the Progress that makes life beautiful; for the Freedom that redeems the people; and for the science that guides us all as an eternal beacon to the Light of Truth! Gago Coutinho and Sacadura Cabral [two Portuguese aviators who had recently flown across the Atlantic] are prodigious spirits before whom our souls kneel religiously – boldly breaking through the air with the mathematical certainty of someone who knows the path to be taken to get from one point to another determined point; flying through the immense blue as sure of their route as any of us walking on earth, they showed us that Science is not an empty word! The power of their prodigious sextant, the fruit of immense scientific lucubrations, is more real and positive than the cross of Christ painted on their device, which could not even have saved them from falling due to lack of gasoline in the middle of the sea at the mercy of the waves. Their extraordinary journey, an adventure which moved us to tears, was the most resounding scientific victory of recent times! It was, above all, a powerful affirmation of science! Let us therefore make science our religion, for scientific religion is Freedom of Thought! To be a Free Thinker is to love immortal science, eagerly waiting for it to reveal to us the truth of the great enigmas of the Universe! And only it can reveal them! People! Let us always fight! From the victory of progress, science, freedom, and free thought, will result human happiness, joy, love, fraternity, respect for women, veneration for mothers, adoration for children, affection for the elderly, protection for the sick, the unfortunate, the tortured. The victory of reaction, of clericalism, of black, cruel and ferocious Jesuitism will result in: the gallows, the acts of faith with their human destruction, persecution, exile, robbery, arson, the deflowering of women, the killing of children, the monstrous torture of all free spirits! The history of so many crimes committed in the name of God horrifies us! The Inquisition, relentlessly slaughtering, tearing, and burning the flesh of so many victims, is still today, in the twentieth century, a sinister specter haunting us!... O most holy mothers! O holy, pious mothers who so love your sweet little children! Have compassion on your beautiful little children, sacred fruits of your blessed wombs: Love Freedom! Love Liberty, O loving mothers, immaculate saints of our altar! We pray for them... for your children, who are the light of your candid eyes, the life of your life... for little children... for all children, tender rosebuds that retrogression furiously lashes, – love Liberty!. And you, O parents! Heads of families who so tremble at your loved ones, snatch them from the merciless clutches of the reactionaries who twist their brains and kill their reason! Hear us all, men, women, and children; listen: Freedom writhes in horrible convulsions... it vibrates in space, echoing from mountain to mountain, an anguished cry for help!... It is Freedom that falls, annihilated! It is Freedom that dies in the bloody clutches of Jesuitism! The Miracle of Fatima, people, is a ridiculous lie, it is a comedy, it is not religion! Come on, liberals! Let us all rise up from this criminal apathy and, without delay, fight not the religious sentiment of the Portuguese people, such a good people, a race of heroes, but rather the exploitation that clericalism is inflicting on the people, foisting upon them, at a good price, images of the saint —trademarked to avoid competition from other vampires! —the shamelessness!—and leading them, through suggestion, to wallow and drink madly, the miraculous water, foul, filthy water, full of rot, pus, and pestilent microbes that the sore flesh of the sick leaves deposited there in the washings! We, all as one man, will fight the reaction, forcing it to retreat and thus, with our efforts, we will save the Republic and the Portuguese Land from its fatal annihilation! … …anyway, Interlude over, let’s get back to the miracle. 2: The Skeptical Explanations Re-invigorated by the rousing prose of Grupo Anticlerical, can we come up with a materialist explanation for the sun miracle? 2.1: Pilgrim, Avert Thine Eyes Starting in October 1917, doubters have focused on one obvious possibility: staring at the sun is harmful to your health. If you stare too long, you go blind. If you stare just slightly less long than that . . . maybe something strange happens? Just to get a particular theory out there: everyone knows that if you stare at a bright light source for a few seconds, you get a temporary afterimage - often pink or bluish-green - on your retina. Suppose the pilgrims stared at the sun. Their eyes would inevitably make microsaccades - small natural jerking motions - and the afterimage would appear somewhere slightly different than the true sun. This might look like the sun turning pink or blue and moving in a zig-zag pattern. Believers in the miracle counter this proposal in several ways. First, although it might explain the sun changing colors and dancing, it doesn’t give an explanation for spinning, sparkling, or falling to earth and threatening to crush everybody (exactly three times in a ten minute interval, no less). Second, although witnesses describe the sun changing color, they also describe everything around them changing color to match the sunlight, which doesn’t match localized afterimages. And one scientifically-minded witness specifically describes closing his eyes to see if there was a persistent afterimage; he says there was not. Third, there are no reports of eye injuries or blindness from a crowd that was, supposedly, staring straight at the sun for ten minutes. This is a good match to witness reports (that the sun was unusually pale and didn’t hurt to look at) and with Dalleur’s theory (that it wasn’t the sun). But it’s a bad match to any theory depending on eye injuries. Fourth, this would require Portuguese people to be total idiots. Everyone already knows bright lights cause afterimages. Surely if you stare at the sun for ten minutes and get some afterimages, you’re not going to freak out and start screaming about miracles and the end of the world. Even if the peasants had somehow remained ignorant of afterimages their whole lives, the scientists and doctors in attendance wouldn’t be fooled. If we are to keep this theory, maybe we should posit some retinal phenomenon much stronger than the ones we know. Everyone thinks they know how much an illusion can fool you - “yeah, okay, obviously the cookie that looks very slightly bigger will actually be the same size” - which is exactly why the really good ones, like the Checker Shadow Illusion, come as such a shock. Squares A and B are the same color. Source: Checker shadow illusion. There’s no way around it: we need to hear from someone who has stared directly into the sun. August Meessen was a physics professor at a Catholic university, which sounds like exactly the job profile we want for this sort of thing. He found himself sufficiently interested in the Fatima miracle to stare straight into the sun for a few minutes and record what happened. From his paper: In November 2002, I looked directly into the sun, at about 4 p.m. The sun was relatively low above the horizon and its light intensity was attenuated, although the sky was clear. I was able to look right into the sun and was amazed to see that the sun was immediately converted into a grey disc, surrounded by a brilliant ring. The grey disc was practically uniform, while the surrounding ring was somewhat irregular and flamboyant, but did not extend beyond the solar disk. It coincided with its rim. I stopped the experiment, since I wanted to be prudent, but I had experienced myself the initial phase of a typical “miracle of the sun” and I could explain it. The sun became grey, since my eyes immediately responded to its great luminosity by an automatic reduction of their sensitivity. This adaptation is not simply due to the bleaching of pigments in the colour-sensitive cones of the fovea, where the image of the sun is projected, but to secondary processes. By “initial phase”, he means the part where the sun looks pale and well-defined, like a full moon. This isn’t something I think needs explanation (see above), but he sure has explained it. Moving on: In a second experiment, realized at 3 p.m. in December 2002, I looked straight at the sun during a much longer time. After some minutes, I saw impressive colours, up to 2 or 3 times the diameter of the sun. They changed, but were mainly pink, deep blue, red and green. Further away, the sky became progressively more luminous. I stopped there, since I understood that these colours resulted from the fact that the red, green and blue sensitive pigments are bleached and regenerated at different rates. This is frustratingly vague. Are the “impressive colors up to 2-3 times the diameter of the sun” just the normal aftereffects of staring at a bright object? Or something surprising even to physics professors? And the spinning? What about the motions of the sun? I didn’t see them, because I didn’t look at the sun for a sufficiently long time or my brain knew already too much. Once, after I had been looking at a very long passing train, I had (for about 30 seconds) the illusion of an opposite motion. Joseph Plateau discovered that when we look at the centre of a spiral that is rotating at some given velocity about this point, and when we stop this rotation, we see a reversed rotation. It lasts for several minutes, although in reality, there is no motion at all. This is a good example of motional after-effects. The “dance of the sun” is initiated, however, by a spontaneous generation of apparent motion. This feels suspiciously like a just-so story. His explanation for the sun falling to earth to crush everyone - which he also did not see - is equally ad hoc: A very interesting study was recently devoted to this “zoom and loom effect”. It tends to appear when the brain is confronted with the two-dimensional retinal image of an object that is situated at some unknown distance. The brain will then consider the possibility that it could come closer, by performing an illusory mental zoom, where the apparent size of the object is progressively increased. This results from the fact that evolution preserved the tendency to take into account the possibility of a dangerous approach: a rapid evasive action could be beneficial for survival. If true, it sounds like you should be able to generate this effect not just by staring at the sun (ill-advised, causes blindness), but by staring at the moon. I would like to test this, but unfortunately I am writing this on the night of a new moon; I’ll check back in two weeks. Still, I am skeptical that no human being living before 1917 AD ever figured out that staring at a celestial body long enough would make it appear to fall to earth and crush you. Compare to much gentler illusions - like how the moon looks bigger right when it starts to rise - which everybody knows about. I was able to find a thirdhand report (Fr. Stanley Jaki → G. J. Strangfeld → consultation with bishop) of another sun miracle investigator, one “Professor Dr. Stöckl” in Germany, who made a similar experiment: After almost a minute (the time varies according to the condition of the atmopshere and the momentary condition of the eyes) one thinks to see a dark blue disk in front of the sun (this is already a sign of the highly excited state of the retina). According to my experience … this dark blue disk is somewhat smaller than the solar disk, so that the edge of that disk stands out as a ring beyond that dark blue disk. Then one has right away the impression that the solar disk rotates with great speed in one or the other direction. This I have experienced often enough. All this is a subjective appearance that has nothing to do with the external world. These reports are suggestive, but weaker than all but the barest Fatima testimonials. Dr. Messeen admits as much, saying that “I didn’t look at the sun for a sufficiently long time”. Can we find people even more committed - or reckless, or masochistic - than Professors Messeen and Stöckl? Absolutely yes: there was a whole subfield of late 18th / early 19th century psychophysicists who experimented with staring at the sun for long periods, many of whom went blind. Joseph Plateau (1801 - 1883, went blind in 18432) summarizes their work in his aptly-named On The Contemplation Of Bright Objects. He lists twenty-six scientists who tried staring at the sun for a really long time. Most describe what we now recognize as typical retinal afterimages, and Plateau spends most of his time talking about how long these last and what colors they pass through. The only one of Plateau’s sources who reports anything even slightly interesting to us is Robert Darwin (father of Charles; cf. Secrets of the Great Families). After stating that: The author has frequently observed that when he gazed at the midday sun for a long time, until its disk appeared pale blue, he saw a bright blue specter on other objects for more than two days. …he mentions how When looking at the meridian sun as long as the eyes can well bear its brightness, the disc first becomes pale, with a luminous crescent, which seems to librate from one edge of it to the other owing to the unsteadiness of the eye. Here is pallor, and at least a hint of motion. But it’s pretty different from spinning, and not really clear how it relates to the sun miracle. Gustav Fechner (1801 - 1887, went blind in 1839) may have stared for even longer; you can read more of his story - including his ensuing insanity and subsequent attempts to found a new religion - on Adam Mastroianni’s blog. But all that he records about his ill-fated experiment is that: …after looking at the sun through homogeneously colored lenses, if you close your eyes, the primary impression remains for a long time and the entire afterimage usually disappears without a complementary coloration having clearly emerged. These people are great, and they all sound like minor Sam Kriss characters. But after whole careers dedicated to staring at the sun much longer than any normal person would ever try, they report only the barest hints of odd phenomena. Indeed, if anything they saw less of interest to the Fatimologist than Profs. Messeen and Stöckl. Worse, all of these authorities saw their phenomena after seconds to minutes of deliberate staring. Surely if it had taken a minute of staring at the sun before anything happened, some of our eyewitnesses would have mentioned this; after all, several mention that they were starting to doubt after the child-seers’ deadline had passed a few minutes earlier. But by all accounts, the miracle was near-instantaneous. Although Messeen and Stöckl’s reports of miracle-like phenomena are intriguing, it doesn’t seem like they can be the whole picture. Let’s move on. 2.2: Aurora Borealis? At This Time Of Year? In This Part Of The Country? Localized Entirely Within Your Kitchen? Could the miracle at Fatima have been some kind of weird weather phenomenon? The main argument against is that if it were a common weather phenomenon, it would not have awed and terrified tens of thousands of people. But if it were a rare weather phenomenon, then the seers’ successful prophecy that the rare weather phenomenon would happen at solar noon on October 13 1917 becomes almost as impressive as an outright miracle. The argument in favor is that dozens of people have written books and papers about this possibility, we would feel remiss if we didn’t mention them, and anyway it gives us the opportunity to look at pretty pictures of interesting weather phenomena. This is a sun dog. It’s caused by ice crystals in the upper atmosphere that refract sunlight in a very specific way. It’s very cool, but aside from a resemblance to a wheel, it looks nothing like the miracle of Fatima. A sun dog doesn’t have any unusual colors, it doesn’t change size, and it doesn’t spin (I’ve embedded a YouTube video not because a still image would be misleading - it wouldn’t be - but just in case you want to see for yourself how completely motionless it is). It’s just a halo shape with two smaller illusory suns on either side of the real one - something which no one at Fatima reported. (source) This is a solar corona3; cloud iridescence is a related phenomenon. I don’t know how much work the exposure length is doing in this particular photo, but I’m guessing more than zero. Coronae are also very pretty, and might explain the description of wheels and colors. They seem surprisingly common for something that I can’t ever remember seeing, supposedly happening several times a year in most locations. But they don’t spin, the colors don’t change or stain the surrounding landscape, and they don’t fall to earth and crush people. Let’s keep this one as a backup option and move on. This is a dust storm. Steuart Campbell wrote a paper arguing that the miracle was caused by one of these, and I admit if I saw this I would start praying pretty hard. Dust storms can change the color of the sun (including unusual colors like green or blue). And very, very charitably, whirling dust could look like the sun itself spinning around, and the thickening and thinning of dust could look like the sun approaching or receding. But this would require a dust storm localized to a 20 mile region of Portugal which does not, technically, have any dust (and where it was, technically, raining at the time). Campbell proposes that perhaps a storm blew a 20 miles x 20 mile dust cloud from the Sahara out to the Atlantic, then onto Fatima for ten minutes during a break in the rain, then back to the Atlantic again. But I don’t think any dust storm has ever behaved in quite this way. If it did, it probably wouldn’t be at the exact moment predicted by child-seers months in advance. At this point, we might as well talk about literal meteors. The way I’m imagining it is this: as a meteor approaches Earth, it breaks up into three big parts and a host of smaller particles. They strike the atmosphere head-on, from the approximate direction of the sun. The small particles hit first and make a firework show. Then the three big pieces hit, producing multicolored fireballs (meteors can absolutely stain the sky bright colors - see the video). Finally, they burn out a few miles above the ground, , convincingly producing the appearance of the sun falling to earth and nearly striking the spectators. This could even explain the warmth and dry clothes - a local meteor strike produces a lot of heat! I like this because it’s the only one that takes seriously the facet of the event which most impressed the witnesses - the part where it looked like the sun was plummeting to earth and about to kill them. But against it: would a rain of micrometeorites really look like the sun was “dancing”, “spinning”, or “zig-zagging”? Aren’t most nearby meteor strikes very loud? (the Fatima event was, according to witnesses, silent) Don’t they usually break windows? Aren’t most meteor strikes of this size visible for hundreds of miles, not just the twenty miles from which we have witness testimonies? Wouldn’t the strike have to be remarkably head-on, and remarkable close to the position of the sun, in order to look like a solar phenomenon rather than a long streak? Aren’t most meteor fireballs visible for between a few seconds and a minute, not the ten minutes of the Fatima event4? And if there were some extremely unusual meteor strike that was the exception to everything, wouldn’t it still be pretty surprising for it to happen at the exact time and place predicted by child-seers months in advance? We come to the unpromisingly-titled Derivation of equations of the model of the dynamic behavior of the three-dimensional atmospheric cloud of electrically charged ice crystals under the influence of electrostatic forces, in which Artur Wiroski argues that Fatima was a three-dimensional atmospheric cloud of electrically charged ice crystals under the influence of electrostatic forces. Actually, he offhandedly mentions Fatima in three sentences, with the majority of the paper looking more like the image above - but he eventually makes it into a Guardian article where he emphasizes that yes, he is trying to explain the miracle of the sun. However, if I’m understanding him correctly, he says that his theoretical ice crystal phenomenon can only happen when the sun is at an altitude below 22 degrees. But during the Fatima miracle, the sun was at 42 degrees (and Dalleur’s mysterious light source was at 30 degrees), so none of this applies. I’ve tried to include pictures of all the phenomena I mention in this section. I failed for this one, because it’s never been spotted or photographed. It’s just some incredibly weird thing that one scientist says ice crystals might do if parameters were ever exactly right, with such a precise definition of “exactly right” that it’s never happened in real life. If it ever did happen, it probably wouldn’t be at exactly the moment predicted by child-seers several months in advance. 2.3: Everyone’s Mad Here Except You And Me Another common response calls the Sun Miracle a “mass hallucination”. Can 70,000 people really hallucinate the same thing? “Mass hallucination” on Wikipedia redirects to List Of Mass Panic Cases. The Miracle of the Sun is on there, but listed as “(disputed)” - the only item to earn such a parenthetical. The other fifty items mostly belong to three categories: A disease with unusual symptoms spreads through a population; doctors eventually pronounce it psychosomatic.
This is the weekly visible open thread. Post about anything you want, ask random questions, whatever. ACX has an unofficial subreddit, Discord, and bulletin board, and in-person meetups around the world. Most content is free, some is subscriber only; you can subscribe here. Also:
2: The following people still haven’t responded to my email asking them to accept their ACX grant - Lewis W, Alejandro A, Nishank B. If you tried to respond but it didn’t reach me, DM me on Substack or Twitter. Do it quick, or I will include / not include you on the announcement post based on your original privacy preferences.
3: All Non-Book Review finalists and honorable mentions (list at #3 here) should have gotten an email asking you to send me your bios for the announcement post. But I have only gotten 6/20 responses. If you didn’t get it, check your spam folder for scott@slatestarcodex.com. If you still didn’t get it, email me. If I don’t answer, DM me on Substack or Twitter.
Of the 42 grantees, 40 have answered our email asking for confirmation that they still want the grant. I’m still waiting for confirmation emails from Lewis Wall and Nishank B. If you’re reading this and don’t think you got a confirmation email, check your spam folder. If it’s not in your spam folder, email me at scott@slatestarcodex.com. If you can’t reach me or I don’t respond, DM me on Substack or Twitter. I’ll give you until November 1 to get in touch, after which point the grant will be withdrawn. There are also a few projects so deep in stealth I don’t have permission to share their existence; I will mention these as they become public.
JD Bauman, $40K, to help fund Christians For Impact. Christians are a large and charitably-inclined demographic, but tend to bounce off the effective altruist movement after we start talking about becoming bodiless immortal machine-gods. JD and his team of Christian EAs network with churches and introduce them to everything else - all the ideas about how to realign one’s life around helping people in need. They have a blog, a career counseling network, and a conference that recently scored a guest appearance by the Archbishop of Canterbury. Our grant helps them publicize and expand their career counseling work.
1st: Joan of Arc, by William Friedman. William is a history enthusiast and author who lives in California, where he spends his time reading, writing, GMing, playing video games and telling people excitedly about all the horrific stuff he learned in his latest history book. His fiction blog is Palace Fiction (which is currently serializing his first novel, The Tragedy of the Titanium Tyrant) and his nonfiction blog is As Our Days.
2nd: Alpha School, by Edward Nevraumont. Edward also wrote one of last year’s finalists (Silver Age Marvel Comics)1. Now that he’s no longer anonymous, he’s going to write a post on his blog responding to the review comments (712 of them!), as well as a follow-up post on what he has learned about Alpha in the six months since he submitted his review (including the Spring and Fall MAP results for his kids). Here is the landing page with more details for ACX readers who are interested.
Dating Men In The Bay Area, by Alex King. Alex is an engineer from San Francisco. She’ll be experimenting with more essays on her new blog, King of Daydreams. When she’s not igniting turmoil in the ACX comments section, she can be found mentoring young engineers, hosting community events, and failing to find a boyfriend. She pinky-promises she is not Aella.
The spirograph reference here is interesting, because the Baron de Alvaiazere, one of the Fatima witnesses, described what he saw via a spirograph-esque drawing: I didn’t mention it in my post because it seemed to be an extraneous detail, but this reader seems to have independently noticed something similar.
I didn’t mention it in my post because it seemed to be an extraneous detail, but this reader seems to have independently noticed something similar.
I didn’t mention it in my post because it seemed to be an extraneous detail, but this reader seems to have independently noticed something similar. 3: As a child, I was on many boring car rides with no one to talk to. I would stare out the window often, and occasionally, just at the sun. I would do this -specifically- because of this phenomenon- I had always assumed everyone knew/understood this was something that happened. It was surreal reading it described as a mystery. The way it would appear to me is that if I stared at the sun long enough (through a glass car window), there would appear a very strong blue after image (light blue- as a child, I thought it similar to the color of Neptune/Uranus as shown in books). This after image would be the same size as and almost- but not quite- line up with the sun. It would then proceed to circle the actual sun. The image was very crisp, but the movement was not- moving in a sort of ‘pulse’ (imagine very slow animation, the image not smoothly moving but jumping from one position to the next to give the illusion of movement). This movement was centered roughly around the sun, but since the image was offset it gave an appearance of ‘corkscrewing’ or spinning, not a perfect circle (that is, the image overlapped the center of rotation, rather than rotating around it). The circling would continue some time (as a child I remember thinking it went for a long time, as an adult I would guess in reality it was only some seconds, certainly less than a minute), and would end when I either looked away or the sun became too bright and I was forced to shut my eyes … What made me realize this is definitely, in my mind, the same as being described is because as a child I was convinced the image was falling- I did not, as a child- think it was the sun itself, but thought that it might be the planet Neptune (because it was blue and a large orb (appearing as a disc to the eye) somewhere, presumably, in space). But as said, I was at the time concerned it was falling, and would occasionally badger my parents about it- whether it was possible the blue orb I saw in front of the sun was Neptune, and if so whether it was going to hit the earth because it looked like it was coming towards us. I understood it wasn’t something you would see if you just looked at the sun- rather in my child mind, I assumed it was in some way that staring at the sun let me see more clearly things around it, though as I grew older I increasingly understood the image to likely be caused by staring, rather than revealed. I remember as a child sort of knowing it was an afterimage but also that it was much sharper and more clear than most afterimages. 4: I was in a room at the boarding school I used to attend, looking out through the window. I recall it being low in the sky but circumstancially it would have been midday (so I presume winter months, since I don’t recall thinking that was unusual). The sky was fairly clear. I stared at it for what felt like three minutes at the time but was probably in hindsight 45 seconds. I was a bored child (probably about eight or nine) left alone in a room and it seemed like a fun idea to stare at the sun. The sun seemed to become covered by lots of large irregularly shaped black-brown spots, with the light itself shining from cracks between them. It looked kind of like a simplistic video game lava texture. 5: I was looking at the sun because I was young and stupid. It stopped shining but remained white, except for a few sunspots that could be seen by the naked eye and which indicated the sun was rapidly spinning. There were no other unusual experiences. 6: On several occasions outside I have seen my entire visual field become tinted various colors. Ever since I heard about eye fatigue and after-image based illusions I explained this to myself as it being very bright out and the color tint being from my green being worn out (making everything pinkish) or my blue being worn out (making everything greenish yellow). Unlike typical afterimages which had particular areas in my field of view, these were almost always across my entire visual field, with occasional hot spot areas where deeper afterimages existed. On each of these occasions it has been bright out and once noticing it, unless I have gone inside, it progresses between colors, though I can’t remember any specific order, only that pink is what I remember most frequently. Lasts until I go somewhere darker or the sun is covered by clouds for a while. Including as an aside, since its beyond the event, but relevant to optical experiences, I have a history of staring out into space without realizing it, failure to blink to the point of eye redness and wateriness, falling asleep with my eyes open, and distractedly looking at bright things for long enough without noticing that I develop a disruptive after image for a while after that makes it hard to read. These things make my baseline for having stared at the sun or not squinted enough on a bright day higher, and, to me, seem to explain why these things happen to me on bright days without clouds or rain, since the cloud protection wouldn’t be a necessary factor in my brightness exposure. i wanted to share since this seems like a difference in some part from the sungazers (who saw auras specifically around the sun) but which matches some of the accounts of the Fatima incident. 7: As a kid, I would stare at the sun sometimes (I eventually abandoned this after I got a headache from doing it; I don’t know whether this has caused any of my minor eye problems later in life), and it would usually resolve to a discolored disk “swirling” slowly around the bright outline of the sun. I assume this is what people mean when they say the sun was “spinning”, although I’m not completely sure. I do not believe I was primed to see something interesting, since I grew up in a nonreligious household and nobody talked to me about sungazing; I only did it because people told me not to stare at the sun for very long. 8: There was an upcoming eclipse when I was a kid and all the talk about “don’t look at the sun” was a temptation I could not resist. I stared at the sun at least a couple of times, but somebody caught me doing it (I think my mother but I do not remember in detail) and made me stop. It was very much like the Fatima miracle people describe—in fact I was a bit confused when I started reading your post because it was immediately clear to me that this is just what it looks like when you stare at the sun (or I guess, under some circumstances?). I did not realize until now that this was a rare or special experience. From what I recall, the rim of the sun remained sharp and bright, but within the circle, the color changed the longer I looked. It had a silvery, almost liquid appearance. I remember the spinning vividly, but it felt to me like it was an illusion happening because of small eye movements, and by shifting my eyes a little bit I could exaggerate or lessen the movement. I could see bright color changes too, around the edge and as afterimages or “tracers” after moving my focus. The “falling to earth” description seems pretty similar to how I remember the tracers appeared when I looked away. I do not remember exactly how long I looked, but I would guess perhaps 1-3 minutes at a time. 9: My mother and sister went sun viewing in ~2009. It was a six-to-nine months long fad in southern Minas Gerais (São João del Rei diocese), Brazil. People reported seeing Jesus and Mary in the sun, and that it spun. No reports of it changing color, though. I dont know the logistical details, who organized these outings (I was indeed just a child, my mom also didn’t care enough at the time to ask things like that). It was a series of monthly weekend mystical appearances that occurred in a bunch of different small cities, attracting, in a rough guess, 500 to a thousand pilgrims each. Always in a rural location, sometimes near small chapels. They did not charge money for the viewing, I believe only the transportation people made a profit. My sister remembers being very hungry, as they didn’t serve (or sell) food at the place, and it went from morning to sundown. My father was a complete skeptical; my mother, extremely Catholic, did not question its veracity: it was just something religious to do, and religion is good. The practice died that same year, because the local Bishop was hard against it, forbidding it. My sister didn’t see anything. My mother also saw nothing, but left feeling spiritually in peace, a very positive sentiment. 10: I used to be very confused about why the sun was portrayed as yellow, because I had looked directly at the sun (I don’t recall how many times; perhaps only once, and I was pretty young), and the sun was clearly bright pink. My default mental image of the sun is still that of a bright pink disk. It did not change colors or move or do any of the other exotic things mentioned in your post. 11: As a kid (maybe 10-13?), I would stare into the sun repeatedly for the weird experience of overexposed eyes. I’d never heard of the Fatima miracle prior to your article, but parts of it seem completely normal to my experience. The center of the sun soon stops looking intolerably bright, and instead seems like a disc of metal of an uncertain color. Its apparent color irregularly shifts between purple, silver, blue and green. My interpretation at the time was that my eyes were probably unable to strongly identify the color, because if I told myself that I expected it to be silver, it would normally be seen as silver. I have to emphasize how non-radiant the center of the sun appears at this point; it looks more like an object illuminated by the sun than like a light source But the outer rim of the sun remains bright. I assume this is because those parts of the retina have not been completely overexposed, and so can still give accurate signals that they’re receiving a ton of light. And the exact amount of ‘bright outside’ and its exact location on the sun varies a lot based on small eye movements; the central disc can appear to shift around and grow/shrink slightly in the sun. In short, the descriptions of the sun as a silver or pulsating multi-colored disc with fireworks on the outside seem entirely normal for “sungazing” for me. I did not see: 1) Rotation 2) The sun falling to earth and looking like it’s going to crush me 3) Any apparitions of people 12: Outside my home, I would frequently stare at the sun for long periods, between the ages of (young, my memory goes back to 4-ish) and 7. I would stare at various times of day — noon, sunset, etc. I wasn’t looking for anything in particular, just curious. I had a habit of staring for long periods at everything around me. The sun appeared various colors on first looking at it, most commonly orange or yellow. On closer inspection, this turned to white. Then shimmery blue patches would appear in the white, always touching the edge, which would appear to spin and reverse quickly. This impression of a blue-white rapidly spinning sun was observed reliably whenever the sun was far enough above the horizon on a clear day. It would continue as long as I looked at the sun. I think I would look for several minutes at a time; less than an hour. (Among my family and friends I was well known for ‘blanking out’ and staring at things for long periods.) As far as I was aware, it was not an ‘optical effect’, just the sun’s normal appearance. I had no impression of the sun falling to earth. I was a very imaginative child with many imaginary friends, ufo sightings, and mysterious experiences. I don’t remember anything imaginative, visionary, creative, etc. associated with looking at the sun. It just seemed like a straightforward observation, like many I made. In later years, I have often observed, as you have, conditions of mist, cloud, rain or (most memorably) snow or ice, which allow the sun to be seen easily as a silvery round disc like the moon. Outside of these conditions, sunrises, and sunsets, I don’t look at the sun anymore, and have never had any vision damage i know of. 13: I’m less stupid than I used to be, but when younger would sometimes look at the sun out of curiosity. I also spent much too much time lighting things on fire with a magnifying glass. So this is not so much “I saw a miracle” as “here are my general notes from looking at the sun”. The silvery sun thing is something I can attest to. At first the sun is too bright to look at, but after a couple of seconds it goes silvery and is more bearable. A slightly twirling of the sun is also something I’ve seen. It’s more like a rotation of its black border? Something like if you’d make a drawing of the sun with a black pen and then coloured it in with yellow (or whatever), the border (i.e. the black ink of the pen) rotates? This doesn’t make sense when I describe it like that, but my brain sees it twirling. I don’t recall colour changes other than everything looking washed out. 14: The first [time I saw it],(before I knew about Fatima) was in summer (I think August). The sun was setting (about an hour before sunset), and I saw the sun change color (alternating blue and pink with an apparent rotational motion around its center, like a Catherine wheel). I don’t remember if it was obscured by clouds. I don’t remember how long the event lasted. After discovering the Fatima event, I decided to personally verify the hypothesis that it was a natural phenomenon due to temporary vision changes. During September 2022, on a couple of occasions, in the early afternoon, while the sun was obscured by translucent clouds, I saw color changes (alternating blue and pink), a rotational motion (like a Catherine wheel), and the sun oscillating (as if vibrating or moving rapidly in a zigzag pattern). On both occasions, the event lasted about a minute, as I then had to look away due to discomfort. On only one occasion, after a heavy rain, and much later (around 5:00 PM), I managed to gaze at the cloudless sun, and only for a few seconds. I saw the same phenomena as when it was covered by clouds, but following this occasion, an afterimage appeared in the center of my field of vision that remained for a couple of days (the afterimage was not severe enough to prevent me from carrying out my activities, including reading and writing, and once it disappeared, I did not suffer any permanent damage to my vision). I must admit that, with the exception of the first case, I had to force myself to look at the sun, as a slight discomfort was present from the first few seconds. In the above cases the edge of the solar disk was not blurred. These were the best of 45 answers. Most of the rest saw normal afterimages, or wanted to say that they, too, had seen the sun look like a pale full moon behind clouds, or saw weird things in the sky that didn’t seem Fatima-related. Interview With A Medjugorje Witness One person filled out the form to say they had seen the miracle at Medjugorje, and kindly agreed to anonymously answer followup questions: SA: Tell me what happened. MW: I was in Medjugorje, I don’t remember the exact year but late 90s or early 2000s. This was not at the same time as one of the apparitions. We were outside, I think in the evening in summer (6pm maybe) Some people pointed out the sun, which was low in the sky, maybe just above eye level from our vantage point, nowhere near setting. Me and my mum looked at it, and it was spinning and pulsing, almost throbbing. I always compared it to a Catherine Wheel before even knowing it was a common comparison, it matched the way it was almost violently moving at risk of leaping off its axis. It changed colours, like it was having a filter passing over it. Not a smooth gradient change but as if a coloured lens was moved over it. There were points it had two or more colours over different sections. I don’t remember the exact colours but it included deep sunset reds, when the sky was high over the horizon. There wasn’t any pain or discomfort from looking at it. Eventually it stopped. The reaction from the people I was with was more quiet awe. Oddly subdued for such a strange moment! We didn’t discuss with others there, as we didn’t speak the same language. I don’t remember any other visions or apparitions. I was a believer at the time, so I was quite sensitive to what I felt were spiritual experiences, but I didn’t encounter any others on this trip. My mum has had other spiritual experiences there, including what she says was a vision of Mary in the 80s which was seen by herself and several others. I’m an atheist these days, and obviously don’t put much stock in the Marian appararitions in Medjugorje now. For instance, it seems the fire and brimstone idea of hell was a Renaissance invention, and the looming end times dynamic has been a constant across many religions. But the sun miracle remains a completely unexplainable experience! SA: What led you to go to Medjugorje? When you set off, did you know about sun miracles? Was there an expectation of seeing one? MW: My mum took me. She’s been on quite a few occasions over the years and took me there on 2/3 occasions. I didn’t know about sun miracles happening there and had no expectation of seeing any. I was aware of the Fatima sun miracle. And my mum often watched quite dramatic, apocalyptic VHSs with meteors falling from the sky etc, so I had a finely developed sense of imminent supernatural events! SA: How long did you spend in Medjugorje before seeing the miracle? How long did you stay afterwards? Did you make multiple attempts to see the miracle before it happened? Did you try to see it again afterwards? MW: I think the trip was 7-10 days. It happened in the second half of the trip, 2-3 days from the end maybe. I definitely kept an eye on the sun when it approached a similar time of day. Now I look into it, the daily apparations were at 6.40pm, I don’t remember if that was the exact time of the sun miracle but it would have been close to that time. I came back to Medjugorje with my mum as a teenager and brother, nothing happened that time! SA: Did you get any chance to talk to other people in Medjugorje, either pilgrims or locals, and gauge what percent of them had seen the miracle, or how many times they had seen it? MW: I didn’t get to discuss with anyone. A short “wow did you see that” with my mum, but it’s not even the weirdest thing she’s seen there given she thinks she saw Mary appear. SA: When people gestured to you to look at the sun, did you see the miracle immediately, or did it take you a while of concentrating and straining? If the latter, how long? MW: I remember it being fairly immediate. Obviously I had to look at the sun, as it’s not like the surroundings were going disco coloured, it didn’t affect the actual light the sun gave off on my surroundings. But I don’t remember staring at a normal looking sun for any period before the effect started. It was wobbling and spinning right away, although the colour changes may have come after the violent spinning. SA: Having [now] read about the theories that it’s just afterimages, or illusions, or something like that - does that accord with your experience? Does it feel like you just saw minor perturbations that could have been illusions? Or did it seem perfectly clear, totally beyond the ability to be an illusion? MW: It felt completely beyond any possibility of it being an illusion. It was too instantaneous, and the effects too strong. No clouds or signs of interference over the sun. And someone else drew my attention to it! For afterimages specifically, they still have that very strong searing quality, which wasn’t a factor here in the same way. SA: Did it look like it looks in the videos linked in the post? MW: No, it didn’t bear much resemblance to the videos. The pulsing wasn’t present with what I saw. Violent spinning and colour changes only, and an effect kind of similar to an eclipse initially that changed to colours changing, but not in the same fashion as an afterimage. SA: Can you tell me more about being an atheist? How does this mesh with you having seen a hard-to-explain miracle? MW: I just gradually became disillusioned with Catholicism. My mum is very devout and pushed it very hard on me, so there’s a strong aspect of teenage rebellion. Fundamentally, I couldn’t reconcile the existence of the kind, loving, individually interested God I’d been taught about with the world as I came to see it (partly the problem of evil, partly seeing the gap between OT and NT as signs of scripture being a historical construct). So either God didn’t exist, did in a form that I had no respect or interest in. The sun miracle was a major reason I called myself agnostic for a very long time. To this day, I can’t explain what happened. I just accept that certain, supernatural appearing, phenomena can occur which we can’t explain. Now I’ve stopped believing such things are possible, they’ve stopped happening. Which I’ve taken as evidence that there’s some degree of self induced receptiveness, like shamanist practices, at play. Although I know the counterargument would be I’ve merely closed myself off from God. SA: Thank you. Ethan: It Wasn’t The Sun Ethan Muse, who wrote the original pro-miracle post that started this discussion, responded to me here: It Wasn’t The Sun. His main goal remains supporting Dalleur’s assertion that Fatima was an objective miracle, implemented through a fiery object which was not the real sun (and therefore cannot be explained by the sun giving people afterimage-related hallucinations), and which was seen by many distant witnesses (and therefore cannot be explained by suggestibility). I won’t answer every one of his objections, both in the interests of time and because I don’t have good answers to every one of his objections, but some highlights: 1.1.1: Cloud Dimming In my original post, I was unimpressed by the “miracle” of people seeing the sun very clearly (including the sharp outline of the solar disc) without being blinded, because I had seen this myself regularly, when the sun was partly dimmed by clouds. Some of the Fatima witnesses had said it couldn’t be clouds, because the disc was visible very clearly rather than the foggy appearance you would get from - well - fog, but I insisted this didn’t update me, because I myself had seen the disc clearly through cloud cover. Ethan says I must be mis-remembering, because my claimed experience is physically impossible: The luminance of the solar disc at its zenith is on the order of 10⁹ cd/m².1 The maximum luminance that an on-axis, compact source can have without causing observers to experience discomfort glare is on the order of 10³ cd/m². Bringing the Sun’s luminance down from 10⁹ cd/m² to 10³ cd/m² requires an attenuation factor of 10⁶. By Beer’s law, that presupposes clouds with an optical depth of roughly 14. When obscured by clouds that thick, the solar beam is essentially extinguished. All that reaches observers is light that has undergone multiple scattering within clouds, emerging from many directions rather than straight paths from the solar disc. The solar disc is reduced to a bright patch or vanishes entirely. Why does Scott have the impression that he has stared at the Sun while it was veiled by thin clouds without experiencing discomfort? It is possible that he is remembering episodes where he briefly glanced at the Sun when it was low on the horizon. Even then, however, luminance should have exceeded the comfort ceiling. Another possibility is that he is accurately recalling that the Sun appeared to be pale, but is forgetting that he squinted, experienced discomfort glare, and/or diverted his gaze. Against this, I posted a Discord poll in which 13/16 respondents agreed they had seen the same thing. After my post, people in the ACX Discord channel independently replicated the poll, with the following results: The Discord comments were pretty interesting, because some people said they could imagine this happening during a forest fire or something - and other people said no, what were they talking about, this happened all the time with totally normal clouds. It really does seem like there’s a pretty sharp distinction between people who recognize and don’t recognize the description. Some people chimed in on the comments of the main post, or the form I set up for people who wanted to send reports, saying the same. From Measure: I have seen the [thin clouds make the sun easy to look at with a crisp edge] phenomenon many times (midwest US, usually early in the morning, but occasionally nearer midday). From a respondent to my survey: I have not seen the sort of behavior described, but I just wanted to say that when there’s just the right amount of cloud cover I can *definitely* look at the sun without my eyes hurting, and it looks like a dull silvery-grey disc. I happen to catch the sun like this every few months (I live in New England), peer at it for a few seconds to see if I can make out sunspots with the naked eye, then think better of my eye health and look away. It’s really weird to me that some people you asked had never experienced this. I thought it was a mundane, normal thing everyone knows! How do we square this with Ethan’s claim that this is impossible? I have no expertise in optical physics and cannot begin to comment on this. GPT-5, after I attempt to give it a neutral prompt that doesn’t reveal which side of the issue I’m on, says that the disc-like sun is possible, and Ethan is wrong because “Cloud droplets are large (Mie regime) and have a strongly forward-peaked phase function. Even when they dim the Sun a lot, they don’t behave like a perfect diffuser”. I don’t know what this means or whether it’s actually a good response. I welcome input from human physicists in the comments. In a private conversation, Ethan continued to assert that I was misremembering, and that all the Discord users and commenters who agreed with me had been contaminated by my testimony and become victims of suggestibility. I think this is a pretty crazy point to suddenly convert to the doctrine of eyewitness fallibility, contamination, and suggestibility - but I leave further discussion to people who understand optical physics. Despite believing I’m right on this factual point, I’m no longer sure it matters - some of the Medjugorje pilgrims say they saw the miracle in a completely clear sky, and that while it was happening it didn’t hurt to stare at the sun. 1.1.2: Eyewitness Testimony Ethan takes issue with my citing Fatima expert Stanley Jaki’s claim that “the great majority of eyewitness accounts, and certainly the most important ones, contain emphatic references to the continued presence of clouds.” He says that: Scott neglects the fact that those ‘emphatic references’ both explicitly and implicitly contradict his proposal . . . Sampling from Scott’s collection of testimonies from 60 eyewitnesses, I found 15 statements that unambiguously describe the behavior of clouds during the event. All of them confirm that, although clouds were present and sometimes passed in front of the ‘Sun,’ cloud coverage was partial, nonuniform, and intermittent. I agree with Doug Summers Stay’s proposal that: I don’t see any mention here of different layers of clouds. It is possible to have both cumulus clouds and cirrus clouds at the same time, so what we think of as “clouds” part and behind them is another layer of clouds blocking the sun. It seems to me, especially from watching the videos and videos in the comments, that there is some rare kind of clouds, perhaps caused by high ice crystals, that can produce a variety of optical effects: motion, changing color, and changing size. That this should happen at a time when a lot of people are looking at the sun expecting something to happen is a big coincidence, but in the end only a coincidence. On this model, there was a thick layer, obvious as clouds to the observers, which had been producing the rainstorm, and which cleared just before the miracle. There was also a thinner layer, which dimmed the sun but didn’t hide it, and which was sometimes - but not consistently - reported as clouds by witnesses. Many witness testimonies say that, although the main layer of clouds had cleared, there was some kind of veil over the sun. O Seculo: The sun had a kind of veil like transparent gauze so that eyes could gaze at it. Almeida describes the sun as …a disc of smoky silver. Compare to our photo of the sun filtered through clouds: From Domingos Pinto Coelho: The sun, until then concealed, showed itself among the clouds that moved fairly fast. Because their density was variable, the veil which they threw over the king of stars was diaphanous. Like the multitude, we then looked toward the sun with rapt attention, and through the clouds, we saw it under new aspects. From Nascimento e Sousa: The sun, which was surrounded by clouds, trembled hesitatingly…I saw there a very pronounced yellow color, and it seemed to me that I saw a silver color beneath the solar disc, but I don’t guarantee that. From Maria de Campos: We started to see the disk of the sun, and see it clearly against the dark gray layer which covered the entire sky…we saw something like a silver-lined veil, with a round shape, as if it were a full moon. Again, I’m not sure this matters, since some of the later miracles were in a clear sky. 1.1.3 - Inconsistency Ethan points out that if the sun were partially veiled by clouds, to the point where it was not too bright to stare at, then it presumably also would not bright enough to produce weird entoptic phenomena and hallucinations. When we discussed this, I had no better solution than to say that maybe there was a level of brightness which was dim enough to look at, but still bright enough to produce phenomena/hallucinations. But again, I’m no longer sure this matters. Many people in the comments to the original post report staring at the completely-non-veiled sun without feeling pain or having negative effects, many Medjugorje pilgrims say they saw the miracle in a completely clear sky without pain, and fire kasina practitioners can get imagery/phenomena from looking at dim or medium-brightness lights. I agree with Ethan that the sun at midday is so bright that it’s painful for me to look at for even a fraction of a second, and I don’t understand how so many people are saying they stare at the sun for minutes at a time at any time of day just because they’re bored. 2: Distant Witnesses Ethan was able to find more medium-distant witnesses than I could: The two witnesses at Alburitel, who I thought were in the same group, were actually in two different groups (is it surprising that our only witnesses from each of these two groups are each other’s brother?)
2: Prediction by Jurgen Gravestein: “I don’t think people realize what kind of ads are coming. If the Sora app has your face, you will in the near future see ads of yourself wearing clothes of a certain brand.”
4: It’s No Great Awakening. Claims of a revival in American Christianity among the young are not borne out by data. The country is no longer secularizing at the same rate as in the early 2000s, but there is no sign of any reversal.
Everyone who studies biochem asks themselves at some point “Why do cells need such long signaling pathways?” - ie so many chemicals whose only point is to activate other chemicals and so on in a chain, until the last chemical in the chain makes something happen. If I understand this paper right, it’s claiming that if each chemical has enough positive and negative inputs, this is analogous to a neural network, capable of making primitive decisions about cellular behavior. I asked some real biologists, who were not nearly as impressed with this thesis as I was and said that although these chains do help set cellular behavior, the analogy between levels of a chemical and the activation function of a neuron was too weak to carry so much weight. I still wonder whether insights from mechanistic interpretability could help us understand networks like these.
This is the weekly visible open thread. Post about anything you want, ask random questions, whatever. ACX has an unofficial subreddit, Discord, and bulletin board, and in-person meetups around the world. Most content is free, some is subscriber only; you can subscribe here. Also:
1: In honor of International Shrimpact Day™, pro-shrimp Substackers are holding a shrimp welfare fundraiser, with 50% matching until December 2. Did you know that $1 can help as many as 21,000 shrimp avoid a painful death? And here is a debate between Jeff Sebo and Lyman Stone, moderated by Peter Singer, on whether shrimp welfare matters.
2: If your response is noooo, charity money should be spent on humans, then good news: pro-human Substackers are holding a human welfare fundraiser, also with 50% matching, until the end of the month. All donations go to homo sapiens, guaranteed!
The hereditarians declared victory (Cremieux on X, Emil Kirkegaard on Substack) because of this graph:
The hereditarians declared victory (Cremieux on X, Emil Kirkegaard on Substack) because of this graph: That is, once you include the rare variants, the amount of genetic variation that “should” exist but doesn’t shrinks to only 12%. Plausibly an even bigger study, investigating even rarer variants, could shrink the gap further, all the way to zero. The oldest and strongest argument against hereditarianism - if all these genes exist, why can’t we find them? - has finally been put to rest. You couldn’t find them because they were rare. But when you include rare variants in your search, you can find at least 88% of them.
That is, once you include the rare variants, the amount of genetic variation that “should” exist but doesn’t shrinks to only 12%. Plausibly an even bigger study, investigating even rarer variants, could shrink the gap further, all the way to zero. The oldest and strongest argument against hereditarianism - if all these genes exist, why can’t we find them? - has finally been put to rest. You couldn’t find them because they were rare. But when you include rare variants in your search, you can find at least 88% of them.
1: Ben Goldhaber: Unexpected Things That Are People. “It’s widely known that corporations are people . . . but there are other, less well known non-human entities that have also been accorded the rank of [legal] person.”
2: Jackdaw was originally Jack Daw. Magpie was originally Maggie Pie (really!) Robin Redbreast is still Robin Redbreast. Weird Medieval Guys explains how birds got human names. Short version: there was a medieval tradition of giving every animal one standard human name (all worms were “William Worm”, all monkeys were “Robert Monkey”) and although these are mostly forgotten, they survived in the names of a few birds. Also: “Perhaps the most baffling … was the common Kestrel. He was known simply as the Windfucker.”
3: A story “in the style of Scott Alexander or Jack Clark” about the two-door meme (meme below). And if you enjoyed the story, here’s the chaser.
This is the weekly visible open thread. Post about anything you want, ask random questions, whatever. ACX has an unofficial subreddit, Discord, and bulletin board, and in-person meetups around the world. Most content is free, some is subscriber only; you can subscribe here. Also:
5: Several people have asked why I delete comments that get someone banned, saying they would like to be able to see them to double-check that my moderation decisions are reasonable, or to learn more about the rules and where the bar is. I agree this would be ideal, but Substack seems to auto-delete comments that get bans, and I can’t figure out how to turn off this feature. Sorry for the inconvenience.
There are op-eds too. Here’s how the Atlantic wants to fix Congress. The New York Times of course has a solution. Here on Substack, Matt Yglesias thinks proportional representation is the solution, and Nicholas Decker has an especially interesting solution.
The US is far bigger than in the Framers’ time, so it’s the 50,000 number that would apply in the present day. This would increase the size of the House of Representatives from 435 reps to 6,6412. Wyoming would have 12 seats; California would have 791. Here’s a map: This would give the U.S. the largest legislature in the world, topping the 2,904-member National People’s Congress of China. It would land us right about the middle of the list of citizens per representative, at #104, right between Hungary and Qatar (we currently sit at #3, right between Afghanistan and Pakistan).
This would give the U.S. the largest legislature in the world, topping the 2,904-member National People’s Congress of China. It would land us right about the middle of the list of citizens per representative, at #104, right between Hungary and Qatar (we currently sit at #3, right between Afghanistan and Pakistan).
In 1917, some Portuguese children started seeing visions of the Virgin Mary. The Virgin told them she would enact a great miracle on a certain day in October, and a crowd of 100,000 gathered to witness the event. According to eyewitness reports, newspaper articles, etc, they saw the sun spin around, change colors, and do various other miraculous things. At least a hundred separate testimonies of the event have come down to us, with only two or three people saying they didn’t see it. Catholics continue to bring this up as one of the best-attested miracles and strongest empirical proofs of the faith - including here on Substack, where there was a spirited debate about the event last fall.
I did my best to research the event, and the results were The Fatima Sun Miracle: Much More Than You Wanted To Know and Highlights From The Comments On Fatima. The main thing I was able to add to the Substack discussion, if not the broader worldwide one, was a survey of similar events. There were apparent sun miracles at various other Catholic sites and apparitions of the Virgin, including a crowd of hundreds of thousands in Italy, and a small town in Bosnia where they seem to happen regularly. But also, people who “sungaze” - a weird alternative medicine practice where people stare at the sun in the hopes that maybe this will help something and they won’t go blind - report sometimes seeing the sun spin and change color in similar ways. And Buddhist meditators report that concentrating very hard on any bright light will cause similar things to happen.
Still, the Catholics - especially original Fatima-Substacker Ethan Muse - were not convinced. The other Catholic sightings could have been other real miracles, equally attributable to the Virgin. The sungazers were staring at the sun for a long time, unlike the Fatima pilgrims who just happened to glance up at it. And the meditators were doing sophisticated contemplative exercises, again different from the Fatima pilgrims who just looked up and saw it. These were suggestive, but there was no record of a miracle exactly like Fatima happening within a non-Catholic religious tradition.
This is the weekly visible open thread. Post about anything you want, ask random questions, whatever. ACX has an unofficial subreddit, Discord, and bulletin board, and in-person meetups around the world. Most content is free, some is subscriber only; you can subscribe here. Also:
Most advice in this space is vague. I care about outcomes you can actually measure. Most photographers optimize for their portfolio. I optimize for dating app performance. I publish my full methodology on Substack for free. If you want someone to run the whole process end-to-end, that is where I come in.