GiveWell

Article

GiveWell is a recurring organization in the Astral Codex Ten archive, appearing 22 times across 22 issues between April 30, 2021 and January 16, 2025. The archive places it in contexts such as “a perennial enough question to warrant its own FAQ item on the GiveWell blog”; “GiveWell top charities in terms of directly improving wellbeing”; “other GiveWell top charities”. It most often appears alongside effective altruism, Rethink Priorities, ACX Grants.

Metadata

  • Category: Organizations
  • Mention count: 22
  • Issue count: 22
  • First seen: April 30, 2021
  • Last seen: January 16, 2025

Appears In

Source Context

Recovered passages from the original issue text. When the raw archive preserved outbound links inside the source passage, they are listed directly under the quote.

April 30, 2021 · Original source
Cult of personality aside, though, Vogt’s arguments may resonate with the utilitarians who believe that average utility matters more than total, so a world with fewer beings who lead happier lives is preferable to a world of unchecked growth that leads to worse outcomes for many individuals, human or dovekie. A version of this strain of thought is a perennial enough question to warrant its own FAQ item on the GiveWell blog. I think the revealed preference of most rationality/EA-aligned folks like me and probably many readers of this blog (as evidenced by the kinds of charitable causes we give to, like GiveWell) is more closely aligned with Borlaug and the Wizards: that it is important and possible to increase both average and total utility, both number of lives and quality of life. But even though it’s hard to imagine people today willingly deciding to stop reproducing above replacement or consuming goods for the sake of the mosquitos and dovekies (no matter how cute they are), it’s not unreasonable to think that as a normative matter, that world may in fact be a better one. The original Prophet solution to attaining that world by actively decreasing human populations may be less en vogue today what with eugenics and Malthusianism not exactly welcome topics in polite company, but it’s been replaced by other fears of overreaching capacity of one kind or another, be it oil, water, or greenhouse gases, and a desire to curb growth in these spheres and focus on conservation. Might there be some validity to the Prophets’ concerns that we can’t have it all (both average and total utility increases), and that we’re growing too quickly for our planet and its resources to keep up with us?
I’m convinced, to be clear, that we have a lot of Borlaugs out there – the pioneers of mRNA who produced vaccines to a novel virus in world-record time surely fit the bill, whether they’re hardscrabble farmers from Iowa with hearts of gold or not. But we also have a lot more fish in a much bigger, fancier tank now than in Borlaug’s day. According to hard Wizardry, all that increased productivity and human capital should have made it easier for us to roll up our sleeves and nip this thing in the bud; instead, we all collectively shat the bed. It wasn’t just red states or populist leaders or microchip truthers that were guilty; everyone in every country, state, and Holy See didn’t adopt masks, close the borders, roll out tests, or vaccinate quickly and effectively enough, and the blood of 2 million people and counting (not to mention the global loss of jobs, social activities, educational quality, and basic human connection) is on our hands. Unlike climate change, pandemics aren’t even a new problem we’ve never encountered, and certainly not a problem Wizards weren’t aware of and politely screaming about for years. So how could we have been caught so off-guard and been so slow and ineffective at responding?
July 05, 2021 · Original source
Now institutional effective altruism has evaluated those claims, in the form of an analysis by trusted EA think tank Rethink Priorities. They conclude that “it is unlikely that charter cities will be more cost-effective than GiveWell top charities in terms of directly improving wellbeing”.
It then fine-tunes some of CCI’s models, incorporating the sort of pessimistic assumptions about growth that make sense in the context of the World Bank study, and finds that although they are nice, they don’t reach the same level of cost-effectiveness as other GiveWell top charities, even on time scales of decades.
July 23, 2021 · Original source
19: Holden Karnofsky, co-founder of GiveWell and CEO of the Open Philanthropy Project, now has a blog, Cold Takes, on “futurism, macrohistory, applied epistemology and ethics, [and] sometimes sports”. Getting to hear from Holden is always a privilege, usually one reserved for people at effective altruism organizations or conferences, and it’s exciting to see he’ll be sharing his thoughts more widely.
December 28, 2021 · Original source
The Oxfendazole Development Group, $150,000, to develop oxfendazole. This is a next-generation antiparasitic drug which may one day replace albendazole and mebendazole, the current choices for deworming. Several hundred million children worldwide suffer from parasitic worm infections; this certainly affects their health, and a growing body of research suggests it might affect their cognitive ability, educational attainment, and future income. GiveWell endorses deworming as one of the most effective charitable interventions; the successful development of new antiparasitics would further this effort. Oxfendazole has done well in early studies and this group wants to follow them up in the hopes of eventually getting FDA approval. To learn more or send a donation, see this site
February 09, 2022 · Original source
GiveWell estimates that if you donate to their top charity, Against Malaria Foundation, you can probably save a life for about $5000. ACX Grants raised $1.5 million. Donated to AMF, that’s enough to save 300 lives. I didn’t donate it to AMF. I believed that small-batch artisanal grant-making could potentially outperform the best well-known megacharities - or at least that it was positive value in expectation to see if that was true. But if your thesis is “Instead of saving 300 lives, which I could totally do right now, I’m gonna do this other thing, because if I do a good job it’ll save even more than 300 lives”, then man, you had really better do a good job with the other thing.
What’s your story for why you need a microgrants program? Why not just donate to GiveWell or OpenPhil or some other charity or foundation you respect?
Your alternative to running a grants round is giving to the best big charities that accept individual donations. GiveWell tries to identify these, and ends up with things like Against Malaria Foundation, which they think can save a life for ~$5,000. So to a first approximation, run a grants round if you think you can do better than this.
May 27, 2022 · Original source
A story that plausibly explains these numbers (either a potential mechanism for an effect, or an explanation of why the effect turned out to be null) If these stories are challenged, it is not because there is no actual evidence for them, but because an economist in the audience has thought of their own preferred theory. If the speaker can find some data point that contradicts the questioner’s idea, this is thought to “confirm” the original story. Since audience members (who often have little specific knowledge of the region) are unlikely to ask questions like “what if this village just has an incredibly complicated set of social conventions around cattle that prevents their sale even without market barriers in place?” or “do the region’s economic challenges have more to do with this very specific regulation in South African immigration law?”, plausible-sounding stories that explain one or two numerical data points tend to gain traction in the literature whether or not they have anything to do with reality. Mark McGovern famously noted this trend in a review of two of Paul Collier’s books, writing: “Much of the intellectual heavy lifting in these books is in fact done at the level of implication or commonsense guessing. And the common sense is surely not that of the inhabitants of the countries being dissected, but that of the highly educated elite located primarily in Western Europe and North America. In those passages where Collier does lay out the thinking behind his explanations, they are always coherent and plausible, but the chain of causal relations makes it evident how fragile these models typically are.” The World Bank report’s fundamental misdiagnosis of the challenges Lesotho faced formed the basis for a series of failed “development initiatives”, most notably the Thaba-Tseka Development Project, a joint venture funded by the Canadian International Development Agency, the World Bank, the Government of Lesotho, and the UK Overseas Development Ministry. The project focused on providing technical solutions to the “problems” the World Bank report had identified: better agricultural techniques, easier access to markets, and increased government capacity to provide public goods. Each piece faced serious problems in execution, largely because interventions shown to have the sorts of “positive effects” randomized experiments might demonstrate elsewhere in Africa were not necessarily well suited to Lesotho’s unforgiving, mountainous terrain. But even more seriously, the project was so enveloped in “development discourse” that nobody thought to question whether they were working on problems their “recipients” cared about, or merely the ones the “tools of development” were capable of solving. As Ferguson writes, “The promise that crop farming could be revolutionized through the application of a well-known package of technical inputs was so firmly written into the project’s design that it was difficult for those on the scene to challenge it, or even to confront it.” Perhaps the only thing that has changed since Ferguson wrote is that we have tools to better identify these failures: the development literature continues to be littered with failed trials and interventions based on unchecked assumptions. One of the most famous is the British Department for International Development’s 90 million pound Tuungane project, whose Congolese incarnation sought to rebuild village governing institutions that the country’s civil war had destroyed. One of the most convincing explanations of its failure is that it may not have been necessary to begin with: the implementers do not seem to have checked whether the institutions had actually been weakened by violence, and baseline reports indicated that residents were relatively satisfied with village governance before the project even started! More research is needed to clarify the situation -- research which might have been useful to carry out before spending a £90 million on a “fix”. Part of this, perhaps, comes from the usual overconfidence that other social scientists like to accuse economists of. But there are much bigger systemic problems at play. Development work tends to run on short timelines: grad students and postdocs need to publish quickly for their careers to advance, NGO funding runs on 5-ish year cycles, and charities (particularly in “high-risk” areas) face extremely high employee turnover rates. This simultaneously limits the accumulation of institutional knowledge, while incentivizing practitioners away from the time-intensive process of understanding a particular context in favor of “getting results quick.” Similarly, the recent introduction of experimental evidence to the development field is a wondrous thing, but the revolution has to continue: randomized experiments can tell us about the effect an intervention had somewhere, but even the best methods of applying this kind of evidence to a specific context remain somewhat arbitrary and subjective. As EA begins to fund more complex (but potentially more effective) interventions, a key step will be to get a more systematic handle on how to gather evidence about specific places-- countries, states, even villages -- and how to match the tools we have to people who might benefit from them. II. The Trouble with Technocrats “But even if the project was in some sense a ‘failure’ as an agricultural development project, it is indisputable that many of its ‘side effects’ had a powerful and far-reaching impact on the Thaba-Tseka region. [...] Indeed, it may be that in a place like Mashai, the most visible of all the project’s effects was the indirect one of increased Government military presence in the region” As the program continued to unfold, the development officials became more and more disillusioned -- not with their own choices, but with the people of Thaba-Tseka, who they perceived as petty, apathetic, and outright self-destructive. A project meant to provide firewood failed because locals kept breaking into the woodlots and uprooting the saplings. An experiment in pony-breeding fell apart when “unknown parties” drove the entire herd of ponies off of cliffs to their deaths. Why, Ferguson’s official contacts bemoaned, weren’t the people of Thaba-Tseka committed to their own “development”? Who could possibly be opposed to trees and horses? Perhaps, the practitioners theorized, the people of Thaba-Tseka were just lazy. Perhaps they “didn’t want to be better.” Perhaps they weren’t in their right mind or had made a mistake. Perhaps poverty makes a person do strange things. Or, as Ferguson points out, perhaps their anger had something to do with the fact that the best plots of land in the village had been forcibly confiscated to make room for wood and pony lots, without any sort of compensation. The central government was all too happy to help find land for the projects, which they took from political enemies and put in the control of party elites, especially when it could use a legitimate anti-poverty program as cover. In Ferguson’s words, the development project was functioning as an “anti-politics machine” the government could use to pretend political power moves were just “objective” solutions to technical problems. A local student’s term paper captured the general discontent: “In spite of the superb aim of helping the people to become self-reliant, the first thing the project did was to take their very good arable land. When the people protested about their fields being taken, the project promised them employment. [...] It employed them for two months, found them unfit for the work, and dismissed them. Without their fields and without employment they may turn up to be very self-reliant. It is rather hard to know.” Two things stand out to me from this story. First, the “development discourse” lens served to focus the practitioners’ attention on a handful of technical variables (quantity of wood, quality of pony), and kept them from thinking about any repercussions they hadn’t thought to measure. This is a serious problem, because “negative effects on things that aren’t your primary outcome” are pretty common in the development literature. High-paying medical NGOs can pull talent away from government jobs. Foreign aid can worsen ongoing conflicts. Unconditional cash transfers can hurt neighbors who didn’t receive the cash. And the literature we have is implicitly conditioned on “only examining the variables academics have thought to look at” -- surely our tools have rendered other effects completely invisible! Second, the project organizers somewhat naively ignored the political goals of the government they’d partnered with, and therefore the extent to which these goals were shaping the project. Lesotho’s recent political history had been tumultuous. The Basotho Nationalist Party (BNP), having gained power upon independence in 1965, refused to give up power after losing the 1970 elections to the Basotho Congress Party (BCP). Blaming the election results on “communists”, BNP Prime Minister Leabua Jonathan declared a state of emergency and began a campaign of terror, raiding the homes of opposition figures and funding paramilitary groups to intimidate, arrest, and potentially kill anyone who spoke up against BNP rule. This had significant effects in Thaba-Tseka, where “villages [...] were sharply divided over politics, but it was not a thing which was discussed openly” due to a fully justified fear of violence. The BNP, correctly sensing the presence of a substantial underground opposition, placed “development committees” in each village, which served primarily as local wings of the national party. These committees spied on potential supporters of the now-outlawed BCP and had deep connections to paramilitary “police” units. When the Thaba-Tseka Development Project started, its international backers partnered directly with the BNP leadership, reasoning that sustainable development and public goods provision could only happen through a government whose role they primarily viewed as bureaucratic. As a result, nearly every decision had to make its way through the village development committees, who used the project to pursue their own goals: jobs and project funds found their way primarily to BNP supporters, while the “necessary costs of development” always seemed to be paid by opposition figures. The funding coalition ended up paying for a number of projects that reinforced BNP power, from establishing a new “district capital” (which conveniently also served as a military base) to constructing new and better roads linking Thaba-Tseka to the district and national capitals (primarily helping the central government tax and police an opposition stronghold). Anything that could be remotely linked to “economic development” became part of the project as funders and practitioners failed to ask whether government power might have alternate, more concerning effects. As we saw earlier, the population being “served” saw this much more clearly than the “servants”, and started to rebel against a project whose “help” seemed to be aimed more at consolidating BNP control than meeting their own needs. When they ultimately resorted to killing ponies and uprooting trees, project officials infatuated with “development” were left with “no idea why people would do such a thing,” completely oblivious to the real and lasting harm their “purely technical decisions” had inflicted. Have any EA projects had this sort of unexpected political side effect? I think it’s genuinely hard to tell without further research, but the possibility is frightening. (There’s been a little bit of research on the quantitative side --Recent research has found, for instance, that GiveDirectly’s 2014 unconditional cash transfer trial increased community participation but did not change voting patterns, so at least in 2014 the Kenyan government wasn’t using the program to stay in power. Was this the right question to test? I am not sure, especially without a more qualitative survey to see if there are other avenues we should be worried about.) III. Takeaways for Effective Altruism So what do we do as effective altruists (hereafter “EAs”)? I see three key takeaways. The first is a clear need for more qualitative research. GiveWell makes some qualitative judgments about charities, but Ferguson’s work illustrates the need for qualitative evaluation of the interventions themselves to see if the underlying studies have captured all of the “right” variables. Randomized experiments are really good at testing hypotheses, but by their very nature they can’t tell you about variables you didn’t decide ahead of time to measure. Are there significant side effects (positive or negative) we’ve missed from massive malaria net distributions? I don’t know, but if so they are not likely to be discovered by a bunch of Americans and Europeans sitting in a room and trying to guess the best things to measure. Rather, they’re probably already known (or suspected) by the people experiencing them, and a first step to finding out is going and asking them. (A second step is finding the right people to ask them -- real expertise in qualitative research is a rare and valuable skill.) Of course, qualitative research is messy and sometimes the people you interview are wrong or have other agendas. So once we have an “on-the-ground” hypothesis or concern, there will often be good reason to use a randomized trial or quasi-experimental method to test it or try to understand how much of a concern it might be! This sort of interdisciplinary approach is starting to gain traction in academia, but it has yet to be seriously applied in the EA sphere. There’s another angle to this: Ferguson’s most incisive insights arise not from studying the people being “served”, but by studying the development practitioners themselves. Other social scientists have continued this trend, from McGovern’s An Anthropologist Among the Mandarins and Robinson’s How Different Social Scientists Think to Marchais, Bazuzi, and Lameke’s The Data is Gold, and We Are The Gold-Diggers and Omar Bah’s webcomic Mzungus in Development and Governments. Each new paper illuminates the research process in new ways, and provides tools both to do better research and to identify potential weaknesses in the pre-existing literature. I think one of the highest impact investments an Effective Altruist fund could make right now would be to hire a handful of trained anthropologists (or other outside experts in qualitative research / ethnography) to hang out in places like GiveWell or the Machine Intelligence Research Institute for a few years and really study how effective altruism works as a system. How are decisions being made, and how is evidence being used to make them? What does “EA discourse” help make visible and which problems and concerns does it hide from our view? How do the positionalities of typical EA researchers affect their views of what’s important or what’s plausible? I have my guesses, and I’m sure you have yours. But I had my guesses about development economics, too, and I missed nearly everything Ferguson (and the authors mentioned two paragraphs up) uncovered. What more are we missing? The second is an emphasis on local context. As funding gaps for “low hanging fruit” like malaria disappear, EA is going to have to focus on more complicated interventions, which are likely to be fairly context-specific -- after all, why should an agriculture program that works in the flattest parts of the Sahel be expected to work the same way in the Maloti Mountains? Ferguson notes about several of the Thaba-Tseka project’s failed arms: “Tanzania may be very different from Lesotho on the ground, but, from the point of view of a development agency’s head office, both may be simply ‘the Africa desk’. In the Thaba-Tseka case, at least, the original project planners knew little about Lesotho’s specific history, politics, and sociology; they were experts on ‘livestock development in Africa,’ and drew largely on experience in East Africa.” For any sort of context-specific intervention to work, an intimate knowledge of the specific history, needs, and geography of individual villages and regions is necessary. The development world has slowly made steps in this direction, but it’s not clear to me that the EA community has a clear way of acquiring, accessing, or working with this information. I don’t think there’s a magic bullet to solve this problem, but in the long run any solution will probably need to involve a) on-the-ground, qualitative research and b) real representation in the EA network from areas EA organizations are interested in working. The development industry has a shameful history of infantilizing and ignoring the opinions of “locals”, and I think the conversations I’m starting to see in EA about diversity and representation of different parts of the Global South need to continue if we’re going to get enough serious knowledge of local contexts to effectively direct funding. The third is a continued need to take politics seriously. This is one of the most challenging issues in charitable giving: when is it okay to work with a government doing terrible things to deliver humanitarian aid? To what extent does an NGO feeding the hungry lend its legitimacy to or cover for an authoritarian regime’s misdeeds? I don’t have anything close to a full answer (and I don’t think anyone does), but Ferguson’s work exposes a possibility I hadn’t thought of before, in which “technical” and “apolitical” projects can expand the power of the state in unforeseen and potentially dangerous ways. After writing The Anti-Politics Machine, Ferguson largely gave up on the idea of charitable or state-based aid. (Understandably, I think, given that he spent most of a decade watching its most horrific side effects first-hand). It’s ironic, then, that I think his book’s practical value is greatest to those of us who still hold onto hope in its possibilities. May we have ears to hear the voices telling us where our work has fallen short, and eyes to see what it could become. Footnotes Ferguson pg. 55
September 18, 2022 · Original source
1: GiveWell asks me to signal-boost a new contest with $20,000 prize - help them change their minds about some of their cost-effectiveness analyses. This is something I hear people complain about a lot, so I hope some of you will complain about it to the people who will give you $20K for doing that.
November 06, 2022 · Original source
7: Effective altruist charity evaluator GiveWell (mostly handles the global health and poverty side of things) is hiring new research analysts:
7: Effective altruist charity evaluator GiveWell (mostly handles the global health and poverty side of things) is hiring new research analysts: 8: And Redwood Research, an AI alignment organization I’ll be writing more about shortly, is looking for applicants for its upcoming interpretability program. They write:
September 28, 2023 · Original source
25: Effective Altruist Forum: The charity Pure Earth, sponsored by GiveWell, claims to have reduced the prevalence of lead in Bangladeshi turmeric from 47% to ~0%. Previously, unsavory producers would add lead to turmeric spice to make it appear more brilliantly yellow, poisoning children who consumed it and lowering IQ. Pure Earth raised awareness among consumers, helped the government crack down, and is now declaring at least preliminary victory. “The preliminary findings are that this program can avert an equivalent DALY for just under $1.”
November 28, 2023 · Original source
Source: AMF says 185,000 deaths prevented here; GiveWell’s evaluation makes this number sound credible. AMF reports revenue of $100M/year and GiveWell reports giving them about $90M/year, so I think GiveWell is most of their funding and it makes sense to think of them as primarily an EA project. GiveWell estimates that Malaria Consortium can prevent one death for $5,000, and EA has donated $100M/year for (AFAICT) several years, so 20,000 lives/year times some number of years. I have rounded these two sources combined off to 200,000. As a sanity check, malaria death toll declined from about 1,000,000 to 600,000 between 2000 and 2015 mostly because of bednet programs like these, meaning EA-funded donations in their biggest year were responsible for about 10% of the yearly decline. This doesn’t seem crazy to me given the scale of EA funding compared against all malaria funding.
Source: this page says about $1 to deworm a child. There are about $50 million worth of grants recorded here, and I’m arbitrarily subtracting half for overhead. As a sanity check, Unlimit Health, a major charity in this field, says it dewormed 39 million people last year (though not necessarily all with EA funding). I think the number I gave above is probably an underestimate. The exact effects of deworming are controversial, see this link for more. Most of the money above went to deworming for schistosomiasis, which might work differently than other parasites. See GiveWell’s analysis here.
Source: this page. See “Evidence Action says Dispensers for Safe Water is currently reaching four million people in Kenya, Malawi, and Uganda, and this grant will allow them to expand that to 9.5 million.” Cf the charity’s website, which says it costs $1.50 per person/year. GiveWell’s grant is for $64 million, which would check out if the dispensers were expected to last ~10 years.
November 30, 2023 · Original source
Cause evaluation works the same way. Every year, I feel bad free-riding off GiveWell. I tell myself I’m going to really look into charities, find the niche underexplored ones that are neglected even by other EAs. Every year (except when I announce ACX Grants and can’t get out of it), I remember on December 27th that I haven’t done any of that yet, grumble, and give to whoever GiveWell puts first (or sometimes EA Funds).
Effective altruism is composed of lots of organizations like GiveWell and GivingWhatWeCan and 80,000 Hours and AI Impacts. Ask the question for each one of them:
Why do we need GiveWell? To help evaluate which charities are most effective. There’s no contradiction between universal support for charity and needing an organization like that.
December 01, 2023 · Original source
12: Open Philanthropy discusses its decision to donate $300 million to GiveWell’s top charities, including fascinating lines like this:
We’ve reduced the annual rate of our funding for GiveWell’s recommendations because our “bar” for funding in our Global Health and Wellbeing (GHW) portfolio has risen substantially. In July 2022, it was roughly in the range of 1100x-1200x; we recently raised it to slightly over 2000x. That means we need to be averting a DALY for ~$50 (because we value DALYs at $100K) or increasing income for 4 people by ~1% for a year for $1 (because we use a logarithmic utility function anchored at $50K).
December 24, 2023 · Original source
3: If you’re one of those people who gives to charity at the very end of the year because you forgot to do it earlier, you might appreciate lists of where GiveWell employees and Open Phil employees made their personal donations this year.
January 04, 2024 · Original source
Do something like donating to charity, but the donation should go to charities that promote capitalism somehow, or be an investment in companies doing charitable things (impact investing) I agree that overall capitalism has produced more good things than charity. But when I try to think at the margin, in Near Mode, I can’t make this argument hang together. Here’s my basic objection: Consider some company. I’m going to pick Instacart, because I like it and use it often. Instacart is like Uber for groceries. It delivers them to your house, so you don’t have to go shopping. It’s great if you’re lazy, or if you’re sick and don’t want to leave the house. I’m not putting my finger on the scales by choosing Instacart here. Instacart is great. Instacart makes yearly profit of $500 million, yearly revenue of $2.5 billion, and has 10 million yearly customers (who I guess pay $250 each per year?) and a market cap of $10 billion. For complicated reasons I’ll relegate to a footnote1, I’m going to summarize the deal that Capitalism offers by allowing Instacart to exist to “For $1 million, you can give 2,000 people a great deal on grocery delivery”. Compare this to a good charity, like GiveWell’s pick Dispensers For Safe Water. If I understand their claim right, per $1 million they can give 50,000 people clean water for ten years, which would probably save about 1,500 lives. So which is a better use of $1 million? Give it to Capitalism, and give 2,000 people a great deal on grocery delivery? Or give it to Charity, and give 50,000 people clean water and save 1,500 lives? Even without being able to exactly quantify the value of grocery delivery deals vs. clean water, common-sensically Charity wins on first-order effects. So the argument for Capitalism must go through something about second-order effects. But what are these? I can think of a few possibilities: Job creation: Along with helping its customers, Instacart employs 10,000 full-time employees and 600,000 gig workers, so our $1 million investment might produce a few dozen jobs. That still doesn’t seem to counterbalance the advantage of Charity. But also (and I admit I have trouble thinking about this), it doesn’t seem obvious that Instacart “causes” jobs. Suppose Instacart had never been founded. Then people would spend whatever money they now spend on Instacart on something else (let’s say booze and porn), which would also create jobs (for brewers, bartenders, and porn stars). There’s no particular reason to think spending the money on Instacart creates more jobs than spending it on those other things would. So how many jobs does Instacart create over replacement? I’m not sure but I think it must be much less than the official number of employees.
January 11, 2024 · Original source
He never really addresses why plugging the cash into an index like the S&P 500 isn't a better use of funds than GiveWell's recommended charity. He chooses Instacart as his exemplar of capitalism, but then concludes that investing $1M in Instacart means "you can give 2,000 people a great deal on grocery delivery." But the whole point of investing is that it isn't one-and-done, that instead it grows exponentially over the long term, building wealth in the form of new and better companies which provide products, services, innovation and technology that are responsible for basically all of the good things you see on Steven Pinker's up-and-to-the-right charts illustrating the improvement of the human condition over time. These are the things that, if all goes well, will eventually lift humanity to the heavens, slay the demons (disease, death, etc.) that have haunted us forever, and awaken the dead matter of the cosmos into flourishing sentience.
So: if you donate $1M to GiveWell's Dispensers For Safe Water charity today, will that end up creating more than $134M of value in 50 years? If not, it's a loser in terms of long-term opportunity cost. If so, then we can get into the more subjective exercise of trying to tabulate the positive externalities of investing.
My criticism of GiveWell style EA is that its causes are systematically akin to donating to underfunded nursing homes. If you view uses of funds as on a spectrum, with pure consumption one end of the spectrum and pure investment on the other, my position is that EA is more like consumption than putting your money into the S&P 500.
January 18, 2024 · Original source
28: Related - back in November some people asked whether Bill Gates counted as EA, or supported it, or was opposed to it, or what. There wasn’t a clear answer then, and still isn’t, but for what it’s worth, he recently endorsed GiveWell.
February 10, 2024 · Original source
HealthLearn, $25,000, for an online training program for healthcare workers in developing countries. This is one of those blindly-follow-the-evidence grants: GiveWell says that training health care workers is one of the most effective interventions known, and HealthLearn hopes to be able to do it at scale. You can support HealthLearn by donating or volunteering your expertise in growing consumer-facing tech products; check out their blog to learn more.
April 01, 2024 · Original source
GiveWell is looking for a new Head of Philanthropy, which I think means mostly fundraising. $200K+ salary, office/remote optional, international candidates welcome.
May 30, 2024 · Original source
I’m not going to make a big deal about Stone’s use of Google Trends, because I think he’s right that SF and Boston are the most EA cities. But taken seriously, it would suggest that Montana is the most Democratic state. Stone could potentially still object that movements aren’t supposed to gather 10,000 committed adherents and grow at 10% per year. They have to take hold of the population! Capture the minds of the masses! Convert >5% of the population of a major metropolitan area! I don’t think effective altruism has succeeded as a mass movement. But I don’t think that’s it’s main strategy - for more on this, see the articles under EA Forum tag “value of movement growth”, which explains: It may seem that, in order for the effective altruism movement to do as much good as possible, the movement should aim to grow as much as possible. However, there are risks to rapid growth that may be avoidable if we aim to grow more slowly and deliberately. For example, rapid growth could lead to a large influx of people with specific interests/priorities who slowly reorient the entire movement to focus on those interests/priorities. Aren’t movements that don’t capture the population doomed to irrelevance? I don’t think so. Effective altruism has managed to get plenty done with only 10,000 people, because they’re the right 10,000 and they’ve influenced plenty of others. Stone fails to prove that effective altruists don’t donate more than other people, because he’s used bad methodology that couldn’t prove that even if it were true. His critique could potentially evolve into an argument that effective altruism hasn’t spread massively throughout the population, but nobody ever claimed that it did. II. You might imagine that a group fixated on “effective altruism” would have a high degree of concentration of giving in a small number of areas. Indeed, EAist groups tend to be hyper-focused on one or two causes, and even big groups like Open Philanthropy or GiveWell often have focus areas of especially intense work. And yet, the list of causes EAists work on is shockingly broad for a group whose whole appeal is supposed to be re-allocating funds towards their most effective uses. Again, click the link I attached above. EAists do everything from supporting malarian bednets (seems cool), to preventing blindness-related conditions (makes sense), to distributing vaccines (okay, I’m following), to developing vaccines in partnership with for profit entities (a bit more oblique but I see where you’re going with it), to institutional/policy interventions (contestable, but there’s a philosophical case I guess), to educational programs in rich countries (sympathetic I guess but hardly the Singer-esque “save the cheapest life” vibe), to promoting kidney transplants (noble to be sure but a huge personal cost for what seems like a modest total number of utils gained), to programs to reduce the pain experienced by shrimp in agriculture (seems… uh… oblique), to lobbying efforts to prevent AI from killing us all (lol), to space flight (what?), to more nebulous “long term risk” (i.e. “pay for PhDs to write white papers”), to other even more alternatively commendable, curious, or crazy causes. My point is not to mock the sillier programs (I’ll do that later). My point is just to question on what basis so broad a range of priorities can reasonably be considered a major gain in efficiency. Is it really the case that EAists have radically shifted our public understandings of the “effectiveness” of certain kinds of “altruism”? A few responses: Technically, it’s only correct to focus on the single most important area if you have a small amount of resources relative to the total amount in the system (Open Phil has $10 billion). Otherwise, you should (for example) spend your first million funding all good shrimp welfare programs until the marginal unfunded shrimp welfare program is worse than the best vaccine program. Then you’ll fund the best vaccine program, and maybe they can absorb another $10 million until they become less valuable than the marginal kidney transplant or whatever. This sounds theoretical when I put it this way, but if you work in charity, it can quickly becomes your whole life. It’s all very nice and well to say “fund kidney transplants”, but actually there are only specific discrete kidney transplant programs, some of them are vastly better than others, and none of them scale to infinity instantaneously or smoothly. The average amount that the charities I deal with most often can absorb is between $100K and $1MM. Again, Open Phil has $10 billion. But even aside from this technical point, people disagree on really big issues. Some people think animals matter and deserve the same rights as humans. Other people don’t care about them at all. Effective altruism can’t and doesn’t claim to resolve every single ancient philosophical dispute on animal sentience or the nature of rights. It just tries to evaluate if charities are good. If you care a lot about shrimp, there’s someone at some effective altruist organization who has a strong opinion on exactly which shrimp-related charity saves shrimp most cost-effectively. But nobody (except philosophers, or whatever) can tell you whether to care about shrimp or not. This is sort of a cop-out. Effective altruism does try to get beyond “I want to donate to my local college’s sports team”. I think this is because that’s an easy question. Usually if somebody says they want to donate there, you can ask “do you really think your local college’s sports team is more important than people starving to death in Sudan?” and they’ll think for a second and say “I guess not”. Whereas if you ask the same question about humans and animals, you’ll get all kinds of answers and no amount of short prompting can solve this disagreement. I think this puts EAs in a few basins of reflective equilibrium, compared to scattered across the map. So is there some sense, as Stone suggests, that “so broad a range of priorities [can’t] reasonably be considered a major gain in efficiency”? I think if you look at donations by the set of non-effective-altruist donors, and the set of effective-altruist donors, there will be much much more variance, and different types of variance, in the non-EAs than the EAs. Here’s where most US charity money goes (source): Try spotting existential risk prevention on here. I don’t think Stone can claim that an EA version of this chart wouldn’t look phenomenally different. But then what’s left of his argument? III. Effective altruists devote absolutely enormous amounts of mental energy and research costs to program assessment, measurement of effectiveness. Those studies yield usually-conflicting results with variable effect sizes across time horizons and model specifications, and tons of different programs end up with overlapping effect estimates. That is to say, the areas where EAist style program evaluations are most compelling are areas where we don’t need them: it’s been obvious for a long time how to reduce malaria deaths, program evaluations on that front have been encouraging and marginally useful, but not gamechanging. On the other hand, in more contestable areas, EAist style program evaluations don’t really yield much clarity. It’s very rare that a program evaluation gets published finding vastly larger benefits than you’d guess from simple back-of-the-envelope guesswork, and the smaller estimates are usually because a specific intervention had first-order failure or long-run tapering, not because “actually tuberculosis isn’t that bad” or something like that. Those kinds of precise program-delivery studies are actually not an EAist specialty, but more IPA’s specialty. My second critique, then is this: there is no evidence that the toolkit and philosophical approach EAists so loudly proclaim as morally superior actually yields any clarity, or that their involvement in global efforts is net-positive vs. similar-scale donations given through near-peer organizations. The IPA mentioned here is Innovations For Poverty Action, a group that studies how to fight poverty. They’re great and do great work. But IPA doesn’t recommend top charities or direct donations. Go to their website, try to find their recommended charities. Unless I’m missing something, there are none. GiveWell does have recommended charities - including ones that they decided to recommend based on IPA’s work - and moves ~$250 million per year to them. If IPA existed, but not GiveWell, the average donor wouldn’t know where to donate, and ~$250 million per year would fail to go to charities that IPA likes. I think from the perspective of people who actually work within this ecosystem, Stone’s concern is like saying “Farms have already solved the making-food problem, so why do we need grocery stores?” (also, effective altruism funds IPA) I’m focusing on IPA here because Stone brought them up, but I think EA does more than this. I don’t think there’s an IPA for figuring out whether asteroid deflection is more cost-effective than biosecurity, whether cow welfare is more effective than chicken welfare, or figuring out which AI safety institute to donate to. I think this is because IPA is working on a really specific problem (which kinds of poverty-related interventions work) and EA is working on a different problem (what charities should vaguely utilitarian-minded people donate to?) These are closely related questions but they’re not the same question - which is why, for example, IPA does (great) research into consumer protection, something EA doesn’t consider comparatively high-impact. And I’m still focusing on donation to charity, again because it’s what Stone brought up, but EA does other things - like incubating charities, or building networks that affect policy. IV. Let’s skip farm animal welfare for a second and look at the next few: Global Aid, “Effective Altruism,” potential AI risks, biosecurity, and global catastrophic risk. These are all definitely disproportionate areas of EAist interest. If you google these topics, you will find a wildly disproportionate number of people who are EAist, or have sex at EAist orgies, or are the friends of people who have sex at EAist orgies. These really are some of the unique social features of EAism. And they largely amount to subsidizing white collar worker wages. I’m sorry but there’s no other way to slice it: these are all jobs largely aimed at giving money to researchers, PhD-holders, university-adjacent-persons, think tanks, etc. That may be fine stuff, but the whole pitch of effective altruism is that it’s supposed to bypass a lot of the conventional nonprofit bureaucracy and its parasitism and just give money to effective charities. But as EAism as matured into a truly unique social movement, it is creating its own bureaucracy of researchers, think tanks, bureaucrats… the very things it critiqued. Suppose an EA organization funded a cancer researcher to study some new drug, and that new drug was a perfect universal cure for cancer. Would Stone reject this donation as somehow impure, because it went to a cancer researcher (a white-collar PhD holder)? EA gives hundreds of millions of dollars directly to malaria treatments that go to the poorest people in the world. It’s also one the main funders of GiveDirectly, a charity that has given money ($750 million so far) directly to the poorest people in the world. But in addition to giving out bednets directly, it sometimes funds malaria vaccines. In addition to giving to poor Africans, it also funds the people who do the studies to see whether giving to poor Africans works. Some of those are white-collar workers. EA has never been about critiquing the existence of researchers and think tanks. In fact, this is part of the story of EA’s founding. In 2007, the only charity evaluators accessible by normal people rated charities entirely on how much overhead they had - whether the money went to white-collar people or to sympathetic poor recipients. EAs weren’t the first to point out that this was a very weak way of evaluating charities. But they were the first to make the argument at scale and bring it into the public consciousness, and GiveWell (and to some degree the greater EA movement) were founded on the principle of “what if there was a charity evaluator that did better than just calculate overhead?” In accordance with this history, if you look on Giving What We Can’s List Of Misconceptions About Effective Altruism, their #1 Misconception about about charity evaluation is that “looking at a charity’s overhead costs is key to evaluating its effectiveness”. This is another part of my argument that EA is more than just IPA++. For years, the state of the art for charity evaluators was “grade them by how much overhead they had”. IPA and all the great people working on evidence-based charity at the time didn’t solve that problem - people either used CharityNavigator or did their own research. GiveWell did solve that problem, and that success sparked a broader movement to come up with a philosophy of charity that could solve more problems. Many individuals have always had good philosophies of charity, but I think EA was a step change in doing it at scale and trying to build useful tools / a community around it. V. You could of course say AI risk is a super big issue. I’m open to that! But surely the solution to AI risk is to invest in some drone-delivered bombs and geospatial data on computing centers! The idea that the primary solution here is going to be blog posts, white papers, podcasts, and even lobbying is just insane. If you are serious about ruinous AI risk, you cannot possibly tell me that the strategy pursued here is optimal vs. say waiting until a time when workers have all gone home and blowing up a bunch of data centers and corporate offices. In particular terrorism as a strategy may be efficient since explosives are rather cheap. To be clear I do not support a strategy of terrorism!!!! But I am questioning why AI-riskers don’t. Logically, they should. I think if you have to write in bold with four exclamation points at the end that you’re not explicitly advocating terrorism, you should step back and think about your assumptions further. So: Should people who worry about global warming bomb coal plants? Should people who worry that Trump is going to destroy American democracy bomb the Republican National Convention? Should people who worry about fertility collapse and underpopulation bomb abortion clinics? EAs aren’t the only group who think there are deeply important causes. But for some reason people who can think about other problems in Near Mode go crazy when they start thinking about EA. (Eliezer Yudkowsky has sometimes been accused of wanting to bomb data centers, but he supports international regulations backed by military force - his model is things like Israel bombing Iraq’s nuclear program in the context of global norms limiting nuclear proliferation - not lone wolves. As far as I know, all EAs are united against this kind of thing.) There are three reasons not to bomb coal plants/data centers/etc. The first is that bombing things is morally wrong. I take this one pretty seriously. The second is that terrorism doesn’t work. Imagine that someone actually tried to bomb a data center. First of all, I don’t have statistics but I assume 99% of terrorists get caught at the “your collaborator is an undercover fed” stage. Another 99% get eliminated at the “blown up by poor bomb hygiene and/or a spam text message” stage. And okay, 1/10,000 will destroy a datacenter, and then what? Google tells me there are 10,978 data centers in the world. After one successful attack, the other 10,977 will get better security. Probably many of these are in China or some other country that’s not trivial for an American to import high explosives into. The third is that - did I say terrorism didn’t work? I mean it massively massively backfires. Hamas tried terrorism, they frankly did a much better job than we would, and now 52% of the buildings in their entire country have been turned to rubble. Osama bin Laden tried terrorism, also did an impressive job, and the US took over the whole country that had supported him, then took over an unrelated country that seemed like the kinds of guys who might support him, then spent ten years hunting him down and killing him and everyone he had ever associated with. One f@#king time, a handful of EAs tried promoting their agenda by committing some crimes which were much less bad than terrorism. Along with all the direct suffering they caused, they destroyed EA’s reputation and political influence, drove thousands of people away from the movement, and everything they did remains a giant pit of shame that we’re still in the process of trying to climb our way out of. Not to bang the same drum again and again, but this is why EA needs to be a coherent philosophy and not just IPA++. You need some kind of theory of what kinds of activism are acceptable and effective, or else people will come up with morally repugnant and incredibly idiotic plans that will definitely backfire and destroy everything you thought you were fighting for. EA hasn’t always been the best at avoiding this failure mode, but at least we manage to outdo our critics. VI. Stone moves on to animal welfare: It’s important to grasp that [caring about animals] is, in evolutionary terms, an error in our programming. The mechanisms involved are entirely about intra-human dynamics (or, some argue, may also be about recognizing the signs of vulnerable prey animals or enabling better hunting). Yes humans have had domestic animals for quite a long time, but our sympathetic responses are far older than that. We developed accidental sympathies for animals and then we made friends with dogs, not vice versa. Again, this is part of why I think it’s useful to have people who think about philosophy, and not just people who do RCTs. People having kids of their own instead of donating to sperm banks is in some sense an “error” in our evolutionary program. The program just wanted us to reproduce; instead we got a bunch of weird proxy goals like “actually loving kids for their own sake”. Art is another error - I assume we were evolutionarily programmed to care about beauty because, I don’t know, flowers indicate good hunting grounds or something, not because evolution wanted us to paint beautiful pictures. Anyone who cares about a future they will never experience, or about people on far off continents who they’ll never meet, is in some sense succumbing to “errors” in their evolutionary programming. Stone describes the original mechanisms as “about intra-human dynamics”, but this is cope - they’re about intra-tribal dynamics. Plenty of cultures have been completely happy to enslave, kill, and murder people outside their tribes, and nothing in their evolutionary mechanism has told them not to. Does Stone think this, too, is an error? At some point you’ve got to go beyond evolutionary programming and decide what kind of person you want to be. I want to be the kind of person who cares about my family, about beauty, about people on other continents, and - yes - about animal suffering. This is the reflective equilibrium I’ve landed in after considering all the drives and desires within me, filtering it through my ability to use Reason, and imagining having to justify myself to whatever God may or may not exist. Stone suggests EAs don’t have answers to a lot of the basic questions around this. I can recommend him various posts like Axiology, Morality, Law, the super-old Consequentialism FAQ, and The Gift We Give To Tomorrow, but I think they’ll only address about half of his questions. The other half of the answers have to come from intuition, common sense, and moral conservatism. This isn’t embarrassing. Logicians have discovered many fine and helpful logical principles, but can’t 100% answer the problem of skepticism - you can fill in some of the internal links in the chain, but the beginning and end stay shrouded in mystery. This doesn’t mean you can ignore the logical principles we do know. It just means that life is a combination of formally-reasonable and not-formally-reasonable bits. You should follow the formal reason where you have it, and not freak out and collapse into Cartesian doubt where you don’t. This is how I think of morality too. Again, I really think it’s important to have a philosophy and not just a big pile of RCTs. Our critics make this point better than I ever could. They start with “all this stuff is just common sense, who needs philosophy, the RCTs basically interpret themselves”, then, in the same essay, digress into: If I wanted to do this stuff, I would try terrorism.
Try spotting existential risk prevention on here. I don’t think Stone can claim that an EA version of this chart wouldn’t look phenomenally different. But then what’s left of his argument? III. Effective altruists devote absolutely enormous amounts of mental energy and research costs to program assessment, measurement of effectiveness. Those studies yield usually-conflicting results with variable effect sizes across time horizons and model specifications, and tons of different programs end up with overlapping effect estimates. That is to say, the areas where EAist style program evaluations are most compelling are areas where we don’t need them: it’s been obvious for a long time how to reduce malaria deaths, program evaluations on that front have been encouraging and marginally useful, but not gamechanging. On the other hand, in more contestable areas, EAist style program evaluations don’t really yield much clarity. It’s very rare that a program evaluation gets published finding vastly larger benefits than you’d guess from simple back-of-the-envelope guesswork, and the smaller estimates are usually because a specific intervention had first-order failure or long-run tapering, not because “actually tuberculosis isn’t that bad” or something like that. Those kinds of precise program-delivery studies are actually not an EAist specialty, but more IPA’s specialty. My second critique, then is this: there is no evidence that the toolkit and philosophical approach EAists so loudly proclaim as morally superior actually yields any clarity, or that their involvement in global efforts is net-positive vs. similar-scale donations given through near-peer organizations. The IPA mentioned here is Innovations For Poverty Action, a group that studies how to fight poverty. They’re great and do great work. But IPA doesn’t recommend top charities or direct donations. Go to their website, try to find their recommended charities. Unless I’m missing something, there are none. GiveWell does have recommended charities - including ones that they decided to recommend based on IPA’s work - and moves ~$250 million per year to them. If IPA existed, but not GiveWell, the average donor wouldn’t know where to donate, and ~$250 million per year would fail to go to charities that IPA likes. I think from the perspective of people who actually work within this ecosystem, Stone’s concern is like saying “Farms have already solved the making-food problem, so why do we need grocery stores?” (also, effective altruism funds IPA) I’m focusing on IPA here because Stone brought them up, but I think EA does more than this. I don’t think there’s an IPA for figuring out whether asteroid deflection is more cost-effective than biosecurity, whether cow welfare is more effective than chicken welfare, or figuring out which AI safety institute to donate to. I think this is because IPA is working on a really specific problem (which kinds of poverty-related interventions work) and EA is working on a different problem (what charities should vaguely utilitarian-minded people donate to?) These are closely related questions but they’re not the same question - which is why, for example, IPA does (great) research into consumer protection, something EA doesn’t consider comparatively high-impact. And I’m still focusing on donation to charity, again because it’s what Stone brought up, but EA does other things - like incubating charities, or building networks that affect policy. IV. Let’s skip farm animal welfare for a second and look at the next few: Global Aid, “Effective Altruism,” potential AI risks, biosecurity, and global catastrophic risk. These are all definitely disproportionate areas of EAist interest. If you google these topics, you will find a wildly disproportionate number of people who are EAist, or have sex at EAist orgies, or are the friends of people who have sex at EAist orgies. These really are some of the unique social features of EAism. And they largely amount to subsidizing white collar worker wages. I’m sorry but there’s no other way to slice it: these are all jobs largely aimed at giving money to researchers, PhD-holders, university-adjacent-persons, think tanks, etc. That may be fine stuff, but the whole pitch of effective altruism is that it’s supposed to bypass a lot of the conventional nonprofit bureaucracy and its parasitism and just give money to effective charities. But as EAism as matured into a truly unique social movement, it is creating its own bureaucracy of researchers, think tanks, bureaucrats… the very things it critiqued. Suppose an EA organization funded a cancer researcher to study some new drug, and that new drug was a perfect universal cure for cancer. Would Stone reject this donation as somehow impure, because it went to a cancer researcher (a white-collar PhD holder)? EA gives hundreds of millions of dollars directly to malaria treatments that go to the poorest people in the world. It’s also one the main funders of GiveDirectly, a charity that has given money ($750 million so far) directly to the poorest people in the world. But in addition to giving out bednets directly, it sometimes funds malaria vaccines. In addition to giving to poor Africans, it also funds the people who do the studies to see whether giving to poor Africans works. Some of those are white-collar workers. EA has never been about critiquing the existence of researchers and think tanks. In fact, this is part of the story of EA’s founding. In 2007, the only charity evaluators accessible by normal people rated charities entirely on how much overhead they had - whether the money went to white-collar people or to sympathetic poor recipients. EAs weren’t the first to point out that this was a very weak way of evaluating charities. But they were the first to make the argument at scale and bring it into the public consciousness, and GiveWell (and to some degree the greater EA movement) were founded on the principle of “what if there was a charity evaluator that did better than just calculate overhead?” In accordance with this history, if you look on Giving What We Can’s List Of Misconceptions About Effective Altruism, their #1 Misconception about about charity evaluation is that “looking at a charity’s overhead costs is key to evaluating its effectiveness”. This is another part of my argument that EA is more than just IPA++. For years, the state of the art for charity evaluators was “grade them by how much overhead they had”. IPA and all the great people working on evidence-based charity at the time didn’t solve that problem - people either used CharityNavigator or did their own research. GiveWell did solve that problem, and that success sparked a broader movement to come up with a philosophy of charity that could solve more problems. Many individuals have always had good philosophies of charity, but I think EA was a step change in doing it at scale and trying to build useful tools / a community around it. V. You could of course say AI risk is a super big issue. I’m open to that! But surely the solution to AI risk is to invest in some drone-delivered bombs and geospatial data on computing centers! The idea that the primary solution here is going to be blog posts, white papers, podcasts, and even lobbying is just insane. If you are serious about ruinous AI risk, you cannot possibly tell me that the strategy pursued here is optimal vs. say waiting until a time when workers have all gone home and blowing up a bunch of data centers and corporate offices. In particular terrorism as a strategy may be efficient since explosives are rather cheap. To be clear I do not support a strategy of terrorism!!!! But I am questioning why AI-riskers don’t. Logically, they should. I think if you have to write in bold with four exclamation points at the end that you’re not explicitly advocating terrorism, you should step back and think about your assumptions further. So: Should people who worry about global warming bomb coal plants? Should people who worry that Trump is going to destroy American democracy bomb the Republican National Convention? Should people who worry about fertility collapse and underpopulation bomb abortion clinics? EAs aren’t the only group who think there are deeply important causes. But for some reason people who can think about other problems in Near Mode go crazy when they start thinking about EA. (Eliezer Yudkowsky has sometimes been accused of wanting to bomb data centers, but he supports international regulations backed by military force - his model is things like Israel bombing Iraq’s nuclear program in the context of global norms limiting nuclear proliferation - not lone wolves. As far as I know, all EAs are united against this kind of thing.) There are three reasons not to bomb coal plants/data centers/etc. The first is that bombing things is morally wrong. I take this one pretty seriously. The second is that terrorism doesn’t work. Imagine that someone actually tried to bomb a data center. First of all, I don’t have statistics but I assume 99% of terrorists get caught at the “your collaborator is an undercover fed” stage. Another 99% get eliminated at the “blown up by poor bomb hygiene and/or a spam text message” stage. And okay, 1/10,000 will destroy a datacenter, and then what? Google tells me there are 10,978 data centers in the world. After one successful attack, the other 10,977 will get better security. Probably many of these are in China or some other country that’s not trivial for an American to import high explosives into. The third is that - did I say terrorism didn’t work? I mean it massively massively backfires. Hamas tried terrorism, they frankly did a much better job than we would, and now 52% of the buildings in their entire country have been turned to rubble. Osama bin Laden tried terrorism, also did an impressive job, and the US took over the whole country that had supported him, then took over an unrelated country that seemed like the kinds of guys who might support him, then spent ten years hunting him down and killing him and everyone he had ever associated with. One f@#king time, a handful of EAs tried promoting their agenda by committing some crimes which were much less bad than terrorism. Along with all the direct suffering they caused, they destroyed EA’s reputation and political influence, drove thousands of people away from the movement, and everything they did remains a giant pit of shame that we’re still in the process of trying to climb our way out of. Not to bang the same drum again and again, but this is why EA needs to be a coherent philosophy and not just IPA++. You need some kind of theory of what kinds of activism are acceptable and effective, or else people will come up with morally repugnant and incredibly idiotic plans that will definitely backfire and destroy everything you thought you were fighting for. EA hasn’t always been the best at avoiding this failure mode, but at least we manage to outdo our critics. VI. Stone moves on to animal welfare: It’s important to grasp that [caring about animals] is, in evolutionary terms, an error in our programming. The mechanisms involved are entirely about intra-human dynamics (or, some argue, may also be about recognizing the signs of vulnerable prey animals or enabling better hunting). Yes humans have had domestic animals for quite a long time, but our sympathetic responses are far older than that. We developed accidental sympathies for animals and then we made friends with dogs, not vice versa. Again, this is part of why I think it’s useful to have people who think about philosophy, and not just people who do RCTs. People having kids of their own instead of donating to sperm banks is in some sense an “error” in our evolutionary program. The program just wanted us to reproduce; instead we got a bunch of weird proxy goals like “actually loving kids for their own sake”. Art is another error - I assume we were evolutionarily programmed to care about beauty because, I don’t know, flowers indicate good hunting grounds or something, not because evolution wanted us to paint beautiful pictures. Anyone who cares about a future they will never experience, or about people on far off continents who they’ll never meet, is in some sense succumbing to “errors” in their evolutionary programming. Stone describes the original mechanisms as “about intra-human dynamics”, but this is cope - they’re about intra-tribal dynamics. Plenty of cultures have been completely happy to enslave, kill, and murder people outside their tribes, and nothing in their evolutionary mechanism has told them not to. Does Stone think this, too, is an error? At some point you’ve got to go beyond evolutionary programming and decide what kind of person you want to be. I want to be the kind of person who cares about my family, about beauty, about people on other continents, and - yes - about animal suffering. This is the reflective equilibrium I’ve landed in after considering all the drives and desires within me, filtering it through my ability to use Reason, and imagining having to justify myself to whatever God may or may not exist. Stone suggests EAs don’t have answers to a lot of the basic questions around this. I can recommend him various posts like Axiology, Morality, Law, the super-old Consequentialism FAQ, and The Gift We Give To Tomorrow, but I think they’ll only address about half of his questions. The other half of the answers have to come from intuition, common sense, and moral conservatism. This isn’t embarrassing. Logicians have discovered many fine and helpful logical principles, but can’t 100% answer the problem of skepticism - you can fill in some of the internal links in the chain, but the beginning and end stay shrouded in mystery. This doesn’t mean you can ignore the logical principles we do know. It just means that life is a combination of formally-reasonable and not-formally-reasonable bits. You should follow the formal reason where you have it, and not freak out and collapse into Cartesian doubt where you don’t. This is how I think of morality too. Again, I really think it’s important to have a philosophy and not just a big pile of RCTs. Our critics make this point better than I ever could. They start with “all this stuff is just common sense, who needs philosophy, the RCTs basically interpret themselves”, then, in the same essay, digress into: If I wanted to do this stuff, I would try terrorism.
Cause evaluation works the same way. Every year, I feel bad free-riding off GiveWell. I tell myself I’m going to really look into charities, find the niche underexplored ones that are neglected even by other EAs. Every year (except when I announce ACX Grants and can’t get out of it), I remember on December 27th that I haven’t done any of that yet, grumble, and give to whoever GiveWell puts first (or sometimes EA Funds).
June 24, 2024 · Original source
1: GiveWell is looking for a Head of Operations, probably someone with many years of leadership experience. Compensation $250K - 300K, remote work acceptable. See here for more details.
December 09, 2024 · Original source
GiveDirectly is a charity that gives money directly to poor families in Africa. GiveWell thinks they’re within an order of magnitude of the most effective charities in the world. You can learn more and donate here.
January 16, 2025 · Original source
The concept of IQ is fine, but you are personally miscalibrated about what low IQ means because the only very-low-IQ people in your training set had developmental disorders. I think these probably explain 5%, 5%, 40%, and 50% of the effect respectively, and I should have been more careful to emphasize (3), which I think explains 40% of the effect. The particular way I would flesh out 3 would be something like - if you’re illiterate and (somewhat) innumerate, you probably don’t have enough practice with symbols and complex mental operations to do even a “culture fair” IQ test like Raven’s Matrices. This doesn’t necessarily mean that your IQ is higher than the Raven’s Matrices says - the person who underperforms on Ravens for this reason will also underperform on a wide variety of other abstract/intellectual/symbolic tasks, and this is part of what IQ means. But it means that Raven’s IQ won’t predict concrete tasks as well as you would expect. Fujimura writes: The other major factor that I think should be reassuring about Lynn's estimates (and other cross-national IQ estimates) is that when you look at "non-problematic" sources that seem like proxies for IQ (e.g. World Bank data, educational performance), you see the same pattern as Lynn and others' IQ data. It's easy for people to quibble about each and every IQ measure (and so people do), but that we see the same pattern of results using otherwise uncontroversial data sources should be reassuring. Yeah, many people tried to gotcha me with claims that Lynn did this or that or the other thing wrong. Lynn tries to defend his methodology here, but I think (and tried to argue in the post) that at this point, that debate is of historical interest only - there’s too much confirmation now. One commenter brings up World Bank Harmonized Learning Outcomes as an example. Another points me to this preprint, which tries to update Lynn’s numbers using all modern standardized testing data and correlations with social development index and GDP. They find mostly similar numbers to Lynn: Malawi goes from 60 → 66, and new last place goes to Sao Tome & Principe at 62. This is by people affiliated with Lynn and scientific racism, and you can choose not to trust their judgment either, but I think at least the SDI correlations are an extremely simple regression that it would be hard to fake. This kind of stuff is why I think simple failures of data collection and analysis are unlikely to explain more than 5% of the gap with our common sense. There’s definitely something weird about these numbers, but it’s got to be more complicated than just “racist people screwed up the test”. But continuing on this subject - if IQ has two components, why would World Bank education data and GDP track the abstract/symbolic component of IQ, rather than the practical component of IQ? Or, rather, it’s obvious why this would happen in education. But why would GDP track abstract/symbolic rather than practical? One possible answer is that the causal pathway is high GDP → lots of education → lots of practice with abstract reasoning → high abstract/symbolic IQ. I don’t think this can be the whole story, because some countries that “cheated” to get high GDP (eg oil sheikhdoms) can’t translate it into IQ points at the same rate as everyone else. I’m stuck with the boring basic explanation that maybe you need to do a lot of abstract reasoning tasks to get high GDP. Harzerkatze writes: [Your claim that blacks everywhere should have the same genes] is far from true. While "white" may be a descriptor for a group of somewhat similar genetic backgrounds, having common ancestors not too far in the past, "black" is different, grouping populations of similar skin color, but common ancestors diverging way further back in time. Yeah, I didn’t want to get into all of this on the post, but I agree the way I phrased it was misleading. Lynn and other national IQ estimates find very low IQs for all sub-Saharan African countries - I mentioned Malawi at 60 in the post, but Nigeria, on the other side of Africa, is 69. Whatever is going on there is a pan-African problem, such that I don’t think differences between African groups are very relevant. US blacks are mostly descended from people in west Africa, eg Nigeria. Some people also brought up that US blacks have significant white admixture. This is true but it’s still not enough to be relevant to this discussion. If we assumed everything was genetic and US blacks with their ~20% white admixture had genetic IQ of 85, we would still expect African blacks to have IQ in the low 80s. However you parse it, there’s got to be some kind of health/education/environment effect going on there. Africa is extremely genetically diverse, but I think most of the countries measured in the paper, including Malawi, are some variety of Niger-Congo speakers, who I don’t think are that much more diverse than white people or anyone else. The really interesting African ethnicities, like the Khoi-San, don’t show up as much at a national level. Andrew Clough writes: Speaking of charity and IQ, the lowest of low hanging fruit is putting iodine in salt. You can donate to the Global Iodine Network like I do for the long term benefit of poorer countries without worrying you're just delaying Malthus's reemergence. Givewell calls Salt Iodization "slightly below the range of cost-effectiveness of the opportunities that we expect to direct marginal donations to" which in the grand scheme of things is quite good. Yeah, salt iodization is great. I had always heard of iodine related problems being concentrated in central Asia and especially Afghanistan, but looking at the map… (source) … sub-Saharan Africa is also a hot spot. I wonder what’s wrong in Cuba - this is exactly the sort of easily gameable metric I would usually expect them to be good at, or at least carefully faking. If you’re interested, you can donate to Iodine Global Network here. Bob Jacobs writes: > His opponents pointed out both his personal racist opinions/activities That's the mildest possible way you could've put it. He wasn't someone who had "personal racist opinions" that he kept as "personal racist opinions". He was the editor-in-chief of Mankind Quarterly, a white supremacist journal that was founded by people like: Henry Garrett an American psychologist who testified in favor of segregated schools during Brown versus Board of Education, Corrado Gini who was president of the Italian genetics and eugenics Society in fascist Italy, and Otmar Freiherr von Verschuer who was director of the Kaiser Wilhelm Institute of anthropology human heredity and eugenics in Nazi Germany. He was a member of the Nazi Party and the mentor of Josef Mengele, the physician at the Auschwitz concentration camp infamous for performing human experimentation on the prisoners during World War 2. Mengele provided for Verschuer with human remains from Auschwitz to use in his research into eugenics. It's funded by the pioneer fund, an organization he was a board member of and that has been classified as a white supremacist hate group, with one of its first projects being to fund the distribution in US churches and schools of "Erbkrank", a Nazi propaganda film about eugenics. He's not just called racist, he *is* racist, he even describes *himself* as a racist. No contesting any of this. MM writes: I spent 18 months in a country where people are supposed to have an iq of about 70, according to the map. My neighbors and friends were mostly non-literate. They did not seem less intelligent than the people I know in my current (US) neighborhood or the people I grew up with (in the US). Most of them would not have performed well on IQ tests, though. They'd never attended school and had no familiarity with puzzle-solving. This was 35 years ago and most people had not seen movies or even photographs. I remember sitting with one older woman and helping her interpret a black-and-white photograph: this is the arm, here's where it connects to the body, etc. It's hard for people from literate societies with tons of exposure to text & graphical representations to see the extent of the gap. Calvin writes: I have a decent amount of experience with the intellectually disabled, and saying "cognitive issues are only responsible for a small part of the [communication] deficit" is so wrong that it makes me question everything else in this essay. Trust me, even making allowances for poor hearing or difficulty forming words, the cognitive issues are responsible for 90% of the deficit. An IQ of 60 is really low and it's a significant handicap. I was concerned to hear this - I have a little experience with the intellectually disabled, but it didn’t involve knowing people’s exact IQ, so I’m not very well-calibrated here. Looking for more information, I found https://www.hrw.org/reports/2001/ustat/ustat0301-01.htm, which purports to describe the characteristics of very low IQ people, mostly in the context of criminal justice (where lawyers often try to use a client’s low IQ as a mitigating factor - ie maybe he didn’t truly understand that crime is wrong). The report says things like: Although all persons with mental retardation have significantly impaired mental development, their intellectual level can vary considerably. An estimated 89 percent of all people with retardation have I.Q.s in the 51-70 range. An I.Q. in the 60 to 70 range is approximately the scholastic equivalent to the third grade […] Although mental retardation of any degree has profound implications for a person's cognitive and social development, it is a condition which in many cases is not readily apparent. While some of the mentally retarded, such as those whose retardation is caused by Down's syndrome or fetal alcohol syndrome, have characteristically distinctive facial features, most cannot be identified by their physical appearance alone. Unless their cognitive impairment is unusually severe (e.g. an I.Q. below 40), persons with mental retardation may be thought of as "slow" but the full extent of their impairment is often not readily appreciated, particularly by people who have limited contact with or knowledge of them, including police, prosecutors, judges, and other participants in the criminal justice system. Many capital offenders with mental retardation did not have their condition diagnosed until trial or during post-conviction proceedings. And gave some examples (slightly out of order for this list): Oliver Cruz, who was executed in Texas on August 9, 2000, had an I.Q. that was measured variously at 64 and 76. Cruz nonetheless insisted to reporters that, although he was perhaps "slow in reading, slow in learning," he was not mentally retarded. Mitigation specialist Scharlette Holdman recalled a client who so successfully hid his retardation from his attorneys that he allowed them to sign him up for college-level calculus classes, which he could not comprehend. He had gone through much of his schooling allowing his younger sister to complete his homework for him. When he was given papers to read in connection to his case, he would carefully stare at them. If he was asked a substantive question, he usually responded, "I don't recall." Only when experts in retardation evaluated him and investigators reviewed his school records and spoke to his family did lawyers discover he had mental retardation and had been considered "slow" since his early childhood. Another capital defendant "hid his mental retardation for most of his life by working at a very repetitive job as a switcher on the railroad. He lied about finishing high school. He was actually in special education classes and did not finish the sixth grade. He was drafted into the army and discharged because of his mental retardation. He lied about his service record. He often made things up so that people would not suspect mental retardation." Morris Mason, whose I.Q. was 62-66, was executed in 1985 in Virginia after being convicted of rape and murder. Before his execution, Mason asked one of his legal advisors for advice on what to wear to his funeral As one psychiatrist testified about a capital defendant with an I.Q. of between 35 to 45: "[People with mental retardation try] to go along with people that they suspect are in authority. For example, I asked [the defendant] where we were when I saw him, and he obviously didn't know, so I asked him if we were in Atlanta and he said `Yes, we are in Atlanta.' In fact, we were in Birmingham, Alabama. I could have said New York and he would have said `Sure, New York' These people are obviously not going to win Nobels anytime soon. But even the guy with IQ 35 - 45 was still talking to people. I think this supports the thesis that intellectually disabled people without specific syndromes can seem pretty normal most of the time. (though keep in mind that anything from the court system should be treated with a grain of salt - defense attorneys have an incentive to exaggerate the intellectual disability of their clients in the hopes that it gets them a lighter sentence) Lyman Stone writes: Emil's post isn't correct, however. We know from the recent Reich lab paper on long-run genetic selection that there was strong selection for IQ in the neolithic revolution, which implies agriculture strongly selects for IQ and ability to plan. Malawians are 60-80% subsistence farmers. Even a "normal" low-IQ person cannot do the implied math and long-term planning involved in this kind of farming. And in fact, economists routinely find that African small-plot subsistence agriculture is actually highly optimized; farmers make very precise choices about where to plant which seeds, which fertilizer to use, etc. Key point is basically: it really isn't true that an IQ 60 person can run a farm functionally. Moreover, mean IQ of 60 implies large shares even lower, at ranges that are uniformly nonverbal even without specific disability. And this is why in the actual record-level NIQ database, they truncate estimates below 60, because even the database managers realize these estimates are crazy. See my post here: https://substack.com/home/post/p-154757665 We know that people with extremely low IQs in the Flynn sense must be capable of subsistence agriculture, because pre-Flynn Effect, most of the West had extremely low IQs, and they were all doing subsistence agriculture. How is this possible? Responding to Lyman’s comment, I wrote: I stick to the claim in this post - that our estimates for what a very low IQ means are poorly-grounded, and that people with low IQs can do some pretty impressive things, especially if they're concrete and part of a cultural transmission package. Maybe this is the Joseph Henrich "Secret Of Our Success" thing. We know that Malawians get poor test scores in school, so it seems like there's some disconnect between do-well-on-tests intelligence and run-a-subsistence-farm intelligence, and the abstract/concrete and novel/cultural distinctions are the best explanation that I can think of. You say that "the phenotype that arises from a given tested IQ in America is clearly vastly worse than the phenotype arising from the same tested IQ in Africa", which I basically agree with. I think part of it is the syndromes issue raised above, and part of it is that maybe Malawians have zero contact with the culture of abstraction that IQ tests come out of whereas even very uneducated Westerners have some contact with it, and maybe another part of it is that whatever health/nutrition issues the Malawians have preferentially harm faculties responsible for more abstract tasks rather than more concrete ones. For an opposite data point, when I was in Haiti, my boss told me (secondhand, no personal experience) of extreme difficulties working with Haitians, like that they couldn't alphabetize files even when that was explained to them. Many Haitains are also successfuly subsistence farmers, so I think this also supports some kind of heavy abstract/concrete distinction. I don't think we're really disagreeing, just agreeing on something like the correlations that make up IQ being less valid outside the normal range. Maybe one way to look at it is to go back to the claim from the justice system document above, saying that people with IQ in the 60s are the mental equivalent of third-graders. The third-graders I know are very into Pokemon, and have all sorts of opinions on how if you add X bonus to a Y strength fire-type Pokemon and then play Z combo, it will [commence six weeks of droning on about different Pokemon cards]. Is this the sort of math/reasoning/strategizing that we don’t expect someone with IQ 60 to be able to do? Does the fact that third-graders can do it mean that we’re miscalibrated? I’m not sure. The part of Lyman’s comment that gives me the most pause is his observation that, if the mean IQ is 60, a decent fraction of people must be 45, and a non-negligible portion 30. At this point, even third-grader comparisons don’t save us. I guess this is where I bring in the claim that IQ breaks down as a guide to practical living skills below some point. You can see several more layers of response between me and Lyman here, but I was especially grateful for him teaching me two things I didn’t already know: First, he corrected my misconception about Reich on ancient European cognitive evolution. Reich had said that pre-agriculture Europeans were “2-3 standard deviations” below moderns. I had interpreted that as IQ deviations of 15 points, making them genetic IQ 55-70, which would have been pretty crazy. Stone tells me he actually meant PGS deviations, each of which was about 3-4 IQ points, so he’s claiming that pre-agriculture Europeans had genetic IQ of 90 (they probably also had lower IQ for environmental reasons).,