Air war in Iraq
Not surprisingly, the publication by the Lancet of new estimates suggesting that over 600 000 people have died (mostly violently) in Iraq, relative to what would have been expected based on death rates in the year before the war, has provoked violent controversy. A lot of the questions raised about the earlier survey, estimating 100 000 excess deaths in the first year or so appear to have been resolved. In particular, the lower bound estimate is now around 400 000, so that unless the survey is rejected completely, there can be no doubt about catastrophic casualties.
One number that is striking, but hasn’t attracted a lot of attention is the estimated death rate from air strikes, 13 per cent of the total or between 50 000 and 100 000 people. Around half the estimated deaths in the last year of the survey, from June 2005 to June 2006. That’s at least 25 000 deaths, or more than 70 per day.
Yet reports of such deaths are very rare. If you relied on media reports you could easily conclude that total deaths from air strikes would only be a few thousand for the entire war. The difference between the numbers of deaths implied by the Lancet study and the reports that shape the “gut perceptions” that the Lancet must have got it wrong are nowhere greater than here. So are the numbers plausible?
I recall seeing only a handful of mentions of air strikes in the mainstream press. In checking my perceptions on this, I found this piece by Norman Solomon (linked by by Dahr Jamil) who notes that a search for “air war” produces zero results for the NYT, Washington Post and Times. Solomon refers to the earlier New Yorker article by Seymour Hersh who makes the same point.
The best source turns out to be the US Air Force Command itself. For October and November 2005, the US Air Force recorded 120 or more air strikes, and this number was on an increasing trend. Most of the strikes appear to be in or near urban areas, and the recorded examples include Hellfire missiles fired by Predators, an F-16 firing a thousand 20mm cannon rounds and an F-15 reported to have fired three GBU-38s, the new satellite-guided 500-pound bomb designed for support of ground troops in close combat. Typical reports of air strikes involve the destruction of buildings in which suspected insurgents are seen taking shelter, or from which fire has been reported. Obviously there is no opportunity to check whether such buildings are occupied by civilians.
An average of 10 fatalities for each air strike seems plausible. If we assume the average number of US plane and missile strikes for the year as a whole was 150 per month, that’s 18000 fatalities for 2005-06. Taking into account strikes by British and other allied forces and by attack helicopters (which seem to be used a lot, but are also rarely reported) it seems likely that Coalition air strikes killed more than 20 000 people in 2005-06.
That’s below the Lancet range of estimates, but in the same ballpark. To explain the gap, I’d suggest that it’s likely that the cause of death has been reported wrongly (or at least, inconsistently with official US accounts) in some cases. I’ve seen quite a few cases where Iraqis have blamed US air strikes for deaths, while the US authorities have denied that there were any strikes in the area and have blamed the deaths on insurgent mortar attacks. That seems to suggest that deaths attributed to air strikes may actually have been caused by artillery on one side or the other.
Based on the survey, and allowing for some misclassification, it seems likely that Coalition air and ground forces have killed between 100 000 and 200 000 people since the war began. The majority of these are military age males, most of whom would have been targeted as suspected insurgents, although we have no real idea how many actually were insurgents and how many were just in the wrong place at the wrong time. Around 70 per cent of all violent deaths in the Lancet survey were of military age males, and presumably the proportion would be higher for the Coalition since they are at least trying to avoid civilian casualties. But even if 80 per cent of those killed were insurgents, that would leave somewhere between 20 000 and 40 000 innocent civilians killed by Coalition forces so far. And of course, the figure also implies that even after 80 000 to 160 000 suspected insurgents have been killed, the situation is going backwards.
For more reactions, see Tim Lambert.
John,
What are your thoughts on the change in non-violent/violent deaths distribution between Lancet 1 and Lancet 2 (see Lambert and DD’s “Death Certifciates” post below for contributions from Mike H specifically)?
I’m no statistician (no, really I’m not) but Mike H’s observations have been out there now for well over 3 days and I’m not seeeing any credible rebuttals. Which, of course, doesn’t mean there are none.
At least for the current survey, it’s impossible to say whether non-violent deaths have risen or fallen. Unless the confidence intervals for non-violent deaths in the previous survey were very tight, I imagine the same was true in that case. Here are the point estimates and confidence intervals for the current survey.
Pre 5·4 (4·1–6·8)
03-04 4·5 (3·2–5·8)
04-05 5·0 (3·8–6·3)
05-06 6·9 (5·1–9·5)
Post-invasion 6·0 (4·8–7·5)
p value 0·523
None of the changes here would meet standard tests of statistical significance. On the other hand, both violent deaths and all deaths have increased significantly.
So, it doesn’t seem to me there’s a story here (unless checking back on the original survey produces something surprising).
Skeptics use the gut-check argument from incredulity a lot. The combination of foreign-language problems, the difficulties of collecting information in war time, and (perhaps most important) successful message control by the military (and self-censorship by the media) altogether mean that American intuitions about Iraq are pretty much worthless. (The idea that the gut is a usable source of information is a dubious one in any case, but especially here.)
A compounding factor is that many Bush loyalists and super-hawks would be happy enough if every single Iraqi were killed. They just don’t want the biased liberal media to report about thing like that.
I thought “air war” is when, say, the USAF and Royal Air Force are fighting Luftwaffe – and those Eye-raki fellas there don’t even have any anti-aircraft machine guns, let alone fighter jets. Isn’t it more like air slaughter?
Here are the non-violent deaths as proportions of the survey totals:
Lancet 1 count: including Fallujah (69/142 [49%]) or excluding Fallujah (68/89 [76%]). 988 households were visited.
Lancet 2 count: (247/547 [46%]). 1,849 households were visited.
If I’ve understood the complaint correctly, it’s this apparent drop in non-violent deaths from 76% of the count to 46% of the count that’s causing difficulty. The two studies used somewhat different methodologies; the sample size and period were different. Death certificates were asked for only in some of the Lancet 1 interviews. Given this, and given the specialism of the maths, I think it would take a statistician with access to population data to interpret these figures meaningfully (pace John Quiggin above; I was busy writing this in the meantime). And my gut feeling is that I don’t see any special reason to assume that non-violent deaths would scale with violent deaths. Perhaps infectious disease epidemics caused in part by degraded sanitation would do it: I don’t see any reason to expect an increase in heart disease, which was said to be the leading cause of death pre-war. And if news reports are to be believed, sectarian gun killings seem to be up; is there any reason for these to lead to a concomitant increase in non-violent deaths?
OK, enough of my speculation. Contrary to what some commenters are suggesting, Lancet 1 had this to say about the non-violent death rate:
It is surprising that beyond the elevation in infant mortality and the rate of violent death, mortality in Iraq seems otherwise to be similar to the period preceding the invasion. This similarity could be a reflection of the skill and function of the Iraqi health system or the capacity of the population to adapt to conditions of insecurity.
And Lancet 2 had this to say:
… families might have misclassified information about the circumstances of death. Deaths could have been over or under-attributed to coalition forces on a consistent basis. The numbers of non-violent deaths were low, thus, estimation of trends with confidence was difficult.
These studies have known limitations which the researchers freely admit to, but that doesn’t render them worthless. If you want to discern specific mortality trends at a higher level of accuracy, I think you need to lobby for a better survey. And if you don’t get that from your government, I think you should assume that the news from Iraq is bad.
“Isn’t it more like air slaughter?”
I thought aerial bombing of civilian areas by an occupying power was a war crime under the Hague Conventions.
What astonishes me is that with 800-900 attacks a week, and with two airstrikes a day why would anyone think that a lot of people were not dying?
Surely, if you were a fiscal conservative you ought to be livid with the administration if they were telling the truth: the huge amount of money being spent on ammunition, expensive air-strikes every day. And yet nobody is dead?
Doesn’t it just seem more logical to agree that lots of people are being killed? Precisely as intended when we drop bombs on them.
rupes: “What astonishes me is that with 800-900 attacks a week, and with two airstrikes a day why would anyone think that a lot of people were not dying?”
A) Because the level of violence has become background noise. Finding 50 tortured bodies is still (probably) front-page news in the US, but no longer headline.
B) Because they need to believe this – otherwise, they’d lose the last shred of argument that this administration is not pure, unadulterated scum.
c) Because they don’t realy mind if Ay-rabs/Sand N*ggers/Muslimocommunofascists/brown foreigners are killed in large numbers.
D) Because the destruction of Iraq as a country was and is a staple of Likkudnik policy.
Mr. Quiggin, I think the concern that brownie has raised is that the previous study had a vastly different proportion of deaths due to violence. I think the argument is that in the first study a very large proportion of excess deaths were due to non-violent causes, while the new study estimates pre invasion deaths to be largely due to violent causes. In response to Brownie I think the explanation is very simple. The original study had a very wide confidence interval, and in the cause of death data (which divides the main data set into smaller groups of counts) the confidence intervals would be even wider. The confidence interval around the number of non-violent deaths in the previous study is probably so wide that it cannot be considered to be statistically significantly different to the figures observed in the new study – i.e. while the point estimates look radically different, the imprecision of the old study prevents any conclusions from being drawn about this.
(I have read the critique of this aspect of the study, which was presented somewhere in comments to Tim Lambert’s blog I think, and this critique was constructed based on the point estimates of deaths only. One needs to consider the errors as well when comparing the old and new studies, and the old study had very wide confidence intervals).
The ony way to get around this problem would be to do a really big study – say, on the scale of the sorts of national surveys which occur in Australia, I think about 5000 households – so that all subgroup analyses (women vs. men, type of death categories by year, etc.) can be performed with narrow confidence intervals. If the governments of the coalition of the willing really cared about Iraqi lives they would have commissioned such a survey using these very techniques several years ago…
I expect that the administration is perfectly well aware that the Lancet numbers are closer to reality than their own. The most obvious explanation of the disparity between the estimates is not some statistical finesse but mere duplicity on the part of people who have been lying relentlessly for the past five years about a great many things.
I’ve posted this elsewhere, but it’s relevant here.
Over the centuries civil wars have caused hideous carnage. 600,000 Americans died in the US Civil war, more than died in World Wars 1 or 2.
Little Bosnia, with its population of 5 million, managed to slaughter 100,000 in its three year war.
England’s Civil War killed 10% of the populace at the time.
The Spanish Civil War, spookily like Iraq in many ways, killed maybe half a million.
Etc etc.
Given that all other major civil wars have indeed caused deaths in the hundreds of thousands, and killed up to 10% of the populace or more, why should Iraq be different?
Why should Iraq, unique amongs civil wars, have a death rate of only 0.2% of the population, 50,000 people? Rather than the normal 2-10%? Why should Iraq get away with only 50,000 dead, when, looking at history, you should expect 500,000? What is so specially peaceful about Iraq that makes it such a fortunate place?
Is it the lack of widespread violence? Is it because all sides are showing admirable restraint? Is it because the Americans are brilliantly precise in their bombing?
I repeat, we expect 2-10% of a population to die in a civil war, that’s what happens. Yet amazingly enough, according to the war mongers, Iraq has got away with an astonishing 0.2%!!
Truly Iraq is a blessed place. The Lucky Country.
John:
Lancet 2004 extrapolated 40,000 excess deaths from non-violent causes, and included them in their 98,000 excess death estimate.
Lancet 2006 erases all those deaths, and replaces them with violent ones.
If the presence of these 40,000 was statistically relevant in 2004, why is their absence in 2006 suddenly not?
Brownie,
Probably the main reason why the statisticians aren’t rushing to respond to Mike H is that this is really difficult to explain without getting into sampling theory. Robert made a good stab at it in comments at Tim Lambert’s:
http://scienceblogs.com/deltoid/2006/10/lancet_roundup.php
I notice that Mike H has responded to Robert in that thread and I think that’s where you are most likely to see an answer to his question. I will just comment on one of his remarks:
I thought mortality studies were as much about the “how” as they were the “how many.”
The more you want to know about precise cause, the larger your sample has to be.
I’ll go back to what I’ve been saying all along: who cares?
I’m willing to grant that people who are expert in statistics are going to want to vet this survey. But I think that they are a minor or vanishing factor in the debate we’re seeing right now.
The survey adds a little more detail to what we’ve known all along: Iraq has become a hellhole. A precise number with precise confidence intervals and rigorous methodology supposedly gives the report more credibility, but not really, because the topic has now been diverted to technical questions about the report rather than the substantive data (WAY TOO MANY PEOPLE KILLED).
The right’s compulsive-obsessive tics are closely tied to their kneejerk politics, and completely disconnected from any understanding of statistics, or of the facts on the ground.
So I’ll settle for 100,000—600,000 extra deaths, both numbers being well within the FAR TOO MANY range for this particular war.
12: Any chance of seeing some working for those figures? Perhaps a year by year breakdown, because after all, Lancet 1 covers a shorter time frame than Lancet 2. I’m not sure where you’re getting the ‘erases’ and ‘replaces’ from; they sound a bit rhetorical to me.
Charlie:
Deltoid link
Lancet 1 does cover a shorter time frame than 2, but there isn’t any similar study to compare Lancet 2 with for the period after Sep 04. On the other hand, we now have 3 studies (Lancet 1 and 2 and the UNDP survey) to evaluate estimates for the first 18 months
Kevin:
“The more you want to know about precise cause, the larger your sample has to be.”
And the largest mortality study done thus far is the UNDP effort. We know how its numbers stack up against Lancet 2 for violent (or “war-related” deaths), at least for year 1.
As I’ve said over at Deltoid, I can see allowing some latitude for discrepancy from study to study when it comes to the various subsets of non-violent and violent, but not the two main categories of death themselves. In order to be considered reasonably accurate, I think the studies have to be reasonably consistent in terms of violent and non-violent death.
Otherwise, this seems an awful lot like resurrecting tens of thousands of dead from study 1, only so you can kill them off again in study 2, but in a markedly different way, in order to balance their bottom lines
John E: I was thinking about your question, and my theory is that the Lancet study serves as a convenient outlet for the pro-war side. For the anti-war side, you’re right that the study only carries a tiny increment of information, so it’s hard to get that worked up about it. For the pro-war side, the news has been so relentlessly bad, that the Lancet study serves as an outlet for frustration. Taking someone’s boxing metaphor of the other day (was it yours?) they’re like boxers who are dead on their feet and don’t know how to stop hitting, but the Lancet study is the best thing they’ve found to punch in nearly two years.
“It is surprising that beyond the elevation in infant mortality and the rate of violent death, mortality in Iraq seems otherwise to be similar to the period preceding the invasion. This similarity could be a reflection of the skill and function of the Iraqi health system or the capacity of the population to adapt to conditions of insecurity.”
Charlie, I meant to address this earlier, but forgot.
Point taken. I’d forgotten about that statement from the first study.
That doesn’t change the fact that the study authors and defenders of the study attributed the increase in non-violent deaths over the baseline extrapolation (whether by infant mortality, accidents, slight increases in heart disease deaths, etc) to the effects of invasion. I spent many hours arguing with learned defenders of the study, their position being that this increase was entirely expected, given there was a war going on and all.
Now, Lancet 2 tells us that Iraqis were actually less likely to die from natural causes and acidents for the first 2 years of the war.
It doesn’t make any sense.
16: OK, so in the comments there you say this:
Recall that in the 2004 survey, the headline-grabbing 100,000 excess death figure owed much of its punch to the 40,000-plus excess deaths attributed to non-violent causes. The 2006 survey tells an exceedingly different tale of mortality in Iraq during the first 18 months of the war. Not only is all of the excess death toll in the 2006 survey the result of violence, it’s actually greater than the entire 112,000 excess death figure, because the death rate from non-violent causes is significantly less than the base line non-violent death rate. My rough math indicates an extrapolated violent death toll of more than 130,000 for the second survey, for the same time frame covered by Lancet 1.
Later in the same thread you quote Lancet 2:
Application of the mortality rates reported here to the period of the 2004 survey gives an estimate of 112,000 (69000 – 155000) excess deaths in Iraq in that period. Thus, the data presented here validates our 2004 study, which conservatively estimated an excess mortality rate of nearly 100,000 as of September 2004 …
I’m not seeing an explicit breakdown of this 112,000 figure in Lancet 2. It seems to be a point estimate total of all deaths for the period of the Lancet 1 study. The authors do, however, provide a table of mortality rates by category and by year on page 4. I think what you’re getting at is that the non-violent death rate there drops from 5.4 pre-war to 4.5 in 2003-4, and then rises again to around 6.9 for 2005-6 (I can’t find equivalent figures in my copy of Lancet 1 for comparison).
Are you saying that this means that the 112,000 figure in Lancet 2 consists entirely of violent deaths?
Bob McManus, apparently unaware that the Hague conventions were signed before the airplane was invented, writes:
I thought aerial bombing of civilian areas by an occupying power was a war crime under the Hague Conventions.
Anyway, the closest that international law comes to banning these types of attacks is in article 51 of Geneva Protocol 1, which bans “indiscriminate attacks…not directed at a specific military objective” or “which may be expected to cause incidental loss of civilian life…which would be excessive in relation to the…military advantage anticipated.”
So I don’t think that these kind of aerial attacks are war crimes, since they’re not indiscriminate.
Charlie,
Exactly. All 112,000 (nearer 130,000 if you include the 20,000 or so fewer non-violent deaths following the fall below pre-invasion levels) are being attributed to violence.
Again, I’m not interested in trying to debunk Lancet 2 even if I thought I could, but claims of mutual corroboration between Lancet 1 and Lancet 2 look a little tendentious.
John Quiggin, on the basis of an article that discusses air attacks at a rate of 35 to 120 per month, concludes that there have been an average of 150 attacks per month. What am I missing?
I also doubt the 10 dead per attack figure. It wouldn’t surprise me if most air attacks killed no one at all. Most military attacks simply miss. The US has shot several billion bullets in Iraq, but they haven’t killed a billion people.
On the other hand, the explanation that most of the Lancet-reported air deaths come from helicopters or artillery makes a lot of sense to me. The US likes to fight at night, and I find it hard to believe that someone who was sleeping when an attack occured has any idea if their loved ones were killed by planes, helicopters, mortars, or what.
“So I don’t think that these kind of aerial attacks are war crimes, since they’re not indiscriminate.”
Or to be more precise, they are not called ‘indiscriminate’ by the people who are doing the bombing.
The people who are actually being bombed might have different ideas.
I got the impression that in most cases it’s local informers settling scores with their personal enemies by means of US-taxpayers-provided ‘smart bombs’. So it’s not entirely indiscriminate.
“The US has shot several billion bullets in Iraq, but they haven’t killed a billion people.”
Well, yes, but it’s hard to imagine that you can drop three 500 pound bombs in a “close combat” situation in an urban area, and not kill anyone. As for the number of attacks, everything I’ve read suggests that the increasing trend to use air attacks, noted in the report, continued through 2005-06.
As regards Mike H’s comments, I think my response at #2 is perfectly adequate. This may be a bit hard to see if you don’t understand sampling theory, but the position is clear to anyone who does.
“Bob McManus, apparently unaware that the Hague conventions were signed before the airplane was invented, writes:”
Actually I was thinking of the Fourth Geneva Convention
“relative to the Protection of Civilian Persons in Time of War” (first adopted in 1949, based on parts of the 1907 Hague Convention IV)”
but I can’t find the relevant article, if it exists. Specifically, I had thought that an occupying power could not use aerial bombing in the pacification of an occupied territory. I am still looking.
“Are you saying that this means that the 112,000 figure in Lancet 2 consists entirely of violent deaths?”
To reiterate what Brownie says, yes. The authors also explicitly state that all of the 112,000 excess deaths extrapolated for Lancet 2. I can’t recall off the top of my head if the statement is in the study itself or the companion paper.
“As regards Mike H’s comments, I think my response at #2 is perfectly adequate. This may be a bit hard to see if you don’t understand sampling theory, but the position is clear to anyone who does.”
I think it’s anything but adequate, John. While I respect your statistical kung fu, I’m having trouble taking it seriously when applying it to the real world, in the context of comparing the two Lancet studies.
Perhaps if we were dealing with a much larger population, and a much larger per annum baseline mortality number, the variances would be easier to dismiss as blips.
But we’re not dealing with a baseline mortality estimate of a million people, we’re dealing with one in the range of 120,000 – 132,000. Juxtapose the swings from study to study between violent and non-violent deaths and we’re looking at numbers that are huge, half the baseline number (although admittedly for an 18 month period).
Insert ” excess ” before ” between violent and non-violent deaths, for comment 29, please.
Left that out.
John:
I’d also be interested in any thoughts you might have on my post to Kevin (17), in relation to the UNDP study, which was conducted using similar methodology to the Lancet studies, but with a sample size many times larger.
The difference between the two in terms of violence is dramatic.
29: While my grasp of the maths involved is minimal, reading around this subject suggests that the p-value quoted for non-violent death rates in Lancet 2 needs to be taken into account. It’s cited as 0.523. I think this may mean that there’s a high chance that the actual figure, if measured, would turn out to be significantly different than the sample.
I guesstimated roughly the same figures for air strike deaths—less than the Lancet number, but still significant.
The more I’ve thought about how you could carry out a study in Iraq under today’s violent conditions, the more skeptical I’ve become of how random the sampling was. It would have been foolhardly for researchers to pick neighborhoods out at random and start nosing around.
What I would have done if I was one of the Iraqi researchers assigned to do random surveys is call around and find neighborhoods where the Man-in-Charge (tribal leader, ayatollah, gang chieftan, whatever) wanted my presence. I’d tell the Americans we did it all randomly, but I’d actually only go where I was invited.
That might have a biasing effect on the results.
“I think this may mean that there’s a high chance that the actual figure, if measured, would turn out to be significantly different than the sample.”
There’s one sure fire way to find out, Charlie. We could ask the authors themselves what their data reveals as the most likely estimate for violent and non-violent deaths in the second study, covering the same time frame as the first.
I’m sure they have these numbers at hand, and it would be interesting to have them on the record.
I’m going to have another go at stating the significance of the p-value because I’m not happy with my last effort. A p-value of 0.52 means that if you used the same methods to sample a (hypothetical) series of identical populations, there’s a 48% chance that you’d observe a smaller difference than your study showed and a 52% chance that you’d observe a larger difference than your study showed. So in the case of Lancet 2, the authors are saying we may as well assume that all years for which deaths were reported were identical in terms of the non-violent death rate.
Compare this with the Lancet 2 p-value for the change in the rate of violent deaths: 0.0001. This states that there would be a 99.99% chance of observing a smaller difference than the study showed if you used the same methods on a (hypothetical) series of identical populations.
34: Mike, that sounds like a reasonable plan.
Mike H, before commenting any further you need to read a text on sampling theory, particularly in relation to confidence intervals and hypothesis testing. Charlie gets it exactly right at #35.
In the recent Israel-Lebanon War, “The IAF flew some 15,500 sorties, including some 10,000 fighter sorties, and attacked a total of around 7,000 targets.” Since Israel killed about 1,000 people by all means, I conclude that on average it takes 10 air attacks to kill 1 person, and that John Quiggin’s figures are inflated by a factor of 100.
Quiggin may find it “hard to imagine that you can drop three 500 pound bombs in a ‘close combat’ situation in an urban area, and not kill anyone,” but that’s exactly what happened most of the time in Lebanon.
On these Lancet discussions, the point has been made repeatedly that “gut feelings” are a poor way to form judgements about things outside our usual experience. Hence, I suggest that people hesitate a little before relying on what their gut tells them about the effects of airpower.
There are many potential biases in the study but how can you tell which way they point?
If Steve is right, the surveyors might either have gone to where someone had a particularly large collection of bleeding stumps making the results a high estimate or they might have missed out the most chaotic and dangerous areas altogether making the results a low estimate.
On the violent/non-violent death split issue, as I understand it the confidence intervals of the two surveys overlap. Likely for some value in the middle neither estimate looks unlikely. Constructing a confidence interval along similar lines provides a lower bound of 400,000 deaths above Saddam. It seems odd to think that the former numbers give cause for concern but not to conclude that since the invasion there have likely been 10,000 excess deaths a month. The confidence intervals are there precisely to address issues like this.
Asking for the data on the exact period doesn’t seem likely to do much beyond the figures John Q quotes and it is, unfortunately, not standard practice to provide all possible breakouts of survey data.
If anyone was better positioned to do a better job of counting the dead it would be the US governent and they have signally failed to provide any meaningful statistics on the matter.
Ragout, I think you have the burden of proof utterly reversed. We have data indicating a large proportion of casualties from airstrikes – although this probably includes helicopters and artillery. Is this possible? Well, yes – that’s Johns point. Lebanon is not a good template for several obvious reasons. Iraq is heavily urbanized while the warfare in Lebanon was concentrated in the southern villages – places, I’d add, where Israel strongly encouraged civilians to flee from. Bombs in cities kill more people than bombs in mostly empty villages do…
Ragout,
your estimate of deaths in the Israel Lebanon War may be low by a factor of two and the sorties are not of the same kind. The Israeli attacks were supposed to be based on excellent intelligence, aimed at destroying infrastructure mostly in evacuated areas and the count includes the massive clusterbombings of evacuated areas in the last days. By contrast the US strikes are not in evacuated areas, with the best will in the world not mostly based on excellent intelligence and mostly in a live combat support role mostly in urban areas.
Still, I will be sure to take arguments from gut feelings skeptically.
Ragout,
Regarding the Israeli “air sorties”.
There is a difference between “sorties” and “attacks” or strikes. In many sorties no attacks are carried out. CENTAF news releases apparently speaks of “weapons released” to refer to sorties where ordinance is fired. The news article referenced by John referred to “military strikes”. It’s unclear whether this sumarizes all air missions involving weapons release.
Whether the Israeli air force uses a similar terminology I don’t know. The lebanon war is probably not comparable to Iraq. The Israeli air force did bomb civilian infrustructure and towns, but the bulk of their missions were directed at suppressing rocket fire from Hizbullah and attempting to eliminate their deeply entrenched bunkers. They would repeatedly bomb hillsides trying to elimate fire. There has been no comparable air barrage in Iraq except for the beginning of the war.
That said, I agree that John had very little basis to hypothesize that a “military strike” would cause on average 10 deaths. Nevertheless he is correct to observe that there has been an increase in the air war in the recent year, which one assumes will have resulted in more casualties, intended and otherwise. Whether the Lancet study accurately capures this, I don’t know. Although in general the methodology is sound.
Ragout, there’s a factor of 10 error in your comparison. The Air Force reports around ten times as many sorties as air strikes.
There is no factor of 10 error in my figures. Four people take me to task for not distinguishing between sorties and attacks. But the sentence (!) I quoted made exactly this distinction: 15,500 sorties attacking 7,000 targets.
Assuming 700 dead from air attacks, that’s 10 air attacks to kill 1 person.
Marc, Jack, and Chew2,
Yes, there are differences between Iraq and Lebanon, but I doubt that the differences were as big as you say. We’re talking orders of magnitude here. Even if US air strikes were 10 times more deadly than Israeli ones, Quiggin is still off by a factor of 10.
Let me use my imagination about how a typical air attack goes down. First, US forces and Iraqi insurgents get in a firefight. When the airstrike called in by US forces arrives, the Iraqis have all taken cover in holes they’ve dug in the ground. These “foxholes” prove a very effective defense, and no Iraqis are killed. However, Iraqi fire is suppressed, as the Americans intended.
In contrast to air strikes, here’s a passage about artillery from Rick Atkinson’s book about the conventional war in 2003 that’s stuck with me:
… [The] earlier suggestion of additional air strikes now seemed more appealing. “What we ought to do is put a precision munition on him instead of walking artillery all over town.”
Ragout,
the Israelis were doing different things. Most of the Israeli targets were supposed to have been evacuated while most attacks in Iraq are supposed to be support for troops in combat, a very different proposition. You are also lowballing the death toll in Lebanon.
.. and what Marc said.
Ragout, the only 10 to 1 ratio in your original post was fighter sorties:deaths, so it was a bit hard to figure out that you actually meant targets:your unstated estimate of air strike deaths.
The fact that the USAF figures show a ratio of 10 missions to an air strike, whereas the Israeli figures are 2 sorties to 1 target reinforces the point that these figures simply aren’t comparable.
As others have stated, the Israelis were trying to destroy hillside rocket emplacements and infrastructure, and, in the latter case to avoid casualties as far as possible. The US use of airstrikes is specifically to kill suspected insurgents in urban areas, and the force protection doctrine clearly allows the destruction of buildings in which they are sheltering.
Ragout, your example doesn’t have an urban feel to it. Why would you assume that’s typical?
“Mike H, before commenting any further you need to read a text on sampling theory, particularly in relation to confidence intervals and hypothesis testing. Charlie gets it exactly right at #35.”
John, the study authors extrapolated 40,000 excess non-violent deaths for the 18 month period covered by Lancet 1. That figure represents a 22% increase over the baseline mortality figure for the 18 months.
If a figure that large isn’t ” statistically significant,” then perhaps the real issue here is whether statistical surveys are being given undue credence when measuring mortality in an environment like Iraq.
John:
Any response to my question directed to you in post 31?
As I said, Mike, before commenting any further you need to read a text on sampling theory, particularly in relation to confidence intervals and hypothesis testing.
Mike H #52 was a response to #50.
On #51/#31, you’re wrong. The UNDP study was based on media reports like IBC.
“As I said, Mike, before commenting any further you need to read a text on sampling theory, particularly in relation to confidence intervals and hypothesis testing.”
I disagree John, and this is looking more and more like a game of dodgeball. And you still haven’t answered my question posed in post 31. It’s a very straightforward request.
As I mentioned at Deltoid earlier tonight, in 2004 I debated knowledgable defenders of Lancet 1, who argued that the 40,000 excess non-violent deaths made sense, and were to be expected as the natural consequence of the country being placed into a war environment. Specific defences were provided to support increases in infant mortality, accidents, etc.
Now Lancet 2 erases these deaths, and replaces them with a much different cause of death, violence. I’m now being told by some of the same previously mentioned defenders of the study that the deaths they lobbied for in 2004 are no longer statistically relevant. Forget about them, I’m told. Go read a book on statistics, I’m told.
Well I think what I really need to do is get to my doctor, real quick.
I’ve had so much smoke blown up my ass over the last 3 days, my lungs probably look like those of two pack a day man.
“On #51/#31, you’re wrong. The UNDP study was based on media reports like IBC.”
Am I now? I could have sworn it was a cluster sample, many times larger than either Lancet study.
Mike H, the significance of an effect size is just not the same thing at all as the statistical significance of an effect. A study can, for example, indicate a very large effect, but in a way that gives us little reason to think that it wasn’t just random noise—big size but little statistical significance. And a study can give us very, very, very good reason to think that there’s a real effect, but that the effect in question is subtle to the point of near-nonexistence—small size but strong statistical significance.
This is a very common layperson’s error in mucking about with statistics: stats people give the word “significant” a special, technical meaning, and you can’t use your ordinary language intuitions about it in trying to suss out what should or shouldn’t count as significant.
Here’s another way to scale Israel in Lebanon. 6 weeks of air assault killed approximately 1,000 people. The Iraq war contains about 28 6-week intervals, so on a pure scaled up basis that would be 28,000 dead, which is in the ballpark of John’s estimate.
Philosopher:
Do you reject the assertion that the huge variance between violent and non-violent deaths has little statistical significance? If so, why? I’d also be interested in what your take is on the cause of the variance.
Which study is giving us the truest picture of mortality in Iraq? Is either? What would you base your choice on?
Sorry Philosopher, my first question should read ” do you reject the assertion that the huge variance between violent and non-violent deaths has statistical significance?
MH, apologies, I misread my source. There’s a UNDP-Lancet comparison here
In Lancet 1, 40% of the 98,000 ci mid-point deaths were attributed to non-violent causes. In Lancet 2, it’s 0%.
A p-value of 0.52 means that if you used the same methods to sample a (hypothetical) series of identical populations, there’s a 48% chance that you’d observe a smaller difference than your study showed and a 52% chance that you’d observe a larger difference than your study showed. So in the case of Lancet 2, the authors are saying we may as well assume that all years for which deaths were reported were identical in terms of the non-violent death rate.
Thanks, Charlie. Are there any conclusions we can draw about the probability that the number of non-violent deaths would fall by 40%? Does a p-value of around .5 not just mean that the figure has an equal chance of being higher or lower? In which case, the non-violent deaths could be even further into the negative than they already are (extrapolating from the pre-invasion baseline)?
At the very least – and even allowing for sampling theory explanations for the discrepancy – are the authors justified claiming mutual corroboration between the first and second studies if the percentage of deaths that can be attributed to non-violent causes has fallen more than 40% according to their own figures, however ‘safe’ they might be based on p-values?
JohnQ,
The UNDP ILCS report was a survey of over 21,000 households and not, as you say, based on media reports.
Sorry John – I cross-posted.
Mike H
I don’t know if this will help, but doing a quick browse to follow up on your points on UNHD survey, I found this http://iraqmortality.org/iraq-mortality
They discuss the differences in the surveys: the questions they asked, how they did the survey, etc (and yes, you are right it was a survey).
There are some big differences in the suveys about how things are classified, what was included, etc. But that link states that the results are consistent.
I apologise, I haven’t read through it all in detail but it might provide some more background.
The interesting analysis above is based on number of missions/strikes/sorties. I think there is a very complementary analysis to be done that is based on amount of bombs dropped, measured by weight.
I think there is reason to conclude that we have dropped more on Iraq than what was dropped on Germany in WWII. I think there are further comparisons which suggest that we could easily have killed 300,000 people just in airstrikes.
Please see detail here.
Also here.
Give me a break. You’re quibbling over factors of two (air attacks vs. sorties, or estimates of Lebanese deaths), when the figures suggest that Quiggin is off by a factor of 100. And I find your argument that there’s a vast difference between airstrikes targeting rockets in Lebanon and airstrikes targetting mortars in Iraq to be unconvincing.
In any event, Quiggin’s only justification for his estimate is that he “can’t imagine” dropping bombs that don’t kill anyone. As I’ve shown, that’s what happened 90% of the time in Lebanon.
By the way, this article puts the number of airstrikes in Iraq at about 300 a year in both 2004 and 2005. That’s a factor of 6 lower than Quiggin’s estimate.
Oh, and another thing. John Quiggin argues that because the US ratio of sorties to airstrikes is a lot higher than Israel’s, that means the figures for airstrikes aren’t comparable. But these figures just mean that the USAF flies a lot more non-combat missions than the IAF (transporting cargo, buzzing crowds in a “show of force,” intelligence-gathering). It certainly does not imply that US combat airstrikes are any more deadly than Israel’s, as Quiggin claims.
ragout
“this article puts the number of airstrikes in Iraq at about 300 a year in both 2004 and 2005”
That article is USA Today, 3/15/06:
But those figures are somewhat at odds with this (WaPo, 12/24/05):
That article I think is very relevant to this discussion.
Similar figures corroborated here (UK Times, 1/1/06):
I was surprised that the leading cause of violent death changes significantly from the 2004 Lancet study in the current Lancet study. Air strikes or artillery were the main cause of violent death in the post-invasion period in Lancet 1. In Lancet 2, gunfire is the leading violent cause of death, including the March 2003-April 2004 period covered by Lancet 1. Since coalition-attributed fatalities in Lancet 2 account for 31% of violent deaths, and air-strikes account for 13% of violent deaths and insurgents don’t have planes, that means that attributed airstrikes alone account for 42% of coalition-inflicted fatalities.
Quiggen’s point about the air war coverage is a bit off the mark. Air strikes account for 12-14% of violent deaths in all three post-invasion periods (see Table 4 in the study). If media reports of the causes of violent fatalities were proportionate to the surveyed causes, we should see no change in media reports about air strike fatalities from March 2003 to the present day. Car-bombing fatalities as a percentage of all fatalities increase monotonically across the three time periods, so we would expect to see more attention given to them. Also, reports tend to focus on mass-fatalities rather than individual engagements. Indeed, most reports about gun-fire combat fatalities arise from reports of US fatalities or discovered bodies. Other engagements are not reported.
Even if “other explosion/ordnance” and unknown causes from table 4 are added to airstrikes (which presumably would cover all not-vehicular IEDs), it is the May 2004-May 2005 period that sees 39% of fatalities due to possible airstrikes. The last period would have possible airstrike account for 26% of all fatalities in that period. Gunfire and car-bombs are still the main story, accounting for 71% of all deaths in the most recent period, and 69% overall.
“do you reject the assertion that the huge variance between violent and non-violent deaths has statistical significance?”
Yes, I am rejecting exactly that; without something confidence intervals, the point-values that you are focusing on cannot be meaningfully compared, no matter how large their difference. (since the confidence intervals might be bigger still).
Hi Rupes:
I’m aware of the site you and John linked to. I think you misunderstood my request of John and Kevin when you typed this:
“There are some big differences in the suveys about how things are classified, what was included, etc. But that link states that the results are consistent.”
Iraq Mortality.Org claims consistency between Lancet 1 and the UNDP. I’m not asking John and Kevin to explain the variances between Lancet 1 and the UNDP study. I’m asking this in relation to Lancet 2.
The difference in terms of violent death is enormous.
Iraq Body Count has come out with its own analysis of the latest Lancet paper. They think it’s rubbish. They’re polite about it, though.
I thought they made a number of very good points, but won’t rehash them here.
I’d rather rehash the one I’ve been making at Deltoid—
The UNDP survey found that the number of war-related deaths (supposedly excluding criminal murders, from what I’ve read) was 19,000 to 28,000 in the first 13 months. The latest Lancet paper happens to have the violent mortality rate for exactly the same stretch of time in Table 3. The 95 percent confidence interval is 1.8 to 4.9 per thousand per year. Assuming a population of 25 million, that’s 45,000 deaths per year at the bottom of the confidence interval, or about 50,000 for 13 months.
To get the UNDP estimate to agree with the lowest end of the confidence interval, you have to tack on 20-30,000 criminal murders. To hit the midrange figure (3.2 per 1000 per year, or about 80,000 deaths per year or 90,000 in the 13 months), you’d have to tack on about 60-70,000 murders.
I don’t think the UNDP and the second Lancet report are in agreement, and it’s not a question of me taking a subset of the data and overinterpreting it—the Lancet authors did the confidence interval for that time period themselves.
Iraq Body Count makes other arguments. I hope people look at them calmly and think about them—there’s been a bit too much talk about how the only reason one could have for doubting this 400-900,000 estimate is rightwing buffoonery or sheer ignorance, and I don’t think that’s quite right.
BTW, John Emerson, there’s an enormous difference between the war as portrayed by Iraq Body Count statistics taken at face value and what one sees in the Lancet report. If you ever bother to read IBC’s analysis of their own data, in most months of the war the number of civilian deaths that the media attributes to American action is extraordinarily low. In most months it is a few dozen people. In the third year of the occupation IBC could clearly attribute 370 civilian deaths to occupation forces. If one takes those numbers at face value, the US troops are doing an extraordinary job avoiding civilian casualties, especially when you couple such (wildly implausible) numbers with the number of insurgents Michael O’Hanlon claims we are killing—he said in late 2005 it was 760-1000 per month.
OTOH, if you take the Lancet numbers as gospel, Iraq is a Vietnam-style free fire zone.
I think the truth is somewhere in-between, but I don’t know where exactly. I think the media is clearly underreporting killing by our side, but don’t know by how much. But it seems to me that American citizens should very much want to know the order of magnitude of the killing our own troops are committing. Tens of thousands of civilians killed per year by our troops in reckless free-fire zone style killing is qualitatively different from a few hundred per year tragically killed by accident.
John:
No problem.
The difference between the UNDP figure of 24,000 war-related deaths (for one year) and the 130,000 plus excess violent deaths I believe Lancet 2 extrapolates for 18 months is rather large, don’t you agree? Given that the size of the UNDP study seems to provide it with a decided advantage over either of the Lancet studies in terms of precision, how would you describe its impact on the credibility of Lancet 2’s violent death figure?
I’d like to harken back to your original comment in this thread, because I missed something important in my initial reading of it:
“At least for the current survey, it’s impossible to say whether non-violent deaths have risen or fallen. Unless the confidence intervals for non-violent deaths in the previous survey were very tight, I imagine the same was true in that case.”
If you look at pg 7 of the companion article, under ” Non-violent death rates,” the study authors make this statement:
“Immediately post-invasion, the death rate due to non-violent causes dropped slightly, then stayed level for the next period, but began to rise in the period from June 2005 until June 2006.”
According to the authors’ own words, there was a decrease (albeit slight) in the rate of non-violent death for the 18 month period immediately after invasion. You’re right that it’s impossible to say if the non-violent rate has gone up or down over the entire length of the study, and the authors confirm that in the same paragraph I cite, but we’re talking about a 40 month period as opposed to an 18 month one.
The point estimate for non-violent excess deaths for the entire 40 months in Lancet 2 was 54,000. However, the non-violent excess death estimate from Lancet 1 was 40,000, over only 18 months. While the authors conclude the former number wasn’t large enough to assert an increase in non-violent mortality, the latter number evidently was big enough, and over a shorter period of time, to constitute an increase in the post-invasion mortality rate, and an increase in the excess death estimate.
BTW, so as not to mislead, IBC attributed about 10,000 civilian deaths to US forces in the first two years. But nearly 7000 of those died in the first two months (during the invasion) and roughly 2000 more died in the two separate attacks on Fallujah. What’s left over are months and months and months where it’s literally dozens of deaths per month caused by coalition forces, in comparison to many hundreds per month (in the first two years remember) caused by criminals or insurgents.
IBC also admits that in the third year there were thousands of deaths it couldn’t attribute to any one actor. But when you stick to what they can attribute, the overwhelming majority of violent deaths after April 2003 are caused by Iraqis (or the foreign jihadists).
Again, that’s a completely different war from the one depicted in the Lancet paper. It makes a difference if you’re an American citizen—it’s not just that the Iraq War was wrong, but it’s also a question of how many civilians our own forces are killing directly.
“Yes, I am rejecting exactly that; without something confidence intervals, the point-values that you are focusing on cannot be meaningfully compared, no matter how large their difference. (since the confidence intervals might be bigger still).”
Philosopher, I’m not sure what you mean in regard to ” without something confidence intervals…..” Did you miss a word?
I don’t want to come off like a snarkasaurus when I say this, philosopher, but I’m a little tired of the cherry-picking and goal post moving that seems to be going on with defenders of the study when it comes to point estimates and confidence intervals. Whenever a point estimate starts taking serious flak, someone pulls the pin on a confidence interval smoke grenade. When the point estimates are holding up well in a debate, defenders of the studies tout them as gospel.
You can’t have it both ways. The authors have gone on the record in both studies with numbers they cite as the most likely values, for all the categories that are of the most consequence. Why don’t we try to stick with them whenever possible?
Any response to my question in relation to which study you think is most accurate, and why?
On the number of strikes, the report I cite, direct from USAF gives more than 300 strikes in the three months Sep-Nov 2005, so there seem to be some big contradictions in reports from the same source.
Looking over the CENTAF reports, they cite 50 or so “close air support” operations per day fairly consistently – this includes reconnaissance and similar, but not transport operations, and there are always several cases of combat support reported. The reported operations in Iraq seem to include a lot less “expenditure of munitions” than those for Afghanistan where the total number of operations is smaller.
So, it seems pretty clear that, if the reporting is correct, air strikes are being used very differently in Iraq than in Afghanistan and certainly than in Lebanon. The descriptions I’ve seen suggest a higher degree of lethality.
These numbers don’t appear to include missile stikes or helicopters.
Re #73, it seems odd that roughly a strike (missile, bomb, or cannon-fire) per day (300 strikes divided by 3 months in Sep-Nov 2005) could produce the fatalities that the 2006 Lancet study finds. Of course, the problem is that we don’t have precise data on US strikes. Indeed, CENTCOM’s Air Component report seems to only cover fixed-wing strikes. For example, of an Apache helicopter strike (or multiple strikes) is not in the Air Component report.
An average daily air-strike death toll of 65 people seems a bit steep, but if near daily fixed-wing strikes and less frequent helicopter strikes are combined with more intense bursts of violence like the inital invasion and the Fallujha battles, then the Lancet figures seem more plausible.
David Kane, every time you raise your head in these forums it is to say that you have “asked for the data” and it wasn`t forthcoming, with the attendant insinuation of fraud or bloodymindedness. Anyone who has worked with these large survey samples knows that the organisations which collect them don`t just give up the data on a whim to anyone who comes along wanting to “replicate it”. They have serious ethics requirements which prevent them from doing that easily. You like to imply you are familiar with statistical analysis and data auditing, in which case you know this is true for almost any survey of this type, and the way you present their reluctance to share the data as evidence of suspicious dealing is quite disingenuous.
If, on the other hand, you have an actual criticism of the methods of the paper – and by “actual” I don`t mean “it disagrees with Dubya” or “that can`t be right!” or “it doesn`t concur with another study which I happen to believe is a fraudulent crock of shit” – then feel free to write a letter to the editor of the Lancet explaining your position. I`m sure they`ll be willing to publish whatever analysis you bring to bear on the substantive elements of the paper – i.e. the sample design, the sample weighting or the statistical comparisons. If you don`t have any argument with these elements of the paper then accept the figures presented and get over it.
Mike H, this issue of confidence intervals is not a smoke grenade or some other dodge, it`s an essential part of the consideration of the results. Given what you and Brownie have presented of the authors` statements in the conclusion one could assume that they have drawn too strong a conclusion about the comparability of the studies, but it is quite common for scientists to overstate the results of their own papers, and such an act in the discussion in no way invalidates the results of the paper. Choosing to disagree with their conclusion that the papers are comparable, and instead concluding that statistical error in the first paper prevents comparison, in no way invalidates the second paper (or the first).
Is it also not the case that in Lancet 1 they excluded the Fallujah cluster? Had they included it the confidence intervals for the violent deaths estimates would have been narrower, and the point estimate more like the point estimate from Lancet 2. In a way this whole discussion is driven by imprecision which the authors themselves introduced in order to make the paper less biassed. I think they should get some credit for that, regardless of whether you think their conclusions about comparability were too strong.
oops, I think I cross-posted.
Given what you and Brownie have presented of the authors` statements in the conclusion one could assume that they have drawn too strong a conclusion about the comparability of the studies,
I don’t think it’s fair to assume that. As Robert pointed out more than once, you can get a significant agreement between studies about the total number of deaths without getting a significant agreement about the total number of non-violent deaths in particular, and the authors never claim the latter.
I would suggest that, given the statistical insignificance of the change in non-violent deaths, one of the authors shouldn’t have later listed the breakdown of excess deaths by cause the way he did after Lancet 1, since apparently the error bars on each of those numbers was so huge as to make them totally uninformative. However, the published study didn’t make that claim.
Is it also not the case that in Lancet 1 they excluded the Fallujah cluster? Had they included it the confidence intervals for the violent deaths estimates would have been narrower, and the point estimate more like the point estimate from Lancet 2.
No, actually, if it’s included the confidence intervals become much larger and the point estimate’s well above Lancet 2. Lancet 2’s between the two estimates you get from Lancet 1, including vs. excluding Falluja.
As I said on another thread, that makes sense to me—Falluja did happen, and really should raise the death count, but the actual Falluja cluster data they got in Lancet 1 is insanely high and (as the authors themselves point out) may not accurately represent the entire 3% of Iraq’s population that cluster was supposed to cover.
Mind you, I have 1 class’ worth of stats knowledge, so whether it makes sense to me or not isn’t particularly important. I’m just saying it doesn’t seem like there’s an intuitively obvious discrepancy so huge we can accuse the authors of incompetence and/or fraud.
c.l.ball, your Apache link is broken
Mike H., confidence intervals are not “smoke grenades” or “kung fu” or any other form of mysticism. They are a basic advanced high school / college into course level statistical concept that you actually need to understand if you are trying to interpret results from random samples. The fact that you don’t seem to is not evidence that you are a no-bullshit, sensible, down-home sort of guy. It just means that your attempt to cherry-pick hopelessly imprecise numbers from subsamples and say that they are implausible is straight-up propaganda.
On the other hand, the stuff on the discrepancy between the UNDP results and the Lancet ones is important. Somebody needs to unpack the exact survey questions and sampling methods here. Steve Sailer’s point at 33 is also an interesting one, although I don’t know how one would follow up on it. Perhaps detailed questions to the surveyors? The authors did try to address this point somewhat in their writeup. Also, the bias that surveying “safer neighborhoods” would introduce into the study is totally unclear—it would seem to bias things toward lower, not higher casualty counts? Finally, Lancet 1 in particular was done before Iraq lapsed into total civil war.
I’d encourage everybody to read the Iraq Body Count press release on the latest Lancet study:
http://www.iraqbodycount.org/press/pr14.php
It’s sensible, from a non-warhawk source, and does raise many reasons to doubt the scale of the Lancet 2 results. They take no position on what if anything might have biased the study sample.
One thing for sure at this point: the initial (Lancet 1) estimate of 100,000 excess civilian deaths is looking not just plausible but actually pretty low. Even the IBC’s passive count based completely on press reports of violent deaths alone is approaching 50,000. And passive counts are almost guaranteed to be underestimates. I suppose a few years down the line the Lancet people will come out with an estimate of 1 million and we’ll all be arguing vociferously over whether it’s only 400,000.
Ragout is right that the 10 deaths per sortie is an unreasonable average.
I crunched the numbers for this a while ago for a few recent wars:
1991 Gulf War: 3,200 civilian deaths in 27,000 bombing sorties
Kosovo War: 500 civilian deaths in 13,000 bombing sorties
2001-2 Afghan War: 1,300 civilian deaths in 6,500 bombing sorties.
There’s all kinds of caveats to attach to this, but there’s as yet no historical basis to claim a civilian death rate due to bombing higher than 0.2 deaths per sortie. Assuming American intelligence were accurate enough to kill four times as many insurgents as civilians (a big assumption), your estimate would still be off by a factor of 10.
“Philosopher, I’m not sure what you mean in regard to ” without something confidence intervals…..” Did you miss a word?” Yeah, sorry, that was supposed to be “something like confidence intervals”, since there are other mathematical ways of getting at significance than the intervals per se. (There’s already been discussion on this thread about p-values, for example.) But nothing in this particular conversation is turning on that.
mq is right about how completely fundamental the distinction between the point-estimate and the confidence intervals are. I’m sure some defenders out there are sloppy about this, too—like I said, it’s a very common misunderstanding that people without any stats training make all the time, regardless of political persuasion. But all the people who seem to know anything about stats are putting a lot of emphasis on the CI, which is as it should be. The point-estimate is of course part of that—the CI is an interval around the point-estimate—and it is also of course easier to report, talk about, and throw around, but it is only part of the story, and a meaningless part without some indication of statistical significance.
Sometimes an extreme hypothetical case can help illustrate matters. Suppose someone did a truly awful survey of Iraqi mortality, in which they only check on 50 people; and with those 50 people, there had been 1 pre-invasion deaths and
4 post-invasion deaths. Even for the lower estimates, this is an entirely plausible kind of result for such a small sample—small samples are more hostage to the luck of the draw than large samples, which is why you want large samples when you do this right. Now, if we don’t check at all for significance, then we have something like a 20 per 1,000 pre-invasion death rate and a 80 per 1,000 post-invasion death rate. Multiply that difference by the population of Iraq, and you get something like
2.1 million excess deaths. Such a huge point estimate! But I hope that I have set this up so it is clear why the size of the point-estimate is simply no indicator whatsoever of whether you’ve actually found something statistically significant. In this case, the sample is so small, that the CI would be absurdly huge. Indeed, I suspect that if one quickly crunched the numbers, this wouldn’t even be a big enough difference, for samples so small, to indicate any statistically significant difference between pre- and post- at all, let alone one of such a magnitude. (It’d be pretty hard to guarantee representativeness in a sample that small, too, but that’s another matter.)
Moral of the story: the bigness of point-estimates or their differences is just no indicator whatsoever of any significance of the effect.
I’ve just looked back at the earlier Lancet paper, and if I didn’t miss anything (and, hey, it’s 3:30am where I’m at, so it might have slipped past me), then there’s something very telling in it, for the particular line of argument that Mike H. is running: the authors never make any claim to statistical significance for an increase in nonviolent deaths. On p. 1861, there is a specific claim about a “58-fold” increase in violent deaths, as well as the increases in the overall death rate—but I’m just not finding anything reported in there about specifically nonviolent deaths that has a CI attached to it. And it looks like Lancet II is the same in this regard. The closest they come to making any such claim is in the following from p. 5, but if you look closely you’ll note that no claims to significance are made there: “Excess mortality is attributed mainly to an increase in the violent death rate; however, an increase in the non-violent death rate was noted in the later part of the post-invasion period (2005–06). The post-invasion non-violent excess mortality rate was 0·7 per 1000 people per year (–1·2 to 3·0).” See? No CI reported, and on p.7 they explicitly say that this was not significant. (Did someone already say this part up-thread somewhere? My apologies for being redundant, if so.)
So, Lancet I makes no claims about nonviolent deaths increasing; and Lancet II likewise finds nothing of statistical interest to report about nonviolent deaths. So in this regard they are totally consonant with each other.
You have a reasonable point, Bruce, but air support seems to be used very differently in Iraq – a much smaller proportion of close air support missions use weapons, but they seem to be directed much more at killing insurgents at close quarters , rather than against dug-in positions. And most of this is taking place in or near urban areas, whereas that was true of only a small proportion of the attacks in the Gulf War, Kosovo and Afghanistan.
I concede that 10 is probably too high, but, given the differences, the value of 1 implied by your analysis (since the 1 to 4 ratio of clearly civilian to putative insurgent appears about right) is too low.
The data on missions vs strikes leads me to think misclassification is probably more important than I first thought. With 50 close air support missions per day by fixed-wing aircraft alone, it’s likely that there will be lots of cases where people are killed by artillery or even small arms but where it seems plausible to those on the ground to blame airstrikes.
There’s all kinds of caveats to attach to this, but there’s as yet no historical basis to claim a civilian death rate due to bombing higher than 0.2 deaths per sortie.
In Hiroshima 100,000 people were killed by one airstrike. So, it’s between 0 and 100,000.
Plus, once again, the Lancet study counts not only the immediate deaths, but also deaths from injuries caused by the airstrikes.
philosopher,
I appreciate your efforts to explain, but even for we non-statisticians the concept of a confidence interval is pretty well understood.
When Kaplan talked about a “dartboard” with respect to Lancet 1, he was ridiculed by supporters of the study who rushed to explain why 8,000 was not as likely a figure as 98,000 for projected excess mortality in Iraq. The net must be literally bursting with examples of comments and posts from sites like CT where the fundamentals were trotted out for the benefit of people like Kaplan. Similarly, it’s not difficult to find quotes from Horton or any of the Hopkins authors in which the 98,000 (more commonly referenced as 100,000) is mentioned as the best estimate we have for excess mortality.
Two years on and when some observers compare the figures in Lancet 1 with Lancet 2, we’re told that we ignore the confidence interval at our peril. Well, okay, let’s not ignore it, but let’s at least be consistent with the emphasis .
the authors never make any claim to statistical significance for an increase in nonviolent deaths.
It really isn’t the argument of those raising questions that the authors make this claim. The fact remains, whilst non-violent deaths account for 40% of the 98,000 point estimate for Lancet 1, they’ve dipped below zero (in that the rate has gone negative from the pre-invasion baseline) for Lancet 2’s 112,000 (or 130,000 extrapolated). Over 40,000 non-violent deaths have been replaced by an additional 60-odd thousand violent deaths.
Provide 99 statistics-based reasons how such a discrepancy may have come about, but then please go on to explain to me why the authors are still within their rights to say the studies are mutually corroborative?
The only sense in which the studies can be said to be mutually corroborative on the specific question of composition of excess deaths, is that the confidence interval’s in both overlap, which, as any fule no, is no corroboration at all.
This is hardly an unimportant point, given mutual corroboration is one of the strongest indicators that we can have faith in the results produced by similar studies. Which is probably why the authors both to claim corroboration at all.
that should be “bother” in the last sentence.
Regarding air operations in Iraq: Close air support sorties are those flown in direct combat support of troops. It’s in the nature of CAS that the aircraft will only fire (or indeed do anything rather than circling) if they get a call from the forward air controller with the troops, unless somebody has a Canadians-in-Afghanistan speed-freak brain storm, so the ratio of strikes to sorties will be low. I’d also point out that more of this will be going on in Iraq than Lebanon, as there are five and a half coalition divisions spread out over a wide area, more than twice the maximum number of troops the Israelis ever sent into Lebanon.
Also, the Israelis launched a quasi-strategic air offensive, which implies they picked a list of targets off the map and set out to bomb them – so, once discounted for the inevitable aircraft that go technical, find the target weathered in, fail to meet the refuelling tanker etc, you’d expect the two numbers to be similar.
IBC on the death certificates/Ministry of Health issue:
http://www.iraqbodycount.org/press/pr14/4.php
Second, the figure of 40,000 claimed as the number of deaths recorded by the MoH in 2002 is false. No specific citation is offered by the Lancet authors for this figure other than a vague attribution to “informed sources in Iraq”. But official Iraqi figures for 2002, forwarded to IBC courtesy of the Los Angeles Times, show that the Ministry registered 84,025 deaths from all causes in that year. This excluded deaths in the Kurdish-administered regions, which contain 12% or more of the population.
Thus, the actual MoH figure for 2002, even while excluding Kurdistan, stands at 70% of the estimate of 120,000 that, per the Lancet authors, “should have been recorded” nation-wide in 2002. It may (or may not, given its post-2004 casualty monitoring system) be true that the “ministry’s numbers are not likely to be more complete or accurate today”.
And by the way, just in case anyone needs convincing that IBC are not ‘warhawks’, as an earlier poster put it, try reading the last paragraph of their response to Lancet 2:
http://www.iraqbodycount.org/press/pr14/6.php
On 9/11 3,000 people were violently killed in attacks on the USA. Those events etched themselves into the soul of every American, and reverberated around the world. In December 2005 President George Bush acknowledged 30,000 known Iraqi violent deaths in a country one tenth the size of the USA. That is already a death toll 100 times greater in its impact on the Iraqi nation than 9/11 was on the USA. That there are more deaths that have not yet come to light is certain, but if a change in policy is needed, the catastrophic roll-call of the already known dead is more than ample justification for that change.
You can believe 655000 Iraqis have died.
But only if you believe it was paradise under Saddam.
The Lancet this week published a report claiming that 655,000 people have been killed in Iraq.
It is based on flawed research that would shame any publication, except one that had previously been responsible for giving Andrew Wakefields MMR “research” the oxygen of publicity.
The research is based on the pre war death rates under Saddam Husseins regime before the 2003 invasion, assessing those afterwards, and labelling the difference war related deaths.
Unfortunately the researchers have made two huge errors.
Their figures only stand up if adult Iraqis living under the Ba’athist regime hardly ever died. Either of natural or violent causes.
Secondly they included the data to undermine their conclusions in their own report.
The researchers claim the death rate in pre war Iraq was 5.5 per thousand and backed up this claim in a country of 25 million people with an analysis of 82 deaths. Thats a death rate, half that of the EU but perfectly possible in a “young” country with a high birth rate, low median age and good life expectancy.
Unbelievably the same research claims infant mortality was 10%.
5.5 deaths per thousand means that 137,500 Iraqis died each year.
10% infant mortality in a country of 34 births per thousand and 25 million means that 85,000 of these deaths were under one year old.
Leaving a death rate for Iraqis over one year old of 2.1 per thousand.
Less than half that of the lowest country in the world.
Paradise.
And on the subject of violent deaths pre invasion?
.
They drew the following conclusion “As there were few violent deaths in the survey population prior to the invasion all violent deaths can be considered “violent excess deaths”
82 deaths analysed.
Read that again. No violent deaths under Saddam.
Not even domestic violence. Prison works.
This may tell you something about the motivation and rigour of the analysis.
Or a desire for headlines.
The World Health Organisation gives a figure of 9.03 deaths per thousand in Iraq pre invasion.
A figure which ,put through the Lancet prism would have reduced its headline death toll by hundreds of thousands.
And the headlines by a few.
Tim
Also, the Apaches (and Cobras) belong to the Army (and Marine Corps), so don’t appear in the Air Component figures.
Secondly they included the data to undermine their conclusions in their own report.
That’s not an “error”. That’s called science.
They drew the following conclusion “As there were few violent deaths in the survey population prior to the invasion all violent deaths can be considered “violent excess deaths”
82 deaths analysed.
Read that again. No violent deaths under Saddam.
Not even domestic violence. Prison works.
Read that again. “Few violent deaths”. “Few” doesn’t mean “no”.
You, Sir, are an intellectually vacuous liar, or else functionally illiterate.
I should point out that we can expect “paradise under Saddam” to be the latest fast-breeding talking point.
alex,
for the purposes of their research they assume none.therefore.
” all violent deaths can be considered “violent excess deaths”
Alex,
That was a pitiful rebuttal. Do you want to try again?
But only if you believe it was paradise under Saddam.
Yes, paradise – or perhaps a well-run police state.
so well run that people hardly ever died.
unless they were under one year old.
Don Johnson, 71: That point is a reasonable one, and some of the statistics and sampling questions are reasonable too, but all reasonable points on these questions are being swamped, and that was inevitable. And the statistical arguing makes swamping easier, not harder.
That the Lancet attempt at precision was, in the context of this particular debate, doomed. (If they cut a few corners to make “the best case”, it was even more doomed, of course).Any report they made whatever would have received enough flak that it wouldn’t be regarded as credible by the media. And while I fully support the political purpose of the study and think that their release date was perfectly timed, I stillthink that their effort was futile.
The population able to understand statistics and willing to have their mind changed by a careful statistical argument on a highly contentious issue is small, and there’s so much junk statistcs out on various issues that the good statistics are automatically devalued (just because so few people really understand stats, and also because so many don’t trust anyone at all).
My opposition to the war is mostly based on its fraudulent justifications, strategic stupidity, and failure, rather than on the brutality or not of the American troops. Based on what you’ve said, Id expect to see lots of war supporters say that we need to “take the gloves off”, since we’re killing too few Iraqis.
89: You say:
It really isn’t the argument of those raising questions that the authors make this claim. The fact remains, whilst non-violent deaths account for 40% of the 98,000 point estimate for Lancet 1, they’ve dipped below zero (in that the rate has gone negative from the pre-invasion baseline) for Lancet 2’s 112,000 (or 130,000 extrapolated). Over 40,000 non-violent deaths have been replaced by an additional 60-odd thousand violent deaths.
I think the problem is that you can’t meaningfully do arithmetic with point estimates.
Charlie,
I think the problem is that you can’t meaningfully do arithmetic with point estimates.
Except, apparently, when you’re looking to use one point estimate in one study to corroborate another in another.
Are you saying that there is no statistical merit in the claims by the authors that the studies are mutually corroborative because the point estimates are in the same ballpark? If so, I’ll settle for that.
The second it bcomes legitimate to do the like for like comparison of point estimates and use this as corroboration, we surely are entitled to point out the glaring contradictions in the constitution of each?
Apple pies and steak pies are both pies, but I’d only put custard on the former.
You’re right, John Emerson, I’d expect a lot of rightwingers to want to “take the gloves off”. A number have already expressed that sentiment. Leaving aside the pro-genocide forces, the one sort of encouraging thing I’ve seen in this debate is that even some rightwingers (such as the editors at the Washington Times) think the death toll is at least 200,000. A week ago I would have thought it impossible to get anyone outside the far left to believe a number that high.
I’m a little perturbed at the near-invisibility of the issue I raised regarding the UNDP/ICLS confidence interval and the corresponding confidence interval in Table 3 of the latest Lancet paper. Only mq in response 82 seems to think it matters. If this latest Lancet paper is plausible, then some sort of explanation for the apparent discrepancy is needed.
103: Are you saying that there is no statistical merit in the claims by the authors that the studies are mutually corroborative because the point estimates are in the same ballpark? If so, I’ll settle for that.
I’m not competent to judge the study’s authors in that way, but it could be that their statement was incautious, yes.
It’s still the case that researchers have (twice now) been to Iraq and asked some of the people there how many in their households had been killed since the invasion. And they came back with some numbers. The problem we are all faced with is finding an explanation for those numbers. The choices here are, essentially:
– The death rate from violence is significantly higher than passive reporting suggests – The respondents were lying – The researchers committed a major error in their sampling methodologyAnd it’s possible that we could combine these explanations in some way.
There is no ‘jury out’ situation here. A criticism that doesn’t amount to more than a criticism of some of the rhetoric of the presentation does little to get you closer to understanding why the numbers are as they are; nor, crucially, does it make the numbers go away. In my view, without a plausible explanation for these numbers, concern is the appropriate response and hand washing should wait.
Let us get down to limiting cases. Baghdad makes up something over 20 per cent of the population, and 40-50 dead shot people are found daily. If the same pattern is national, that would be 160-200 sectarian war victims a day, before the usual 10-20 victims of dramatic guerrilla violence. 3 years, 7 months=1291 days=251,745, and that’s without any allowance for coalition activity or mass-casualty bombings – or for places like Fallujah, Ramadi or Diyala province.
And the pattern is national. Very much so. If you tot up Baghdad, Basra, Mosul, Kirkuk, Kerbala and Najaf out of the top 10 cities, all of which have severe violence, that gives you more than half the population – and we still haven’t included Samara, Baqubah, Fallujah, Ramadi or Diyala.
Very simply, the violence is where the people are.
There is no ‘jury out’ situation here. A criticism that doesn’t amount to more than a criticism of some of the rhetoric of the presentation does little to get you closer to understanding why the numbers are as they are; nor, crucially, does it make the numbers go away. In my view, without a plausible explanation for these numbers, concern is the appropriate response and hand washing should wait.
It’s a little deeper than that, Charlie. One of the strongest indicators we could have that Hopkins has things about right is if Lancet 2 validates and corroborates Lancet 1. I suggest that’s why the authors have claimed this to be the case.
You say it could be that such claims are “incautious” yet don’t believe you are qualified to judge. Well, if the basis for claiming corroboration was a mystery beyond the comprehension of all but statistical sciences PHDs, then you might have point, but the basis for such a claim is a 98,000 point estimate in Lancet 1 and 112,000 in Lancet 2 for the same period. They’re in the same ballpark, aren’t they? The point is that the composition of each figure is entirely contradictory, invalidating any claim of mutual corroboration. Over 40% of deaths from one cause in Lancet 1 have been replaced by the only other possible cause in Lancet 2.
The importance of non-corroboration cannot be overstated, as it immediately invites the conclusion that both cannot be right, which in turn means one must be wrong.
You can fill in the rest.
The point is that the composition of each figure is entirely contradictory, invalidating any claim of mutual corroboration.
This is wrong. If, in sample A, mortality has increased and the main cause is traffic accidents, while in sample B mortality has increased and the main cause is heart disease, the conclusion that mortality has increased is indeed reinforced. As to the main cause in the population as a whole, “further research is required” – a conclusion which appears in all too many studies.
Kevin,
The point of contradiction is not that Lancet 1 said the greater number of deaths was 30-40 year old shoemakers from Baghdad, whereas Lancet 2 claims it is 20-something plumbers from Mosul. The deaths have one of two causes: violence or non-violence. Even allowing for double weight of misattribution, to go from 40%+ non-violent deaths in Lancet 1 to sub-zero in Lancet 2 takes some doing if all bias has been avoided.
Moreover, that mortality has increased is not in dispute. If the authors used the findings of Lancet 2 and Lancet 1 to claim mutual corroboration for an increase in mortality and nothing more, you’ll get little argument from me or anyone else, I suspect. But they specifically went further than this. They highlighted a point estimate of 112,000 deaths for the first 18 months of the conflict and compared this with the 98,000 estimate from Lancet 1. They are explicitly claiming mutual corroboration not just for a rise in mortality, but in the estimation of that rise.
Given the finite number of causes of death is 2 (violent and non-violent) and not 22, the composition of both estimates is significant and where they are found to be contradictory – as they are – it is evidence against mutual corroboration on the specific question of point estimates.
If the authors want to row back on the corroboration claims about the point estimates for the first 18 months of the year, than I’ll accept that composition of those point estimates becomes less significant.
107: The importance of non-corroboration cannot be overstated, as it immediately invites the conclusion that both cannot be right, which in turn means one must be wrong. You can fill in the rest.
The corollary of ‘the studies don’t explicitly support each other’ is not ‘the studies disagree’. The degree of confidence you’d need to claim contradiction is unavailable, as we’ve discussed.
So while logical opposition between the studies, if it existed, would be good for your argumentative drive, I think the best you can claim is that one or more of the studies is incomplete. So more data is needed.
In the meantime, perhaps less of the rhetorical punchlining?
Hey!
With all due respect, I think something important is being ignored.
John: “The best source turns out to be the US Air Force Command itself.”
Yes, but I think you need to pay less attention to what they say about “airstrikes,” and more attention to what they say about tons of bombs dropped.
As far as I can tell, the above analysis is largely focused on number of aitstrikes/sorties, and relies on statistics released by the US. Trouble is, guns don’t kill people; bullets kill people. Airstrikes don’t kill people; bombs kill people. (I realize airstrikes deliver munitions aside from bombs, such as bullets, but I’m more concerned about the bombs.)
While the US does release certain vague figures regarding “airstrikes” (a term which I haven’t seen defined very clearly anywhere), it seems very, very reluctant to tell us the total tonnage of bombs dropped on Iraq.
Information can be found in daily “CENTAF airpower” summaries such as this, but I think the information is purposely vague, and never reveals tonnage, as far as I can tell (we see terminology such as “airstrikes,” “missions” and “sorties”). John cites Jamail’s article which takes a close look at those summaries. By the way, I’m puzzled by this statement of Jamail’s:
I can’t figure out where he got that from (“26 tons”). I also think the idea that Fallujah received only 26 tons “during the November 2004 siege of that city” is dramatically understated.
Anyway, as far as I can tell, the US has released such information (tonnage dropped on Iraq) exactly once, here:
I believe this statement has been used as the basis for reporting by Hersh, here:
(John, you point to Solomon, who points to Hersh, who points to the US press release. However, Solomon and Hersh leave out URLs, which I’ve provided.)
As far as I can tell, the Marines press release has otherwise been overlooked, in general, even though I think it is credible and unintentionally revealing. When that statement is analyzed (please follow the li