<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Polls and Margins</title>
	<atom:link href="http://crookedtimber.org/2003/08/19/polls-and-margins/feed/" rel="self" type="application/rss+xml" />
	<link>http://crookedtimber.org/2003/08/19/polls-and-margins/</link>
	<description>Out of the crooked timber of humanity, no straight thing was ever made</description>
	<lastBuildDate>Mon, 13 Feb 2012 05:39:10 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: claxton6</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2352</link>
		<dc:creator>claxton6</dc:creator>
		<pubDate>Sun, 24 Aug 2003 18:10:41 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2352</guid>
		<description>&gt;I, and many other people, many of whom live in Vermont, have Call-Intercept or Call-Blocking or some such feature that systematically skews who is called in phone surveys.My experience with telephone surveys is largely in Nevada, which may be a little different from Vermont, but we only saw a very small number of households with Call-Intercept or Call-Blocking, and even among those households it was possible to get through to a household member.Of course, I think that presumes that you have a live person doing the calling, since you have to state who&#039;s calling. I think political polls do this, rather than automated dialling like telemarketers, but I don&#039;t know for sure.</description>
		<content:encoded><![CDATA[	<p>>I, and many other people, many of whom live in Vermont, have Call-Intercept or Call-Blocking or some such feature that systematically skews who is called in phone surveys.My experience with telephone surveys is largely in Nevada, which may be a little different from Vermont, but we only saw a very small number of households with Call-Intercept or Call-Blocking, and even among those households it was possible to get through to a household member.Of course, I think that presumes that you have a live person doing the calling, since you have to state who&#8217;s calling. I think political polls do this, rather than automated dialling like telemarketers, but I don&#8217;t know for sure.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: bigring55t</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2351</link>
		<dc:creator>bigring55t</dc:creator>
		<pubDate>Fri, 22 Aug 2003 03:35:35 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2351</guid>
		<description>Actually, despite all the math the solution lies in the realm of psychology. Kos works for the Dean campaign thus the careful wording is simply a  troll prophylactic (borrowed from Atrios) meant to head off  pointless accusations of unfairness.</description>
		<content:encoded><![CDATA[	<p>Actually, despite all the math the solution lies in the realm of psychology. Kos works for the Dean campaign thus the careful wording is simply a  troll prophylactic (borrowed from Atrios) meant to head off  pointless accusations of unfairness.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: kokomo</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2350</link>
		<dc:creator>kokomo</dc:creator>
		<pubDate>Wed, 20 Aug 2003 20:52:17 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2350</guid>
		<description>Given the data, there is a small chance that Dean and Kerry are tied, but a reasonable interpretation is that Dean is in the lead.  This is what Kos communicated.  The problem is an interesting one, but Kos&#039; statement is not a proper subject for the discussion.  </description>
		<content:encoded><![CDATA[	<p>Given the data, there is a small chance that Dean and Kerry are tied, but a reasonable interpretation is that Dean is in the lead.  This is what Kos communicated.  The problem is an interesting one, but Kos&#8217; statement is not a proper subject for the discussion.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: pathos</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2349</link>
		<dc:creator>pathos</dc:creator>
		<pubDate>Wed, 20 Aug 2003 20:24:31 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2349</guid>
		<description>I am surprised people are still doing phone polls.I, and many other people, many of whom  live in Vermont, have Call-Intercept or Call-Blocking or some such feature that systematically skews who is called in phone surveys.  I am guess that the more right-wing you are, the more likely you are to block/screen your calls.This is a new phenomenon, but it explains why the Republicans did so well in 2002, despite all polls showing that it would be much closer.I no longer put any faith in polls conducted over the telephone.  Might as well be an internet poll.</description>
		<content:encoded><![CDATA[	<p>I am surprised people are still doing phone polls.I, and many other people, many of whom  live in Vermont, have Call-Intercept or Call-Blocking or some such feature that systematically skews who is called in phone surveys.  I am guess that the more right-wing you are, the more likely you are to block/screen your calls.This is a new phenomenon, but it explains why the Republicans did so well in 2002, despite all polls showing that it would be much closer.I no longer put any faith in polls conducted over the telephone.  Might as well be an internet poll.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Thomas Dent</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2348</link>
		<dc:creator>Thomas Dent</dc:creator>
		<pubDate>Wed, 20 Aug 2003 18:56:17 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2348</guid>
		<description>What Tim said. Maybe Kos got into the habit of thinking thatthe MOE means subtracting from one guy and adding to theother from looking at two-horse races. If we can assume thatthe distribution of &#039;undecided&#039;s is narrowly peaked andtheir number is uncorrelated with either of the two candidatesthen going to the MOE +4 for Kennedy means -4 for Nixon and vice versa. This is a rather tricky point since what one should be talking about strictly is a probability distribution over the entire multidimensional space of possible results adding up to 100%. Inevitably it doesn&#039;t always makes sense when you try to summarize it in a single MOE. Truman vs. Dewey vs. Thurmond was probably a case where quoting a single MOE would be misleading if you wanted to find the likelihood of the actual numbers being off by a certain number of points.And then you have the problem of Clark (1 percent) - with theMOE being +-4, this should mean that there is a large probabilityof Clark&#039;s actual percentage being negative! This piece of nonsense comes about because MOE assumes that the distributionsare Gaussian, but they can&#039;t be because the Gaussian extendsfrom minus infinity to plus infinity whereas the percentageresult is strictly between 0 and 100.And then you have the fact that MOE represents only the statistical random error, and you still have to contend with systematic biases,for example Dean supporters being more likely to agree to answer the poll because of a peculiar character trait that they are more likely to possess...If another poll with different methods comes out with similar numbersit will be much more clear that Dean has a lead.</description>
		<content:encoded><![CDATA[	<p>What Tim said. Maybe Kos got into the habit of thinking thatthe <span class="caps">MOE</span> means subtracting from one guy and adding to theother from looking at two-horse races. If we can assume thatthe distribution of &#8216;undecided&#8217;s is narrowly peaked andtheir number is uncorrelated with either of the two candidatesthen going to the <span class="caps">MOE </span>+4 for Kennedy means -4 for Nixon and vice versa. This is a rather tricky point since what one should be talking about strictly is a probability distribution over the entire multidimensional space of possible results adding up to 100%. Inevitably it doesn&#8217;t always makes sense when you try to summarize it in a single <span class="caps">MOE</span>. Truman vs. Dewey vs. Thurmond was probably a case where quoting a single <span class="caps">MOE</span> would be misleading if you wanted to find the likelihood of the actual numbers being off by a certain number of points.And then you have the problem of Clark (1 percent) &#8211; with the<span class="caps">MOE</span> being +-4, this should mean that there is a large probabilityof Clark&#8217;s actual percentage being negative! This piece of nonsense comes about because <span class="caps">MOE</span> assumes that the distributionsare Gaussian, but they can&#8217;t be because the Gaussian extendsfrom minus infinity to plus infinity whereas the percentageresult is strictly between 0 and 100.And then you have the fact that <span class="caps">MOE</span> represents only the statistical random error, and you still have to contend with systematic biases,for example Dean supporters being more likely to agree to answer the poll because of a peculiar character trait that they are more likely to possess&#8230;If another poll with different methods comes out with similar numbersit will be much more clear that Dean has a lead.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Tim Lambert</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2347</link>
		<dc:creator>Tim Lambert</dc:creator>
		<pubDate>Wed, 20 Aug 2003 17:54:54 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2347</guid>
		<description>I don&#039;t think you can work out the answer unless you know to what extent Dean and Kerry are competing for the same supporters.If total support for Dean and Kerry is fixed at 49% so that any increase for Dean is matched by a decrease for Kerry, then the 95% confidence interval for the difference is +/- 8% so that a 7% difference is not significant.On the other hand if they are not competing for the same voters (so that half the people will never vote for Kerry and the other half will never vote for Dean) then changes are independent and the 95% confidence interval for the difference is +/- 4sqrt(2) = +/- 5.6% and the difference is significant.Reality is going to be in between these two cases, so the answer is &quot;it depends&quot;.</description>
		<content:encoded><![CDATA[	<p>I don&#8217;t think you can work out the answer unless you know to what extent Dean and Kerry are competing for the same supporters.If total support for Dean and Kerry is fixed at 49% so that any increase for Dean is matched by a decrease for Kerry, then the 95% confidence interval for the difference is +/- 8% so that a 7% difference is not significant.On the other hand if they are not competing for the same voters (so that half the people will never vote for Kerry and the other half will never vote for Dean) then changes are independent and the 95% confidence interval for the difference is +/- 4sqrt(2) = +/- 5.6% and the difference is significant.Reality is going to be in between these two cases, so the answer is &#8220;it depends&#8221;.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Johnson</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2346</link>
		<dc:creator>Jeff Johnson</dc:creator>
		<pubDate>Wed, 20 Aug 2003 15:48:34 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2346</guid>
		<description>Ooops, my explanation of confidence level was misleading.  It&#039;s not necessarily true that exactly 5 out of every 100 polls will be inaccurate at 95% confidence.  That&#039;s only in the limit.</description>
		<content:encoded><![CDATA[	<p>Ooops, my explanation of confidence level was misleading.  It&#8217;s not necessarily true that exactly 5 out of every 100 polls will be inaccurate at 95% confidence.  That&#8217;s only in the limit.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Johnson</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2345</link>
		<dc:creator>Jeff Johnson</dc:creator>
		<pubDate>Wed, 20 Aug 2003 15:05:15 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2345</guid>
		<description>The end of my post disappeared.  It seems that the blogger doesn&#039;t like the less-than sign.  Anyway, I meant to say that there&#039;s an 85% probability that d is between 1% and 13%.</description>
		<content:encoded><![CDATA[	<p>The end of my post disappeared.  It seems that the blogger doesn&#8217;t like the less-than sign.  Anyway, I meant to say that there&#8217;s an 85% probability that d is between 1% and 13%.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Johnson</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2344</link>
		<dc:creator>Jeff Johnson</dc:creator>
		<pubDate>Wed, 20 Aug 2003 15:00:33 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2344</guid>
		<description>I found a z-table and did a few calculations.  Suppose we take the margin of error for Dean and Kerry&#039;s poll numbers to be +/-3% instead of 4%.  Since Dean got 28% and Kerry 21%, the difference here d=7%.  The margin of error for the difference would now be +/-6%.  Given our new margin of error and a sample size of 600, the confidence level would be about 85% instead of 95%.Thus, we might say that there&#039;s a 85% probability that 1% </description>
		<content:encoded><![CDATA[	<p>I found a z-table and did a few calculations.  Suppose we take the margin of error for Dean and Kerry&#8217;s poll numbers to be +/-3% instead of 4%.  Since Dean got 28% and Kerry 21%, the difference here d=7%.  The margin of error for the difference would now be +/-6%.  Given our new margin of error and a sample size of 600, the confidence level would be about 85% instead of 95%.Thus, we might say that there&#8217;s a 85% probability that 1%</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Johnson</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2343</link>
		<dc:creator>Jeff Johnson</dc:creator>
		<pubDate>Wed, 20 Aug 2003 14:09:34 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2343</guid>
		<description>The probability of any particular poll result is very low, given any assumption.  This is not how you want to think of the results.Suppose that the confidence level for the poll is 95%, which is fairly standard and seems to be compatible with the sample size and margin of error.  Now, in response to J. Michael Neal, when you&#039;re estimating the difference between two dependent variables, such as Dean&#039;s and Kerry&#039;s support, the margin of error for the difference is twice the margin of error for the individual variables, so a statistical tie would be within the confidence interval, because the margin of error for the difference would be +/- 8%.What a 95% confidence level means is that if you did 100 polls with the same sample size, 95 of the polls would give results within the margin of error of the actual number in the target population.  5 of the polls, however, would give results which are not within the margin of error of the actual number in the target population.  In other words, 5% of the time the polls are going to be dead wrong, even given the margin of error.Thus, as I think Amit Dubey was suggesting, in order to calculate the probability that Dean is not leading Kerry, you have to take into account, among other things, the possibility that the actual numbers are, for example, Kerry 75% and Dean 3%.</description>
		<content:encoded><![CDATA[	<p>The probability of any particular poll result is very low, given any assumption.  This is not how you want to think of the results.Suppose that the confidence level for the poll is 95%, which is fairly standard and seems to be compatible with the sample size and margin of error.  Now, in response to J. Michael Neal, when you&#8217;re estimating the difference between two dependent variables, such as Dean&#8217;s and Kerry&#8217;s support, the margin of error for the difference is twice the margin of error for the individual variables, so a statistical tie would be within the confidence interval, because the margin of error for the difference would be +/- 8%.What a 95% confidence level means is that if you did 100 polls with the same sample size, 95 of the polls would give results within the margin of error of the actual number in the target population.  5 of the polls, however, would give results which are not within the margin of error of the actual number in the target population.  In other words, 5% of the time the polls are going to be dead wrong, even given the margin of error.Thus, as I think Amit Dubey was suggesting, in order to calculate the probability that Dean is not leading Kerry, you have to take into account, among other things, the possibility that the actual numbers are, for example, Kerry 75% and Dean 3%.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Doug Turnbull</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2342</link>
		<dc:creator>Doug Turnbull</dc:creator>
		<pubDate>Wed, 20 Aug 2003 13:49:36 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2342</guid>
		<description>Agree with the last post that you need to integrate your liklihood function. Plus, In some cases the liklihood function doesn&#039;t sum to 100% (not sure if this is such a case), so you&#039;d want to do the simulation for each possible result and then normalize to that value, which is a lot of work. The other thing that I wonder about is whether your simulation would give you a margin of error of 4%, or whether your assumptions give you a smaller margin than that--it&#039;s possible there are other systemic errors in the polling that increase the error margin above a true random sample. Trying another tack, using the 4% figure and assuming it&#039;s a sigma value (don&#039;t know how they define it), and assuming statistical independance of the Dean and Kerry numbers (certainly not true), then you get a 1/6 probability that Dean&#039;s numbers are 24% or below, and a 1/6 chance that Kerry&#039;s numbers are 25% or above. So you have a roughly 1/36 chance that both are true.Anyway, I agree with your underlying point that most people take margins of error and assume that they mean any number from the measured value +/- the MOE is equally likely, which is not how statistics or measurements work. It always bugs me when people bring out the &quot;statistically tied&quot; verbage, or some such, since it&#039;s just not true.</description>
		<content:encoded><![CDATA[	<p>Agree with the last post that you need to integrate your liklihood function. Plus, In some cases the liklihood function doesn&#8217;t sum to 100% (not sure if this is such a case), so you&#8217;d want to do the simulation for each possible result and then normalize to that value, which is a lot of work. The other thing that I wonder about is whether your simulation would give you a margin of error of 4%, or whether your assumptions give you a smaller margin than that&#8212;it&#8217;s possible there are other systemic errors in the polling that increase the error margin above a true random sample. Trying another tack, using the 4% figure and assuming it&#8217;s a sigma value (don&#8217;t know how they define it), and assuming statistical independance of the Dean and Kerry numbers (certainly not true), then you get a 1/6 probability that Dean&#8217;s numbers are 24% or below, and a 1/6 chance that Kerry&#8217;s numbers are 25% or above. So you have a roughly 1/36 chance that both are true.Anyway, I agree with your underlying point that most people take margins of error and assume that they mean any number from the measured value +/- the <span class="caps">MOE</span> is equally likely, which is not how statistics or measurements work. It always bugs me when people bring out the &#8220;statistically tied&#8221; verbage, or some such, since it&#8217;s just not true.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Amit Dubey</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2341</link>
		<dc:creator>Amit Dubey</dc:creator>
		<pubDate>Wed, 20 Aug 2003 13:27:42 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2341</guid>
		<description>Hi,You should not do this using a simulation.  The probability you got was too low because you also have to simulate all other combinations of them being tied, or Kerry beating Dean, then take the integral.  (This is the last step you were missing).What you want to do is to set up a decision rule testing if one mean really is bigger than the other, and then test the hypotheses.  Most introductory social science statistics texts should cover this.</description>
		<content:encoded><![CDATA[	<p>Hi,You should not do this using a simulation.  The probability you got was too low because you also have to simulate all other combinations of them being tied, or Kerry beating Dean, then take the integral.  (This is the last step you were missing).What you want to do is to set up a decision rule testing if one mean really is bigger than the other, and then test the hypotheses.  Most introductory social science statistics texts should cover this.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: J. Michael Neal</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2340</link>
		<dc:creator>J. Michael Neal</dc:creator>
		<pubDate>Wed, 20 Aug 2003 06:08:36 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2340</guid>
		<description>Then I believe that you have exceeded my statistical competance.  I&#039;ll get back to it when I have my degree in a couple of years.</description>
		<content:encoded><![CDATA[	<p>Then I believe that you have exceeded my statistical competance.  I&#8217;ll get back to it when I have my degree in a couple of years.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Brian Weatherson</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2339</link>
		<dc:creator>Brian Weatherson</dc:creator>
		<pubDate>Wed, 20 Aug 2003 02:23:08 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2339</guid>
		<description>If I ran the simulation correctly, it should have taken into account the fact that it&#039;s more probable that Kerry&#039;s vote is under-reported conditional on Dean&#039;s vote being over-reported. Indeed, if I just multiply the probabilities of Dean getting as high as 28 by that of Kerry getting as low as 21 (all conditional on them both really being at 24.5), the result is under 0.1%.I agree entirely that this isn&#039;t very meaningful 6 months out. I&#039;m just interested in the theoretical question because it&#039;s one that arises fairly frequently, and this looked to be a pretty extreme case.</description>
		<content:encoded><![CDATA[	<p>If I ran the simulation correctly, it should have taken into account the fact that it&#8217;s more probable that Kerry&#8217;s vote is under-reported conditional on Dean&#8217;s vote being over-reported. Indeed, if I just multiply the probabilities of Dean getting as high as 28 by that of Kerry getting as low as 21 (all conditional on them both really being at 24.5), the result is under 0.1%.I agree entirely that this isn&#8217;t very meaningful 6 months out. I&#8217;m just interested in the theoretical question because it&#8217;s one that arises fairly frequently, and this looked to be a pretty extreme case.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: J. Michael Neal</title>
		<link>http://crookedtimber.org/2003/08/19/polls-and-margins/comment-page-1/#comment-2338</link>
		<dc:creator>J. Michael Neal</dc:creator>
		<pubDate>Wed, 20 Aug 2003 01:53:11 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/wp/?p=159#comment-2338</guid>
		<description>Kokomo,No, I don&#039;t think that Dean and Kerry being tied actually is within the 95% confidence interval.  Either Dean being at 24% *or* Kerry being at 25% is, but not both.  This is a case where the very sloppy layman&#039;s use of &quot;margin of error&quot; is incorrect.</description>
		<content:encoded><![CDATA[	<p>Kokomo,No, I don&#8217;t think that Dean and Kerry being tied actually is within the 95% confidence interval.  Either Dean being at 24% <strong>or</strong> Kerry being at 25% is, but not both.  This is a case where the very sloppy layman&#8217;s use of &#8220;margin of error&#8221; is incorrect.</p>
 ]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced

Served from: crookedtimber.org @ 2012-02-13 06:18:58 -->
