<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: If I&#8217;d only known&#8230;</title>
	<atom:link href="http://crookedtimber.org/2007/06/07/if-id-only-known/feed/" rel="self" type="application/rss+xml" />
	<link>http://crookedtimber.org/2007/06/07/if-id-only-known/</link>
	<description>Out of the crooked timber of humanity, no straight thing was ever made</description>
	<lastBuildDate>Sun, 22 Nov 2009 04:23:18 -0800</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: lees</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199865</link>
		<dc:creator>lees</dc:creator>
		<pubDate>Sun, 10 Jun 2007 09:44:22 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199865</guid>
		<description>In case you&#039;re not familiar with it, I&#039;ll mention the existence of a book, edited by Richard A. Seltzer, titled Mistakes That Social Scientists Make: Error and Redemption in the Research Process (New York: St. Martin’s Press, 1995). The chapters are retellings by various social scientists of some mistake they&#039;ve made while conducting research.</description>
		<content:encoded><![CDATA[	<p>In case you&#8217;re not familiar with it, I&#8217;ll mention the existence of a book, edited by Richard A. Seltzer, titled Mistakes That Social Scientists Make: Error and Redemption in the Research Process (New York: St. Martin&#8217;s Press, 1995). The chapters are retellings by various social scientists of some mistake they&#8217;ve made while conducting research.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: trane</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199859</link>
		<dc:creator>trane</dc:creator>
		<pubDate>Sun, 10 Jun 2007 07:29:29 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199859</guid>
		<description>And the (second or third) language skills you think are sufficient seldom are.</description>
		<content:encoded><![CDATA[	<p>And the (second or third) language skills you think are sufficient seldom are.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: trane</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199858</link>
		<dc:creator>trane</dc:creator>
		<pubDate>Sun, 10 Jun 2007 07:27:32 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199858</guid>
		<description>I second #29.</description>
		<content:encoded><![CDATA[	<p>I second #29.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Peter</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199798</link>
		<dc:creator>Peter</dc:creator>
		<pubDate>Sat, 09 Jun 2007 17:06:26 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199798</guid>
		<description>Robert Chambers, in his book &quot;Rural Development: Putting the Last First&quot;  (Longman, 1983), has a nice, intemperate discussion of the practical issues which impact the collection and analysis of social science research data, and which are usually ignored in textbooks.  His focus is on research in developing countries but the lessons apply more generally, particularly since the (sub-)culture of the researcher is almost never that of the research subjects.</description>
		<content:encoded><![CDATA[	<p>Robert Chambers, in his book &#8220;Rural Development: Putting the Last First&#8221;  (Longman, 1983), has a nice, intemperate discussion of the practical issues which impact the collection and analysis of social science research data, and which are usually ignored in textbooks.  His focus is on research in developing countries but the lessons apply more generally, particularly since the (sub-)culture of the researcher is almost never that of the research subjects.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Peter</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199782</link>
		<dc:creator>Peter</dc:creator>
		<pubDate>Sat, 09 Jun 2007 13:52:15 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199782</guid>
		<description>Make sure that your software licenses are up to date and in order. Don&#039;t wait for the last minute like so many folks do.

This one was one that my group dodged during one class, but every other group in the course was burned by it. During an IC design class, we completed the design work early in the semester (the other 2 wanted to take time off to visit family in the middle of the semester, so we had to start early). The software license for the design software expired about halfway through the semester, so that we were the &lt;em&gt;only&lt;/em&gt; group that managed to complete the design work that semester. Because the other groups waited until about the last month of the semester to start, the expired license wasn&#039;t rectified until after the semester was over (most serious CAE software expires annually). I&#039;m sure the EE dept was ticked off that most of the course (all but 3) got an Incomplete that semester. 

Like Sarapen, I also carry a notebook of some kind all the time. And like Daniel, I&#039;ve had some embarassing incidents horribly &lt;em&gt;underestimating&lt;/em&gt; how long it takes to perform some tasks.</description>
		<content:encoded><![CDATA[	<p>Make sure that your software licenses are up to date and in order. Don&#8217;t wait for the last minute like so many folks do.</p>

	<p>This one was one that my group dodged during one class, but every other group in the course was burned by it. During an IC design class, we completed the design work early in the semester (the other 2 wanted to take time off to visit family in the middle of the semester, so we had to start early). The software license for the design software expired about halfway through the semester, so that we were the <em>only</em> group that managed to complete the design work that semester. Because the other groups waited until about the last month of the semester to start, the expired license wasn&#8217;t rectified until after the semester was over (most serious <span class="caps">CAE</span> software expires annually). I&#8217;m sure the EE dept was ticked off that most of the course (all but 3) got an Incomplete that semester.</p>

	<p>Like Sarapen, I also carry a notebook of some kind all the time. And like Daniel, I&#8217;ve had some embarassing incidents horribly <em>underestimating</em> how long it takes to perform some tasks.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: leederick</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199781</link>
		<dc:creator>leederick</dc:creator>
		<pubDate>Sat, 09 Jun 2007 12:15:17 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199781</guid>
		<description>The most important thing is to try to arrange your study so you get a result which other scholars will find interesting no matter how the data or evidence comes out.

That can be tricky in some circumstances. If you are doing routine tests of drugs and you find your drug cures cancer, that&#039;s a real result - but if you find your drug doesn&#039;t cure cancer, people are going to be less interested. The outcome you get, and whether you end up on the front page of Nature or page 4568 of Annals of the Properties of Obscure Compounds: Series H, basically depends upon luck.

Social scientists have an easier time of it because the theories they look at are usually more complicated than whether something has a treatment effect or not. If you think hard enough about the problem, you can often arrange things so your data will say something new or interesting about some aspect of the theory you&#039;re trying to address however it comes out. And you get to say something worthwhile to say to your peers either way.

So my advice would be to be very wary of standard research design from the experimental sciences and what the statisticians you consult will say. Standard NP-hypothesis testing and study design will set you up for a situation where you are basically playing Russian Roulette for an interesting result. You&#039;re making a gamble, and if you lose and the results say accept the null or don&#039;t have a large enough effect size you&#039;re not saying anything interesting.

The other thing I would say is to be prepared to be methodologically eclectic when if comes to statistics. Statisticians can&#039;t agree on a theory of inference amongst themselves, so there&#039;s no reason you should feel obliged to stick to one. By all means set up a hypothesis test before you collect the data, but be prepared to throw that out the window and focus your study around exploratory data analysis, or post trial statistics, or Bayesian analysis if this will give you a more interesting result to publish.</description>
		<content:encoded><![CDATA[	<p>The most important thing is to try to arrange your study so you get a result which other scholars will find interesting no matter how the data or evidence comes out.</p>

	<p>That can be tricky in some circumstances. If you are doing routine tests of drugs and you find your drug cures cancer, that&#8217;s a real result &#8211; but if you find your drug doesn&#8217;t cure cancer, people are going to be less interested. The outcome you get, and whether you end up on the front page of Nature or page 4568 of Annals of the Properties of Obscure Compounds: Series H, basically depends upon luck.</p>

	<p>Social scientists have an easier time of it because the theories they look at are usually more complicated than whether something has a treatment effect or not. If you think hard enough about the problem, you can often arrange things so your data will say something new or interesting about some aspect of the theory you&#8217;re trying to address however it comes out. And you get to say something worthwhile to say to your peers either way.</p>

	<p>So my advice would be to be very wary of standard research design from the experimental sciences and what the statisticians you consult will say. Standard NP-hypothesis testing and study design will set you up for a situation where you are basically playing Russian Roulette for an interesting result. You&#8217;re making a gamble, and if you lose and the results say accept the null or don&#8217;t have a large enough effect size you&#8217;re not saying anything interesting.</p>

	<p>The other thing I would say is to be prepared to be methodologically eclectic when if comes to statistics. Statisticians can&#8217;t agree on a theory of inference amongst themselves, so there&#8217;s no reason you should feel obliged to stick to one. By all means set up a hypothesis test before you collect the data, but be prepared to throw that out the window and focus your study around exploratory data analysis, or post trial statistics, or Bayesian analysis if this will give you a more interesting result to publish.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: tom</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199776</link>
		<dc:creator>tom</dc:creator>
		<pubDate>Sat, 09 Jun 2007 09:54:05 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199776</guid>
		<description>Automate your data analysis. If you use SPSS this means learning syntax. You should be able to redo your whole analysis at the press of a button - invaluable when you realise you want to include/exclude/preprocess the data differently, or at review stage you just want to work out exactly how you got from the raw data to the graph on page 4</description>
		<content:encoded><![CDATA[	<p>Automate your data analysis. If you use <span class="caps">SPSS</span> this means learning syntax. You should be able to redo your whole analysis at the press of a button &#8211; invaluable when you realise you want to include/exclude/preprocess the data differently, or at review stage you just want to work out exactly how you got from the raw data to the graph on page 4</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Chris</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199768</link>
		<dc:creator>Chris</dc:creator>
		<pubDate>Sat, 09 Jun 2007 03:36:34 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199768</guid>
		<description>I suppose both of these are trivial too, but they&#039;re probably the most important lessons I&#039;ve learned over the years.

1.) When designing an experiment, keep in mind that it&#039;s probably not going to be the only one you need to run. Make sure you leave yourself directions to go next.

2.) Related to 1), when designing a study, try to imagine the criticisms you&#039;ll get. I usually think to myself, &quot;What would reviewers say is missing, or needs to be controlled, what follow-up experiments would they want to see, etc.&quot; Thinking about the flaws that other people will see is a good way of taking a step back and looking at a study with fresh eyes.</description>
		<content:encoded><![CDATA[	<p>I suppose both of these are trivial too, but they&#8217;re probably the most important lessons I&#8217;ve learned over the years.</p>

	<p>1.) When designing an experiment, keep in mind that it&#8217;s probably not going to be the only one you need to run. Make sure you leave yourself directions to go next.</p>

	<p>2.) Related to 1), when designing a study, try to imagine the criticisms you&#8217;ll get. I usually think to myself, &#8220;What would reviewers say is missing, or needs to be controlled, what follow-up experiments would they want to see, etc.&#8221; Thinking about the flaws that other people will see is a good way of taking a step back and looking at a study with fresh eyes.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Craig</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199765</link>
		<dc:creator>Craig</dc:creator>
		<pubDate>Sat, 09 Jun 2007 00:49:37 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199765</guid>
		<description>Your second (or third or ...) language skills are &lt;i&gt;never&lt;/i&gt; as good as you think they are.</description>
		<content:encoded><![CDATA[	<p>Your second (or third or &#8230;) language skills are <i>never</i> as good as you think they are.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Witt</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199739</link>
		<dc:creator>Witt</dc:creator>
		<pubDate>Fri, 08 Jun 2007 19:13:20 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199739</guid>
		<description>Another one I learned the hard way: Before you start the project, practice writing your final report. Seriously. I have found more gaps in my data collection by realizing that I wanted to be able to write a sentence that said &quot;The average participant got a loan of $3000, although the loan program is nationwide and the impact of that $3000 varied greatly....&quot; ooops. Maybe I ought to think about cost-of-living differences around the country and whether those need to be factored in.</description>
		<content:encoded><![CDATA[	<p>Another one I learned the hard way: Before you start the project, practice writing your final report. Seriously. I have found more gaps in my data collection by realizing that I wanted to be able to write a sentence that said &#8220;The average participant got a loan of $3000, although the loan program is nationwide and the impact of that $3000 varied greatly&#8230;.&#8221; ooops. Maybe I ought to think about cost-of-living differences around the country and whether those need to be factored in.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: RSA</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199724</link>
		<dc:creator>RSA</dc:creator>
		<pubDate>Fri, 08 Jun 2007 16:50:50 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199724</guid>
		<description>This may be either too obvious or too detailed, but one of my favorite papers from the 1990s is by Julian Faraway, on the &lt;a href=&quot;http://www.maths.bath.ac.uk/~jjf23/papers/cda.pdf&quot; rel=&quot;nofollow&quot;&gt;cost of data analysis&lt;/a&gt;.

&lt;i&gt;A regression analysis usually consists of several stages such as variable selection, transformation and residual diagnosis. Inference is often made from the selected model without regard to the model selection methods that preceeded it. This can result in overoptimistic and biased inferences.&lt;/i&gt;

The technical content is about regression analysis, but there are good general lessons as well.</description>
		<content:encoded><![CDATA[	<p>This may be either too obvious or too detailed, but one of my favorite papers from the 1990s is by Julian Faraway, on the <a href="http://www.maths.bath.ac.uk/~jjf23/papers/cda.pdf" rel="nofollow">cost of data analysis</a>.</p>

	<p><i>A regression analysis usually consists of several stages such as variable selection, transformation and residual diagnosis. Inference is often made from the selected model without regard to the model selection methods that preceeded it. This can result in overoptimistic and biased inferences.</i></p>

	<p>The technical content is about regression analysis, but there are good general lessons as well.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: SamChevre</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199715</link>
		<dc:creator>SamChevre</dc:creator>
		<pubDate>Fri, 08 Jun 2007 15:46:54 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199715</guid>
		<description>Another in the &quot;obvious, but often missed&quot; category: if your study could be applicable to a political controversy, be aware of the critiques made of studies by partisans on both sides.  Yes, some of them are ridiculous nit-picks--but often some very good, but non-obvious, points are well-known to partisanns and ignored elsewhere.  (For example, the wedge between &quot;wages&quot; and &quot;compensation&quot; has grown very fast in the last 20 years.)</description>
		<content:encoded><![CDATA[	<p>Another in the &#8220;obvious, but often missed&#8221; category: if your study could be applicable to a political controversy, be aware of the critiques made of studies by partisans on both sides.  Yes, some of them are ridiculous nit-picks&#8212;but often some very good, but non-obvious, points are well-known to partisanns and ignored elsewhere.  (For example, the wedge between &#8220;wages&#8221; and &#8220;compensation&#8221; has grown very fast in the last 20 years.)</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: joe</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199706</link>
		<dc:creator>joe</dc:creator>
		<pubDate>Fri, 08 Jun 2007 14:25:38 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199706</guid>
		<description>Greetings from a GWU colleague of Henry&#039;s.  Two points when working with quantitative data.

1. When entering data from surveys one has administered on one&#039;s own, don&#039;t worry excessively about how to code responses to questions.  Settle on a consistent coding scheme that captures the relevant variation in the data, and THEN when you are ready to analyze the data, let your software do the work of creatingn appropriate variables (e.g. dummy variables) by using logical statements to create new variables.

2.  Don&#039;t &quot;hardwire&quot; your data when you enter it.  Subject to memory capacity and similar constraints, keep the data in as flexible a form as possible, because as one begins to do the analysis, one may often find a need to create new variables, and this is easiest to do when data are not entered into a pre-existing format.  This can save time back-tracking later one.</description>
		<content:encoded><![CDATA[	<p>Greetings from a <span class="caps">GWU</span> colleague of Henry&#8217;s.  Two points when working with quantitative data.</p>

	<p>1. When entering data from surveys one has administered on one&#8217;s own, don&#8217;t worry excessively about how to code responses to questions.  Settle on a consistent coding scheme that captures the relevant variation in the data, and <span class="caps">THEN</span> when you are ready to analyze the data, let your software do the work of creatingn appropriate variables (e.g. dummy variables) by using logical statements to create new variables.</p>

	<p>2.  Don&#8217;t &#8220;hardwire&#8221; your data when you enter it.  Subject to memory capacity and similar constraints, keep the data in as flexible a form as possible, because as one begins to do the analysis, one may often find a need to create new variables, and this is easiest to do when data are not entered into a pre-existing format.  This can save time back-tracking later one.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Barry</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199699</link>
		<dc:creator>Barry</dc:creator>
		<pubDate>Fri, 08 Jun 2007 13:48:09 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199699</guid>
		<description>Eudoxis:
&quot;This is, undoubtedly, common knowledge, but it is one of those lessons that seem difficult to learn. Consult a statistician during project design, not after data collection, even if you know some statistics.&quot;

As a statistician, I really, really support this.  There&#039;s an old saying, &#039;post hoc is post-mortem&#039;, or &#039;when the statistician is consulted after an experiment is run, frequently the only thing that he can do is to confirm the cause of failure&#039;.

In addition, good data management will save a lot of time, and might make the difference between success and failure.  Having an idea of what variables are to be collected, how data is to be linked between/within subjects, times, locations, facilities, interviewers, etc., might save a person-year or so of re-do work.  .

Last:  if you&#039;re scaling up (e.g., from a single research/facility to multiple researchers/facilities), don&#039;t assume that you can do things in the manner which worked for smaller, simpler studies.  You wouldn&#039;t approach building a large house in the same manner as building a garden shed; it works similarly in research.</description>
		<content:encoded><![CDATA[	<p>Eudoxis:<br />
&#8220;This is, undoubtedly, common knowledge, but it is one of those lessons that seem difficult to learn. Consult a statistician during project design, not after data collection, even if you know some statistics.&#8221;</p>

	<p>As a statistician, I really, really support this.  There&#8217;s an old saying, &#8216;post hoc is post-mortem&#8217;, or &#8216;when the statistician is consulted after an experiment is run, frequently the only thing that he can do is to confirm the cause of failure&#8217;.</p>

	<p>In addition, good data management will save a lot of time, and might make the difference between success and failure.  Having an idea of what variables are to be collected, how data is to be linked between/within subjects, times, locations, facilities, interviewers, etc., might save a person-year or so of re-do work.  .</p>

	<p>Last:  if you&#8217;re scaling up (e.g., from a single research/facility to multiple researchers/facilities), don&#8217;t assume that you can do things in the manner which worked for smaller, simpler studies.  You wouldn&#8217;t approach building a large house in the same manner as building a garden shed; it works similarly in research.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Kieran Healy</title>
		<link>http://crookedtimber.org/2007/06/07/if-id-only-known/comment-page-1/#comment-199697</link>
		<dc:creator>Kieran Healy</dc:creator>
		<pubDate>Fri, 08 Jun 2007 13:31:07 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/2007/06/07/if-id-only-known/#comment-199697</guid>
		<description>&lt;i&gt;The very obvious one is draw lots of graphs.&lt;/i&gt;

Yes. Look at your data.</description>
		<content:encoded><![CDATA[	<p><i>The very obvious one is draw lots of graphs.</i></p>

	<p>Yes. Look at your data.</p>
 ]]></content:encoded>
	</item>
</channel>
</rss>
