<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for Dr. Gerstmar&#039;s Thoughts on Health, Happiness, and Well-Being from Aspire Natural Health</title>
	<atom:link href="http://www.aspirenaturalhealth.com/blog/comments/feed" rel="self" type="application/rss+xml" />
	<link>http://www.aspirenaturalhealth.com/blog</link>
	<description>Using natural medicine to live a high quality life</description>
	<lastBuildDate>Sun, 18 Jul 2010 03:46:40 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
	<item>
		<title>Comment on Believe it or Not!  Saturated Fat is NOT bad for you by Dr. Tim Gerstmar</title>
		<link>http://www.aspirenaturalhealth.com/blog/archives/282/comment-page-1#comment-287</link>
		<dc:creator>Dr. Tim Gerstmar</dc:creator>
		<pubDate>Sun, 18 Jul 2010 03:46:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.aspirenaturalhealth.com/blog/?p=282#comment-287</guid>
		<description>KC Craichy,
   Yes, unfortunately the saturated fat is bad theory, like a snowball rolling down a hill has gained a lot of speed and sucked up a lot of time and money to no avail.  Unfortunately so much ego is tied up in maintaining this bogus theory in the face of a wealth of scientific evidence that it looks like we&#039;re going to have to wait for a changing of the guard before this myth dies.  In the meantime check out some of the good resources, educate yourself and understand that the real problem seems to lie with an excess of omega-6 polyunsaturated fats, not saturated fat.

Wishing you the best,
Dr. Tim Gerstmar</description>
		<content:encoded><![CDATA[<p>KC Craichy,<br />
   Yes, unfortunately the saturated fat is bad theory, like a snowball rolling down a hill has gained a lot of speed and sucked up a lot of time and money to no avail.  Unfortunately so much ego is tied up in maintaining this bogus theory in the face of a wealth of scientific evidence that it looks like we&#8217;re going to have to wait for a changing of the guard before this myth dies.  In the meantime check out some of the good resources, educate yourself and understand that the real problem seems to lie with an excess of omega-6 polyunsaturated fats, not saturated fat.</p>
<p>Wishing you the best,<br />
Dr. Tim Gerstmar</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Destroying China (the Study that Is) by Dr. Gerstmar&#039;s Thoughts on Health, Happiness, and Well-Being from Aspire Natural Health &#187; Blog Archive &#187; Destroying China (the Study that Is) &#124; myezpharmacy</title>
		<link>http://www.aspirenaturalhealth.com/blog/archives/310/comment-page-1#comment-272</link>
		<dc:creator>Dr. Gerstmar&#039;s Thoughts on Health, Happiness, and Well-Being from Aspire Natural Health &#187; Blog Archive &#187; Destroying China (the Study that Is) &#124; myezpharmacy</dc:creator>
		<pubDate>Fri, 16 Jul 2010 10:52:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.aspirenaturalhealth.com/blog/archives/310#comment-272</guid>
		<description>[...] Dr. Gerstmar&#039;s Thoughts on Health, Happiness, and Well-Being from Aspire Natural Health &#187;... [...]</description>
		<content:encoded><![CDATA[<p>[...] Dr. Gerstmar&#039;s Thoughts on Health, Happiness, and Well-Being from Aspire Natural Health &raquo;&#8230; [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Believe it or Not!  Saturated Fat is NOT bad for you by KC Craichy</title>
		<link>http://www.aspirenaturalhealth.com/blog/archives/282/comment-page-1#comment-263</link>
		<dc:creator>KC Craichy</dc:creator>
		<pubDate>Thu, 15 Jul 2010 19:24:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.aspirenaturalhealth.com/blog/?p=282#comment-263</guid>
		<description>Over the years, saturated fats have become the dietary scapegoat for nearly everything.</description>
		<content:encoded><![CDATA[<p>Over the years, saturated fats have become the dietary scapegoat for nearly everything.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Measuring what really counts by Dr. Gerstmar&#039;s Thoughts on Health, Happiness, and Well-Being from Aspire Natural Health &#187; Blog Archive &#187; Measuring what really counts &#124; myezpharmacy</title>
		<link>http://www.aspirenaturalhealth.com/blog/archives/313/comment-page-1#comment-259</link>
		<dc:creator>Dr. Gerstmar&#039;s Thoughts on Health, Happiness, and Well-Being from Aspire Natural Health &#187; Blog Archive &#187; Measuring what really counts &#124; myezpharmacy</dc:creator>
		<pubDate>Thu, 15 Jul 2010 08:39:07 +0000</pubDate>
		<guid isPermaLink="false">http://www.aspirenaturalhealth.com/blog/archives/313#comment-259</guid>
		<description>[...] Dr. Gerstmar&#039;s Thoughts on Health, Happiness, and Well-Being from Aspire Natural Health &#187;... [...]</description>
		<content:encoded><![CDATA[<p>[...] Dr. Gerstmar&#039;s Thoughts on Health, Happiness, and Well-Being from Aspire Natural Health &raquo;&#8230; [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Want to be healthier? Live in a third world country on the equator by Dr. Tim Gerstmar</title>
		<link>http://www.aspirenaturalhealth.com/blog/archives/300/comment-page-1#comment-239</link>
		<dc:creator>Dr. Tim Gerstmar</dc:creator>
		<pubDate>Wed, 14 Jul 2010 17:00:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.aspirenaturalhealth.com/blog/archives/300#comment-239</guid>
		<description>Hi Kids Health,
   Eating &quot;real&quot; food and avoiding fake &quot;food-like products&quot; is one of the most important things you can do for your health along with managing your stress levels, getting enough sleep at night, and regularly moving your body.  I&#039;d add making sure your vitamin D levels are optimal is cheap, easy, and likely profoundly important as well.

Thanks for your comment.

Best,
Dr. Tim Gerstmar</description>
		<content:encoded><![CDATA[<p>Hi Kids Health,<br />
   Eating &#8220;real&#8221; food and avoiding fake &#8220;food-like products&#8221; is one of the most important things you can do for your health along with managing your stress levels, getting enough sleep at night, and regularly moving your body.  I&#8217;d add making sure your vitamin D levels are optimal is cheap, easy, and likely profoundly important as well.</p>
<p>Thanks for your comment.</p>
<p>Best,<br />
Dr. Tim Gerstmar</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Want to be healthier? Live in a third world country on the equator by Dr. Tim Gerstmar</title>
		<link>http://www.aspirenaturalhealth.com/blog/archives/300/comment-page-1#comment-238</link>
		<dc:creator>Dr. Tim Gerstmar</dc:creator>
		<pubDate>Wed, 14 Jul 2010 16:58:31 +0000</pubDate>
		<guid isPermaLink="false">http://www.aspirenaturalhealth.com/blog/archives/300#comment-238</guid>
		<description>Hi Fermina,
   I&#039;m not sure what you mean by your comment.  Getting enough Vitamin D, while not absolutely 100% &quot;proven&quot;, is accumulating enough data to be one of the easiest, cheapest and single most useful things you can do for your health.  If you get your vitamin D through sensible sun exposure, it&#039;s free!
   If you meant something else by your comment, feel free to update it and let me know.

Best,
Dr. Tim Gerstmar</description>
		<content:encoded><![CDATA[<p>Hi Fermina,<br />
   I&#8217;m not sure what you mean by your comment.  Getting enough Vitamin D, while not absolutely 100% &#8220;proven&#8221;, is accumulating enough data to be one of the easiest, cheapest and single most useful things you can do for your health.  If you get your vitamin D through sensible sun exposure, it&#8217;s free!<br />
   If you meant something else by your comment, feel free to update it and let me know.</p>
<p>Best,<br />
Dr. Tim Gerstmar</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Destroying China (the Study that Is) by Dr. Tim Gerstmar</title>
		<link>http://www.aspirenaturalhealth.com/blog/archives/310/comment-page-1#comment-237</link>
		<dc:creator>Dr. Tim Gerstmar</dc:creator>
		<pubDate>Wed, 14 Jul 2010 16:53:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.aspirenaturalhealth.com/blog/archives/310#comment-237</guid>
		<description>Hi Freelee,
   I am the first to admit that I am not a statistician, and I&#039;m sure there are things that could be done to improve Denise&#039;s critique which she freely admits.  The point, in my mind, is not that her critique is &quot;right&quot; but that it shows enough holes in Campbell&#039;s interpretation of the data to render it invalid.

   Thank you for your comments.

Best,
Dr. Tim Gerstmar</description>
		<content:encoded><![CDATA[<p>Hi Freelee,<br />
   I am the first to admit that I am not a statistician, and I&#8217;m sure there are things that could be done to improve Denise&#8217;s critique which she freely admits.  The point, in my mind, is not that her critique is &#8220;right&#8221; but that it shows enough holes in Campbell&#8217;s interpretation of the data to render it invalid.</p>
<p>   Thank you for your comments.</p>
<p>Best,<br />
Dr. Tim Gerstmar</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Destroying China (the Study that Is) by freelee</title>
		<link>http://www.aspirenaturalhealth.com/blog/archives/310/comment-page-1#comment-221</link>
		<dc:creator>freelee</dc:creator>
		<pubDate>Tue, 13 Jul 2010 11:43:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.aspirenaturalhealth.com/blog/archives/310#comment-221</guid>
		<description>Denise conducted the analysis incorrectly and came to a heavily flawed conclusion.  Denise is a great writer however this is about science. Here is the correct procedure from a cancer epidemiologist.

Hi Denise,

As promised, I&#039;m posting my response to your email [yes, she emailed me] on your site.  You asked that I provide some tips on where to start and how to proceed.  BTW, you mentioned &quot;epidemiology secrets&quot; and I just want to say: no &quot;secrets&quot;!!  Epidemiology is just critical thinking, but with numbers.  It&#039;s no different from many other disciplines.  Maybe some time you can help me with writing (scientists are generally terrible writers, hehe).  

Note: I&#039;ve included some comments on what went wrong and how it can be corrected merely for demonstrative purposes - not at all malicious attacks, OK?  This is how we all learn after all.  In caps, I will highlight steps in the action plan for you.

STEP 0: Do a literature search.  I find it helpful to keep an excel spreadsheet with columns for author, title, journal, year, summary of paper, strengths of the study, weaknesses, and concluding remarks.  This is essential, as one shouldn&#039;t just blindly go into an analysis without having at least some background information on the subject matter.  No need to be an expert, but good to know what&#039;s already out there, and what needs to be done.  

1. Correlations:
For this discussion, the outcome will be colorectal cancer, since you used it on your post.  Similarly, the primary exposure of interest will be total cholesterol.  By by basing your conclusions on uncorrected correlations alone, you&#039;ve made a huge leap that doesn&#039;t have much ground to stand on.  The simple correlations are biased, as you yourself pointed out when evaluating total cholesterol, schistomiasis, and colorectal cancer.  As such, if you don&#039;t adjust for potential confounders via multiple regression, the association you observe is biased.  We almost always need to adjust for confounders, and this is very true in your case.

STEP 1: It&#039;s a good habit to evaluate the correlations between all exposures and also between all exposures and the outcome &lt;i&gt;at the individual level&lt;/i&gt;.  So, for *every* analysis you plan on doing, run create scatterplots for every X against X and every X against Y, using the *individual* data (where possible), and provide the correlation + 95% confidence interval for each.

STEP 2: Create histograms for every exposure of that is categoric and density plots (or you can create histograms with very narrow bars) for every exposure that is continuous.  This will tell you how the variables are distributed and what the appropriate summary statistics for them would be.  For example, if total cholesterol is not normally distributed (follow a bell curve) then *median* total cholesterol might be a better summary statistic then *mean* total cholesterol (good to know when you present descriptive statistics of the data you&#039;re using).  Sometimes it&#039;s useful to present different stats for a single variable.

2. Individual data vs. aggregated data:
You stated you didn&#039;t see much curvature, but keep in mind that you were presenting with aggregated data (eg. average total cholesterol for all individuals) instead of including individual-level data (the exposure and outcome for a single individual).  Consequently, there was a big loss in information, and you can&#039;t make accurate decisions on how to model your data if you plot aggregated data.  Related to this, your analysis was ecologic (used aggregated/grouped data) but you made individual-level conclusions when you used the term &quot;risk factor.&quot;  This is referred to as an ecologic fallacy - and it&#039;s just that.  A fallacy.  For example, all we can say based on your cholesterol-colorectal cancer example (the one that doesn&#039;t account for schistomiasis) is that the counties with higher mean total cholesterol tend to have higher incidence rates of colorectal cancer.  We can&#039;t make the leap to calling cholesterol a *risk factor* for colorectal cancer.

STEP 3: Don&#039;t aggregate your data in  your analysis.  Why?  You lose A LOT of information when you aggregate data and you can bias your results.  So keep that data at the individual-level.  For descriptive tables, by all means, aggregated data is necessary for obvious reasons.  But in your analysis, individual-level data when you&#039;ve got it is essential.  

3. The right regression model:
One of your outcomes was incidence rates of colorectal cancer.  When you do your analysis with individual-level data, with incidence rates of colorectal cancer as your outcome, linear regression = WRONG model.  Make sure you know which models to use and when.  To start - when modeling &quot;raw&quot; rates (case counts and person time), we almost always use Poisson regression, and often we need to account for overdispersion as well.  Get to know some of the other common regression models as well.    

STEP 4: Write out all of the primary exposures of interest you want to investigate and the corresponding outcome of interest and how you&#039;re setting up your outcome variable (are you interested in colorectal cancer *incidence rates*, *prevalence*, a simple yes/no the person has colorectal cancer?)

STEP 5: Write out what the appropriate regression model would be for the different analyses you plan to conduct.  

4. Confounders:
These are factors that are related to the exposure and the outcome of interest such that *not* adjusting for them will produce a biased association between exposure and outcome.  As you saw, schistomiasis might be a confounder.  And in fact, county might be too - and is actually upstream of schistomiasis in some sense, right?  Two confounders that almost *always* must be included in a model are AGE and SEX (provided your analysis isn&#039;t restricted to one sex).  This is especially true for chronic disease (eg. cardiovascular disease and cancer).  In this particular case, body mass index (BMI) would be very important to include as well.  County may also be important.  

STEP 6: For every analysis you do, write out all potential confounders you can think of and why.  You know the data better than I do as you&#039;ve worked with it extensively.  And, from STEP 0, you&#039;ll know your context.  

STEP 7: Write out *how* the confounders are related to the exposure and outcome.  Is the confounder protective (i.e. decrease risk) for the outcome?  Or is it a risk factor?  How is it associated with the primary exposure of interest?  This is where those scatterplots in STEP 1 come in handy!  The purpose of this is to give you an idea of *how* an observed association might be biased if you *don&#039;t* adjust for certain confounders.  It is tedious, but thorough and, like STEP 6, will allow you to approach your analyses with more contextual background.

5.  &quot;Cleaning&quot; and &quot;recoding&quot; your data:
Raw data is not *in and of itself* a bad thing.  It is simply the data in its original form.  But in order to be useful for analysis we often need to &quot;clean&quot; it and &quot;recode&quot; it.  When I say &quot;clean&quot; it, I mean setting up the *dataset* that is free (to the greatest extent possible) of unnecessary data (for example, if you&#039;re interested in ovarian cancer, you wouldn&#039;t include men), or mistakes (for example, if an individual in the data was coded as being a man with ovarian cancer, this is clearly wrong).  In this case, you might either omit it since you don&#039;t have a way to check which is correct or, based on other data for that individual choose to change &quot;man&quot; to &quot;woman&quot; or &quot;ovarian cancer&quot; to &quot;no ovarian cancer.&quot;  &quot;Recoding&quot; means setting up the *variables* to be useful.  For example, we might recode BMI in categories of underweight, normal, overweight, and obese rather than leave it as continuous.  Some variables may already be categoric, if the corresponding data were collected that way.

STEP 8: Clean your data.  You will likely need to set up multiple datasets.  

STEP 9: Write out *how* you&#039;ve cleaned your data.  (This is good record keeping.)

STEP 10: Recode your data.  This might include combining variables too.  

STEP 11: Create a &quot;data dictionary&quot; similar to the one on the Oxford site.  But in addition, include a description of how you&#039;ve coded your data (eg. 1=underweight, 2=normal, 3=overweight, 4=obese).  Again, good for record keeping, but also &quot;keeps you honest&quot; so others know how you set up your data.  This will often be apparent when you present your results, but not always.  It&#039;s a good habit to keep track of this, in any event. 

STEP 12: Replot all newly *categorized* variables against the outcome(s) of interest.  Why?  Because the categorized data may reveal non-linear relationships with the outcome (in fact, this is a strength of categorizing data - that we can account for some non-linear relationships).  For example, underweight might be a risk for something, whereas normal BMI is protective, while overweight and obese are a risk (&quot;U-shaped&quot;).  

6. Exploration of your data through descriptive statistics:
Almost all scientific papers start out with a &quot;Table 1&quot; which presents a description of the data.  It tells us things like What&#039;s the % of women and men in our data, What is the proportion of people with and without the exposure and with and without the outcome?

STEP 13: Create descriptive tables of all relevant variables.  This includes your primary exposure of interest, confounders, and outcome.  Obviously, you will have different tables for each analysis as you&#039;re interested in different primary exposures (cholesterol? meat? total caloric intake?) and outcomes (cardiovascular disease? colorectal cancer? bladder cancer?).  To save time, you might include all relevant exposures and confounders in rows, and cross-classify them with all outcomes of interest in columns.  

6.  Analysis: 
The fun part.  

STEP 14: Run your models.  Keep track of what you include in your models b/c oftentimes we will evaluate several models for each analysis depending on what&#039;s called &quot;fit statistics.&quot;  Since you are familiar with p-values and I assume interpretation of beta coefficients, use these to help inform you of which variables to include in your final model *within the context of the analysis at hand* (this is key - if you have reason to believe that a confounder is important to include, keep it in the model even if it&#039;s non-significant).  

STEP 15: Create tables for results from *all* analyses (including the models you decide to can in favor for another one) and what regression model was used.  This is much more transparent than simply producing your final model.

There&#039;s more &quot;post-analysis&quot; stuff that should be done, but really Steps 1-15 is a pretty thorough. 

7. Publish:
I can&#039;t stress this enough.  This is a long-term goal for sure, especially as you will likely end up with multiple papers!  But once you think you&#039;ve got the data set-up and analyses down, you need to write it up and send it on for peer-review.  Peer-review is not perfect for sure, but it is the best measure we have for good science.  It gives credibility to your efforts.  Besides, you *do* want to be acknowledged for your efforts, right?  By publishing in a peer-reviewed journal, you&#039;re more likely to gain more widely publicized attention, which I think should be the goal of most epidemiological studies; we want to improve public health through informing not only our peers, but also the public.  

As a last note, I know this is a huge undertaking, but these are steps to a thorough analysis.  I have no doubt you&#039;re capable of tackling it.  

Best wishes.

PS. I&#039;m sure you already planned to do this, but make all of the above available.  With your large readership you can make this a collaborative effort.</description>
		<content:encoded><![CDATA[<p>Denise conducted the analysis incorrectly and came to a heavily flawed conclusion.  Denise is a great writer however this is about science. Here is the correct procedure from a cancer epidemiologist.</p>
<p>Hi Denise,</p>
<p>As promised, I&#8217;m posting my response to your email [yes, she emailed me] on your site.  You asked that I provide some tips on where to start and how to proceed.  BTW, you mentioned &#8220;epidemiology secrets&#8221; and I just want to say: no &#8220;secrets&#8221;!!  Epidemiology is just critical thinking, but with numbers.  It&#8217;s no different from many other disciplines.  Maybe some time you can help me with writing (scientists are generally terrible writers, hehe).  </p>
<p>Note: I&#8217;ve included some comments on what went wrong and how it can be corrected merely for demonstrative purposes &#8211; not at all malicious attacks, OK?  This is how we all learn after all.  In caps, I will highlight steps in the action plan for you.</p>
<p>STEP 0: Do a literature search.  I find it helpful to keep an excel spreadsheet with columns for author, title, journal, year, summary of paper, strengths of the study, weaknesses, and concluding remarks.  This is essential, as one shouldn&#8217;t just blindly go into an analysis without having at least some background information on the subject matter.  No need to be an expert, but good to know what&#8217;s already out there, and what needs to be done.  </p>
<p>1. Correlations:<br />
For this discussion, the outcome will be colorectal cancer, since you used it on your post.  Similarly, the primary exposure of interest will be total cholesterol.  By by basing your conclusions on uncorrected correlations alone, you&#8217;ve made a huge leap that doesn&#8217;t have much ground to stand on.  The simple correlations are biased, as you yourself pointed out when evaluating total cholesterol, schistomiasis, and colorectal cancer.  As such, if you don&#8217;t adjust for potential confounders via multiple regression, the association you observe is biased.  We almost always need to adjust for confounders, and this is very true in your case.</p>
<p>STEP 1: It&#8217;s a good habit to evaluate the correlations between all exposures and also between all exposures and the outcome <i>at the individual level</i>.  So, for *every* analysis you plan on doing, run create scatterplots for every X against X and every X against Y, using the *individual* data (where possible), and provide the correlation + 95% confidence interval for each.</p>
<p>STEP 2: Create histograms for every exposure of that is categoric and density plots (or you can create histograms with very narrow bars) for every exposure that is continuous.  This will tell you how the variables are distributed and what the appropriate summary statistics for them would be.  For example, if total cholesterol is not normally distributed (follow a bell curve) then *median* total cholesterol might be a better summary statistic then *mean* total cholesterol (good to know when you present descriptive statistics of the data you&#8217;re using).  Sometimes it&#8217;s useful to present different stats for a single variable.</p>
<p>2. Individual data vs. aggregated data:<br />
You stated you didn&#8217;t see much curvature, but keep in mind that you were presenting with aggregated data (eg. average total cholesterol for all individuals) instead of including individual-level data (the exposure and outcome for a single individual).  Consequently, there was a big loss in information, and you can&#8217;t make accurate decisions on how to model your data if you plot aggregated data.  Related to this, your analysis was ecologic (used aggregated/grouped data) but you made individual-level conclusions when you used the term &#8220;risk factor.&#8221;  This is referred to as an ecologic fallacy &#8211; and it&#8217;s just that.  A fallacy.  For example, all we can say based on your cholesterol-colorectal cancer example (the one that doesn&#8217;t account for schistomiasis) is that the counties with higher mean total cholesterol tend to have higher incidence rates of colorectal cancer.  We can&#8217;t make the leap to calling cholesterol a *risk factor* for colorectal cancer.</p>
<p>STEP 3: Don&#8217;t aggregate your data in  your analysis.  Why?  You lose A LOT of information when you aggregate data and you can bias your results.  So keep that data at the individual-level.  For descriptive tables, by all means, aggregated data is necessary for obvious reasons.  But in your analysis, individual-level data when you&#8217;ve got it is essential.  </p>
<p>3. The right regression model:<br />
One of your outcomes was incidence rates of colorectal cancer.  When you do your analysis with individual-level data, with incidence rates of colorectal cancer as your outcome, linear regression = WRONG model.  Make sure you know which models to use and when.  To start &#8211; when modeling &#8220;raw&#8221; rates (case counts and person time), we almost always use Poisson regression, and often we need to account for overdispersion as well.  Get to know some of the other common regression models as well.    </p>
<p>STEP 4: Write out all of the primary exposures of interest you want to investigate and the corresponding outcome of interest and how you&#8217;re setting up your outcome variable (are you interested in colorectal cancer *incidence rates*, *prevalence*, a simple yes/no the person has colorectal cancer?)</p>
<p>STEP 5: Write out what the appropriate regression model would be for the different analyses you plan to conduct.  </p>
<p>4. Confounders:<br />
These are factors that are related to the exposure and the outcome of interest such that *not* adjusting for them will produce a biased association between exposure and outcome.  As you saw, schistomiasis might be a confounder.  And in fact, county might be too &#8211; and is actually upstream of schistomiasis in some sense, right?  Two confounders that almost *always* must be included in a model are AGE and SEX (provided your analysis isn&#8217;t restricted to one sex).  This is especially true for chronic disease (eg. cardiovascular disease and cancer).  In this particular case, body mass index (BMI) would be very important to include as well.  County may also be important.  </p>
<p>STEP 6: For every analysis you do, write out all potential confounders you can think of and why.  You know the data better than I do as you&#8217;ve worked with it extensively.  And, from STEP 0, you&#8217;ll know your context.  </p>
<p>STEP 7: Write out *how* the confounders are related to the exposure and outcome.  Is the confounder protective (i.e. decrease risk) for the outcome?  Or is it a risk factor?  How is it associated with the primary exposure of interest?  This is where those scatterplots in STEP 1 come in handy!  The purpose of this is to give you an idea of *how* an observed association might be biased if you *don&#8217;t* adjust for certain confounders.  It is tedious, but thorough and, like STEP 6, will allow you to approach your analyses with more contextual background.</p>
<p>5.  &#8220;Cleaning&#8221; and &#8220;recoding&#8221; your data:<br />
Raw data is not *in and of itself* a bad thing.  It is simply the data in its original form.  But in order to be useful for analysis we often need to &#8220;clean&#8221; it and &#8220;recode&#8221; it.  When I say &#8220;clean&#8221; it, I mean setting up the *dataset* that is free (to the greatest extent possible) of unnecessary data (for example, if you&#8217;re interested in ovarian cancer, you wouldn&#8217;t include men), or mistakes (for example, if an individual in the data was coded as being a man with ovarian cancer, this is clearly wrong).  In this case, you might either omit it since you don&#8217;t have a way to check which is correct or, based on other data for that individual choose to change &#8220;man&#8221; to &#8220;woman&#8221; or &#8220;ovarian cancer&#8221; to &#8220;no ovarian cancer.&#8221;  &#8220;Recoding&#8221; means setting up the *variables* to be useful.  For example, we might recode BMI in categories of underweight, normal, overweight, and obese rather than leave it as continuous.  Some variables may already be categoric, if the corresponding data were collected that way.</p>
<p>STEP 8: Clean your data.  You will likely need to set up multiple datasets.  </p>
<p>STEP 9: Write out *how* you&#8217;ve cleaned your data.  (This is good record keeping.)</p>
<p>STEP 10: Recode your data.  This might include combining variables too.  </p>
<p>STEP 11: Create a &#8220;data dictionary&#8221; similar to the one on the Oxford site.  But in addition, include a description of how you&#8217;ve coded your data (eg. 1=underweight, 2=normal, 3=overweight, 4=obese).  Again, good for record keeping, but also &#8220;keeps you honest&#8221; so others know how you set up your data.  This will often be apparent when you present your results, but not always.  It&#8217;s a good habit to keep track of this, in any event. </p>
<p>STEP 12: Replot all newly *categorized* variables against the outcome(s) of interest.  Why?  Because the categorized data may reveal non-linear relationships with the outcome (in fact, this is a strength of categorizing data &#8211; that we can account for some non-linear relationships).  For example, underweight might be a risk for something, whereas normal BMI is protective, while overweight and obese are a risk (&#8220;U-shaped&#8221;).  </p>
<p>6. Exploration of your data through descriptive statistics:<br />
Almost all scientific papers start out with a &#8220;Table 1&#8243; which presents a description of the data.  It tells us things like What&#8217;s the % of women and men in our data, What is the proportion of people with and without the exposure and with and without the outcome?</p>
<p>STEP 13: Create descriptive tables of all relevant variables.  This includes your primary exposure of interest, confounders, and outcome.  Obviously, you will have different tables for each analysis as you&#8217;re interested in different primary exposures (cholesterol? meat? total caloric intake?) and outcomes (cardiovascular disease? colorectal cancer? bladder cancer?).  To save time, you might include all relevant exposures and confounders in rows, and cross-classify them with all outcomes of interest in columns.  </p>
<p>6.  Analysis:<br />
The fun part.  </p>
<p>STEP 14: Run your models.  Keep track of what you include in your models b/c oftentimes we will evaluate several models for each analysis depending on what&#8217;s called &#8220;fit statistics.&#8221;  Since you are familiar with p-values and I assume interpretation of beta coefficients, use these to help inform you of which variables to include in your final model *within the context of the analysis at hand* (this is key &#8211; if you have reason to believe that a confounder is important to include, keep it in the model even if it&#8217;s non-significant).  </p>
<p>STEP 15: Create tables for results from *all* analyses (including the models you decide to can in favor for another one) and what regression model was used.  This is much more transparent than simply producing your final model.</p>
<p>There&#8217;s more &#8220;post-analysis&#8221; stuff that should be done, but really Steps 1-15 is a pretty thorough. </p>
<p>7. Publish:<br />
I can&#8217;t stress this enough.  This is a long-term goal for sure, especially as you will likely end up with multiple papers!  But once you think you&#8217;ve got the data set-up and analyses down, you need to write it up and send it on for peer-review.  Peer-review is not perfect for sure, but it is the best measure we have for good science.  It gives credibility to your efforts.  Besides, you *do* want to be acknowledged for your efforts, right?  By publishing in a peer-reviewed journal, you&#8217;re more likely to gain more widely publicized attention, which I think should be the goal of most epidemiological studies; we want to improve public health through informing not only our peers, but also the public.  </p>
<p>As a last note, I know this is a huge undertaking, but these are steps to a thorough analysis.  I have no doubt you&#8217;re capable of tackling it.  </p>
<p>Best wishes.</p>
<p>PS. I&#8217;m sure you already planned to do this, but make all of the above available.  With your large readership you can make this a collaborative effort.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Destroying China (the Study that Is) by Alternative Medicine Expertise from Aspire Natural Health &#187; Blog Archive &#187; Destroying China (the Study that Is) &#124; myezpharmacy</title>
		<link>http://www.aspirenaturalhealth.com/blog/archives/310/comment-page-1#comment-201</link>
		<dc:creator>Alternative Medicine Expertise from Aspire Natural Health &#187; Blog Archive &#187; Destroying China (the Study that Is) &#124; myezpharmacy</dc:creator>
		<pubDate>Sat, 10 Jul 2010 17:43:42 +0000</pubDate>
		<guid isPermaLink="false">http://www.aspirenaturalhealth.com/blog/archives/310#comment-201</guid>
		<description>[...] Alternative Medicine Expertise from Aspire Natural Health &#187; Blog Archive &#187; Destroying Ch... [...]</description>
		<content:encoded><![CDATA[<p>[...] Alternative Medicine Expertise from Aspire Natural Health &raquo; Blog Archive &raquo; Destroying Ch&#8230; [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Want to be healthier? Live in a third world country on the equator by Fermina Boulerice</title>
		<link>http://www.aspirenaturalhealth.com/blog/archives/300/comment-page-1#comment-200</link>
		<dc:creator>Fermina Boulerice</dc:creator>
		<pubDate>Sat, 10 Jul 2010 17:04:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.aspirenaturalhealth.com/blog/archives/300#comment-200</guid>
		<description>nice that will be healthy :)</description>
		<content:encoded><![CDATA[<p>nice that will be healthy <img src='http://www.aspirenaturalhealth.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
</channel>
</rss>
