<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: on using the data twice&#8230;</title>
	<atom:link href="http://xianblog.wordpress.com/2012/01/13/on-using-the-data-twice/feed/" rel="self" type="application/rss+xml" />
	<link>http://xianblog.wordpress.com/2012/01/13/on-using-the-data-twice/</link>
	<description>an attempt at bloggin, from scratch...</description>
	<lastBuildDate>Sun, 19 May 2013 09:02:13 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: xi'an</title>
		<link>http://xianblog.wordpress.com/2012/01/13/on-using-the-data-twice/comment-page-1/#comment-13273</link>
		<dc:creator><![CDATA[xi'an]]></dc:creator>
		<pubDate>Mon, 16 Jan 2012 11:13:38 +0000</pubDate>
		<guid isPermaLink="false">http://xianblog.wordpress.com/?p=14187#comment-13273</guid>
		<description><![CDATA[Thanks for the precision! Even though I had handled a previous version of this paper, I did not remember the re-run part (and I have not re-checked the paper so far). Now, this is entering a grey area albeit far from the Mordor of &quot;using the data twice&quot; (to keep with the Tolkienian theme!). In a Bayesian analysis with different models, the global posterior distribution is the weighted sum of all the posteriors for the different models. Once a model is selected based on this global posterior, we do not rerun the chosen model since it already is part of the global model. I would thus say it&#039;s mostly kosher.]]></description>
		<content:encoded><![CDATA[<p>Thanks for the precision! Even though I had handled a previous version of this paper, I did not remember the re-run part (and I have not re-checked the paper so far). Now, this is entering a grey area albeit far from the Mordor of &#8220;using the data twice&#8221; (to keep with the Tolkienian theme!). In a Bayesian analysis with different models, the global posterior distribution is the weighted sum of all the posteriors for the different models. Once a model is selected based on this global posterior, we do not rerun the chosen model since it already is part of the global model. I would thus say it&#8217;s mostly kosher.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Matthieu Authier</title>
		<link>http://xianblog.wordpress.com/2012/01/13/on-using-the-data-twice/comment-page-1/#comment-13272</link>
		<dc:creator><![CDATA[Matthieu Authier]]></dc:creator>
		<pubDate>Mon, 16 Jan 2012 10:47:44 +0000</pubDate>
		<guid isPermaLink="false">http://xianblog.wordpress.com/?p=14187#comment-13272</guid>
		<description><![CDATA[Hello,

thanks very much for the answer. I was asking about the horseshoe because a part of the Biometrika paper deals with thresholding, and conditional on the threshold dropping some variables. I was feeling that using the horseshoe prior and reporting posteriors from this model is enough. That is, we don&#039;t need to re-run a model with only the predictors that were selected before, since that would be using the data twice (but given my experience so far with the publication process in the field of ecology, this is probably what will be asked by reviewers).
Hence my question about the horseshoe. I feel that it could fall also in your category (e) in practice but really is fine on its own (d).
Or in Tolkienian prose, the horseshoe may be then some sort of &quot;One model to find them all and in the prior shrink them&quot;...
Sorry for bugging you with probably naïve questions and mille mercis d&#039;avoir pris le temps de répondre.
Cheers
Matthieu]]></description>
		<content:encoded><![CDATA[<p>Hello,</p>
<p>thanks very much for the answer. I was asking about the horseshoe because a part of the Biometrika paper deals with thresholding, and conditional on the threshold dropping some variables. I was feeling that using the horseshoe prior and reporting posteriors from this model is enough. That is, we don&#8217;t need to re-run a model with only the predictors that were selected before, since that would be using the data twice (but given my experience so far with the publication process in the field of ecology, this is probably what will be asked by reviewers).<br />
Hence my question about the horseshoe. I feel that it could fall also in your category (e) in practice but really is fine on its own (d).<br />
Or in Tolkienian prose, the horseshoe may be then some sort of &#8220;One model to find them all and in the prior shrink them&#8221;&#8230;<br />
Sorry for bugging you with probably naïve questions and mille mercis d&#8217;avoir pris le temps de répondre.<br />
Cheers<br />
Matthieu</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: xi'an</title>
		<link>http://xianblog.wordpress.com/2012/01/13/on-using-the-data-twice/comment-page-1/#comment-13241</link>
		<dc:creator><![CDATA[xi'an]]></dc:creator>
		<pubDate>Sat, 14 Jan 2012 11:20:03 +0000</pubDate>
		<guid isPermaLink="false">http://xianblog.wordpress.com/?p=14187#comment-13241</guid>
		<description><![CDATA[Matthieu: thanks for the comments. There are several issues:
(a) &quot;using the data twice&quot; is not related to Bayesian model selection per se in that we can run proper Bayesian model selection w/o using the data twice. It is also possible to &quot;use the data twice&quot; in non-Bayesian settings.
(a&#039;) by the way variable selection &lt;em&gt;is&lt;/em&gt; model choice [for me].
(b) Lasso is a penalised likelihood technique for variable comparison / nested model comparison. It has a vague Bayesian flavour that was precised by Park and Casella (2008).
(c) AIC, BIC, &amp;C, Lasso, are on principle free from the sin of &quot;using the data twice&quot;, being all of the likelihood ratio type. 
(c&#039;) this is less clear for DIC!
(d) Carvahlo, Scott and Polson have reanalysed the Bayesian lasso by picking &lt;em&gt;another prior&lt;/em&gt; on the penalty coefficients. This is strictly Bayesian, hence does not use the data twice.
(e) in practice, people use mixes of Bayesian and non-Bayesian techniques, like pluggin-hyperparameters, which end up using the data more than once...

Merci et à bientôt!]]></description>
		<content:encoded><![CDATA[<p>Matthieu: thanks for the comments. There are several issues:<br />
(a) &#8220;using the data twice&#8221; is not related to Bayesian model selection per se in that we can run proper Bayesian model selection w/o using the data twice. It is also possible to &#8220;use the data twice&#8221; in non-Bayesian settings.<br />
(a&#8217;) by the way variable selection <em>is</em> model choice [for me].<br />
(b) Lasso is a penalised likelihood technique for variable comparison / nested model comparison. It has a vague Bayesian flavour that was precised by Park and Casella (2008).<br />
(c) AIC, BIC, &amp;C, Lasso, are on principle free from the sin of &#8220;using the data twice&#8221;, being all of the likelihood ratio type.<br />
(c&#8217;) this is less clear for DIC!<br />
(d) Carvahlo, Scott and Polson have reanalysed the Bayesian lasso by picking <em>another prior</em> on the penalty coefficients. This is strictly Bayesian, hence does not use the data twice.<br />
(e) in practice, people use mixes of Bayesian and non-Bayesian techniques, like pluggin-hyperparameters, which end up using the data more than once&#8230;</p>
<p>Merci et à bientôt!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: xi'an</title>
		<link>http://xianblog.wordpress.com/2012/01/13/on-using-the-data-twice/comment-page-1/#comment-13236</link>
		<dc:creator><![CDATA[xi'an]]></dc:creator>
		<pubDate>Sat, 14 Jan 2012 07:53:06 +0000</pubDate>
		<guid isPermaLink="false">http://xianblog.wordpress.com/?p=14187#comment-13236</guid>
		<description><![CDATA[Thanks! Using a Bayesian approach where all unknowns are endowed with prior probabilities (for models) or prior distributions (for parameters), and where the errors are evaluated under a loss function, the data is only used once in the (global) posterior, which is then the only entity needed to conduct the decision process. A genuine Bayesian approach cannot therefore &quot;use the data twice&quot;. Only approximations to Bayesian procedures open the Pandora box of re-using the data, some of them in obviously detrimental fashions... Correcting for this can only be done in an un-Bayesian way, using asymptotics and indeed the sampling distribution.]]></description>
		<content:encoded><![CDATA[<p>Thanks! Using a Bayesian approach where all unknowns are endowed with prior probabilities (for models) or prior distributions (for parameters), and where the errors are evaluated under a loss function, the data is only used once in the (global) posterior, which is then the only entity needed to conduct the decision process. A genuine Bayesian approach cannot therefore &#8220;use the data twice&#8221;. Only approximations to Bayesian procedures open the Pandora box of re-using the data, some of them in obviously detrimental fashions&#8230; Correcting for this can only be done in an un-Bayesian way, using asymptotics and indeed the sampling distribution.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mayod@vt.edu</title>
		<link>http://xianblog.wordpress.com/2012/01/13/on-using-the-data-twice/comment-page-1/#comment-13231</link>
		<dc:creator><![CDATA[mayod@vt.edu]]></dc:creator>
		<pubDate>Sat, 14 Jan 2012 02:34:46 +0000</pubDate>
		<guid isPermaLink="false">http://xianblog.wordpress.com/?p=14187#comment-13231</guid>
		<description><![CDATA[Rather than make it an &quot;all or nothing&quot; principle, one needs an epistemological criterion to distinguish pejorative from unpejorative double counting.  That is what I hope the &quot;severity principle&quot; achieves, at least for someone who cares to evaluate procedures according to how well or poorly specific errors are controlled/ruled out.  When double counting does warrant an adjustment, we appeal essentially to the sampling distribution.  But I&#039;m not sure what the Bayesian justification for caring is.  Unless one foists it on the prior, but I&#039;d still want to know the grounds, then the evidential relationship between data and hypothesis, for likelihoodists, should have nothing to do with looking at or using up the data---isn&#039;t that part of the Bayesian magic?]]></description>
		<content:encoded><![CDATA[<p>Rather than make it an &#8220;all or nothing&#8221; principle, one needs an epistemological criterion to distinguish pejorative from unpejorative double counting.  That is what I hope the &#8220;severity principle&#8221; achieves, at least for someone who cares to evaluate procedures according to how well or poorly specific errors are controlled/ruled out.  When double counting does warrant an adjustment, we appeal essentially to the sampling distribution.  But I&#8217;m not sure what the Bayesian justification for caring is.  Unless one foists it on the prior, but I&#8217;d still want to know the grounds, then the evidential relationship between data and hypothesis, for likelihoodists, should have nothing to do with looking at or using up the data&#8212;isn&#8217;t that part of the Bayesian magic?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Authier Matthieu</title>
		<link>http://xianblog.wordpress.com/2012/01/13/on-using-the-data-twice/comment-page-1/#comment-13216</link>
		<dc:creator><![CDATA[Authier Matthieu]]></dc:creator>
		<pubDate>Fri, 13 Jan 2012 11:45:07 +0000</pubDate>
		<guid isPermaLink="false">http://xianblog.wordpress.com/?p=14187#comment-13216</guid>
		<description><![CDATA[Hello,
as an ecologist, I am very often confronted to the issue of model selection. Most of the time, this is done by some automated procedure that computes the AIC of all possible variable combination. I suspect then what we&#039;re doing more variable selection than model selection. Anyway, the process is tedious and looks a bit like an overkill : most of the models considered have not been thought about (after all, R does this on its own without thinking) and some are just not plausible. 
I read recently the Biometrika paper of Carvahlo, Scott and Polson about the horsehoe estimator for sparse signals. I would be very curious to know your thoughts about this procedure. I have used the horseshoe in a logistic regression about clutch size in a seabird. It was very useful in shrinking some spurious signals (evaluated from posterior predictive checks... I guess this is using the data twice...).
My more focus question is then : do you think the horseshoe prior can solve this &quot;using the data twice&quot; problem? 
Yours sincerely
Matthieu]]></description>
		<content:encoded><![CDATA[<p>Hello,<br />
as an ecologist, I am very often confronted to the issue of model selection. Most of the time, this is done by some automated procedure that computes the AIC of all possible variable combination. I suspect then what we&#8217;re doing more variable selection than model selection. Anyway, the process is tedious and looks a bit like an overkill : most of the models considered have not been thought about (after all, R does this on its own without thinking) and some are just not plausible.<br />
I read recently the Biometrika paper of Carvahlo, Scott and Polson about the horsehoe estimator for sparse signals. I would be very curious to know your thoughts about this procedure. I have used the horseshoe in a logistic regression about clutch size in a seabird. It was very useful in shrinking some spurious signals (evaluated from posterior predictive checks&#8230; I guess this is using the data twice&#8230;).<br />
My more focus question is then : do you think the horseshoe prior can solve this &#8220;using the data twice&#8221; problem?<br />
Yours sincerely<br />
Matthieu</p>
]]></content:encoded>
	</item>
</channel>
</rss>
