Slices and crumbs [arXiv:1011.4722]

An interesting note was arXived a few days ago by Madeleine Thompson and Radford Neal. Beside the nice touch of mixing crumbs and slices, the neat idea is to have multiple-try proposals for simulating within a slice and to decrease the dimension of the simulation space at each try. This dimension diminution is achieved via the construction of an orthogonal basis based on the gradient of the log densities at previously-rejected proposals.

$\mathbf{J}=(\nabla\log f(x_1),\ldots,\nabla\log f(x_k))$

until all dimensions are exhausted, in which case the scale of the Gaussian proposal is reduced. (The paper comes with R and C codes.) Provided the gradient can be computed (or at least approximated), this is a fairly general method (even though I have not tested it so cannot say how much calibration it requires). An interesting point is that, contrariwise to the delayed-rejection method of Antonietta Mira and co-authors,  the repeated proposals do not induce a complexification in the slice acceptance probability. I am less convinced by the authors’ conclusion that the method compares with adaptive Metropolis techniques, in the sense the “shrinking rank” method forgets about past experiences as it starts from scratch at each iteration: it is thus not really learning… (Now, in terms of performances, this may be the case!)

Computing evidence

The book Random effects and latent variable model selection, edited by David Dunson in 2008 as a Springer Lecture Note. contains several chapters dealing with evidence approximation in mixed effect models. (Incidentally, I would be interested in the story behind the  Lecture Note as I found no explanation in the backcover or in the preface. Some chapters but not all refer to a SAMSI workshop on model uncertainty…) The final chapter written by Joyee Ghosh and David Dunson (similar to a corresponding paper in JCGS) contains in particular the interesting identity that the Bayes factor opposing model h to model h-1 can be unbiasedly approximated by (the average of the terms)

$\dfrac{f(x|\theta_{i,h},\mathfrak{M}=h-1)}{f(x|\theta_{i,h},\mathfrak{M}=h)}$

when

• $\mathfrak{M}$ is the model index,
• the $\theta_{i,h}$‘s are simulated from the posterior under model h,
• the model $\mathfrak{M}=h-1$ only considers the h-1 first components of $\theta_{i,h}$,
• the prior under model h-1 is the projection of the prior under model h. (Note that this marginalisation is not the projection used in Bayesian Core.)

Puhdistus

I read Purge (Puhdistus in Finnish) by Sofi Oksonen (in French) this summer when flying to San Francisco from Vancouver. This is a strong and gripping novel, as others have noticed before me. It takes place in post-communist Estonia where a widow of a (very lower-ranking) member of the communist nomenklatura is forced into considering her past choices and the lies she made to herself and to others when her grand-niece pops in, pursued by Russian-mafia-style gangsters who had enslaved her into a cruel prostitution scheme in Germany… This may sound like a cheap plot but the slow unravelling of the old woman’s (horrific) deeds and of the compromises she was led to endure makes for a much deeper read. There is also an historical level about Estonia being as ruthlessly occupied by Soviet soldiers as by Nazis, and about the hopeless fight of the local partisans followed by massive deportations to the Russian Far East. The book is thus multifaceted and the end, although predictable, is a nice tale of redemption for the old Aliide Truu who would otherwise appear as a remorseless criminal… An impressive and recommended tour-de-force! (Note that, despite some misguided criticisms found on Amazon, this is not a thriller!)

First snow

