## 06 November 2009

### Getting Started In: Bayesian NLP

This isn't so much a post in the "GSI" series, but just two links that recently came out. Kevin Knight and Philip Resnik both just came out with tutorials for Bayesian NLP. They're both excellent, and almost entirely non-redundant. I highly recommend reading both. And I thank Kevin and Philip from the bottom of my heart, since I'd been toying with the idea of writing such a thing (for a few years!) and they've saved me the effort. I'd probably start with Kevin's and then move on to Philip's (which is more technically meaty), but either order is really fine.

Thanks again to both of them. (And if you haven't read Kevin's previous workbook on SMT -- which promises free beer! -- I highly recommend that, too.)

Bob Carpenter said...

I've been prepping my own tutorial. For practice, I wrote a series of blog entries, starting with What is Bayesian Statistical Inference?, walking through point estimates for a few posts [using baseball batting ability as a simple binomial example], then concluding with a full hierarchical beta-binomial.

One of the hardest things I had to overcome in learning stats is sloppy random variable notation. I talk about this in a related post on what's wrong with probability notation.

Resnik and Hardisty need to keep event notation, Pr(X=x), separate from density notation, p(x). What's confusing to the novice audiences for whom they're writing is that for a discrete random variable X, Pr(X=x) = p(x). The problem is for continuous densities, where Pr(X=x) = 0, because you're integrating over a point rather than an interval of non-zero width (i.e. a measurable interval). When they write Pr(\pi) in equation (4) for the first MAP example, the value is zero, because p(\pi) is a continuous desnity. They really should write p(\pi) instead.

Philip said...

Thanks, Hal, for posting this, and thanks, Bob, for your very reasonable point regarding probability notation. I'm collecting this and other comments for the next version.

Hi. I'm working as a consultant and one of my clients wants to make sense of his text data. Can you suggest me what are the various products around which will help me analyze his data? Also, how long does it take to create a program from open source? Does it require a highly skilled coder with good understanding of the math involved or an average java coder can do? I'm not exactly a comp science guy and hence the confusion. would be great if you could help me out with this. Thank you.

Nikhil Gupta
nikhil_gupta_iitd@yahoo.co.in

