my biased thoughts on the fields of natural language processing (NLP), computational linguistics (CL) and related topics (machine learning, math, funding, etc.)
30 May 2009
Semi-supervised or Semi-unsupervised? (SSL-NLP invited papers)
Kevin Duh not-so-recently asked me to write a "position piece" for the workshop he's co-organizing on semi-supervised learning in NLP. I not-so-recently agreed. And recently I actually wrote said position piece. You can also find a link off the workshop page. I hope people recognize that it's intentionally a bit tongue-in-cheek. If you want to discuss this stuff or related things in general, come to the panel at NAACL from 4:25 to 5:25 on 4 June at the workshop! You can read the paper for more information, but my basic point is that we can typically divide semi-supervised approached into one lump (semi-supervised) that work reasonably well with only labeled data and are just improved with unlabeled data and one lump (semi-unsupervised) that work reasonably well with only unlabeled data and are just improved with labeled data. The former are typically encode lots of prior information; the latter do not. Let's combine! (Okay, my claim is more nuanced than that, but that's the high-order bit.)
I was tempted to come to that workshop but I've signed up for the two semantics ones instead. Looks like I have a full week of semantics ahead, actually.
ReplyDeleteRachel ~ writing from the Ballroom in UMC!
I don't see the generative/discriminative distinction as such a dichotomy. One attractive feature of Bayesian inference is that once you've set up a model, arbitrary subsets of the random variables may be supervised, which amounts to simply fixing their outcomes.
ReplyDeleteWith hierarchical Bayesian GLMs, one typically models (a) the hierarchical mean and variance parameters, (b) the regression coefficients, and (c) the predicted outcomes including noise. One cannot by definition model (d) the highest level priors. Typically, one does not model (e) the item level predictors.
But really this choice is arbitrary. You can choose not to model the hierarchical variance parameters or you can set the hiearchical prior means to zero, for instance. For missing data imputation, you see the predictors themselves being modeled to fill in missing data (often with an ad-hoc regression based on the other predictors).
Very nice information. Thanks for this. Please come visit my site Fortworth Yellow Page Business Directory when you got time.
ReplyDeleteVery nice information. Thanks for this. Please come visit my site Local Business Directory Of Fortworth U.S.A. when you got time.
ReplyDeleteI enjoyed reading your work! GREAT post! I looked around for this… but I found you! Anyway, would you mind if I threw up a backlink from my site? Please come visit my site Phone Directory Of Columbus City Ohio OH State when you got time.
ReplyDeleteI enjoyed reading your work! GREAT post! I looked around for this… but I found you! Anyway, would you mind if I threw up a backlink from my site? Please come visit my site Business Directory Columbus when you got time.
ReplyDeletethe predicted outcomes including noise. One cannot by definition model (d) the highest level priors. Typically, one does not model (e) the item level predictors.
ReplyDeleteAssignments | Theses | Essays
arbitrary subsets of the random variables may be supervised, which amounts to simply fixing their outcomes.
ReplyDeleteA Level Coursework | Dissertations
Hey,
ReplyDeleteVery useful piece of information indeed! Thanks for sharing and keep posting the good work.
Business Plan Presentation
Latest film "Nightmare on Elm Street" Plot
ReplyDelete"The new Nightmare on Elm Street" played by the screenwriter Wesley Stricker, will follow the original director Wes Craven's Queen's routine, to stimulate a bloody killer scenes big screen again. According to reports, film the story will be a nightmare killer Freddy is how to burn the first incarnation of devils as the main history.
We look forward to this wonderful movie please right..............................................
New Nightmare on Elm Street "will be April 30, 2010 capture of the North American cinemas.
i would definitely go with the supervised. I think you have to go with this in the future.
ReplyDeletepalm beach cosmetic dentist