<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:gd="http://schemas.google.com/g/2005"><id>tag:blogger.com,1999:blog-19803222.post5175748262897667616..comments</id><updated>2025-03-25T23:21:27.081-06:00</updated><title type='text'>Comments on natural language processing blog: Structured prediction is *not* RL</title><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='https://nlpers.blogspot.com/feeds/comments/default'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default'/><link rel='alternate' type='text/html' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html'/><link rel="hub" href="http://pubsubhubbub.appspot.com/"/><author><name>hal</name><uri>http://www.blogger.com/profile/02162908373916390369</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='26' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgD1MG_aVuSjAPEsTePUN5QqDL4Mwr_aJFKGsFsw3e2k7jPJInwEwyvw2TlzxtYbKe8wRJv11hXBkUgikpsedU0jUnSH2r81uA9EGrcIcPqJCHUbycK2Kx0KFUzRASN9Q/s220/menyc-250x300.png'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>6</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-19803222.post-5657772404914355355</id><published>2017-08-06T07:39:38.774-06:00</published><updated>2017-08-06T07:39:38.774-06:00</updated><title type='text'>Hi Hal, an interesting blog post!&#xa;&#xa;You might be in...</title><content type='html'>Hi Hal, an interesting blog post!&lt;br /&gt;&lt;br /&gt;You might be interested in our recent paper which adapts the LOLS approach to training RNNs:&lt;br /&gt;&lt;i&gt;SEARNN: Training RNNs with Global-Local Losses&lt;/i&gt;&lt;br /&gt;Rémi Leblond, Jean-Baptiste Alayrac, Anton Osokin, Simon Lacoste-Julien&lt;br /&gt;&lt;a href=&quot;https://arxiv.org/abs/1706.04499&quot; rel=&quot;nofollow&quot;&gt;arXiv:1706.04499&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;It will be presented at the &lt;a href=&quot;https://deepstruct.github.io/ICML17/ac/&quot; rel=&quot;nofollow&quot;&gt;ICML 2017 Workshop on Deep Structured Prediction&lt;/a&gt;&lt;br /&gt;(in case that you are around...).</content><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default/5657772404914355355'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default/5657772404914355355'/><link rel='alternate' type='text/html' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html?showComment=1502026778774#c5657772404914355355' title=''/><author><name>Simon Lacoste-Julien</name><uri>https://www.blogger.com/profile/12471143626693719175</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='26' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjl51W3CSs7FoodlS6y1WGrOmJFx4dwwR-eeW4HFI00n4TXsJgHSesYbFfQBxYCeRbiH4VNVqSwh6bKhT8pW06Y4R--3xgUFSCeCTdU9hFgX5b_kHPt2azL7h899NdV3Q/s220/SLJ3.jpg'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html' ref='tag:blogger.com,1999:blog-19803222.post-5175748262897667616' source='http://www.blogger.com/feeds/19803222/posts/default/5175748262897667616' type='text/html'/><gd:extendedProperty name="blogger.itemClass" value="pid-1861252339"/><gd:extendedProperty name="blogger.displayTime" value="06 August, 2017 07:39"/></entry><entry><id>tag:blogger.com,1999:blog-19803222.post-7931047087166437376</id><published>2017-04-07T05:15:46.365-06:00</published><updated>2017-04-07T05:15:46.365-06:00</updated><title type='text'>There is a lot of psychological evidence about wha...</title><content type='html'>There is a lot of psychological evidence about what happens during sentence processing, including evidence that there is some kind of representation of non-determinism.  For example, there is Marslen-Wilson&amp;#39;s Cohort Model (https://en.wikipedia.org/wiki/Cohort_model). But models like this are largely silent on details of the representations used. Which computational models cannot be.</content><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default/7931047087166437376'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default/7931047087166437376'/><link rel='alternate' type='text/html' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html?showComment=1491563746365#c7931047087166437376' title=''/><author><name>Chris Brew</name><uri>https://www.blogger.com/profile/15950294272852443488</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html' ref='tag:blogger.com,1999:blog-19803222.post-5175748262897667616' source='http://www.blogger.com/feeds/19803222/posts/default/5175748262897667616' type='text/html'/><gd:extendedProperty name="blogger.itemClass" value="pid-1193603107"/><gd:extendedProperty name="blogger.displayTime" value="07 April, 2017 05:15"/></entry><entry><id>tag:blogger.com,1999:blog-19803222.post-5718550167661434671</id><published>2017-04-03T19:24:54.836-06:00</published><updated>2017-04-03T19:24:54.836-06:00</updated><title type='text'>@Chris: thanks! I don&amp;#39;t know enough about how ...</title><content type='html'>@Chris: thanks! I don&amp;#39;t know enough about how the brain works to say something interesting, but this is cool to think about. In eye-tracking, people do look back, for instance, when they get garden pathed or whatever, which isn&amp;#39;t necessarily maintaining multiple hypotheses, but maintaining some sort of uncertainty. (Like Jinho Choi&amp;#39;s selective branching.)&lt;br /&gt;&lt;br /&gt;@Dipendra: Agreed, good point. CPI also assumes you can reset (which is one reason we chose it), which is totally a fine assumption in SP. Even if you do ten passes over the data, though, you&amp;#39;re still only trying a very very small subset of possible trajectories. But this definitely goes to the question of: would I rather expand more now, or do this sentence again later in a few hours?</content><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default/5718550167661434671'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default/5718550167661434671'/><link rel='alternate' type='text/html' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html?showComment=1491269094836#c5718550167661434671' title=''/><author><name>hal</name><uri>https://www.blogger.com/profile/02162908373916390369</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='26' height='32' src='//blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgD1MG_aVuSjAPEsTePUN5QqDL4Mwr_aJFKGsFsw3e2k7jPJInwEwyvw2TlzxtYbKe8wRJv11hXBkUgikpsedU0jUnSH2r81uA9EGrcIcPqJCHUbycK2Kx0KFUzRASN9Q/s220/menyc-250x300.png'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html' ref='tag:blogger.com,1999:blog-19803222.post-5175748262897667616' source='http://www.blogger.com/feeds/19803222/posts/default/5175748262897667616' type='text/html'/><gd:extendedProperty name="blogger.itemClass" value="pid-815832165"/><gd:extendedProperty name="blogger.displayTime" value="03 April, 2017 19:24"/></entry><entry><id>tag:blogger.com,1999:blog-19803222.post-730082631303758853</id><published>2017-04-03T16:10:06.588-06:00</published><updated>2017-04-03T16:10:06.588-06:00</updated><title type='text'>I wonder if the argument that &amp;quot;RL does not bu...</title><content type='html'>I wonder if the argument that &amp;quot;RL does not build out the whole search tree&amp;quot; true in practice. Most people perform epochs over the dataset and therefore reset the world to a start state. The agent can then execute a trajectory, different from what it tried earlier, and keep doing it to eventually explore the entire search space. In fact papers such as Trust Region Policy Optimization (Figure 1 https://arxiv.org/pdf/1502.05477.pdf) perform multiple rollouts from the same state (to be fair they do mention that it works in a simulation). Of course this won&amp;#39;t hold in a real world, you cannot remake a glass jar that a robot has accidentally broken while exploring. Similarly, you cannot just reset to a start state that easily. &lt;br /&gt;</content><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default/730082631303758853'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default/730082631303758853'/><link rel='alternate' type='text/html' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html?showComment=1491257406588#c730082631303758853' title=''/><author><name>Anonymous</name><uri>https://www.blogger.com/profile/00402700087475050406</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html' ref='tag:blogger.com,1999:blog-19803222.post-5175748262897667616' source='http://www.blogger.com/feeds/19803222/posts/default/5175748262897667616' type='text/html'/><gd:extendedProperty name="blogger.itemClass" value="pid-530097022"/><gd:extendedProperty name="blogger.displayTime" value="03 April, 2017 16:10"/></entry><entry><id>tag:blogger.com,1999:blog-19803222.post-8838966538327061457</id><published>2017-04-03T14:22:03.924-06:00</published><updated>2017-04-03T14:22:03.924-06:00</updated><title type='text'></title><content type='html'>This comment has been removed by the author.</content><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default/8838966538327061457'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default/8838966538327061457'/><link rel='alternate' type='text/html' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html?showComment=1491250923924#c8838966538327061457' title=''/><author><name>Anonymous</name><uri>https://www.blogger.com/profile/00402700087475050406</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html' ref='tag:blogger.com,1999:blog-19803222.post-5175748262897667616' source='http://www.blogger.com/feeds/19803222/posts/default/5175748262897667616' type='text/html'/><gd:extendedProperty name='blogger.contentRemoved' value='true'/><gd:extendedProperty name="blogger.itemClass" value="pid-530097022"/><gd:extendedProperty name="blogger.displayTime" value="03 April, 2017 14:22"/></entry><entry><id>tag:blogger.com,1999:blog-19803222.post-670555302063049402</id><published>2017-04-03T09:21:13.240-06:00</published><updated>2017-04-03T09:21:13.240-06:00</updated><title type='text'>One of the advantages of RL is that it does NOT bu...</title><content type='html'>One of the advantages of RL is that it does NOT build out the whole search tree, so does not require detailed symbolic representations of multiple alternatives at the same time. Neither the brain&amp;#39;s neural hardware nor the various fancy-dan deep learning networks comfortably accommodate the representation of detailed symbolic alternatives, as far as we know. So it is well worth pursuing RL and similar methods in which the representation of prior context and current uncertainty is less necessarily cut-and-dried than in (say) CRFs.</content><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default/670555302063049402'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/19803222/5175748262897667616/comments/default/670555302063049402'/><link rel='alternate' type='text/html' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html?showComment=1491232873240#c670555302063049402' title=''/><author><name>Chris Brew</name><uri>https://www.blogger.com/profile/15950294272852443488</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='https://nlpers.blogspot.com/2017/04/structured-prediction-is-not-rl.html' ref='tag:blogger.com,1999:blog-19803222.post-5175748262897667616' source='http://www.blogger.com/feeds/19803222/posts/default/5175748262897667616' type='text/html'/><gd:extendedProperty name="blogger.itemClass" value="pid-1193603107"/><gd:extendedProperty name="blogger.displayTime" value="03 April, 2017 09:21"/></entry></feed>