{"id":136,"date":"2023-12-16T00:02:26","date_gmt":"2023-12-16T00:02:26","guid":{"rendered":"https:\/\/wp.lancs.ac.uk\/colab\/?p=136"},"modified":"2024-12-16T00:04:31","modified_gmt":"2024-12-16T00:04:31","slug":"information-guided-planning-an-online-approach-for-partially-observable-problems","status":"publish","type":"post","link":"https:\/\/wp.lancs.ac.uk\/colab\/2023\/12\/16\/information-guided-planning-an-online-approach-for-partially-observable-problems\/","title":{"rendered":"Information-guided Planning: An Online Approach for Partially Observable Problems"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/wp.lancs.ac.uk\/colab\/files\/2024\/12\/ibpomcp-1024x382.png\" alt=\"\" width=\"676\" height=\"252\" class=\"aligncenter size-large wp-image-137\" srcset=\"https:\/\/wp.lancs.ac.uk\/colab\/files\/2024\/12\/ibpomcp-1024x382.png 1024w, https:\/\/wp.lancs.ac.uk\/colab\/files\/2024\/12\/ibpomcp-300x112.png 300w, https:\/\/wp.lancs.ac.uk\/colab\/files\/2024\/12\/ibpomcp-768x287.png 768w, https:\/\/wp.lancs.ac.uk\/colab\/files\/2024\/12\/ibpomcp-676x252.png 676w, https:\/\/wp.lancs.ac.uk\/colab\/files\/2024\/12\/ibpomcp.png 1350w\" sizes=\"auto, (max-width: 676px) 100vw, 676px\" \/><\/p>\n<p>Our work &#8220;Information-guided Planning: An Online Approach for Partially Observable Problems&#8221; was presented at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023). The work integrates entropy into the decision-making process of the Monte Carlo simulations of an on-line planner, improving the agent&#8217;s performance, especially in scenarios with sparse rewards. The paper is <a href=\"https:\/\/papers.nips.cc\/paper_files\/paper\/2023\/file\/da5498f88193ff61f0daea1940b819da-Paper-Conference.pdf\">freely available<\/a>. Our source code is also available in the <a href=\"https:\/\/github.com\/lsmcolab\/ib-pomcp\/\">paper&#8217;s GitHub<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Our work &#8220;Information-guided Planning: An Online Approach for Partially Observable Problems&#8221; was presented at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023). The work integrates entropy into the decision-making process of the Monte Carlo simulations of an on-line planner, improving the agent&#8217;s performance, especially in scenarios with sparse rewards. The paper is freely [&hellip;]<\/p>\n","protected":false},"author":1432,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[3],"tags":[],"class_list":["post-136","post","type-post","status-publish","format-standard","hentry","category-news"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/posts\/136","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/users\/1432"}],"replies":[{"embeddable":true,"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/comments?post=136"}],"version-history":[{"count":1,"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/posts\/136\/revisions"}],"predecessor-version":[{"id":138,"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/posts\/136\/revisions\/138"}],"wp:attachment":[{"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/media?parent=136"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/categories?post=136"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/tags?post=136"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}