{"id":126,"date":"2023-02-14T23:30:51","date_gmt":"2023-02-14T23:30:51","guid":{"rendered":"https:\/\/wp.lancs.ac.uk\/colab\/?p=126"},"modified":"2024-12-14T23:35:24","modified_gmt":"2024-12-14T23:35:24","slug":"certified-policy-smoothing-for-cooperative-multi-agent-reinforcement-learning","status":"publish","type":"post","link":"https:\/\/wp.lancs.ac.uk\/colab\/2023\/02\/14\/certified-policy-smoothing-for-cooperative-multi-agent-reinforcement-learning\/","title":{"rendered":"Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/wp.lancs.ac.uk\/colab\/files\/2024\/12\/advAttack-1024x425.png\" alt=\"\" width=\"676\" height=\"281\" class=\"aligncenter size-large wp-image-127\" srcset=\"https:\/\/wp.lancs.ac.uk\/colab\/files\/2024\/12\/advAttack-1024x425.png 1024w, https:\/\/wp.lancs.ac.uk\/colab\/files\/2024\/12\/advAttack-300x124.png 300w, https:\/\/wp.lancs.ac.uk\/colab\/files\/2024\/12\/advAttack-768x319.png 768w, https:\/\/wp.lancs.ac.uk\/colab\/files\/2024\/12\/advAttack-676x280.png 676w, https:\/\/wp.lancs.ac.uk\/colab\/files\/2024\/12\/advAttack.png 1087w\" sizes=\"auto, (max-width: 676px) 100vw, 676px\" \/><\/p>\n<p>Our work &#8220;Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning&#8221; was presented at AAAI 2023. The paper presents a new approach to certify the robustness of multi-agent cooperative reinforcement learning systems, using policy smoothing techniques. The paper is freely available <a href=\"https:\/\/ojs.aaai.org\/index.php\/AAAI\/article\/view\/26756\/26528\">here<\/a>. The source code is also available on <a href=\"https:\/\/github.com\/TrustAI\/CertifyCMARL\">GitHub<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Our work &#8220;Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning&#8221; was presented at AAAI 2023. The paper presents a new approach to certify the robustness of multi-agent cooperative reinforcement learning systems, using policy smoothing techniques. The paper is freely available here. The source code is also available on GitHub.<\/p>\n","protected":false},"author":1432,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[3],"tags":[],"class_list":["post-126","post","type-post","status-publish","format-standard","hentry","category-news"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/posts\/126","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/users\/1432"}],"replies":[{"embeddable":true,"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/comments?post=126"}],"version-history":[{"count":1,"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/posts\/126\/revisions"}],"predecessor-version":[{"id":128,"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/posts\/126\/revisions\/128"}],"wp:attachment":[{"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/media?parent=126"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/categories?post=126"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wp.lancs.ac.uk\/colab\/wp-json\/wp\/v2\/tags?post=126"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}