{"id":3819,"date":"2021-04-06T08:02:34","date_gmt":"2021-04-06T07:02:34","guid":{"rendered":"https:\/\/www.ceessnoek.info\/?p=3819"},"modified":"2021-04-06T08:02:46","modified_gmt":"2021-04-06T07:02:46","slug":"cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space","status":"publish","type":"post","link":"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/","title":{"rendered":"CVPR 2021: Few-Shot Transformation of Common Actions into Time and Space"},"content":{"rendered":"\n<p>The CVPR 2021 cam-ready \u201c<em>Few-Shot Transformation of Common Actions into Time and Space<\/em>\u201d by Pengwan Yang, Pascal Mettes and Cees Snoek is\u00a0<a href=\"https:\/\/isis-data.science.uva.nl\/cgmsnoek\/pub\/yang-common-time-space-cvpr2021.pdf\">now available<\/a>.\u00a0This paper introduces the task of few-shot common action localization in time and space. Given a few trimmed support videos containing the same but unknown action, we strive for spatio-temporal localization of that action in a long untrimmed query video. We do not require any class labels, interval bounds, or bounding boxes. To address this challenging task, we introduce a novel few-shot transformer architecture with a dedicated encoder-decoder structure optimized for joint commonality learning and localization prediction, without the need for proposals. Experiments on reorganizations of the AVA and UCF101-24 datasets show the effectiveness of our approach for few-shot common action localization, even when the support videos are noisy. Although we are not specifically designed for common localization in time only, we also compare favorably against the few-shot and one-shot state-of-the-art in this setting. Lastly, we demonstrate that the few-shot transformer is easily extended to common action localization per pixel.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><a href=\"https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space.png\"><img loading=\"lazy\" decoding=\"async\" width=\"924\" height=\"668\" src=\"https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space.png\" alt=\"\" class=\"wp-image-3816\" srcset=\"https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space.png 924w, https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space-300x217.png 300w, https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space-768x555.png 768w\" sizes=\"auto, (max-width: 924px) 100vw, 924px\" \/><\/a><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>The CVPR 2021 cam-ready \u201cFew-Shot Transformation of Common Actions into Time and Space\u201d by Pengwan Yang, Pascal Mettes and Cees Snoek is\u00a0now available.\u00a0This paper introduces the task of few-shot common action localization in time and space. Given a few trimmed support videos containing the same but unknown action, we strive for spatio-temporal localization of that [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[4],"tags":[],"class_list":["post-3819","post","type-post","status-publish","format-standard","hentry","category-science"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.3 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>CVPR 2021: Few-Shot Transformation of Common Actions into Time and Space - Cees Snoek<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"CVPR 2021: Few-Shot Transformation of Common Actions into Time and Space - Cees Snoek\" \/>\n<meta property=\"og:description\" content=\"The CVPR 2021 cam-ready \u201cFew-Shot Transformation of Common Actions into Time and Space\u201d by Pengwan Yang, Pascal Mettes and Cees Snoek is\u00a0now available.\u00a0This paper introduces the task of few-shot common action localization in time and space. Given a few trimmed support videos containing the same but unknown action, we strive for spatio-temporal localization of that [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/\" \/>\n<meta property=\"og:site_name\" content=\"Cees Snoek\" \/>\n<meta property=\"article:published_time\" content=\"2021-04-06T07:02:34+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2021-04-06T07:02:46+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space.png\" \/>\n<meta name=\"author\" content=\"Cees\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Cees\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/\",\"url\":\"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/\",\"name\":\"CVPR 2021: Few-Shot Transformation of Common Actions into Time and Space - Cees Snoek\",\"isPartOf\":{\"@id\":\"https:\/\/www.ceessnoek.info\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space.png\",\"datePublished\":\"2021-04-06T07:02:34+00:00\",\"dateModified\":\"2021-04-06T07:02:46+00:00\",\"author\":{\"@id\":\"https:\/\/www.ceessnoek.info\/#\/schema\/person\/4bca975b7c432aeb5dced40bdbc204c1\"},\"breadcrumb\":{\"@id\":\"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/#primaryimage\",\"url\":\"https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space.png\",\"contentUrl\":\"https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space.png\",\"width\":924,\"height\":668},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.ceessnoek.info\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"CVPR 2021: Few-Shot Transformation of Common Actions into Time and Space\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.ceessnoek.info\/#website\",\"url\":\"https:\/\/www.ceessnoek.info\/\",\"name\":\"Cees Snoek\",\"description\":\"research on video and image ai\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.ceessnoek.info\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.ceessnoek.info\/#\/schema\/person\/4bca975b7c432aeb5dced40bdbc204c1\",\"name\":\"Cees\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.ceessnoek.info\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/756ccb993852c1e8e3af39a228d11a7305b2a937750f26dc5799d5df019b0f51?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/756ccb993852c1e8e3af39a228d11a7305b2a937750f26dc5799d5df019b0f51?s=96&d=mm&r=g\",\"caption\":\"Cees\"},\"sameAs\":[\"http:\/\/www.CeesSnoek.info\"],\"url\":\"https:\/\/www.ceessnoek.info\/index.php\/author\/admin\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"CVPR 2021: Few-Shot Transformation of Common Actions into Time and Space - Cees Snoek","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/","og_locale":"en_US","og_type":"article","og_title":"CVPR 2021: Few-Shot Transformation of Common Actions into Time and Space - Cees Snoek","og_description":"The CVPR 2021 cam-ready \u201cFew-Shot Transformation of Common Actions into Time and Space\u201d by Pengwan Yang, Pascal Mettes and Cees Snoek is\u00a0now available.\u00a0This paper introduces the task of few-shot common action localization in time and space. Given a few trimmed support videos containing the same but unknown action, we strive for spatio-temporal localization of that [&hellip;]","og_url":"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/","og_site_name":"Cees Snoek","article_published_time":"2021-04-06T07:02:34+00:00","article_modified_time":"2021-04-06T07:02:46+00:00","og_image":[{"url":"https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space.png","type":"","width":"","height":""}],"author":"Cees","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Cees","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/","url":"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/","name":"CVPR 2021: Few-Shot Transformation of Common Actions into Time and Space - Cees Snoek","isPartOf":{"@id":"https:\/\/www.ceessnoek.info\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/#primaryimage"},"image":{"@id":"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/#primaryimage"},"thumbnailUrl":"https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space.png","datePublished":"2021-04-06T07:02:34+00:00","dateModified":"2021-04-06T07:02:46+00:00","author":{"@id":"https:\/\/www.ceessnoek.info\/#\/schema\/person\/4bca975b7c432aeb5dced40bdbc204c1"},"breadcrumb":{"@id":"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/#primaryimage","url":"https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space.png","contentUrl":"https:\/\/www.ceessnoek.info\/wp-content\/uploads\/2021\/03\/pengwang-common-time-space.png","width":924,"height":668},{"@type":"BreadcrumbList","@id":"https:\/\/www.ceessnoek.info\/index.php\/cvpr-2021-few-shot-transformation-of-common-actions-into-time-and-space\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.ceessnoek.info\/"},{"@type":"ListItem","position":2,"name":"CVPR 2021: Few-Shot Transformation of Common Actions into Time and Space"}]},{"@type":"WebSite","@id":"https:\/\/www.ceessnoek.info\/#website","url":"https:\/\/www.ceessnoek.info\/","name":"Cees Snoek","description":"research on video and image ai","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.ceessnoek.info\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.ceessnoek.info\/#\/schema\/person\/4bca975b7c432aeb5dced40bdbc204c1","name":"Cees","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.ceessnoek.info\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/756ccb993852c1e8e3af39a228d11a7305b2a937750f26dc5799d5df019b0f51?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/756ccb993852c1e8e3af39a228d11a7305b2a937750f26dc5799d5df019b0f51?s=96&d=mm&r=g","caption":"Cees"},"sameAs":["http:\/\/www.CeesSnoek.info"],"url":"https:\/\/www.ceessnoek.info\/index.php\/author\/admin\/"}]}},"_links":{"self":[{"href":"https:\/\/www.ceessnoek.info\/index.php\/wp-json\/wp\/v2\/posts\/3819","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.ceessnoek.info\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ceessnoek.info\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ceessnoek.info\/index.php\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ceessnoek.info\/index.php\/wp-json\/wp\/v2\/comments?post=3819"}],"version-history":[{"count":1,"href":"https:\/\/www.ceessnoek.info\/index.php\/wp-json\/wp\/v2\/posts\/3819\/revisions"}],"predecessor-version":[{"id":3820,"href":"https:\/\/www.ceessnoek.info\/index.php\/wp-json\/wp\/v2\/posts\/3819\/revisions\/3820"}],"wp:attachment":[{"href":"https:\/\/www.ceessnoek.info\/index.php\/wp-json\/wp\/v2\/media?parent=3819"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ceessnoek.info\/index.php\/wp-json\/wp\/v2\/categories?post=3819"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ceessnoek.info\/index.php\/wp-json\/wp\/v2\/tags?post=3819"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}