312 entries « 1 of 7 »

2025

Union-over-Intersections: Object Detection beyond Winner-Takes-AllAritra Bhowmik, Pascal Mettes, Martin R Oswald, Cees G M Snoek: Union-over-Intersections: Object Detection beyond Winner-Takes-All. In: ICLR, 2025. (Type: Proceedings Article | Abstract | Links | BibTeX)
TULIP: Token-length Upgraded CLIPIvona Najdenkoska, Mohammad Mahdi Derakhshani, Yuki M Asano, Nanne van Noord, Marcel Worring, Cees G M Snoek : TULIP: Token-length Upgraded CLIP. In: ICLR, 2025. (Type: Proceedings Article | Abstract | Links | BibTeX)
One Hundred Neural Networks and Brains Watching Videos: Lessons from AlignmentChristina Sartzetaki, Gemma Roig, Cees G M Snoek, Iris I A Groen: One Hundred Neural Networks and Brains Watching Videos: Lessons from Alignment. In: ICLR, 2025. (Type: Proceedings Article | Abstract | Links | BibTeX)
CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent CooperationJie Liu, Pan Zhou, Yingjun Du, Ah-Hwee Tan, Cees G M Snoek, Jan-Jakob Sonke, Efstratios Gavves: CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation. In: ICLR, 2025. (Type: Proceedings Article | Links | BibTeX)
DynaPrompt: Dynamic Test-Time Prompt TuningZehao Xiao, Shilin Yan, Jack Hong, Jiayin Cai, Xiaolong Jiang, Yao Hu, Jiayi Shen, Cheems Wang, Cees G M Snoek: DynaPrompt: Dynamic Test-Time Prompt Tuning. In: ICLR, 2025. (Type: Proceedings Article | BibTeX)
The Sound of Water: Inferring Physical Properties from Pouring LiquidsPiyush Bagad, Makarand Tapaswi, Cees G M Snoek, Andrew Zisserman: The Sound of Water: Inferring Physical Properties from Pouring Liquids. In: ICASSP, 2025. (Type: Proceedings Article | Abstract | Links | BibTeX)
GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution ShiftsSameer Ambekar, Zehao Xiao, Xiantong Zhen, Cees G M Snoek: GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution Shifts. In: WACV, 2025. (Type: Proceedings Article | Abstract | BibTeX)
SimPLR: A Simple and Plain Transformer for Scaling-Efficient Object Detection and SegmentationDuy-Kien Nguyen, Martin R Oswald, Cees G M Snoek: SimPLR: A Simple and Plain Transformer for Scaling-Efficient Object Detection and Segmentation. In: Transactions on Machine Learning Research, 2025, (Pending minor revision). (Type: Journal Article | Abstract | Links | BibTeX)
Commonsense Video Question Answering through Video-Grounded Entailment Tree ReasoningHuabin Liu, Filip Ilievski, Cees G M Snoek: Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning. arXiv:2501.05069, 2025. (Type: Unpublished | Abstract | Links | BibTeX)

2024

IPO: Interpretable Prompt Optimization for Vision-Language ModelsYingjun Du, Wenfang Sun, Cees G M Snoek: IPO: Interpretable Prompt Optimization for Vision-Language Models. In: NeurIPS, 2024. (Type: Proceedings Article | Abstract | Links | BibTeX)
Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty DetectionMohammadreza Salehi, Nikolaos Apostolikas, Efstratios Gavves, Cees G M Snoek, Yuki M Asano: Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty Detection. In: ACCV, 2024, (Oral presentation). (Type: Proceedings Article | Abstract | Links | BibTeX)
Beyond Coarse-Grained Matching in Video-Text RetrievalAozhu Chen, Hazel Doughty, Xirong Li, Cees G M Snoek: Beyond Coarse-Grained Matching in Video-Text Retrieval. In: ACCV, 2024, (Oral presentation). (Type: Proceedings Article | Abstract | Links | BibTeX)
LocoMotion: Learning Motion-Focused Video-Language RepresentationsHazel Doughty, Fida Mohammad Thoker, Cees G M Snoek: LocoMotion: Learning Motion-Focused Video-Language Representations. In: ACCV, 2024, (Oral presentation). (Type: Proceedings Article | Abstract | Links | BibTeX)
QUOTA: Quantifying Objects with Text-to-Image Models for Any DomainWenfang Sun, Yingjun Du, Gaowen Liu, Cees G M Snoek: QUOTA: Quantifying Objects with Text-to-Image Models for Any Domain. arXiv:2411.19534, 2024. (Type: Unpublished | Abstract | Links | BibTeX)
Day2Dark: Pseudo-Supervised Activity Recognition beyond Silent DaylightYunhua Zhang, Hazel Doughty, Cees G M Snoek: Day2Dark: Pseudo-Supervised Activity Recognition beyond Silent Daylight. In: International Journal of Computer Vision, 2024, (In press). (Type: Journal Article | Abstract | Links | BibTeX)
Beyond Model Adaptation at Test Time: A SurveyZehao Xiao, Cees G M Snoek: Beyond Model Adaptation at Test Time: A Survey. arXiv:2411.03687, 2024. (Type: Unpublished | Abstract | Links | BibTeX)
Prompt Diffusion Robustifies Any-Modality Prompt LearningYingjun Du, Gaowen Liu, Yuzhang Shang, Yuguang Yao, Ramana Kompella, Cees G M Snoek: Prompt Diffusion Robustifies Any-Modality Prompt Learning. arXiv:2410.20164, 2024. (Type: Unpublished | Abstract | Links | BibTeX)
TVBench: Redesigning Video-Language EvaluationDaniel Cores, Michael Dorkenwald, Manuel Mucientes, Cees G M Snoek, Yuki M Asano: TVBench: Redesigning Video-Language Evaluation. arXiv:2410.07752, 2024. (Type: Unpublished | Abstract | Links | BibTeX)
SIGMA: Sinkhorn-Guided Masked Video ModelingMohammadreza Salehi, Michael Dorkenwald, Fida Mohammad Thoker, Efstratios Gavves, Cees G M Snoek, Yuki M Asano: SIGMA: Sinkhorn-Guided Masked Video Modeling. In: ECCV, 2024. (Type: Proceedings Article | Abstract | Links | BibTeX)
SelEx: Self-Expertise in Fine-Grained Generalized Category DiscoverySarah Rastegar, Mohammadreza Salehi, Yuki M Asano, Hazel Doughty, Cees G M Snoek: SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery. In: ECCV, 2024. (Type: Proceedings Article | Abstract | Links | BibTeX)
GeneralAD: Anomaly Detection Across Domains by Attending to Distorted FeaturesLuc Sträter, Mohammadreza Salehi, Efstratios Gavves, Cees G M Snoek, Yuki M Asano: GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features. In: ECCV, 2024. (Type: Proceedings Article | Abstract | Links | BibTeX)
Probabilistic Test-Time Generalization by Variational Neighbor-LabelingSameer Ambekar, Zehao Xiao, Jiayi Shen, Xiantong Zhen, Cees G M Snoek: Probabilistic Test-Time Generalization by Variational Neighbor-Labeling. In: CoLLAs, 2024. (Type: Proceedings Article | Abstract | Links | BibTeX)
Focus for Free in Density-Based CountingZenglin Shi, Pascal Mettes, Cees G M Snoek: Focus for Free in Density-Based Counting. In: International Journal of Computer Vision, vol. 132, iss. 7, pp. 2600-2617, 2024. (Type: Journal Article | Abstract | Links | BibTeX)
Low-Resource Vision Challenges for Foundation ModelsYunhua Zhang, Hazel Doughty, Cees G M Snoek: Low-Resource Vision Challenges for Foundation Models. In: CVPR, 2024, (Best paper FGVC2024 workshop.). (Type: Proceedings Article | Abstract | Links | BibTeX)
PIN: Positional Insert Unlocks Object Localisation Abilities in VLMsMichael Dorkenwald, Nimrod Barazani, Cees G M Snoek, Yuki M Asano: PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs. In: CVPR, 2024. (Type: Proceedings Article | Abstract | Links | BibTeX)
Any-Shift Prompting for Generalization over DistributionsZehao Xiao, Jiayi Shen, Mohammad Mahdi Derakhshani, Shengcai Liao, Cees G M Snoek: Any-Shift Prompting for Generalization over Distributions. In: CVPR, 2024. (Type: Proceedings Article | Abstract | Links | BibTeX)
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual PixelsDuy-Kien Nguyen, Mahmoud Assran, Unnat Jain, Martin R Oswald, Cees G M Snoek, Xinlei Chen: An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels. arXiv:2406.09415, 2024. (Type: Unpublished | Abstract | Links | BibTeX)
Background No More: Action Recognition Across Domains by Causal InterventionsSarah Rastegar, Hazel Doughty, Cees G M Snoek: Background No More: Action Recognition Across Domains by Causal Interventions. In: Computer Vision and Image Understanding, vol. 242, 2024. (Type: Journal Article | Abstract | Links | BibTeX)
R-MAE: Regions Meet Masked AutoencodersDuy-Kien Nguyen, Vaibhav Aggarwal, Yanghao Li, Martin R Oswald, Alexander Kirillov, Cees G M Snoek, Xinlei Chen: R-MAE: Regions Meet Masked Autoencoders. In: ICLR, 2024. (Type: Proceedings Article | Abstract | Links | BibTeX)
Graph Neural Networks for Learning Equivariant Representations of Neural NetworksMiltiadis Kofinas, Boris Knyazev, Yan Zhang, Yunlu Chen, Gertjan J Burghouts, Efstratios Gavves, Cees G M Snoek, David W Zhang: Graph Neural Networks for Learning Equivariant Representations of Neural Networks. In: ICLR, 2024, (Oral presentation). (Type: Proceedings Article | Abstract | Links | BibTeX)
Training-Free Semantic Segmentation via LLM-SupervisionWenfang Sun, Yingjun Du, Gaowen Liu, Ramana Kompella, Cees G M Snoek: Training-Free Semantic Segmentation via LLM-Supervision. arXiv:2404.00701, 2024. (Type: Unpublished | Abstract | Links | BibTeX)
Flow Matching for Conditional Text Generation in a Few Sampling StepsVincent Tao Hu, Di Wu, Yuki M Asano, Pascal Mettes, Basura Fernando, Björn Ommer, Cees G M Snoek: Flow Matching for Conditional Text Generation in a Few Sampling Steps. In: EACL, 2024. (Type: Proceedings Article | Abstract | Links | BibTeX)
Latent Space Editing in Transformer-Based Flow MatchingVincent Tao Hu, David W Zhang, Mang Tang, Pascal Mettes, Deli Zhao, Cees G M Snoek: Latent Space Editing in Transformer-Based Flow Matching. In: AAAI Conference on Artificial Intelligence, 2024. (Type: Proceedings Article | Abstract | Links | BibTeX)
MetaKernel: Learning Variational Random Features with Limited LabelsYingjun Du, Haoliang Sun, Xiantong Zhen, Jun Xu, Yilong Yin, Ling Shao, Cees G M Snoek: MetaKernel: Learning Variational Random Features with Limited Labels. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 46, iss. 3, pp. 1464-1478, 2024. (Type: Journal Article | Abstract | Links | BibTeX)

2023

Guided Diffusion from Self-Supervised Diffusion FeaturesVincent Tao Hu, Yunlu Chen, Mathilde Caron, Yuki M Asano, Cees G M Snoek, Bjorn Ommer: Guided Diffusion from Self-Supervised Diffusion Features. arXiv:2312.08825, 2023. (Type: Unpublished | Abstract | Links | BibTeX)
Motion Flow Matching for Human Motion Synthesis and EditingVincent Tao Hu, Wenzhe Yin, Pingchuan Ma, Yunlu Chen, Basura Fernando, Yuki M Asano, Efstratios Gavves, Pascal Mettes, Bjorn Ommer, Cees G. M. Snoek: Motion Flow Matching for Human Motion Synthesis and Editing. arXiv:2312.08895, 2023. (Type: Unpublished | Abstract | Links | BibTeX)
Unlocking Spatial Comprehension in Text-to-Image Diffusion ModelsMohammad Mahdi Derakhshani, Menglin Xia, Harkirat Behl, Cees G M Snoek, Victor Rühle: Unlocking Spatial Comprehension in Text-to-Image Diffusion Models. arXiv:2311.17937, 2023. (Type: Unpublished | Abstract | Links | BibTeX)
Small Visual Language Models can also be Open-Ended Few-Shot LearnersMohammad Mahdi Derakhshani, Ivona Najdenkoska, Cees G M Snoek, Marcel Worring, Yuki M Asano: Small Visual Language Models can also be Open-Ended Few-Shot Learners. arXiv:2310.00500, 2023. (Type: Unpublished | Abstract | Links | BibTeX)
ProtoDiff: Learning to Learn Prototypical Networks by Task-Guided DiffusionYingjun Du, Zehao Xiao, Shengcai Liao, Cees G M Snoek: ProtoDiff: Learning to Learn Prototypical Networks by Task-Guided Diffusion. In: NeurIPS, 2023. (Type: Proceedings Article | Abstract | Links | BibTeX)
Learn to Categorize or Categorize to Learn? Self-Coding for Generalized Category DiscoverySarah Rastegar, Hazel Doughty, Cees G M Snoek: Learn to Categorize or Categorize to Learn? Self-Coding for Generalized Category Discovery. In: NeurIPS, 2023. (Type: Proceedings Article | Abstract | Links | BibTeX)
Learning Unseen Modality InteractionYunhua Zhang, Hazel Doughty, Cees G M Snoek: Learning Unseen Modality Interaction. In: NeurIPS, 2023. (Type: Proceedings Article | Abstract | Links | BibTeX)
Bayesian Prompt Learning for Image-Language Model GeneralizationMohammad Mahdi Derakhshani, Enrique Sanchez, Adrian Bulat, Victor Guilherme Turrisi da Costa, Cees G M Snoek, Georgios Tzimiropoulos, Brais Martinez: Bayesian Prompt Learning for Image-Language Model Generalization. In: ICCV, 2023. (Type: Proceedings Article | Abstract | Links | BibTeX)
Detecting Objects with Graph Priors and Graph RefinementAritra Bhowmik, Martin R Oswald, Yu Wang, Nora Baka, Cees G M Snoek: Detecting Objects with Graph Priors and Graph Refinement. In: ICCV, 2023. (Type: Proceedings Article | Abstract | Links | BibTeX)
Tubelet-Contrastive Self-Supervision for Video-Efficient GeneralizationFida Mohammad Thoker, Hazel Doughty, Cees G M Snoek: Tubelet-Contrastive Self-Supervision for Video-Efficient Generalization. In: ICCV, 2023. (Type: Proceedings Article | Abstract | Links | BibTeX)
Self-Ordering Point CloudsPengwan Yang, Cees G M Snoek, Yuki M Asano: Self-Ordering Point Clouds. In: ICCV, 2023, (Oral presentation). (Type: Proceedings Article | Abstract | Links | BibTeX)
Order-preserving Consistency Regularization for Domain Adaptation and GeneralizationMengmeng Jing, Xiantong Zhen, Jingjing Li, Cees G M Snoek: Order-preserving Consistency Regularization for Domain Adaptation and Generalization. In: ICCV, 2023. (Type: Proceedings Article | Abstract | Links | BibTeX)
Time Does Tell: Self-Supervised Time-Tuning of Dense Image RepresentationsMohammadreza Salehi, Efstratios Gavves, Cees G M Snoek, Yuki M Asano: Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations. In: ICCV, 2023. (Type: Proceedings Article | Abstract | Links | BibTeX)
Open-Ended Medical Visual Question Answering Through Prefix Tuning of Language ModelsTom van Sonsbeek, Mohammad Mahdi Derakhshani, Ivona Najdenkoska, Cees G M Snoek, Marcel Worring: Open-Ended Medical Visual Question Answering Through Prefix Tuning of Language Models. In: MICCAI, 2023, (Oral presentation). (Type: Proceedings Article | Abstract | Links | BibTeX)
Query by Activity Video in the WildTao Hu, William Thong, Pascal Mettes, Cees G M Snoek: Query by Activity Video in the Wild. In: ICIP, 2023. (Type: Proceedings Article | BibTeX)
Attentional Prototype Inference for Few-Shot SegmentationHaoliang Sun, Xiankai Lu, Haochen Wang, Yilong Yin, Xiantong Zhen, Cees G M Snoek, Ling Shao: Attentional Prototype Inference for Few-Shot Segmentation. In: Pattern Recognition, vol. 142, 2023. (Type: Journal Article | Abstract | Links | BibTeX)
312 entries « 1 of 7 »
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author’s copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.