Publications
You can also find my publications on my Google Scholar profile.
Year: 2025 2024 2023 2022 2021 2020 Before 2020
2025
Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
Yuxiang Lu, Shengcao Cao, Yu-Xiong Wang
ICLR 2025
[PDF]
RTDiff: Reverse Trajectory Synthesis via Diffusion for Offline Reinforcement Learning
Qianlan Yang, Yu-Xiong Wang
ICLR 2025
[PDF]
3DGS-Drag: Dragging Gaussians for Intuitive Point-Based 3D Editing
Jiahua Dong, Yu-Xiong Wang
ICLR 2025
[Website]
[PDF]
[Code]
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang*, Xu Xin*, Yu-Xiong Wang
ICLR 2025
[Website]
[PDF]
[Code]
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models
Shuhong Zheng, Zhipeng Bao, Ruoyu Zhao, Martial Hebert, Yu-Xiong Wang
ICLR 2025
[Website]
[PDF]
2024
Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision
Shengcao Cao, Liang-Yan Gui, Yu-Xiong Wang
arXiv, 2024
[Website]
[PDF]
[Code]
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
Kai Yan, Alex Schwing, Yu-Xiong Wang
NeurIPS, 2024 (Spotlight)
[Website]
[PDF]
[Code]
InstructG2I: Synthesizing Images from Multimodal Attributed Graphs
Bowen Jin, Ziqi Pang, Bingjun Guo, Yu-Xiong Wang, Jiaxuan You, Jiawei Han
NeurIPS, 2024
[Website]
[PDF]
[Code]
ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing
Jun-Kun Chen, Yu-Xiong Wang
NeurIPS, 2024
[Website]
[PDF]
SceneCraft: Layout-Guided 3D Scene Generation
Xiuyu Yang, Yunze Man, Jun-Kun Chen, Yu-Xiong Wang
NeurIPS, 2024
[Website]
[PDF]
[Code]
InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction
Sirui Xu, Ziyin Wang, Yu-Xiong Wang, Liang-Yan Gui
NeurIPS, 2024.
[Website]
[PDF]
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Reasoning
Yunze Man, Shuhong Zheng, Zhipeng Bao, Martial Hebert, Liang-Yan Gui, Yu-Xiong Wang
NeurIPS, 2024.
[Website]
[PDF]
[Code]
Floating No More: Object-Ground Reconstruction from a Single Image
Yunze Man, Yichen Sheng, Jianming Zhang, Liang-Yan Gui, Yu-Xiong Wang
arXiv, 2024.
[Website]
[PDF]
AlignDiff: Aligning Diffusion Models for General Few-Shot Segmentation
Ri-Zhao Qiu, Yu-Xiong Wang*, Kris Hauser*
ECCV, 2024 (Oral).
[PDF]
[Code]
Language Agent Tree Search Unifies Reasoning, Acting, and Planning in Language Models
Andy Zhou, Kai Yan, Michal Shlapentokh-Rothman, Haohan Wang, Yu-Xiong Wang
ICML, 2024.
[Website]
[PDF]
[Code]
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching
ƒ
Kai Yan, Alex Schwing, Yu-Xiong Wang
ICML, 2024.
[Website]
[PDF]
[Code]
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories
Qianlan Yang, Yu-Xiong Wang
ICML, 2024.
[Website]
[PDF]
Aligning Large Multimodal Models with Factually Augmented RLHF
Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liangyan Gui, Yu-Xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell
ACL Findings, 2024.
[Website]
[PDF]
[Code]
Separate-and-Enhance: Compositional Finetuning for Text2Image Diffusion Models
Zhipeng Bao, Yijun Li, Krishna Kumar Singh, Yu-Xiong Wang, Martial Hebert
SIGGRAPH, 2024.
[PDF]
Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion
Linzhan Mou*, Jun-Kun Chen*, Yu-Xiong Wang
CVPR, 2024.
[Website]
[PDF]
[Poster]
[Video]
ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing
Jun-Kun Chen, Samuel Rota Bulò, Norman Müller, Lorenzo Porzi, Peter Kontschieder, Yu-Xiong Wang
CVPR, 2024.
[Website]
[PDF]
[Poster]
[Video]
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding
Zhihao Zhang, Shengcao Cao, Yu-Xiong Wang
CVPR, 2024.
[Website]
[PDF]
[Code]
[Video]
Situational Awareness Matters in 3D Vision Language Reasoning
Yunze Man, Liangyan Gui, Yu-Xiong Wang
CVPR, 2024.
[Website]
[PDF]
[Code]
[Video]
Restricted Memory Banks Improve Video Object Segmentation: A Revisit
Junbao Zhou, Ziqi Pang, Yu-Xiong Wang
CVPR, 2024.
[Website]
[PDF]
[Code]
[Video]
Region Representations Revisited
Michal Shlapentokh-Rothman* , Ansel Blume*, Yao Xiao, Yuqun Wu, Sethurame TV, Heyi Tao, Jae Yong Lee, Wilfredo Torres, Yu-Xiong Wang, Derek Hoiem
CVPR, 2024.
[Website]
[PDF]
[Code]
[Poster]
[Video]
Language Agent Tree Search Unifies Reasoning, Acting, and Planning in Language Models
Andy Zhou, Kai Yan, Michal Shlapentokh-Rothman, Haohan Wang, Yu-Xiong Wang
arXiv
[Website] [PDF] [Code]
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
Ziqi Pang, Ziyang Xie*, Yunze Man*, Yu-Xiong Wang
ICLR 2024 (Spotlight, Top 5%)
[Website] [Code]
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation
Shengcao Cao, Jiuxiang Gu, Jason Kuen, Hao Tan, Ruiyi Zhang, Handong Zhao, Ani Nenkova, Liang-Yan Gui, Tong Sun, Yu-Xiong Wang
ICLR, 2024.
[PDF]
2023
YouTubePD: A Multimodal Benchmark for Parkinson’s Disease Analysis
Andy Zhou*, Samuel Li*, Pranav Sriram*, Xiang Li*, Jiahua Dong*, Ansh Sharma, Yuanyi Zhong, Shirui Luo, Maria Jaromin, Volodymyr Kindratenko, George Heintz, Christopher Zallek, Yu-Xiong Wang
NeurIPS Datasets and Benchmarks Track, 2023.
[Website] [PDF]
ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields
Jiahua Dong, Yu-Xiong Wang
NeurIPS, 2023.
[Website] [PDF] [Code]
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection
Shengcao Cao, Dhiraj Joshi, Liang-Yan Gui, Yu-Xiong Wang
NeurIPS, 2023.
[Website] [PDF] [Code] [Video]
A Simple Solution for Offline Imitation from Observations and Examples with Possibly Incomplete Trajectories
Kai Yan, Alexander G. Schwing, Yu-Xiong Wang
NeurIPS, 2023.
[Website] [PDF] [Code] [Poster] [Video]
Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models
Andy Zhou, Jindong Wang, Yu-Xiong Wang, Haohan Wang
NeurIPS, 2023.
[PDF] [Code]
Contrastive Learning Relies More on Spatial Inductive Bias Than Supervised Learning: An Empirical Study
Yuanyi Zhong*, Haoran Tang*, Junkun Chen*, Yu-Xiong Wang
ICCV, 2023.
Improving Equivariance in State-of-the-Art Supervised Depth and Normal Predictors
Yuanyi Zhong, Anand Bhattad, Yu-Xiong Wang, David A. Forsyth
ICCV, 2023.
Video State-Changing Object Segmentation
Jiangwei Yu*, Xiang Li*, Xinran Zhao, Hongming Zhang, Yu-Xiong Wang
ICCV, 2023.
[Website] [PDF]
Multi-task View Synthesis with Neural Radiance Fields
Shuhong Zheng*, Zhipeng Bao*, Martial Hebert, Yu-Xiong Wang
ICCV, 2023.
[Website] [PDF] [Code]
InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion
Sirui Xu, Zhengyuan Li, Yu-Xiong Wang*, Liangyan Gui*
ICCV, 2023.
[Website] [PDF] [Code] [Video]
Revisiting Deformable Convolution for Depth Completion
Xinglong Sun, Jean Ponce, Yu-Xiong Wang
IROS, 2023.
Streaming Motion Forecasting for Autonomous Driving
Ziqi Pang, Deva Ramanan, Mengtian Li, Yu-Xiong Wang
IROS, 2023.
[Website] [PDF] [Code]
Learning Lightweight Object Detectors via Progressive Knowledge Distillation
Shengcao Cao, Mengtian Li, James Hays, Deva Ramanan, Yu-Xiong Wang, Liangyan Gui
ICML, 2023.
[PDF] [Code]
NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds
Jun-Kun Chen, Jipeng Lyu, Yu-Xiong Wang
CVPR, 2023.
[Website] [PDF]
Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object Tracking
Ziqi Pang, Jie Li, Pavel Tokmakov, Dian Chen, Sergey Zagoruyko, Yu-Xiong Wang
CVPR, 2023.
[Website] [PDF] [Code] [Video]
Contrastive Mean Teacher for Domain Adaptive Object Detectors
Shengcao Cao, Dhiraj Joshi, Liangyan Gui, Yu-Xiong Wang
CVPR, 2023.
[PDF] [Code] [Video]
Object Discovery from Motion-Guided Tokens
Zhipeng Bao, Pavel Tokmakov, Yu-Xiong Wang, Adrien Gaidon, Martial Hebert
CVPR, 2023.
[Website] [PDF] [Code]
GAPS: Few-Shot Incremental Semantic Segmentation via Guided Copy-Paste Synthesis
Ri-Zhao Qiu, Peiyi Chen, Wangzhe Sun, Yu-Xiong Wang, Kris Hauser
CVPR Workshop on Learning with Limited Labelled Data, 2023.
From N to N+1: Learning to Detect Novel Animals with SAM in the Wild
Garvita Allabadi, Ana Lucic, Yu-Xiong Wang, Vikram Adve
CVPR Workshop on CV4Animals, 2023.
Stochastic Multi-Person 3D Motion Forecasting
Sirui Xu, Yu-Xiong Wang*, Liangyan Gui*
ICLR, **Notable-Top-25%**, 2023.
[Website] [PDF] [Code]
Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields
Mingtong Zhang*, Shuhong Zheng*, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang
WACV, 2023.
[PDF] [Code] [Video]
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching
Kai Yan, Alexander G. Schwing, Yu-Xiong Wang
NeurIPS Workshop on Optimal Transport and Machine Learning, 2023.
[Website] [PDF] [Code] [Poster] [Video]
Aligning Large Multimodal Models with Factually Augmented RLHF
Zhiqing Sun*, Sheng Shen*, Shengcao Cao*, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-Yan Gui, Yu-Xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell
arXiv, 2023.
[Website] [PDF] [Code]
2022
CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning with Demonstrations
Kai Yan, Alexander G. Schwing, Yu-Xiong Wang
NeurIPS, 2022.
[Website] [PDF] [Code] [Poster] [Video]
LECO: Learning with an Evolving Class Ontology
Zhiqiu Lin, Deepak Pathak, Yu-Xiong Wang, Deva Ramanan, Shu Kong
NeurIPS, 2022.
[Website] [PDF]
The Curse of Zero Task Diversity: On the Failure of Transfer Learning to Outperform MAML and their Empirical Equivalence
Brando Miranda, Patrick Yu, Yu-Xiong Wang, Oluwasanmi Koyejo
NeurIPS Workshop on Meta-Learning, **Contributed Talk**, 2022.
[PDF] [Poster]
Towards Overcoming Data Scarcity in Materials Science: Unifying Models and Datasets with a Mixture of Experts Framework
Rees Chang, Yu-Xiong Wang, Elif Ertekin
Nature npj Computational Materials, 2022.
[PDF] [Code]
Generalized Few-Shot Node Classification on Graphs
Zhe Xu, Kaize Ding, Yu-Xiong Wang, Huan Liu, Hanghang Tong
ICDM, 2022.
[PDF]
Diverse Human Motion Prediction Guided by Multi-Level Spatial Temporal Anchors
Sirui Xu, Yu-Xiong Wang*, Liang-Yan Gui*
ECCV, **Oral Presentation**, 2022.
[Website] [PDF] [Code]
PointTree: Transformation Robust Point Cloud Encoder with Relaxed K-D Trees
Junkun Chen, Yu-Xiong Wang
ECCV, 2022.
[PDF] [Code]
Generative Modeling for Multi-Task Visual Learning
Zhipeng Bao, Martial Hebert, Yu-Xiong Wang
ICML, 2022.
[PDF] [Code]
Is Self-Supervised Contrastive Learning More Robust Than Supervised Learning?
Yuanyi Zhong, Haoran Tang, Junkun Chen, Jian Peng, Yu-Xiong Wang
ICML Workshop on Pre-training: Perspectives, Pitfalls, and Paths Forward, 2022.
[PDF]
DIVeR: Real-time and Accurate Neural Radiance Fields with Deterministic Integration for Volume Rendering
Liwen Wu, Jae Yong Lee, Anand Bhattad, Yu-Xiong Wang, David Forsyth
CVPR, **Best Paper Award Finalist**, 2022.
[Website] [PDF]
Discovering Objects that Can Move
Zhipeng Bao, Pavel Tokmakov, Allan Jabri, Yu-Xiong Wang, Adrien Gaidon, Martial Hebert
CVPR, 2022.
[Website] [PDF]
Long-Tailed Recognition via Weight Balancing
Shaden Alshammari, Yu-Xiong Wang, Deva Ramanan, Shu Kong
CVPR, 2022.
[PDF] [Code]
Embracing Single Stride 3D Object Detector with Sparse Transformer
Lue Fan, Ziqi Pang, Tianyuan Zhang, Yu-Xiong Wang, Hang Zhao, Feng Wang, Naiyan Wang, Zhaoxiang Zhang
CVPR, 2022.
[Website] [PDF] [Code]
On the Importance of Firth Bias Reduction in Few-Shot Classification
Saba Ghaffari*, Ehsan Saleh*, David Forsyth, Yu-Xiong Wang
ICLR, 2022.
[Website] [PDF]
2021
SIRfyN: Single Image Relighting from your Neighbors
David Forsyth, Anand Bhattad, Pranav Asthana, Yuanyi Zhong, Yu-Xiong Wang
Arxiv, 2021.
[PDF]
Pixel Contrastive-Consistent Semi-Supervised Semantic Segmentation
Yuanyi Zhong, Bodi Yuan, Hong Wu, Zhiqiang Yuan, Jian Peng, Yuxiong Wang
ICCV, 2021.
[PDF]
Learning to Hallucinate Examples from Extrinsic and Intrinsic Supervision
Liangke Gui*, Adrien Bardes*, Ruslan Salakhutdinov, Alexander Hauptmann, Martial Hebert, Yuxiong Wang
ICCV, 2021 (\* indicates equal contribution).
[PDF]
On the Importance of Distractors for Few-Shot Learning
Rajshekhar Das, Yuxiong Wang, José M. F. Moura
ICCV, 2021.
[PDF]
Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection
Nadine Chang, Zhiding Yu, Yuxiong Wang, Anima Anandkumar, Sanja Fidler, Jose M. Alvarez
ICML, 2021.
[PDF]
Hallucination Improves Few-Shot Object Detection
Weilin Zhang, Yuxiong Wang
CVPR, 2021.
[PDF]
DAP: Detection-Aware Pre-training with Weak Supervision
Yuanyi Zhong, Jianfeng Wang, Lijuan Wang, Jian Peng, Yuxiong Wang, , Lei Zhang
CVPR, 2021.
[PDF]
Unlocking the Full Potential of Small Data with Diverse Supervision
Ziqi Pang, Zhiyuan Hu, Pavel Tokmakov, Yuxiong Wang, Martial Hebert
CVPR Workshop on Learning from Limited or Imperfect Data, 2021.
[Website] [PDF] [Code]
Bowtie Networks: Generative Modeling for Joint Few-Shot Recognition and Novel-View Synthesis
Zhipeng Bao, Yuxiong Wang, Martial Hebert
ICLR, 2021.
[PDF]
[code]
2020
Towards Streaming Perception
Mengtian Li, Yuxiong Wang, Deva Ramanan
**Oral Presentation, Best Paper Honorable Mention**, ECCV, 2020.
[PDF]
Before 2020
Meta-Learning to Detect Rare Objects
Yuxiong Wang, Deva Ramanan, Martial Hebert,
ICCV, 2019.
[PDF] [Poster]
Learning Compositional Representations for Few-Shot Recognition
Pavel Tokmakov, Yuxiong Wang, Martial Hebert
ICCV, 2019.
[PDF] [Poster]
Image Deformation Meta-Networks for One-Shot Learning
Zitian Chen, Yanwei Fu, Yuxiong Wang, Lin Ma, Wei Liu, Martial Hebert
Oral Presentation, Best Paper Award Finalist, CVPR, 2019.
[PDF] [Poster]
Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent
Yuqian Fu, Chengrong Wang, Yanwei Fu, Yuxiong Wang, Cong Bai, Xiangyang Xue, Yu-Gang
Jiang, Lin Ma, Wei Liu, Martial Hebert
[PDF]
Teaching Robots to Predict Human Motion
Liang-Yan Gui, Kevin Zhang, Yuxiong Wang, Xiaodan Liang, José M. F. Moura, Manuela M. Veloso
Oral Presentation, IROS, 2018.
[PDF]
Few-Shot Human Motion Prediction via Meta-Learning
Liang-Yan Gui, Yuxiong Wang, Deva Ramanan, José M. F. Moura
ECCV, 2018.
[PDF] [Poster]
Adversarial Geometry-Aware Human Motion Prediction
Yuxiong Wang*, Liang-Yan Gui*, Xiaodan Liang, José M. F. Moura
Oral Presentation, ECCV, 2018 (* indicates equal contribution).
[PDF] [Poster]
Low-Shot Learning from Imaginary Data
Yuxiong Wang, Ross Girshick, Martial Hebert, Bharath Hariharan
Spotlight Oral Presentation, CVPR, 2018.
[PDF] [Poster]
Factorized Convolutional Networks: Unsupervised Fine-Tuning for Image Clustering
Liang-Yan Gui, Liangke Gui, Yuxiong Wang, Louis-Philippe Morency, José M. F. Moura
Oral Presentation, WACV, 2018.
[PDF]
Learning to Model the Tail
Yuxiong Wang, Deva Ramanan, Martial Hebert
NeurIPS, 2017.
[PDF] [Poster]
Few-Shot Hash Learning for Image Retrieval
Yuxiong Wang, Liangke Gui, Martial Hebert
ICCV Workshops, 2017.
[PDF] [Poster]
Growing a Brain: Fine-Tuning by Increasing Model Capacity
Yuxiong Wang, Deva Ramanan, Martial Hebert
CVPR, 2017.
[PDF] [Poster]
Learning from Small Sample Sets by Combining Unsupervised Meta-Training with CNNs
Yuxiong Wang, Martial Hebert
NeurIPS, 2016.
[PDF] [Poster]
Learning to Learn: Model Regression Networks for Easy Small Sample Learning
Yuxiong Wang, Martial Hebert
ECCV, 2016.
[PDF] [Poster]
Learning by Transferring from Unsupervised Universal Sources
Yuxiong Wang, Martial Hebert
Oral Presentation, AAAI, 2016.
[PDF]
Model Recommendation: Generating Object Detectors from Few Samples
Yuxiong Wang, Martial Hebert
CVPR, 2015.
[PDF] [Poster]
Self-Explanatory Sparse Representation for Image Classification
Yuxiong Wang*, Baodi Liu*, Bin Shen, Yu-Jin Zhang, Martial Hebert
ECCV, 2014 (* indicates equal contribution).
[PDF] [Poster]
Non-Negative Matrix Factorization: A Comprehensive Review
Yuxiong Wang, Yu-Jin Zhang
IEEE Transactions on Knowledge and Data Engineering, vol. 25, no. 6, pp.1336-1353, 2013.
[PDF]