Recent Publications

Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning

Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning

Yuxiang Lu, Shengcao Cao, Yu-Xiong Wang

ICLR 2025

[PDF]

RTDiff: Reverse Trajectory Synthesis via Diffusion for Offline Reinforcement Learning

RTDiff: Reverse Trajectory Synthesis via Diffusion for Offline Reinforcement Learning

Qianlan Yang, Yu-Xiong Wang

ICLR 2025

[PDF]

3DGS-Drag: Dragging Gaussians for Intuitive Point-Based 3D Editing

3DGS-Drag: Dragging Gaussians for Intuitive Point-Based 3D Editing

Jiahua Dong, Yu-Xiong Wang

ICLR 2025

[Website] [PDF] [Code]

Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

Ziqi Pang*, Xu Xin*, Yu-Xiong Wang

ICLR 2025

[Website] [PDF] [Code]

Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models

Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models

Shuhong Zheng, Zhipeng Bao, Ruoyu Zhao, Martial Hebert, Yu-Xiong Wang

ICLR 2025

[Website] [PDF]

Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision

Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision

Shengcao Cao, Liang-Yan Gui, Yu-Xiong Wang

arXiv, 2024

[Website] [PDF] [Code]

Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers

Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers

Kai Yan, Alex Schwing, Yu-Xiong Wang

NeurIPS, 2024 (Spotlight)

[Website] [PDF] [Code]

InstructG2I: Synthesizing Images from Multimodal Attributed Graphs

InstructG2I: Synthesizing Images from Multimodal Attributed Graphs

Bowen Jin, Ziqi Pang, Bingjun Guo, Yu-Xiong Wang, Jiaxuan You, Jiawei Han

NeurIPS, 2024

[Website] [PDF] [Code]

ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing

ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing

Jun-Kun Chen, Yu-Xiong Wang

NeurIPS, 2024

[Website] [PDF]

SceneCraft: Layout-Guided 3D Scene Generation

SceneCraft: Layout-Guided 3D Scene Generation

Xiuyu Yang, Yunze Man, Jun-Kun Chen, Yu-Xiong Wang

NeurIPS, 2024

[Website] [PDF] [Code]

InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction

InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction

Sirui Xu, Ziyin Wang, Yu-Xiong Wang, Liang-Yan Gui

NeurIPS, 2024.

[Website] [PDF]

Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Reasoning

Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Reasoning

Yunze Man, Shuhong Zheng, Zhipeng Bao, Martial Hebert, Liang-Yan Gui, Yu-Xiong Wang

NeurIPS, 2024.

[Website] [PDF] [Code]

More Publications