Presentation
Schedule
The team has a total of 8 papers (including 1 oral and 3 highlights) accepted to CVPR 2026.
| Paper |
|---|
|
Editing
MatAnyone2: Scaling Video Matting via a Learned Quality Evaluator
P. Yang, S. Zhou, K. Hao, Q. Tao in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2026 (CVPR, Highlight) [arXiv] [Project Page] [Demo] |
|
Editing
Precise Object and Effect Removal with Adaptive Target-Aware Attention
J. Zhao, Z. Wang, P. Yang, S. Zhou in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2026 (CVPR) [arXiv] [Project Page] [Demo] |
|
Editing
Linear Image Generation by Synthesizing Exposure Brackets
Y. Dai, Z. Zhang, S. Zhou, N. Zhao in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2026 (CVPR) [Coming Soon] |
|
Generation
Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Y. Zhou, Z. Xiao, T. Wei, S. Yang, X. Pan in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2026 (CVPR, Highlight) [arXiv] [Project Page] |
|
Generation
WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World
A. Liang, L. Kong, T. Yan, H. Liu, W. Yang, Z. Huang, W. Yin, J. Zuo, Y. Hu, D. Zhu, D. Lu, Y. Liu, G. Jiang, L. Li, X. Li, L. Zhuo, L. X. Ng, B. R. Cottereau, C. Gao, L. Pan, W. T. Ooi, Z. Liu in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2026 (CVPR, Oral) [arXiv] [Project Page] |
|
3DSpatial
OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer
H. Peng, H. Li, Y. Dai, Y. Lan, Y. Luo, T.u Qi, Z. Zhang, Y. Zhan, J. Zhang, W. Xu, Z. Liu in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2026 (CVPR, Highlight) [arXiv] [Project Page] |
|
3DGeneration
PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image
Z. Cao, F. Hong, Z. Chen, L. Pan, Z. Liu in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2026 (CVPR) [arXiv] [Project Page] |
|
SptialMultimodal
Scaling Spatial Intelligence with Multimodal Foundation Models
Z. Cai, R. Wang, C. Gu, F. Pu, J. Xu, Y. Wang, W. Yin, Z. Yang, C. Wei, T. Zhou, Q. Sun, H. E. Pang, J. Li, O. Qian, Z. Lin, X. Shi, D. Kewang, X. Han, Z. Chen, X. Fan, H. Deng, L. Lu, L. Pan, B. Li, Z. Liu, Q. Wang, D. Lin, L. Yang in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2026 (CVPR) [arXiv] [Project Page] |