2024

Journal

  1. Efficient Diffusion Model for Image Restoration by Residual Shifting
    Z. Yue, J. Wang, C. C. Loy
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 (TPAMI)
    [arXiv] [Project Page]
  2. Playing for 3D Human Recovery
    Z. Cai, M. Zhang, J. Ren, C. Wei, D. Ren, J. Li, Z. Lin, H. Zhao, S. Yi, L. Yang, C. C. Loy, Z. Liu
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 (TPAMI)
    [arXiv] [Project Page]
  3. Transformer-Based Visual Segmentation: A Survey
    X. Li, H. Ding, W. Zhang, H. Yuan, J. Pang, G. Cheng, K. Chen, Z. Liu, C. C. Loy
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 (TPAMI)
    [arXiv] [Project Page]
  4. DifFace: Blind Face Restoration with Diffused Error Contraction
    Z. Yue, C. C. Loy
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 (TPAMI)
    [arXiv] [Project Page] [Demo]
  5. PERF: Panoramic Neural Radiance Field from a Single Panorama
    G. Wang, P. Wang, Z. Chen, W. Wang, C. C. Loy, Z. Liu
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 (TPAMI)
    [arXiv] [Project Page]
  6. Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond
    Y. Dai, C. Li, S. Zhou, R. Feng, Y. Luo, C. C. Loy
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 (TPAMI)
    [arXiv] [Project Page]
  7. Talk-to-Edit: Fine-Grained 2D and 3D Facial Editing via Dialog
    Y. Jiang, Z. Huang, T. Wu, X. Pan, C. C. Loy, Z. Liu
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 (TPAMI)
    [DOI] [Project Page]
  8. Unified 3D and 4D Panoptic Segmentation via Dynamic Shifting Network
    F. Hong, L. Kong, H. Zhou, X. Zhu, H. Li, Z. Liu
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 (TPAMI)
    [DOI] [arXiv] [Project Page]
  9. MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model
    M. Zhang, Z. Cai, L. Pan, F. Hong, X. Guo, L. Yang, Z. Liu
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 (TPAMI)
    [arXiv] [Project Page]
  10. MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
    J. Xie, W. Li, X. Li, Z. Liu, Y. S. Ong, C. C. Loy
    International Journal of Computer Vision, 2024 (IJCV)
    [arXiv] [Project Page]
  11. Contextual Object Detection with Multimodal Large Language Models
    Y. Zang, W. Li, J. Han, K. Zhou, C. C. Loy
    International Journal of Computer Vision, 2024 (IJCV)
    [arXiv] [Project Page]
  12. Exploiting Diffusion Prior for Real-World Image Super-Resolution
    J. Wang, Z. Yue, S. Zhou, K. C. K. Chan, C. C. Loy
    International Journal of Computer Vision, 2024 (IJCV)
    [PDF] [DOI] [arXiv] [Project Page]
  13. Position-Guided Point Cloud Panoptic Segmentation Transformer
    Z. Xiao, W. Zhang, T. Wang, C. C. Loy, D. Lin, J. Pang
    International Journal of Computer Vision, 2024 (IJCV)
    [DOI] [arXiv] [Project Page]
  14. ReliTalk: Relightable Talking Portrait Generation from a Single Video
    H. Qiu, Z. Chen, Y. Jiang, H. Zhou, X. Fan, L. Yang, W. Wu, Z. Liu
    International Journal of Computer Vision, 2024 (IJCV)
    [arXiv] [Project Page]
  15. Generalized Out-of-Distribution Detection: A Survey
    J. Yang, K. Zhou, Y. Li, Z. Liu
    International Journal of Computer Vision, 2024 (IJCV)
    [arXiv] [Project Page]
  16. Robust Partial-to-Partial Point Cloud Registration in a Full Range
    L. Pan, Z. Cai, Z. Liu
    IEEE Robotics and Automation Letters, 2024 (RAL)
    [arXiv] [Project Page]
  17. Temporally Consistent Video Colorization with Deep Feature Propagation and Self-regularization Learning
    Y. Liu, H. Zhao, K. C. K. Chan, X. Wang, C. C. Loy, Y. Qiao, C. Dong
    Computational Visual Media, 2024 (CVM)
    [DOI] [arXiv] [Project Page]

Conference

  1. Video Diffusion Models are Training-free Motion Interpreter and Controller
    Z. Xiao, Y. Zhou, S. Yang, X. Pan
    in Proceedings of Neural Information Processing Systems, 2024 (NeurIPS)
    [arXiv] [Project Page]
  2. Generalizable Implicit Motion Modeling for Video Frame Interpolation
    Z. Guo, W. Li, C. C. Loy
    in Proceedings of Neural Information Processing Systems, 2024 (NeurIPS)
    [arXiv] [Project Page]
  3. Learning 3D Garment Animation from Trajectories of A Piece of Cloth
    Y. Shao, C. C. Loy, B. Dai
    in Proceedings of Neural Information Processing Systems, 2024 (NeurIPS)
    [Coming Soon]
  4. L4GM: Large 4D Gaussian Reconstruction Model
    J. Ren, K. Xie, A. Mirzaei, H. Liang, X. Zeng, K. Kreis, Z. Liu, A. Torralba, S. Fidler, S. W. Kim, H. Ling
    in Proceedings of Neural Information Processing Systems, 2024 (NeurIPS)
    [arXiv] [Project Page]
  5. OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
    T. Zhang, X. Li, H. Fei, H. Yuan, S. Wu, S. Ji, C. C. Loy, S. Yan
    in Proceedings of Neural Information Processing Systems, 2024 (NeurIPS)
    [arXiv] [Project Page]
  6. I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models
    O. Wenqi, Y. Dong, L. Yang, J. Si, X. Pan
    SIGGRAPH ASIA, 2024 (SIGGRAPH ASIA)
    [arXiv] [Project Page]
  7. ReVersion: Diffusion-Based Relation Inversion from Images
    Z. Huang, T. Wu, Y. Jiang, K. C. K. Chan, Z. Liu
    SIGGRAPH ASIA, 2024 (SIGGRAPH ASIA)
    [arXiv] [Project Page]
  8. LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation
    Y. Lan, F. Hong, S. Yang, S. Zhou, X. Meng, B. Dai, X. Pan, C. C. Loy
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page]
  9. Kalman-Inspired Feature Propagation for Video Face Super-Resolution
    R. Feng, C. Li, C. C. Loy
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page]
  10. GroupDiff: Diffusion-based Group Portrait Editing
    Y. Jiang, N. Zhao, Q. Liu, K. K. Singh, S. Yang, C. C. Loy, Z. Liu
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page]
  11. Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively
    H. Yuan, X. Li, C. Zhou, Y. Li, K. Chen, C. C. Loy
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page]
  12. Gaussian3Diff: 3D Gaussian Diffusion for 3D Full Head Synthesis and Editing
    Y. Lan, F. Tan, D. Qiu, Q. Xu, K. Genova, Z. Huang, R. Pandey, S. Fanello, T. Funkhouser, C. C. Loy, Y. Zhang
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page]
  13. Octopus: Embodied Vision-Language Programmer from Environmental Feedback
    J. Yang, Y. Dong, S. Liu, B. Li, Z. Wang, C. Jiang, H. Tan, J. Kang, Y. Zhang, K. Zhou, Z. Liu
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page]
  14. FunQA: Towards Surprising Video Comprehension
    B. Xie, S. Zhang, Z. Zhou, B. Li, Y. Zhang, J. Hessel, J. Yang, Z. Liu
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page]
  15. FreeInit: Bridging Initialization Gap in Video Diffusion Models
    T. Wu, C. Si, Y. Jiang, Z. Huang, Z. Liu
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page] [YouTube]
  16. LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation
    J. Tang, Z. Chen, X. Chen, T. Wang, G. Zeng, Z. Liu
    European Conference on Computer Vision, 2024 (ECCV, Oral)
    [arXiv] [Project Page]
  17. MMBench: Is Your Multi-modal Model an All-around Player?
    Y. Liu, H. Duan, Y. Zhang, B. Li, S. Zhang, W. Zhao, Y. Yuan, J. Wang, C. He, Z. Liu, K. Chen, D. Lin
    European Conference on Computer Vision, 2024 (ECCV, Oral)
    [arXiv] [Project Page]
  18. Large Motion Model for Unified Multi-Modal Motion Generation
    M. Zhang, D. Jin, C. Gu, F. Hong, Z. Cai, J. Huang, C. Zhang, X. Guo, L. Yang, Y. He, Z. Liu
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page]
  19. Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo
    T. Liu, G. Wang, S. Hu, L. Shen, X. Ye, Y. Zang, Z. Cao, W. Li, Z. Liu
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page]
  20. ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance
    Y. Chen, T. Wang, T. Wu, X. Pan, K. Jia, Z. Liu
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page]
  21. WHAC: World-grounded Humans and Cameras
    W. Yin, Z. Cai, C. Wei, F. Wang, R. Wang, H. Mei, W. Xiao, Z. Yang, Q. Sun, A. Yamashita, Z. Liu, L. Yang
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page]
  22. StructLDM: Structured Latent Diffusion for 3D Human Generation
    T. Hu, F. Hong, Z. Liu
    European Conference on Computer Vision, 2024 (ECCV)
    [arXiv] [Project Page]
  23. Digital Life Project: Autonomous 3D Characters with Social Intelligence
    Z. Cai, J. Jiang, Z. Qing, X. Guo, M. Zhang, Z. Lin, H. Mei, C. Wei, R. Wang, W. Yin, X. Fan, H. Du, L. Pan, P. Gao, Z. Yang, Y. Gao, J. Li, T. Ren, Y. Wei, X. Wang, C. C. Loy, L. Yang, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  24. Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
    S. Zhou, P. Yang, J. Wang, Y. Luo, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR, Highlight)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  25. Learning Inclusion Matching for Animation Paint Bucket Colorization
    Y. Dai, S. Zhou, Q. Li, C. Li, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  26. OMG-Seg: Is One Model Good Enough For All Segmentation?
    X. Li, H. Yuan, W. Li, H. Ding, S. Wu, W. Zhang, Y. Li, K. Chen, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  27. Towards Language-Driven Video Inpainting via Multimodal Large Language Models
    J. Wu, X. Li, C. Si, S. Zhou, J. Yang, J. Zhang, Y. Li, K. Chen, Y. Tong, Z. Liu, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  28. VideoBooth: Diffusion-based Video Generation with Image Prompts
    Y. Jiang, T. Wu, S. Yang, C. Si, D. Lin, Y. Qiao, C. C. Loy, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  29. FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
    S. Yang, Y. Zhou, Z. Liu, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  30. When StyleGAN Meets Stable Diffusion: a W+ Adapter for Personalized Image Generation
    X. Li, X. Hou, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  31. MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior
    H. Chen, C. C. Loy, X. Pan
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  32. CityDreamer: Compositional Generative Model of Unbounded 3D Cities
    H. Xie, Z. Chen, F. Hong, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  33. FreeU: Free Lunch in Diffusion U-Net
    C. Si, Z. Huang, Y. Jiang, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  34. VBench: Comprehensive Benchmark Suite for Video Generative Models
    Z. Huang, Y. He, J. Yu, F. Zhang, C. Si, Y. Jiang, Y. Zhang, T. Wu, Q. Jin, N. Chanpaisit, Y. Wang, X. Chen, L. Wang, D. Lin, Y. Qiao, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR, Highlight)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  35. SurMo: Surface-based 4D Motion Modeling for Dynamic Human Rendering
    T. Hu, F. Hong, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  36. GauHuman: Articulated Gaussian Splatting for Real-Time 3D Human Rendering
    S. Hu, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  37. DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing
    K. Zhang, Y. Zhou, X. Xu, X. Pan, Bo Dai
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2024 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  38. DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation
    J. Tang, J. Ren, H. Zhou, Z. Liu, G. Zeng
    International Conference on Learning Representations, 2024 (ICLR, Oral)
    [PDF] [arXiv] [Project Page]
  39. CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
    S. Wu, W. Zhang, L. Xu, S. Jin, X. Li, W. Liu, C. C. Loy
    International Conference on Learning Representations, 2024 (ICLR, Spotlight)
    [PDF] [arXiv] [Project Page]
  40. SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
    X. Chen, Y. Wang, L. Zhang, S. Zhuang, X. Ma, J. Yu, Y. Wang, D. Lin, Y. Qiao, Z. Liu
    International Conference on Learning Representations, 2024 (ICLR)
    [PDF] [arXiv] [Project Page]
  41. FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling
    H. Qiu, M. Xia, Y. Zhang, Y. He, X. Wang, Y. Shan, Z. Liu
    International Conference on Learning Representations, 2024 (ICLR)
    [PDF] [arXiv] [Project Page]
  42. Large-Vocabulary 3D Diffusion Model with Transformer
    Z. Cao, F. Hong, T. Wu, L. Pan, Z. Liu
    International Conference on Learning Representations, 2024 (ICLR)
    [PDF] [arXiv] [Project Page]
  43. Adaptive Window Pruning for Efficient Local Motion Deblurring
    H. Li, J. Zhao, S. Zhou, H. Feng, C. Li, C. C. Loy
    International Conference on Learning Representations, 2024 (ICLR)
    [PDF] [arXiv] [Project Page]
  44. Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment
    L. Siyao, T. Gu, Z. Yang, Z. Lin, Z. Liu, H. Ding, L. Yang, C. C. Loy
    International Conference on Learning Representations, 2024 (ICLR)
    [PDF] [arXiv] [Project Page]
  45. CLIM: Contrastive Language-Image Mosaic for Region Representation
    S. Wu, W. Zhang, L. Xu, S. Jin, W. Liu, C. C. Loy
    in Proceedings of AAAI Conference on Artificial Intelligence, 2024 (AAAI)
    [arXiv] [Project Page]
  46. PaintHuman: Towards High-fidelity Text-to-3D Human Texturing via Denoised Score Distillation
    J. Yu, H. Zhu, L. Jiang, C. C. Loy, W. Cai, W. Wu
    in Proceedings of AAAI Conference on Artificial Intelligence, 2024 (AAAI)
    [arXiv]
  47. Task-Oriented Human-Object Interactions Generation with Implicit Neural Representations
    Q. Li, J. Wang, C. C. Loy, B. Dai
    in Proceedings of IEEE/CVF Winter Conference on Applications of Computer Vision, 2024 (WACV)
    [arXiv]

Technical Report

  1. EgoLM: Multi-Modal Language Model of Egocentric Motions
    F. Hong, V. Guzov, H. J. Kim, Y. Ye, R. Newcombe, Z. Liu, L. Ma
    Technical report, arXiv:2409.18127, 2024
    [arXiv] [Project Page]
  2. Disco4D: Disentangled 4D Human Generation and Animation from a Single Image
    H. E. Pang, S. Liu, Z. Cai, L. Yang, T. Zhang, Z. Liu
    Technical report, arXiv:2409.17280, 2024
    [arXiv]
  3. 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
    Z. Chen, J. Tang, Y. Dong, Z. Cao, F. Hong, Y. Lan, T. Wang, H. Xie, T. Wu, S. Saito, L. Pan, D. Lin, Z. Liu
    Technical report, arXiv:2409.12957, 2024
    [arXiv] [Project Page]
  4. LLaVA-OneVision: Easy Visual Task Transfer
    B. Li, Y. Zhang, D. Guo, R. Zhang, F. Li, H. Zhang, K. Zhang, Y. Li, Z. Liu, C. Li
    Technical report, arXiv:2408.03326, 2024
    [arXiv] [Project Page] [Demo]
  5. LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models
    K. Zhang, B. Li, P. Zhang, F. Pu, J. A. Cahyono, K. Hu, S. Liu, Y. Zhang, J. Yang, C. Li, Z. Liu
    Technical report, arXiv:2407.12772, 2024
    [arXiv] [Project Page]
  6. CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation
    X. Guo, M. Zhang, H. Xie, C. Gu, Z. Liu
    Technical report, arXiv:2407.06188, 2024
    [arXiv] [Project Page]
  7. Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
    H. Yuan, X. Li, L. Qi, T. Zhang, M.-H. Yang, S. Yan, C. C. Loy
    Technical report, arXiv:2406.19369, 2024
    [arXiv]
  8. Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration
    K. Liao, Z. Yue, Z. Wang, C. C. Loy
    Technical report, arXiv:2406.18516, 2024
    [arXiv] [Project Page]
  9. FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models
    H. Qiu, Z. Chen, Z. Wang, Y. He, M. Xia, Z. Liu
    Technical report, arXiv:2406.16863, 2024
    [arXiv] [Project Page]
  10. Long Context Transfer from Language to Vision
    P. Zhang, K. Zhang, B. Li, G. Zeng, J. Yang, Y. Zhang, Z. Wang, H. Tan, C. Li, Z. Liu
    Technical report, arXiv:2406.16852, 2024
    [arXiv] [Project Page]
  11. AITTI: Learning Adaptive Inclusive Token for Text-to-Image Generation
    X. Hou, X. Li, C. C. Loy
    Technical report, arXiv:2406.12805, 2024
    [arXiv] [Project Page]
  12. GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation
    H. Xie, Z. Chen, F. Hong, Z. Liu
    Technical report, arXiv:2406.06526, 2024
    [arXiv] [Project Page]
  13. F-LMM: Grounding Frozen Large Multimodal Models
    S. Wu, S. Jin, W. Zhang, L. Xu, W. Liu, W. Li, C. C. Loy
    Technical report, arXiv:2406.05821, 2024
    [arXiv] [Project Page]
  14. DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D Generation
    Z. Cao, F. Hong, T. Wu, L. Pan, Z. Liu
    Technical report, arXiv:2405.08055, 2024
    [arXiv] [Project Page]
  15. Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving
    L. Kong, X. Xu, J. Ren, W. Zhang, L. Pan, K. Chen, W. T. Ooi, Z. Liu
    Technical report, arXiv:2405.05258, 2024
    [arXiv] [Project Page]
  16. WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning
    Y. Zhang, K. Zhang, B. Li, F. Pu, C. A. Setiadharma, J. Yang, Z. Liu
    Technical report, arXiv:2405.03272, 2024
    [arXiv] [Project Page]
  17. Point-In-Context: Understanding Point Cloud via In-Context Learning
    M. Liu, Z. Fang, X. Li, J. M. Buhmann, X. Li, C. C. Loy
    Technical report, arXiv:2404.12352, 2024
    [arXiv] [Project Page]
  18. MOWA: Multiple-in-One Image Warping Model
    K. Liao, Z. Yue, Z. Wu, C. C. Loy
    Technical report, arXiv:2404.10716, 2024
    [arXiv] [Project Page]
  19. MMInA: Benchmarking Multihop Multimodal Internet Agents
    Z. Zhang, S. Tian, L. Chen, Z. Liu
    Technical report, arXiv:2404.09992, 2024
    [arXiv] [Project Page]
  20. FashionEngine: Interactive Generation and Editing of 3D Clothed Humans
    Y. Zhang, K. Zhang, B. Li, F. Pu, C. A. Setiadharma, J. Yang, Z. Liu
    Technical report, arXiv:2404.01655, 2024
    [arXiv] [Project Page]
  21. AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
    Q. Sun, Y. Wang, A. Zeng, W. Yin, C. Wei, W. Wang, H. Mei, C. S. Leung, Z. Liu, L. Yang, Z. Cai
    Technical report, arXiv:2403.17934, 2024
    [arXiv] [Project Page]
  22. InTeX: Interactive Text-to-texture Synthesis via Unified Depth-aware Inpainting
    J. Tang, R. Lu, X. Chen, X. Wen, G. Zeng, Z. Liu
    Technical report, arXiv:2403.11878, 2024
    [arXiv] [Project Page]
  23. 3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors
    F. Hong, J. Tang, Z. Cao, M. Shi, T. Wu, Z. Chen, T. Wang, L. Pan, D. Lin, Z. Liu
    Technical report, arXiv:2403.02234, 2024
    [arXiv] [Project Page]
  24. Control Color: Multimodal Diffusion-based Interactive Image Colorization
    Z. Liang, Z. Li, S. Zhou, C. Li, C. C. Loy
    Technical report, arXiv:2402.10855, 2024
    [arXiv] [Project Page]

2023

Journal

  1. SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections
    Z. Chen, G. Wang, Z. Liu
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 (TPAMI)
    [arXiv] [Project Page]
  2. Bailando++: 3D Dance GPT with Choreographic Memory
    L. Siyao. W. Yu, T. Gu, C. Lin, Q. Wang, C. Qian, C. C. Loy, Z. Liu
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 (TPAMI)
    [DOI] [Project Page]
  3. GP-UNIT: Generative Prior for Versatile Unsupervised Image-to-Image Translation
    S. Yang, L. Jiang, Z. Liu, C. C. Loy
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 (TPAMI)
    [DOI] [arXiv] [Project Page]
  4. Towards Real-World Visual Tracking with Temporal Contexts
    Z. Cao, Z. Huang, L. Pan, S. Zhang, Z. Liu, C. Fu
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 (TPAMI)
    [DOI] [arXiv] [Project Page]
  5. Correspondence Distillation from NeRF-based GAN
    Y. Lan, C. C. Loy, B. Dai
    International Journal of Computer Vision, 2023 (IJCV)
    [PDF] [DOI] [arXiv] [Project Page]
  6. Semi-Supervised Domain Generalization with Stochastic StyleMatch
    K. Zhou, C. C. Loy, Z. Liu
    International Journal of Computer Vision, 2023 (IJCV)
    [PDF] [DOI] [arXiv] [Project Page]
  7. Variational Relational Point Completion Network for Robust 3D Classification
    L. Pan, X. Chen, Z. Cai, J. Zhang, H. Zhao, S. Yi, Z. Liu
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 (TPAMI)
    [DOI] [arXiv] [Project Page]
  8. Reference-based Image and Video Super-Resolution via C2-Matching
    Y. Jiang, K. C. K. Chan, X. Wang, C. C. Loy, Z. Liu
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 (TPAMI)
    [DOI] [arXiv] [Project Page]
  9. Computation-Efficient Knowledge Distillation via Uncertainty-Aware Mixup
    G. Xu, Z. Liu, C. C. Loy
    Pattern Recognition, 2023 (PR)
    [DOI] [arXiv] [Project Page]
  10. Network Pruning via Resource Reallocation
    Y. Hou, Z. Ma, C. Liu, Z. Wang, C. C. Loy
    Pattern Recognition, 2023 (PR)
    [DOI] [arXiv] [Project Page]

Conference

  1. PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance
    P. Yang, S. Zhou, Q. Tao, C. C. Loy
    in Proceedings of Neural Information Processing Systems, 2023 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  2. ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting
    Z. Yue, J. Wang, C. C. Loy
    in Proceedings of Neural Information Processing Systems, 2023 (NeurIPS, Spotlight)
    [PDF] [arXiv] [Project Page]
  3. Rubik's Cube: High-Order Channel Interactions with a Hierarchical Receptive Field
    N. Zheng, M. Zhou, C. Zhou, C. C. Loy
    in Proceedings of Neural Information Processing Systems, 2023 (NeurIPS)
    [PDF] [Project Page]
  4. Explore In-Context Learning for 3D Point Cloud Understanding
    Z. Fang, X. Li, X. Li, J. M. Buhmann, C. C. Loy, M. Liu
    in Proceedings of Neural Information Processing Systems, 2023 (NeurIPS, Spotlight)
    [PDF] [arXiv] [Project Page]
  5. 4D Panoptic Scene Graph Generation
    J. Yang, J. Cen, W. Peng, S. Liu, F. Hong, X. Li, K. Zhou, Q. Chen, Z. Liu
    in Proceedings of Neural Information Processing Systems, 2023 (NeurIPS, Spotlight)
    [PDF] [arXiv] [Project Page]
  6. Segment Any Point Cloud Sequences by Distilling Vision Foundation Models
    Y. Liu, L. Kong, J. Cen, R. Chen, W. Zhang, L. Pan, K. Chen, Z. Liu
    in Proceedings of Neural Information Processing Systems, 2023 (NeurIPS, Spotlight)
    [PDF] [arXiv] [Project Page]
  7. PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation
    Z. Chen, F. Hong, H. Mei, G. Wang, L. Yang, Z. Liu
    in Proceedings of Neural Information Processing Systems, 2023 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  8. InsActor: Instruction-driven Physics-based Characters
    J. Ren, M. Zhang, C. Yu, X. Ma, L. Pan, Z. Liu
    in Proceedings of Neural Information Processing Systems, 2023 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  9. FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing
    M. Zhang, H. Li, Z. Cai, J. Ren, L. Yang, Z. Liu
    in Proceedings of Neural Information Processing Systems, 2023 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  10. Language Models are Visual Reasoning Coordinators
    L. Chen, B. Li, S. Shen, J. Yang, C. Li, K. Keutzer, T. Darrell, Z. Liu
    in Proceedings of Neural Information Processing Systems, 2023 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  11. What Makes Good Examples for Visual In-Context Learning?
    Y. Zhang, K. Zhou, Z. Liu
    in Proceedings of Neural Information Processing Systems, 2023 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  12. Towards Robust and Expressive Whole-body Human Pose and Shape Estimation
    H. E. Pang, Z. Cai, L. Yang, T. Zhang, Q. Tao, Z. Wu, Z. Liu
    in Proceedings of Neural Information Processing Systems, 2023 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  13. RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars
    D. Pan, L. Zhuo, J. Piao, H. Luo, W. Cheng, Y. Wang, S. Fan, S. Liu, L. Yang, B. Dai, Z. Liu, C. C. Loy, C. Qian, W. Wu, D. Lin, K.-Y. Lin
    in Proceedings of Neural Information Processing Systems (Datasets and Benchmarks Track), 2023 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  14. SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation
    Z. Cai, W. Yin, A. Zeng, C. Wei, Q. Sun, Y. Wang, H. E. Pang, H. Mei, M. Zhang, L. Zhang, C. C. Loy, L. Yang, Z. Liu
    in Proceedings of Neural Information Processing Systems (Datasets and Benchmarks Track), 2023 (NeurIPS)
    [PDF] [arXiv] [Project Page] [YouTube]
  15. Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
    S. Yang, Y. Zhou, Z. Liu, C. C. Loy
    SIGGRAPH ASIA, 2023 (SIGGRAPH ASIA)
    [arXiv] [Project Page]
  16. Video Infilling with Rich Motion Prior
    X. Hou, L. Jiang, R. Shao, C. C. Loy
    in Proceedings of British Machine Vision Conference, 2023 (BMVC)
    [PDF] [Supplementary Material]
  17. Text2Performer: Text-Driven Human Video Generation
    Y. Jiang, S. Yang, T. L. Koh, W. Wu, C. C. Loy, Z. Liu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  18. UnitedHuman: Harnessing Multi-Source Data for High-Resolution Human Generation
    J. Fu, S. Li, Y. Jiang, K.-Y. Lin, W. Wu, Z. Liu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  19. StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation
    Y. Wang, L. Jiang, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  20. StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces
    S. Yang, L. Jiang, Z. Liu, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [Demo]
  21. Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation
    Y. Jiang, L. Jiang, S. Yang, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [Demo]
  22. ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model
    M. Zhang, X. Guo, L. Pan, Z. Cai, F. Hong, H. Li, L. Yang, Z. Liu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Project Page]
  23. Towards Multi-Layered 3D Garments Animation
    Y. Shao, C. C. Loy, B. Dai
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  24. Robo3D: Towards Robust and Reliable 3D Perception Against Corruptions
    L. Kong, Y. Liu, X. Li, R. Chen, W. Zhang, J. Ren, L. Pan, K. Chen, Z. Liu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Project Page]
  25. SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling
    Z. Yang, Z. Cai, H. Mei, S. Liu, Z. Chen, W. Xiao, Y. Wei, Z. Qing, C. Wei, B. Dai, W. Wu, C. Qian, D. Lin, Z. Liu, L. Yang
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Project Page]
  26. DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields
    J. Zhang, Y. Lan, S. Yang, F. Hong, Q. Wang, C. K. Yeo, Z. Liu, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  27. SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis
    G. Wang, Z. Chen, C. C. Loy, Z. Liu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Project Page]
  28. SHERF: Generalizable Human NeRF from a Single Image
    S. Hu, F. Hong, L. Pan, H. Mei, L. Yang, Z. Liu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  29. DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering
    W. Cheng, R. Chen, K. Chen, Z. Cai, B. Dai, S. Fan, Y. Gao, Z. Lin, D. Lin, Z. Liu, K.-Y. Lin, C. C. Loy, C. Qian, D. Ren, W. Wu, J. Wang, Z. Yu, W. Yin, L. Yang
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Project Page]
  30. Rethinking Range View Representation for LiDAR Segmentation
    L. Kong, Y. Liu, R. Chen, Y. Ma, X. Zhu, Y. Li, Y. Hou, Y. Qiao, Z. Liu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Project Page]
  31. Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation
    X. Li, H. Yuan, W. Zhang, G. Cheng, J. Pang, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  32. Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
    J. Wu, X. Li, H. Ding, X. Li, G. Cheng, Y. Tong, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  33. MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
    H. Ding, C. Liu, S. He, X. Jiang, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Project Page]
  34. Iterative Prompt Learning for Unsupervised Backlit Image Enhancement
    Z. Liang, C. Li, S. Zhou, R. Feng, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV, Oral)
    [PDF] [arXiv] [Project Page]
  35. ProPainter: Improving Propagation and Transformer for Video Inpainting
    S. Zhou, C. Li, K. C. K. Chan, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  36. Deep Geometrized Cartoon Line Inbetweening
    L. Siyao, T. Gu, W. Xiao, H. Ding, Z. Liu, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2023 (ICCV)
    [PDF] [Supplementary Material] [Project Page]
  37. F2-NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories
    P. Wang, Y. Liu, Z. Chen, L. Liu, Z. Liu, T. Komura, C. Theobalt, W. Wang
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR, Highlight)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  38. LaserMix for Semi-Supervised LiDAR Semantic Segmentation
    L. Kong, J. Ren, L. Pan, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR, Highlight)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  39. OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation
    T. Wu, J. Zhang, X. Fu, Y. Wang, J. Ren, L. Pan, W. Wu, L. Yang, J. Wang, C. Qian, D. Lin, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR, Award Candidate)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  40. Nighttime Smartphone Reflective Flare Removal using Optical Center Symmetry Prior
    Y. Dai, Y. Luo, S. Zhou, C. Li, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR, Highlight)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  41. Detecting and Grounding Multi-Modal Media Manipulation
    R. Shao, T. Wu, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  42. Collaborative Diffusion for Multi-Modal Face Generation and Editing
    Z. Huang, K.C.K. Chan, Y. Jiang, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  43. Correlational Image Modeling for Self-Supervised Visual Pre-Training
    W. Li, J. Xie, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  44. Aligning Bag of Regions for Open-Vocabulary Object Detection
    S. Wu, W. Zhang, S. Jin, W. Liu, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  45. Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion
    Y. Lan, X. Meng, S. Yang, C. C. Loy, B. Dai
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  46. Learning Generative Structure Prior for Blind Text Image Super-resolution
    X. Li, W. Zuo, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  47. Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image Restoration in Under-Display Camera
    R. Feng, C. Li, H. Chen, S. Li, J. Gu, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  48. Panoptic Video Scene Graph Generation
    J. Yang, W. Peng, X. Li, Z. Guo, L. Chen, B. Li, Z. Ma, W. Zhang, K. Zhou, C. C. Loy, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR)
    [PDF]
  49. Siamese DETR
    Z. Chen, G. Huang, W. Li, J. Teng, K. Wang, J. Shao, C. C. Loy, L. Sheng
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  50. CelebV-Text: A Large-Scale Facial Text-Video Dataset
    J. Yu, H. Zhu, L. Jiang, C. C. Loy, W. Cai, W. Wu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2023 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  51. Flexible Piecewise Curves Estimation for Photo Enhancement
    C. Li, C. Guo, Q. Ai, S. Zhou, C. C. Loy
    in Workshop Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, NTIRE, 2023 (CVPRW)
    [arXiv]
  52. BeautyREC: Robust, Efficient, and Content-preserving Makeup Transfer
    Q. Yan, C. Guo, J. Zhao, Y. Dai, C. C. Loy, C. Li
    in Workshop Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, NTIRE, 2023 (CVPRW)
    [arXiv] [Project Page]
  53. The Nuts and Bolts of Adopting Transformer in GANs
    R. Xu and X. Xu and K. Chen and B. Zhou and C. C. Loy
    in Workshop Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, AI4CC, 2023 (CVPRW)
    [arXiv] [Project Page]
  54. Embedding Fourier for Ultra-High-Definition Low-Light Image Enhancement
    C. Li, C. Guo, M. Zhou, Z. Liang, S. Zhou, R. Feng, C. C. Loy
    International Conference on Learning Representations, 2023 (ICLR, Oral)
    [arXiv] [Project Page]
  55. Sparse Mixture-of-Experts are Domain Generalizable Learners
    B. Li, Y. Shen, J. Yang, Y. Wang, J. Ren, T. Che, J. Zhang, Z. Liu
    International Conference on Learning Representations, 2023 (ICLR, Oral)
    [arXiv] [Project Page]
  56. EVA3D: Compositional 3D Human Generation from 2D Image Collections
    F. Hong, Z. Chen, Y. Lan, L. Pan, Z. Liu
    International Conference on Learning Representations, 2023 (ICLR, Spotlight)
    [arXiv] [Project Page] [YouTube]
  57. DiffMimic: Efficient Motion Mimicking with Differentiable Physics
    J. Ren, C. Yu, S. Chen, X. Ma, L. Pan, Z. Liu
    International Conference on Learning Representations, 2023 (ICLR)
    [PDF] [arXiv] [Project Page]
  58. Masked Frequency Modeling for Self-Supervised Visual Pre-Training
    J. Xie, W. Li, X. Zhan, Z. Liu, Y. S. Ong, C. C. Loy
    International Conference on Learning Representations, 2023 (ICLR)
    [arXiv] [Project Page]
  59. Improving Data Augmentation for Multi-Modality 3D Object Detection
    W. Zhang, Z. Wang, C. C. Loy
    International Conference on Learning Representations Workshop, 2023 (ICLRW)
    [arXiv] [Project Page]
  60. Exploring CLIP for Assessing the Look and Feel of Images
    J. Wang, K. C. K. Chan, C. C. Loy
    in Proceedings of AAAI Conference on Artificial Intelligence, 2023 (AAAI)
    [arXiv] [Project Page]

Technical Report

  1. DreamGaussian4D: Generative 4D Gaussian Splatting
    J. Ren, L. Pan, J. Tang, C. Zhang, A. Cao, G. Zeng, Z. Liu
    Technical report, arXiv:2312.17142, 2023
    [arXiv] [Project Page] [YouTube]
  2. EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM
    C. Zhou, X. Li, C. C. Loy, B. Dai
    Technical report, arXiv:2312.06660, 2023
    [arXiv] [Project Page] [Demo]
  3. Gaussian3Diff: 3D Gaussian Diffusion for 3D Full Head Synthesis and Editing
    Y. Lan, F. Tan, D. Qiu, Q. Xu, K. Genova, Z. Huang, S. Fanello, R. Pandey, T. Funkhouser, C. C. Loy, Y. Zhang
    Technical report, arXiv:2312.03763, 2023
    [arXiv] [Project Page]
  4. OtterHD: A High-Resolution Multi-modality Model
    B. Li, P. Zhang, J. Yang, Y. Zhang, F. Pu, Z. Liu
    Technical report, arXiv:2311.04219, 2023
    [arXiv] [Project Page]
  5. DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection
    S. Xu, X. Li, S. Wu, W. Zhang, Y. Li, G. Cheng, Y. Tong, K. Chen, C. C. Loy
    Technical report, arXiv:2310.09458, 2023
    [arXiv] [Project Page]
  6. LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
    Y. Wang, X. Chen, X. Ma, S. Zhou, Z. Huang, Y. Wang, C. Yang, Y. He, J. Yu, P. Yang, Y. Guo, T. Wu, C. Si, Y. Jiang, C. Chen, C. C. Loy, B. Dai, D. Lin, Y. Qiao, Z. Liu
    Technical report, arXiv:2309.15103, 2023
    [arXiv] [Project Page]
  7. Robust Sequential DeepFake Detection
    R. Shao, T. Wu, Z. Liu
    Technical report, arXiv:2309.14991, 2023
    [arXiv] [Project Page]
  8. Detecting and Grounding Multi-Modal Media Manipulation and Beyond
    R. Shao, T. Wu, J. Wu, L. Nie, Ziwei Liu
    Technical report, arXiv:2309.14203, 2023
    [arXiv] [Project Page]
  9. Interpret Vision Transformers as ConvNets with Dynamic Convolutions
    C. Zhou, C. C. Loy, B. Dai
    Technical report, arXiv:2309.10713, 2023
    [arXiv]
  10. PointHPS: Cascaded 3D Human Pose and Shape Estimation from Point Clouds
    Z. Cai, L. Pan, C. Wei, W. Yin, F. Hong, M. Zhang, C. C. Loy, L. Yang, Z. Liu
    Technical report, arXiv:2308.14492, 2023
    [arXiv] [Project Page]
  11. HumanLiff: Layer-wise 3D Human Generation with Diffusion Model
    S. Hu, F. Hong, T. Hu, L. Pan, H. Mei, W. Xiao, L. Yang, Z. Liu
    Technical report, arXiv:2308.09712, 2023
    [arXiv] [Project Page]
  12. Hierarchy Flow For High-Fidelity Image-to-Image Translation
    W. Fan, J. Chen, Z. Liu
    Technical report, arXiv:2308.06909, 2023
    [arXiv] [Project Page]
  13. Benchmarking and Analyzing Generative Data for Visual Recognition
    B. Li, H. Liu, L. Chen, Y. J. Lee, C. Li, Z. Liu
    Technical report, arXiv:2307.13697, 2023
    [arXiv]
  14. Pair then Relation: Pair-Net for Panoptic Scene Graph Generation
    J. Wang, Z. Wen, X. Li, Z. Guo, J. Yang, Z. Liu
    Technical report, arXiv:2307.08699, 2023
    [arXiv] [Project Page]
  15. InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
    Y. Wang, Y. He, Y. Li, K. Li, J. Yu, X. Ma, X. Chen, Y. Wang, P. Luo, Z. Liu, Y. Wang, L. Wang, Y. Qiao
    Technical report, arXiv:2307.06942, 2023
    [arXiv] [Project Page]
  16. OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection
    J. Zhang, J. Yang, P. Wang, H. Wang, Y. Lin, H. Zhang, Y. Sun, X. Du, K. Zhou, W. Zhang, Y. Li, Z. Liu, Y. Chen, H. Li
    Technical report, arXiv:2306.09301, 2023
    [arXiv] [Project Page]
  17. MIMIC-IT: Multi-Modal In-Context Instruction Tuning
    B. Li, Y. Zhang, L. Chen, J. Wang, F. Pu, J. Yang, C. Li, Z. Liu
    Technical report, arXiv:2306.05425, 2023
    [arXiv] [Project Page]
  18. DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection
    R. Shao, T. Wu, L. Nie, Z. Liu
    Technical report, arXiv:2306.00863, 2023
    [arXiv] [Project Page]
  19. Learning without Forgetting for Vision-Language Models
    D.-W. Zhou, Y. Zhang, J. Ning, H.-J. Ye, D.-C. Zhan, Z. Liu
    Technical report, arXiv:2305.19270, 2023
    [arXiv]
  20. SAD: Segment Any RGBD
    J. Cen, Y. Wu, K. Wang, X. Li, J. Yang, Y. Pei, L. Kong, Z. Liu, Q. Chen
    Technical report, arXiv:2305.14207, 2023
    [arXiv] [Project Page]
  21. ConsistentNeRF: Enhancing Neural Radiance Fields with 3D Consistency for Sparse View Synthesis
    S. Hu, K. Zhou, K. Li, L. Yu, L. Hong, T. Hu, Z. Li, G. H. Lee, Z. Liu
    Technical report, arXiv:2305.11031, 2023
    [arXiv] [Project Page]
  22. Otter: A Multi-Modal Model with In-Context Instruction Tuning
    B. Li, Y. Zhang, L. Chen, J. Wang, J. Yang, Z. Liu
    Technical report, arXiv:2305.03726, 2023
    [arXiv] [Project Page]
  23. RoboBEV: Towards Robust Bird's Eye View Perception under Corruptions
    S. Xie, L. Kong, W. Zhang, J. Ren, L. Pan, K. Chen, Z. Liu
    Technical report, arXiv:2304.06719, 2023
    [arXiv] [Project Page]

2022

Book Chapter

  1. Talking Faces: Audio-to-Video Face Generation
    Y. Wang, L. Song, W. Wu, C. Qian, R. He, C. C. Loy
    In C. Rathgeb, R. Tolosana, R. Vera-Rodriguez, C. Busch (Eds.), Handbook of Digital Face Manipulation and Detection, Springer, 2022
    [Book Link]
  2. DeepFakes Detection: the DeeperForensics Dataset and Challenge
    L. Jiang, W. Wu, C. Qian, C. C. Loy
    In C. Rathgeb, R. Tolosana, R. Vera-Rodriguez, C. Busch (Eds.), Handbook of Digital Face Manipulation and Detection, Springer, 2022
    [Book Link]

Journal

  1. Semi-Supervised and Long-Tailed Object Detection with CascadeMatch
    Y. Zang, K. Zhou, C. Huang, C. C. Loy
    International Journal of Computer Vision, 2022 (IJCV)
    [PDF] [DOI] [arXiv] [Project Page]
  2. Text2Light: Zero-Shot Text-Driven HDR Panorama Generation
    Z. Chen, G. Wang, Z. Liu
    ACM Transactions on Graphics, 2022 (SIGGRAPH ASIA - TOG)
    [DOI] [arXiv] [Project Page] [YouTube]
  3. VToonify: Controllable High-Resolution Portrait Video Style Transfer
    S. Yang, L. Jiang, Z. Liu, C. C. Loy
    ACM Transactions on Graphics, 2022 (SIGGRAPH ASIA - TOG)
    [DOI] [arXiv] [Project Page] [YouTube]
  4. Audio-driven Dubbing for User Generated Contents via Style-aware Semi-parametric Synthesis
    L. Song, W. Wu, C. Fu, C. C. Loy, R. He
    IEEE Transactions on Circuits and Systems for Video Technology, 2022 (TCVST)
    [DOI] [arXiv]
  5. Delving into Inter-Image Invariance for Unsupervised Visual Representations
    J. Xie, X. Zhan, Z. Liu, Y. S. Ong, C. C. Loy
    International Journal of Computer Vision, 2022 (IJCV)
    [PDF] [DOI] [arXiv] [Project Page]
  6. Open Long-Tailed Recognition in a Dynamic World
    Z. Liu, Z. Miao, X. Zhan, J. Wang, B. Gong, S. X. Yu
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022 (TPAMI)
    [DOI] [arXiv]
  7. Self-Supervised Representation Learning: Introduction, Advances and Challenges
    L. Ericsson, H. Gouk, C. C. Loy, T. M. Hospedales
    IEEE Signal Processing Magazine, 2022 (SPM)
    [DOI] [arXiv]
  8. Domain Generalization: A Survey
    K. Zhou, Z. Liu, Y. Qiao, T. Xiang, C. C. Loy
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022 (TPAMI)
    [DOI] [arXiv]
  9. Learning to Prompt for Vision-Language Models
    K. Zhou, J. Yang, C. C. Loy, Z. Liu
    International Journal of Computer Vision, 2022 (IJCV)
    [DOI] [arXiv] [Project Page]
  10. GLEAN: Generative Latent Bank for Image Super-Resolution and Beyond
    K. C. K. Chan, X. Wang, X. Xu, J. Gu, C. C. Loy
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022 (TPAMI)
    [DOI] [arXiv] [Project Page]
  11. AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
    F. Hong, M. Zhang, L. Pan, Z. Cai, L. Yang, Z. Liu
    ACM Transactions on Graphics, 2022 (SIGGRAPH - TOG)
    [DOI] [arXiv] [Project Page]
  12. Text2Human: Text-Driven Controllable Human Image Generation
    Y. Jiang, S. Yang, H. Qiu, W. Wu, C. C. Loy, Z. Liu
    ACM Transactions on Graphics, 2022 (SIGGRAPH - TOG)
    [DOI] [arXiv] [Project Page] [YouTube]
  13. Chasing the Tail in Monocular 3D Human Reconstruction with Prototype Memory
    Y. Rong, Z. Liu, C. C. Loy
    IEEE Transactions on Image Processing, 2022 (TIP)
    [DOI] [arXiv] [Project Page]
  14. Everybody’s Talkin’: Let Me Talk as You Want
    L. Song, W. Wu, C. Qian, R. He, C. C. Loy
    IEEE Transactions on Information Forensics and Security, 2022 (TIFS)
    [DOI] [arXiv] [Project Page]

Conference

  1. Towards Robust Blind Face Restoration with Codebook Lookup Transformer
    S. Zhou, K. C. K. Chan, C. Li, C. C. Loy
    in Proceedings of Neural Information Processing Systems, 2022 (NeurIPS)
    [arXiv] [Project Page] [Demo]
  2. Deep Fourier Up-Sampling
    M. Zhou, H. Yu, J. Huang, F. Zhao, J. Gu, C. C. Loy, D. Meng, C. Li
    in Proceedings of Neural Information Processing Systems, 2022 (NeurIPS)
    [arXiv] [Project Page]
  3. AnimeRun: 2D Animation Visual Correspondence from Open Source 3D Movies
    L. Siyao, Y. Li, B. Li, C. Dong, Z. Liu, C. C. Loy
    in Proceedings of Neural Information Processing Systems (Datasets and Benchmarks Track), 2022 (NeurIPS)
    [arXiv] [Project Page]
  4. Flare7K: A Phenomenological Nighttime Flare Removal Dataset
    Y. Dai, C. Li, S. Zhou, R. Feng, C. C. Loy
    in Proceedings of Neural Information Processing Systems (Datasets and Benchmarks Track), 2022 (NeurIPS)
    [arXiv] [Project Page]
  5. OpenOOD: Benchmarking Generalized Out-of-Distribution Detection
    J. Yang, P. Wang, D. Zou, Z. Zhou, K. Ding, W. Peng, H. Wang, G. Chen, B. Li, Y. Sun, X. Du, K. Zhou, W. Zhang, D. Hendrycks, Y. Li, Z. Liu
    in Proceedings of Neural Information Processing Systems (Datasets and Benchmarks Track), 2022 (NeurIPS)
    [arXiv] [Project Page]
  6. Benchmarking and Analyzing 3D Human Pose and Shape Estimation Beyond Algorithms
    H. E. Pang, Z. Cai, L. Yang, T. Zhang, Z. Liu
    in Proceedings of Neural Information Processing Systems (Datasets and Benchmarks Track), 2022 (NeurIPS)
    [arXiv] [Project Page]
  7. Open-Vocabulary DETR with Conditional Matching
    Y. Zang, W. Li, K. Zhou, C. Huang, C. C. Loy
    European Conference on Computer Vision, 2022 (ECCV, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  8. Transformer with Implicit Edges for Particle-based Physics Simulation
    Y. Shao, C. C. Loy, B. Dai
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Project Page]
  9. Extract Free Dense Labels from CLIP
    C. Zhou, C. C. Loy, B. Dai
    European Conference on Computer Vision, 2022 (ECCV, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  10. Mind the Gap in Distilling StyleGANs
    G. Xu, Y. Hou, Z. Liu, C. C. Loy
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  11. LEDNet: Joint Low-light Enhancement and Deblurring in the Dark
    S. Zhou, C. Li, C. C. Loy
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  12. BRACE: The Breakdancing Competition Dataset for Dance Motion Synthesis
    D. Moltisanti, J. Wu, B. Dai, C. C. Loy
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  13. StyleLight: HDR Panorama Generation for Lighting Estimation and Editing
    G. Wang, Y. Yang, C. C. Loy, Z. Liu
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  14. HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling
    Z. Cai, D. Ren, A. Zeng, Z. Lin, T. Yu, W. Wang, X. Fan, Y. Gao, Y. Yu, L. Pan, F. Hong, M. Zhang, C. C. Loy, L. Yang, Z. Liu
    European Conference on Computer Vision, 2022 (ECCV, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  15. Monocular 3D Object Reconstruction with GAN Inversion
    J. Zhang, D. Ren, Z. Cai, C. K. Yeo, B. Dai, C. C. Loy
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  16. CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
    H. Zhu, W. Wu, W. Zhu, L. Jiang, S. Tang, L. Zhang, Z. Liu, C. C. Loy
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  17. StyleGAN-Human: A Data-Centric Odyssey of Human Generation
    J. Fu, S. Li, Y. Jiang, K.-Y. Lin, C. Qian, C. C. Loy, W. Wu, Z. Liu
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  18. Dense Siamese Network for Dense Unsupervised Learning
    W. Zhang, J. Pang, K. Chen, C. C. Loy
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  19. Relighting4D: Neural Relightable Human from Videos
    Z. Chen, Z. Liu
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  20. Panoptic Scene Graph Generation
    J. Yang, Y. Z. Ang, Z. Guo, K. Zhou, W. Zhang, Z. Liu
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  21. Benchmarking Omni-Vision Representation through the Lens of Visual Realms
    Y. Zhang, Z. Yin, J. Shao, Z. Liu
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  22. Detecting and Recovering Sequential DeepFake Manipulation
    R. Shao, T. Wu, Z. Liu
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  23. Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis
    L. Zhuo, G. Wang, S. Li, W. Wu, Z. Liu
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  24. X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation
    Y. He, Ge. Huang, S. Chen, J. Teng, K. Wang, Z. Yin, L. Sheng, Z. Liu, Y. Qiao, J. Shao
    European Conference on Computer Vision, 2022 (ECCV)
    [PDF] [arXiv] [Supplementary Material]
  25. Benchmarking and Analyzing Point Cloud Classification under Corruptions
    J. Ren, L. Pan, Z. Liu
    in Proceedings of International Conference on Machine Learning, 2022 (ICML)
    [arXiv] [Project Page]
  26. Investigating Trade-offs in Real-World Video Super-Resolution
    K. C. K. Chan, S. Zhou, X. Xu, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  27. BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment
    K. C. K. Chan, S. Zhou, X. Xu, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  28. Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
    S. Yang, L. Jiang, Z. Liu, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  29. Unsupervised Image-to-Image Translation with Generative Prior
    S. Yang, L. Jiang, Z. Liu, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  30. Video K-Net: A Simple, Strong, and Unified Baseline For End-to-End Dense Video Segmentation
    X. Li, W. Zhang, J. Pang, K. Chen, G. Cheng, Y. Tong, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  31. Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory
    L. Siyao, W. Yu, T. Gu, C. Lin, Q. Wang, C. Qian, C. C. Loy, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  32. TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing
    Y. Xu, Y. Yin, L. Jiang, Q. Wu, C. Zheng, C. C. Loy, B. Dai, W. Wu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  33. Conditional Prompt Learning for Vision-Language Models
    K. Zhou, J. Yang, C. C. Loy, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR)
    [PDF] [arXiv] [Project Page]
  34. Versatile Multi-Modal Pre-Training for Human-Centric Perception
    F. Hong, L. Pan, Z. Cai, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR, Oral)
    [PDF] [arXiv] [Project Page]
  35. Balanced MSE for Imbalanced Visual Regression
    J. Ren, M. Zhang, C. Yu, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  36. TCTrack: Temporal Contexts for Aerial Tracking
    Z. Cao, Z. Huang, L. Pan, S. Zhang, Z. Liu, C. Fu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  37. Delving Deep into the Generalization of Vision Transformers under Distribution Shifts
    C. Zhang, M. Zhang, S. Zhang, D. Jin, Q. Zhou, Z. Cai, H. Zhao, S. Yi, X. Liu, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR)
    [PDF] [arXiv] [Project Page]
  38. Full-Range Virtual Try-On with Recurrent Tri-Level Transformation
    H. Yang, X. Yu, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR)
    [PDF] [Supplementary Material]
  39. Towards Diverse and Natural Scene-aware 3D Human Motion Synthesis
    J. Wang, Y. Rong, J. Liu, S. Yan, D. Lin, B. Dai
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR)
    [PDF] [arXiv] [Supplementary Material]
  40. Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
    X. Liu, Q. Wu, H. Zhou, Y. Xu, R. Qian, X. Lin, X. Zhou, W. Wu, B. Dai, B. Zhou
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  41. Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
    Y. Xu, F. Wei, X. Sun, C. Yang, Y. Shen, B. Dai, B. Zhou, S. Lin
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  42. Revisiting Skeleton-based Action Recognition
    H. Duan, Y. Zhao, K. Chen, D. Lin, B. Dai
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2022 (CVPR, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  43. Delving into High-Quality Synthetic Face Occlusion Segmentation Datasets
    K. T. R. Voo, L. Jiang, C. C. Loy
    in Workshop Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, VDU, 2022 (CVPRW)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  44. MoCaNet: Motion Retargeting in-the-wild via Canonicalization Networks
    W. Zhu, Z. Yang, Z. Di, W. Wu, Y. Wang, C. C. Loy
    in Proceedings of AAAI Conference on Artificial Intelligence, 2022 (AAAI)
    [arXiv] [Project Page]
  45. Visual Sound Localization in-the-Wild by Cross-Modal Interference Erasing
    X. Liu, R. Qian, H. Zhou, W. lin, Z. Liu, B. Zhou, X. Zhou
    in Proceedings of AAAI Conference on Artificial Intelligence, 2022 (AAAI)
    [PDF] [arXiv]
  46. SepFusion: Finding Optimal Fusion Structures for Visual Sound Separation
    D. Zhou, X. Zhou, D. Hu, H. Zhou, L. Bai, Z. Liu, W. Ouyang
    in Proceedings of AAAI Conference on Artificial Intelligence, 2022 (AAAI)
    [PDF]
  47. TAda! Temporally-Adaptive Convolutions for Video Understanding
    Z. Huang, S. Zhang, L. Pan, Z. Qing, M. Tang, Z. Liu, M. H. Ang Jr
    International Conference on Learning Representations, 2022 (ICLR)
    [arXiv]

Technical Report

  1. Unified Vision and Language Prompt Learning
    Y. Zang, W. Li, K. Zhou, C. Huang, C. C. Loy
    Technical report, arXiv:2210.07225, 2022
    [arXiv]
  2. On-Device Domain Generalization
    K. Zhou, Y. Zhang, Y. Zang, J. Yang, C. C. Loy, Z. Liu
    Technical report, arXiv:2209.07521, 2022
    [arXiv]
  3. StyleFaceV: Face Video Generation via Decomposing and Recomposing Pretrained StyleGAN3
    H. Qiu, Y. Jiang, H. Zhou, W. Wu, Z. Liu
    Technical report, arXiv:2208.07862, 2022
    [arXiv] [Project Page]
  4. CuDi: Curve Distillation for Efficient and Controllable Exposure Adjustment
    C. Li, C. Guo, R. Feng, S. Zhou, C. C. Loy
    Technical report, arXiv:2207.14273, 2022
    [arXiv] [Project Page]
  5. Neural Prompt Search
    Y. Zhang, K. Zhou, Z. Liu
    Technical report, arXiv:2206.04673, 2022
    [arXiv] [Project Page]
  6. Robust Face Anti-Spoofing with Dual Probabilistic Modeling
    Y. Zhang, Y. Wu, Z. Yin, J. Shao, Z. Liu
    Technical report, arXiv:2204.12685, 2022
    [arXiv]
  7. Few-shot Forgery Detection via Guided Adversarial Interpolation
    H. Qiu, S. Chen, B. Gan, K. Wang, H. Shi, J. Shao, Z. Liu
    Technical report, arXiv:2204.05905, 2022
    [arXiv]
  8. On the Generalization of BasicVSR++ to Video Deblurring and Denoising
    K. C. K. Chan, S. Zhou, X. Xu, C. C. Loy
    Technical report, arXiv:2204.05308, 2022
    [arXiv] [Project Page]
  9. Full-Spectrum Out-of-Distribution Detection
    J. Yang · K. Zhou · Z. Liu
    Technical report, arXiv:2204.05306, 2022
    [arXiv] [Project Page]
  10. Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy
    Y. Zhang, Q. Sun, Y. Zhou, Z. He, Z. Yin, K. Wang, L. Sheng, Y. Qiao, J. Shao, Z. Liu
    Technical report, arXiv:2203.07845, 2022
    [arXiv] [Project Page]

2021

Journal

  1. Low-Light Image and Video Enhancement Using Deep Learning: A Survey
    C. Li, C. Guo, L. Han, J. Jiang, M. Cheng, J. Gu, C. C. Loy
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021 (TPAMI)
    [DOI] [arXiv] [Project Page]
  2. Iterative Human and Automated Identification of Wildlife Images
    Z. Miao, Z. Liu, K. M. Gaynor, M. S. Palmer, S. X. Yu, W. M. Getz
    Nature Machine Intelligence, vol. 3, pp. 885–895, 2021 (Nat Mach Intell)
    [DOI] [arXiv]
  3. Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation
    X. Pan, X. Zhan, B. Dai, D. Lin, C. C. Loy, P. Luo
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021 (TPAMI)
    [DOI] [Project Page]
  4. Path-Restore: Learning Network Path Selection for Image Restoration
    K. Yu, X. Wang, C. Dong, X. Tang, C. C. Loy
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021 (TPAMI)
    [DOI] [arXiv] [Project Page]
  5. CARAFE++: Unified Content-Aware ReAssembly of FEatures
    J. Wang, K. Chen, R. Xu, Z. Liu, C. C. Loy, D. Lin
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021 (TPAMI)
    [DOI] [arXiv]
  6. Learning to Enhance Low-Light Image via Zero-Reference Deep Curve Estimation
    C. Li, C. Guo, C. C. Loy
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021 (TPAMI)
    [DOI] [arXiv] [Project Page]
  7. Texture Memory-Augmented Deep Patch-Based Image Inpainting
    R. Xu, M. Guo, J. Wang, X. Li, B. Zhou, C. C. Loy
    IEEE Transactions on Image Processing, 2021 (TIP)
    [DOI] [arXiv]

Conference

  1. K-Net: Towards Unified Image Segmentation
    W. Zhang, J. Pang, K. Chen, C. C. Loy
    in Proceedings of Neural Information Processing Systems, 2021 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  2. Unsupervised Object-Level Representation Learning from Scene Images
    J. Xie, X. Zhan, Z. Liu, Y. S. Ong, C. C. Loy
    in Proceedings of Neural Information Processing Systems, 2021 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  3. Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data
    L. Jiang, B. Dai, W. Wu, C. C. Loy
    in Proceedings of Neural Information Processing Systems, 2021 (NeurIPS)
    [PDF] [arXiv] [Project Page] [YouTube]
  4. A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis
    X. Pan, X. Xu, C. C. Loy, C. Theobalt, B. Dai
    in Proceedings of Neural Information Processing Systems, 2021 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  5. Garment4D: Garment Reconstruction from Point Cloud Sequences
    F. Hong, L. Pan, Z. Cai, Z. Liu
    in Proceedings of Neural Information Processing Systems, 2021 (NeurIPS)
    [PDF] [Project Page]
  6. Few-Shot Object Detection via Association and Discrimination
    Y. Cao, J. Wang, Y. Jin, T. Wu, K. Chen, Z. Liu, D. Lin
    in Proceedings of Neural Information Processing Systems, 2021 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  7. Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion
    T. Wu, L. Pan, J. Zhang, T. Wang, Z. Liu, D. Lin
    in Proceedings of Neural Information Processing Systems, 2021 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  8. Generative Occupancy Fields for 3D Surface-Aware Image Synthesis
    X. Xu, X. Pan, D. Lin, B. Dai
    in Proceedings of Neural Information Processing Systems, 2021 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  9. Monocular 3D Reconstruction of Interacting Hands via Collision-Aware Factorized Refinements
    Y. Rong, J. Wang, Z. Liu, C. C. Loy
    in Proceedings of International Conference on 3D Vision, 2021 (3DV)
    [arXiv] [Project Page]
  10. 3D Human Texture Estimation from a Single Image with Transformers
    X. Xu, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2021 (ICCV, Oral)
    [PDF] [arXiv] [Project Page]
  11. FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation
    Y. Zang, C. Huang, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2021 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  12. ReconfigISP: Reconfigurable Camera Image Processing Pipeline
    K. Yu, Z. Li, Y. Peng, C. C. Loy, J. Gu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2021 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  13. Focal Frequency Loss for Image Reconstruction and Synthesis
    L. Jiang, B. Dai, W. Wu, C. C. Loy
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2021 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  14. Talk-to-Edit: Fine-Grained Facial Editing via Dialog
    Y. Jiang, Z. Huang, X. Pan, C. C. Loy, Z. Liu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2021 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  15. Semantically Coherent Out-of-Distribution Detection
    J. Yang, H. Wang, L. Feng, X. Yan, H. Zheng, W. Zhang, Z. Liu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2021 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  16. Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency
    Z. Luo, Z. Cai, C. Zhou, G. Zhang, H. Zhao, S. Yi, S. Lu, H. Li, S. Zhang, Z. Liu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2021 (ICCV)
    [PDF] [arXiv] [Supplementary Material]
  17. Incorporating Convolution Designs into Visual Transformers
    K. Yuan, S. Guo, Z. Liu, A. Zhou, F. Yu, W. Wu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2021 (ICCV)
    [PDF] [arXiv]
  18. Differentiable Dynamic Wirings for Neural Networks
    K. Yuan, Q. Li, S. Guo, D. Chen, A. Zhou, F. Yu, Z. Liu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2021 (ICCV)
    [PDF]
  19. BlockPlanner: City Block Generation with Vectorized Graph Representation
    L. Xu, Y. Xiangli, A. Rao, N. Zhao, B. Dai, Z. Liu, D. Lin
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2021 (ICCV)
    [PDF] [Supplementary Material]
  20. Energy-Based Open-World Uncertainty Modeling for Confidence Calibration
    Y. Wang, B. Li, T. Che, K. Zhou, D. Li, Z. Liu
    in Proceedings of IEEE/CVF International Conference on Computer Vision, 2021 (ICCV)
    [PDF] [arXiv]
  21. Retrospective Class Incremental Learning
    Q. Tao, C. C. Loy, J. Cai, Z. Ge, S. See
    in Proceedings of IEEE International Conference on Multimedia and Expo, 2021 (ICME)
    [PDF]
  22. GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution
    K. C. K. Chan, X. Wang, X. Xu, J. Gu, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  23. Adversarial Robustness under Long-Tailed Distribution
    T. Wu, Z. Liu, Q. Huang, Y. Wang, D. Lin
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  24. Variational Relational Point Completion Network
    L. Pan, X. Chen, Z. Cai, J. Zhang, H. Zhao, S. Yi, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  25. ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis
    Y. He, B. Gan, S. Chen, Y. Zhou, G. Yin, L. Song, L. Sheng, J. Shao, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  26. BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond
    K. C. K. Chan, X. Wang, K. Yu, C. Dong, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  27. Robust Reference-based Super-Resolution via C2-Matching
    Y. Jiang, K. C. K. Chan, X. Wang, C. C. Loy, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  28. Visually Informed Binaural Audio Generation without Binaural Audios
    X. Xu, H. Zhou, Z. Liu, B. Dai, X. Wang, D. Lin
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  29. Scene-aware Generative Network for Human Motion Synthesis
    J. Wang, S. Yan, B. Dai, D. Lin
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [Supplementary Material] [Project Page]
  30. LiDAR-based Panoptic Segmentation via Dynamic Shifting Network
    F. Hong, H. Zhou, X. Zhu, H. Li, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  31. Unsupervised Feature Learning by Cross-Level Instance-Group Discrimination
    X. Wang, Z. Liu, S. X. Yu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  32. Removing Diffraction Image Artifacts in Under-Display Camera via Dynamic Skip Connection Network
    R. Feng, C. Li, H. Chen, S. Li, C. C. Loy, J. Gu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  33. Deep Animation Video Interpolation in the Wild
    L. Siyao, S. Zhao, W. Yu, W. Sun, D. Metaxas, C. C. Loy, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  34. Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation
    H. Zhou, Y. Sun, W. Wu, C. C. Loy, X. Wang, Z. Liu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  35. Pareidolia Face Reenactment
    L. Song, W. Wu, C. Fu, C. Qian, C. C. Loy, R. He
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  36. Audio-Driven Emotional Video Portraits
    X. Ji, H. Zhou, K. Wang, W. Wu, X. Cao, C. C. Loy, F. Xu
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  37. Positional Encoding as Spatial Inductive Bias in GANs
    R. Xu, X. Wang, K. Chen, B. Zhou, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  38. Seesaw Loss for Long-Tailed Instance Segmentation
    J. Wang, W. Zhang, Y. Zang, Y. Cao, J. Pang, T. Gong, K. Chen, Z. Liu, C. C. Loy, D. Lin
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material]
  39. Unsupervised 3D Shape Completion through GAN Inversion
    J. Zhang, X. Chen, Z. Cai, L. Pan, H. Zhao, S. Yi, C. K. Yeo, B. Dai, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2021 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  40. Do 2D GANs Know 3D Shape? Unsupervised 3D Shape Reconstruction from 2D Image GANs
    X. Pan, B. Dai, Z. Liu, C. C. Loy, P. Luo
    International Conference on Learning Representations, 2021 (ICLR, Oral)
    [PDF] [arXiv] [Project Page]
  41. Long-Tailed Recognition by Routing Diverse Distribution-Aware Experts
    X. Wang, L. Lian, Z. Miao, Z. Liu, S. X. Yu
    International Conference on Learning Representations, 2021 (ICLR)
    [PDF] [arXiv] [Project Page]
  42. Understanding Deformable Alignment in Video Super-Resolution
    K. C. K. Chan, X. Wang, K. Yu, C. Dong, C. C. Loy
    in Proceedings of AAAI Conference on Artificial Intelligence, 2021 (AAAI)
    [arXiv] [Project Page]
  43. PTeacher: a Computer-Aided Personalized Pronunciation Training System with Exaggerated Audio-Visual Corrective Feedback
    Y. Bu, T. Ma, W. Li, H. Zhou, J. Jia, S. Chen, K. Xu, D. Shi, H. Wu, Z. Yang, K. Li, Z. Wu, Y. Shi, X. Lu, Z. Liu
    in ACM Conference on Human Factors in Computing Systems, 2021 (CHI)
    [PDF]

2020

Journal

  1. A Lightweight Optical Flow CNN - Revisiting Data Fidelity and Regularization
    T.-W. Hui, X. Tang, C. C. Loy
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020 (TPAMI)
    [DOI] [arXiv] [Project Page]
  2. High-Quality Video Generation from Static Structural Annotations
    L. Sheng, J. Pan, J. Guo, J. Shao, C. C. Loy
    International Journal of Computer Vision, 2020 (IJCV)
    [DOI]

Conference

  1. Cross-Scale Internal Graph Neural Network for Image Super-Resolution
    S. Zhou, J. Zhang, W. Zuo, C. C. Loy
    in Proceedings of Neural Information Processing Systems, 2020 (NeurIPS)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  2. Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation
    X. Pan, X. Zhan, B. Dai, D. Lin, C. C. Loy, P. Luo
    European Conference on Computer Vision, 2020 (ECCV, Oral)
    [PDF] [arXiv] [Project Page]
  3. LiteFlowNet3: Resolving Correspondence Ambiguity for More Accurate Optical Flow Estimation
    T.-W. Hui, C. C. Loy
    European Conference on Computer Vision, 2020 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  4. MEAD: A Large-scale Audio-visual Dataset for Emotional Talking Face Generation
    K. Wang, Q. Wu, L. Song, Z. Yang, W. Wu, C. Qian, R. He, Y. Qiao, C. C. Loy
    European Conference on Computer Vision, 2020 (ECCV)
    [PDF] [Supplementary Material] [Project Page]
  5. TSIT: A Simple and Versatile Framework for Image-to-Image Translation
    L. Jiang, C. Zhang, M. Huang, C. Liu, J. Shi, C. C. Loy
    European Conference on Computer Vision, 2020 (ECCV, Spotlight)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  6. Side-Aware Boundary Localization for More Precise Object Detection
    J. Wang, W. Zhang, Y. Cao, K. Chen, J. Pang, T. Gong, J. Shi, C. C. Loy, D. Lin
    European Conference on Computer Vision, 2020 (ECCV)
    [PDF] [arXiv] [Supplementary Material]
  7. RGB-D Salient Object Detection with Cross-Modality Modulation and Selection
    C. Li, R. Cong, Y. Piao, Q. Xu, C. C. Loy
    European Conference on Computer Vision, 2020 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  8. MessyTable: Instance Association in Multiple Camera Views
    Z. Cai, J. Zhang, D. Ren, C. Yu, H. Zhao, S. Yi, C. K. Yeo, C. C. Loy
    European Conference on Computer Vision, 2020 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  9. Knowledge Distillation Meets Self-Supervision
    G. Xu, Z. Liu, X. Li, C. C. Loy
    European Conference on Computer Vision, 2020 (ECCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  10. Zero-Reference Deep Curve Estimation for Low-Light Image Enhancement
    C. Guo, C. Li, J. Guo, C. C. Loy, J. Hou, S. Kwong, R. Cong
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2020 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  11. Self-Supervised Scene De-occlusion
    X. Zhan, X. Pan, B. Dai, Z. Liu, D. Lin, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2020 (CVPR, Oral)
    [PDF] [arXiv] [Supplementary Material] [Project Page] [YouTube]
  12. TransMoMo: Invariance-Driven Unsupervised Video Motion Retargeting
    Z. Yang, W. Zhu, W. Wu, C. Qian, Q. Zhou, B. Zhou, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2020 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  13. Prime Sample Attention in Object Detection
    Y. Cao, K. Chen, C. C. Loy, D. Lin
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2020 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  14. Online Deep Clustering for Unsupervised Representation Learning
    X. Zhan, J. Xie, Z. Liu, Y. S. Ong, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2020 (CVPR)
    [PDF] [arXiv] [Project Page]
  15. Inter-Region Affinity Distillation for Road Marking Segmentation
    Y. Hou, Z. Ma, C. Liu, T.-W. Hui, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2020 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  16. EcoNAS: Finding Proxies for Economical Neural Architecture Search
    D. Zhou, X. Zhou, W. Zhang, C. C. Loy, S. Yi, X. Zhang, W. Ouyang
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2020 (CVPR)
    [PDF] [arXiv] [Supplementary Material]
  17. DeeperForensics-1.0: A Large-Scale Dataset for Real-World Face Forgery Detection
    L. Jiang, R. Li, W. Wu, C. Qian, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2020 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  18. Learning to Cluster Faces via Confidence and Connectivity Estimation
    L. Yang, D. Chen, X. Zhan, R. Zhao, C. C. Loy, D. Lin
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2020 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  19. Real or Not Real, That is the Question
    Y. Xiangli, Y. Deng, B. Dai, C. C. Loy, D. Lin
    International Conference on Learning Representations, 2020 (ICLR, Spotlight)
    [PDF] [Project Page]

Technical Report

  1. Feature Pyramid Grids
    K. Chen, Y. Cao, C. C. Loy, D. Lin, C. Feichtenhofer
    Technical report, arXiv:2004.03580, 2020
    [arXiv]
  2. Residual Knowledge Distillation
    M. Gao, Y. Shen, Q. Li, C. C. Loy
    Technical report, arXiv:2002.09168, 2020
    [arXiv]

2019

Journal

  1. Deep Imbalanced Learning for Face Recognition and Attribute Prediction
    C. Huang, Y. Li, C. C. Loy, X. Tang
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019 (TPAMI)
    [DOI] [arXiv]

Conference

  1. CARAFE: Content-Aware ReAssembly of FEatures
    J. Wang, K. Chen, R. Xu, Z. Liu, C. C. Loy, D. Lin
    in Proceedings of International Conference on Computer Vision, 2019 (ICCV, Oral)
    [PDF] [arXiv] [Supplementary Material]
  2. Robust Multi-Modality Multi-Object Tracking
    W. Zhang, H. Zhou, S. Sun, Z. Wang, J. Shi, C. C. Loy
    in Proceedings of International Conference on Computer Vision, 2019 (ICCV)
    [PDF] [arXiv] [Project Page]
  3. Delving Deep into Hybrid Annotations for 3D Human Recovery in the Wild
    Y. Rong, Z. Liu, C. Li, K. Cao, C. C. Loy
    in Proceedings of International Conference on Computer Vision, 2019 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  4. Learning Lightweight Lane Detection CNNs by Self Attention Distillation
    Y. Hou, Z. Ma, C, Liu, C. C. Loy
    in Proceedings of International Conference on Computer Vision, 2019 (ICCV)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  5. Hybrid Task Cascade for Instance Segmentation
    K. Chen, J. Pang, J. Wang, Y. Xiong, X. Li, S. Sun, W. Feng, Z. Liu, J. Shi, W. Ouyang, C. C. Loy, D. Lin
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2019 (CVPR)
    [PDF] [arXiv] [Project Page]
  6. Region Proposal by Guided Anchoring
    J. Wang, K. Chen, S. Yang, C. C. Loy, D. Lin
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2019 (CVPR)
    [PDF] [arXiv] [Project Page]
  7. Deep Network Interpolation for Continuous Imagery Effect Transition
    X. Wang, K. Yu, C. Dong, X. Tang, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2019 (CVPR)
    [PDF] [arXiv] [Project Page]
  8. TransGaGa: Geometry-Aware Unsupervised Image-to-Image Translation
    W. Wu, K. Cao, C. Li, C. Qian, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2019 (CVPR)
    [PDF] [arXiv] [Supplementary Material] [Project Page]
  9. Deep Flow-Guided Video Inpainting
    R. Xu, X. Li, B. Zhou, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2019 (CVPR)
    [PDF] [arXiv] [Project Page]
  10. Dense Intrinsic Appearance Flow for Human Pose Transfer
    Y. Li, C. Huang, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2019 (CVPR)
    [PDF] [Supplementary Material] [Project Page]
  11. Self-Supervised Learning via Conditional Motion Propagation
    X. Zhan, X. Pan, Z. Liu, D. Lin, C. C. Loy
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2019 (CVPR)
    [PDF] [arXiv] [Project Page]
  12. Learning a Unified Classifier Incrementally via Rebalancing
    S. Hou, X. Pan, C. C. Loy, Z. Wang, D. Lin
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2019 (CVPR)
    [PDF] [Project Page]
  13. Learning to Cluster Faces on an Affinity Graph
    L. Yang, X. Zhan, D. Chen, J. Yan, C. C. Loy, D. Lin
    in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2019 (CVPR, Oral)
    [PDF] [arXiv] [Project Page]
  14. EDVR: Video Restoration with Enhanced Deformable Convolutional Networks
    X. Wang, C. K. Chan, K. Yu, C. Dong, X. Tang, C. C. Loy
    in Workshop Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, NTIRE, 2019 (CVPRW)
    [PDF] [arXiv] [Project Page]
  15. Disentangling Content and Style via Unsupervised Geometry Distillation
    W. Wu, K. Cao, C. Li, C. Qian, C. C. Loy
    International Conference on Learning Representations Workshop, 2019 (ICLRW)
    [PDF]
  16. One-shot Face Reenactment
    Y. Zhang, S. Zhang, Y. He, C. Li, C. C. Loy, Z. Liu
    in Proceedings of British Machine Vision Conference, 2019 (BMVC, Spotlight)
    [PDF] [arXiv] [Project Page]
  17. Instance-level Facial Attributes Transfer with Geometry-aware Flow
    W. Yin, Z. Liu, C. C. Loy
    in Proceedings of AAAI Conference on Artificial Intelligence, 2019 (AAAI, Spotlight)
    [PDF] [arXiv] [Project Page]
  18. Learning to Steer by Mimicking Features from Heterogeneous Auxiliary Networks
    Y. Hou, Z. Ma, C. Liu, C. C. Loy
    in Proceedings of AAAI Conference on Artificial Intelligence, 2019 (AAAI, Oral)
    [PDF] [arXiv] [Project Page]

2018

Conference

  1. Non-Local Recurrent Network for Image Restoration
    D. Liu, B. Wen, Y. Fan, C. C. Loy, T. S. Huang
    in Proceedings of Neural Information Processing Systems, 2018 (NeurIPS)
    [PDF] [arXiv] [Project Page]
  2. ReenactGAN: Learning to Reenact Faces via Boundary Transfer
    W. Wu, Y. Zhang, C. Li, C. Qian, C. C. Loy
    in Proceedings of European Conference on Computer Vision, 2018 (ECCV)
    [PDF] [arXiv] [Project Page] [YouTube]
  3. Video Object Segmentation with Joint Re-identification and Attention-Aware Mask Propagation
    X. Li, C. C. Loy
    in Proceedings of European Conference on Computer Vision, 2018 (ECCV)
    [PDF] [arXiv]
  4. PSANet: Point-wise Spatial Attention Network for Scene Parsing
    H. Zhao, Y. Zhang, S. Liu, J. Shi, C. C. Loy, D. Lin, J. Jia
    in Proceedings of European Conference on Computer Vision, 2018 (ECCV)
    [PDF] [Project Page]
  5. Lifelong Learning via Progressive Distillation and Retrospection
    S. Hou, X. Pan, C. C. Loy, Z. Wang, D. Lin
    in Proceedings of European Conference on Computer Vision, 2018 (ECCV)
    [PDF] [Project Page]
  6. Consensus-Driven Propagation in Massive Unlabeled Data for Face Recognition
    X. Zhan, Z. Liu, J. Yan, D. Lin, C. C. Loy
    in Proceedings of European Conference on Computer Vision, 2018 (ECCV)
    [PDF] [arXiv] [Project Page]
  7. The Devil of Face Recognition is in the Noise
    F. Wang, L. Chen, C. Li, S. Huang, Y. Chen, C. Qian, C. C. Loy
    in Proceedings of European Conference on Computer Vision, 2018 (ECCV)
    [PDF] [arXiv] [Project Page]
  8. Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition
    G. Yin, L. Sheng, B. Liu, N. Yu, X. Wang, J. Shao, C. C. Loy
    in Proceedings of European Conference on Computer Vision, 2018 (ECCV)
    [PDF] [arXiv]
  9. ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks
    X. Wang, K. Yu, S. Wu, J. Gu, Y. Liu, C. Dong, Y. Qiao, C. C. Loy
    in Workshop Proceedings of European Conference on Computer Vision, 2018 (ECCVW)
    [PDF] [arXiv] [Project Page]

Technical Report

  1. An Embarrassingly Simple Approach for Knowledge Distillation
    M. Gao, Y. Shen, Q. Li, C. C. Loy, X. Tang
    Technical report, arXiv:1812.01819, 2018
    [arXiv]