Lei Zhang Chair
Professor of Computer Vision and Image Analysis Fellow of IEEE Office: PQ816 I am also with OPPO Research Institute. |
|
Education
3/1998~10/2001 |
PhD |
Dept. of Automatic Control,
Northwestern Polytechnical University,
Xi'an, China. |
9/1995~3/1998 |
M.Sc |
Dept. of Automatic Control,
Northwestern Polytechnical University,
Xi'an, China. |
9/1991~7/1995 |
B.Sc |
Dept. of Aeronautical
Engineering, Shenyang
Inst. of Aeronautical Engineering, Shenyang, China. |
Work Experience
7/2017~present |
Chair Professor, Dept. of
Computing, Hong Kong Polytechnic University, Hong Kong. |
7/2015~6/2017 |
Professor, Dept. of
Computing, Hong Kong Polytechnic University, Hong Kong. |
9/2010~6/2015 |
Associate Professor, Dept.
of Computing, Hong Kong Polytechnic University, Hong Kong. |
1/2006~8/2010 |
Assistant Professor, Dept. of
Computing, Hong Kong Polytechnic University, Hong Kong. |
1/2003~1/2006 |
Postdoctoral Fellow, Dept. of Electrical and Computer
Engineering, McMaster University,
Canada. |
1/2001~1/2003 |
Research
Assistant/Associate, Dept. of Computing, Hong Kong Polytechnic University,
Hong Kong. |
Visual Computing Lab (our
mission): Y learning and beyond: for future visual enhancement and
understanding. |
My Google Scholar Citation Profile:
http://scholar.google.com/citations?user=tAK5l1IAAAAJ
|
|
News
1.
Our
following paper was selected as the "2024 IEEE SPS Best Paper
Award": K.
Zhang, W. Zuo, L. Zhang, "FFDNet:
Toward a Fast and Flexible Solution for CNN based Image Denoising," IEEE Trans. on Image Processing, vol. 27, issue 9, pp. 4608-4622,
Sept. 2018. |
2.
Several PhD Student positions jointly trained with OPPO Research Institute are available.
The research topics include Image/Video
Restoration/Enhancement, Image/Video Quality Assessment, Diffusion Models,
Vision-Language Models, Efficient Network Architectures, etc. Please send me your CV if you have
interest. |
3.
Several Postdoctoral Fellow or Research Associate positions on Image/Video Generation and Restoration,
Image/Video Quality Assessment, Vision-Language Models are available. Please send me your CV if
you have interest. |
4.
Research
Interns on Image/Video Enhancement, Image/Video Quality Assessment, Diffusion
Models, Vision-Language Models, etc., are available at OPPO
Research Institute. Please send me your CV if
you have interest. |
Newly accepted
1.
D.
Chen, T. Wu, K. Ma, L. Zhang, "Toward Generalized Image Quality
Assessment: Relaxing the Perfect Reference Quality Assumption," in CVPR
2025. (paper) (code) (General image quality assessment in the era of generative models!) |
2.
C.
Xie, M. Li, H. Zeng, J. Luo, L. Zhang, "MaSS13K: A Matting-level
Semantic Segmentation Benchmark," in CVPR 2025. (paper) (code) (High resolution and high precision
semantic segmentation dataset and model!) |
3.
Z.
Ma, X. Liang, R. Wu, X. Zhu, Z. Lei, L. Zhang, "Progressive Rendering
Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation
without 3D Data," in CVPR 2025. (paper) (code) (Faster and stronger 3D generator!) |
4.
L.
Sun, R. Wu, Z. Ma, S. Liu, Q. Yi, L. Zhang, "Pixel-level and
Semantic-level Adjustable Super-resolution: A Dual-LoRA
Approach," in CVPR 2025. (paper) (code) (Flexible super-resolution to meet you
preference!) |
5.
R.
Li, T. Yang, S. Guo, L. Zhang, "RORem:
Training a Robust Object Remover with Human-in-the-Loop," in CVPR 2025. (paper) (code) (A powerful remove any object model with a
large scale paired dataset!) |
6.
B.
Chen, G. Li, R. Wu, X. Zhang, J. Chen, J. Zhang, L. Zhang, "Adversarial
Diffusion Compression for Real-World Image Super-Resolution," in CVPR
2025. (paper) (code) (Extremely efficient generative super-resolution!) |
7.
G.
Li, B. Chen, C. Zhao, L. Zhang, J. Zhang, "OSMamba:
Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure
Correction," in CVPR 2025. (paper) (code) |
8.
C.
Xiao, M. Li, Z. Zhang, D. Meng, L. Zhang, "Spatial-Mamba: Effective
Visual State Space Models via Structure-Aware State Fusion," in ICLR
2025. (paper) (code) (A real visual Mamba model, you just need to scan once!) |
9.
Z.
Zhang, R. Li, L. Zhang, "FreCaS: Efficient
Higher-Resolution Image Generation via Frequency-aware Cascaded
Sampling," in ICLR 2025. (paper) (code) (Generating higher-resolution images better and faster!) |
10. X. Kong, K. Huang, P. Li, L. Zhang,
"Toward Generalizing Visual Brain Decoding to Unseen Subjects," in
ICLR 2025. (paper) (code) (Can visual brain encoding be generalized?) |
11. M. Ni, Y. Fan, L. Zhang, W. Zuo, "Visual-O1:
Understanding Ambiguous Instructions via Multi-modal Multi-turn
Chain-of-thoughts Reasoning," in ICLR 2025. (paper) (code) |
Preprint
1. Y. Wu, L. Chen, R. Li, S. Wang, C. Xie,
L. Zhang, "InsViE-1M: Effective Instruction-based Video Editing with
Elaborate Dataset Construction," (paper) (code) (A large-scale instruction-based video
editing dataset and an effective model!) |
2. D. Chen, L. Chen, Z. Zhang, L. Zhang,
"Generalized and Efficient 2D Gaussian Splatting for Arbitrary-Scale
Super-Resolution," preprint. (paper) (code) (Effective and efficient ASR with GS
representation!) |
3. H. Wei, S. Liu, C. Yuan, L. Zhang,
"Perceive, Understand and Restore: Real-World Image Super-Resolution with
Autoregressive Multimodal Generative Models," (paper) (code) (Can autoregressive multimodal models do
generative image restoration?) |
4. W. Li, Y. Yuan, J. Liu, D. Tang, S. Wang,
J. Zhu, L. Zhang, "TokenPacker: Efficient
Visual Projector for Multimodal LLM," preprint. (paper) (code) (Up to 89% visual token compression!) |
5. L. Sun, R. Wu, Z. Zhang, H. Yong, L.
Zhang, "Improving the Stability of Diffusion Models for Content
Consistent Super-Resolution," preprint. (paper) (code) |
6. X. Kong, C. Dong, L. Zhang, "Towards Effective Multiple-in-One Image
Restoration: A Sequential and Prompt Learning Strategy," preprint. (paper) (code&data) |