Lei Zhang

Chair Professor of Computer Vision and Image Analysis

Fellow of IEEE
Department of Computing
The Hong Kong Polytechnic University
Hung Hom, Kowloon, Hong Kong

Office: PQ816
Email: cslzhang at comp.polyu dot edu.hk

I am also with OPPO Research Institute.

Education

3/1998~10/2001

PhD

Dept. of Automatic Control, Northwestern Polytechnical University, Xi'an, China.

9/1995~3/1998

M.Sc

Dept. of Automatic Control, Northwestern Polytechnical University, Xi'an, China.

9/1991~7/1995

B.Sc

Dept. of Aeronautical Engineering, Shenyang Inst. of Aeronautical Engineering, Shenyang, China.


Work Experience

7/2017~present

Chair Professor, Dept. of Computing, Hong Kong Polytechnic University, Hong Kong.

7/2015~6/2017

Professor, Dept. of Computing, Hong Kong Polytechnic University, Hong Kong.

9/2010~6/2015

Associate Professor, Dept. of Computing, Hong Kong Polytechnic University, Hong Kong.

1/2006~8/2010

Assistant Professor, Dept. of Computing, Hong Kong Polytechnic University, Hong Kong.

1/2003~1/2006

Postdoctoral Fellow, Dept. of Electrical and Computer Engineering, McMaster University, Canada.

1/2001~1/2003

Research Assistant/Associate, Dept. of Computing, Hong Kong Polytechnic University, Hong Kong.


Visual Computing Lab (our mission):

Y learning and beyond: for future visual enhancement and understanding.

 

My Google Scholar Citation Profile:

http://scholar.google.com/citations?user=tAK5l1IAAAAJ


http://t3.gstatic.com/images?q=tbn:ANd9GcSHajD6zIxvR7ORoWo3YUt1I4QtdrnCXbMSavwRvV19gHyDytAfYgMC900297235[1]

Papers&Codes


News

1.    Several Postdoctoral Fellow or Research Associate positions on Video Generation and Vision-Language Models are available. Please send me your CV if you have interest.

2.    Several PhD Student positions jointly trained with OPPO Research Institute are available. The research topics include Image/Video Restoration/Enhancement, Diffusion Models, Vision-Language Models, Efficient Network Architectures, etc. Please send me your CV if you have interest.

3.    Research Interns on Image Enhancement, Diffusion Models, Vision-Language Models, etc., are available at OPPO Research Institute. Please send me your CV if you have interest.

Newly accepted

1.      R. Wu, L. Sun, Z. Ma, L. Zhang, "One-Step Effective Diffusion Network for Real-World Image Super-Resolution," in NeurIPS 2024. (paper) (code) (High quality and stable super-resolution in just one step diffusion!)

2.      G. Zhang, L. Fan, C. He, Z. Lei, Z. Zhang, L. Zhang, "Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection," in NeurIPS 2024 (spotlight). (paper) (code) (New SOTA on point cloud 3D detection!)

3.      Y. Zhang, L. Zhang, "AdaNeg: Adaptive Negative Proxy Guided OOD Detection with Vision-Language Models," in NeurIPS 2024. (paper) (code) (New SOTA on OOD detection!)

4.      D. Chen, Z. Zhang, J. Liang, L. Zhang, "SSL: A Self-similarity Loss for Improving Generative Image Super-resolution," in ACM MM 2024. (paper) (code) (A simple yet effective loss for generative SR!)

Preprint

1.    C. Xiao, M. Li, Z. Zhang, D. Meng, L. Zhang, "Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion," preprint. (paper) (code) (A real visual Mamba model, you just need to scan once!)

2.    Z. Zhang, R. Li, L. Zhang, "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling," preprint. (paper) (code) (Generating higher-resolution images better and faster!)

3.    X. Kong, K. Huang, P. Li, L. Zhang, "Toward Generalizing Visual Brain Decoding to Unseen Subjects," preprint. (paper) (code) (Can visual brain encoding be generalized?)

4.    M. Ni, Y. Fan, L. Zhang, W. Zuo, "Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning," preprint. (paper) (code)

5.    W. Li, Y. Yuan, J. Liu, D. Tang, S. Wang, J. Zhu, L. Zhang, "TokenPacker: Efficient Visual Projector for Multimodal LLM," preprint. (paper) (code) (Up to 89% visual token compression!)

6.    R. Li, L. Chen, Z. Zhang, V. Jampani, V.M. Patel, L. Zhang, "SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing," preprint. (paper) (code) (Multi-view consistent 3D scene editing!)

7.    L. Sun, R. Wu, Z. Zhang, H. Yong, L. Zhang, "Improving the Stability of Diffusion Models for Content Consistent Super-Resolution," preprint. (paper) (code)

8.    X. Kong, C. Dong, L. Zhang, "Towards Effective Multiple-in-One Image Restoration: A Sequential and Prompt Learning Strategy," preprint. (paper) (code&data)