Lei Zhang

Chair Professor of Computer Vision and Image Analysis

Fellow of IEEE
Department of Computing
The Hong Kong Polytechnic University
Hung Hom, Kowloon, Hong Kong

Office: PQ816
Email: cslzhang at comp.polyu dot edu.hk

I am also with OPPO Research Institute.

Education

3/1998~10/2001

PhD

Dept. of Automatic Control, Northwestern Polytechnical University, Xi'an, China.

9/1995~3/1998

M.Sc

Dept. of Automatic Control, Northwestern Polytechnical University, Xi'an, China.

9/1991~7/1995

B.Sc

Dept. of Aeronautical Engineering, Shenyang Inst. of Aeronautical Engineering, Shenyang, China.


Work Experience

7/2017~present

Chair Professor, Dept. of Computing, Hong Kong Polytechnic University, Hong Kong.

7/2015~6/2017

Professor, Dept. of Computing, Hong Kong Polytechnic University, Hong Kong.

9/2010~6/2015

Associate Professor, Dept. of Computing, Hong Kong Polytechnic University, Hong Kong.

1/2006~8/2010

Assistant Professor, Dept. of Computing, Hong Kong Polytechnic University, Hong Kong.

1/2003~1/2006

Postdoctoral Fellow, Dept. of Electrical and Computer Engineering, McMaster University, Canada.

1/2001~1/2003

Research Assistant/Associate, Dept. of Computing, Hong Kong Polytechnic University, Hong Kong.


Visual Computing Lab (our mission):

Y learning and beyond: for future visual enhancement and understanding.

 

My Google Scholar Citation Profile:

http://scholar.google.com/citations?user=tAK5l1IAAAAJ


http://t3.gstatic.com/images?q=tbn:ANd9GcSHajD6zIxvR7ORoWo3YUt1I4QtdrnCXbMSavwRvV19gHyDytAfYgMC900297235[1]

Papers&Codes


News

1.    PhD Student, Research Assistant/Associate and Postdoc positions on Image/Video Restoration/Enhancement, Diffusion Models, Vision-Language Models, Segmentation, etc., are available. Please send me your CV if you have interest.

2.    Research Interns on Diffusion Models, Vision-Language Models, Image Enhancement, Segmentation, etc., are available at OPPO Research Institute. Please send me your CV if you have interest.

3.    I've been selected as a "Clarivate Analytics Highly Cited Researcher" from 2015 to 2023. (https://clarivate.com/hcr).

Newly accepted

1.      Z. Ma, Y. Wei, Y. Zhang, X. Zhu, Z. Lei, L. Zhang, "ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation," in ECCV 2024. (paper) (code) (High-quality, prompt-consistent text-to-3D synthesis up to 100k prompts!)

2.      T. Yang, R. Wu, P. Ren, X. Xie, L. Zhang, "Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization," in ECCV 2024. (paper) (code) (Pixel-aware generation for various tasks!)

3.      X. Yang, C. He, J. Ma, L. Zhang, "Motion-Guided Latent Diffusion for Temporally Consistent Real-World Video Super-Resolution," in ECCV 2024. (paper) (code) (Simple yet effective video super-resolution with diffusion priors!)

4.      R.B. Li, R.H. Li, S. Guo, L. Zhang, "Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models," in ECCV 2024. (paper) (code) (Disentangle the source and target prompts for better inversion!)

5.      Y. Wei, Z. Ji, J. Bai, H. Zhang, L. Zhang, W. Zuo, "MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation," in ECCV 2024. (paper) (code) (Identity preserved and flexible personalized image editing!)

6.      M. Ni, Y. Shen, L. Zhang, W. Zuo, "Responsible Visual Editing," in ECCV 2024. (paper) (code) (Making visual editing safer, fairer and more responsible!)

7.      T. Wu, K. Ma, J. Liang, Y. Yang, L. Zhang, "A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment," in ECCV 2024. (paper) (code) (Can MLLM understand image quality?)

8.      Y. Zhang, W. Zhu, C. He, L. Zhang, "LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models," in ECCV 2024. (paper) (code) (New SOTA on full-spectrum OOD detection!)

9.      R. Li, Z. Zhang, C. He, Z. Ma, V. Patel, L. Zhang, "Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding," in ECCV 2024. (paper) (code) (1(text)-2(image)-3(point cloud) aligned for 3D scene understanding!)

10.  P. Wang, Y. Wang, S. Li, Z. Zhang, Z. Lei, L. Zhang, "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation," in ECCV 2024. (paper) (code) (Self-distillation works better with geometry guidance!)

11.  G. Zhang, J. Fan, L. Chen, Z. Zhang, Z. Lei, L. Zhang, "General Geometry-Aware Weakly Supervised 3D Object Detection," in ECCV 2024. (paper) (code) (A unified and strong framework for 3D object detection!)

12.  C. He, R. Li, G. Zhang, L. Zhang, "ScatterFormer: Efficient Voxel Transformer with Scattered Attention," in ECCV 2024. (paper) (code)

13.  J. Li, L. Wang, L. Zhang, B. Wang, "TensoSDF: Roughness-aware Tensorial Representation for Robust Geometry and Material Reconstruction," ACM Transactions on Graphics (Proceedings of SIGGRAPH 2024). (paper) (code)

14.  L. Sun, J. Liang, S. Liu, H. Yong, L. Zhang, "Perception-Distortion Balanced Super-Resolution: A Multi-Objective Optimization Perspective," IEEE Trans. on Image Processing, 2024. (paper) (code)

Preprint

1.    R. Wu, L. Sun, Z. Ma, L. Zhang, "One-Step Effective Diffusion Network for Real-World Image Super-Resolution," preprint. (paper) (code) (High quality and stable super-resolution in just one step diffusion!)

2.    W. Li, Y. Yuan, J. Liu, D. Tang, S. Wang, J. Zhu, L. Zhang, "TokenPacker: Efficient Visual Projector for Multimodal LLM," preprint. (paper) (code) (Up to 89% visual token compression!)

3.    G. Zhang, L. Fan, C. He, Z. Lei, Z. Zhang, L. Zhang, "Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection," preprint. (paper) (code) (New SOTA on point cloud 3D detection!)

4.    R. Li, L. Chen, Z. Zhang, V. Jampani, V.M. Patel, L. Zhang, "SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing," preprint. (paper) (code) (Multi-view consistent 3D scene editing!)

5.    L. Sun, R. Wu, Z. Zhang, H. Yong, L. Zhang, "Improving the Stability of Diffusion Models for Content Consistent Super-Resolution," preprint. (paper) (code)

6.    Z. Zhang, R. Li, S. Guo, Y. Cao, L. Zhang, "TMP: Temporal Motion Propagation for Online Video Super-Resolution," preprint. (paper) (code)

7.    X. Kong, C. Dong, L. Zhang, "Towards Effective Multiple-in-One Image Restoration: A Sequential and Prompt Learning Strategy," preprint. (paper) (code&data)

8.    S. Li, M. Li, P. Wang, L. Zhang, "OpenSD: Unified Open-Vocabulary Segmentation and Detection," preprint. (paper) (code)