Hongxia Yang

Portrait of Hongxia
Professor
Department of Computing
HK Polytechnic University
Logo of PolyU
Dr. Hongxia Yang, PhD from Duke University, has published over 100 papers in top-tier conferences and journals, and holds more than 50 patents in the USA and China. Previously, she worked as a research staff member at IBM T.J. Watson research center, principal scientist at Yahoo!, an AI scientist and Director at Alibaba DAMO Academy, an adjunct professor at Zhejiang University’s Shanghai Advanced Research Institute and the Head of Large Language Models at ByteDance US.
Open positions: I am currently seeking to recruit fully funded Ph.D. students and postdocs who are interested in specializing in Generative AI and decentralized computing. Please feel free to send me an email with your CV. Self-funded visiting students and scholars are also welcome to apply.

Awards And Honors

  • [2016] Recognized by WILEY as one of the world’s most inspirational female data scientists.

  • [2019] World Artificial Intelligence Conference’s highest award, the Super AI Leader (SAIL Award).

  • [2020] National Science and Technology Progress Award Second Class (国家科学技术进步奖二等奖) for Key Technologies in Intelligent Science and Technology Information Mining and Knowledge Services and Their Large-Scale Application (Third Contributor, jointly applied with Tsinghua University).

  • [2020] National Natural Science Foundation of China key project (国家自然基金委重点项目), Research on Autonomous, Efficient, and Generalizable Neural Network Models in Complex E-commerce Environments(jointly applied with Zhejiang University).

  • [2021] Chinese Institute of Electronics Science and Technology Progress First Class (中国电子学会科学技术进步奖一等奖) for Ultra-Large Scale High-Performance Graph Neural Network Computing Platform and Its Applications(Second Contributor, jointly applied with Zhejiang University).

  • [2022] Ministry of Education Science and Technology Progress Award First Class (教育部科学技术进步奖一等奖) for Large-Scale Graph Neural Network Edge-Cloud Collaborative Computing Platform and Application Demonstration (Third Contributor, jointly applied with Zhejiang University).

  • [2022] Top 50 Women in Tech by Forbes China.

  • [2023-24] AI 2000 Most Influential Scholar Award.

Selected Publications

    Generative AI

  1. InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models, Linyi Li, Shijie Geng, Zhenwen Li, Yibo He, Hao Yu, Ziyue Hua, Guanghan Ning, Siwei Wang, Tao Xie, Hongxia Yang, NeurIPS, 2024.

  2. DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation, Yuang Ai, Xiaoqiang Zhou, Huaibo Huang, Xiaotian Han, Zhengyu Chen, Quanzeng You, Hongxia Yang, NeurIPS, 2024.

  3. Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model, Haogeng Liu, Quanzeng You, Xiaotian Han, Yongfei Liu, Huaibo Huang, Ran He, Hongxia Yang, NeurIPS, 2024.

  4. Empowering Large Language Model Agents through Action Learning, Haiteng Zhao, Chang Ma, Guoyin Wang, Jing Su, Lingpeng Kong, Jingjing Xu, Zhi-Hong Deng, Hongxia Yang, COLM, 2024.

  5. An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing, Ziwei Chai, Guoyin Wang, Jing Su, Tianjie Zhang, Xuanwen Huang, Xuwu Wang, Jingjing Xu, Jianbo Yuan, Hongxia Yang, Fei Wu, Yang Yang, ACL, 2024.

  6. DeVAn: Dense Video Annotation for Video-Language Models, Tingkai Liu, Yunzhe Tao, Haogeng Liu, Qihang Fang, Ding Zhou, Huaibo Huang, Ran He, Hongxia Yang, ACL, 2024.

  7. Expedited Training of Visual Conditioned Language Generation via Redundancy Reduction, Yiren Jian, Tingkai Liu, Yunzhe Tao, Chunhui Zhang, Soroush Vosoughi, Hongxia Yang, ACL, 2024.

  8. InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model, Haogeng Liu, Quanzeng You, Yiqi Wang, Xiaotian Han, Bohan Zhai, Yongfei Liu, Wentao Chen, Yiren Jian, Yunzhe Tao, Jianbo Yuan, Ran He, Hongxia Yang, ACL, 2024.

  9. LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild, Ziyu Zhao, Leilei Gan, Guoyin Wang, Wangchunshu Zhou, Hongxia Yang, Kun Kuang, Fei Wu, ACL, 2024.

  10. Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, Zhenyu He, Guhao Feng, Shengjie Luo, Kai Yang, Liwei Wang, Jingjing Xu, Zhi Zhang, Hongxia Yang, Di He, ICML, 2024.

  11. Self-Infilling Code Generation, Lin Zheng, Jianbo Yuan, Zhi Zhang, Hongxia Yang, Lingpeng Kong, ICML, 2024.

  12. InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks, Xueyu Hu, Ziyu Zhao, Shuang Wei, Ziwei Chai, Qianli Ma, Guoyin Wang, Xuwu Wang, Jing Su, Jingjing Xu, Ming Zhu, Yao Cheng, Jianbo Yuan, Jiwei Li, Kun Kuang, Yang Yang, Hongxia Yang, Fei Wu, ICML, 2024.

  13. $\mathcal{\beta}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis, Zishun Yu, Yunzhe Tao, Liyu Chen, Tao Sun, Hongxia Yang, ICLR, 2024.

  14. LEMON: Lossless model expansion, Yite Wang, Jiahao Su, Hanlin Lu, Cong Xie, Tianyi Liu, Jianbo Yuan, Haibin Lin, Ruoyu Sun, Hongxia Yang, ICLR, 2024.

  15. Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling, Huangjie Zheng, Zhendong Wang, Jianbo Yuan, Guanghan Ning, Pengcheng He, Quanzeng You, Hongxia Yang, Mingyuan Zhou, ICLR, 2024.

  16. Let Models Speak Ciphers: Multiagent Debate through Embeddings, Chau Pham, Boyi Liu, Yingxiang Yang, Zhengyu Chen, Tianyi Liu, Jianbo Yuan, Bryan A. Plummer, Zhaoran Wang, Hongxia Yang, ICLR, 2024.