[NEW] We have several opening positions for full-time/part-time PhD/Research Assistants/Project Assistants/Post-docs in our team. Drop me your CV if you are interested in the position. Priority will be given to students with at least 1 paper published/accepted in premier NLP/ML/AI conferences (e.g., ACL, EMNLP, NAACL, COLING).
[NEW] We provided a tutorial in AACL 2022 about "When Cantonese NLP Meets Pre-training: Progress and Challenges". We took Cantonese as an example trying to draw the attention of our community over NLP for low-resource languages. [Slides] [Reference List].
[NEW] Three papers are accepted in EMNLP 2022, two in main conference and one in Findings! One is about comment retrieval for multimodal NLU on social media (Check the paper here), one is about cross-modality discourse on social media (Check the paper here), and the other one is about NER for privacy documents (Check the paper here). Congrats to Chunpu, Kaifa, and Kaifa's chief supervisor Dr. Xiapu Luo.
One paper is accepted in ICSE 2022 about Programing Language Pre-training! Congrats to Zhengran, Hanzhuo, and their supervisor Prof. Yuqun Zhang (SUSTech). Check the paper here.
One paper is accepted in ICML 2022 (spotlight) about Explainable AI! Congrats to Yibing and his supervisor Dr. Shiqi Wang (CityU). Check the paper here.
One paper is accepted in NAACL 2022 (Findings) about Complaints on Social Media! Congrats to Ming and his supervisor Prof. Shi Zong (Nanjing University). Check the paper here.
One paper is accepted in ACL 2022 (long paper, main conference) about Expertise Learning in Health Forums! Congrats to Xiaoxin and Yubo. Check the paper here.
One paper is accepted in WWW 2022 about User Engagements in Conversations! Congrats to Lingzhi and her supervisor Prof. Kam-Fai Wong (CUHK). Check the paper here.
Dr. Li is awarded CCF-Baidu open fund for Short Text Pre-training!
Dr. Jing Li is an Assistant Professor of the Department of Computing, The Hong Kong Polytechnic University (PolyU) since 2019. She is a member of Research Centre of Data Sciences and Artificial Intelligence (RC-DSAI). Before joining PolyU, she worked in the Natural Language Processing Center, Tencent AI Lab as a senior researcher from 2017 to 2019. Jing obtained her PhD degree from the Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong in 2017 under supervision of Professor Kam-Fai Wong. Before that, she received her B.S. degree from Department of Machine Intelligence, Peking University in 2013. Jing has broad research interests on Natural Language Processing (NLP), Computational Social Science (CSS), and Machine Learning (ML). Particularly, she works on novel algorithms for language representation learning, social media language understanding, conversation and social interaction modeling, and robust NLP and multimodal applications in the noisy real-world applications. To know more about our research team, feel free to visit PolyU SMART Group.
- Pre-training and Language Representation Learning
- Natural Language Understanding (NLU) for Social Media Contents
- Online Conversations and Social Interaction Modeling
- Natural Language Processing (NLP) and Multimodal Applications
Research Projects (as PI) |
- Mar 2022 - Feb 2023: Knowledge-Enhanced Automatic Essay Grading Research. Gift Fund from Zhongjiaoyunzhi (matched with RGC-RMGS).
- Jan 2022 - Dec 2024: Social-Transformers: A Deep Pre-training Framework for Social Media Language Understanding. RGC Early Career Scheme (ECS).
- July 2021 - June 2022: Development of a 3-hour Online Programme on Artificial Intelligence and Data Analytics. PolyU Internal Fund under Freshman Seminar for the Online Teaching Development and Educational Research Grant as PI with Co-PI Dr. Richard Lui. Feel free to explore the AIDA Interactive Playground (only for internal use of PolyU students)!
- Jan 2022 - Dec 2022: Pre-training Methods for Short Texts. CCF-Baidu Open Fund (matched with RGC-RMGS).
- Jan 2021 - Dec 2021: Comment-Aware Weakly-Supervised Classification for Social Media Texts. CCF-Tencent Rhino-Bird Young Faculty Open Research Fund (matched with RGC-RMGS).
- Jan 2021 - Dec 2023: Characterize, Detect, and Neutralize: Context-Aware Computational Methods for Media Bias on Social Platforms. NSFC (Young Scientists Fund).
- Oct 2019 - Sep 2022: Discourse Parsing for Online Conversations. PolyU Internal Fund.
- Chunpu Xu and Jing Li
Borrowing Human Senses: Comment-Aware Self-Training for Social Media Multimodal Classification
EMNLP 2022. [Code and Data]
- Chunpu Xu, Hanzhuo Tan, Jing Li, and Piji Li
Understanding Social Media Cross-Modality Discourse in Linguistic Space
EMNLP 2022 Findings. [Code and Data]
- Kaifa Zhao, Le Yu, Shiyao Zhou, Jing Li, Xiapu Luo, Yat Fei Aemon Chiu, Yutong Liu
A Fine-grained Chinese Software Privacy Policy Dataset for Sequence Labeling and Regulation Compliant Identification
EMNLP 2022. [Code and Data]
- Xiaoxin Lu, Yubo Zhang, Jing Li, Shi Zong
Doctor Recommendation in Online Health Forums via Expertise Learning
ACL 2022. [Code and Data]
- Lingzhi Wang, Jing Li, Xingshan Zeng, Kam-Fai Wong
Successful New-entry Prediction for Multi-Party Online Conversations via Latent Topics and Discourse Modeling
WWW 2022. [Code and Data]
- Yuji Zhang, Yubo Zhang, Chunpu Xu, Jing Li, Ziyan Jiang, and Baolin Peng
#HowYouTagTweets: Learning User Hashtagging Preferences via Personalized Topic Attention
EMNLP 2021. [Code and Data]
- Xingshan Zeng, Jing Li, Lingzhi Wang, and Kam-Fai Wong
Modeling Global and Local Interactions for Online Conversation Recommendation
ACM TOIS 2021.
- Rong Xiang, Jing Li, Mingyu Wan, Jinghang Gu, Qin Lu, Wenjie Li, Chu-Ren Huang
Affective Awareness in Neural Sentiment Analysis
KBS Journal Volume 226 (2021).
- Zexin Lu, Keyang Ding, Yuji Zhang, Jing Li, Baolin Peng, and Lemao Liu
Engage the Public: Poll Question Generation for Social Media Posts
ACL-IJCNLP 2021. [Code and Data]
- Lu Ji, Zhongyu Wei, Jing Li, Qi Zhang, and Xuanjing Huang.
Discrete Argument Representation Learning for Interactive Argument Pair Identification
NAACL 2021.
- Lei Chen, Zhongyu Wei, Jing Li, Baohua Zhou, Qi Zhang, and Xuanjing Huang.
Modeling Evolution of Message Interaction for Rumor Resolution [Code]
COLING 2020.
- Keyang Ding, Jing Li, and Yuji Zhang.
Hashtags, Emotions, and Comments: A Large-Scale Dataset to Understand Find-Grained Social Emotions to Online Topics [Data]
EMNLP 2020 (short paper).
- Lingzhi Wang, Jing Li, Xingshan Zeng, Haisong Zhang, and Kam-Fai Wong.
Continuity of Topic, Interaction, and Query: Learning to Quote in Online Conversations
EMNLP 2020.
- Yue Wang, Jing Li, Michael R. Lyu, and Irwin King.
Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attention and Image Wordings [Code and Data]
EMNLP 2020.
- Xingshan Zeng, Jing Li, Lu Wang, Zhiming Mao, and Kam-Fai Wong.
Dynamic Online Conversation Recommendation [Code and Data]
ACL 2020.
- Jichuan Zeng, Jing Li, Yulan He, Cuiyun Gao, Michael R. Lyu, and Irwin King.
What Changed Your Mind: The Roles of Dynamic Topics and Discourse in Argumentation Process [Code and Data]
WWW 2020.
- Ming Liao, Jing Li, Haisong Zhang, Lingzhi Wang, Xixin Wu, and Kam-Fai Wong.
Coupling Global and Local Context for Unsupervised Aspect Extraction
EMNLP 2019.
- Xingshan Zeng, Jing Li, Lu Wang, and Kam-Fai Wong.
Neural Conversation Recommendation with Online Interaction Modeling [Code and Data]
EMNLP 2019
- Yue Wang, Jing Li, Hou Pong Chan, Irwin King, Michael R. Lyu, and Shuming Shi.
Topic-Aware Neural Keyphrase Generation for Social Media Language
[Code and Data]
ACL 2019.
- Xingshan Zeng, Jing Li, Lu Wang, and Kam-Fai Wong.
Joint Effects of Context and User History for Predicting Online Conversation Re-entries
[Code and Data]
ACL 2019.
- Yue Wang, Jing Li, Irwin King, Michael R. Lyu, and Shuming Shi.
Microblog Hashtag Generation via Encoding Conversation Contexts
NAACL 2019.
- Jichuan Zeng, Jing Li, Yulan He, Cuiyun Gao, Michael R. Lyu, and Irwin King
What You Say and How You Say it: Joint Modeling of Topics and Discourse in Microblog Conversations [Code and Data]
TACL 2019 (presented in ACL 2019).
- Jing Li, Yan Song, Zhongyu Wei, and Kam-Fai Wong
A Joint Model of Conversational Discourse and Latent Topics on Microblogs
CL 2018. (Volume 44, Issue 4)
- Jichuan Zeng, Jing Li, Yan Song, Cuiyun Gao, Michael R. Lyu, and Irwin King
Topic Memory Networks for Short Text Classification [Code]
EMNLP 2018.
- Dingmin Wang, Yan Song, Jing Li, Jialong Han, and Haisong Zhang
A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check [Code and Data]
EMNLP 2018.
- Xingshan Zeng, Jing Li, Lu Wang, Nicholas Beauchamp, Sarah Schugars, and Kam-Fai Wong
Microblog Conversation Recommendation via Joint Modeling of Topics and Discourse
[Data]
NAACL 2018.
- Yingyi Zhang, Jing Li, Yan Song, and Chengzhi Zhang
Encoding Conversation Context for Neural Keyphrase Extraction from Microblog Posts
[Data]
NAACL 2018.
- Jing Li
Microblog Summarization Using Conversation Structures
PhD thesis. 2017
- Jing Li, Ming Liao, Wei Gao, Yulan He, and Kam-Fai Wong
Topic Extraction from Microblog Posts Using Conversation Structures [Data] [Code]
ACL 2016.
- Jing Li, Wei Gao, Zhongyu Wei, Baolin Peng, and Kam-Fai Wong
Using Content-level Structures for Summarizing Microblog Repost Trees [Data]
EMNLP 2015.
Research Experience
- Visiting PhD at Aston University, Birmingham, UK, Jan - Apr, 2016, Supervisor: Prof. Yulan He (Now in University of Warwick.)
- Visiting Scientist at Northeastern University, Boston, USA, Feb - May, 2017, Host: Prof. Lu Wang (Now in University of Michigan.)
Organization Committee Member:
Programme Committee Member (including area chairs and senior members):
- 2023: EACL, AAAI (senior member), ICASSP (meta reviewer), and ACL.
- 2022: ACL (area chair), ICASSP (meta reviewer), AAAI, and EMNLP.
- 2021: ACL (area chair for NLP Applications), IJCAI (senior member), CCL (area chair for summarization and generation), AAAI, EACL, and NAACL
- 2020: AAAI, ACL, and ICONIP (senior member)
- 2019: ACL, EMNLP, NAACL, and AAAI
- 2018: ACL and EMNLP (Best reviewer award in EMNLP 2018)
- 2017: EACL and EMNLP
- 2016: EMNLP
- 2015: EMNLP
Reviewer
- TACL: July 2021 - June 2023.
- ACL Rolling Review (as a reviewer and action edtior).
- Spring 2023: [COMP1433] Introduction to Data Analytics (co-teaching with Dr. Jibin Wu)
- Spring 2023: [COMP1004] Introduction to Artificial Intelligence and Data Analytics (co-teaching with Prof. Guandong Xu)
- Fall 2022: [COMP1004] Introduction to Artificial Intelligence and Data Analytics (co-teaching with Dr. Richard Lui)
- Spring 2022: [COMP1433] Introduction to Data Analytics.
- Spring 2021: [FH6051] Computational Linguistics (co-teaching with Prof. Chu-Ren Huang)
- Spring 2021: [COMP5511] Artifical Intelligence Concepts.
- Spring 2021: [COMP1433] Introduction to Data Analytics.
- Spring 2020: [COMP1433] Introduction to Data Analytics.
- Fall 2019: [COMP6701] Advanced Topics in Computer Algorithms (co-teaching with Dr. Jesper Jansson)
- Fall 2019: [COMP4122] Game Design and Development (co-teaching with Dr. Ping Li)
Research Students and Staffs in PolyU (we form SMART Group together!):
Alumni in PolyU:
- Xiaoxin Lu, Master student and Project Assistant. Jan 2021-June 2022. Dissertation: Doctor Recommendation in Online Health Forums via Expertise Learning. Publication: ACL 2022.
- Zexin Lu, PhD student. Graduated in Nov 2022. Thesis: Machine-Aided Online User Engagements. Publication: SLT 2021 and ACL-IJCNLP 2021. Co-supervisor: Chair Prof. Qing Li.
- Yibing Liu, Research Asisstant. Apr 2021 - June 2021. Focusing on privacy document analysis. Now a PhD student at City University of Hong Kong. Publication: ICML 2022.
- Keyang Ding, Research Assistant. Apr 2021 - present. Focusing on emotion analysis on social media. Master Dissertation: Hashtags, Emotions, and Comments: A Large-Scale Dataset to Understand Fine-Grained Social Emotions to Online Topics. Publication: EMNLP 2020 and ACL-IJCNLP 2021. Now a PhD student in Harbin Institute of Technology.
- Bing Wang, Visiting PhD student from University of Oxford. Focusing on computer vision. Mar 2021 - June 2021.
- Junfeng Jiang, Master student. Oct 2019-Mar 2021. Dissertation: Online Medical-consultation Recommendation System with Topic Model. Now a software engineer in Oppo.
- Hongliang Sun, Master student. Oct 2019-Mar 2021. Dissertation: Domain-Specific Language Model Continue Pretraining for Chinese Weibo. Now a PhD student in Harbin Institute of Technology.
- Jiancheng Wen, Research Assistant. Aug 2020-Feb 2021. Focusing on multimodality learning.
Intern Students in Tencent:
- Yingyi Zhang, PhD student from Nanjing University of Science and Technology. Oct 2017 - May 2018. Focusing on keyphrase extraction: NAACL 2018 and JASIST 2019. Now a visiting PhD in University of Pittsburgh.
- Jichuan Zeng, PhD student from The Chinese University of Hong Kong. Dec 2017 - Aug 2019. Focusing on topic modeling: EMNLP 2018 , TACL 2019, and WWW 2020. Obtained PhD degree in 2019 and now a senior research engineer at ByteDance.
- Yue Wang, PhD student from The Chinese University of Hong Kong. May 2018 - Aug 2019. Focusing on keyphrase generation: NAACL 2019, ACL 2019, and EMNLP 2020. Obtained PhD degree in 2020.
- Ming Liao, PhD student from The Chinese University of Hong Kong. Oct 2018 - Aug 2019. Focusing on sentiment analysis: EMNLP 2019.
- Lu Ji, Master student from Fudan University. Apr - Aug 2019. Focusing on conversation discourse analysis: NAACL 2021. Tencent Rhino-Bird Elite Training Program. Graduated in June 2020 and now in Pinduoduo as a research engineer.
- Xiaoxue Liu, Master student from Nanjing University. May - Sep 2018. Focusing on event detection. Graduated in June 2019 and now in Tencent as a research engineer.
Other Student Collaborators
- Xingshan Zeng, PhD student from The Chinese University of Hong Kong. Focusing on conversation recommendation: NAACL 2018, ACL 2019, EMNLP 2019, ACL 2020, and ACM TOIS. Obtained PhD degree 2020 and now in Huawei Noah's ark lab as a researcher.
- Lingzhi Wang, PhD student from The Chinese University of Hong Kong. Focusing on user behavior analysis in social media conversations: EMNLP 2020.
- Luyang Lin, PhD student from The Chinese University of Hong Kong. Focusing on media bias on social media.
- Lei Chen, master student from Fudan University. Focusing on rumor detection. COLING 2020.
|