I am a first year PhD student in Computer Science Department, University of California, Los Angeles. I become a member of Center for Vision, Cognition, Learning and Autonomy, 2019. My supervisor is Professor Song-Chun Zhu. Before that, I received my Science Master degree in Department of Computer Science and Technology of Tsinghua University, supervised by Professor Jianyong Wang. I received my B.S. degree in School of Automation, 2015.

I was a research intern with the NLC Group, MSRA from Nov. 2017 to May 2018, working with Dr. Lei Ji, Nan Duan and Ming Zhou. Before that, I was a visiting student with Multimedia Laboratory, CUHK from Aug 2016 to Oct 2017, working with Professor Hongsheng Li and Xiaogang Wang.

I am interested in computer vision, natural language understanding, data mining, especially in multimodal learning and reasoning for vision, language, knowledge, cognition and action.



Learning Long- and Short-Term User Literal-Preference with Multimodal Hierarchical Transformer Network for Personalized Image Caption
Wei Zhang, Yue Ying, Pan Lu, Hongyuan Zha
AAAI 2020  [Paper]  [BibTex]

Knowledge Aware Semantic Concept Expansion for Image-Text Matching
Botian Shi, Lei Ji, Pan Lu, Nan Duan
IJCAI 2019  [Paper]  [BibTex]
Oral Presentation

Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering
Peng Gao, Zhengkai Jiang, Haoxuan You, Pan Lu, Steven CH Hoi, Xiaogang Wang, Hongsheng Li
CVPR 2019  [Paper]  [Code]  [BibTex]
Oral Presentation

A Novel Hybrid Sequential Model for Review-based Rating Prediction
Yuanquan Lu, Wei Zhang, Pan Lu, Jianyong Wang
PAKDD 2019  [Paper]  [BibTex]

Knowledge-Aware Deep Dual Networks for Text-Based Mortality Prediction
Ning Liu, Pan Lu, Wei Zhang, Jianyong Wang
ICDE 2019  [Paper]  [BibTex]

R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering
Pan Lu, Lei Ji, Wei Zhang, Nan Duan, Ming Zhou, Jianyong Wang
SIGKDD 2018  [Paper]  [Project]  [Video]  [BibTex]
Oral Presentation

Co-attending Free-form Regions and Detections with Multi-modal Multiplicative Feature Embedding for Visual Question Answering
Pan Lu, Hongsheng Li, Wei Zhang, Jianyong Wang, Xiaogang Wang
AAAI 2018  [Paper]  [Code]  [BibTex]
Oral Presentation




Selected Honors

Professional Service


Center for Vision, Cognition, Learning and Autonomy (VCLA)
Bolter Hall
580 Portola Plaza
Los Angeles, CA 90024
lupantech [at] gmail [dot] com
[Google Scholar]  |  [GitHub]  |  [LinkedIn]

© Pan Lu 2020