Pinlong Cai | 蔡品隆

cpl_pic.jpg

Research Scientist, Shanghai Artificial Intelligence Laboratory
Xuhui District, Shanghai, China
caipinlong@pjlab.org.cn
Google scholar | ORCID | ResearchGate

I have long focused on data-driven modeling for intelligent systems, applying these approaches to several domains such as Intelligent Transportation, Autonomous Driving, and Industrial Automation. Yet with the rapid emergence of Artificial General Intelligence (AGI), exemplified by large multimodal models, my perspective has gradually shifted from fitting tasks with data toward enabling machines to understand the world.

I believe true AGI should not merely be larger models, but systems that embrace knowledge-driven learning, perhaps by emulating human cognition, or through continuous self-evolution via interaction with the environment. Rather than replacing human intelligence, AGI will be an awakening of machine intelligence guided and accompanied by humanity. This evolution has the potential to fundamentally reshape how scientific discovery unfolds and profoundly advance the trajectory of human civilization.

I am committed to contributing to this journey, not just for the technology itself, but for the transformative future it may bring.


Work Experience

2021 - Now · Research Scientist · Shanghai Artificial Intelligence Laboratory
Knowledge Engine, Large Multimodal Model, Autonomous Driving
2020 - 2021 · Standard & Strategy Engineer · ZTE Corporation
V2X, Video Codec
2016 - 2017 · Research & Development Engineer · Quanzhou Institute of Equipment Manufacturing (CAS)
Computational Intelligence, Industrial Process Control

Education

2009 - 2020 · B.S. / M.S. / Ph.D. in Traffic Information Engineering and Control · Beihang University
supervised by Prof. Yunpeng Wang and Prof. Guangquan Lu

News

Dec 1, 2025 · I have been selected for the Shanghai Oriental Talent Program (Young Talent)
Sep 03, 2025 · HetaRAG (a hybrid, deep-retrieval RAG framework that unifies multiple heterogeneous data stores) has been released (Code)
Apr 29, 2024 · InternVL 1.5 (an open-source multimodal large language model) has been released (Code, Paper, Model and Dataset)
Dec 12, 2023 · Towards Knowledge-driven Autonomous Driving (review paper) has been released
Jul 15, 2023 · LimSim (a long-term interactive multi-scenario traffic simulator) has been released (Code and Paper)
Dec 14, 2022 · I have been selected for the Shanghai Rising-Star Program