馬連韜,北京大學(xué)軟件工程國家工程研究中心研究型助理教授(Research Assistant Professor,助理研究員),碩士生導(dǎo)師,北京大學(xué)計算機軟件與理論博士畢業(yè),北京大學(xué)計算機系博雅博士后。長期從事醫(yī)信交叉、電子病歷數(shù)據(jù)深度學(xué)習(xí)可解釋分析研究工作,以大語言模型(LLM)賦能臨床工作與醫(yī)學(xué)科研。成果服務(wù)于智慧醫(yī)療終末期慢性腎病患者、淋巴瘤患者、產(chǎn)科診療輔助等。
研究興趣
醫(yī)信交叉,智慧醫(yī)療,預(yù)后預(yù)測,診療輔助 多變量時間序列電子病歷數(shù)據(jù)分析 可解釋深度學(xué)習(xí) 醫(yī)療垂直領(lǐng)域大模型 臨床應(yīng)用:終末期慢性腎病、淋巴瘤、產(chǎn)科科研項目
2025.01-2025.12 R Consortium 國際R語言聯(lián)合體, Infrastructure Steering Committee (ISC) Grant Program R語言基礎(chǔ)設(shè)施建設(shè)督導(dǎo)項目,面向臨床數(shù)據(jù)科學(xué)家的電子病歷建模方法基建,聯(lián)合主持(全球每年10項,2016年以來中國科研機構(gòu)首次獲批) 2025.01-2027.12 國家自然科學(xué)基金,在研,主持 2021.07-2023.07 國家博士后科學(xué)基金特別站前資助,已結(jié)題,主持(全國軟件工程學(xué)科同年度僅 3 人獲批) 2023.01-2023.07 國家博士后科學(xué)基金面上資助,已結(jié)題,主持 2023.12-2025.04 ***后勤保障,醫(yī)信交叉***智能監(jiān)測與推薦系統(tǒng),主持 2021.07-2023.06 北京大學(xué),博雅博士后資助,已結(jié)題,主持 2024.01-2026.12 國家自然科學(xué)基金區(qū)域聯(lián)合重點項目,在研,項目骨干 2023.01-2025.12 國家自然科學(xué)基金專項,在研,項目骨干 2019.10-2021.10 國家科技部, 國家重點研發(fā)計劃, 前沿科技創(chuàng)新專項, 已結(jié)題, 參與 2025.01-2027.12 北京市自然基金委,前沿專項,項目骨干 2025.01-2027.12 北京大學(xué),醫(yī)學(xué)+X領(lǐng)航計劃,項目骨干成果發(fā)表
Wu, Y., Gao, J., Tang, W*., Su, C., Zhu, Y., ... & Ma, L*. (2025). Exploring the Relationship Between Dietary Intake and Clinical Outcomes in Peritoneal Dialysis Patients. Health Data Science (HDS). Science Partner Journal, Science合作刊. 通訊作者. Liao, W., Zhu, Y., Wang, Z., Chu, X., Wang, Y., & Ma, L*. (2025). Learnable Prompt as Pseudo-Imputation: Reassessing the Necessity of Traditional EHR Data Imputation in Downstream Clinical Prediction. In Proceedings of the ACM SIGKDD international conference on knowledge discovery & data mining. 計算機學(xué)會CCF-A類最高級推薦國際學(xué)術(shù)會議, 通訊作者. Wang, T., ... & Ma, L*. (2025). Adaptive Activation Steering: A Tuning-Free LLM Truthfulness Improvement Method for Diverse Hallucinations Categories. Web Conference (WWW). CCF-A, 通訊作者. Wang, Z., ... & Ma, L*. (2025). ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration. Web Conference (WWW). CCF-A, 通訊作者. Ma, L., Zhang, C., Gao, J., Jiao, X., Yu, Z., Zhu, Y., ... & Wang, T. (2023). Mortality prediction with adaptive feature importance recalibration for peritoneal dialysis patients. Patterns, 4(12). Cell Patterns子刊, 首頁封面文章, 第一作者.Gao, J., Zhu, Y., Wang, W., Wang, Z., Dong, G., Tang, W., ... & Ma, L.* (2024). A comprehensive benchmark for COVID-19 predictive modeling using electronic health records in intensive care. Patterns, 5(4). Cell Patterns子刊, 通訊作者. Gao, J., Wang, Z., Tang, W., Wang, Y., Wang, L., Ma, L.,* Zhu, Y.. (2025) An AI–Clinician Interaction System for Transparent and Actionable Clinical Decision Support. Symposium on Artificial Intelligence in Learning Health Systems (SAIL). NEJM AI Top Abstract Nomination, Travel Award. Yu, Z., Zhang, C., Wang, Y., Tang, W., Wang, J., & Ma, L.* (2024, April). Predict and Interpret Health Risk Using Ehr Through Typical Patients. In ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1506-1510). IEEE. CCF-B, 通訊作者. Zhu, Y., Wang, Z., He, L., Xie, S., Zheng, X., Ma, L.*, & Pan, C.* (2024, October). PRISM: Mitigating EHR Data Sparsity via Learning from Missing Feature Calibrated Prototype Patient Representations. In Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM) (pp. 3560-3569). CCF-B, 通訊作者. Wang, T., Zhu, Y., Wang, Z., Tang, W.*, Zhao, X., Wang, T., ... Ma, L.*, & Wang, L*. (2024). Protocol to process follow-up electronic medical records of peritoneal dialysis patients to train AI models. Cell STAR protocols, 5(4), 103335. 邀稿, 通訊作者. Wu, H., Zhu, Y., Wang, Z., Zheng, X., Wang, L., Tang, W., ... & Ma, L.* EHRFlow: A Large Language Model-Driven Iterative Multi-Agent Electronic Health Record Data Analysis Workflow. In Artificial Intelligence and Data Science for Healthcare: Bridging Data-Centric AI and People-Centric Healthcare. KDD 2024 Workshop, Oral, 錄取率20%, 通訊作者. Hong, S., Yin, D., Tang, G., Fu, T., Ma, L., Gao, J., ... & Zhang, L. (2024, August). Artificial Intelligence and Data Science for Healthcare: Bridging Data-Centric AI and People-Centric Healthcare. In Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (pp. 6720-6721). KDD 2024 Workshop聯(lián)合主席. MA L, MA X, GAO J, et al. Distilling knowledge from publicly available online emr data to emerging epidemic for prognosis[C]//Proceedings of the Web Conference 2021. 2021: 3558-3568. 計算機學(xué)會 CCF-A 類最高級推薦國際頂級會議, 第一作者, MA L, ZHANG C, WANG Y, et al. Concare: Personalized clinical feature embedding via capturing the healthcare context[C]//Proceedings of the AAAI Conference on Artificial Intelligence: volume 34. 2020: 833-840. 計算機學(xué)會 CCF-A 類最高級推薦國際頂級會議, 第一作者, MA L, GAO J, WANG Y, et al. Adacare: Explainable clinical health status representation learning via scale-adaptive feature extraction and recalibration[C]//ThirtyFourth AAAI Conference on Artificial Intelligence. 2020. 計算機學(xué)會 CCF-A 類最高級推薦國際頂級會議, 第一作者. 馬連韜, 張超賀, 焦賢鋒, 王亞沙, 唐雯, 趙俊峰. Dr. Deep: 基于醫(yī)療特征上下文學(xué)習(xí)的患者健康狀態(tài)可解釋評估. 計算機研究與發(fā)展. 2021. CCF-A 中文核心, 第一作者. 馬連韜, 王亞沙, 彭廣舉, 等. 基于公交車軌跡數(shù)據(jù)的道路 GPS 環(huán)境友好性評估[J]. 計算機研究與發(fā)展, 2016, 53(12): 2694-2707. CCF-A 中文核心, 第一作者. GAO J, ZHU Y, WANG W, Wang Z, Dong G, Tang W, Wang H, Wang Y, Harrison E, MA L*. A comprehensive benchmark for covid-19 predictive modeling using electronic health records in intensive care. AMIA Summit. 2023. 美國醫(yī)學(xué)信息學(xué)協(xié)會國際報告, 通訊作者. Liao W, Liao Y, Fan Z, Zhang J, Li S, Yang J, Ma L*. Multi-modal Medical Vision-and-Language Learning for Retinal Vein Occlusion Classification. Health Data Science Summit. 2023. HDS Summit 口頭報告, 會議優(yōu)秀摘要提名, 通訊作者. Zhu Y, An J, Zhou E, An L, Gao J, Li H, Feng H, Hou B, Tang W, Pan C, Ma L*. Mitigating Bias in Healthcare Data through Multi-Level and Multi-Sensitive-Attribute Reweighting Method. Health Data Science Summit. 2023. HDS Summit 墻報, 通訊作者. Zhang C, Gao X, Ma L, et al. GRASP: Generic Framework for Health Status Representation Learning Based on Incorporating Knowledge from Similar Patients; 35th AAAI Conference on Artificial Intelligence (AAAI), 2021. CCF-A. Zhang C, Chu X, Ma L, Zhu Y, Wang Y, Wang J, Zhao J. M3care: Learning with missing modalities in multimodal healthcare data. InProceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2022 Aug 14 (pp. 2418-2428). CCF-A. Ma X, Wang Y, Chu X, Ma L, et al. Patient Health Representation Learning via Correlational Sparse Prior of Medical Features. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022. CCF-A. Ma X, Chu X, Wang Y, Lin Y, Zhao J, Ma L, et al. Fused Gromov-Wasserstein Graph Mixup for Graph-level Classifications. Advances in Neural Information Processing Systems (NeurIPS), 2023. CCF-A. Wang J, Wang Y, Zhang D, Wang F, He Y, Ma L. PSAllocator: Multi-task allocation for participatory sensing with sensing capability constraints. InProceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing 2017 Feb 25 (pp. 1139-1151). CCF-A. 王亞沙, 馬連韜, 等. 基于時間窗口切割的健康風(fēng)險關(guān)鍵事件檢測方法及系統(tǒng). 國家發(fā)明專利. 2022. CN112205965B. 第一學(xué)生發(fā)明人,已授權(quán). 王亞沙, 馬連韜, 等. 一種患者潛在重要信息的確定方法和裝置. 國家發(fā)明專利. 2023. CN112289444B. 第一學(xué)生發(fā)明人,已授權(quán). 呂云翔, 馬連韜, 等. 機器學(xué)習(xí)基礎(chǔ) (大數(shù)據(jù)技術(shù)與應(yīng)用專業(yè)規(guī)劃教材). 清華大學(xué)出版社. 2018. 第一學(xué)生作者.
會議舉辦
2024.10.31 Seminar on Advancing Healthcare Informatics, Insights from Cell Press Patterns/Matter/iScience and Peking University 2024.08.26 SIGKDD Workshop, Artificial Intelligence and Data Science for Healthcare, Bridging Data-Centric AI and People-Centric Healthcare,Barcelona Spain 2024.01.07 AI in Medicine League (AIMEL)課程講授
2024.12 徐州市第一人民醫(yī)院,人工智能醫(yī)療交叉 2024.12 北京大學(xué),工學(xué)院,機器學(xué)習(xí)與大數(shù)據(jù)分析 2024.11 北京大學(xué)天津濱海新一代信息技術(shù)研究院,天津財經(jīng)大學(xué),機器學(xué)習(xí)與數(shù)據(jù)挖掘 2024.10 北京大學(xué),軟件與微電子學(xué)院,軟件工程前沿(博士生必修課) 2024.01 北京大學(xué),計算機學(xué)院,ICS受邀報告
2025.02.24 廣西醫(yī)科大學(xué),AI賦能醫(yī)學(xué) 2024.12.28 徐州市健康管理學(xué)會,慢性腎臟病預(yù)防與控制,腹膜透析患者飲食營養(yǎng)推薦 2024.12.22 華北血液腫瘤免疫治療研討會,濾泡性淋巴瘤一線治療后R維持獲益評估與用藥推薦 2024.12.06 中國產(chǎn)科質(zhì)量控制大會,大語言模型支持的產(chǎn)科醫(yī)患溝通輔助 2024.10.28 International Symposium on High Confidence Software,High Confidence Software on AI-Medicine Intersection 2024.09.02 Seminar at University of Edinburgh,Building Trustworthy and Accessible Clinical Prediction Framework 2024.06.01 內(nèi)蒙古醫(yī)院協(xié)會血液凈化學(xué)術(shù)會議,腹膜透析患者可解釋預(yù)后預(yù)測發(fā)展履歷
2023.07-至今 北京大學(xué) 軟件工程國家工程研究中心 助理研究員 2021.07-2023.07 北京大學(xué) 計算機系 博雅博士后 (合作導(dǎo)師:王亞沙教授) 2016.07-2021.07 北京大學(xué) 信息科學(xué)技術(shù)學(xué)院 理學(xué)博士 (導(dǎo)師:謝冰教授) 2012.07-2016.07 北京航空航天大學(xué) 軟件學(xué)院 工學(xué)學(xué)士(導(dǎo)師:李紅裔教授、呂云翔教授)