A Comprehensive Survey of Electronic Health Record Modeling: From Deep Learning Approaches to Large Language Models

Builder & Current Maintainer: Weijieying Ren, YuQing Huang, Jingxi Zhu, Zehao Liu,Tianxiang Zhao and Prof. Vasant Honavar.

Paper List

We have summarized the main branches of works for EHRs modeling, including its downstream tasks and applications. For more details, please refer to our recent survey (paper).

Branch 8: Clinical Applications

8.1 Clinical Understanding

8.1.1 Clinical Notes Summarization

Adapted large language models can outperform medical experts in clinical text summarization
Dave Van Veen, Cara Van Uden, Louis Blankemeier, Jean-Benoit Delbrouck, Asad Aali, Christian Bluethgen, Anuj Pareek, Malgorzata Polacin, Eduardo Pontes Reis, Anna Seehofnerová & others
Nature Medicine 2024
A framework to assess clinical safety and hallucination rates of LLMs for medical text summarisation
Elham Asgari, Nina Montaña-Brown, Magda Dubois, Saleh Khalil, Jasmine Balloch, Joshua Au Yeung & Dominic Pimenta
npj Digital Medicine 2025
Synoptic reporting by summarizing cancer pathology reports using large language models
Sivaraman Rajaganapathy, Shaika Chowdhury, Xiaodi Li, Vincent Buchner, Zhe He, Rui Zhang, Xiaoqian Jiang, Ping Yang, James R. Cerhan & Nansu Zong
npj Health Systems 2025
Cord-19: The covid-19 open research dataset
Lucy Lu Wang, Kyle Lo, Yoganand Chandrasekhar, Russell Reas, Jiangjiang Yang, Douglas Burdick, Darrin Eide, Kathryn Funk, Yannis Katsis, Rodney Kinney & others
ArXiv 2020
Generating (factual?) narrative summaries of rcts: Experiments with neural multi-document summarization
Byron C. Wallace, Sayantan Saha, Frank Soboczenski & Iain J. Marshall
AMIA Summits on Translational Science Proceedings 2021
Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension
Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Ves Stoyanov & Luke Zettlemoyer
arXiv 2019
Pegasus: Pre-training with extracted gap-sentences for abstractive summarization
Jingqing Zhang, Yao Zhao, Mohammad Saleh & Peter Liu
ICML 2020
Prophetnet: Predicting future n-gram for sequence-to-sequence pre-training
Weizhen Qi, Yu Yan, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang & Ming Zhou
arXiv 2020
Automated radiology report summarization using an open-source natural language processing pipeline
Daniel J. Goff & Thomas W. Loehfelm
Journal of Digital Imaging 2018
Dr. summarize: Global summarization of medical dialogue by exploiting local structures
Anirudh Joshi, Namit Katariya, Xavier Amatriain & Anitha Kannan
arXiv 2020
Medically aware GPT-3 as a data generator for medical dialogue summarization
Bharath Chintagunta, Namit Katariya, Xavier Amatriain & Anitha Kannan
MLHC 2021
Aptsumm at biolaysumm task 1: Biomedical breakdown, improving readability by relevancy based selection
A.S. Poornash, Atharva Deshmukh, Archit Sharma & Sriparna Saha
BioNLP 2023
From sights to insights: Towards summarization of multimodal clinical documents
Akash Ghosh, Mohit Tomar, Abhisek Tiwari, Sriparna Saha, Jatin Salve & Setu Sinha
ACL 2024
Ms2: Multi-document summarization of medical studies
Jay DeYoung, Iz Beltagy, Madeleine van Zuylen, Bailey Kuehl & Lucy Lu Wang
arXiv 2021

8.1.2 Concept Extraction

Iterative refinement and goal articulation to optimize large language models for clinical information extraction
David Hein, Alana Christie, Michael Holcomb, Bingqing Xie, A.J. Jain, Joseph Vento, Neil Rakheja, Ameer Hamza Shakur, Scott Christley, Lindsay G. Cowell & others
npj Digital Medicine 2025
Clinical entity augmented retrieval for clinical information extraction
Ivan Lopez, Akshay Swaminathan, Karthik Vedula, Sanjana Narayanan, Fateme Nateghi Haredasht, Stephen P. Ma, April S. Liang, Steven Tate, Manoj Maddali, Robert Joseph Gallo & others
npj Digital Medicine 2025
A critical assessment of using ChatGPT for extracting structured data from clinical notes
Jingwei Huang, Donghan M. Yang, Ruichen Rong, Kuroush Nezafati, Colin Treager, Zhikai Chi, Shidan Wang, Xian Cheng, Yujia Guo, Laura J. Klesse & others
npj Digital Medicine 2024
Social determinants of health extraction from clinical notes across institutions using large language models
Vipina K. Keloth, Salih Selek, Qingyu Chen, Christopher Gilman, Sunyang Fu, Yifang Dang, Xinghan Chen, Xinyue Hu, Yujia Zhou, Huan He & others
npj Digital Medicine 2025

8.1.3 Image-Text Retrieval

Transparent medical image AI via an image-text foundation model grounded in medical literature
Chanwoo Kim, Soham U. Gadgil, Alex J. DeGrave, Jesutofunmi A. Omiye, Zhuo Ran Cai, Roxana Daneshjou, Su-In Lee
Nature Medicine 2024

8.1.4 Question Answering

8.1.6 Clinical Education / Knowledge Sharing

A visual–language foundation model for pathology image analysis using medical Twitter
Zhi Huang, Federico Bianchi, Mert Yuksekgonul, Thomas J. Montine & James Zou
Nature Medicine 2023

8.1.6 General Clinical AI Capabilities

A visual-language foundation model for computational pathology
Ming Y. Lu, Bowen Chen, Drew F.K. Williamson, Richard J. Chen, Ivy Liang, Tong Ding, Guillaume Jaume, Igor Odintsov, Long Phi Le, Georg Gerber & others
Nature Medicine 2024
Towards a general-purpose foundation model for computational pathology
Richard J. Chen, Tong Ding, Ming Y. Lu, Drew F.K. Williamson, Guillaume Jaume, Andrew H. Song, Bowen Chen, Andrew Zhang, Daniel Shao, Muhammad Shaban & others
Nature Medicine 2024
Heterogeneity and predictors of the effects of AI assistance on radiologists
Feiyang Yu, Alex Moehring, Oishi Banerjee, Tobias Salz, Nikhil Agarwal, Pranav Rajpurkar
Nature Medicine 2024

8.2 Clinical Reasoning and Decision Support

8.2.1 Diagnosis Support

8.2.1.1 Diagnosis Prediction

8.2.1.1.1 Early Detection

Multimodal GPT model for assisting thyroid nodule diagnosis and management
Jincao Yao, Yunpeng Wang, Zhikai Lei, Kai Wang, Na Feng, Fajin Dong, Jianhua Zhou, Xiaoxian Li, Xiang Hao, Jiafei Shen & others
npj Digital Medicine 2025
[Pancreatic ductal adenocarcinoma] Large-scale pancreatic cancer detection via non-contrast CT and deep learning.
Kai Cao, Yingda Xia, Jiawen Yao, Xu Han, Lukas Lambert, Tingting Zhang, Wei Tang, Gang Jin, Hui Jiang, Xu Fang, Isabella Nogues, Xuezhou Li, Wenchao Guo, Yu Wang, Wei Fang, Mingyan Qiu, Yang Hou, Tomas Kovarnik, Michal Vocka, Yimei Lu, Yingli Chen, Xin Chen, Zaiyi Liu, Jian Zhou, Chuanmiao Xie, Rong Zhang, Hong Lu, Gregory D. Hager, Alan L. Yuille, Le Lu, Chengwei Shao, Yu Shi, Qi Zhang, Tingbo Liang, Ling Zhang & Jianping Lu
Nature Medicine 2023
[breast cancer] Prospective implementation of AI-assisted screen reading to improve early detection of breast cancer.
Annie Y. Ng, Cary J.G. Oberije, Éva Ambrózay, Endre Szabó, Orsolya Serfőző, Edit Karpati, Georgia Fox, Ben Glocker, Elizabeth A. Morris, Gábor Forrai & others
Nature Medicine 2023
A deep-learning algorithm to classify skin lesions from mpox virus infection
Alexander H. Thieme, Yuanning Zheng, Gautam Machiraju, Chris Sadee, Mirja Mittermaier, Maximilian Gertler, Jorge L. Salinas, Krithika Srinivasan, Prashnna Gyawali, Francisco Carrillo-Perez & others
Nature Medicine 2023

8.2.1.1.2 Differential Diagnosis

AI-based differential diagnosis of dementia etiologies on multimodal data
Chonghua Xue, Sahana S. Kowshik, Diala Lteif, Shreyas Puducheri, Varuna H. Jasodanand, Olivia T. Zhou, Anika S. Walia, Osman B. Guney, J. Diana Zhang, Serena T. Pham & others
Nature Medicine 2024
A phenotype-based AI pipeline outperforms human experts in differentially diagnosing rare diseases using EHRs
Xiaohao Mao, Yu Huang, Ye Jin, Lun Wang, Xuanzhong Chen, Honghong Liu, Xinglin Yang, Haopeng Xu, Xiaodong Luan, Ying Xiao & others
npj Digital Medicine 2025
Autonomous medical evaluation for guideline adherence of large language models
Dennis Fast, Lisa C. Adams, Felix Busch, Conor Fallon, Marc Huppertz, Robert Siepmann, Philipp Prucker, Nadine Bayerl, Daniel Truhn, Marcus Makowski & others
npj Digital Medicine 2024
Enhancing diagnostic capability with multi-agents conversational large language models
Xi Chen, Huahui Yi, Mingke You, WeiZhi Liu, Li Wang, Hairui Li, Xue Zhang, Yingman Guo, Lei Fan, Gang Chen & others
npj Digital Medicine 2025

8.2.2 Prognostic Forecasting

8.2.2.1 Risk Prediction

[mental health crisis] Machine learning model to predict mental health crises from electronic health records.
Roger Garriga, Javier Mas, Semhar Abraha, Jon Nolan, Oliver Harrison, George Tadros, Aleksandar Matic.
Nature Medicine 2022
A deep learning system for predicting time to progression of diabetic retinopathy
Ling Dai, Bin Sheng, Tingli Chen, Qiang Wu, Ruhan Liu, Chun Cai, Liang Wu, Dawei Yang, Haslina Hamzah, Yuexing Liu & others
Nature Medicine 2024
Heterogeneous aging across multiple organ systems and prediction of chronic disease and mortality.
Ye Ella Tian, Vanessa Cropley, Andrea B. Maier, Nicola T. Lautenschlager, Michael Breakspear & Andrew Zalesky
Nature Medicine 2023

8.2.2.2 Prognosis Estimation

8.2.2.3 Readmission Prediction

8.2.2.4 Mortality Prediction

8.2.3 Treatment Modeling

8.2.3.1 Treatment Effect Estimation

8.2.3.2 Treatment Recommendation

AI-guided precision parenteral nutrition for neonatal intensive care units
Thanaphong Phongpreecha, Marc Ghanem, Jonathan D. Reiss, Tomiko Oskotsky, Samson J. Mataraso, Davide De Francesco, S. Momsen Reincke, Camilo Espinosa, Philip Chung, Taryn Ng & others
Nature Medicine 2025

8.2.3.3 Counterfactual Simulation

8.2.4 Patient Stratification and Cohort Discovery

8.2.4.1 Subtyping / Endotyping

A foundation model for clinical-grade computational pathology and rare cancers detection
Eugene Vorontsov, Alican Bozkurt, Adam Casson, George Shaikovski, Michal Zelechowski, Kristen Severson, Eric Zimmermann, James Hall, Neil Tenenholtz, Nicolo Fusi & others
Nature Medicine 2024
Data-driven cluster analysis identifies distinct types of metabolic dysfunction-associated steatotic liver disease
Violeta Raverdy, Federica Tavaglione, Estelle Chatelain, Guillaume Lassailly, Antonio De Vincentis, Umberto Vespasiani-Gentilucci, Sami F. Qadri, Robert Caiazzo, Helene Verkindt, Chiara Saponaro & others
Nature Medicine 2024
A data-driven framework for identifying patient subgroups on which an AI/machine learning model may underperform
Adarsh Subbaswamy, Berkman Sahiner, Nicholas Petrick, Vinay Pai, Roy Adams, Matthew C. Diamond & Suchi Saria
npj Digital Medicine 2024

8.2.4.2 Personalized Cohort Selection

A machine learning approach to leveraging electronic health records for enhanced omics analysis
Samson J. Mataraso, Camilo A. Espinosa, David Seong, S. Momsen Reincke, Eloise Berson, Jonathan D. Reiss, Yeasul Kim, Marc Ghanem, Chi-Hung Shu, Tomin James & others
Nature Machine Intelligence 2025

8.3 Clinical Operations Support

8.3.1 Workflow Optimization

8.3.1.1 Triage and Prioritization

Evaluating large language model workflows in clinical decision support for triage and referral and diagnosis
Farieda Gaber, Maqsood Shaik, Fabio Allega, Agnes Julia Bilecz, Felix Busch, Kelsey Goon, Vedran Franke & Altuna Akalin
npj Digital Medicine 2025
Continuous multimodal data supply chain and expandable clinical decision support for oncology
Jee Suk Chang, Hyunwook Kim, Eun Sil Baek, Jeong Eun Choi, Joon Seok Lim, Jin Sung Kim & Sang Joon Shin
npj Digital Medicine 2025

8.3.1.2 Clinical Note Generation

8.3.2.1 Radiology Report Generation

Collaboration between clinicians and vision-language models in radiology report generation
Ryutaro Tanno, David G. T. Barrett, Andrew Sellergren, Sumedh Ghaisas, Sumanth Dathathri, Abigail See, Johannes Welbl, Charles Lau, Tao Tu, Shekoofeh Azizi, Karan Singhal, Mike Schaekermann, Rhys May, Roy Lee, SiWai Man, Sara Mahdavi, Zahra Ahmed, Yossi Matias, Joelle Barral, S. M. Ali Eslami, Danielle Belgrave, Yun Liu, Sreenivasa Raju Kalidindi, Shravya Shetty, Vivek Natarajan, Pushmeet Kohli, Po-Sen Huang, Alan Karthikesalingam & Ira Ktena
Nature Medicine 2025

8.3.1.3 Care Recommendation

8.3.2 Patient-Task Matching

8.3.2.1 Trial Matching

PRISM: Patient Records Interpretation for Semantic clinical trial Matching system using large language models
Shashi Gupta, Aditya Basu, Mauro Nievas, Jerrin Thomas, Nathan Wolfrath, Adhitya Ramamurthi, Bradley Taylor, Anai N. Kothari, Regina Schwind, Therica M. Miller & others
npj Digital Medicine 2024

8.3.3 Clinical Interaction Agents

8.3.3.1 Assistants for Clinical Decision-Making

Toward expert-level medical question answering with large language models
Karan Singhal, Tao Tu, Juraj Gottweis, Rory Sayres, Ellery Wulczyn, Mohamed Amin, Le Hou, Kevin Clark, Stephen R. Pfohl, Heather Cole-Lewis & others
Nature Medicine 2025
Medical foundation large language models for comprehensive text analysis and beyond
Qianqian Xie, Qingyu Chen, Aokun Chen, Cheng Peng, Yan Hu, Fongci Lin, Xueqing Peng, Jimin Huang, Jeffrey Zhang, Vipina Keloth & others
npj Digital Medicine 2025
Large language model agents can use tools to perform clinical calculations
Alex J. Goodell, Simon N. Chu, Dara Rouholiman & Larry F. Chu
npj Digital Medicine 2025
Integrated image-based deep learning and language models for primary diabetes care
Jiajia Li, Zhouyu Guan, Jing Wang, Carol Y. Cheung, Yingfeng Zheng, Lee-Ling Lim, Cynthia Ciwei Lim, Paisan Ruamviboonsuk, Rajiv Raman, Leonor Corsino & others
Nature Medicine 2024
Deep learning-aided decision support for diagnosis of skin disease across skin tones
Matthew Groh, Omar Badri, Roxana Daneshjou, Arash Koochek, Caleb Harris, Luis R. Soenksen, P. Murali Doraiswamy & Rosalind Picard
Nature Medicine 2024
High-precision information retrieval for rapid clinical guideline updates
Florian Borchert, Paul Wullenweber, Annika Oeser, Nina Kreuzberger, Torsten Karge, Thomas Langer, Nicole Skoetz, Lothar H. Wieler, Matthieu-P. Schapranow & Bert Arnrich
npj Digital Medicine 2025
Retrieval augmented generation for 10 large language models and its generalizability in assessing medical fitness
Yu He Ke, Liyuan Jin, Kabilan Elangovan, Hairil Rizal Abdullah, Nan Liu, Alex Tiong Heng Sia, Chai Rick Soh, Joshua Yi Min Tung, Jasmine Chiat Ling Ong, Chang-Fu Kuo & others
npj Digital Medicine 2025
Vision–language foundation model for echocardiogram interpretation.
Matthew Christensen, Milos Vukadinovic, Neal Yuan, David Ouyang
Nature Medicine 2024.
Human-AI collaboration enables more empathic conversations in text-based peer-to-peer mental health support
Ashish Sharma, Inna W. Lin, Adam S. Miner, David C. Atkins & Tim Althoff
Nature Machine Intelligence 2023
Heterogeneity and predictors of the effects of AI assistance on radiologists
Feiyang Yu, Alex Moehring, Oishi Banerjee, Tobias Salz, Nikhil Agarwal, Pranav Rajpurkar
Nature Medicine 2024
Deep learning-aided decision support for diagnosis of skin disease across skin tones
Matthew Groh, Omar Badri, Roxana Daneshjou, Arash Koochek, Caleb Harris, Luis R. Soenksen, P. Murali Doraiswamy, Rosalind Picard
Nature Medicine 2024
Optimization of hepatological clinical guidelines interpretation by large language models: a retrieval augmented generation-based framework
Simone Kresevic, Mauro Giuffrè, Milos Ajcevic, Agostino Accardo, Lory S. Crocè & Dennis L. Shung
npj Digital Medicine 2024
GPT-4 assistance for improvement of physician performance on patient care tasks: a randomized controlled trial
Ethan Goh, Robert J. Gallo, Eric Strong, Yingjie Weng, Hannah Kerman, Jason A. Freed, Joséphine A. Cool, Zahir Kanjee, Kathleen P. Lane, Andrew S. Parsons & others
Nature Medicine 2025
A generalist medical language model for disease diagnosis assistance
Xiaohong Liu, Hao Liu, Guoxing Yang, Zeyu Jiang, Shuguang Cui, Zhaoze Zhang, Huan Wang, Liyuan Tao, Yongchang Sun, Zhu Song & others
Nature Medicine 2025
Prompt engineering in consistency and reliability with the evidence-based guideline for LLMs
Li Wang, Xi Chen, XiangWen Deng, Hao Wen, MingKe You, WeiZhi Liu, Qi Li & Jian Li
npj Digital Medicine 2024
Large language model agents can use tools to perform clinical calculations
Alex J. Goodell, Simon N. Chu, Dara Rouholiman & Larry F. Chu
npj Digital Medicine 2025

8.3.3.2 Assistants for Administrative Communication

Outpatient reception via collaboration between nurses and a large language model: a randomized controlled trial
Peixing Wan, Zigeng Huang, Wenjun Tang, Yulan Nie, Dajun Pei, Shaofen Deng, Jing Chen, Yizhi Zhou, Hongru Duan, Qingyu Chen & others
Nature Medicine 2024

8.3.3.3 Assistive Devices and Technologies

Human-centred design and fabrication of a wearable multimodal visual assistance system
Jian Tang, Yi Zhu, Gai Jiang, Lin Xiao, Wei Ren, Yu Zhou, Qinying Gu, Biao Yan, Jiayi Zhang, Hengchang Bi & others
Nature Machine Intelligence 2025

8.3.4 Administrative Automation

8.3.4.1 Clinical Coding Automation

Optimising the paradigms of human AI collaborative clinical coding
Yue Gao, Yuepeng Chen, Minghao Wang, Jinge Wu, Yunsoo Kim, Kaiyin Zhou, Miao Li, Xien Liu, Xiangling Fu, Ji Wu & others
npj Digital Medicine 2024

8.3.4.2 Scheduling and Resource Allocation

Branch 1: Data-Centric Approaches

Branch 2: Neural Modeling Strategies

2.1 Feature-Aware Modules

2.1.1 Discretization and Binning-Based Methods

Binning as a Pretext Task: Improving Self-Supervised Learning in Tabular Domains.
Kyungeun Lee, Ye Seul Sim, Hye-Seung Cho, Moonjung Eo, Suhee Yoon, Sanghyu Yoon, Woohyung Lim
The 41st International Conference on Machine Learning (ICML 2024)
TabLog: Test-Time Adaptation for Tabular Data Using Logic Rules.
Weijieying Ren, Xiaoting Li, Huiyuan Chen, Vineeth Rakesh, Zhuoyi Wang, Mahashweta Das, Vasant Honavar
The 41st International Conference on Machine Learning (ICML 2024)
Byzantine-Robust Learning on Heterogeneous Datasets via Bucketing. (paper)
Sai Praneeth Karimireddy, Lie He, Martin Jaggi
The 10th International Conference on Learning Representations (ICLR 2022)
On Embeddings for Numerical Features in Tabular Deep Learning. (paper)
Yury Gorishniy, Ivan Rubachev, Artem Babenko
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2022)
Learning Binary Decision Trees by Argmin Differentiation. (paper)
Valentina Zantedeschi, Matt J. Kusner, Vlad Niculae
The 39th International Conference on Machine Learning (ICML 2022)

2.1.2 Kernel-Based Methods

Feature Learning for Interpretable, Performant Decision Trees. (paper)
Jack H. Good, Torin Kovach, Kyle Miller, Artur Dubrawski
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
Kernel Density Decision Trees. (paper)
Jack H. Good, Kyle Miller, Artur Dubrawski
The 36th AAAI Conference on Artificial Intelligence (AAAI 2022)
Heterogeneous Risk Minimization. (paper)
Jiashuo Liu, Zheyuan Hu, Peng Cui, Bo Li, Zheyan Shen
The 38th International Conference on Machine Learning (ICML 2021)
Kernelized Heterogeneous Risk Minimization. (paper)
Jiashuo Liu, Zheyuan Hu, Peng Cui, Bo Li, Zheyan Shen
The 35th Annual Conference on Neural Information Processing Systems (NeurIPS 2021)

2.2 Model Architecture Design

2.2.1 Tree-based

GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data. (paper)
Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt
The 12st International Conference on Learning Representations (ICLR 2024)
POETREE: Interpretable Policy Learning with Adaptive Decision Trees. (paper)
Alizée Pace, Alex Chan, Mihaela van der Schaar
The 10th International Conference on Learning Representations (ICLR 2022)
Instance-Based Uncertainty Estimation for Gradient-Boosted Regression Trees. (paper)
Jonathan Brophy, Daniel Lowd
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2022)
Local Contrastive Feature Learning for Tabular Data. (paper)
Zhabiz Gharibshah, Xingquan Zhu.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management (CIKM 2022)
Subgroup Robustness Grows On Trees: An Empirical Baseline Investigation. (paper)
Joshua P Gardner, Zoran Popovi, Ludwig Schmidt
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2022)

2.2.2 Graph-based

Time-Aware Medication Recommendation via Intervention of Dynamic Treatment Regimes.
Yishuo Li, Qi Zhang, Wenpeng Lu, Xueping Peng, Weiyu Zhang, Jiasheng Si, Yongshun Gong, Liang Hu
Proceedings of the ACM on Web Conference 2025
HI-DR: Exploiting Health Status-Aware Attention and an EHR Graph+ for Effective Medication Recommendation.
Taeri Kim, Jiho Heo, Hyunjoon Kim, Sang-Wook Kim
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2025)
Functional Graph Convolutional Networks: A Unified Multi-Task and Multi-Modal Learning Framework to Facilitate Health and Social-Care Insights.
Tobia Boschi, Francesca Bonin, Rodrigo Ordonez-Hurtado, Cécile Rousseau, Alessandra Pascale, John Dinsmore
arXiv preprint arXiv:2403.10158 (2024)
Graph transformers on EHRs: Better representation improves downstream performance.
Raphael Poulain, Rahmatollah Beheshti
The 12th International Conference on Learning Representations (ICLR 2024)
GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data.
Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt
The 12th International Conference on Learning Representations (ICLR 2024)
On Class Distributions Induced by Nearest Neighbor Graphs for Node Classification of Tabular Data. (paper)
Federico Errica
The 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023)
GOGGLE: Generative Modelling for Tabular Data by Learning Relational Structure. (paper)
Tennison Liu, Zhaozhi Qian, Jeroen Berrevoets, Mihaela van der Schaar
The 11th International Conference on Learning Representations (ICLR 2023)
HyTrel: Hypergraph-enhanced Tabular Data Representation Learning. (paper)
Pei Chen, Soumajyoti Sarkar, Leonard Lausen, Balasubramaniam Srinivasan, Sheng Zha, Ruihong Huang, George Karypis
The 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023)
T2G-FORMER: Organizing Tabular Features into Relation Graphs Promotes Heterogeneous Feature Interaction. (paper)
Jiahuan Yan, Jintai Chen, Yixuan Wu, Danny Z. Chen, Jian Wu
The 37th AAAI Conference on Artificial Intelligence (AAAI 2023)
Graph-text multi-modal pre-training for medical representation learning.
Sungjin Park, Seongsu Bae, Jiho Kim, Tackeun Kim, Edward Choi
Conference on Health, Inference, and Learning (CHIL 2022)
Context-Aware Health Event Prediction via Transition Functions on Dynamic Disease Graphs.
Chang Lu, Tian Han, Yue Ning
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2022)
Graph representation learning in biomedicine and healthcare. (paper)
Marinka Li, Kevin Huang, Marinka Zitnik
Nature Biomedical Engineering, 6(12):1186–1197, 2022
Does your graph need a confidence boost? Convergent boosted smoothing on graphs with tabular node features. (paper)
Jiuhai Chen, Jonas Mueller, Vassilis N. Ioannidis, Soji Adeshina, Yangkun Wang, Tom Goldstein, David Wipf
The 10th International Conference on Learning Representations (ICLR 2022)
Collaborative Graph Learning with Auxiliary Text for Temporal Event Prediction in Healthcare.
Chang Lu, Chandan K. Reddy, Prithwish Chakraborty, Samantha Kleinberg, Yue Ning
Proceedings of the 30th International Joint Conference on Artificial Intelligence (IJCAI 2021)
Collaborative graph learning with auxiliary text for temporal event prediction in healthcare. (paper)
Chang Lu, Chandan K Reddy, Prithwish Chakraborty, Samantha Kleinberg, Yue Ning
arXiv preprint arXiv:2105.07542, 2021
Variationally regularized graph-based representation learning for electronic health records.
Weicheng Zhu, Narges Razavian
Proceedings of the Conference on Health, Inference, and Learning (CHIL 2021)
Leveraging graph-based hierarchical medical entity embedding for healthcare applications.
Tong Wu, Yunlong Wang, Yue Wang, Emily Zhao, Yilian Yuan
Scientific Reports, 11(1):5858, Nature Publishing Group, 2021
Harmonized representation learning on dynamic EHR graphs.
Dongha Lee, Xiaoqian Jiang, Hwanjo Yu
Journal of Biomedical Informatics, 106:103426, Elsevier, 2020
Learning the graphical structure of electronic health records with graph convolutional transformer. (paper)
Edward Choi, Zhen Xu, Yujia Li, Michael W Dusenberry, Glenn Flores, Ping Xie, Andrew M Dai
Proceedings of the AAAI Conference on Artificial Intelligence, 2020
Pre-training of graph augmented transformers for medication recommendation. (paper)
Junyuan Shang, Tengfei Ma, Cao Xiao, Jimeng Sun
arXiv preprint arXiv:1906.00346, 2019
GRAM: graph-based attention model for healthcare representation learning.
Edward Choi, Mohammad Taha Bahadori, Le Song, Walter F Stewart, Jimeng Sun
The 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2017)

2.2.3 Rule-based Models

ConSequence: Synthesizing Logically Constrained Sequences for Electronic Health Record Generation.
Brandon Theodorou, Shrusti Jain, Cao Xiao, Jimeng Sun.
In Proceedings of the AAAI Conference on Artificial Intelligence, vol. 38, no. 14, pp. 15355–15363, 2024.
TabLog: Test-Time Adaptation for Tabular Data Using Logic Rules.
Weijieying Ren, Xiaoting Li, Huiyuan Chen, Vineeth Rakesh, Zhuoyi Wang, Mahashweta Das, Vasant G. Honavar.
In Forty-first International Conference on Machine Learning (ICML), 2024.
Do Machine Learning Models Learn Statistical Rules Inferred from Data?
Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong.
In International Conference on Machine Learning (ICML), pp. 25677–25693, 2023. PMLR.
Logical Activation Functions: Logit-space Equivalents of Probabilistic Boolean Operators.
Scott Lowe, Robert Earle, Jason d'Eon, Thomas Trappenberg, Sageev Oore.
Advances in Neural Information Processing Systems (NeurIPS), vol. 35, pp. 29733–29747, 2022.

2.2.4 Additive-model-based

Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning.
Yi Cheng, Renjun Hu, Haochao Ying, Xing Shi, Jian Wu, Wei Lin
The 38th AAAI Conference on Artificial Intelligence (AAAI 2024)
Towards Hybrid-Grained Feature Interaction Selection for Deep Sparse Network.
Fuyuan Lyu, Xing Tang, Dugang Liu, Chen Ma, Weihong Luo, Liang Chen, Xue Steve Liu, et al.
Advances in Neural Information Processing Systems (NeurIPS 2023)
GRAND-SLAMIN’: Interpretable Additive Modeling with Structural Constraints.
Shibal Ibrahim, Gabriel Afriat, Kayhan Behdin, Rahul Mazumder
Advances in Neural Information Processing Systems (NeurIPS 2023)
Sparse Interaction Additive Networks via Feature Interaction Detection and Sparse Selection.
James Enouen, Yan Liu
Advances in Neural Information Processing Systems (NeurIPS 2022)

2.2.5 Hierarchical and Structured Temporal Models

Graph transformers on EHRs: Better representation improves downstream performance.
Raphael Poulain, Rahmatollah Beheshti
The 12th International Conference on Learning Representations (ICLR 2024)
Transformer-Based Deep Learning Model for the Diagnosis of Suspected Lung Cancer in Primary Care Based on Electronic Health Record Data.
Lan Wang, Yonghua Yin, Ben Glampson, Robert Peach, Mauricio Barahona, Brendan C. Delaney, Erik K. Mayer
EBioMedicine (2024)
Polynomial-based Self-Attention for Table Representation Learning.
Jayoung Kim, Yehjin Shin, Jeongwhan Choi, Hyowon Wi, Noseong Park
The 41th International Conference on Machine Learning (ICML 2024)
Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning. (paper)
Yi Cheng; Renjun Hu; Haochao Ying; Xing Shi; Jian Wu; Wei Lin
The 38th AAAI Conference on Artificial Intelligence (AAAI 2024)
TransformEHR: Transformer-Based Encoder-Decoder Generative Model to Enhance Prediction of Disease Outcomes Using Electronic Health Records.
Zhichao Yang, Avijit Mitra, Weisong Liu, Dan Berlowitz, Hong Yu
Nature Communications (2023)
Divide Rows and Conquer Cells: Towards Structure Recognition for Large Tables.(paper)
Huawen Shen, Xiang Gao, Jin Wei, Liang Qiao, Yu Zhou, Qiang Li, Zhanzhan Cheng
the 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)
DIWIFT: Discovering Instance-wise Influential Features for Tabular Data.
Dugang Liu, Pengxiang Cheng, Hong Zhu, Xing Tang, Yanyu Chen, Xiaoting Wang, Weike Pan, Zhong Ming, Xiuqiang He
(TheWebConf 2023) (paper)
DL-BERT: A Time-Aware Double-Level BERT-Style Model with Pre-Training for Disease Prediction.
Xianlai Chen, Jiamiao Lin, Ying An
2022 IEEE International Conference on Big Data (Big Data)
Few-Shot Learning with Semi-Supervised Transformers for Electronic Health Records.
Raphael Poulain, Mehak Gupta, Rahmatollah Beheshti
Machine Learning for Healthcare Conference (MLHC 2022)
Med-BERT: Pretrained Contextualized Embeddings on Large-Scale Structured Electronic Health Records for Disease Prediction.
Laila Rasmy, Yang Xiang, Ziqian Xie, Cui Tao, Degui Zhi
NPJ Digital Medicine (2021)
RareBERT: Transformer Architecture for Rare Disease Patient Identification Using Administrative Claims.
PKS Prakash, Srinivas Chilukuri, Nikhil Ranade, Shankar Viswanathan
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2021)
Transformer-Based Multi-Target Regression on Electronic Health Records for Primordial Prevention of Cardiovascular Disease.
Raphael Poulain, Mehak Gupta, Randi Foraker, Rahmatollah Beheshti
2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
Communication Efficient Federated Generalized Tensor Factorization for Collaborative Health Data Analytics.
Jing Ma, Qiuchen Zhang, Jian Lou, Li Xiong, Joyce C. Ho
Proceedings of the Web Conference (WWW 2021)
RAPT: Pre-Training of Time-Aware Transformer for Learning Robust Healthcare Representation.
Houxing Ren, Jingyuan Wang, Wayne Xin Zhao, Ning Wu
Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)
BEHRT: Transformer for Electronic Health Records.
Yikuan Li, Shishir Rao, José Roberto Ayala Solares, Abdelaali Hassaine, Rema Ramakrishnan, Dexter Canoy, Yajie Zhu, Kazem Rahimi, Gholamreza Salimi-Khorshidi
Scientific Reports (2020)
HiTANet: Hierarchical Time-Aware Attention Networks for Risk Prediction on Electronic Health Records.
Junyu Luo, Muchao Ye, Cao Xiao, Fenglong Ma
Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2020)
Attention Is All You Need.
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, Illia Polosukhin
Advances in Neural Information Processing Systems (NeurIPS 2017)

2.3 Temporal Dependency Modeling

2.3.1 Irregular/Asynchronous Sampling

Integrating Sequence and Image Modeling in Irregular Medical Time Series Through Self-Supervised Learning.
Chen, Liuqing, Xiao, Shuhong, Ding, Shixian, Hu, Shanhai, Sun, Lingyun
arXiv preprint arXiv:2502.06134 (2025)
SepsisCalc: Integrating Clinical Calculators into Early Sepsis Prediction via Dynamic Temporal Graph Construction.
Yin, Changchang, Fu, Shihan, Yao, Bingsheng, Pham, Thai-Hoang, Cao, Weidan, Wang, Dakuo, Caterino, Jeffrey, Zhang, Ping
arXiv preprint arXiv:2501.00190 (2024)
Predictive modeling with temporal graphical representation on electronic health records.
Chen, Jiayuan, Yin, Changchang, Wang, Yuanlong, Zhang, Ping
IJCAI: proceedings of the conference (2024)
Dna-t: Deformable neighborhood attention transformer for irregular medical time series.
Huang, Jianxuan, Yang, Baoyao, Yin, Kejing, Xu, Jingwen
IEEE Journal of Biomedical and Health Informatics (2024)
A Hybrid Approach for Irregular-Time Series Prediction using Electronic Health Records: an Intensive Care Unit Mortality Case Study.
Zhong, Shaojie, Wang, Li Rong, Zhan, Zhuoxuan, Ng, Yih Yng, Fan, Xiuyi
ACM Transactions on Computing for Healthcare (2024)
Contiformer: Continuous-time transformer for irregular time series modeling.
Chen, Yuqi, Ren, Kan, Wang, Yansen, Fang, Yuchen, Sun, Weiwei, Li, Dongsheng
Advances in Neural Information Processing Systems (2023)
Improving medical predictions by irregular multimodal electronic health records modeling.
Zhang, Xinlu, Li, Shiyang, Chen, Zhiyu, Yan, Xifeng, Petzold, Linda Ruth
International Conference on Machine Learning (2023)
Warpformer: A multi-scale modeling approach for irregular clinical time series.
Zhang, Jiawen, Zheng, Shun, Cao, Wei, Bian, Jiang, Li, Jia
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2023)
Time series as images: Vision transformer for irregularly sampled time series.
Li, Zekun, Li, Shiyang, Yan, Xifeng
Advances in Neural Information Processing Systems (2023)
Self-supervised transformer for sparse and irregularly sampled multivariate clinical time-series.
Tipirneni, Sindhu, Reddy, Chandan K
ACM Transactions on Knowledge Discovery from Data (TKDD) (2022)
Cumulative stay-time representation for electronic health records in medical event time prediction.
Katsuki, Takayuki, Miyaguchi, Kohei, Koseki, Akira, Iwamori, Toshiya, Yanagiya, Ryosuke, Suzuki, Atsushi
arXiv preprint arXiv:2204.13451 (2022)
Imputation of missing values for electronic health record laboratory data.
Li, Jiang, Yan, Xiaowei S, Chaudhary, Durgesh, Avula, Venkatesh, Mudiganti, Satish, Husby, Hannah, Shahjouei, Shima, Afshar, Ardavan, Stewart, Walter F, Yeasin, Mohammed, others
NPJ digital medicine (2021)
Multi-time attention networks for irregularly sampled time series.
Shukla, Satya Narayan, Marlin, Benjamin M
arXiv preprint arXiv:2101.10318 (2021)
Graph-guided network for irregularly sampled multivariate time series.
Zhang, Xiang, Zeman, Marko, Tsiligkaridis, Theodoros, Zitnik, Marinka
arXiv preprint arXiv:2110.05357 (2021)
Set functions for time series.
Horn, Max, Moor, Michael, Bock, Christian, Rieck, Bastian, Borgwardt, Karsten
International Conference on Machine Learning (2020)
Deep mixed effect model using Gaussian processes: a personalized and reliable prediction for healthcare.
Chung, Ingyo, Kim, Saehoon, Lee, Juho, Kim, Kwang Joon, Hwang, Sung Ju, Yang, Eunho
Proceedings of the AAAI conference on artificial intelligence (2020)
Interpolation-prediction networks for irregularly sampled time series.
Shukla, Satya Narayan, Marlin, Benjamin M
arXiv preprint arXiv:1909.07782 (2019)
Recurrent neural networks for multivariate time series with missing values.
Che, Zhengping, Purushotham, Sanjay, Cho, Kyunghyun, Sontag, David, Liu, Yan
Scientific reports (2018)

2.3.2 Multi-Timescale Dynamics

A deep learning algorithm to predict risk of pancreatic cancer from disease trajectories.
Placido, Davide, Yuan, Bo, Hjaltelin, Jessica X, Zheng, Chunlei, Haue, Amalie D, Chmura, Piotr J, Yuan, Chen, Kim, Jihye, Umeton, Renato, Antell, Gregory, others
Nature Medicine (2023)
Modelling long-and short-term multi-dimensional patterns in predictive maintenance with accumulative attention.
Shi, Yong, Zhang, Linzi
Reliability Engineering & System Safety (2023)
Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting.
Zhou, Tian, Ma, Ziqing, Wen, Qingsong, Wang, Xue, Sun, Liang, Jin, Rong
International conference on machine learning (2022)
Clinical risk prediction with temporal probabilistic asymmetric multi-task learning.
Nguyen, A Tuan, Jeong, Hyewon, Yang, Eunho, Hwang, Sung Ju
Proceedings of the AAAI Conference on Artificial Intelligence (2021)
Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting.
Wu, Haixu, Xu, Jiehui, Wang, Jianmin, Long, Mingsheng
Advances in neural information processing systems (2021)
Adacare: Explainable clinical health status representation learning via scale-adaptive feature extraction and recalibration.
Ma, Liantao, Gao, Junyi, Wang, Yasha, Zhang, Chaohe, Wang, Jiangtao, Ruan, Wenjie, Tang, Wen, Gao, Xin, Ma, Xinyu
Proceedings of the AAAI Conference on Artificial Intelligence (2020)
DeepAlerts: deep learning based multi-horizon alerts for clinical deterioration on oncology hospital wards.
Li, Dingwen, Lyons, Patrick G, Lu, Chenyang, Kollef, Marin
Proceedings of the AAAI Conference on Artificial Intelligence (2020)
Stagenet: Stage-aware neural networks for health risk prediction.
Gao, Junyi, Xiao, Cao, Wang, Yasha, Tang, Wen, Glass, Lucas M, Sun, Jimeng
Proceedings of the web conference 2020 (2020)

2.3.3 Conditional Clinical Sequences

A deep learning algorithm to predict risk of pancreatic cancer from disease trajectories.
Placido, Davide, Yuan, Bo, Hjaltelin, Jessica X, Zheng, Chunlei, Haue, Amalie D, Chmura, Piotr J, Yuan, Chen, Kim, Jihye, Umeton, Renato, Antell, Gregory, others
Nature medicine (2023)
Diaformer: Automatic diagnosis via symptoms sequence generation.
Chen, Junying, Li, Dongfang, Chen, Qingcai, Zhou, Wenxiu, Liu, Xin
Proceedings of the AAAI Conference on Artificial Intelligence (2022)
A Novel Sequence-to-Subgraph Framework for Diagnosis Classification.
Chen, Jun, Yuan, Quan, Lu, Chao, Huang, Haifeng
IJCAI (2021)
Change matters: Medication change prediction with recurrent residual networks.
Yang, Chaoqi, Xiao, Cao, Glass, Lucas, Sun, Jimeng
arXiv preprint arXiv:2105.01876 (2021)
Concare: Personalized clinical feature embedding via capturing the healthcare context.
Ma, Liantao, Zhang, Chaohe, Wang, Yasha, Ruan, Wenjie, Wang, Jiangtao, Tang, Wen, Ma, Xinyu, Gao, Xin, Gao, Junyi
Proceedings of the AAAI conference on artificial intelligence (2020)
DeepAlerts: deep learning based multi-horizon alerts for clinical deterioration on oncology hospital wards.
Li, Dingwen, Lyons, Patrick G, Lu, Chenyang, Kollef, Marin
Proceedings of the AAAI Conference on Artificial Intelligence (2020)

2.4 Meta-Architectural Strategies

2.4.1 Meta-Adaptive Modeling

2.4.2 Neural Architecture Search (NAS)

TabNAS: Rejection Sampling for Neural Architecture Search on Tabular Datasets. (paper)
Chengrun Yang, Gabriel Bender, Hanxiao Liu, Pieter-Jan Kindermans, Madeleine Udell, Yifeng Lu, Quoc V Le, Da Huang
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)

Branch 3: Learning-Focused Approaches

3.1 Self-Supervised Learning

SwitchTab: Switched Autoencoders Are Effective Tabular Learners. (paper)
Jing Wu; Suiyao Chen; Qi Zhao; Renat Sergazinov; Chen Li; Shengjie Liu.
The 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)
STUNT: Few-shot Tabular Learning with Self-generated Tasks from Unlabeled Tables.
Jaehyun Nam, Jihoon Tack, Kyungmin Lee, Hankook Lee, Jinwoo Shin
The 11th International Conference on Learning Representations (ICLR 2023)
(paper)
Self-Supervision Enhanced Feature Selection with Correlated Gates. (paper)
Changhee Lee, Fergus Imrie, Mihaela van der Schaar
The 10th International Conference on Learning Representations (ICLR 2022)
Scarf: Self-Supervised Contrastive Learning using Random Feature Corruption. (paper)
Dara Bahri, Heinrich Jiang, Yi Tay, Donald Metzler
The 10th International Conference on Learning Representations (ICLR 2022)
Local Contrastive Feature Learning for Tabular Data. (paper)
Zhabiz Gharibshah, Xingquan Zhu.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management (CIKM 2022)
CORE: Self- and Semi-supervised Tabular Learning with COnditional REgularizations. (paper)
Xintian Han, Rajesh Ranganath
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2021)
Self-Supervised Adversarial Distribution Regularization for Medication Recommendation.
Yanda Wang, Weitong Chen, Dechang Pi, Lin Yue, Sen Wang, Miao Xu
International Joint Conference on Artificial Intelligence (IJCAI 2021)
SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning. (paper)
Talip Ucar, Ehsan Hajiramezanali, Lindsay Edwards
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2021)
VIME: Extending the Success of Self- and Semi-supervised Learning to Tabular Domain. (paper)
Jinsung Yoon, Yao Zhang, James Jordon, Mihaela van der Schaar
The 34th Annual Conference on Neural Information Processing Systems (Neurips 2020)

3.2 Clustering-Based Methods

Deep Representation Learning for Clustering Longitudinal Survival Data from Electronic Health Records.
Qiu, Jiajun; Hu, Yao; Li, Li; Erzurumluoglu, Abdullah Mesut; Braenne, Ingrid; Whitehurst, Charles; Schmitz, Jochen; Arora, Jatin; Bartholdy, Boris Alexander; Gandhi, Shrey; et al.
Nature Communications (2025)
Identification of Predictive Subphenotypes for Clinical Outcomes Using Real World Data and Machine Learning.
Weishen Pan, Deep Hathi, Zhenxing Xu, Qiannan Zhang, Ying Li, Fei Wang
Nature Communications (2025)
Data-driven cluster analysis identifies distinct types of metabolic dysfunction-associated steatotic liver disease.
Raverdy, Violeta, Tavaglione, Federica, Chatelain, Estelle, Lassailly, Guillaume, De Vincentis, Antonio, Vespasiani-Gentilucci, Umberto, Qadri, Sami F, Caiazzo, Robert, Verkindt, Helene, Saponaro, Chiara, others
Nature Medicine (2024)
ProtoGate: Prototype-based Neural Networks with Global-to-local Feature Selection for Tabular Biomedical Data.
Jiang, Xiangjian; Margeloiu, Andrei; Simidjievski, Nikola; Jamnik, Mateja.
Proceedings of the 41st International Conference on Machine Learning (ICML) (2024)
Clustering Interval-Censored Time-Series for Disease Phenotyping.
Chen, Irene Y.; Krishnan, Rahul G.; Sontag, David.
Proceedings of the 36th AAAI Conference on Artificial Intelligence (2022)
Learning of Cluster-based Feature Importance for Electronic Health Record Time-series.
Aguiar, Henrique; Santos, Mauro D.; Watkinson, Peter J.; Zhu, Tingting.
Proceedings of the 39th International Conference on Machine Learning (ICML) (2022)
Longitudinal Patient Stratification of Electronic Health Records with Flexible Adjustment for Clinical Outcomes.
Carr, Oliver; Javer, Avelino; Rockenschaub, Patrick; Parsons, Owen; Durichen, Robert.
Proceedings of Machine Learning for Health (2021)
CROCS: Clustering and Retrieval of Cardiac Signals Based on Patient Disease Class, Sex, and Age.
Kiyasseh, Dani; Zhu, Tingting; Clifton, David A.
Advances in Neural Information Processing Systems (NeurIPS) (2021)
Transfer Learning via Optimal Transportation for Integrative Cancer Patient Stratification.
Ziyu Liu, Wei Shao, Jie Zhang, Min Zhang, Kun Huang
International Joint Conference on Artificial Intelligence (IJCAI), 2021, pp. 2760–2766.
Deep Representation Learning of Electronic Health Records to Unlock Patient Stratification at Scale.
Landi, Isotta; Glicksberg, Benjamin S.; Lee, Hao-Chih; Cherng, Sarah; Landi, Giulia; Danieletto, Matteo; Dudley, Joel T.; Furlanello, Cesare; Miotto, Riccardo.
npj Digital Medicine (2020)
Data-IQ: Characterizing Subgroups with Heterogeneous Outcomes in Tabular Data.
Seedat, Nabeel; Crabbé, Jonathan; Bica, Ioana; van der Schaar, Mihaela.
Proceedings of the 36th Annual Conference on Neural Information Processing Systems (NeurIPS 2022)
Temporal Phenotyping Using Deep Predictive Clustering of Disease Progression.
Lee, Changhee; van der Schaar, Mihaela.
International Conference on Machine Learning (2020)
ProtoGate: Prototype-based Neural Networks with Global-to-local Feature Selection for Tabular Biomedical Data.
Xiangjian Jiang, Andrei Margeloiu, Nikola Simidjievski, Mateja Jamnik
The 41th International Conference on Machine Learning (ICML 2024)
PTaRL: Prototype-based Tabular Representation Learning via Space Calibration.
Hangting Ye, Wei Fan, Xiaozhuang Song, Shun Zheng, He Zhao, Dan dan Guo, Yi Chang
The 12th International Conference on Learning Representations (ICLR 2024)
(paper)
TabR: Tabular Deep Learning Meets Nearest Neighbors. (paper)
Yury Gorishniy, Ivan Rubachev, Nikolay Kartashev, Daniil Shlenskii, Akim Kotelnikov, Artem Babenko
The 12th International Conference on Learning Representations (ICLR 2024).
On Class Distributions Induced by Nearest Neighbor Graphs for Node Classification of Tabular Data. (paper)
Federico Errica
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
Learning Enhanced Representation for Tabular Data via Neighborhood Propagation. (paper)
Kounianhua Du, Weinan Zhang, Ruiwen Zhou, Yangkun Wang, Xilong Zhao, Jiarui Jin, Quan Gan, Zheng Zhang, David Wipf
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)

3.3 Latent Representation Learning

XTab: Cross-table Pretraining for Tabular Transformers.
Bingzhao Zhu, Xingjian Shi, Nick Erickson, Mu Li, George Karypis, Mahsa Shoaran
Proceedings of the 40th International Conference on Machine Learning (ICML 2023)
Assisting Clinical Decisions for Scarcely Available Treatment via Disentangled Latent Representation.
Bing Xue, Ahmed Sameh Said, Ziqi Xu, Hanyang Liu, Neel Shah, Hanqing Yang, Philip Payne, Chenyang Lu
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023.
Numerical Tuple Extraction from Tables with Pre-training.
Qingping Yang, Yixuan Cao, Yingming Hu, Jianfeng Li, Nanbo Peng, Ping Luo
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)
Inferring multimodal latent topics from electronic health records.
Yue Li, Pratheeksha Nair, Xing Han Lu, Zhi Wen, Yuening Wang, Amir Ardalan Kalantari Dehaghi, Yan Miao, Weiqi Liu, Tamas Ordog, Joanna M Biernacka, others
Nature Communications (2020)
Assisting clinical decisions for scarcely available treatment via disentangled latent representation.
Bing Xue, Ahmed Sameh Said, Ziqi Xu, Hanyang Liu, Neel Shah, Hanqing Yang, Philip Payne, Chenyang Lu
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023)
Transferable Adversarial Robustness for Categorical Data via Universal Robust Embeddings.
Klim Kireev, Maksym Andriushchenko, Carmela Troncoso, Nicolas Flammarion
Advances in Neural Information Processing Systems (NeurIPS 2023)
Revisiting Deep Learning Models for Tabular Data.
Yury Gorishniy, Ivan Rubachev, Valentin Khrulkov, Artem Babenko
Advances in Neural Information Processing Systems (NeurIPS 2021)
ManyDG: Many-domain Generalization for Healthcare Applications.
Chaoqi Yang, M. Brandon Westover, Jimeng Sun
International Conference on Learning Representations (ICLR 2023)
BiSHop: Bi-Directional Cellular Learning for Tabular Data with Generalized Sparse Modern Hopfield Model.
Chenwei Xu, Yu-Chao Huang, Jerry Yao-Chieh Hu, Weijian Li, Ammar Gilani, Hsi-Sheng Goan, Han Liu
The 41st International Conference on Machine Learning (ICML 2024)
Transferable Adversarial Robustness for Categorical Data via Universal Robust Embeddings. (paper)
Klim Kireev, Maksym Andriushchenko, Carmela Troncoso, Nicolas Flammarion
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2023)
Revisiting Deep Learning Models for Tabular Data. (paper)
Yury Gorishniy, Ivan Rubachev, Valentin Khrulkov, Artem Babenko.
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2021)

3.4 Causal Representation Learning

3.5 Continual Learning

Branch 4: Learning with External Modalities and Knowledge

4.1 Multi-modal Learning

4.1.1 Cross-model Alignment

4.1.1.1 Global Alignment

Focus on What Matters: Enhancing Medical Vision-Language Models with Automatic Attention Alignment Tuning.
Chang, Aofei and Huang, Le and Boyd, Alex James and others
arXiv (2025)
HealNet: Multimodal Fusion for Heterogeneous Biomedical Data.
Hemker, Konstantin; Simidjievski, Nikola; Jamnik, Mateja
Advances in Neural Information Processing Systems (NeurIPS) (2024)
Eye-gaze guided multi-modal alignment for medical representation learning
Ma, Chong; Jiang, Hanqi; Chen, Wenting; Li, Yiwei; Wu, Zihao; Yu, Xiaowei; Liu, Zhengliang; Guo, Lei; Zhu, Dajiang; Zhang, Tuo; et al.
Advances in Neural Information Processing Systems (NeurIPS) 2024
Multimodal Prototyping for cancer survival prediction
Song, Andrew H and Chen, Richard J and Jaume, Guillaume and Vaidya, Anurag Jayant and Baras, Alexander and Mahmood, Faisal
International Conference on Machine Learning (ICML) 2024
Pathasst: A generative foundation AI assistant towards artificial general intelligence of pathology.
Sun, Yuxuan; Zhu, Chenglu; Zheng, Sunyi; Zhang, Kai; Sun, Lin; Shui, Zhongyi; Zhang, Yunlong; Li, Honglin; Yang, Lin
Proceedings of the AAAI Conference on Artificial Intelligence (2024)
A visual-language foundation model for computational pathology.
Lu, Ming Y; Chen, Bowen; Williamson, Drew FK; Chen, Richard J; Liang, Ivy; Ding, Tong; Jaume, Guillaume; Odintsov, Igor; Le, Long Phi; Gerber, Georg; others
Nature Medicine (2024)
Vision language foundation model for echocardiogram interpretation.
Christensen, Matthew; Vukadinovic, Milos; Yuan, Neal; Ouyang, David
Nature Medicine (2024)
A multimodal generative AI copilot for human pathology
Lu, Ming Y and Chen, Bowen and Williamson, Drew FK and Chen, Richard J and Zhao, Melissa and Chow, Aaron K and Ikemura, Kenji and Kim, Ahrong and Pouli, Dimitra and Patel, Ankush and others
Nature 2024
MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization
hu, Kangyu and Xia, Peng and Li, Yun and Zhu, Hongtu and Wang, Sheng and Yao, Huaxiu
ICML 2025
Advancing radiograph representation learning with masked record modeling.
Zhou, Hong-Yu; Lian, Chenyu; Wang, Liansheng; Yu, Yizhou
arXiv preprint arXiv:2301.13155 (2023)
Biomedclip: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Zhang, Sheng and Xu, Yanbo and Usuyama, Naoto and Xu, Hanwen and Bagga, Jaspreet and Tinn, Robert and Preston, Sam and Rao, Rajesh and Wei, Mu and Valluri, Naveen and others.
Arxiv 2023
Pmc-clip: Contrastive language-image pre-training using biomedical documents.
Lin, Weixiong; Zhao, Ziheng; Zhang, Xiaoman; Wu, Chaoyi; Zhang, Ya; Wang, Yanfeng; Xie, Weidi
International Conference on Medical Image Computing and Computer-Assisted Intervention (2023)
Imitate: Clinical prior guided hierarchical vision-language pre-training.
Liu, Che; Cheng, Sibo; Shi, Miaojing; Shah, Anand; Bai, Wenjia; Arcucci, Rossella
arXiv preprint arXiv:2310.07355 (2023)
Human-Centred Multimodal Deep Learning Models for Chest X-Ray Diagnosis.
Chihcheng Hsieh
International Joint Conference on Artificial Intelligence (IJCAI 2023)
Prior: Prototype representation from medical images and reports.
Cheng, Pujin; Lin, Li; Lyu, Junyan; Huang, Yijin; Luo, Wenhan; Tang, Xiaoying
Proceedings of the IEEE/CVF International Conference on Computer Vision (2023)
M-flag: Medical vision-language pre-training with frozen language models and latent space geometry optimization.
Liu, Che; Cheng, Sibo; Chen, Chen; Qiao, Mengyun; Zhang, Weitong; Shah, Anand; Bai, Wenjia; Arcucci, Rossella
International Conference on Medical Image Computing and Computer-Assisted Intervention (2023)
Learning to exploit temporal structure for biomedical vision-language processing.
Bannur, Shruthi and Hyland, Stephanie and Liu, Qianchu and Perez-Garcia, Fernando and Ilse, Maximilian and Castro, Daniel C and Boecking, Benedikt and Sharma, Harshita and Bouzid, Kenza and Thieme, Anja and others
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023)
A visual language foundation model for pathology image analysis using medical twitter.
Huang, Zhi; Bianchi, Federico; Yuksekgonul, Mert; Montine, Thomas J; Zou, James
Nature Medicine (2023)
Contrastive learning of medical visual representations from paired images and text.
Zhang, Yuhao and Jiang, Hang and Miura, Yasuhide and Manning, Christopher D and Langlotz, Curtis P
Machine Learning for Healthcare Conference (MLHC 2022)
Making the most of text semantics to improve biomedical vision--language processing.
Boecking, Benedikt and Usuyama, Naoto and Bannur, Shruthi and Castro, Daniel C and Schwaighofer, Anton and Hyland, Stephanie and Wetscherek, Maria and Naumann, Tristan and Nori, Aditya and Alvarez-Valle, Javier and others
European Conference on Computer Vision (ECCV 2022)
Integrated multimodal artificial intelligence framework for healthcare applications.
Soenksen, Luis R and Ma, Yu and Zeng, Cynthia and Boussioux, Leonard and Villalobos Carballo, Kimberly and Na, Liangyuan and Wiberg, Holly M and Li, Michael L and Fuentes, Ignacio and Bertsimas, Dimitris
NPJ Digital Medicine 5(1):149 (2022)
Expert-level detection of pathologies from unannotated chest X-ray images via self-supervised learning.
Tiu, Ekin and Talius, Ellie and Patel, Pujan and Langlotz, Curtis P and Ng, Andrew Y and Rajpurkar, Pranav
Nature Biomedical Engineering 6(12):1399–1406 (2022)

4.1.1.2 Fine-grained Alignment

4.1.1.2.1 Fine-Grained Contextual Understanding.

Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational Pathology.
Liu, Pei; Ji, Luping; Gou, Jiaxiang; Fu, Bo; Ye, Mao
The Thirteenth International Conference on Learning Representations (ICLR) (2025)
Prompt as Knowledge Bank: Boost Vision-language Model via Structural Representation for Zero-shot Medical Detection.
Yang, Yuguang; Chen, Tongfei; Huang, Haoyu; Yang, Linlin; Xie, Chunyu; Leng, Dawei; Cao, Xianbin; Zhang, Baochang
The Thirteenth International Conference on Learning Representations (ICLR) (2025)
HSCR: Hierarchical Self-Contrastive Rewarding for Aligning Medical Vision Language Models.
Jiang, Songtao and Zhang, Yan and Jin, Yeying and Tang, Zhihang and Wu, Yangyang and Feng, Yang and Wu, Jian and Liu, Zuozhu
arXiv preprint arXiv:2506.00805 (2025)
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
Weichao Zhao, Hao Feng, Qi Liu, Jingqun Tang, Binghong Wu, Lei Liao, Shu Wei, Yongjie Ye, Hao Liu, Wengang Zhou, Houqiang Li, Can Huang
Advances in Neural Information Processing Systems (NeurIPS) 2024
Joint learning of localized representations from medical images and reports.
Müller, Philip and Kaissis, Georgios and Zou, Congyu and Rueckert, Daniel
European Conference on Computer Vision (ECCV 2022)
MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training for X-ray Diagnosis.
Wu, Chaoyi and Zhang, Xiaoman and Zhang, Ya and Wang, Yanfeng and Xie, Weidi
IEEE/CVF International Conference on Computer Vision (ICCV 2023)
Knowledge-enhanced visual-language pre-training on chest radiology images.
Zhang, Xiaoman and Wu, Chaoyi and Zhang, Ya and Xie, Weidi and Wang, Yanfeng
Nature Communications 14(1):4542 (2023)
Imitate: Clinical prior guided hierarchical vision-language pre-training.
Liu, Che and Cheng, Sibo and Shi, Miaojing and Shah, Anand and Bai, Wenjia and Arcucci, Rossella
arXiv preprint arXiv:2310.07355 (2023)

4.1.1.2.2 Extending 2D to 3D Imaging.

Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-ray Expert Models.
Cao, Weiwei and Zhang, Jianpeng and Xia, Yingda and Mok, Tony CW and Li, Zi and Ye, Xianghua and Lu, Le and Zheng, Jian and Tang, Yuxing and Zhang, Ling
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)
A foundation model utilizing chest CT volumes and radiology reports for supervised-level zero-shot detection of abnormalities.
Hamamci, Ibrahim Ethem and Er, Sezgin and Almas, Furkan and Simsek, Ayse Gulnihan and Esirgun, Sevval Nil and Dogan, Irem and Dasdelen, Muhammed Furkan and Wittmann, Bastian and Simsar, Enis and Simsar, Mehmet and others
arXiv preprint arXiv:2403.17834 (2024)
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models.
Bai, Fan and Du, Yuxin and Huang, Tiejun and Meng, Max Q.-H. and Zhao, Bo
arXiv preprint arXiv:2404.00578 (2024)
CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios.
Lin, Jingyang and Xia, Yingda and Zhang, Jianpeng and Yan, Ke and Lu, Le and Luo, Jiebo and Zhang, Ling
arXiv preprint arXiv:2404.15272 (2024)
Merlin: A Vision Language Foundation Model for 3D Computed Tomography.
Blankemeier, Louis and Cohen, Joseph Paul and Kumar, Ashwin and Van Veen, Dave and Gardezi, Syed Jamal Safdar and Paschali, Magdalini and Chen, Zhihong and Delbrouck, Jean-Benoit and Reis, Eduardo and Truyts, Cesar and others
arXiv preprint arXiv:2406.06512 (2024)
Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection
Wang, Yuli; Dai, Yuwei; Jones, Craig; Sair, Haris; Shen, Jinglai; Loizou, Nicolas; Hsu, Wen-Chi; Imami, Maliha; Jiao, Zhicheng; Zhang, Paul; et al.
Advances in Neural Information Processing Systems (NeurIPS) 2024
Benchmarking encoder-decoder architectures for biplanar X-ray to 3D bone shape reconstruction.
Shakya, Mahesh; Khanal, Bishesh
Advances in Neural Information Processing Systems (NeurIPS) (2023)

4.1.1.2.3 Region-Level Medical LMMs.

Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute Alignment.
Jiang, Yankai; Lei, Wenhui; Zhang, Xiaofan; Zhang, Shaoting
The Thirteenth International Conference on Learning Representations (ICLR) (2025)
Reinforced Correlation Between Vision and Language for Precise Medical AI Assistant
Wang, Haonan and Mao, Jiaji and Wang, Lehan and Zhang, Qixiang and Elbatel, Marawan and Qin, Yi and Hu, Huijun and Li, Baoxun and Deng, Wenhui and Qin, Weifeng and others
arxiv 2025
Training medical large vision-language models with abnormal-aware feedback.
Zhou, Yucheng; Song, Lingran; Shen, Jianbing
arXiv preprint arXiv:2501.01377 (2025)
AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation.
Li, Qingqiu and Cui, Zihang and Bae, Seongsu and Xu, Jilan Mand Yuan, Runtian and Zhang, Yuejie and Feng, Rui and Shen, Quanli and Zhang, Xiaobo and He, Junjun and others
arXiv preprint arXiv:2505.02830 (2025)
Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding.
Shui, Zhongyi and Zhang, Jianpeng and Cao, Weiwei and Wang, Sinuo and Guo, Ruizhe and Lu, Le and Yang, Lin and Ye, Xianghua and Liang, Tingbo and Zhang, Qi and others
arXiv preprint arXiv:2501.14548 (2025)
Regiongpt: Towards region understanding vision language model.
Guo, Qiushan and De Mello, Shalini and Yin, Hongxu and Byeon, Wonmin and Cheung, Ka Chun and Yu, Yizhou and Luo, Ping and Liu, Sifei
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)
Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction.
Zhang, Yilan; Xu, Yingxue; Chen, Jianqi; Xie, Fengying; Chen, Hao
The Thirteenth International Conference on Learning Representations (ICLR) (2024)
Fine-Grained Image-Text Alignment in Medical Imaging Enables Explainable Cyclic Image-Report Generation.
Chen, Wenting; Shen, Linlin; Lin, Jingyang; Luo, Jiebo; Li, Xiang; Yuan, Yixuan
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL) (2024)
A refer-and-ground multimodal large language model for biomedicine.
Huang, Xiaoshuang and Huang, Haifeng and Shen, Lingdong and Yang, Yehui and Shang, Fangxin and Liu, Junwei and Liu, Jia
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2024)
Maira-2: Grounded radiology report generation.
Bannur, Shruthi and Bouzid, Kenza and Castro, Daniel C and Schwaighofer, Anton and Thieme, Anja and Bond-Taylor, Sam and Ilse, Maximilian and Pérez-García, Fernando and Salvatelli, Valentina and Sharma, Harshita and others
arXiv preprint arXiv:2406.04449 (2024)
Shikra: Unleashing multimodal LLM's referential dialogue magic.
Chen, Keqin and Zhang, Zhao and Zeng, Weili and Zhang, Richong and Zhu, Feng and Zhao, Rui
arXiv preprint arXiv:2306.15195 (2023)
Gpt4roi: Instruction tuning large language model on region-of-interest.
Zhang, Shilong and Sun, Peize and Chen, Shoufa and Xiao, Min and Shao, Wenqi and Zhang, Wenwei and Liu, Yu and Chen, Kai and Luo, Ping
arXiv preprint arXiv:2307.03601 (2023)

4.1.1.3 Data-Efficient Parallel and Unpaired Alignment.

4.1.1.3.1 Parallel Data Collection.

HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation.
Lin, Tianwei; Zhang, Wenqiao; Li, Sijing; Yuan, Yuqian; Yu, Binhe; Li, Haoyuan; He, Wanggui; Jiang, Hao; Li, Mengze; Song, Xiaohui; et al.
Proceedings of the 42nd International Conference on Machine Learning (ICML) (2025)
BiomedCLIP: A multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs.
Zhang, Sheng and Xu, Yanbo and Usuyama, Naoto and Xu, Hanwen and Bagga, Jaspreet and Tinn, Robert and Preston, Sam and Rao, Rajesh and Wei, Mu and Valluri, Naveen and others
arXiv preprint arXiv:2303.00915 (2023)
LLaVA-Med: Training a large language-and-vision assistant for biomedicine in one day.
Li, Chunyuan and Wong, Cliff and Zhang, Sheng and Usuyama, Naoto and Liu, Haotian and Yang, Jianwei and Naumann, Tristan and Poon, Hoifung and Gao, Jianfeng
Advances in Neural Information Processing Systems (NeurIPS 2023)
Med-Flamingo: A multimodal medical few-shot learner.
Moor, Michael and Huang, Qian and Wu, Shirley and Yasunaga, Michihiro and Dalmia, Yash and Leskovec, Jure and Zakka, Cyril and Reis, Eduardo Pontes and Rajpurkar, Pranav
Machine Learning for Health (ML4H 2023)
OpenFlamingo: An open-source framework for training large autoregressive vision-language models.
Awadalla, Anas and Gao, Irena and Gardner, Josh and Hessel, Jack and Hanafy, Yusuf and Zhu, Wanrong and Marathe, Kalyani and Bitton, Yonatan and Gadre, Samir and Sagawa, Shiori and others
arXiv preprint arXiv:2308.01390 (2023)

4.1.1.3.2 Learning from unpaired Data.

Multimodal patient representation learning with missing modalities and labels.
Wu, Zhenbang; Dadu, Anant; Tustison, Nicholas; Avants, Brian; Nalls, Mike; Sun, Jimeng; Faghri, Faraz
The Twelfth International Conference on Learning Representations (ICLR) (2024)
PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
Xie, Yutong; Chen, Qi; Wang, Sinuo; To, Minh-Son; Lee, Iris; Khoo, Ee Win; Hendy, Kerolos; Koh, Daniel; Xia, Yong; Wu, Qi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)
A medical multimodal large language model for future pandemics.
Liu, Fenglin; Zhu, Tingting; Wu, Xian; Yang, Bang; You, Chenyu; Wang, Chenyang; Lu, Lei; Liu, Zhangdaihong; Zheng, Yefeng; Sun, Xu; et al.
NPJ Digital Medicine 6(1):226 (2023)
Towards unifying medical vision-and-language pre-training via soft prompts.
Chen, Zhihong; Diao, Shizhe; Wang, Benyou; Li, Guanbin; Wan, Xiang
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV 2023)
MedCLIP: Contrastive learning from unpaired medical images and text.
Wang, Zifeng; Wu, Zhenbang; Agarwal, Dinesh; Sun, Jimeng
arXiv preprint arXiv:2210.10163 (2022)
M3Care: Learning with missing modalities in multimodal healthcare data.
Zhang, Chaohe; Chu, Xu; Ma, Liantao; Zhu, Yinghao; Wang, Yasha; Wang, Jiangtao; Zhao, Junfeng
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)

4.1.1.3 Data Efficient Parallel Alignment

MedCLIP: Contrastive Learning from Unpaired Medical Images and Text.
Zifeng Wang, Zhenbang Wu, Dinesh Agarwal, Jimeng Sun
arXiv preprint arXiv:2210.10163 (2022)
Towards Unifying Medical Vision-and-Language Pre-Training via Soft Prompts.
Zhihong Chen, Shizhe Diao, Benyou Wang, Guanbin Li, Xiang Wan
Proceedings of the IEEE/CVF International Conference on Computer Vision (2023)
PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
Yutong Xie, Qi Chen, Sinuo Wang, Minh-Son To, Iris Lee, Ee Win Khoo, Kerolos Hendy, Daniel Koh, Yong Xia, Qi Wu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2024)
LVM-Med: Learning large-scale self-supervised vision models for medical imaging via second-order graph matching
MH Nguyen, Duy; Nguyen, Hoang; Diep, Nghiem; Pham, Tan Ngoc; Cao, Tri; Nguyen, Binh; Swoboda, Paul; Ho, Nhat; Albarqouni, Shadi; Xie, Pengtao; et al.
Nature Digital Medicine 2023

4.1.2 Knowledge-Informed Modeling

4.1.3 Temporal and Asynchronous Integration and Modeling

Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing.
Bannur, Shruthi; Hyland, Stephanie; Liu, Qianchu; Perez-Garcia, Fernando; Ilse, Maximilian; Castro, Daniel C; Boecking, Benedikt; Sharma, Harshita; Bouzid, Kenza; Thieme, Anja; et al.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation.
Yao, Wenfang; Liu, Chen; Yin, Kejing; Cheung, William; Qin, Jing
Advances in Neural Information Processing Systems (NeurIPS) (2024)
Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational Pathology.
Liu, Pei; Ji, Luping; Gou, Jiaxiang; Fu, Bo; Ye, Mao
The Thirteenth International Conference on Learning Representations (ICLR) (2025)
Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training Yang, Jinxia and Su, Bing and Zhao, Xin and Wen, Ji-Rong
International Conference on Machine Learning (ICLR) 2024
Time-to-Event Pretraining for 3D Medical Imaging Huo, Zepeng and Fries, Jason Alan and Lozano, Alejandro and Valanarasu, Jeya Maria Jose and Steinberg, Ethan and Blankemeier, Louis and Chaudhari, Akshay S and Langlotz, Curtis and Shah, Nigam H
International Conference on Machine Learning (ICLR) 2024
Addressing asynchronicity in clinical multimodal fusion via individualized chest x-ray generation Yao, Wenfang and Liu, Chen and Yin, Kejing and Cheung, William and Qin, Jing
Advances in Neural Information Processing Systems (Neurips) 2024
Improving Medical Predictions by Irregular Multimodal Electronic Health Records Modeling
Xinlu Zhang, Shiyang Li, Zhiyu Chen, Xifeng Yan, Linda Petzold
International Conference on Machine Learning (ICML) 2023

4.1.4 Modality-Specific Robustness

4.2 Multi-source Learning

Communication Efficient Federated Generalized Tensor Factorization for Collaborative Health Data Analytics.
Jing Ma, Qiuchen Zhang, Jian Lou, Li Xiong, Joyce C. Ho
Proceedings of the Web Conference 2021, pages 171–182.

4.3 Learning with knowledge Graph

4.4 Learning with External Data Source

4.4.1 External Knowledge via Prompting and Instruction Tuning

AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation
Qingqiu Li, Zihang Cui, Seongsu Bae, Jilan Xu, Runtian Yuan, Yuejie Zhang, Rui Feng, Quanli Shen, Xiaobo Zhang, Junjun He, Shujun Wang
arxiv 2025
Prompt as Knowledge Bank: Boost Vision-language Model via Structural Representation for Zero-shot Medical Detection.
Yang, Yuguang; Chen, Tongfei; Huang, Haoyu; Yang, Linlin; Xie, Chunyu; Leng, Dawei; Cao, Xianbin; Zhang, Baochang
The Thirteenth International Conference on Learning Representations (ICLR) (2025)
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement
Che Liu, Zhongwei Wan, Cheng Ouyang, Anand Shah, Wenjia Bai, Rossella Arcucci
International Conference on Machine Learning (ICML) 2024

4.4.2 Internalized Knowledge from LLMs.

EMERGE: Enhancing Multimodal Electronic Health Records Predictive Modeling with Retrieval-Augmented Generation
Yinghao Zhu, Changyu Ren, Zixiang Wang, Xiaochen Zheng, Shiyun Xie, Junlan Feng, Xi Zhu, Zhoujun Li, Liantao Ma, Chengwei Pan
ACM International Conference on Information and Knowledge Management (CIKM) 2024
LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation
Lee, Suhyeon, Won Jun Kim, Jinho Chang, and Jong Chul Ye
In The Twelfth International Conference on Learning Representations (ICLR 2024)

1.4.3 Case-Based Knowledge from Patient Records

Retrieve, reason, and refine: Generating accurate and faithful patient instructions Liu, Fenglin and Yang, Bang and You, Chenyu and Wu, Xian and Ge, Shen and Liu, Zhangdaihong and Sun, Xu and Yang, Yang and Clifton, David
Advances in Neural Information Processing Systems (Neurips) 2022

Branch 5: LLM-Based Modeling and Systems

5.1 Learning with LLMs

5.1.1 Prompt-Based Methods

5.1.2 Pretraining Methods

5.1.3 Fine-Tuning Methods

5.1.4 Retrieval-Augmented Methods

5.2 LLM-Driven Medical Agents

1.2.6 Masking Modeling

MCM: Masked Cell Modeling for Anomaly Detection in Tabular Data.
Jiaxin Yin, Yuanyuan Qiao, Zitang Zhou, Xiangchao Wang, Jie Yang
The 12st International Conference on Learning Representations (ICLR 2024).
(paper)
Data-Efficient and Interpretable Tabular Anomaly Detection.
Chun-Hao Chang, Jinsung Yoon, Sercan Arik, Madeleine Udell, Tomas Pfister
The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023) (paper)
Anomaly detection for tabular data with internal contrastive learning.
Tom Shenkar, Lior Wolf
The 10th International Conference on Learning Representations (ICLR 2022)
(paper)

1.3 Reinforcement Learning

"My nose is running." "Are you also coughing?": Building A Medical Diagnosis Agent with Interpretable Inquiry Logics.
Wenge Liu, Yi Cheng, Hao Wang, Jianheng Tang, Yafei Liu, Ruihui Zhao, Wenjie Li, Yefeng Zheng, Xiaodan Liang
International Joint Conference on Artificial Intelligence (IJCAI 2022)

1.4 Temporal Modeling

Cooperative Joint Attentive Network for Patient Outcome Prediction on Irregular Multi-Rate Multivariate Health Data.
Qingxiong Tan, Mang Ye, Grace Lai-Hung Wong, Pong Chi Yuen
International Joint Conference on Artificial Intelligence (IJCAI), 2021, pp. 1586–1592.

1.7 Clinical Agent

From EHRs to Patient Pathways: Scalable Modeling of Longitudinal Health Trajectories with LLMs
Pellegrini, Chantal and Özsoy, Ege and Bani-Harouni, David and Keicher, Matthias and Navab, Nassir
arXiv (2025)
ColaCare: Enhancing Electronic Health Record Modeling Through Large Language Model-Driven Multi-Agent Collaboration
Zixiang Wang, Yinghao Zhu, Huiya Zhao, Xiaochen Zheng, Dehao Sui, Tianlong Wang, Wen Tang, Yasha Wang, Ewen Harrison, Chengwei Pan, Junyi Gao, Liantao Ma
Proceedings of the ACM on Web Conference (WWW) (2025)
CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering
Yuren Mao, Wenyi Xu, Yuyang Qin, Yunjun Gao
arXiv 2025
MedRAX: Medical Reasoning Agent for Chest X-ray
Fallahpour, Adibvafa and Ma, Jun and Munim, Alif and Lyu, Hongwei and Wang, Bo
Forty-Second International Conference on Machine Learning (ICML) 2025
MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications
Jiang, Yixing and Black, Kameron C and Geng, Gloria and Park, Danny and Ng, Andrew Y and Chen, Jonathan H
arXiv (2025)
MedAgent-Pro: Towards Multi-Modal Evidence-Based Medical Diagnosis via Reasoning Agentic Workflow
Wang, Ziyue and Wu, Junde and Low, Chang Han and Jin, Yueming
arXiv (2025)
CoD: Towards an Interpretable Medical Agent Using Chain of Diagnosis
Junying Chen, Chi Gui, Anningzhe Gao, Ke Ji, Xidong Wang, Xiang Wan, Benyou Wang
arXiv preprint arXiv:2407.13301 (2024)
MMedAgent: Learning to Use Medical Tools with Multi-Modal Agent
Binxu Li, Tiankai Yan, Yuanting Pan, Jie Luo, Ruiyang Ji, Jiayuan Ding, Zhe Xu, Shilong Liu, Haoyu Dong, Zihao Lin, Yixin Wang
Findings of the Association for Computational Linguistics (EMNLP-Findings 2024)
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records
Shi, Wenqi and Xu, Ran and Zhuang, Yuchen and Yu, Yue and Zhang, Jieyu and Wu, Hang and Zhu, Yuanda and Ho, Joyce and Yang, Carl and Wang, May Dongmei
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2024)
AgentClinic: A Dynamic Multi-Agent Environment for Healthcare Diagnosis and Treatment Simulations
Samuel Schmidgall, Rojin Ziaei, Carl Harris, Eduardo Reis, Jeffrey Jopling, Michael Moor
Arxiv 2024

others:Benchmark

AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments
Schmidgall, Samuel and Ziaei, Rojin and Harris, Carl and Reis, Eduardo and Jopling, Jeffrey and Moor, Michael
arxiv 2025
ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification
Ziqing Fan, Cheng Liang, Chaoyi Wu, Ya Zhang, Yanfeng Wang, Weidi Xie
Arxiv 2025

1.3 Learning with External Knowledge

1.3.1 Learning with Good Model Initialization

XTab: Cross-table Pretraining for Tabular Transformers. (paper)

Numerical Tuple Extraction from Tables with Pre-training. (paper)
Qingping Yang, Yixuan Cao, Yingming Hu, Jianfeng Li, Nanbo Peng, Ping Luo.
The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)
Well-tuned Simple Nets Excel on Tabular Datasets. (paper)
Arlind Kadra, Marius Lindauer, Frank Hutter, Josif Grabocka
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2021)

1.3.1 Learning with Knowledge Graph

High dimensional, tabular deep learning with an auxiliary knowledge graph. (paper)
Camilo Ruiz, Hongyu Ren, Kexin Huang, Jure Leskovec
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
External Knowledge Infusion for Tabular Pre-training Models with Dual-adapters. (paper)
Can Qin, Sungchul Kim, Handong Zhao, Tong Yu, Ryan A. Rossi, Yun Fu.
The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)

1.3.2 Learning with Large Language Models

TableRAG: Million-Token Table Understanding with Language Models.
Si-An Chen, Lesly Miculicich, Julian Eisenschlos, Zifeng Wang, Zilong Wang, Yanfei Chen, YASUHISA FUJII, Hsuan-Tien Lin, Chen-Yu Lee, Tomas Pfister.
The 38th Annual Conference on Neural Information Processing Systems (Neurips 2024)
Large Language Models Can Automatically Engineer Features for Few-Shot Tabular Learning.
Sungwon Han, Jinsung Yoon, Sercan Arik, Tomas Pfister
The 41th International Conference on Machine Learning (ICML 2024)
OpenTab: Advancing Large Language Models as Open-domain Table Reasoners. (paper)
Kezhi Kong, Jiani Zhang, Zhengyuan Shen, Balasubramaniam Srinivasan, Chuan Lei, Christos Faloutsos, Huzefa Rangwala, George Karypis
The 12th International Conference on Learning Representations (ICLR 2024)
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources. (paper)
Xingxuan Li, Ruochen Zhao, Yew Ken Chia, Bosheng Ding, Shafiq Joty, Soujanya Poria, Lidong Bing
The 12th International Conference on Learning Representations (ICLR 2024)
Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-ray Expert Models.
Cao, Weiwei; Zhang, Jianpeng; Xia, Yingda; Mok, Tony CW; Li, Zi; Ye, Xianghua; Lu, Le; Zheng, Jian; Tang, Yuxing; Zhang, Ling
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2024)
A foundation model utilizing chest CT volumes and radiology reports for supervised-level zero-shot detection of abnormalities.
Hamamci, Ibrahim Ethem; Er, Sezgin; Almas, Furkan; Simsek, Ayse Gulnihan; Esirgun, Sevval Nil; Dogan, Irem; Dasdelen, Muhammed Furkan; Wittmann, Bastian; Simsar, Enis; Simsar, Mehmet; others
arXiv preprint arXiv:2403.17834 (2024)
Merlin: A Vision Language Foundation Model for 3D Computed Tomography.
Blankemeier, Louis; Cohen, Joseph Paul; Kumar, Ashwin; Van Veen, Dave; Gardezi, Syed Jamal Safdar; Paschali, Magdalini; Chen, Zhihong; Delbrouck, Jean-Benoit; Reis, Eduardo; Truyts, Cesar; others
arXiv preprint arXiv:2406.06512 (2024)
Making Pre-trained Language Models Great on Tabular Prediction. (paper)
Jiahuan Yan, Bo Zheng, Hongxia Xu, Yiheng Zhu, Danny Chen, Jimeng Sun, Jian Wu, Jintai Chen
The 12th International Conference on Learning Representations (ICLR 2024)
UniTabE: A Universal Pretraining Protocol for Tabular Foundation Model in Data Science. (paper)
Yazheng Yang, Yuqi Wang, Guang Liu, Ledell Wu, Qi Liu
The 12th International Conference on Learning Representations (ICLR 2024)
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding. (paper)
Zilong Wang, Hao Zhang, Chun-Liang Li, Julian Martin Eisenschlos, Vincent Perot, Zifeng Wang, Lesly Miculicich, Yasuhisa Fujii, Jingbo Shang, Chen-Yu Lee, Tomas Pfister
The 12th International Conference on Learning Representations (ICLR 2024)
Parameterizing Context: Unleashing the Power of Parameter-Efficient Fine-Tuning and In-Context Tuning for Continual Table Semantic Parsing. (paper)
Yongrui Chen, Shenyu Zhang, Guilin Qi, Xinnan Guo
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
Trompt: Towards a Better Deep Neural Network for Tabular Data. (paper)
Kuan-Yu Chen, Ping-Han Chiang, Hsin-Rung Chou, Ting-Wei Chen, Tien-Hao Chang
The 40th International Conference on Machine Learning (ICML 2023)
Large Language Models for Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering.(paper)
Noah Hollmann, Samuel Müller, Frank Hutter
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs.
Zhang, Sheng; Xu, Yanbo; Usuyama, Naoto; Xu, Hanwen; Bagga, Jaspreet; Tinn, Robert; Preston, Sam; Rao, Rajesh; Wei, Mu; Valluri, Naveen; others
arXiv preprint arXiv:2303.00915 (2023)
Learning to exploit temporal structure for biomedical vision-language processing.
Bannur, Shruthi; Hyland, Stephanie; Liu, Qianchu; Perez-Garcia, Fernando; Ilse, Maximilian; Castro, Daniel C; Boecking, Benedikt; Sharma, Harshita; Bouzid, Kenza; Thieme, Anja; others
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2023)
MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training for X-ray Diagnosis.
Wu, Chaoyi; Zhang, Xiaoman; Zhang, Ya; Wang, Yanfeng; Xie, Weidi
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) (2023)
Knowledge-enhanced visual-language pre-training on chest radiology images.
Zhang, Xiaoman; Wu, Chaoyi; Zhang, Ya; Xie, Weidi; Wang, Yanfeng
Nature Communications (2023)
Language Models are Weak Learners. (paper)
Hariharan Manikandan, Yiding Jiang, J Zico Kolter
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
DyRRen: A Dynamic Retriever-Reranker-Generator Model for Numerical Reasoning over Tabular and Textual Data. (paper)
Xiao Li, Yin Zhu, Sichen Liu, Jiangzhou Ju, Yuzhong Qu, Gong Cheng
The 37th AAAI Conference on Artificial Intelligence (AAAI 2023)
Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions. (paper)
Fenglin Liu, Bang Yang, Chenyu You, Xian Wu, Shen Ge, Zhangdaihong Liu, Xu Sun, Yang Yang, David A. Clifton
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions. (paper)
Fenglin Liu, Bang Yang, Chenyu You, Xian Wu, Shen Ge, Zhangdaihong Liu, Xu Sun, Yang Yang, David A. Clifton
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
TAPEX: Table Pre-training via Learning a Neural SQL Executor.(paper)
Qian Liu, Bei Chen, Jiaqi Guo, Morteza Ziyadi, Zeqi Lin, Weizhu Chen, Jian-Guang Lou
The 10th International Conference on Learning Representations (ICLR 2022)
Contrastive learning of medical visual representations from paired images and text.
Zhang, Yuhao; Jiang, Hang; Miura, Yasuhide; Manning, Christopher D; Langlotz, Curtis P
Machine Learning for Healthcare Conference (2022)
Making the most of text semantics to improve biomedical vision language processing.
Boecking, Benedikt; Usuyama, Naoto; Bannur, Shruthi; Castro, Daniel C; Schwaighofer, Anton; Hyland, Stephanie; Wetscherek, Maria; Naumann, Tristan; Nori, Aditya; Alvarez-Valle, Javier; others
European Conference on Computer Vision (2022)
Expert-level detection of pathologies from unannotated chest X-ray images via self-supervised learning.
Tiu, Ekin; Talius, Ellie; Patel, Pujan; Langlotz, Curtis P; Ng, Andrew Y; Rajpurkar, Pranav
Nature Biomedical Engineering (2022)
Joint learning of localized representations from medical images and reports.
Müller, Philip; Kaissis, Georgios; Zou, Congyu; Rueckert, Daniel
European Conference on Computer Vision (2022)

1.4 Causal Representation Learning

TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second. (paper)
Noah Hollmann, Samuel Müller, Katharina Eggensperger, Frank Hutter
The 11th International Conference on Learning Representations (ICLR 2023)
Towards Out-of-Distribution Sequential Event Prediction: A Causal Treatment. (paper)
Chenxiao Yang, Qitian Wu, Qingsong Wen, Zhiqiang Zhou, Liang Sun, Junchi Yan
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
Transfer Learning on Heterogeneous Feature Spaces for Treatment Effects Estimation. (paper)
Ioana Bica, Mihaela van der Schaar
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes. (paper)
Zhaozhi Qian, Yao Zhang, Ioana Bica, Angela Mary Wood, Mihaela van der Schaar
The 34th Annual Conference on Neural Information Processing Systems (Neurips 2021)

Branch 2: Downstream Tasks

2.1 Generation

2.1.1 GAN-based Models

Invertible Tabular GANs: Killing Two Birds with One Stone for Tabular Data Synthesis. (paper)
JAEHOON LEE, Jihyeon Hyeong, Jinsung Jeon, Noseong Park, Jihoon Cho
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2021)
DECAF: Generating Fair Synthetic Data Using Causally-Aware Generative Networks. (paper)
Boris van Breugel, Trent Kyono, Jeroen Berrevoets, Mihaela van der Schaar
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2021)

2.1.2 VAE-based Models

A Learnable Discrete-Prior Fusion Autoencoder with Contrastive Learning for Tabular Data Synthesis. (paper)
Rongchao Zhang, Yiwei Lou, Dexuan Xu, Yongzhi Cao, Hanpin Wang, Yu Huang
The 38th AAAI Conference on Artificial Intelligence (AAAI 2024)

2.1.3 Diffusion-based Models

Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees. (paper)
Alexia Jolicoeur, Martineau Kilian Fatras, Tal Kachmana Rangwala, George Karypis
(AISTATS 2024)
Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space.(paper)
Hengrui Zhang, Jiani Zhang, Zhengyuan Shen, Balasubramaniam Srinivasan, Xiao Qin, Christos Faloutsos, Huzefa Rangwala, George Karypis
The 12th International Conference on Learning Representations (ICLR 2024)
A Flexible Generative Model for Heterogeneous Tabular EHR with Missing Modality. (paper)
Huan He, William hao, Yuanzhe Xi, Yong Chen, Bradley Malin, Joyce Ho
The 12th International Conference on Learning Representations (ICLR 2024)
CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis. (paper)
Chaejeong Lee, Jayoung Kim, Noseong Park
The 40th International Conference on Machine Learning (ICML 2023)
TabDDPM: Modelling Tabular Data with Diffusion Models. (paper)
Akim Kotelnikov, Dmitry Baranchuk, Ivan Rubachev, Artem Babenko
The 40th International Conference on Machine Learning (ICML 2023)
Concrete Score Matching: Generalized Score Matching for Discrete Data. (paper)
Chenlin Meng, Kristy Choi, Jiaming Song, Stefano Ermon
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
SOS: Score-based Oversampling Minor Classes for Tabular Data. (paper)
jayoung kim, ChaeJeong Lee,Yehjin Shin, Sewon Park, Minjung Kim, Noseong Park, Jihoon Cho
The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)

2.1.4 Transformer-based

ReMasker: Imputing Tabular Data with Masked Autoencoding. (paper)
Tianyu Du, Luca Melis, Ting Wang
The 12th International Conference on Learning Representations (ICLR 2024)
TabMT: Generating tabular data with masked transformers. (paper)
Manbir S Gulati, Paul F Roysdon
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
Realtabformer: Generating realistic relational and tabular data using transformers. (paper)
Aivin V. Solatorio, Olivier Dupriez
(Arxiv 2023)
Fata-trans: Field and time-aware transformer for sequential tabular data. (paper)
Dongyu Zhang, Liang Wang, Xin Dai, Shubham Jain, Junpeng Wang, Yujie Fan, Chin-Chia Michael Yeh, Yan Zheng, Zhongfang Zhuang, Wei Zhang
The 32nd ACM International Conference on Information and Knowledge Management (CIKM 2023)

2.1.5 Large Language Model-based

CuTS: Customizable Tabular Synthetic Data Generation.
Mark Vero, Mislav Balunovic, Martin Vechev
The 41th International Conference on Machine Learning (ICML 2024)
Language-Interfaced Tabular Oversampling via Progressive Imputation and Self-Authentication. (paper) June Yong Yang, Geondo Park, Joowon Kim, Hyeongwon Jang, Eunho Yang
The 12th International Conference on Learning Representations (ICLR 2024)
Language Models are Realistic Tabular Data Generators. (paper)
Vadim Borisov, Kathrin Seßler2, Tobias Leemann, Martin Pawelczyk, Gjergji Kasneci
The 11th International Conference on Learning Representations (ICLR 2023)
Generative Table Pre-training Empowers Models for Tabular Prediction. (paper)
Tianping Zhang, Shaowen Wang, Shuicheng Yan, Jian Li, Qian Liu
(EMNLP 2023)
nBIIG: A Neural BI Insights Generation System for Table Reporting.
Yotam Perlitz, Dafna Sheinwald, Noam Slonim, Michal Shmueli-Scheuer
The 37th AAAI Conference on Artificial Intelligence (AAAI 2023) (paper)

2.1.6 Model-agnostic

How Realistic Is Your Synthetic Data? Constraining Deep Generative Models for Tabular Data.
Mihaela C Stoian, Salijona Dyrmishi, Maxime Cordy, Thomas Lukasiewicz, Eleonora Giunchiglia
The 12th International Conference on Learning Representations (ICLR 2024, interesting!) (paper)

2.2 Anomaly Detection

Beyond Individual Input for Deep Anomaly Detection on Tabular Data.
Hugo Thimonier, Fabrice Popineau, Arpad Rimmel, Bich-Liên DOAN
The 41st International Conference on Machine Learning (ICML 2024).
MCM: Masked Cell Modeling for Anomaly Detection in Tabular Data.
Jiaxin Yin, Yuanyuan Qiao, Zitang Zhou, Xiangchao Wang, Jie Yang
The 12th International Conference on Learning Representations (ICLR 2024).(paper)
SemanticMask: A Contrastive View Design for Anomaly Detection in Tabular Data. (paper)
Shuting Tao, Tongtian Zhu, Hongwei Wang, Xiangming Meng
the 33th International Joint Conference on Artificial Intelligence (IJCAI 2024).

2.3 Transfer Learning

Tabular Insights, Visual Impacts: Transferring Expertise from Tables to Images.
Jun-Peng Jiang, Han-Jia Ye, Leye Wang, Yang Yang, Yuan Jiang, De-Chuan Zhan.
The 41st International Conference on Machine Learning (ICML 2024).
CARTE: Pretraining and Transfer for Tabular Learning.
Myung Jun Kim, Leo Grinsztajn, Gael Varoquaux
The 41st International Conference on Machine Learning (ICML 2024).
TabLog: Test-Time Adaptation for Tabular Data Using Logic Rules.
Weijieying Ren, Xiaoting Li, Huiyuan Chen, Vineeth Rakesh, Zhuoyi Wang, Mahashweta Das, Vasant Honavar
The 41st International Conference on Machine Learning (ICML 2024).
Towards Cross-Table Masked Pretraining for Web Data Mining.
Chao Ye, Guoshan Lu, Haobo Wang, Liyao Li, Sai Wu, Gang Chen, Junbo Zhao
(WWW 2024).(paper)
MCM: Masked Cell Modeling for Anomaly Detection in Tabular Data. (paper)
Jiaxin Yin, Yuanyuan Qiao, Zitang Zhou, Xiangchao Wang, Jie Yang
The 12th International Conference on Learning Representations (ICLR 2024).
XTab: Cross-table Pretraining for Tabular Transformers. (paper)
Bingzhao Zhu, Xingjian Shi, Nick Erickson, Mu Li, George Karypis, Mahsa Shoaran
The 40th International Conference on Machine Learning (ICML 2023)
TransTab: Learning Transferable Tabular Transformers Across Tables. (paper)
Zifeng Wang, Jimeng Sun.
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
External Knowledge Infusion for Tabular Pre-training Models with Dual-adapters. (paper)
Can Qin, Sungchul Kim, Handong Zhao, Tong Yu, Ryan A. Rossi, YUN FU.
The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)
Transfer Learning on Heterogeneous Feature Spaces for Treatment Effects Estimation.
Ioana Bica, Mihaela van der Schaar
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022) (paper)

2.4 Explanation/Model Assesment

Interpretable Deep Clustering for Tabular Data.
Jonathan Svirsky, Ofir Lindenbaum
The 41st International Conference on Machine Learning (ICML 2024)
InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation.
Jacob Si, Wendy Yusi Cheng, Michael Cooper, Rahul G. Krishnan
The 41st International Conference on Machine Learning (ICML 2024)
Do Machine Learning Models Learn Statistical Rules Inferred from Data? (paper)
Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong
The 40th International Conference on Machine Learning (ICML 2023)
An Inductive Bias for Tabular Deep Learning. (paper)
Ege Beyazit, Jonathan Kozaczuk, Bo Li, Vanessa Wallace, Bilal H Fadlallah
The 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023)
TabCBM: Concept-based Interpretable Neural Networks for Tabular Data. (paper)
Mateo Espinosa Zarlenga, Zohreh Shams, Michael Edward Nelson, Been Kim, and Mateja Jamnik (TMLR 2023)
Tabular Data: Deep Learning is Not All You Need. (paper)
Ravid Shwartz-Ziv, Amitai Armon.
(Information Fusion 2022)
Revisiting Deep Learning Models for Tabular Data. (paper)
Yury Gorishniy, Ivan Rubachev, Valentin Khrulkov, Artem Babenko
The 34th Annual Conference on Neural Information Processing Systems (Neurips 2021)

2.5: Retrieval

TabR: Tabular Deep Learning Meets Nearest Neighbors.(paper)
Yury Gorishniy, Ivan Rubachev, Nikolay Kartashev, Daniil Shlenskii, Akim Kotelnikov, Artem Babenko
The 12th International Conference on Learning Representations (ICLR 2024)
Dense Representation Learning and Retrieval for Tabular Data Prediction.(paper)
Lei Zheng, Ning Li, Xianyu Chen, Quan Gan, Weinan Zhang
The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023)

2.5: Efficiency

TinyLUT: Tiny Look-Up Table for Efficient Image Restoration at the Edge.(paper)
Huanan LI, Juntao Guan, Lai Rui, Sijun Ma, Lin Gu, Noperson
The 38th Annual Conference on Neural Information Processing Systems (Neurips 2024)
FHyperFast: Instant Classification for Tabular Data.(paper)
David Bonet; Daniel Mas Montserrat; Xavier Giró-i-Nieto; Alexander Ioannidis
The 38th AAAI Conference on Artificial Intelligence (AAAI 2024)
TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second. (paper)
Noah Hollmann, Samuel Müller, Katharina Eggensperger, Frank Hutter
The 11th International Conference on Learning Representations (ICLR 2023)
Clustering the Sketch: Dynamic Compression for Embedding Tables. (paper)
Henry Tsang, Thomas Dybdahl Ahle
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
Compressing Tabular Data via Latent Variable Estimation. (paper)
Andrea Montanari, Eric Weiner
The 40th International Conference on Machine Learning (ICML 2023)

Branch 3: Application

3.1 Clinical Tabular Data

EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records.
Yeonsu Kwon, Jiho Kim, Gyubok Lee, Seongsu Bae, Daeun Kyung, Wonchul Cha, Tom Pollard, Alistair Johnson, Edward Choi.
The 38th Annual Conference on Neural Information Processing Systems (Neurips 2024)
Contrastive Learning for Clinical Outcome Prediction with Partial Data Sources.
Xia, Jonathan Wilson, Benjamin Goldstein, Ricardo Henao.
The 41th International Conference on Machine Learning (ICML 2024)
Safe Exploration in Dose Finding Clinical Trials with Heterogeneous Participants.
Isabel Chien, Wessel Bruinsma, Javier Gonzalez, Richard E Turner.
The 41th International Conference on Machine Learning (ICML 2024)
Weight Predictor Network with Feature Selection for Small Sample Tabular Biomedical Data. (paper)
Andrei Margeloiu, Nikola Simidjievski, Pietro Liò, Mateja Jamnik.
The 37th AAAI Conference on Artificial Intelligence (AAAI 2023)
Improving Medical Predictions by Irregular Multimodal Electronic Health Records Modeling.(paper)
Xinlu Zhang, Shiyang Li, Zhiyu Chen, Xifeng Yan, Linda Ruth Petzold
The 40th International Conference on Machine Learning (ICML 2023)
Towards Semi-Structured Automatic ICD Coding via Tree-based Contrastive Learning. (paper)
Chang Lu, Chandan K. Reddy, Ping Wang, Yue Ning
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
MOTOR: A Time-to-Event Foundation Model For Structured Medical Records. (paper)
Ethan Steinberg, Jason Alan Fries, Yizhe Xu, Nigam Shah
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
Locally Sparse Neural Networks for Tabular Biomedical Data. (paper)
Junchen Yang, Ofir Lindenbaum, Yuval Kluger
The 39th International Conference on Machine Learning (ICML 2022)
Evaluating Latent Space Robustness and Uncertainty of EEG-ML Models under Realistic Distribution Shifts.(paper)
Neeraj Wagh, Jionghao Wei, Samarth Rawal, Brent M. Berry, Yogatheesan Varatharajah
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
Debiased, Longitudinal and Coordinated Drug Recommendation through Multi-Visit Clinic Records. (paper)
Hongda Sun, Shufang Xie, Shuqi Li, Yuhan Chen, Ji-Rong Wen, Rui Yan
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions. (paper)
Fenglin Liu, Bang Yang, Chenyu You, Xian Wu, Shen Ge, Zhangdaihong Liu, Xu Sun, Yang Yang, David A. Clifton
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
Self-Supervised Graph Neural Networks for Improved Electroencephalographic Seizure Analysis. (paper)
Siyi Tang, Jared Dunnmon, Khaled Kamal Saab, Xuan Zhang, Qianying Huang, Florian Dubost, Daniel Rubin, Christopher Lee-Messer
The 10th International Conference on Learning Representations (ICLR 2022)
SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes. (paper)
Zhaozhi Qian, Yao Zhang, Ioana Bica, Angela Mary Wood, Mihaela van der Schaar
The 34th Annual Conference on Neural Information Processing Systems (Neurips 2021)

3.2 Financial Tabular Data

Beyond Pure Text: Summarizing Financial Reports Based on Both Textual and Tabular Data. (paper)
Ziao Wang, Zelin Jiang, Xiaofeng Zhang, Jaehyeon Soon, Jialu Zhang, Wang Xiaoyao, Hongwei Du
The 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)

Existing Surveys

Explainable Artificial Intelligence for Tabular Data: A Survey
Sahakyan M, Aung Z, Rahwan T
IEEE Access 2021
The use of generative adversarial networks to alleviate class imbalance in tabular data: a survey
Sauber-Cole, Rick, and Taghi M. Khoshgoftaar
Journal of Big Data 2022
Deep Neural Networks and Tabular Data: A Survey.(paper)
Vadim Borisov, Tobias Leemann, Kathrin Seßler, Johannes Haug, Martin Pawelczyk, Gjergji Kasneci
IEEE transactions on neural networks and learning systems 2022
Synthetic data generation for tabular health records: A systematic review
Hernandez, M., Epelde, G., Alberdi, A., Cilla, R., & Rankin, D.
Neurocomputing 2022
Embeddings for Tabular Data: A Survey. (paper)
Rajat Singh, Srikanta Bedathur
Arxiv 2023
Transformers for Tabular Data Representation: A Survey of Models and Applications. (paper)
Gilbert Badaro, Mohammed Saeed, Paolo Papotti
TACL 2023
A Survey on Self-Supervised Learning for Non-Sequential Tabular Data. (paper)
Wei-Yao Wang1, Wei-Wei Du2,, Derek Xu, Wei Wang, Wen-Chih Peng
Arxiv 2024
Language Modeling on Tabular Data: A Survey of Foundations, Techniques and Evolution
Yucheng Ruan, Xiang Lan, Jingying Ma, Yizhi Dong, Kai He, Mengling Feng
Arvix 2024
Large language models (LLMs) on tabular data: Prediction, generation, and understanding - a survey
Xi Fang, Weijie Xu, Fiona Anting Tan, Jiani Zhang, Ziqing Hu, Yanjun (Jane) Qi, Scott Nickleach, Diego Socolinsky, Srinivasan Sengamedu, "SHS", Christos Faloutsos
Arvix 2024

Tools & Libraries

Pytorch Frame: A modular deep learning framework for building neural network models on heterogeneous tabular data. (paper)
PyTorch Tabular: A Framework for Deep Learning with Tabular Data.

Last updated on March 05, 2024. (For problems, contact wjr5337@psu.edu. To add papers, please pull request at our repo)