Deep Learning for Electronic Health Records: Foundations, Challenges, Advances and Future Directions
Builder & Current Maintainer: Weijieying Ren, YuQing Huang, Jingxi Zhu, Zehao Liu,Tianxiang Zhao and Prof. Vasant Honavar.
Paper List
We have summarized the main branches of works for Deep tabular data representation learning, including its downstream tasks and applications. For more details, please refer to our recent survey (paper).
Branch 1: Data-Centric Approaches
Branch 2: Neural Modeling Strategies
2.1 Feature-Aware Modules
2.1.1 Discretization and Binning-Based Methods
-
Binning as a Pretext Task: Improving Self-Supervised Learning in Tabular Domains.
Kyungeun Lee, Ye Seul Sim, Hye-Seung Cho, Moonjung Eo, Suhee Yoon, Sanghyu Yoon, Woohyung Lim
The 41st International Conference on Machine Learning (ICML 2024)
-
TabLog: Test-Time Adaptation for Tabular Data Using Logic Rules.
Weijieying Ren, Xiaoting Li, Huiyuan Chen, Vineeth Rakesh, Zhuoyi Wang, Mahashweta Das, Vasant Honavar
The 41st International Conference on Machine Learning (ICML 2024)
-
Byzantine-Robust Learning on Heterogeneous Datasets via Bucketing. (paper)
Sai Praneeth Karimireddy, Lie He, Martin Jaggi
The 10th International Conference on Learning Representations (ICLR 2022)
-
On Embeddings for Numerical Features in Tabular Deep Learning. (paper)
Yury Gorishniy, Ivan Rubachev, Artem Babenko
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2022)
-
Learning Binary Decision Trees by Argmin Differentiation. (paper)
Valentina Zantedeschi, Matt J. Kusner, Vlad Niculae
The 39th International Conference on Machine Learning (ICML 2022)
2.1.2 Kernel-Based Methods
-
Feature Learning for Interpretable, Performant Decision Trees. (paper)
Jack H. Good, Torin Kovach, Kyle Miller, Artur Dubrawski
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
-
Kernel Density Decision Trees. (paper)
Jack H. Good, Kyle Miller, Artur Dubrawski
The 36th AAAI Conference on Artificial Intelligence (AAAI 2022)
-
Heterogeneous Risk Minimization. (paper)
Jiashuo Liu, Zheyuan Hu, Peng Cui, Bo Li, Zheyan Shen
The 38th International Conference on Machine Learning (ICML 2021)
-
Kernelized Heterogeneous Risk Minimization. (paper)
Jiashuo Liu, Zheyuan Hu, Peng Cui, Bo Li, Zheyan Shen
The 35th Annual Conference on Neural Information Processing Systems (NeurIPS 2021)
2.2 Model Architecture Design
2.2.1 Tree-based
-
GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data. (paper)
Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt
The 12st International Conference on Learning Representations (ICLR 2024)
-
POETREE: Interpretable Policy Learning with Adaptive Decision Trees. (paper)
Alizée Pace, Alex Chan, Mihaela van der Schaar
The 10th International Conference on Learning Representations (ICLR 2022)
-
Instance-Based Uncertainty Estimation for Gradient-Boosted Regression Trees. (paper)
Jonathan Brophy, Daniel Lowd
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2022)
- Local Contrastive Feature Learning for Tabular Data. (paper)
Zhabiz Gharibshah, Xingquan Zhu.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management (CIKM 2022)
-
Subgroup Robustness Grows On Trees: An Empirical Baseline Investigation. (paper)
Joshua P Gardner, Zoran Popovi, Ludwig Schmidt
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2022)
2.2.2 Graph-based
-
Time-Aware Medication Recommendation via Intervention of Dynamic Treatment Regimes.
Yishuo Li, Qi Zhang, Wenpeng Lu, Xueping Peng, Weiyu Zhang, Jiasheng Si, Yongshun Gong, Liang Hu
Proceedings of the ACM on Web Conference 2025
-
HI-DR: Exploiting Health Status-Aware Attention and an EHR Graph+ for Effective Medication Recommendation.
Taeri Kim, Jiho Heo, Hyunjoon Kim, Sang-Wook Kim
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2025)
-
Functional Graph Convolutional Networks: A Unified Multi-Task and Multi-Modal Learning Framework to Facilitate Health and Social-Care Insights.
Tobia Boschi, Francesca Bonin, Rodrigo Ordonez-Hurtado, Cécile Rousseau, Alessandra Pascale, John Dinsmore
arXiv preprint arXiv:2403.10158 (2024)
-
Graph transformers on EHRs: Better representation improves downstream performance.
Raphael Poulain, Rahmatollah Beheshti
The 12th International Conference on Learning Representations (ICLR 2024)
-
GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data.
Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt
The 12th International Conference on Learning Representations (ICLR 2024)
-
On Class Distributions Induced by Nearest Neighbor Graphs for Node Classification of Tabular Data. (paper)
Federico Errica
The 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023)
-
GOGGLE: Generative Modelling for Tabular Data by Learning Relational Structure. (paper)
Tennison Liu, Zhaozhi Qian, Jeroen Berrevoets, Mihaela van der Schaar
The 11th International Conference on Learning Representations (ICLR 2023)
-
HyTrel: Hypergraph-enhanced Tabular Data Representation Learning. (paper)
Pei Chen, Soumajyoti Sarkar, Leonard Lausen, Balasubramaniam Srinivasan, Sheng Zha, Ruihong Huang, George Karypis
The 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023)
-
T2G-FORMER: Organizing Tabular Features into Relation Graphs Promotes Heterogeneous Feature Interaction. (paper)
Jiahuan Yan, Jintai Chen, Yixuan Wu, Danny Z. Chen, Jian Wu
The 37th AAAI Conference on Artificial Intelligence (AAAI 2023)
-
Graph-text multi-modal pre-training for medical representation learning.
Sungjin Park, Seongsu Bae, Jiho Kim, Tackeun Kim, Edward Choi
Conference on Health, Inference, and Learning (CHIL 2022)
-
Context-Aware Health Event Prediction via Transition Functions on Dynamic Disease Graphs.
Chang Lu, Tian Han, Yue Ning
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2022)
-
Graph representation learning in biomedicine and healthcare. (paper)
Marinka Li, Kevin Huang, Marinka Zitnik
Nature Biomedical Engineering, 6(12):1186–1197, 2022
-
Does your graph need a confidence boost? Convergent boosted smoothing on graphs with tabular node features. (paper)
Jiuhai Chen, Jonas Mueller, Vassilis N. Ioannidis, Soji Adeshina, Yangkun Wang, Tom Goldstein, David Wipf
The 10th International Conference on Learning Representations (ICLR 2022)
-
Collaborative Graph Learning with Auxiliary Text for Temporal Event Prediction in Healthcare.
Chang Lu, Chandan K. Reddy, Prithwish Chakraborty, Samantha Kleinberg, Yue Ning
Proceedings of the 30th International Joint Conference on Artificial Intelligence (IJCAI 2021)
-
Collaborative graph learning with auxiliary text for temporal event prediction in healthcare. (paper)
Chang Lu, Chandan K Reddy, Prithwish Chakraborty, Samantha Kleinberg, Yue Ning
arXiv preprint arXiv:2105.07542, 2021
-
Variationally regularized graph-based representation learning for electronic health records.
Weicheng Zhu, Narges Razavian
Proceedings of the Conference on Health, Inference, and Learning (CHIL 2021)
-
Leveraging graph-based hierarchical medical entity embedding for healthcare applications.
Tong Wu, Yunlong Wang, Yue Wang, Emily Zhao, Yilian Yuan
Scientific Reports, 11(1):5858, Nature Publishing Group, 2021
-
Harmonized representation learning on dynamic EHR graphs.
Dongha Lee, Xiaoqian Jiang, Hwanjo Yu
Journal of Biomedical Informatics, 106:103426, Elsevier, 2020
-
Learning the graphical structure of electronic health records with graph convolutional transformer. (paper)
Edward Choi, Zhen Xu, Yujia Li, Michael W Dusenberry, Glenn Flores, Ping Xie, Andrew M Dai
Proceedings of the AAAI Conference on Artificial Intelligence, 2020
-
Pre-training of graph augmented transformers for medication recommendation. (paper)
Junyuan Shang, Tengfei Ma, Cao Xiao, Jimeng Sun
arXiv preprint arXiv:1906.00346, 2019
-
GRAM: graph-based attention model for healthcare representation learning.
Edward Choi, Mohammad Taha Bahadori, Le Song, Walter F Stewart, Jimeng Sun
The 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2017)
2.2.3 Rule-based Models
-
ConSequence: Synthesizing Logically Constrained Sequences for Electronic Health Record Generation.
Brandon Theodorou, Shrusti Jain, Cao Xiao, Jimeng Sun.
In Proceedings of the AAAI Conference on Artificial Intelligence, vol. 38, no. 14, pp. 15355–15363, 2024.
-
TabLog: Test-Time Adaptation for Tabular Data Using Logic Rules.
Weijieying Ren, Xiaoting Li, Huiyuan Chen, Vineeth Rakesh, Zhuoyi Wang, Mahashweta Das, Vasant G. Honavar.
In Forty-first International Conference on Machine Learning (ICML), 2024.
-
Do Machine Learning Models Learn Statistical Rules Inferred from Data?
Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong.
In International Conference on Machine Learning (ICML), pp. 25677–25693, 2023. PMLR.
-
Logical Activation Functions: Logit-space Equivalents of Probabilistic Boolean Operators.
Scott Lowe, Robert Earle, Jason d'Eon, Thomas Trappenberg, Sageev Oore.
Advances in Neural Information Processing Systems (NeurIPS), vol. 35, pp. 29733–29747, 2022.
2.2.4 Additive-model-based
-
Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning.
Yi Cheng, Renjun Hu, Haochao Ying, Xing Shi, Jian Wu, Wei Lin
The 38th AAAI Conference on Artificial Intelligence (AAAI 2024)
-
Towards Hybrid-Grained Feature Interaction Selection for Deep Sparse Network.
Fuyuan Lyu, Xing Tang, Dugang Liu, Chen Ma, Weihong Luo, Liang Chen, Xue Steve Liu, et al.
Advances in Neural Information Processing Systems (NeurIPS 2023)
-
GRAND-SLAMIN’: Interpretable Additive Modeling with Structural Constraints.
Shibal Ibrahim, Gabriel Afriat, Kayhan Behdin, Rahul Mazumder
Advances in Neural Information Processing Systems (NeurIPS 2023)
-
Sparse Interaction Additive Networks via Feature Interaction Detection and Sparse Selection.
James Enouen, Yan Liu
Advances in Neural Information Processing Systems (NeurIPS 2022)
2.2.5 Hierarchical and Structured Temporal Models
-
Graph transformers on EHRs: Better representation improves downstream performance.
Raphael Poulain, Rahmatollah Beheshti
The 12th International Conference on Learning Representations (ICLR 2024)
-
Transformer-Based Deep Learning Model for the Diagnosis of Suspected Lung Cancer in Primary Care Based on Electronic Health Record Data.
Lan Wang, Yonghua Yin, Ben Glampson, Robert Peach, Mauricio Barahona, Brendan C. Delaney, Erik K. Mayer
EBioMedicine (2024)
- Polynomial-based Self-Attention for Table Representation Learning.
Jayoung Kim, Yehjin Shin, Jeongwhan Choi, Hyowon Wi, Noseong Park
The 41th International Conference on Machine Learning (ICML 2024)
- Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning. (paper)
Yi Cheng; Renjun Hu; Haochao Ying; Xing Shi; Jian Wu; Wei Lin
The 38th AAAI Conference on Artificial Intelligence (AAAI 2024)
-
TransformEHR: Transformer-Based Encoder-Decoder Generative Model to Enhance Prediction of Disease Outcomes Using Electronic Health Records.
Zhichao Yang, Avijit Mitra, Weisong Liu, Dan Berlowitz, Hong Yu
Nature Communications (2023)
- Divide Rows and Conquer Cells: Towards Structure Recognition for Large Tables.(paper)
Huawen Shen, Xiang Gao, Jin Wei, Liang Qiao, Yu Zhou, Qiang Li, Zhanzhan Cheng
the 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)
- DIWIFT: Discovering Instance-wise Influential Features for Tabular Data.
Dugang Liu, Pengxiang Cheng, Hong Zhu, Xing Tang, Yanyu Chen, Xiaoting Wang, Weike Pan, Zhong Ming, Xiuqiang He
(TheWebConf 2023) (paper)
-
DL-BERT: A Time-Aware Double-Level BERT-Style Model with Pre-Training for Disease Prediction.
Xianlai Chen, Jiamiao Lin, Ying An
2022 IEEE International Conference on Big Data (Big Data)
-
Few-Shot Learning with Semi-Supervised Transformers for Electronic Health Records.
Raphael Poulain, Mehak Gupta, Rahmatollah Beheshti
Machine Learning for Healthcare Conference (MLHC 2022)
-
Med-BERT: Pretrained Contextualized Embeddings on Large-Scale Structured Electronic Health Records for Disease Prediction.
Laila Rasmy, Yang Xiang, Ziqian Xie, Cui Tao, Degui Zhi
NPJ Digital Medicine (2021)
-
RareBERT: Transformer Architecture for Rare Disease Patient Identification Using Administrative Claims.
PKS Prakash, Srinivas Chilukuri, Nikhil Ranade, Shankar Viswanathan
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2021)
-
Transformer-Based Multi-Target Regression on Electronic Health Records for Primordial Prevention of Cardiovascular Disease.
Raphael Poulain, Mehak Gupta, Randi Foraker, Rahmatollah Beheshti
2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
-
Communication Efficient Federated Generalized Tensor Factorization for Collaborative Health Data Analytics.
Jing Ma, Qiuchen Zhang, Jian Lou, Li Xiong, Joyce C. Ho
Proceedings of the Web Conference (WWW 2021)
-
RAPT: Pre-Training of Time-Aware Transformer for Learning Robust Healthcare Representation.
Houxing Ren, Jingyuan Wang, Wayne Xin Zhao, Ning Wu
Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)
-
BEHRT: Transformer for Electronic Health Records.
Yikuan Li, Shishir Rao, José Roberto Ayala Solares, Abdelaali Hassaine, Rema Ramakrishnan, Dexter Canoy, Yajie Zhu, Kazem Rahimi, Gholamreza Salimi-Khorshidi
Scientific Reports (2020)
-
HiTANet: Hierarchical Time-Aware Attention Networks for Risk Prediction on Electronic Health Records.
Junyu Luo, Muchao Ye, Cao Xiao, Fenglong Ma
Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2020)
-
Attention Is All You Need.
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, Illia Polosukhin
Advances in Neural Information Processing Systems (NeurIPS 2017)
2.3 Temporal Dependency Modeling
2.3.1 Irregular/Asynchronous Sampling
-
Integrating Sequence and Image Modeling in Irregular Medical Time Series Through Self-Supervised Learning.
Chen, Liuqing, Xiao, Shuhong, Ding, Shixian, Hu, Shanhai, Sun, Lingyun
arXiv preprint arXiv:2502.06134 (2025)
-
SepsisCalc: Integrating Clinical Calculators into Early Sepsis Prediction via Dynamic Temporal Graph Construction.
Yin, Changchang, Fu, Shihan, Yao, Bingsheng, Pham, Thai-Hoang, Cao, Weidan, Wang, Dakuo, Caterino, Jeffrey, Zhang, Ping
arXiv preprint arXiv:2501.00190 (2024)
-
Predictive modeling with temporal graphical representation on electronic health records.
Chen, Jiayuan, Yin, Changchang, Wang, Yuanlong, Zhang, Ping
IJCAI: proceedings of the conference (2024)
-
Dna-t: Deformable neighborhood attention transformer for irregular medical time series.
Huang, Jianxuan, Yang, Baoyao, Yin, Kejing, Xu, Jingwen
IEEE Journal of Biomedical and Health Informatics (2024)
-
A Hybrid Approach for Irregular-Time Series Prediction using Electronic Health Records: an Intensive Care Unit Mortality Case Study.
Zhong, Shaojie, Wang, Li Rong, Zhan, Zhuoxuan, Ng, Yih Yng, Fan, Xiuyi
ACM Transactions on Computing for Healthcare (2024)
-
Contiformer: Continuous-time transformer for irregular time series modeling.
Chen, Yuqi, Ren, Kan, Wang, Yansen, Fang, Yuchen, Sun, Weiwei, Li, Dongsheng
Advances in Neural Information Processing Systems (2023)
-
Improving medical predictions by irregular multimodal electronic health records modeling.
Zhang, Xinlu, Li, Shiyang, Chen, Zhiyu, Yan, Xifeng, Petzold, Linda Ruth
International Conference on Machine Learning (2023)
-
Warpformer: A multi-scale modeling approach for irregular clinical time series.
Zhang, Jiawen, Zheng, Shun, Cao, Wei, Bian, Jiang, Li, Jia
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2023)
-
Time series as images: Vision transformer for irregularly sampled time series.
Li, Zekun, Li, Shiyang, Yan, Xifeng
Advances in Neural Information Processing Systems (2023)
-
Self-supervised transformer for sparse and irregularly sampled multivariate clinical time-series.
Tipirneni, Sindhu, Reddy, Chandan K
ACM Transactions on Knowledge Discovery from Data (TKDD) (2022)
-
Cumulative stay-time representation for electronic health records in medical event time prediction.
Katsuki, Takayuki, Miyaguchi, Kohei, Koseki, Akira, Iwamori, Toshiya, Yanagiya, Ryosuke, Suzuki, Atsushi
arXiv preprint arXiv:2204.13451 (2022)
-
Imputation of missing values for electronic health record laboratory data.
Li, Jiang, Yan, Xiaowei S, Chaudhary, Durgesh, Avula, Venkatesh, Mudiganti, Satish, Husby, Hannah, Shahjouei, Shima, Afshar, Ardavan, Stewart, Walter F, Yeasin, Mohammed, others
NPJ digital medicine (2021)
-
Multi-time attention networks for irregularly sampled time series.
Shukla, Satya Narayan, Marlin, Benjamin M
arXiv preprint arXiv:2101.10318 (2021)
-
Graph-guided network for irregularly sampled multivariate time series.
Zhang, Xiang, Zeman, Marko, Tsiligkaridis, Theodoros, Zitnik, Marinka
arXiv preprint arXiv:2110.05357 (2021)
-
Set functions for time series.
Horn, Max, Moor, Michael, Bock, Christian, Rieck, Bastian, Borgwardt, Karsten
International Conference on Machine Learning (2020)
-
Deep mixed effect model using Gaussian processes: a personalized and reliable prediction for healthcare.
Chung, Ingyo, Kim, Saehoon, Lee, Juho, Kim, Kwang Joon, Hwang, Sung Ju, Yang, Eunho
Proceedings of the AAAI conference on artificial intelligence (2020)
-
Interpolation-prediction networks for irregularly sampled time series.
Shukla, Satya Narayan, Marlin, Benjamin M
arXiv preprint arXiv:1909.07782 (2019)
-
Recurrent neural networks for multivariate time series with missing values.
Che, Zhengping, Purushotham, Sanjay, Cho, Kyunghyun, Sontag, David, Liu, Yan
Scientific reports (2018)
2.3.2 Multi-Timescale Dynamics
-
A deep learning algorithm to predict risk of pancreatic cancer from disease trajectories.
Placido, Davide, Yuan, Bo, Hjaltelin, Jessica X, Zheng, Chunlei, Haue, Amalie D, Chmura, Piotr J, Yuan, Chen, Kim, Jihye, Umeton, Renato, Antell, Gregory, others
Nature Medicine (2023)
-
Modelling long-and short-term multi-dimensional patterns in predictive maintenance with accumulative attention.
Shi, Yong, Zhang, Linzi
Reliability Engineering & System Safety (2023)
-
Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting.
Zhou, Tian, Ma, Ziqing, Wen, Qingsong, Wang, Xue, Sun, Liang, Jin, Rong
International conference on machine learning (2022)
-
Clinical risk prediction with temporal probabilistic asymmetric multi-task learning.
Nguyen, A Tuan, Jeong, Hyewon, Yang, Eunho, Hwang, Sung Ju
Proceedings of the AAAI Conference on Artificial Intelligence (2021)
-
Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting.
Wu, Haixu, Xu, Jiehui, Wang, Jianmin, Long, Mingsheng
Advances in neural information processing systems (2021)
-
Adacare: Explainable clinical health status representation learning via scale-adaptive feature extraction and recalibration.
Ma, Liantao, Gao, Junyi, Wang, Yasha, Zhang, Chaohe, Wang, Jiangtao, Ruan, Wenjie, Tang, Wen, Gao, Xin, Ma, Xinyu
Proceedings of the AAAI Conference on Artificial Intelligence (2020)
-
DeepAlerts: deep learning based multi-horizon alerts for clinical deterioration on oncology hospital wards.
Li, Dingwen, Lyons, Patrick G, Lu, Chenyang, Kollef, Marin
Proceedings of the AAAI Conference on Artificial Intelligence (2020)
-
Stagenet: Stage-aware neural networks for health risk prediction.
Gao, Junyi, Xiao, Cao, Wang, Yasha, Tang, Wen, Glass, Lucas M, Sun, Jimeng
Proceedings of the web conference 2020 (2020)
2.3.3 Conditional Clinical Sequences
-
A deep learning algorithm to predict risk of pancreatic cancer from disease trajectories.
Placido, Davide, Yuan, Bo, Hjaltelin, Jessica X, Zheng, Chunlei, Haue, Amalie D, Chmura, Piotr J, Yuan, Chen, Kim, Jihye, Umeton, Renato, Antell, Gregory, others
Nature medicine (2023)
-
Diaformer: Automatic diagnosis via symptoms sequence generation.
Chen, Junying, Li, Dongfang, Chen, Qingcai, Zhou, Wenxiu, Liu, Xin
Proceedings of the AAAI Conference on Artificial Intelligence (2022)
-
A Novel Sequence-to-Subgraph Framework for Diagnosis Classification.
Chen, Jun, Yuan, Quan, Lu, Chao, Huang, Haifeng
IJCAI (2021)
-
Change matters: Medication change prediction with recurrent residual networks.
Yang, Chaoqi, Xiao, Cao, Glass, Lucas, Sun, Jimeng
arXiv preprint arXiv:2105.01876 (2021)
-
Concare: Personalized clinical feature embedding via capturing the healthcare context.
Ma, Liantao, Zhang, Chaohe, Wang, Yasha, Ruan, Wenjie, Wang, Jiangtao, Tang, Wen, Ma, Xinyu, Gao, Xin, Gao, Junyi
Proceedings of the AAAI conference on artificial intelligence (2020)
-
DeepAlerts: deep learning based multi-horizon alerts for clinical deterioration on oncology hospital wards.
Li, Dingwen, Lyons, Patrick G, Lu, Chenyang, Kollef, Marin
Proceedings of the AAAI Conference on Artificial Intelligence (2020)
2.4 Meta-Architectural Strategies
2.4.1 Meta-Adaptive Modeling
2.4.2 Neural Architecture Search (NAS)
-
TabNAS: Rejection Sampling for Neural Architecture Search on Tabular Datasets. (paper)
Chengrun Yang, Gabriel Bender, Hanxiao Liu, Pieter-Jan Kindermans, Madeleine Udell, Yifeng Lu, Quoc V Le, Da Huang
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
Branch 3: Learning-Focused Approaches
3.1 Self-Supervised Learning
- SwitchTab: Switched Autoencoders Are Effective Tabular Learners. (paper)
Jing Wu; Suiyao Chen; Qi Zhao; Renat Sergazinov; Chen Li; Shengjie Liu.
The 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)
- STUNT: Few-shot Tabular Learning with Self-generated Tasks from Unlabeled Tables.
Jaehyun Nam, Jihoon Tack, Kyungmin Lee, Hankook Lee, Jinwoo Shin
The 11th International Conference on Learning Representations (ICLR 2023)
(paper)
- Self-Supervision Enhanced Feature Selection with Correlated Gates. (paper)
Changhee Lee, Fergus Imrie, Mihaela van der Schaar
The 10th International Conference on Learning Representations (ICLR 2022)
- Scarf: Self-Supervised Contrastive Learning using Random Feature Corruption. (paper)
Dara Bahri, Heinrich Jiang, Yi Tay, Donald Metzler
The 10th International Conference on Learning Representations (ICLR 2022)
- Local Contrastive Feature Learning for Tabular Data. (paper)
Zhabiz Gharibshah, Xingquan Zhu.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management (CIKM 2022)
- CORE: Self- and Semi-supervised Tabular Learning with COnditional REgularizations. (paper)
Xintian Han, Rajesh Ranganath
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2021)
-
Self-Supervised Adversarial Distribution Regularization for Medication Recommendation.
Yanda Wang, Weitong Chen, Dechang Pi, Lin Yue, Sen Wang, Miao Xu
International Joint Conference on Artificial Intelligence (IJCAI 2021)
- SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning. (paper)
Talip Ucar, Ehsan Hajiramezanali, Lindsay Edwards
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2021)
- VIME: Extending the Success of Self- and Semi-supervised Learning to Tabular Domain. (paper)
Jinsung Yoon, Yao Zhang, James Jordon, Mihaela van der Schaar
The 34th Annual Conference on Neural Information Processing Systems (Neurips 2020)
3.2 Clustering-Based Methods
-
Deep Representation Learning for Clustering Longitudinal Survival Data from Electronic Health Records.
Qiu, Jiajun; Hu, Yao; Li, Li; Erzurumluoglu, Abdullah Mesut; Braenne, Ingrid; Whitehurst, Charles; Schmitz, Jochen; Arora, Jatin; Bartholdy, Boris Alexander; Gandhi, Shrey; et al.
Nature Communications (2025)
-
Identification of Predictive Subphenotypes for Clinical Outcomes Using Real World Data and Machine Learning.
Weishen Pan, Deep Hathi, Zhenxing Xu, Qiannan Zhang, Ying Li, Fei Wang
Nature Communications (2025)
-
Data-driven cluster analysis identifies distinct types of metabolic dysfunction-associated steatotic liver disease.
Raverdy, Violeta, Tavaglione, Federica, Chatelain, Estelle, Lassailly, Guillaume, De Vincentis, Antonio, Vespasiani-Gentilucci, Umberto, Qadri, Sami F, Caiazzo, Robert, Verkindt, Helene, Saponaro, Chiara, others
Nature Medicine (2024)
-
ProtoGate: Prototype-based Neural Networks with Global-to-local Feature Selection for Tabular Biomedical Data.
Jiang, Xiangjian; Margeloiu, Andrei; Simidjievski, Nikola; Jamnik, Mateja.
Proceedings of the 41st International Conference on Machine Learning (ICML) (2024)
-
Clustering Interval-Censored Time-Series for Disease Phenotyping.
Chen, Irene Y.; Krishnan, Rahul G.; Sontag, David.
Proceedings of the 36th AAAI Conference on Artificial Intelligence (2022)
-
Learning of Cluster-based Feature Importance for Electronic Health Record Time-series.
Aguiar, Henrique; Santos, Mauro D.; Watkinson, Peter J.; Zhu, Tingting.
Proceedings of the 39th International Conference on Machine Learning (ICML) (2022)
-
Longitudinal Patient Stratification of Electronic Health Records with Flexible Adjustment for Clinical Outcomes.
Carr, Oliver; Javer, Avelino; Rockenschaub, Patrick; Parsons, Owen; Durichen, Robert.
Proceedings of Machine Learning for Health (2021)
-
CROCS: Clustering and Retrieval of Cardiac Signals Based on Patient Disease Class, Sex, and Age.
Kiyasseh, Dani; Zhu, Tingting; Clifton, David A.
Advances in Neural Information Processing Systems (NeurIPS) (2021)
-
Transfer Learning via Optimal Transportation for Integrative Cancer Patient Stratification.
Ziyu Liu, Wei Shao, Jie Zhang, Min Zhang, Kun Huang
International Joint Conference on Artificial Intelligence (IJCAI), 2021, pp. 2760–2766.
-
Deep Representation Learning of Electronic Health Records to Unlock Patient Stratification at Scale.
Landi, Isotta; Glicksberg, Benjamin S.; Lee, Hao-Chih; Cherng, Sarah; Landi, Giulia; Danieletto, Matteo; Dudley, Joel T.; Furlanello, Cesare; Miotto, Riccardo.
npj Digital Medicine (2020)
-
Data-IQ: Characterizing Subgroups with Heterogeneous Outcomes in Tabular Data.
Seedat, Nabeel; Crabbé, Jonathan; Bica, Ioana; van der Schaar, Mihaela.
Proceedings of the 36th Annual Conference on Neural Information Processing Systems (NeurIPS 2022)
-
Temporal Phenotyping Using Deep Predictive Clustering of Disease Progression.
Lee, Changhee; van der Schaar, Mihaela.
International Conference on Machine Learning (2020)
-
ProtoGate: Prototype-based Neural Networks with Global-to-local Feature Selection for Tabular Biomedical Data.
Xiangjian Jiang, Andrei Margeloiu, Nikola Simidjievski, Mateja Jamnik
The 41th International Conference on Machine Learning (ICML 2024)
-
PTaRL: Prototype-based Tabular Representation Learning via Space Calibration.
Hangting Ye, Wei Fan, Xiaozhuang Song, Shun Zheng, He Zhao, Dan dan Guo, Yi Chang
The 12th International Conference on Learning Representations (ICLR 2024)
(paper)
-
TabR: Tabular Deep Learning Meets Nearest Neighbors. (paper)
Yury Gorishniy, Ivan Rubachev, Nikolay Kartashev, Daniil Shlenskii, Akim Kotelnikov, Artem Babenko
The 12th International Conference on Learning Representations (ICLR 2024).
-
On Class Distributions Induced by Nearest Neighbor Graphs for Node Classification of Tabular Data. (paper)
Federico Errica
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
-
Learning Enhanced Representation for Tabular Data via Neighborhood Propagation. (paper)
Kounianhua Du, Weinan Zhang, Ruiwen Zhou, Yangkun Wang, Xilong Zhao, Jiarui Jin, Quan Gan, Zheng Zhang, David Wipf
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
3.3 Latent Representation Learning
-
XTab: Cross-table Pretraining for Tabular Transformers.
Bingzhao Zhu, Xingjian Shi, Nick Erickson, Mu Li, George Karypis, Mahsa Shoaran
Proceedings of the 40th International Conference on Machine Learning (ICML 2023)
-
Assisting Clinical Decisions for Scarcely Available Treatment via Disentangled Latent Representation.
Bing Xue, Ahmed Sameh Said, Ziqi Xu, Hanyang Liu, Neel Shah, Hanqing Yang, Philip Payne, Chenyang Lu
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023.
-
Numerical Tuple Extraction from Tables with Pre-training.
Qingping Yang, Yixuan Cao, Yingming Hu, Jianfeng Li, Nanbo Peng, Ping Luo
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)
-
Inferring multimodal latent topics from electronic health records.
Yue Li, Pratheeksha Nair, Xing Han Lu, Zhi Wen, Yuening Wang, Amir Ardalan Kalantari Dehaghi, Yan Miao, Weiqi Liu, Tamas Ordog, Joanna M Biernacka, others
Nature Communications (2020)
-
Assisting clinical decisions for scarcely available treatment via disentangled latent representation.
Bing Xue, Ahmed Sameh Said, Ziqi Xu, Hanyang Liu, Neel Shah, Hanqing Yang, Philip Payne, Chenyang Lu
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023)
-
Transferable Adversarial Robustness for Categorical Data via Universal Robust Embeddings.
Klim Kireev, Maksym Andriushchenko, Carmela Troncoso, Nicolas Flammarion
Advances in Neural Information Processing Systems (NeurIPS 2023)
-
Revisiting Deep Learning Models for Tabular Data.
Yury Gorishniy, Ivan Rubachev, Valentin Khrulkov, Artem Babenko
Advances in Neural Information Processing Systems (NeurIPS 2021)
-
ManyDG: Many-domain Generalization for Healthcare Applications.
Chaoqi Yang, M. Brandon Westover, Jimeng Sun
International Conference on Learning Representations (ICLR 2023)
-
BiSHop: Bi-Directional Cellular Learning for Tabular Data with Generalized Sparse Modern Hopfield Model.
Chenwei Xu, Yu-Chao Huang, Jerry Yao-Chieh Hu, Weijian Li, Ammar Gilani, Hsi-Sheng Goan, Han Liu
The 41st International Conference on Machine Learning (ICML 2024)
-
Transferable Adversarial Robustness for Categorical Data via Universal Robust Embeddings. (paper)
Klim Kireev, Maksym Andriushchenko, Carmela Troncoso, Nicolas Flammarion
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2023)
-
Revisiting Deep Learning Models for Tabular Data. (paper)
Yury Gorishniy, Ivan Rubachev, Valentin Khrulkov, Artem Babenko.
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2021)
3.4 Causal Representation Learning
3.5 Continual Learning
Branch 4: Learning with External Modalities and Knowledge
4.1 Multi-modal Learning
4.1.1 Cross-model Alignment
4.1.1.1 Global Alignment
- Focus on What Matters: Enhancing Medical Vision-Language Models with Automatic Attention Alignment Tuning.
Chang, Aofei and Huang, Le and Boyd, Alex James and others
arXiv (2025)
- HealNet: Multimodal Fusion for Heterogeneous Biomedical Data.
Hemker, Konstantin; Simidjievski, Nikola; Jamnik, Mateja
Advances in Neural Information Processing Systems (NeurIPS) (2024)
-
Eye-gaze guided multi-modal alignment for medical representation learning
Ma, Chong; Jiang, Hanqi; Chen, Wenting; Li, Yiwei; Wu, Zihao; Yu, Xiaowei; Liu, Zhengliang; Guo, Lei; Zhu, Dajiang; Zhang, Tuo; et al.
Advances in Neural Information Processing Systems (NeurIPS) 2024
-
Multimodal Prototyping for cancer survival prediction
Song, Andrew H and Chen, Richard J and Jaume, Guillaume and Vaidya, Anurag Jayant and Baras, Alexander and Mahmood, Faisal
International Conference on Machine Learning (ICML) 2024
-
Pathasst: A generative foundation AI assistant towards artificial general intelligence of pathology.
Sun, Yuxuan; Zhu, Chenglu; Zheng, Sunyi; Zhang, Kai; Sun, Lin; Shui, Zhongyi; Zhang, Yunlong; Li, Honglin; Yang, Lin
Proceedings of the AAAI Conference on Artificial Intelligence (2024)
-
A visual-language foundation model for computational pathology.
Lu, Ming Y; Chen, Bowen; Williamson, Drew FK; Chen, Richard J; Liang, Ivy; Ding, Tong; Jaume, Guillaume; Odintsov, Igor; Le, Long Phi; Gerber, Georg; others
Nature Medicine (2024)
-
Vision language foundation model for echocardiogram interpretation.
Christensen, Matthew; Vukadinovic, Milos; Yuan, Neal; Ouyang, David
Nature Medicine (2024)
-
A multimodal generative AI copilot for human pathology
Lu, Ming Y and Chen, Bowen and Williamson, Drew FK and Chen, Richard J and Zhao, Melissa and Chow, Aaron K and Ikemura, Kenji and Kim, Ahrong and Pouli, Dimitra and Patel, Ankush and others
Nature 2024
-
MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization
hu, Kangyu and Xia, Peng and Li, Yun and Zhu, Hongtu and Wang, Sheng and Yao, Huaxiu
ICML 2025
-
Advancing radiograph representation learning with masked record modeling.
Zhou, Hong-Yu; Lian, Chenyu; Wang, Liansheng; Yu, Yizhou
arXiv preprint arXiv:2301.13155 (2023)
-
Biomedclip: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Zhang, Sheng and Xu, Yanbo and Usuyama, Naoto and Xu, Hanwen and Bagga, Jaspreet and Tinn, Robert and Preston, Sam and Rao, Rajesh and Wei, Mu and Valluri, Naveen and others.
Arxiv 2023
-
Pmc-clip: Contrastive language-image pre-training using biomedical documents.
Lin, Weixiong; Zhao, Ziheng; Zhang, Xiaoman; Wu, Chaoyi; Zhang, Ya; Wang, Yanfeng; Xie, Weidi
International Conference on Medical Image Computing and Computer-Assisted Intervention (2023)
-
Imitate: Clinical prior guided hierarchical vision-language pre-training.
Liu, Che; Cheng, Sibo; Shi, Miaojing; Shah, Anand; Bai, Wenjia; Arcucci, Rossella
arXiv preprint arXiv:2310.07355 (2023)
-
Human-Centred Multimodal Deep Learning Models for Chest X-Ray Diagnosis.
Chihcheng Hsieh
International Joint Conference on Artificial Intelligence (IJCAI 2023)
-
Prior: Prototype representation from medical images and reports.
Cheng, Pujin; Lin, Li; Lyu, Junyan; Huang, Yijin; Luo, Wenhan; Tang, Xiaoying
Proceedings of the IEEE/CVF International Conference on Computer Vision (2023)
-
M-flag: Medical vision-language pre-training with frozen language models and latent space geometry optimization.
Liu, Che; Cheng, Sibo; Chen, Chen; Qiao, Mengyun; Zhang, Weitong; Shah, Anand; Bai, Wenjia; Arcucci, Rossella
International Conference on Medical Image Computing and Computer-Assisted Intervention (2023)
- Learning to exploit temporal structure for biomedical vision-language processing.
Bannur, Shruthi and Hyland, Stephanie and Liu, Qianchu and Perez-Garcia, Fernando and Ilse, Maximilian and Castro, Daniel C and Boecking, Benedikt and Sharma, Harshita and Bouzid, Kenza and Thieme, Anja and others
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023)
-
A visual language foundation model for pathology image analysis using medical twitter.
Huang, Zhi; Bianchi, Federico; Yuksekgonul, Mert; Montine, Thomas J; Zou, James
Nature Medicine (2023)
- Contrastive learning of medical visual representations from paired images and text.
Zhang, Yuhao and Jiang, Hang and Miura, Yasuhide and Manning, Christopher D and Langlotz, Curtis P
Machine Learning for Healthcare Conference (MLHC 2022)
- Making the most of text semantics to improve biomedical vision--language processing.
Boecking, Benedikt and Usuyama, Naoto and Bannur, Shruthi and Castro, Daniel C and Schwaighofer, Anton and Hyland, Stephanie and Wetscherek, Maria and Naumann, Tristan and Nori, Aditya and Alvarez-Valle, Javier and others
European Conference on Computer Vision (ECCV 2022)
- Integrated multimodal artificial intelligence framework for healthcare applications.
Soenksen, Luis R and Ma, Yu and Zeng, Cynthia and Boussioux, Leonard and Villalobos Carballo, Kimberly and Na, Liangyuan and Wiberg, Holly M and Li, Michael L and Fuentes, Ignacio and Bertsimas, Dimitris
NPJ Digital Medicine 5(1):149 (2022)
- Expert-level detection of pathologies from unannotated chest X-ray images via self-supervised learning.
Tiu, Ekin and Talius, Ellie and Patel, Pujan and Langlotz, Curtis P and Ng, Andrew Y and Rajpurkar, Pranav
Nature Biomedical Engineering 6(12):1399–1406 (2022)
4.1.1.2 Fine-grained Alignment
4.1.1.2.1 Fine-Grained Contextual Understanding.
- Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational Pathology.
Liu, Pei; Ji, Luping; Gou, Jiaxiang; Fu, Bo; Ye, Mao
The Thirteenth International Conference on Learning Representations (ICLR) (2025)
- Prompt as Knowledge Bank: Boost Vision-language Model via Structural Representation for Zero-shot Medical Detection.
Yang, Yuguang; Chen, Tongfei; Huang, Haoyu; Yang, Linlin; Xie, Chunyu; Leng, Dawei; Cao, Xianbin; Zhang, Baochang
The Thirteenth International Conference on Learning Representations (ICLR) (2025)
- HSCR: Hierarchical Self-Contrastive Rewarding for Aligning Medical Vision Language Models.
Jiang, Songtao and Zhang, Yan and Jin, Yeying and Tang, Zhihang and Wu, Yangyang and Feng, Yang and Wu, Jian and Liu, Zuozhu
arXiv preprint arXiv:2506.00805 (2025)
-
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
Weichao Zhao, Hao Feng, Qi Liu, Jingqun Tang, Binghong Wu, Lei Liao, Shu Wei, Yongjie Ye, Hao Liu, Wengang Zhou, Houqiang Li, Can Huang
Advances in Neural Information Processing Systems (NeurIPS) 2024
- Joint learning of localized representations from medical images and reports.
Müller, Philip and Kaissis, Georgios and Zou, Congyu and Rueckert, Daniel
European Conference on Computer Vision (ECCV 2022)
- MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training for X-ray Diagnosis.
Wu, Chaoyi and Zhang, Xiaoman and Zhang, Ya and Wang, Yanfeng and Xie, Weidi
IEEE/CVF International Conference on Computer Vision (ICCV 2023)
- Knowledge-enhanced visual-language pre-training on chest radiology images.
Zhang, Xiaoman and Wu, Chaoyi and Zhang, Ya and Xie, Weidi and Wang, Yanfeng
Nature Communications 14(1):4542 (2023)
- Imitate: Clinical prior guided hierarchical vision-language pre-training.
Liu, Che and Cheng, Sibo and Shi, Miaojing and Shah, Anand and Bai, Wenjia and Arcucci, Rossella
arXiv preprint arXiv:2310.07355 (2023)
4.1.1.2.2 Extending 2D to 3D Imaging.
- Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-ray Expert Models.
Cao, Weiwei and Zhang, Jianpeng and Xia, Yingda and Mok, Tony CW and Li, Zi and Ye, Xianghua and Lu, Le and Zheng, Jian and Tang, Yuxing and Zhang, Ling
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)
- A foundation model utilizing chest CT volumes and radiology reports for supervised-level zero-shot detection of abnormalities.
Hamamci, Ibrahim Ethem and Er, Sezgin and Almas, Furkan and Simsek, Ayse Gulnihan and Esirgun, Sevval Nil and Dogan, Irem and Dasdelen, Muhammed Furkan and Wittmann, Bastian and Simsar, Enis and Simsar, Mehmet and others
arXiv preprint arXiv:2403.17834 (2024)
- M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models.
Bai, Fan and Du, Yuxin and Huang, Tiejun and Meng, Max Q.-H. and Zhao, Bo
arXiv preprint arXiv:2404.00578 (2024)
- CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios.
Lin, Jingyang and Xia, Yingda and Zhang, Jianpeng and Yan, Ke and Lu, Le and Luo, Jiebo and Zhang, Ling
arXiv preprint arXiv:2404.15272 (2024)
- Merlin: A Vision Language Foundation Model for 3D Computed Tomography.
Blankemeier, Louis and Cohen, Joseph Paul and Kumar, Ashwin and Van Veen, Dave and Gardezi, Syed Jamal Safdar and Paschali, Magdalini and Chen, Zhihong and Delbrouck, Jean-Benoit and Reis, Eduardo and Truyts, Cesar and others
arXiv preprint arXiv:2406.06512 (2024)
-
Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection
Wang, Yuli; Dai, Yuwei; Jones, Craig; Sair, Haris; Shen, Jinglai; Loizou, Nicolas; Hsu, Wen-Chi; Imami, Maliha; Jiao, Zhicheng; Zhang, Paul; et al.
Advances in Neural Information Processing Systems (NeurIPS) 2024
-
Benchmarking encoder-decoder architectures for biplanar X-ray to 3D bone shape reconstruction.
Shakya, Mahesh; Khanal, Bishesh
Advances in Neural Information Processing Systems (NeurIPS) (2023)
4.1.1.2.3 Region-Level Medical LMMs.
- Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute Alignment.
Jiang, Yankai; Lei, Wenhui; Zhang, Xiaofan; Zhang, Shaoting
The Thirteenth International Conference on Learning Representations (ICLR) (2025)
- Reinforced Correlation Between Vision and Language for Precise Medical AI Assistant
Wang, Haonan and Mao, Jiaji and Wang, Lehan and Zhang, Qixiang and Elbatel, Marawan and Qin, Yi and Hu, Huijun and Li, Baoxun and Deng, Wenhui and Qin, Weifeng and others
arxiv 2025
- Training medical large vision-language models with abnormal-aware feedback.
Zhou, Yucheng; Song, Lingran; Shen, Jianbing
arXiv preprint arXiv:2501.01377 (2025)
- AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation.
Li, Qingqiu and Cui, Zihang and Bae, Seongsu and Xu, Jilan Mand Yuan, Runtian and Zhang, Yuejie and Feng, Rui and Shen, Quanli and Zhang, Xiaobo and He, Junjun and others
arXiv preprint arXiv:2505.02830 (2025)
- Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding.
Shui, Zhongyi and Zhang, Jianpeng and Cao, Weiwei and Wang, Sinuo and Guo, Ruizhe and Lu, Le and Yang, Lin and Ye, Xianghua and Liang, Tingbo and Zhang, Qi and others
arXiv preprint arXiv:2501.14548 (2025)
- Regiongpt: Towards region understanding vision language model.
Guo, Qiushan and De Mello, Shalini and Yin, Hongxu and Byeon, Wonmin and Cheung, Ka Chun and Yu, Yizhou and Luo, Ping and Liu, Sifei
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)
- Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction.
Zhang, Yilan; Xu, Yingxue; Chen, Jianqi; Xie, Fengying; Chen, Hao
The Thirteenth International Conference on Learning Representations (ICLR) (2024)
- Fine-Grained Image-Text Alignment in Medical Imaging Enables Explainable Cyclic Image-Report Generation.
Chen, Wenting; Shen, Linlin; Lin, Jingyang; Luo, Jiebo; Li, Xiang; Yuan, Yixuan
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL) (2024)
- A refer-and-ground multimodal large language model for biomedicine.
Huang, Xiaoshuang and Huang, Haifeng and Shen, Lingdong and Yang, Yehui and Shang, Fangxin and Liu, Junwei and Liu, Jia
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2024)
- Maira-2: Grounded radiology report generation.
Bannur, Shruthi and Bouzid, Kenza and Castro, Daniel C and Schwaighofer, Anton and Thieme, Anja and Bond-Taylor, Sam and Ilse, Maximilian and Pérez-García, Fernando and Salvatelli, Valentina and Sharma, Harshita and others
arXiv preprint arXiv:2406.04449 (2024)
- Shikra: Unleashing multimodal LLM's referential dialogue magic.
Chen, Keqin and Zhang, Zhao and Zeng, Weili and Zhang, Richong and Zhu, Feng and Zhao, Rui
arXiv preprint arXiv:2306.15195 (2023)
- Gpt4roi: Instruction tuning large language model on region-of-interest.
Zhang, Shilong and Sun, Peize and Chen, Shoufa and Xiao, Min and Shao, Wenqi and Zhang, Wenwei and Liu, Yu and Chen, Kai and Luo, Ping
arXiv preprint arXiv:2307.03601 (2023)
4.1.1.3 Data-Efficient Parallel and Unpaired Alignment.
4.1.1.3.1 Parallel Data Collection.
-
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation.
Lin, Tianwei; Zhang, Wenqiao; Li, Sijing; Yuan, Yuqian; Yu, Binhe; Li, Haoyuan; He, Wanggui; Jiang, Hao; Li, Mengze; Song, Xiaohui; et al.
Proceedings of the 42nd International Conference on Machine Learning (ICML) (2025)
- BiomedCLIP: A multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs.
Zhang, Sheng and Xu, Yanbo and Usuyama, Naoto and Xu, Hanwen and Bagga, Jaspreet and Tinn, Robert and Preston, Sam and Rao, Rajesh and Wei, Mu and Valluri, Naveen and others
arXiv preprint arXiv:2303.00915 (2023)
- LLaVA-Med: Training a large language-and-vision assistant for biomedicine in one day.
Li, Chunyuan and Wong, Cliff and Zhang, Sheng and Usuyama, Naoto and Liu, Haotian and Yang, Jianwei and Naumann, Tristan and Poon, Hoifung and Gao, Jianfeng
Advances in Neural Information Processing Systems (NeurIPS 2023)
- Med-Flamingo: A multimodal medical few-shot learner.
Moor, Michael and Huang, Qian and Wu, Shirley and Yasunaga, Michihiro and Dalmia, Yash and Leskovec, Jure and Zakka, Cyril and Reis, Eduardo Pontes and Rajpurkar, Pranav
Machine Learning for Health (ML4H 2023)
- OpenFlamingo: An open-source framework for training large autoregressive vision-language models.
Awadalla, Anas and Gao, Irena and Gardner, Josh and Hessel, Jack and Hanafy, Yusuf and Zhu, Wanrong and Marathe, Kalyani and Bitton, Yonatan and Gadre, Samir and Sagawa, Shiori and others
arXiv preprint arXiv:2308.01390 (2023)
4.1.1.3.2 Learning from unpaired Data.
- Multimodal patient representation learning with missing modalities and labels.
Wu, Zhenbang; Dadu, Anant; Tustison, Nicholas; Avants, Brian; Nalls, Mike; Sun, Jimeng; Faghri, Faraz
The Twelfth International Conference on Learning Representations (ICLR) (2024)
- PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
Xie, Yutong; Chen, Qi; Wang, Sinuo; To, Minh-Son; Lee, Iris; Khoo, Ee Win; Hendy, Kerolos; Koh, Daniel; Xia, Yong; Wu, Qi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)
- A medical multimodal large language model for future pandemics.
Liu, Fenglin; Zhu, Tingting; Wu, Xian; Yang, Bang; You, Chenyu; Wang, Chenyang; Lu, Lei; Liu, Zhangdaihong; Zheng, Yefeng; Sun, Xu; et al.
NPJ Digital Medicine 6(1):226 (2023)
- Towards unifying medical vision-and-language pre-training via soft prompts.
Chen, Zhihong; Diao, Shizhe; Wang, Benyou; Li, Guanbin; Wan, Xiang
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV 2023)
- MedCLIP: Contrastive learning from unpaired medical images and text.
Wang, Zifeng; Wu, Zhenbang; Agarwal, Dinesh; Sun, Jimeng
arXiv preprint arXiv:2210.10163 (2022)
- M3Care: Learning with missing modalities in multimodal healthcare data.
Zhang, Chaohe; Chu, Xu; Ma, Liantao; Zhu, Yinghao; Wang, Yasha; Wang, Jiangtao; Zhao, Junfeng
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)
4.1.1.3 Data Efficient Parallel Alignment
-
MedCLIP: Contrastive Learning from Unpaired Medical Images and Text.
Zifeng Wang, Zhenbang Wu, Dinesh Agarwal, Jimeng Sun
arXiv preprint arXiv:2210.10163 (2022)
-
Towards Unifying Medical Vision-and-Language Pre-Training via Soft Prompts.
Zhihong Chen, Shizhe Diao, Benyou Wang, Guanbin Li, Xiang Wan
Proceedings of the IEEE/CVF International Conference on Computer Vision (2023)
-
PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
Yutong Xie, Qi Chen, Sinuo Wang, Minh-Son To, Iris Lee, Ee Win Khoo, Kerolos Hendy, Daniel Koh, Yong Xia, Qi Wu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2024)
-
LVM-Med: Learning large-scale self-supervised vision models for medical imaging via second-order graph matching
MH Nguyen, Duy; Nguyen, Hoang; Diep, Nghiem; Pham, Tan Ngoc; Cao, Tri; Nguyen, Binh; Swoboda, Paul; Ho, Nhat; Albarqouni, Shadi; Xie, Pengtao; et al.
Nature Digital Medicine 2023
4.1.2 Knowledge-Informed Modeling
4.1.3 Temporal and Asynchronous Integration and Modeling
-
Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing.
Bannur, Shruthi; Hyland, Stephanie; Liu, Qianchu; Perez-Garcia, Fernando; Ilse, Maximilian; Castro, Daniel C; Boecking, Benedikt; Sharma, Harshita; Bouzid, Kenza; Thieme, Anja; et al.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
-
Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation.
Yao, Wenfang; Liu, Chen; Yin, Kejing; Cheung, William; Qin, Jing
Advances in Neural Information Processing Systems (NeurIPS) (2024)
- Interpretable Vision-Language Survival Analysis with Ordinal Inductive Bias for Computational Pathology.
Liu, Pei; Ji, Luping; Gou, Jiaxiang; Fu, Bo; Ye, Mao
The Thirteenth International Conference on Learning Representations (ICLR) (2025)
- Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training
Yang, Jinxia and Su, Bing and Zhao, Xin and Wen, Ji-Rong
International Conference on Machine Learning (ICLR) 2024
- Time-to-Event Pretraining for 3D Medical Imaging
Huo, Zepeng and Fries, Jason Alan and Lozano, Alejandro and Valanarasu, Jeya Maria Jose and Steinberg, Ethan and Blankemeier, Louis and Chaudhari, Akshay S and Langlotz, Curtis and Shah, Nigam H
International Conference on Machine Learning (ICLR) 2024
- Addressing asynchronicity in clinical multimodal fusion via individualized chest x-ray generation
Yao, Wenfang and Liu, Chen and Yin, Kejing and Cheung, William and Qin, Jing
Advances in Neural Information Processing Systems (Neurips) 2024
-
Improving Medical Predictions by Irregular Multimodal Electronic Health Records Modeling
Xinlu Zhang, Shiyang Li, Zhiyu Chen, Xifeng Yan, Linda Petzold
International Conference on Machine Learning (ICML) 2023
4.1.4 Modality-Specific Robustness
4.2 Multi-source Learning
Communication Efficient Federated Generalized Tensor Factorization for Collaborative Health Data Analytics.
Jing Ma, Qiuchen Zhang, Jian Lou, Li Xiong, Joyce C. Ho
Proceedings of the Web Conference 2021, pages 171–182.
4.3 Learning with knowledge Graph
4.4 Learning with External Data Source
4.4.1 External Knowledge via Prompting and Instruction Tuning
- AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation
Qingqiu Li, Zihang Cui, Seongsu Bae, Jilan Xu, Runtian Yuan, Yuejie Zhang, Rui Feng, Quanli Shen, Xiaobo Zhang, Junjun He, Shujun Wang
arxiv 2025
- Prompt as Knowledge Bank: Boost Vision-language Model via Structural Representation for Zero-shot Medical Detection.
Yang, Yuguang; Chen, Tongfei; Huang, Haoyu; Yang, Linlin; Xie, Chunyu; Leng, Dawei; Cao, Xianbin; Zhang, Baochang
The Thirteenth International Conference on Learning Representations (ICLR) (2025)
-
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement
Che Liu, Zhongwei Wan, Cheng Ouyang, Anand Shah, Wenjia Bai, Rossella Arcucci
International Conference on Machine Learning (ICML) 2024
4.4.2 Internalized Knowledge from LLMs.
-
EMERGE: Enhancing Multimodal Electronic Health Records Predictive Modeling with Retrieval-Augmented Generation
Yinghao Zhu, Changyu Ren, Zixiang Wang, Xiaochen Zheng, Shiyun Xie, Junlan Feng, Xi Zhu, Zhoujun Li, Liantao Ma, Chengwei Pan
ACM International Conference on Information and Knowledge Management (CIKM) 2024
- LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation
Lee, Suhyeon, Won Jun Kim, Jinho Chang, and Jong Chul Ye
In The Twelfth International Conference on Learning Representations (ICLR 2024)
1.4.3 Case-Based Knowledge from Patient Records
- Retrieve, reason, and refine: Generating accurate and faithful patient instructions
Liu, Fenglin and Yang, Bang and You, Chenyu and Wu, Xian and Ge, Shen and Liu, Zhangdaihong and Sun, Xu and Yang, Yang and Clifton, David
Advances in Neural Information Processing Systems (Neurips) 2022
Branch 5: LLM-Based Modeling and Systems
5.1 Learning with LLMs
5.1.1 Prompt-Based Methods
5.1.2 Pretraining Methods
5.1.3 Fine-Tuning Methods
5.1.4 Retrieval-Augmented Methods
5.2 LLM-Driven Medical Agents
1.2.6 Masking Modeling
-
MCM: Masked Cell Modeling for Anomaly Detection in Tabular Data.
Jiaxin Yin, Yuanyuan Qiao, Zitang Zhou, Xiangchao Wang, Jie Yang
The 12st International Conference on Learning Representations (ICLR 2024).
(paper)
-
Data-Efficient and Interpretable Tabular Anomaly Detection.
Chun-Hao Chang, Jinsung Yoon, Sercan Arik, Madeleine Udell, Tomas Pfister
The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023) (paper)
-
Anomaly detection for tabular data with internal contrastive learning.
Tom Shenkar, Lior Wolf
The 10th International Conference on Learning Representations (ICLR 2022)
(paper)
1.3 Reinforcement Learning
-
"My nose is running." "Are you also coughing?": Building A Medical Diagnosis Agent with Interpretable Inquiry Logics.
Wenge Liu, Yi Cheng, Hao Wang, Jianheng Tang, Yafei Liu, Ruihui Zhao, Wenjie Li, Yefeng Zheng, Xiaodan Liang
International Joint Conference on Artificial Intelligence (IJCAI 2022)
1.4 Temporal Modeling
-
Cooperative Joint Attentive Network for Patient Outcome Prediction on Irregular Multi-Rate Multivariate Health Data.
Qingxiong Tan, Mang Ye, Grace Lai-Hung Wong, Pong Chi Yuen
International Joint Conference on Artificial Intelligence (IJCAI), 2021, pp. 1586–1592.
1.7 Clinical Agent
-
From EHRs to Patient Pathways: Scalable Modeling of Longitudinal Health Trajectories with LLMs
Pellegrini, Chantal and Özsoy, Ege and Bani-Harouni, David and Keicher, Matthias and Navab, Nassir
arXiv (2025)
-
ColaCare: Enhancing Electronic Health Record Modeling Through Large Language Model-Driven Multi-Agent Collaboration
Zixiang Wang, Yinghao Zhu, Huiya Zhao, Xiaochen Zheng, Dehao Sui, Tianlong Wang, Wen Tang, Yasha Wang, Ewen Harrison, Chengwei Pan, Junyi Gao, Liantao Ma
Proceedings of the ACM on Web Conference (WWW) (2025)
-
CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering
Yuren Mao, Wenyi Xu, Yuyang Qin, Yunjun Gao
arXiv 2025
-
MedRAX: Medical Reasoning Agent for Chest X-ray
Fallahpour, Adibvafa and Ma, Jun and Munim, Alif and Lyu, Hongwei and Wang, Bo
Forty-Second International Conference on Machine Learning (ICML) 2025
-
MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications
Jiang, Yixing and Black, Kameron C and Geng, Gloria and Park, Danny and Ng, Andrew Y and Chen, Jonathan H
arXiv (2025)
-
MedAgent-Pro: Towards Multi-Modal Evidence-Based Medical Diagnosis via Reasoning Agentic Workflow
Wang, Ziyue and Wu, Junde and Low, Chang Han and Jin, Yueming
arXiv (2025)
-
CoD: Towards an Interpretable Medical Agent Using Chain of Diagnosis
Junying Chen, Chi Gui, Anningzhe Gao, Ke Ji, Xidong Wang, Xiang Wan, Benyou Wang
arXiv preprint arXiv:2407.13301 (2024)
-
MMedAgent: Learning to Use Medical Tools with Multi-Modal Agent
Binxu Li, Tiankai Yan, Yuanting Pan, Jie Luo, Ruiyang Ji, Jiayuan Ding, Zhe Xu, Shilong Liu, Haoyu Dong, Zihao Lin, Yixin Wang
Findings of the Association for Computational Linguistics (EMNLP-Findings 2024)
-
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records
Shi, Wenqi and Xu, Ran and Zhuang, Yuchen and Yu, Yue and Zhang, Jieyu and Wu, Hang and Zhu, Yuanda and Ho, Joyce and Yang, Carl and Wang, May Dongmei
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2024)
-
AgentClinic: A Dynamic Multi-Agent Environment for Healthcare Diagnosis and Treatment Simulations
Samuel Schmidgall, Rojin Ziaei, Carl Harris, Eduardo Reis, Jeffrey Jopling, Michael Moor
Arxiv 2024
others:Benchmark
- AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments
Schmidgall, Samuel and Ziaei, Rojin and Harris, Carl and Reis, Eduardo and Jopling, Jeffrey and Moor, Michael
arxiv 2025
- ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification
Ziqing Fan, Cheng Liang, Chaoyi Wu, Ya Zhang, Yanfeng Wang, Weidi Xie
Arxiv 2025
1.3 Learning with External Knowledge
1.3.1 Learning with Good Model Initialization
- XTab: Cross-table Pretraining for Tabular Transformers. (paper)
Bingzhao Zhu, Xingjian Shi, Nick Erickson, Mu Li, George Karypis, Mahsa Shoaran
The 40th International Conference on Machine Learning (ICML 2023)
- Numerical Tuple Extraction from Tables with Pre-training. (paper)
Qingping Yang, Yixuan Cao, Yingming Hu, Jianfeng Li, Nanbo Peng, Ping Luo.
The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)
- Well-tuned Simple Nets Excel on Tabular Datasets. (paper)
Arlind Kadra, Marius Lindauer, Frank Hutter, Josif Grabocka
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2021)
1.3.1 Learning with Knowledge Graph
-
High dimensional, tabular deep learning with an auxiliary knowledge graph. (paper)
Camilo Ruiz, Hongyu Ren, Kexin Huang, Jure Leskovec
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
- External Knowledge Infusion for Tabular Pre-training Models with Dual-adapters. (paper)
Can Qin, Sungchul Kim, Handong Zhao, Tong Yu, Ryan A. Rossi, Yun Fu.
The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)
1.3.2 Learning with Large Language Models
- TableRAG: Million-Token Table Understanding with Language Models.
Si-An Chen, Lesly Miculicich, Julian Eisenschlos, Zifeng Wang, Zilong Wang, Yanfei Chen, YASUHISA FUJII, Hsuan-Tien Lin, Chen-Yu Lee, Tomas Pfister.
The 38th Annual Conference on Neural Information Processing Systems (Neurips 2024)
- Large Language Models Can Automatically Engineer Features for Few-Shot Tabular Learning.
Sungwon Han, Jinsung Yoon, Sercan Arik, Tomas Pfister
The 41th International Conference on Machine Learning (ICML 2024)
- OpenTab: Advancing Large Language Models as Open-domain Table Reasoners. (paper)
Kezhi Kong, Jiani Zhang, Zhengyuan Shen, Balasubramaniam Srinivasan, Chuan Lei, Christos Faloutsos, Huzefa Rangwala, George Karypis
The 12th International Conference on Learning Representations (ICLR 2024)
- Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources. (paper)
Xingxuan Li, Ruochen Zhao, Yew Ken Chia, Bosheng Ding, Shafiq Joty, Soujanya Poria, Lidong Bing
The 12th International Conference on Learning Representations (ICLR 2024)
-
Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-ray Expert Models.
Cao, Weiwei; Zhang, Jianpeng; Xia, Yingda; Mok, Tony CW; Li, Zi; Ye, Xianghua; Lu, Le; Zheng, Jian; Tang, Yuxing; Zhang, Ling
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2024)
-
A foundation model utilizing chest CT volumes and radiology reports for supervised-level zero-shot detection of abnormalities.
Hamamci, Ibrahim Ethem; Er, Sezgin; Almas, Furkan; Simsek, Ayse Gulnihan; Esirgun, Sevval Nil; Dogan, Irem; Dasdelen, Muhammed Furkan; Wittmann, Bastian; Simsar, Enis; Simsar, Mehmet; others
arXiv preprint arXiv:2403.17834 (2024)
-
Merlin: A Vision Language Foundation Model for 3D Computed Tomography.
Blankemeier, Louis; Cohen, Joseph Paul; Kumar, Ashwin; Van Veen, Dave; Gardezi, Syed Jamal Safdar; Paschali, Magdalini; Chen, Zhihong; Delbrouck, Jean-Benoit; Reis, Eduardo; Truyts, Cesar; others
arXiv preprint arXiv:2406.06512 (2024)
- Making Pre-trained Language Models Great on Tabular Prediction. (paper)
Jiahuan Yan, Bo Zheng, Hongxia Xu, Yiheng Zhu, Danny Chen, Jimeng Sun, Jian Wu, Jintai Chen
The 12th International Conference on Learning Representations (ICLR 2024)
-
UniTabE: A Universal Pretraining Protocol for Tabular Foundation Model in Data Science. (paper)
Yazheng Yang, Yuqi Wang, Guang Liu, Ledell Wu, Qi Liu
The 12th International Conference on Learning Representations (ICLR 2024)
- Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding. (paper)
Zilong Wang, Hao Zhang, Chun-Liang Li, Julian Martin Eisenschlos, Vincent Perot, Zifeng Wang, Lesly Miculicich, Yasuhisa Fujii, Jingbo Shang, Chen-Yu Lee, Tomas Pfister
The 12th International Conference on Learning Representations (ICLR 2024)
-
Parameterizing Context: Unleashing the Power of Parameter-Efficient Fine-Tuning and In-Context Tuning for Continual Table Semantic Parsing. (paper)
Yongrui Chen, Shenyu Zhang, Guilin Qi, Xinnan Guo
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
- Trompt: Towards a Better Deep Neural Network for Tabular Data. (paper)
Kuan-Yu Chen, Ping-Han Chiang, Hsin-Rung Chou, Ting-Wei Chen, Tien-Hao Chang
The 40th International Conference on Machine Learning (ICML 2023)
-
Large Language Models for Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering.(paper)
Noah Hollmann, Samuel Müller, Frank Hutter
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
-
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs.
Zhang, Sheng; Xu, Yanbo; Usuyama, Naoto; Xu, Hanwen; Bagga, Jaspreet; Tinn, Robert; Preston, Sam; Rao, Rajesh; Wei, Mu; Valluri, Naveen; others
arXiv preprint arXiv:2303.00915 (2023)
-
Learning to exploit temporal structure for biomedical vision-language processing.
Bannur, Shruthi; Hyland, Stephanie; Liu, Qianchu; Perez-Garcia, Fernando; Ilse, Maximilian; Castro, Daniel C; Boecking, Benedikt; Sharma, Harshita; Bouzid, Kenza; Thieme, Anja; others
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2023)
-
MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training for X-ray Diagnosis.
Wu, Chaoyi; Zhang, Xiaoman; Zhang, Ya; Wang, Yanfeng; Xie, Weidi
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) (2023)
-
Knowledge-enhanced visual-language pre-training on chest radiology images.
Zhang, Xiaoman; Wu, Chaoyi; Zhang, Ya; Xie, Weidi; Wang, Yanfeng
Nature Communications (2023)
-
Language Models are Weak Learners. (paper)
Hariharan Manikandan, Yiding Jiang, J Zico Kolter
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
- DyRRen: A Dynamic Retriever-Reranker-Generator Model for Numerical Reasoning over Tabular and Textual Data. (paper)
Xiao Li, Yin Zhu, Sichen Liu, Jiangzhou Ju, Yuzhong Qu, Gong Cheng
The 37th AAAI Conference on Artificial Intelligence (AAAI 2023)
- Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions. (paper)
Fenglin Liu, Bang Yang, Chenyu You, Xian Wu, Shen Ge, Zhangdaihong Liu, Xu Sun, Yang Yang, David A. Clifton
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
- Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions. (paper)
Fenglin Liu, Bang Yang, Chenyu You, Xian Wu, Shen Ge, Zhangdaihong Liu, Xu Sun, Yang Yang, David A. Clifton
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
-
TAPEX: Table Pre-training via Learning a Neural SQL Executor.(paper)
Qian Liu, Bei Chen, Jiaqi Guo, Morteza Ziyadi, Zeqi Lin, Weizhu Chen, Jian-Guang Lou
The 10th International Conference on Learning Representations (ICLR 2022)
-
Contrastive learning of medical visual representations from paired images and text.
Zhang, Yuhao; Jiang, Hang; Miura, Yasuhide; Manning, Christopher D; Langlotz, Curtis P
Machine Learning for Healthcare Conference (2022)
-
Making the most of text semantics to improve biomedical vision language processing.
Boecking, Benedikt; Usuyama, Naoto; Bannur, Shruthi; Castro, Daniel C; Schwaighofer, Anton; Hyland, Stephanie; Wetscherek, Maria; Naumann, Tristan; Nori, Aditya; Alvarez-Valle, Javier; others
European Conference on Computer Vision (2022)
-
Expert-level detection of pathologies from unannotated chest X-ray images via self-supervised learning.
Tiu, Ekin; Talius, Ellie; Patel, Pujan; Langlotz, Curtis P; Ng, Andrew Y; Rajpurkar, Pranav
Nature Biomedical Engineering (2022)
-
Joint learning of localized representations from medical images and reports.
Müller, Philip; Kaissis, Georgios; Zou, Congyu; Rueckert, Daniel
European Conference on Computer Vision (2022)
1.4 Causal Representation Learning
- TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second. (paper)
Noah Hollmann, Samuel Müller, Katharina Eggensperger, Frank Hutter
The 11th International Conference on Learning Representations (ICLR 2023)
- Towards Out-of-Distribution Sequential Event Prediction: A Causal Treatment. (paper)
Chenxiao Yang, Qitian Wu, Qingsong Wen, Zhiqiang Zhou, Liang Sun, Junchi Yan
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
- Transfer Learning on Heterogeneous Feature Spaces for Treatment Effects Estimation. (paper)
Ioana Bica, Mihaela van der Schaar
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
- SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes. (paper)
Zhaozhi Qian, Yao Zhang, Ioana Bica, Angela Mary Wood, Mihaela van der Schaar
The 34th Annual Conference on Neural Information Processing Systems (Neurips 2021)
Branch 2: Downstream Tasks
2.1 Generation
2.1.1 GAN-based Models
- Invertible Tabular GANs: Killing Two Birds with One Stone for Tabular Data Synthesis. (paper)
JAEHOON LEE, Jihyeon Hyeong, Jinsung Jeon, Noseong Park, Jihoon Cho
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2021)
- DECAF: Generating Fair Synthetic Data Using Causally-Aware Generative Networks. (paper)
Boris van Breugel, Trent Kyono, Jeroen Berrevoets, Mihaela van der Schaar
The 35th Annual Conference on Neural Information Processing Systems (Neurips 2021)
2.1.2 VAE-based Models
- A Learnable Discrete-Prior Fusion Autoencoder with Contrastive Learning for Tabular Data Synthesis. (paper)
Rongchao Zhang, Yiwei Lou, Dexuan Xu, Yongzhi Cao, Hanpin Wang, Yu Huang
The 38th AAAI Conference on Artificial Intelligence (AAAI 2024)
2.1.3 Diffusion-based Models
- Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees. (paper)
Alexia Jolicoeur, Martineau Kilian Fatras, Tal Kachmana Rangwala, George Karypis
(AISTATS 2024)
- Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space.(paper)
Hengrui Zhang, Jiani Zhang, Zhengyuan Shen, Balasubramaniam Srinivasan, Xiao Qin, Christos Faloutsos, Huzefa Rangwala, George Karypis
The 12th International Conference on Learning Representations (ICLR 2024)
- A Flexible Generative Model for Heterogeneous Tabular EHR with Missing Modality. (paper)
Huan He, William hao, Yuanzhe Xi, Yong Chen, Bradley Malin, Joyce Ho
The 12th International Conference on Learning Representations (ICLR 2024)
- CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis. (paper)
Chaejeong Lee, Jayoung Kim, Noseong Park
The 40th International Conference on Machine Learning (ICML 2023)
- TabDDPM: Modelling Tabular Data with Diffusion Models. (paper)
Akim Kotelnikov, Dmitry Baranchuk, Ivan Rubachev, Artem Babenko
The 40th International Conference on Machine Learning (ICML 2023)
- Concrete Score Matching: Generalized Score Matching for Discrete Data. (paper)
Chenlin Meng, Kristy Choi, Jiaming Song, Stefano Ermon
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
- SOS: Score-based Oversampling Minor Classes for Tabular Data. (paper)
jayoung kim, ChaeJeong Lee,Yehjin Shin, Sewon Park, Minjung Kim, Noseong Park, Jihoon Cho
The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)
2.1.4 Transformer-based
-
ReMasker: Imputing Tabular Data with Masked Autoencoding. (paper)
Tianyu Du, Luca Melis, Ting Wang
The 12th International Conference on Learning Representations (ICLR 2024)
-
TabMT: Generating tabular data with masked transformers. (paper)
Manbir S Gulati, Paul F Roysdon
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
-
Realtabformer: Generating realistic relational and tabular data using transformers. (paper)
Aivin V. Solatorio, Olivier Dupriez
(Arxiv 2023)
-
Fata-trans: Field and time-aware transformer for sequential tabular data. (paper)
Dongyu Zhang, Liang Wang, Xin Dai, Shubham Jain, Junpeng Wang, Yujie Fan, Chin-Chia Michael Yeh, Yan Zheng, Zhongfang Zhuang, Wei Zhang
The 32nd ACM International Conference on Information and Knowledge Management (CIKM 2023)
2.1.5 Large Language Model-based
-
CuTS: Customizable Tabular Synthetic Data Generation.
Mark Vero, Mislav Balunovic, Martin Vechev
The 41th International Conference on Machine Learning (ICML 2024)
-
Language-Interfaced Tabular Oversampling via Progressive Imputation and Self-Authentication. (paper)
June Yong Yang, Geondo Park, Joowon Kim, Hyeongwon Jang, Eunho Yang
The 12th International Conference on Learning Representations (ICLR 2024)
- Language Models are Realistic Tabular Data Generators. (paper)
Vadim Borisov, Kathrin Seßler2, Tobias Leemann, Martin Pawelczyk, Gjergji Kasneci
The 11th International Conference on Learning Representations (ICLR 2023)
- Generative Table Pre-training Empowers Models for Tabular Prediction. (paper)
Tianping Zhang, Shaowen Wang, Shuicheng Yan, Jian Li, Qian Liu
(EMNLP 2023)
- nBIIG: A Neural BI Insights Generation System for Table Reporting.
Yotam Perlitz, Dafna Sheinwald, Noam Slonim, Michal Shmueli-Scheuer
The 37th AAAI Conference on Artificial Intelligence (AAAI 2023) (paper)
2.1.6 Model-agnostic
-
How Realistic Is Your Synthetic Data? Constraining Deep Generative Models for Tabular Data.
Mihaela C Stoian, Salijona Dyrmishi, Maxime Cordy, Thomas Lukasiewicz, Eleonora Giunchiglia
The 12th International Conference on Learning Representations (ICLR 2024, interesting!) (paper)
2.2 Anomaly Detection
-
Beyond Individual Input for Deep Anomaly Detection on Tabular Data.
Hugo Thimonier, Fabrice Popineau, Arpad Rimmel, Bich-Liên DOAN
The 41st International Conference on Machine Learning (ICML 2024).
-
MCM: Masked Cell Modeling for Anomaly Detection in Tabular Data.
Jiaxin Yin, Yuanyuan Qiao, Zitang Zhou, Xiangchao Wang, Jie Yang
The 12th International Conference on Learning Representations (ICLR 2024).(paper)
-
SemanticMask: A Contrastive View Design for Anomaly Detection in Tabular Data. (paper)
Shuting Tao, Tongtian Zhu, Hongwei Wang, Xiangming Meng
the 33th International Joint Conference on Artificial Intelligence (IJCAI 2024).
2.3 Transfer Learning
-
Tabular Insights, Visual Impacts: Transferring Expertise from Tables to Images.
Jun-Peng Jiang, Han-Jia Ye, Leye Wang, Yang Yang, Yuan Jiang, De-Chuan Zhan.
The 41st International Conference on Machine Learning (ICML 2024).
-
CARTE: Pretraining and Transfer for Tabular Learning.
Myung Jun Kim, Leo Grinsztajn, Gael Varoquaux
The 41st International Conference on Machine Learning (ICML 2024).
-
TabLog: Test-Time Adaptation for Tabular Data Using Logic Rules.
Weijieying Ren, Xiaoting Li, Huiyuan Chen, Vineeth Rakesh, Zhuoyi Wang, Mahashweta Das, Vasant Honavar
The 41st International Conference on Machine Learning (ICML 2024).
-
Towards Cross-Table Masked Pretraining for Web Data Mining.
Chao Ye, Guoshan Lu, Haobo Wang, Liyao Li, Sai Wu, Gang Chen, Junbo Zhao
(WWW 2024).(paper)
-
MCM: Masked Cell Modeling for Anomaly Detection in Tabular Data. (paper)
Jiaxin Yin, Yuanyuan Qiao, Zitang Zhou, Xiangchao Wang, Jie Yang
The 12th International Conference on Learning Representations (ICLR 2024).
- XTab: Cross-table Pretraining for Tabular Transformers. (paper)
Bingzhao Zhu, Xingjian Shi, Nick Erickson, Mu Li, George Karypis, Mahsa Shoaran
The 40th International Conference on Machine Learning (ICML 2023)
-
TransTab: Learning Transferable Tabular Transformers Across Tables. (paper)
Zifeng Wang, Jimeng Sun.
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
- External Knowledge Infusion for Tabular Pre-training Models with Dual-adapters. (paper)
Can Qin, Sungchul Kim, Handong Zhao, Tong Yu, Ryan A. Rossi, YUN FU.
The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)
- Transfer Learning on Heterogeneous Feature Spaces for Treatment Effects Estimation.
Ioana Bica, Mihaela van der Schaar
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022) (paper)
2.4 Explanation/Model Assesment
- Interpretable Deep Clustering for Tabular Data.
Jonathan Svirsky, Ofir Lindenbaum
The 41st International Conference on Machine Learning (ICML 2024)
- InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation.
Jacob Si, Wendy Yusi Cheng, Michael Cooper, Rahul G. Krishnan
The 41st International Conference on Machine Learning (ICML 2024)
- Do Machine Learning Models Learn Statistical Rules Inferred from Data? (paper)
Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong
The 40th International Conference on Machine Learning (ICML 2023)
-
An Inductive Bias for Tabular Deep Learning. (paper)
Ege Beyazit, Jonathan Kozaczuk, Bo Li, Vanessa Wallace, Bilal H Fadlallah
The 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023)
-
TabCBM: Concept-based Interpretable Neural Networks for Tabular Data. (paper)
Mateo Espinosa Zarlenga, Zohreh Shams, Michael Edward Nelson, Been Kim, and Mateja Jamnik (TMLR 2023)
- Tabular Data: Deep Learning is Not All You Need. (paper)
Ravid Shwartz-Ziv, Amitai Armon.
(Information Fusion 2022)
- Revisiting Deep Learning Models for Tabular Data. (paper)
Yury Gorishniy, Ivan Rubachev, Valentin Khrulkov, Artem Babenko
The 34th Annual Conference on Neural Information Processing Systems (Neurips 2021)
2.5: Retrieval
- TabR: Tabular Deep Learning Meets Nearest Neighbors.(paper)
Yury Gorishniy, Ivan Rubachev, Nikolay Kartashev, Daniil Shlenskii, Akim Kotelnikov, Artem Babenko
The 12th International Conference on Learning Representations (ICLR 2024)
- Dense Representation Learning and Retrieval for Tabular Data Prediction.(paper)
Lei Zheng, Ning Li, Xianyu Chen, Quan Gan, Weinan Zhang
The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023)
2.5: Efficiency
- TinyLUT: Tiny Look-Up Table for Efficient Image Restoration at the Edge.(paper)
Huanan LI, Juntao Guan, Lai Rui, Sijun Ma, Lin Gu, Noperson
The 38th Annual Conference on Neural Information Processing Systems (Neurips 2024)
- FHyperFast: Instant Classification for Tabular Data.(paper)
David Bonet; Daniel Mas Montserrat; Xavier Giró-i-Nieto; Alexander Ioannidis
The 38th AAAI Conference on Artificial Intelligence (AAAI 2024)
- TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second. (paper)
Noah Hollmann, Samuel Müller, Katharina Eggensperger, Frank Hutter
The 11th International Conference on Learning Representations (ICLR 2023)
-
Clustering the Sketch: Dynamic Compression for Embedding Tables. (paper)
Henry Tsang, Thomas Dybdahl Ahle
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
- Compressing Tabular Data via Latent Variable Estimation. (paper)
Andrea Montanari, Eric Weiner
The 40th International Conference on Machine Learning (ICML 2023)
Branch 3: Application
3.1 Clinical Tabular Data
- EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records.
Yeonsu Kwon, Jiho Kim, Gyubok Lee, Seongsu Bae, Daeun Kyung, Wonchul Cha, Tom Pollard, Alistair Johnson, Edward Choi.
The 38th Annual Conference on Neural Information Processing Systems (Neurips 2024)
- Contrastive Learning for Clinical Outcome Prediction with Partial Data Sources.
Xia, Jonathan Wilson, Benjamin Goldstein, Ricardo Henao.
The 41th International Conference on Machine Learning (ICML 2024)
- Safe Exploration in Dose Finding Clinical Trials with Heterogeneous Participants.
Isabel Chien, Wessel Bruinsma, Javier Gonzalez, Richard E Turner.
The 41th International Conference on Machine Learning (ICML 2024)
- Weight Predictor Network with Feature Selection for Small Sample Tabular Biomedical Data. (paper)
Andrei Margeloiu, Nikola Simidjievski, Pietro Liò, Mateja Jamnik.
The 37th AAAI Conference on Artificial Intelligence (AAAI 2023)
- Improving Medical Predictions by Irregular Multimodal Electronic Health Records Modeling.(paper)
Xinlu Zhang, Shiyang Li, Zhiyu Chen, Xifeng Yan, Linda Ruth Petzold
The 40th International Conference on Machine Learning (ICML 2023)
-
Towards Semi-Structured Automatic ICD Coding via Tree-based Contrastive Learning. (paper)
Chang Lu, Chandan K. Reddy, Ping Wang, Yue Ning
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
-
MOTOR: A Time-to-Event Foundation Model For Structured Medical Records. (paper)
Ethan Steinberg, Jason Alan Fries, Yizhe Xu, Nigam Shah
The 37th Annual Conference on Neural Information Processing Systems (Neurips 2023)
- Locally Sparse Neural Networks for Tabular Biomedical Data. (paper)
Junchen Yang, Ofir Lindenbaum, Yuval Kluger
The 39th International Conference on Machine Learning (ICML 2022)
- Evaluating Latent Space Robustness and Uncertainty of EEG-ML Models under Realistic Distribution Shifts.(paper)
Neeraj Wagh, Jionghao Wei, Samarth Rawal, Brent M. Berry, Yogatheesan Varatharajah
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
- Debiased, Longitudinal and Coordinated Drug Recommendation through Multi-Visit Clinic Records. (paper)
Hongda Sun, Shufang Xie, Shuqi Li, Yuhan Chen, Ji-Rong Wen, Rui Yan
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
- Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions. (paper)
Fenglin Liu, Bang Yang, Chenyu You, Xian Wu, Shen Ge, Zhangdaihong Liu, Xu Sun, Yang Yang, David A. Clifton
The 36th Annual Conference on Neural Information Processing Systems (Neurips 2022)
- Self-Supervised Graph Neural Networks for Improved Electroencephalographic Seizure Analysis. (paper)
Siyi Tang, Jared Dunnmon, Khaled Kamal Saab, Xuan Zhang, Qianying Huang, Florian Dubost, Daniel Rubin, Christopher Lee-Messer
The 10th International Conference on Learning Representations (ICLR 2022)
- SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes. (paper)
Zhaozhi Qian, Yao Zhang, Ioana Bica, Angela Mary Wood, Mihaela van der Schaar
The 34th Annual Conference on Neural Information Processing Systems (Neurips 2021)
3.2 Financial Tabular Data
- Beyond Pure Text: Summarizing Financial Reports Based on Both Textual and Tabular Data. (paper)
Ziao Wang, Zelin Jiang, Xiaofeng Zhang, Jaehyeon Soon, Jialu Zhang, Wang Xiaoyao, Hongwei Du
The 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)
Existing Surveys
-
Explainable Artificial Intelligence for Tabular Data: A Survey
Sahakyan M, Aung Z, Rahwan T
IEEE Access 2021
-
The use of generative adversarial networks to alleviate class imbalance in tabular data: a survey
Sauber-Cole, Rick, and Taghi M. Khoshgoftaar
Journal of Big Data 2022
-
Deep Neural Networks and Tabular Data: A Survey.(paper)
Vadim Borisov, Tobias Leemann, Kathrin Seßler, Johannes Haug, Martin Pawelczyk, Gjergji Kasneci
IEEE transactions on neural networks and learning systems 2022
-
Synthetic data generation for tabular health records: A systematic review
Hernandez, M., Epelde, G., Alberdi, A., Cilla, R., & Rankin, D.
Neurocomputing 2022
-
Embeddings for Tabular Data: A Survey. (paper)
Rajat Singh, Srikanta Bedathur
Arxiv 2023
-
Transformers for Tabular Data Representation: A Survey of Models and Applications. (paper)
Gilbert Badaro, Mohammed Saeed, Paolo Papotti
TACL 2023
-
A Survey on Self-Supervised Learning for Non-Sequential Tabular Data. (paper)
Wei-Yao Wang1, Wei-Wei Du2,, Derek Xu, Wei Wang, Wen-Chih Peng
Arxiv 2024
-
Language Modeling on Tabular Data: A Survey of Foundations, Techniques and Evolution
Yucheng Ruan, Xiang Lan, Jingying Ma, Yizhi Dong, Kai He, Mengling Feng
Arvix 2024
-
Large language models (LLMs) on tabular data: Prediction, generation, and understanding - a survey
Xi Fang, Weijie Xu, Fiona Anting Tan, Jiani Zhang, Ziqing Hu, Yanjun (Jane) Qi, Scott Nickleach, Diego Socolinsky, Srinivasan Sengamedu, "SHS", Christos Faloutsos
Arvix 2024
Tools & Libraries
- Pytorch Frame: A modular deep learning framework for building neural network models on heterogeneous tabular data. (paper)
- PyTorch Tabular: A Framework for Deep Learning with Tabular Data.
Last updated on March 05, 2024.
(For problems, contact wjr5337@psu.edu. To add papers, please pull request at
our repo)