Deep Tabular Representation Learning and its Applications: A Survey

Builder & Current Maintainer: Weijieying Ren, YuQing Huang and Tianxiang Zhao, Prof. Vasant Honavar.

Paper List

We have summarized the main branches of works for Deep tabular data representation learning, including its downstream tasks and applications. For more details, please refer to our recent survey (paper).

Branch 1: Deep Tabular Representation Learning

1.1 Heterogeneous Feature Representation Learning

1.1.1 Kernel-Based

1.1.2 Binning-Based Representation Learning

1.1.3 Latent Representation Learning

1.2 Inter-Column Dependency Modeling

1.2.1 Tree-based

1.2.2 Graph-based

1.2.3 Rule-based Method

1.2.4 Transformer-based

1.2.5 Additive-model-based

1.2.6 Masking Modeling

1.2.7 Neural Architecture Search

1.3 Self-supervised Representation Learning

1.4 Clustering-based Representation Learning

1.2 Multi-modality Representation Learning

1.3 Learning with External Knowledge

1.3.1 Learning with Good Model Initialization

1.3.1 Learning with Knowledge Graph

1.3.2 Learning with Large Language Models

1.4 Causal Representation Learning

Branch 2: Downstream Tasks

2.1 Generation

2.1.1 GAN-based Models

2.1.2 VAE-based Models

2.1.3 Diffusion-based Models

2.1.4 Transformer-based

2.1.5 Large Language Model-based

2.1.6 Model-agnostic

2.2 Anomaly Detection

2.3 Transfer Learning

2.4 Explanation/Model Assesment

2.5: Retrieval

2.5: Efficiency

Branch 3: Application

3.1 Clinical Tabular Data

3.2 Financial Tabular Data

Existing Surveys

Tools & Libraries

Last updated on March 05, 2024. (For problems, contact wjr5337@psu.edu. To add papers, please pull request at our repo)