TTVAE: Transformer-based generative modeling for tabular data generation

被引：1

作者：

Wang, Alex X. ^{[1
]}

Nguyen, Binh P. ^{[1
,2
]}

机构：

[1] Victoria Univ Wellington, Sch Math & Stat, Wellington 6012, New Zealand

[2] Ho Chi Minh City Open Univ, Fac Informat Technol, 97 Vo Van Tan,Dist 3, Ho Chi Minh City 70000, Vietnam

来源：

ARTIFICIAL INTELLIGENCE | 2025年 / 340卷

关键词：

Generative AI; Tabular data; Transformer; Latent space interpolation; SMOTE;

D O I：

10.1016/j.artint.2025.104292

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Tabular data synthesis presents unique challenges, with Transformer models remaining underexplored despite the applications of Variational Autoencoders and Generative Adversarial Networks. To address this gap, we propose the Transformer-based Tabular Variational AutoEncoder (TTVAE), leveraging the attention mechanism for capturing complex data distributions. The inclusion of the attention mechanism enables our model to understand complex relationships among heterogeneous features, a task often difficult for traditional methods. TTVAE facilitates the integration of interpolation within the latent space during the data generation process. Specifically, TTVAE is trained once, establishing a low-dimensional representation of real data, and then various latent interpolation methods can efficiently generate synthetic latent points. Through extensive experiments on diverse datasets, TTVAE consistently achieves state-of-the-art performance, highlighting its adaptability across different feature types and data sizes. This innovative approach, empowered by the attention mechanism and the integration of interpolation, addresses the complex challenges of tabular data synthesis, establishing TTVAE as a powerful solution.

引用

页数：17

共 50 条

[41] Ship trajectory prediction using AIS data with TransFormer-based AI
Takahashi, Koya
Zama, Kaito
Hiroi, Noriko F.
2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1302 - 1305
[42] REFERENT: Transformer-based Feedback Generation using Assignment Information for Programming Course
Heo, Jinseok
Jeong, Hohyeon
Choi, Dongwook
Lee, Eunseok
2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING-SOFTWARE ENGINEERING EDUCATION AND TRAINING, ICSE-SEET, 2023, : 308 - 313
[43] Efficient Transformer-Based Compressed Video Modeling via Informative Patch Selection
Suzuki, Tomoyuki
Aoki, Yoshimitsu
SENSORS, 2023, 23 (01)
[44] Knowledge-Enhanced Conversational Recommendation via Transformer-Based Sequential Modeling
Zou, Jie
Sun, Aixin
Long, Cheng
Kanoulas, Evangelos
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (06)
[45] TransMRSR: transformer-based self-distilled generative prior for brain MRI super-resolution
Huang, Shan
Liu, Xiaohong
Tan, Tao
Hu, Menghan
Wei, Xiaoer
Chen, Tingli
Sheng, Bin
VISUAL COMPUTER, 2023, 39 (08): : 3647 - 3659
[46] RFormer: Transformer-Based Generative Adversarial Network for Real Fundus Image Restoration on a New Clinical Benchmark
Deng, Zhuo
Cai, Yuanhao
Chen, Lu
Gong, Zheng
Bao, Qiqi
Yao, Xue
Fang, Dong
Yang, Wenming
Zhang, Shaochong
Ma, Lan
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (09) : 4645 - 4655
[47] Transformer-Based Long Distance Fiber Channel Modeling for Optical OFDM Systems
Zhang, Niuyong
Yang, Hang
Niu, Zekun
Zheng, Lizhuo
Chen, Cao
Xiao, Shilin
Yi, Lilin
JOURNAL OF LIGHTWAVE TECHNOLOGY, 2022, 40 (24) : 7779 - 7789
[48] TransMRSR: transformer-based self-distilled generative prior for brain MRI super-resolution
Shan Huang
Xiaohong Liu
Tao Tan
Menghan Hu
Xiaoer Wei
Tingli Chen
Bin Sheng
The Visual Computer, 2023, 39 : 3647 - 3659
[49] Deterministic Autoencoder using Wasserstein loss for tabular data generation
Wang, Alex X.
Nguyen, Binh P.
NEURAL NETWORKS, 2025, 185
[50] A Deep Learning-Based Pipeline for the Generation of Synthetic Tabular Data
Panfilo, Daniele
Boudewijn, Alexander
Saccani, Sebastiano
Coser, Andrea
Svara, Borut
Chauvenet, Carlo Rossi
Mami, Ciro Antonio
Medvet, Eric
IEEE ACCESS, 2023, 11 : 63306 - 63323

← 1 2 3 4 5 →