TTVAE: Transformer-based generative modeling for tabular data generation

被引:1
|
作者
Wang, Alex X. [1 ]
Nguyen, Binh P. [1 ,2 ]
机构
[1] Victoria Univ Wellington, Sch Math & Stat, Wellington 6012, New Zealand
[2] Ho Chi Minh City Open Univ, Fac Informat Technol, 97 Vo Van Tan,Dist 3, Ho Chi Minh City 70000, Vietnam
关键词
Generative AI; Tabular data; Transformer; Latent space interpolation; SMOTE;
D O I
10.1016/j.artint.2025.104292
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tabular data synthesis presents unique challenges, with Transformer models remaining underexplored despite the applications of Variational Autoencoders and Generative Adversarial Networks. To address this gap, we propose the Transformer-based Tabular Variational AutoEncoder (TTVAE), leveraging the attention mechanism for capturing complex data distributions. The inclusion of the attention mechanism enables our model to understand complex relationships among heterogeneous features, a task often difficult for traditional methods. TTVAE facilitates the integration of interpolation within the latent space during the data generation process. Specifically, TTVAE is trained once, establishing a low-dimensional representation of real data, and then various latent interpolation methods can efficiently generate synthetic latent points. Through extensive experiments on diverse datasets, TTVAE consistently achieves state-of-the-art performance, highlighting its adaptability across different feature types and data sizes. This innovative approach, empowered by the attention mechanism and the integration of interpolation, addresses the complex challenges of tabular data synthesis, establishing TTVAE as a powerful solution.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Ship trajectory prediction using AIS data with TransFormer-based AI
    Takahashi, Koya
    Zama, Kaito
    Hiroi, Noriko F.
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1302 - 1305
  • [42] REFERENT: Transformer-based Feedback Generation using Assignment Information for Programming Course
    Heo, Jinseok
    Jeong, Hohyeon
    Choi, Dongwook
    Lee, Eunseok
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING-SOFTWARE ENGINEERING EDUCATION AND TRAINING, ICSE-SEET, 2023, : 308 - 313
  • [43] Efficient Transformer-Based Compressed Video Modeling via Informative Patch Selection
    Suzuki, Tomoyuki
    Aoki, Yoshimitsu
    SENSORS, 2023, 23 (01)
  • [44] Knowledge-Enhanced Conversational Recommendation via Transformer-Based Sequential Modeling
    Zou, Jie
    Sun, Aixin
    Long, Cheng
    Kanoulas, Evangelos
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (06)
  • [45] TransMRSR: transformer-based self-distilled generative prior for brain MRI super-resolution
    Huang, Shan
    Liu, Xiaohong
    Tan, Tao
    Hu, Menghan
    Wei, Xiaoer
    Chen, Tingli
    Sheng, Bin
    VISUAL COMPUTER, 2023, 39 (08): : 3647 - 3659
  • [46] RFormer: Transformer-Based Generative Adversarial Network for Real Fundus Image Restoration on a New Clinical Benchmark
    Deng, Zhuo
    Cai, Yuanhao
    Chen, Lu
    Gong, Zheng
    Bao, Qiqi
    Yao, Xue
    Fang, Dong
    Yang, Wenming
    Zhang, Shaochong
    Ma, Lan
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (09) : 4645 - 4655
  • [47] Transformer-Based Long Distance Fiber Channel Modeling for Optical OFDM Systems
    Zhang, Niuyong
    Yang, Hang
    Niu, Zekun
    Zheng, Lizhuo
    Chen, Cao
    Xiao, Shilin
    Yi, Lilin
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2022, 40 (24) : 7779 - 7789
  • [48] TransMRSR: transformer-based self-distilled generative prior for brain MRI super-resolution
    Shan Huang
    Xiaohong Liu
    Tao Tan
    Menghan Hu
    Xiaoer Wei
    Tingli Chen
    Bin Sheng
    The Visual Computer, 2023, 39 : 3647 - 3659
  • [49] Deterministic Autoencoder using Wasserstein loss for tabular data generation
    Wang, Alex X.
    Nguyen, Binh P.
    NEURAL NETWORKS, 2025, 185
  • [50] A Deep Learning-Based Pipeline for the Generation of Synthetic Tabular Data
    Panfilo, Daniele
    Boudewijn, Alexander
    Saccani, Sebastiano
    Coser, Andrea
    Svara, Borut
    Chauvenet, Carlo Rossi
    Mami, Ciro Antonio
    Medvet, Eric
    IEEE ACCESS, 2023, 11 : 63306 - 63323