75
The digital age has equipped financial institutions with vast amounts of data. Privacy concerns have posed challenges to harnessing this data's full potential. Generation of synthetic data is one of the most promising solutions for allowing analysis of the patterns and trends contained in this data without compromising privacy. Although initial methods for generating synthetic data were basic, emerging generative models have expanded the possibilities. However, generating synthetic data for unique datasets, like bank transaction sequences, remains challenging. These sequences exhibit complex variability driven by the various customer transaction behaviors, distinguishing them from the more predictable patterns in other data types. We propose BankGAN, an innovative conditional tabular GAN architecture designed specifically for synthesizing bank transaction sequences that exhibit non-uniform date patterns. We show that BankGAN outperforms a recurrent neural network (RNN)-based model in achieving superior statistical resemblance to real data. Moreover, it excels at replicating features of periodic transactions, surpassing both the RNN and transformer-based models. BankGAN distinguishes itself by generating privacy-preserving synthetic data without compromising data quality—a stark contrast to the existing models where adding privacy-preserving guarantees typically degrades performance.
Article ID: 2024S7
Month: May
Year: 2024
Address: Online
Venue: The 37th Canadian Conference on Artificial Intelligence
Publisher: Canadian Artificial Intelligence Association
URL: https://caiac.pubpub.org/pub/dnup7l7j