Meta Learning for Imbalanced Big Data Analysis by Using Generative Adversarial Networks
Title | Meta Learning for Imbalanced Big Data Analysis by Using Generative Adversarial Networks |
Publication Type | Conference Paper |
Year of Publication | 2018 |
Authors | Seo, Sanghyun, Jeon, Yongjin, Kim, Juntae |
Conference Name | Proceedings of the 2018 International Conference on Big Data and Computing |
Publisher | ACM |
ISBN Number | 978-1-4503-6426-3 |
Keywords | Generative Adversarial Learning, generative adversarial network, Imbalanced Big Data Analysis, Meta Learning, Metrics, Oversampling, pubcrawl, Resiliency, Scalability |
Abstract | Imbalanced big data means big data where the ratio of a certain class is relatively small compared to other classes. When the machine learning model is trained by using imbalanced big data, the problem with performance drops for the minority class occurs. For this reason, various oversampling methodologies have been proposed, but simple oversampling leads to problem of the overfitting. In this paper, we propose a meta learning methodology for efficient analysis of imbalanced big data. The proposed meta learning methodology uses the meta information of the data generated by the generative model based on Generative Adversarial Networks. It prevents the generative model from becoming too similar to the real data in minority class. Compared to the simple oversampling methodology for analyzing imbalanced big data, it is less likely to cause overfitting. Experimental results show that the proposed method can efficiently analyze imbalanced big data. |
URL | https://dl.acm.org/citation.cfm?doid=3220199.3220205 |
DOI | 10.1145/3220199.3220205 |
Citation Key | seo_meta_2018 |