An ensemble model with hierarchical decomposition and aggregation for highly scalable and robust classification
Title | An ensemble model with hierarchical decomposition and aggregation for highly scalable and robust classification |
Publication Type | Conference Paper |
Year of Publication | 2017 |
Authors | Vu, Q. H., Ruta, D., Cen, L. |
Conference Name | 2017 Federated Conference on Computer Science and Information Systems (FedCSIS) |
Date Published | sep |
Keywords | binary classification problem, compositionality, Computational modeling, computer game, computer games, Data models, Decision trees, decomposition, Deep Learning, ensemble model, extreme gradient boosted decision trees, feature extraction, flexible robust scheme, game state information, Hearthstone, highly scalable classification, learning (artificial intelligence), logistic regression, Logistics, machine learning, Metrics, model decomposition, pattern classification, Predictive models, pubcrawl, regression analysis, sub-model integration, sub-model training, Training, Xgboost |
Abstract | This paper introduces an ensemble model that solves the binary classification problem by incorporating the basic Logistic Regression with the two recent advanced paradigms: extreme gradient boosted decision trees (xgboost) and deep learning. To obtain the best result when integrating sub-models, we introduce a solution to split and select sets of features for the sub-model training. In addition to the ensemble model, we propose a flexible robust and highly scalable new scheme for building a composite classifier that tries to simultaneously implement multiple layers of model decomposition and outputs aggregation to maximally reduce both bias and variance (spread) components of classification errors. We demonstrate the power of our ensemble model to solve the problem of predicting the outcome of Hearthstone, a turn-based computer game, based on game state information. Excellent predictive performance of our model has been acknowledged by the second place scored in the final ranking among 188 competing teams. |
URL | https://annals-csis.org/proceedings/2017/drp/564.html |
DOI | 10.15439/2017F564 |
Citation Key | vu_ensemble_2017 |
- highly scalable classification
- Xgboost
- Training
- sub-model training
- sub-model integration
- regression analysis
- pubcrawl
- Predictive models
- pattern classification
- model decomposition
- Metrics
- machine learning
- Logistics
- logistic regression
- learning (artificial intelligence)
- binary classification problem
- Hearthstone
- game state information
- flexible robust scheme
- feature extraction
- extreme gradient boosted decision trees
- ensemble model
- deep learning
- decomposition
- Decision trees
- Data models
- computer games
- computer game
- Computational modeling
- Compositionality