Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Image Classification using Model Ensembling


Affiliations
1 Department of Computer Science, St. Xavier’s College, India
2 Department of Computer Application, RCC Institute of Information Technology, India
3 Department of Computer Science and Engineering, RCC Institute of Information Technology, India
     

   Subscribe/Renew Journal


Classifying images efficiently using various algorithms is very useful now-a-days given that the field of computer vision is growing rapidly. The research work highlighted in this paper focuses on the independent use of various models to classify images and then combining them together to form a better model in terms of performance than each of the individual models. The dataset used consists of 200 classes with 90,000 training images, 10,000 validation images and 10,000 test images. The data preparation step in this work involves resizing the images (data), shuffling them and transforming them into a data generator to provide input to the models. The images were also augmented using two different sets of image transformation effects to get more data for the models to train on. These data were then used to train five different models (one model trained from scratch and four other models using pre-trained weights and transfer learning) independently. The performance of each model was judged by checking two evaluation metrics – f1-score and categorical accuracy. The models were also tried to be fine-tuned to get a better performance, and finally the models were ensembled together to get a better categorical accuracy and f1-score on unseen (validation and test) data.

Keywords

Image Classification, Convolutional Neural Networks, Image Augmentation, Model Ensembling, F1-Score
Subscription Login to verify subscription
User
Notifications
Font Size


  • Image Classification using Model Ensembling

Abstract Views: 306  |  PDF Views: 1

Authors

Debabrata Datta
Department of Computer Science, St. Xavier’s College, India
Anweshan Mukherjee
Department of Computer Science, St. Xavier’s College, India
Soumen Mukherjee
Department of Computer Application, RCC Institute of Information Technology, India
Arup Kr. Bhattacharjee
Department of Computer Science and Engineering, RCC Institute of Information Technology, India
Anal Acharya
Department of Computer Science, St. Xavier’s College, India

Abstract


Classifying images efficiently using various algorithms is very useful now-a-days given that the field of computer vision is growing rapidly. The research work highlighted in this paper focuses on the independent use of various models to classify images and then combining them together to form a better model in terms of performance than each of the individual models. The dataset used consists of 200 classes with 90,000 training images, 10,000 validation images and 10,000 test images. The data preparation step in this work involves resizing the images (data), shuffling them and transforming them into a data generator to provide input to the models. The images were also augmented using two different sets of image transformation effects to get more data for the models to train on. These data were then used to train five different models (one model trained from scratch and four other models using pre-trained weights and transfer learning) independently. The performance of each model was judged by checking two evaluation metrics – f1-score and categorical accuracy. The models were also tried to be fine-tuned to get a better performance, and finally the models were ensembled together to get a better categorical accuracy and f1-score on unseen (validation and test) data.

Keywords


Image Classification, Convolutional Neural Networks, Image Augmentation, Model Ensembling, F1-Score

References