Return to Article Details Heterogeneous Ensemble Learning for Context-Aware Image Captioning with Transformers Download Download PDF