profile picture

Arijit Ghosh

I am a (happy) 1st year Ph.D. student in Computer Vision at Institut Polytechnique de Paris (IMAGINE Lab) under the supervision of David Picard. My main research interest lies in multi-modal generative models and the generalization of such multimodal models. Prior to my Ph.D. I completed my M.Sc. in Artificial Intelligence from University Erlangen-Nürnberg, Germany.

Publications

How far can we go with ImageNet for Text-to-Image generation?

How far can we go with ImageNet for Text-to-Image generation?

Lucas Degeorge, Arijit Ghosh, Nicolas Dufour, Vicky Kalogeiton, David Picard

* Equal contribution

ArXiV, 2025

Recent text-to-image (T2I) generation models have achieved remarkable results by training on billion-scale datasets, following a 'bigger is better' paradigm that prioritizes data quantity over quality. We challenge this established paradigm by demonstrating that strategic data augmentation of small, well-curated datasets can match or outperform models trained on massive web-scraped collections. Using only ImageNet enhanced with well-designed text and image augmentations, we achieve a +2 overall score over SD-XL on GenEval and +5 on DPGBench while using just 1/10th the parameters and 1/1000th the training images. Our results suggest that strategic data augmentation, rather than massive datasets, could offer a more sustainable path forward for T2I generation.

Education

Ph.D. in Computer Vision
Institut Polytechnique de Paris
2024 - 2027
M.Sc. in Artificial Intelligence
University Erlangen-Nürnberg
Grade: 1.30 (Best: 1.0, Worst: 4.0)
2021 - 2024
B.Tech. in Electronics and Communication Engineering
Maulana Abul Kalam Azad National Institute of Technology, West Bengal
Grade: 9.24 (Best: 10.0, Worst: 5.0)
2017 - 2021

Experience

Research Assistant

IMAGINE Lab, IP Paris

Jun 2024 - Nov 2024
Student Research Assistant

IDEA Lab, University Erlangen-Nürnberg

Jan 2022 - May 2024
Student Research Assistant

Fraunhofer IIS, Nürnberg

Mar 2023 - May 2024

Talks

Weakly Supervised Learning

Weakly Supervised Learning Talk

Representation Learning

Representation Learning Talk

Teaching

Teaching Assistant

University Erlangen-Nürnberg

Course: Algorithms, Programming and Data Representation