Mahesh's webpage
Computer Science PhD student,
University at Buffalo, The State University of New York
301B Davis Hall
UB North Campus
Amherst, NY, 14260.
I am a Computer Science PhD candidate advised by Dr. David Doermann. Prior to this, I completed my Masters in CS, also at UB, and B.Tech at Walchand College of Engineering, Shivaji University, India. I developed deep-learning models for filesystems at Veritas Technologies LLC, where I was fortunate to be advised by Anindya Banerjee.
My current research focuses on image/video synthesis via diffusion models, and VQA and fairness issues of Multi-Modal Large Language Models (MLLMs).
Education
- PhD in Computer Science - University at Buffalo, The State University of New York (2023 - Expected 2027)
- Masters in Computer Science - University at Buffalo, The State University of New York (2021 - 2023)
- B.Tech in Information Technology - Walchand College of Engineering, Shivaji University, India (2013 - 2017)
Professional Experience
-
Visiting Research Scholar - Johns Hopkins University (June 2026 - August 2026)Research Topics: Multi-video understanding.
-
Research Assistant & Lab Manager - A2IL Lab, University at Buffalo (2022 - Present)Research Topics: Multi-modal generative AI.
-
Software Engineer - Veritas Technologies LLC (2017 - 2021)Developed deep-learning models for storage filesystems. Reduced execution time of resource-intensive tasks by 56%.
news
| Jun 01, 2026 | Excited to be joining Johns Hopkins University as a Visiting Research Scholar (June – August 2026), working on multi-video understanding. |
|---|---|
| May 15, 2026 | Two papers accepted to the ACL 2026 Multimodal Augmented Generation Workshop — CRAFT and TRACE. |
| Mar 15, 2026 | One paper accepted as Oral (≤ 8% of submitted papers) at MIDL 2026 — Category-wise Structured Radiology Report Generation with Contrastive Decoding. |
| Feb 27, 2026 | One paper accepted to CVPR 2026 — FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants. |
| Sep 25, 2025 | One paper accepted to NeurIPS 2025 — AutoEdit: Automatic Hyperparameter Tuning for Image Editing. |
latest posts
| Nov 28, 2024 | Distance metric for fairness in vision language models |
|---|---|
| Nov 28, 2024 | Quotodian commands |
| Nov 28, 2024 | Diffusion Models Background and Code |
selected publications
- AutoEdit: Automatic Hyperparameter Tuning for Image EditingIn Neural Information Processing Systems , 2025