Mahesh's webpage

Computer Science PhD student,
University at Buffalo, The State University of New York

prof_pic.jpg

301B Davis Hall

UB North Campus

Amherst, NY, 14260.

I am a Computer Science PhD candidate advised by Dr. David Doermann. Prior to this, I completed my Masters in CS, also at UB, and B.Tech at Walchand College of Engineering, Shivaji University, India. I developed deep-learning models for filesystems at Veritas Technologies LLC, where I was fortunate to be advised by Anindya Banerjee.

My current research focuses on image/video synthesis via diffusion models, and VQA and fairness issues of Multi-Modal Large Language Models (MLLMs).

Education

  • PhD in Computer Science - University at Buffalo, The State University of New York (2023 - Expected 2027)
  • Masters in Computer Science - University at Buffalo, The State University of New York (2021 - 2023)
  • B.Tech in Information Technology - Walchand College of Engineering, Shivaji University, India (2013 - 2017)

Professional Experience

  • Johns Hopkins University
    Visiting Research Scholar - Johns Hopkins University (June 2026 - August 2026)

    Research Topics: Multi-video understanding.

  • University at Buffalo
    Research Assistant & Lab Manager - A2IL Lab, University at Buffalo (2022 - Present)

    Research Topics: Multi-modal generative AI.

  • Veritas Technologies
    Software Engineer - Veritas Technologies LLC (2017 - 2021)

    Developed deep-learning models for storage filesystems. Reduced execution time of resource-intensive tasks by 56%.

news

Jun 01, 2026 Excited to be joining Johns Hopkins University as a Visiting Research Scholar (June – August 2026), working on multi-video understanding.
May 15, 2026 Two papers accepted to the ACL 2026 Multimodal Augmented Generation Workshop — CRAFT and TRACE.
Mar 15, 2026 One paper accepted as Oral (≤ 8% of submitted papers) at MIDL 2026 — Category-wise Structured Radiology Report Generation with Contrastive Decoding.
Feb 27, 2026 One paper accepted to CVPR 2026 — FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants.
Sep 25, 2025 One paper accepted to NeurIPS 2025 — AutoEdit: Automatic Hyperparameter Tuning for Image Editing.

latest posts

selected publications

  1. FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants
    Mahesh Bhosale, Abdul Wasi, Shantam Shrivastva, and 5 more authors
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition , 2026
  2. CRAFT: Critic-Refined Adaptive Key-Frame Targeting for Multimodal Video Question Answering
    Mahesh Bhosale, Abdul Wasi, Vishvesh Trivedi, and 3 more authors
    In ACL Multimodal Augmented Generation via MultimodAl Retrieval Workshop , 2026
  3. TRACE: Evidence Grounding-Guided Multi-Video Event Understanding and Claim Generation
    Pengyu Yan, Akhil V S S Gorugantu, Mahesh Bhosale, and 3 more authors
    In ACL Multimodal Augmented Generation via MultimodAl Retrieval Workshop , 2026
  4. PathDiff: Histopathology Image Synthesis with Unpaired Text and Mask Conditions
    Mahesh Bhosale, Abdul Wasi, Yuanhao Zhai, and 7 more authors
    In IEEE/CVF International Conference on Computer Vision , 2025
  5. AutoEdit: Automatic Hyperparameter Tuning for Image Editing
    Chau Pham, Quan Dao, Mahesh Bhosale, and 3 more authors
    In Neural Information Processing Systems , 2025