publications
2025
- EMNLP
Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video CaptioningIn Empirical Methods in Natural Language Processing EMNLP, 2025 - MM
SIDA: Synthetic Image Driven Zero-shot Domain AdaptationIn ACM Multimedia, 2025 - MM
SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image CaptioningIn ACM Multimedia, 2025 - CVPR
VerbDiff: Text-Only Diffusion Models with Enhanced Interaction AwarenessIn IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2025