publications
2025
- EMNLPSali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video CaptioningIn Empirical Methods in Natural Language Processing EMNLP, 2025
- MMSIDA: Synthetic Image Driven Zero-shot Domain AdaptationIn ACM Multimedia, 2025
- MMSynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image CaptioningIn ACM Multimedia, 2025
- CVPRVerbDiff: Text-Only Diffusion Models with Enhanced Interaction AwarenessIn IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2025