Publications

(2024). Compressed-Language Models for Understanding Compressed File Formats: a JPEG Exploration. In ArXiv.

Cite Preprint PDF

(2024). Towards Automated Movie Trailer Generation. In CVPR.

Cite Preprint PDF

(2023). Localizing Moments in Long Video via Multimodal Guidance. In ICCV.

Cite Preprint PDF Supplementary Material Code

(2023). Boundary-denoising for video activity localization. In ICLR.

Cite Preprint PDF Code

(2022). Egocentric Video-Language Pretraining. In NeurIPS.

Cite Preprint PDF Code

(2021). MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions. In CVPR.

Cite Preprint PDF Code Video

(2021). VLG-Net: Video-Language Graph Matching Network for Video Grounding. In ICCVW.

Cite Preprint PDF Supplementary Material Code Video

(2019). Finding Moments in Video Collections Using Natural Language. In ArXiv.

Cite Preprint PDF Code Video

(2019). Seq2Seq RNN based Gait Anomaly Detection from Smartphone Acquired Multimodal Motion Data. In ArXiv.

Cite Preprint PDF Code