Skip to main content
Spatio-Temporal Statistics and Data Science
STSDS
Aerospace and Transportation Systems
Main navigation
Home
People
All Profiles
Principal Investigators
Research Scientists
Research Staff
Postdoctoral Fellows
Students
Alumni
Former Members
Events
All Events
Events Calendar
News
Projects
Topics
Courses
Theses
multimodal alignment
Towards Scalable and Efficient Semantic Video Search
Mattia Soldan, Ph.D., Electrical and Computer Engineering
Jul 13, 18:00
-
19:00
B4 L5 R5209
video-language grounding
semantic video retrieval
multimodal alignment
This dissertation advances fine-grained, content-aware video retrieval by developing novel models and frameworks for Video-Language Grounding, enabling accurate alignment between natural language queries and specific temporal segments in unstructured video content.