Experience
Sign in to view more
Education
Sign in to view more
Bio
I mainly focus on machine learning models and deep learning methods for structured semantic understanding in videos and images (e.g. Structured Prediction). I believe our world is compositional and humans don't perceive the world as raw pixels. Moreover, structured models can enjoy the properties of generalization and inductive-bias, which I find critical, especially at the intersections of vision, language and robotics.
Research Interest:
Machine Learning & Deep Learning: Generative Models, Graph Neural Networks, Self-Supervised Learning.
Vision & Language: Video Understanding, Scene Understanding, Visual Reasoning.
Vision & Robotics: Semantic Understanding, Structured Representation, Transfer Learning.
Research Interest:
Machine Learning & Deep Learning: Generative Models, Graph Neural Networks, Self-Supervised Learning.
Vision & Language: Video Understanding, Scene Understanding, Visual Reasoning.
Vision & Robotics: Semantic Understanding, Structured Representation, Transfer Learning.