I am an assistant professor at UMBC studying computer vision and machine learning. Prior to this, I was a postdoctoral research associate with Antonio Torralba at Massachusetts Institute of Technology (MIT). I graduated with a PhD in computer science from the University of California Irvine where I was advised by Deva Ramanan and co-advised by Ramesh Jain. I obtained my M.Sc. in electrical engineering at Sharif University of Technology in Tehran, Iran (map).
![]() |
ISD: Self-Supervised Learning by Iterative Similarity Distillation
|
|||||
![]() |
Explainable Models with Consistent Interpretations
|
|||||
![]() |
CompRess: Self-Supervised Learning by Compressing Representations
|
|||||
![]() |
||||||
![]() |
Universal Litmus Patterns: Revealing Backdoor Attacks in CNNs
|
|||||
![]() |
||||||
![]() |
||||||
![]() |
||||||
![]() |
Boosting Self-Supervised Learning via Knowledge Transfer
|
|||||
![]() |
Canonical correlation analysis of brain prefrontal activity measured by functional near infra-red spectroscopy (fNIRS) during a moral judgment task
|
|||||
![]() |
Cross-Modal Scene Networks
|
|||||
![]() |
Representation Learning by Learning to Count
|
|||||
![]() |
Weakly Supervised Cascaded Convolutional Networks
|
|||||
Automated Detection of Substance Use-Related Social Media Posts Based on Image and Text Analysis
|
||||||
![]() |
Generating Videos with Scene Dynamics
|
|||||
![]() |
Joint Semantic Segmentation and Depth Estimation with Deep Convolutional Networks
|
|||||
![]() |
Anticipating Visual Representations From Unlabeled Video
|
|||||
![]() |
Learning Aligned Cross-Modal Representations from Weakly Aligned Data
|
|||||
![]() |
||||||
![]() |
DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns
|
|||||
![]() |
Visualizing Object Detection Features
|
|||||
![]() |
Learning Visual Biases from Human Imagination
|
|||||
![]() |
Assessing the Quality of Actions
|
|||||
![]() |
Parsing Videos of Actions with Segmental Grammars
|
|||||
![]() |
Are All Training Examples Equally Valuable?
|
|||||
![]() |
||||||
![]() |
Detecting Activities of Daily Living in First-person Camera Views
|
|||||
![]() |
Steerable Part Models
|
|||||
![]() |
Globally-Optimal Greedy Algorithms for Tracking a Variable Number of Objects
|
|||||
![]() |
A Large-scale Benchmark Dataset for Event Recognition in Surveillance Video
|
|||||
![]() |
TalkMiner: A Lecture Webcast Search Engine
|
|||||
![]() |
Bilinear Classifiers for Visual Recognition
|
|||||
![]() |
Opti-Acoustic Stereo Imaging: On System Calibration and 3-D Target Reconstruction
|
|||||
![]() |
Towards Environment-to-Environment (E2E) Multimedia Communication Systems
|
|||||
![]() |
Opti-Acoustic Stereo Imaging, System Calibration and 3-D Reconstruction
|
|||||
Please check out the list of older publications. |