I am a researcher in Computer Vision. I work on Scene Understanding and Affective Computing.
I am a visiting researcher at the Massachusetts Institute of Technology (CSAIL, MIT), working with Prof. Antonio Torralba. I am also a member of the BCN Perceptual Computing Lab and the Computer Vision Center.
Our work on "Class Activation Map" accepted at CVPR 2016We revisit the global average pooling layer and shed light on how it explicitly enables the convolutional neural network to have remarkable localization ability despite being trained on image-level labels (Project Page).
Scene recognition demoGiven a picture our system predicts the scene category and some other attibutes. It also provides a heatmap that indicates the region of the image that supports the ouputs.
Understanding the representations learned by CNNsWe found that object detectors emerged in a CNN trained for scene recognition. For more information check our paper: B. Zhou, A. Khosla, A. Lapedriza, A. Oliva, and A. Torralba. “Object Detectors Emerge in Deep Scene CNNs.” International Conference on Learning Representations (ICLR) oral, 2015. (PDF).
Project page of Places DatabaseYou can download the database and the pretrained network PlacesCNN. More details can be found in our paper: B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. “Learning Deep Features for Scene Recognition using Places Database.” Advances in Neural Information Processing Systems 27 (NIPS), 2014. (PDF).
Universitat Oberta de Catalunya,
Estudis d'Informàtica, Multimèdia i Telecomunicació
Rambla del Poblenou, 156
08018 Barcelona (Spain)