I am a researcher in Computer Vision. I work on Scene Understanding and Affective Computing.
I am a visiting researcher at the Massachusetts Institute of Technology (MIT). I am at the Medialab, working with Prof. Rosalind Picard, at the Affective Computing group. I am also a member of the BCN Perceptual Computing Lab and the Computer Vision Center.
From 2012 until 2015 I was a visiting researcher at the MIT CSAIL, working with Prof. Antonio Torralba. I did my PhD in Computer Science at the Universitat Autonoma de Barcelona and my BS degree in Mathematics at the Universitat de Barcelona.
Our work on "Class Activation Map" accepted at CVPR 2016We revisit the global average pooling layer and shed light on how it explicitly enables the convolutional neural network to have remarkable localization ability despite being trained on image-level labels (Project Page).
Scene recognition demoGiven a picture our system predicts the scene category and some other attibutes. It also provides a heatmap that indicates the region of the image that supports the ouputs.
Understanding the representations learned by CNNsWe found that object detectors emerged in a CNN trained for scene recognition. For more information check our paper: B. Zhou, A. Khosla, A. Lapedriza, A. Oliva, and A. Torralba. “Object Detectors Emerge in Deep Scene CNNs.” International Conference on Learning Representations (ICLR) oral, 2015. (PDF).
Project page of Places DatabaseYou can download the database and the pretrained network PlacesCNN. More details can be found in our paper: B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. “Learning Deep Features for Scene Recognition using Places Database.” Advances in Neural Information Processing Systems 27 (NIPS), 2014. (PDF).
Universitat Oberta de Catalunya,
Estudis d'Informàtica, Multimèdia i Telecomunicació
Rambla del Poblenou, 156
08018 Barcelona (Spain)