Official Code for the paper "Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning". Paper accepted at European Conference on Computer Vision (ECCV), 2022
CMSF extends a prior self-supervised representation learning method(MSF) where a sample is encouraged to be close to not just its augmented version but also to the nearest neighbors of the augmented image. In CMSF, the neighbors are constrained to be from the same semantic category as the input image. Use of constraint provides samples that are far from the target image in the feature space but close in the semantic space. The category labels are present in the supervised set-up and are predicted in the semi- and self-supervised set-ups.
All our experiments use the PyTorch library. We recommend installing the following package versions:
- python=3.7.6
- pytorch=1.4
- torchvision=0.5.0
- faiss-gpu=1.6.1 (required for k-NN evaluation alone)
Instructions for PyTorch installation can be found here.
GPU version of the FAISS (https://github.com/facebookresearch/faiss) package is necessary for k-NN evaluation of trained models. It can be installed using the following command:
pip install faiss-gpu
We provide scripts to run the self-supervised, semi-supervised and fully supervised CMSF experiments. The provided scripts include the default values of hyperparameters used to report the results in the paper. Modify the path to dataset to run the codes.
Self-supservised CMSF
bash run_selfsup.sh
Semi-supervised CMSF
bash run_semisup_fullprecision.sh
The script includes code for training the semi-supervised version and performing the k-NN and linear evaluation on the final checkpoint. Modify the arguments to run only the training / testing codes.
Supservised CMSF
bash run_supervised.sh
- Add multi-crop codes.
- Add results and pretrained models.
If you make use of the code, please cite the following work:
@inproceedings{navaneet2022constrained,
title={Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning},
author={KL Navaneet and Soroush Abbasi Koohpayegani and Ajinkya Tejankar and Kossar Pourahmadi and Akshayvarun Subramanya and Hamed Pirsiavash},
year={2022},
booktitle={European Conference on Computer Vision (ECCV)}
}
This project is under the MIT license.