Mask2Former for Bird's Eye View Representation

The project aims to develop a model that learns different queries for each vehicle in bird's eye view (BEV) from multi-camera images, enabling compact and interpretable vehicle representations for downstream tasks like 3D object detection, tracking, and motion forecasting.

Learned Representations

Different queries successfully learn to identify vehicles in the scene.

1. Clone the Repository

git clone https://github.com/mrabiabrn/mask2former4bev.git
cd mask2former4bev

2. Setup the Environment

Create a Conda environment and install the required dependencies:

conda create -n mask2former4bev
conda activate mask2former4bev
conda install pytorch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2  pytorch-cuda=12.1 -c pytorch -c nvidia
pip install -r requirements.txt

3. Download Dataset

Download NuScenes from this link to root/to/nuscenes.

4. Training

torchrun --master_port 2245 --nproc_per_node=<gpus>  train.py --dataset_path "root/to/dataset"

Acknowledgments

This repository incorporates code from several public works, including SimpleBEV, Mask2Former, and SOLV. Special thanks to the authors of these projects for making their code available.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
figures		figures
models		models
notebooks		notebooks
scripts		scripts
utils_		utils_
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
dataset.py		dataset.py
dataset_old.py		dataset_old.py
evaluation_script.py		evaluation_script.py
evaluator.py		evaluator.py
extract_sam_masks.py		extract_sam_masks.py
initialize.py		initialize.py
losses.py		losses.py
read_args.py		read_args.py
train.py		train.py
train_segnet.py		train_segnet.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mask2Former for Bird's Eye View Representation

Learned Representations

1. Clone the Repository

2. Setup the Environment

3. Download Dataset

4. Training

Acknowledgments

About

Releases

Packages

Languages

mrabiabrn/mask2former4bev

Folders and files

Latest commit

History

Repository files navigation

Mask2Former for Bird's Eye View Representation

Learned Representations

1. Clone the Repository

2. Setup the Environment

3. Download Dataset

4. Training

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages