MaskAdapter

Mask-Adapter

The Devil is in the Masks for Open-Vocabulary Segmentation

[Yongkang Li](https://owl-10.github.io/yongkangli/)\*, [Tianheng Cheng](https://scholar.google.com/citations?user=PH8rJHYAAAAJ&hl=zh-CN)\*, [Bin Feng](https://scholar.google.com/citations?user=nRc8u6gAAAAJ&hl=zh-CN), [Wenyu Liu](http://eic.hust.edu.cn/professor/liuwenyu), [Xinggang Wang](https://xwcv.github.io/)📧 Huazhong University of Science and Technology, CVPR 2025 (\* equal contribution, 📧 corresponding author) [![arxiv paper](https://img.shields.io/badge/CVPR-Paper-blue)](https://openaccess.thecvf.com/content/CVPR2025/papers/Li_Mask-Adapter_The_Devil_is_in_the_Masks_for_Open-Vocabulary_Segmentation_CVPR_2025_paper.pdf) [![arxiv paper](https://img.shields.io/badge/arXiv-Paper-red)](https://arxiv.org/abs/2412.04533) [![checkpoints](https://img.shields.io/badge/HuggingFace-🤗_Weight-orange)](https://huggingface.co/owl10/Mask-Adapter) [![🤗 HuggingFace Demo](https://img.shields.io/badge/Mask_Adapter-🤗_HF_Demo-orange)](https://huggingface.co/spaces/wondervictor/Mask-Adapter)

Highlights

Updates

Getting Started

Models

Model Backbone A-847 A-150 PC-459 PC-59 PAS-20 Download
FC-CLIP ConvNeXt-L 14.8 34.1 18.2 58.4 95.4 model
FC-CLIP + Mask-Adapter ConvNeXt-L 14.1 36.6 19.3 59.7 95.5 model
MAFTP-Base ConvNeXt-B 13.8 34.5 18.5 57.5 95.5 model
MAFTP-Base + Mask-Adapter ConvNeXt-B 14.2 35.6 17.9 58.4 95.1 model
MAFTP-Large ConvNeXt-L 15.5 36.3 21.2 59.5 96.4 model
MAFTP-Large + Mask-Adapter ConvNeXt-L 16.2 38.2 22.7 60.4 95.8 model

Citation

If you Mask-Adapter useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.

@article{li2024maskadapter,
      title={Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation}, 
      author={Yongkang Li and Tianheng Cheng and Wenyu Liu and Xinggang Wang},
      year={2024},
      eprint={2412.04533},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2412.04533}, 
}

License

All code in this repository is under the Apache License 2.0.

Acknowledgement

Mask-Adapter is based on the following projects: detectron2, Mask2Former, FC-CLIP and MAFTP. Many thanks for their excellent contributions to the community.