Sule Bai | 白苏乐

Hi! I'm a second-year master student in Tsinghua University Shenzhen International Graduate School, supervised by Prof. Yansong Tang. Before that, I received my bachelor's degree from the School of Computer Science and Technology in the Beijing University of Posts and Telecommunications in 2023.

My currect research interest lies in open-vocabulary segmentation, vision-language models and video understanding.

Email  /  Github  /  Scholar

profile photo
Publications

* indicates equal contribution

dise Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
Sule Bai*, Yong Liu*, Yifei Han, Haoji Zhang, Yansong Tang
arXiv Preprint, 2024
[Paper] [Code]

We propose a training-free method that enhances CLIP's segmentation performance through self-calibration without introducing new parameters or relying on additional backbones.

dise Open-Vocabulary Segmentation with Semantic-Assisted Calibration
Yong Liu*, Sule Bai*, Guanbin Li, Yitong Wang, Yansong Tang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[Paper] [Code]

We propose an open-vocabulary segmentation (OVS) method by calibrating in-vocabulary and domain-biased embedding space with generalized contextual prior of CLIP.

dise Narrative Action Evaluation with Prompt-Guided Multimodal Interaction
Shiyi Zhang*, Sule Bai*, Guangyi Chen, Lei Chen, Jiwen Lu, Junle Wang, Yansong Tang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[Paper] [Code]

We investigate a new problem called narrative action evaluation (NAE) and propose a prompt-guided multimodal interaction framework.

© Sule Bai | Last updated: Mar. 2025