Abstract: CLIP, a foundational vision-language model, has emerged as a powerful tool for open-vocabulary semantic segmentation. While freezing the text encoder preserves its powerful embeddings, ...
Abstract: Power transformers require fast and reliable protection to prevent catastrophic failures. However, collecting labeled fault data in real-world conditions is difficult, as transformers cannot ...