Large-size remote sensing images contain rich geographical information. Efficient and accurate semantic segmentation of these images is of significant importance in various fields. However, the ...
Abstract: Vision-language foundation models (VLMs) have shown great potential in feature transfer and generalization across a wide spectrum of medical-related downstream tasks. However, fine-tuning ...