Visual Basic Component Object Model

Object-Aware Image Augmentation for Audio-Visual Zero-Shot Learning

Abstract: Audio-visual zero-shot learning (ZSL) leverages both video and audio information for model training, aiming to classify new video categories that were not seen during the training. However, ...

GitHub

T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Note: This model has been trained for approximately 2.7M steps (batch size = 1) and is still in the training process. I have attached a .ipynb file in the repository. You can refer to it to know how ...

IEEE

MambaEVT: Event Stream based Visual Object Tracking using State Space Model

Abstract: Event camera-based visual tracking has drawn more and more attention in recent years due to the unique imaging principle and advantages of low energy consumption, high dynamic range, and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Object-Aware Image Augmentation for Audio-Visual Zero-Shot Learning

T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

MambaEVT: Event Stream based Visual Object Tracking using State Space Model

Trending now