Open-source implementation of WFS-SB, a training-free frame selection framework for long-video understanding with LVLMs. Long videos contain heavy frame redundancy, while Large Vision-Language Models ...
Some results have been hidden because they may be inaccessible to you