A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens Paper • 2604.04913 • Published Apr 6 • 11
view article Article How I contributed a new model to the Transformers library using Codex Mar 30 • 50
PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders Paper • 2603.25398 • Published Mar 26 • 3
PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders Paper • 2603.25398 • Published Mar 26 • 3