Paper Review: Swin Transformer: Hierarchical Vision Transformer using Shifted Windows


Paper Review: Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

#swintransformer Paper Review: Swin Transformer: Hierarchical Vision Transformer using Shifted Windows - Maksim Zhdanov (xzcodes.github.io) Paper link Code available here (no implementation at the moment of writing this review) An amazing paper from Microsoft Research Asia presents a brand new vision Transformer called Swin Transformer that can serve as a backbone just like usual CNNs in computer vision and Transformers in natural language processing (NLP). There are two main problems with the usage of Transformers for computer vision. - Firstly, existing Transformer-based models have tokens o..........



원문링크 : Paper Review: Swin Transformer: Hierarchical Vision Transformer using Shifted Windows