Skip to content

Latest commit

 

History

History
49 lines (44 loc) · 11.3 KB

w-on-new-ideas-in-vision-transformers.md

File metadata and controls

49 lines (44 loc) · 11.3 KB

ICCVW-2023-Papers

Application App

Workshop on New Ideas in Vision Transformers

Section Papers Preprint Papers Papers with Open Code Papers with Video

Title Repo Paper Video
Explaining Through Transformer Input Sampling GitHub thecvf
Actor-Agnostic Multi-Label Action Recognition with Multi-Modal Query GitHub thecvf
arXiv
YouTube
All-Pairs Consistency Learning forWeakly Supervised Semantic Segmentation thecvf
Dual-Contrastive Dual-Consistency Dual-Transformer: A Semi-Supervised Approach to Medical Image Segmentation GitHub thecvf YouTube
A Hybrid Visual Transformer for Efficient Deep Human Activity Recognition thecvf
Which Tokens to Use? Investigating Token Reduction in Vision Transformers WEB Page
GitHub
thecvf
arXiv
Hierarchical Spatiotemporal Transformers for Video Object Segmentation thecvf
arXiv
YouTube
IDTransformer: Transformer for Intrinsic Image Decomposition GitHub Page
GitHub
thecvf
MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers GitHub thecvf
arXiv
YouTube
Template-Guided Illumination Correction for Document Images with Imperfect Geometric Reconstruction GitHub Page
GitHub
thecvf YouTube
Spatio-Temporal Convolution-Attention Video Network thecvf
TSOSVNet: Teacher-Student Collaborative Knowledge Distillation for Online Signature Verification thecvf YouTube
SeMask: Semantically Masked Transformers for Semantic Segmentation GitHub thecvf
arXiv
YouTube
TransInpaint: Transformer-based Image Inpainting with Context Adaptation thecvf YouTube
Interactive Image Segmentation with Cross-Modality Vision Transformers GitHub thecvf
arXiv
MOSAIC: Multi-Object Segmented Arbitrary Stylization using CLIP thecvf
arXiv
On Moving Object Segmentation from Monocular Video with Transformers thecvf YouTube
SCSC: Spatial Cross-Scale Convolution Module to Strengthen Both CNNs and Transformers thecvf
arXiv
YouTube