Skip to content
View chaoyuaw's full-sized avatar

Highlights

  • Pro

Organizations

@TeamCohen

Block or report chaoyuaw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,884 898 Updated Aug 21, 2024

Code for PointInfinity: Resolution-Invariant Point Diffusion Models

Python 18 1 Updated Jun 19, 2024

Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

Python 10,500 919 Updated Aug 9, 2024

Multiview Compressive Coding for 3D Reconstruction

Python 626 47 Updated Jan 20, 2023

Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022

Python 144 11 Updated Nov 30, 2022

PixelNeRF Official Repository

Python 1,389 196 Updated Jun 30, 2024

This repo accompanies the research paper, ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data and contains the data, scripts to visualize and proces…

Python 637 54 Updated Sep 21, 2024

BARF: Bundle-Adjusting Neural Radiance Fields 🤮 (ICCV 2021 oral)

Python 783 112 Updated Apr 28, 2023

VOLO: Vision Outlooker for Visual Recognition

Jupyter Notebook 923 94 Updated Sep 18, 2022

This repository contains the code for the CVPR 2021 paper "GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields"

Python 1,232 160 Updated Feb 9, 2022
Python 83 10 Updated Mar 4, 2024

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 13,620 2,033 Updated Jul 24, 2024

A deep learning library for video understanding research.

Python 3,285 406 Updated Aug 13, 2024

An end-to-end PyTorch framework for image and video classification

Python 1,589 278 Updated Jun 27, 2024

Official implementation of TMANet.

Python 121 23 Updated Sep 20, 2022

Transformer training code for sequential tasks

Python 609 59 Updated Sep 14, 2021

Official DeiT repository

Python 4,009 550 Updated Mar 15, 2024

A Benchmark for Learned Indexes

C++ 267 58 Updated Apr 27, 2022

Learning Continuous Image Representation with Local Implicit Image Function, in CVPR 2021 (Oral)

Python 1,257 145 Updated Aug 21, 2021

Transformers for Longer Sequences

Python 564 101 Updated Sep 1, 2022

MONeT framework for reducing memory consumption of DNN training

Python 172 19 Updated May 4, 2021

Multi-view Wire Art

C++ 85 10 Updated Jun 27, 2020

You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization

Python 840 158 Updated Oct 16, 2021

PyTorch implementation of X3D models with Multigrid training.

Python 92 13 Updated Oct 10, 2021

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,177 1,221 Updated Aug 14, 2024

AViD Dataset: Anonymized Videos from Diverse Countries

55 4 Updated Mar 30, 2023

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

Python 556 61 Updated Jan 1, 2024
Python 212 36 Updated Jun 12, 2023
Next