-
Esper Tech Solutions
- Islamabad
- http://esper.solutions/
Stars
hamzakhalil798 / BIKE
Forked from whwu95/BIKE【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212
Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah
Face recognition SDK Android with 3D passive face liveness detection(anti-spoofing). Standard Face Recognition SDK This repo supports the following functionality: face matching, face compare, face …
User activity detection using IMU (Inertial Measurement Unit) sensors and power of deep learning. The accelerometer data from smart wearables is used for continuous activity detection, which can be…
Anthropometric measurement extraction using single image
A Naive Bayes spam/ham classifier based on Bayes' Theorem. A bunch of email subject is first used to train the classifier and then a previously unseen email subject is fed to predict whether it is …
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
Rembg is a tool to remove images background
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
High-Resolution Image Synthesis with Latent Diffusion Models
A latent text-to-image diffusion model
Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)
Best Practices, code samples, and documentation for Computer Vision.
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Road Damage Detection and Classification with Faster R-CNN (BigData 2018)
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
Code for CVPR 2019 paper. BASNet: Boundary-Aware Salient Object Detection