Stars
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
MBTI Personality Test App is a test app with 70 questions to determine your personality type based on MBTI (Myers–Briggs Type Indicator).
PsyDI: Towards a Personalized and Progressively In-depth Chatbot for Psychological Measurements. (e.g. MBTI Measurement Agent)
The implementation of "Pedestrian-Aware Panoramic Video Stitching Based on a Structured Camera Aray".
Panorama stitching of images or real-time video streams
A full Python implementation for real car surround view system
Cross-platform ground control station for drones (Android, iOS, Mac OS, Linux, Windows)
ArduPlane, ArduCopter, ArduRover, ArduSub source
Mission Planner Ground Control Station for ArduPilot (c# .net)
Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic…
Common used path planning algorithms with animations.
🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
📻Terminal/ssh/telnet/serialport/RDP/VNC/sftp client(linux, mac, win)
Multi-purpose serial data visualization & processing program
This repository is an official implementation of ADAPT: Action-aware Driving Caption Transformer, accepted by ICRA 2023.
PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)
A Stable Diffusion desktop frontend with inpainting, img2img and more!
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
This repository contains the source code for the paper First Order Motion Model for Image Animation
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
A Library for Advanced Deep Time Series Models.
Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)