Skip to content
View ChenJunyu2000's full-sized avatar
  • https://www.cqnu.edu.cn/

Block or report ChenJunyu2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Official PyTorch Implementation of ParGo: Bridging Vision-Language with Partial and Global Views.

Python 2 Updated Aug 26, 2024

All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)

Python 135 13 Updated Aug 22, 2024

Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"

Python 24 3 Updated Mar 28, 2024

Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024

Python 73 8 Updated Jun 11, 2024

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating

Python 76 2 Updated Jan 29, 2024
Python 2 1 Updated Jul 15, 2021

Ambiguity-Aware and High-Order Relation Learning for Multi-Grained Image-Text Alignment

4 Updated Aug 19, 2024

该项目旨在通过输入文本描述来检索与之相匹配的图片。

Python 24 3 Updated Aug 24, 2023

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,368 208 Updated Apr 15, 2024
Python 14 Updated Sep 3, 2024

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

392 47 Updated Jul 11, 2024

ESA: External Space Attention Aggregation for Image-Text Retrieval

Python 12 Updated Aug 30, 2024

A Framework of Small-scale Large Multimodal Models

Python 597 53 Updated Sep 10, 2024

a family of highly capabale yet efficient large multimodal models

Python 158 15 Updated Aug 23, 2024
Python 361 38 Updated May 1, 2024
Python 57 2 Updated Jun 20, 2024

This repository contains code for paper GraDual: Graph-based Dual-modal Representation for Image-Text Matching, published in WACV 2022

Python 8 Updated Sep 13, 2022
Python 11 2 Updated May 3, 2024

The code of the paper of "A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval" accepted by NeurIPS' 2022.

Python 18 Updated Jan 16, 2024

USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024

Python 19 Updated Mar 22, 2024

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

Python 544 113 Updated May 18, 2023

Enhanced Citation Counts Manager for Zotero 7

JavaScript 90 3 Updated Jul 10, 2024

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Python 697 52 Updated Mar 20, 2024

PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"

Python 488 125 Updated Dec 8, 2021

📖 Official Code for “PIR-CLIP: Remote Sensing Image-text Retrieval with Prior Instruction Representation Learning”

Python 12 1 Updated Jun 27, 2024

An open source implementation of CLIP.

Python 9,934 959 Updated Aug 19, 2024
Python 15 3 Updated Apr 30, 2022
Python 23 3 Updated May 16, 2023

The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)

Python 170 19 Updated Feb 7, 2022
43 1 Updated Aug 14, 2023
Next