A curated list of papers and resources about multimodal learning on graphs. The models we focus here are mainly large multimodal models and diffusion models.
An awesome repo on large language models (LLMs) on graphs an be found here.
This repo will be continuously updated. Don't forget to star it and keep tuned!
Please cite the paper in Citations if you find the resource helpful for your research. Thanks!
-
VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context.
ICML2024
Yunxin Li, Baotian Hu, Haoyuan Shi, Wei Wang, Longyue Wang, Min Zhang [PDF], 2024.5
-
Multimodal LLMs Struggle with Basic Visual Network Analysis: a VNA Benchmarkt.
preprint
Evan M. Williams, Kathleen M. Carley [PDF], 2024.5
-
SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance.
ICML2024
Guibao Shen, Luozhou Wang, Jiantao Lin, Wenhang Ge, Chaozhe Zhang, Xin Tao, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Guangyong Chen, Yijun Li, Ying-Cong Chen [PDF], 2024.5
Please cite the following paper if you find the resource helpful for your research.
@article{jin@llmgraph,
title={Large Language Models on Graphs: A Comprehensive Survey},
author={Jin, Bowen and Liu, Gang and Han, Chi and Jiang, Meng and Ji, Heng and Han, Jiawei},
journal={arXiv preprint arXiv:2312.02783},
year={2023}
}