RoSA: A Robust Self-Aligned Framework for Node-Node Graph Contrastive Learning

This repo is Pytorch implemention of
RoSA: A Robust Self-Aligned Framework for Node-Node Graph Contrastive Learning [PPT][appendix]

Yun Zhu*, Jianhao Guo*, Fei Wu, Siliang Tang†

In IJCAI 2022

Overview

This is the first work dedicated to solving non-aligned node-node graph contrastive learning problems. To tackle the non-aligned problem, we introduce a novel graph-based optimal transport algorithm, g-EMD, which does not require explicit node-node correspondence and can fully utilize graph topological and attributive information for non-aligned node-node contrasting. Moreover, to compensate for the possible information loss caused by non-aligned sub-sampling, we propose a nontrivial unsupervised graph adversarial training to improve the diversity of sub-sampling and strengthen the robustness of the model. The overview of our method is depicted as:

Files

   .
    ├── dataset_apis                  # Code process datasets.
    │   ├── topology_dist              # Storing the distance of the shortest path (SPD) between vi and vj.
    │   ├── citeseer.py                # processing for citeseer dataset.
    │   ├── cora.py                    # processing for cora dataset. 
    │   ├── dblp.py                    # processing for dblp dataset.
    │   ├── pubmed.py                  # processing for pubmed dataset. 
    │   ├── cornell.py                 # processing for cornell dataset. 
    │   ├── wisconsin.py               # processing for wisconsin dataset. 
    │   ├── texas.py                   # processing for texas dataset.     
    │   └── ...                        # More datasets will be added.
    │
    ├── adversarial.py                # Code for unsupervised adversarial training.
    ├── augmentation.py               # Code for augmentation.
    ├── config.yaml                   # Configurations for our method.
    ├── eval_utils.py                 # The toolkits for evaluation.
    ├── eval.py                       # Code for evaluation.
    ├── global_var.py                 # Code for storing global variable.
    ├── model.py                      # Code for building up model.
    ├── train.py                      # Training process.
    ├── test_runs.py                  # Reproduce the results reported in our paper
    └── ...

Setup

Recommand you to set up a Python virtual environment with the required dependencies as follows:

conda create -n rosa python==3.9
conda activate rosa 
conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=11.1 -c pytorch -c conda-forge
pip install torch-scatter torch-sparse torch-cluster torch-spline-conv torch-geometric -f https://data.pyg.org/whl/torch-1.8.0+cu111.html

Usage

Command for training model on Cora dataset

CUDA_VISIBLE_DEVICES=0 python train.py --dataset=Cora --config=config.yaml --ad=True --rectified=True

For efficient usage, you can run as below:

CUDA_VISIBLE_DEVICES=0 python train.py --dataset=Cora --config=config.yaml --ad=False --rectified=False

Now supported datasets include Cora, Citeseer, Pubmed, DBLP, Cornell, Wisconsin, Texas. More datasets are coming soon!

Command for testing model on Cora dataset
After training, the best checkpoint will be stored in checkpoints\<Dataset>\ dir. Then you can test the checkpoint through this command:

CUDA_VISIBLE_DEVICES=0 python eval.py --dataset=Cora --config=config.yaml

Command for reproducing the results in the paper
Because our results are based on 20 independent experiments. We provide the all(20) pre-trained ckpts of RoSA for Cora and Citeseer. As for other larger datasets, you can contact us by email if you need.
The usage is quite simple, you just run the command below. The ckpts will be downloaded automatically and the pre-trained models will be tested.

CUDA_VISIBLE_DEVICES=0 python test_runs.py --dataset=Cora

After 20 runs, test acc: 84.56 ± 0.67

CUDA_VISIBLE_DEVICES=0 python test_runs.py --dataset=Citeseer

After 20 runs, test acc: 73.5 ± 0.45

Illustration of arguements

--dataset: default Cora, [Cora, Citeseer, Pubmed, DBLP, Cornell, Wisconsin, Texas] can also be choosen
--rectified: defalut False, use rectified cost matrix instead of vanilla cost matrix (if True)
--ad: default False, use unsupervised adversarial training (if True)
--aligned: default False,  use aligned views (if True)

More experiments

We conduct experimetns on other five commonly used datasets with RoSA, the results show in Table 1. RoSA reaches SOTA on these datasets which proves the effectiveness of our method.

Method	Wiki-CS	Amazon-Computers	Amazon-Photo	Coauthor-CS	Coauthor-Physics
DGI	75.35 ± 0.14	83.95 ± 0.47	91.61 ± 0.22	92.15 ± 0.63	94.51 ± 0.52
GMI	74.85 ± 0.08	82.21 ± 0.31	90.68 ± 0.17	OOM	OOM
MVGRL	77.52 ± 0.08	87.52 ± 0.11	91.74 ± 0.07	92.11 ± 0.12	95.33 ± 0.03
GRACE	78.19 ± 0.01	87.46 ± 0.22	92.15 ± 0.24	92.93 ± 0.01	95.26 ± 0.02
GCA	78.35 ± 0.05	88.94 ± 0.15	92.53 ± 0.16	93.10 ± 0.01	95.73 ± 0.03
BGRL	79.36 ± 0.53	89.68 ± 0.31	92.87 ± 0.27	93.21 ± 0.18	95.56 ± 0.12
RoSA	80.11 ± 0.10	90.12 ± 0.26	93.67 ± 0.07	93.23 ± 0.13	95.76 ± 0.09

	Hidden size	Batch size	Learning rate	Walk length	Epochs	tau	p_{e,1}	p_{e,1}	p_{f,1}	p_{f,1}
Wiki-CS	256	256	1e-3	10	500	0.5	0.2	0.3	0.2	0.3
Amazon-Computers	128	256	1e-3	10	500	0.2	0.4	0.5	0.1	0.2
Amazon-Photo	256	256	1e-3	10	500	0.3	0.2	0.3	0.2	0.3
Coauthor-CS	256	128	1e-3	10	100	0.1	0.2	0.3	0.2	0.3
Coauthor-Physics	128	256	1e-3	10	100	0.5	0.2	0.3	0.2	0.3

In additions, we use prelu as activation function and use adamw optimizer with 5e-4 weight decay for all experimetns. The restart ratio of random walking is 0.5.

Citation

If you use this code for you research, please cite our paper.

@inproceedings{ijcai2022-527,
  title     = {RoSA: A Robust Self-Aligned Framework for Node-Node Graph Contrastive Learning},
  author    = {Zhu, Yun and Guo, Jianhao and Wu, Fei and Tang, Siliang},
  booktitle = {Proceedings of the Thirty-First International Joint Conference on
               Artificial Intelligence, {IJCAI-22}},
  publisher = {International Joint Conferences on Artificial Intelligence Organization},
  editor    = {Lud De Raedt},
  pages     = {3795--3801},
  year      = {2022},
  month     = {7},
  note      = {Main Track},
  doi       = {10.24963/ijcai.2022/527},
  url       = {https://doi.org/10.24963/ijcai.2022/527},
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RoSA: A Robust Self-Aligned Framework for Node-Node Graph Contrastive Learning

Overview

Files

Setup

Usage

Illustration of arguements

More experiments

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
assets		assets
dataset_apis		dataset_apis
.gitignore		.gitignore
README.md		README.md
adversarial.py		adversarial.py
augmentation.py		augmentation.py
config.yaml		config.yaml
eval.py		eval.py
eval_utils.py		eval_utils.py
global_var.py		global_var.py
model.py		model.py
test_runs.py		test_runs.py
train.py		train.py

ZhuYun97/RoSA

Folders and files

Latest commit

History

Repository files navigation

RoSA: A Robust Self-Aligned Framework for Node-Node Graph Contrastive Learning

Overview

Files

Setup

Usage

Illustration of arguements

More experiments

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages