Skip to content

Commit

Permalink
[Dataset] Add dataset ogbn-papers100M (#7096)
Browse files Browse the repository at this point in the history
Co-authored-by: Ubuntu <ubuntu@ip-172-31-21-37.ap-northeast-1.compute.internal>
  • Loading branch information
caojy1998 and Ubuntu authored Feb 20, 2024
1 parent 2df8586 commit 3ced341
Showing 1 changed file with 11 additions and 1 deletion.
12 changes: 11 additions & 1 deletion python/dgl/graphbolt/impl/ondisk_dataset.py
Original file line number Diff line number Diff line change
Expand Up @@ -877,6 +877,16 @@ class BuiltinDataset(OnDiskDataset):
Reverse edges are added to the original graph and duplicated
edges are removed.
**ogbn-papers100M**
The ogbn-papers100M dataset is a directed graph, representing the citation
network between all Computer Science (CS) arXiv papers indexed by MAG.
See more details in `ogbn-papers100M
<https://ogb.stanford.edu/docs/nodeprop/#ogbn-papers100M>`_.
.. note::
Reverse edges are added to the original graph and duplicated
edges are removed.
**ogbn-products**
The ogbn-products dataset is an undirected and unweighted graph,
representing an Amazon product co-purchasing network. See more details
Expand Down Expand Up @@ -916,7 +926,7 @@ class BuiltinDataset(OnDiskDataset):
"ogbn-products",
"ogbn-arxiv",
]
_large_datasets = ["ogb-lsc-mag240m"]
_large_datasets = ["ogb-lsc-mag240m", "ogbn-papers100M"]
_all_datasets = _datasets + _large_datasets

def __init__(self, name: str, root: str = "datasets") -> OnDiskDataset:
Expand Down

0 comments on commit 3ced341

Please sign in to comment.