A social network of Twitch users which was collected from the public API in Spring 2018. Nodes are Twitch users and edges are mutual follower relationships between them. The graph forms a single strongly connected component without missing attributes. The machine learning tasks related to the graph are count data regression and node classification. There are 6 specific tasks:
- Explicit content streamer identification. - Broadcaster language prediction. - User lifetime estimation. - Churn prediction. - Affiliate status identification. - View count estimation.
Twitch Gamers paper: arxiv.org Twitch Gamers project: Github
| Dataset statistics | |
|---|---|
| Directed | No. |
| Node features | No. |
| Edge features | No. |
| Node labels | Yes. |
| Temporal | No. |
| Nodes | 168,114 |
| Edges | 6,797,557 |
| Density | 0.0005 |
| Transitvity | 0.0184 |
| Possible tasks | |
|---|---|
| Node level regression | |
| Binary node classification | |
| Multionomial node classification | |
| Link prediction | |
| Community detection | |
| Network visualization | |
@misc{rozemberczki2021twitch,
title={Twitch Gamers: a Dataset for Evaluating Proximity Preserving and Structural Role-based Node Embeddings},
author={Benedek Rozemberczki and Rik Sarkar},
year={2021},
eprint={2101.03091},
archivePrefix={arXiv},
primaryClass={cs.SI}
}
| File | Description |
|---|---|
| twitch_gamers.zip | Twitch Gamers Social Network |