A social network of Twitch users which was collected from the public API in Spring 2018. Nodes are Twitch users and edges are mutual follower relationships between them. The graph forms a single strongly connected component without missing attributes. The machine learning tasks related to the graph are count data regression and node classification. There are 6 specific tasks:
- Explicit content streamer identification. - Broadcaster language prediction. - User lifetime estimation. - Churn prediction. - Affiliate status identification. - View count estimation.
Twitch Gamers paper: arxiv.org Twitch Gamers project: Github
Dataset statistics | |
---|---|
Directed | No. |
Node features | No. |
Edge features | No. |
Node labels | Yes. |
Temporal | No. |
Nodes | 168,114 |
Edges | 6,797,557 |
Density | 0.0005 |
Transitvity | 0.0184 |
Possible tasks | |
---|---|
Node level regression | |
Binary node classification | |
Multionomial node classification | |
Link prediction | |
Community detection | |
Network visualization |
@misc{rozemberczki2021twitch, title={Twitch Gamers: a Dataset for Evaluating Proximity Preserving and Structural Role-based Node Embeddings}, author={Benedek Rozemberczki and Rik Sarkar}, year={2021}, eprint={2101.03091}, archivePrefix={arXiv}, primaryClass={cs.SI} }
File | Description |
---|---|
twitch_gamers.zip | Twitch Gamers Social Network |