Open positions
Open research positions in SNAP group are available at undergraduate, graduate and postdoctoral levels.

Twitch Gamers Social Network

Dataset information

A social network of Twitch users which was collected from the public API in Spring 2018. Nodes are Twitch users and edges are mutual follower relationships between them. The graph forms a single strongly connected component without missing attributes. The machine learning tasks related to the graph are count data regression and node classification. There are 6 specific tasks:

- Explicit content streamer identification.
- Broadcaster language prediction.
- User lifetime estimation.
- Churn prediction.
- Affiliate status identification.
- View count estimation.

Twitch Gamers paper: arxiv.org
Twitch Gamers project: Github


Dataset statistics
Directed No.
Node features No.
Edge features No.
Node labels Yes.
Temporal No.
Nodes 168,114
Edges 6,797,557
Density 0.0005
Transitvity 0.0184

Possible tasks
Node level regression
Binary node classification
Multionomial node classification
Link prediction
Community detection
Network visualization

Source (citation)

  • B. Rozemberczki and R. Sarkar. Twitch Gamers: a Dataset for Evaluating Proximity Preserving and Structural Role-based Node Embeddings. 2021.
  •         @misc{rozemberczki2021twitch,
                  title={Twitch Gamers: a Dataset for Evaluating Proximity Preserving and Structural Role-based Node Embeddings}, 
                  author={Benedek Rozemberczki and Rik Sarkar},
                  year={2021},
                  eprint={2101.03091},
                  archivePrefix={arXiv},
                  primaryClass={cs.SI}
             }
    
            

    Files

    File Description
    twitch_gamers.zip Twitch Gamers Social Network