The MOOC user action dataset represents the actions taken by users on a popular MOOC platform. The actions are represented as a directed, temporal network. The nodes represent users and course activities (targets), and edges represent the actions by users on the targets. The actions have attributes and timestamps. To protect user privacy, we anonimize the users and timestamps are standardized to start from timestamp 0. The dataset is directed, temporal, and attributed.
Additionally, each action has a binary label, representing whether the user dropped-out of the course after this action, i.e., whether this is last action of the user.
This dataset serves as a recommender system dataset and a dynamic network dataset.
Project website: The dataset have been generated as part of the research project on advanced user modeling and recommender systems. The details of the project can be found here.
Dataset statistics | |
---|---|
Number of users | 7,047 |
Number of targets | 97 |
Number of actions | 411,749 |
Number of positive action labels | 4,066 |
Timestamp | seconds |
@inproceedings{kumar2019predicting, title={Predicting dynamic embedding trajectory in temporal interaction networks}, author={Kumar, Srijan and Zhang, Xikun and Leskovec, Jure}, booktitle={Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery \& Data Mining}, pages={1269--1278}, year={2019}, organization={ACM} }
File | Description |
---|---|
mooc_actions.tsv | Time-ordered sequence of user actions. |
mooc_action_features.tsv | Features associated with each action. |
mooc_action_labels.tsv | Binary label associated with each action, indicating whether the student drops-out after the action. |
where
where
where