• The organizers and their affiliated institutions make no warranties regarding the datasets provided, including but not limited to being correct or complete. They cannot be held liable for providing access to the datasets or the usage of the datasets.
  • Each subset part of the dataset is subject to the license of the source corpus.
  • The access to the dataset is personal. Each user should apply individually.
  • The dataset should only be used for scientific or research purposes. Any other use is explicitly prohibited.
  • Re-identification of the data subjects (learners and/or authors of the texts) is explicitly prohibited.
  • Feeding the data to proprietary machine learning models that retain data for model training is explicitly prohibited.
  • The datasets must not be redistributed or shared in part or full with any third party.
  • If you use any of the subsets provided in the dataset, you agree to cite the associated papers referring to the source corpora.

Contact multigec@svenska.gu.se for more information.