L2_profiles

L2profiles

RJ funded project - documentation, issues, etc - https://spraakbanken.gu.se/eng/l2-profiling

Status document (with links): https://docs.google.com/document/d/16JzEDFyDhbV9Updw_QYGHZJXdMbZ1LR3NTKJEsJKuqk/edit#

Several directions are pursued in this project:

Research on vocabulary (receptive versus productive).

This is split into several tasks:

  1. Manual checks of automatic annotation of two corpora: coursebooks (COCTAILL) and essays (SweLL-pilot)
  1. Manual checks of SenSVALex, a sense-based word list generated from COCTAILL corpus
    • SVALex (article): https://spraakbanken.gu.se/sites/spraakbanken.gu.se/files/SVALex_LREC_cameraReady.pdf
  2. (in future): Manual checks of SenSweLLex, a sense-based word list generated from SweLL-pilot
    • SweLLex (article): http://www.ep.liu.se/ecp/130/010/ecp16130010.pdf
  3. Manual lexicographic annotation of SenSVALex (and ev. SenSweLLex) using LEGATO tool:
    • LEGATO: https://spraakbanken.gu.se/larkalabb/legato
    • Guidelines: https://docs.google.com/document/d/1nZOKf-54FEkjIQFnPUmZZRWqib6y7gpCuKQO-XadeqM/edit#heading=h.5rcsyvi01oc5

Research on grammar (receptive versus productive)

This is split into several tasks:

  1. Passives

  2. Prepositions

  3. Definiteness

  4. Embedded clauses?

Report issues

  1. For Legato, use this github page and issues connected to it: https://github.com/elenavolodina/Legato

  2. For corpus annotation checks & for SVALex checks, use issues on this (current) github page