ViMELF

A Corpus of Video-Mediated English as a Lingua Franca Conversations

Citation:

ViMELF. 2018. Corpus of Video-Mediated English as a Lingua Franca Conversations. Birkenfeld: Trier University of Applied Sciences. Version 1.0. The CASE project [http://umwelt-campus.de/case].

Compilation: The CASE project, Trier University of Applied Sciences, coordination: Stefan Diemer, Marie-Louise Brunner, Caroline Collet, Selina Schmidt.

Version 1.0, released 15 May 2018

Description:

  • 20 Conversations
  • Conversation length: 744.5 min total, ca. 12.5 hours of conversations
  • Average conversation length: 37.23 min.
  • Words/Tokens: 113670 (plain text), 154472 (annotated version)
  • Participants: 40 (20 SB, 5 FL, 5 HE, 5 ST, 5 SF)
  • Medium: Video both sides: 11, video one side: 3, audio: 6
  • Including sociolinguistic background data

Versions: Four versions of the corpus are available:

  • CASE transcription(as docx, rtf and txt): the basic version produced by manual transcription. CASE transcription conventions (### link) include spoken language features beyond the words, such as prosodic, paralinguistic and non-verbal features.
  • XML version (xml): a version of the annotated CASE transcription encapsulating the original information in a machine-readable form (###) Gee
  • Lexical version (lex): For the lexical version all annotation is removed - this version is produced with XTranscript (Gee 2018)
  • Part-of-speech tagged version (pos): a POS-tagged version of the lexical version, produced with the CLAWS POS tagger.

Access ViMELF

The corpus is freely available for noncommercial research. If you want to obtain access to the transcripts, please send a mail to

case(at)umwelt-campus.de

Please provide the following details:

  • Name
  • Institution
  • Status: (Student / Doctoral Researcher / Faculty / Teacher / Independent Scholar)
  • Research Focus
  • Institutional Website (if any)
  • Mail Address

We will send you a link to the corpus data download site.

Access to ViMELF video data

The ViMELF video files are also available for noncommercial research purposes. In order to obtain a viewing link, please send a mail to

case(at)umwelt-campus.de

and provide the following details:

  • Name
  • Institution
  • Status: (Student / Doctoral Researcher / Faculty / Teacher / Independent Scholar)
  • Research Focus
  • Institutional Website (if any)
  • Mail Address

Project Coordination & Contact

Stefan Diemer & Marie-Louise Brunner

Language & Communication, Trier University of Applied Sciences, Germany

sk(at)umwelt-campus.de

We'd love to hear from you!

Further info, questions etc.: case(at)umwelt-campus.de