TaCoCASE - Transatlantic Component of the CASE project 

Sub-Corpus of the CASE project

  • International video-mediated communication
  • Skype conversations between native speakers (NS) and non-native speakers (NNS) of English

TaCoCASE was compiled under similar conditions as ViMELF. In order to analyze a larger corpus, the conversations from ViMELF and TaCoCASE can be used in combination. 

Citation:

TaCoCASE. 2023. Transatlantic Component of the CASE project. Birkenfeld: Trier University of Applied Sciences. Version 1.0. Collet, Caroline. [http://umwelt-campus.de/case/TaCoCASE].

Compilation: Caroline Collet, Coordination: Caroline Collet & Stefan Diemer

Version 1.0, released September 2023

Description:

  • 15 conversations
  • Conversation length: 650 minutes (= ca. 10.5 hours)
  • Average conversation length: 43 minutes (= 9,483 words)
  • Words / Tokens: 140,003
  • Participants: 26 [8 SB (Germany), 10 BI (Great Britain), 8 BO (USA)]
  • Medium: Video both sides (13x video, 2x audio)
  • Including sociolinguistic background data

Versions: Four versions of the corpus are available: 

  • CASE transcription (as docx and txt): the basic version produced by manual transcription. CASE transcription conventions include spoken language features beyond the words, such as prosodic, paralinguistic and non-verbal features.
  • XML version (xml): a version of the annotated CASE transcription encapsulating the original information in a machine-readable form (Gee 2018)
  • Lexical version (lex): For the lexical version all annotation is removed - this version is produced with XTranscript (Gee 2018) (coming soon)
  • Part-of-speech tagged version (pos): a POS-tagged version of the lexical version, produced with the CLAWS POS tagger. (coming soon)

Access TaCoCASE

The corpus is freely available for noncommercial research. It can be accessed and searched through WebCorp LSE hosted by our partner institution Birmingham City University - access it here (you will need to create a free account).

If you want to obtain access to the full transcripts, you can also send a mail to mail@caroline-collet.de

Please provide the following details:

Name

Institution

Status: (Student / Doctoral Researcher / Faculty / Teacher / Independent Scholar)

Research Focus

Institutional Website (if any)

Mail Address

We will send you a link to the corpus data download site.

Access to TaCoCASE video data

The TaCoCASE video files will soon be available for noncommercial research purposes. In order to obtain a viewing link, please send a mail to mail@caroline-collet.de

and provide the following details:

Name

Institution

Status: (Student / Doctoral Researcher / Faculty / Teacher / Independent Scholar)

Research Focus

Institutional Website (if any)

Mail Address

Project Coordination & Contact

Prof. Dr. Stefan Diemer

Institute for International & Digital Communication, Trier University of Applied Sciences, Germany

case@umwelt-campus.de

back-to-top nach oben