Publication:

The Construction of a Corpus of Spoken Sylheti (2000)

Author(s): Baker PM; Lie MLS; McEnery T; Sebba M

    Abstract: This paper describes the construction of a corpus of spoken Sylheti. The corpus was created to examine difficulties in the creation of spoken language corpora in which features such as code switching (simply described here as the process of switching from one language to another during the course of an interaction; however, this description disguises a host of situations, which will be examined in the paper) are common. The paper also presents a transliteration scheme for Sylheti based around the Roman alphabet.

      • Journal: Literary and Linguistic Computing
      • Volume: 15
      • Issue: 4
      • Pages: 421-432
      • Publisher: Oxford University Press
      • Publication type: Article
      • Bibliographic status: Published