Penyusunan Korpus Bahasa Daerah
Slides presentation for an online invited talk (in Indonesian) at the workshop on creating regional language corpora (24 October 2024, 16.00 Jakarta). The workshop was organised by the Language and Literature Preservation working group, of the National Language Agency of Indonesia. The talk began with presenting the concept of corpus and the FAIR data principle for wider access to the created corpora, especially for the native speakers of the language, in addition to researchers/academics. Next, one of the case studies presented is the creation of the Contemporary Enggano language corpora and Enggano legacy materials of texts and word lists, including the expected output of Enggano digital and print dictionaries, among others. In addition, cross-linguistic parallel corpora creation from picture stimuli (the SCOPIC project), especially the Balinese corpora, was also discussed. Then, regarding the model for infrastructure, the language data across Australia project was presented. Finally, there is a mention of community-based development of digital infrastructure centring around the Balinese language dictionary and content.
Rajeg, Gede Primahadi Wijaya (2024). Penyusunan Korpus Bahasa Daerah. University of Oxford. Presentation. https://doi.org/10.25446/oxford.27867759
Funding
Lexical resources for Enggano, a threatened language of Indonesia
Arts and Humanities Research Council
Find out more...