University of Oxford
Browse

CLDF dataset derived from von Rosenberg's "De Mentawei-Eilanden en Hunne Bewoners" from 1853 for comparative numeral data

Download (32.91 kB)
dataset
posted on 2025-02-06, 10:07 authored by Gede Primahadi Wijaya RajegGede Primahadi Wijaya Rajeg

Cross-Linguistic Data Format (CLDF) dataset derived from von Rosenberg's "De Mentawei-Eilanden en Hunne Bewoners" from 1853 for the comparative numeral data (p. 434). It is a work in progress and another practice session with CLDF to handle/test multiple languages.

NB: In this first version (v1.0.0), the word forms are still in the original orthography and not yet segmented/tokenised. The next release attempts to include orthography standardisation and segmentation.

Funding

Lexical resources for Enggano, a threatened language of Indonesia

Arts and Humanities Research Council

Find out more...

History