RDF data hosted at the Leipzig Corpora Collection

This server currently hosts a dataset for the Xhosa language based on the Bantu Language Model (BLM). It contains lexical and morphological data, English translations and is linked to WordNet RDF. The data is based on extensive manual work by J.A. Louw, was digitized and enhanced by and under the supervision of Prof. Sonja Bosch (University of South Africa), and converted from CSV files to this RDF dataset by Thomas Eckart and Bettina Klimek (Leipzig University, Germany).

For more information about the dataset, please refer to: Sonja Bosch, Thomas Eckart, Bettina Klimek, Dirk Goldhahn and Uwe Quasthoff (2018): Preparation and Usage of Xhosa Lexicographical Data for a Multilingual, Federated Environment at the 11th Edition of the Language Resources and Evaluation Conference LREC 2018, Miyazaki (Japan) (PDF).

Example resources:
Licensed under Creative Commons Attribution 4.0 International (CC BY-SA 4.0)