A corpus-based survey of four electronic swahili–english bilingual dictionaries
View/ Open
Date
2009Author
De Pauw, G
de Schryver, G
Wagacha, P
Type
ArticleMetadata
Show full item recordAbstract
In this article we survey four different electronic bilingual dictionaries for the lan-guage pair Swahili–English. Aided by a data-driven morphological analyzer and part-of-speech tagger, we quantify the coverage of the dictionaries on large monolingual corpora of Swahili. In a second series of experiments, we investigate how applicable the dictionaries are as a tool in the development of a machine translation system, by evaluating bilingual coverage on the parallel SAWA corpus. At the same time we attempt to consolidate the dictionaries into a unified lexico-graphic database and compare the coverage to that of its composite parts.
URI
http://www.ajol.info/index.php/lex/article/view/49134/35479http://erepository.uonbi.ac.ke:8080/xmlui/handle/123456789/37385
Citation
Lexikos 19 (AFRILEX-reeks/series 19: 2009): 340-352Publisher
School of Computing and Informatics, University of Nairobi
Subject
LEXICOGRAPHYEVALUATION
MORPHOLOGY
LEMMATIZATION
PARALLEL CORPORA
MACHINE LEARNING
MACHINE TRANSLATION
SWAHILI (KISWAHILI)
ENGLISH