Show simple item record

dc.contributor.authorDe Pauw, Guy
dc.contributor.authorWagacha, Peter W
dc.contributor.authorAbade, Dorothy Atieno
dc.date.accessioned2013-07-02T14:54:58Z
dc.date.available2013-07-02T14:54:58Z
dc.date.issued02-07-13
dc.identifier.urihttp://hdl.handle.net/11295/44250
dc.description.abstractThis paper describes a proof-of-the-principle experiment in which maximum entropy learning is used for the automatic induction of word classes for the Western Nilotic language of Dholuo. The proposed approach extracts shallow morphological and contextual features for each word of a 300k text corpus of Dholuo. These features provide a layer of linguistic abstraction that enables the extraction of general word classes. We provide a preliminary evaluation of the proposed method in terms of language model perplexity and through a simple case study of the paradigm of the verb stem "somo".en
dc.language.isoenen
dc.titleUnsupervised induction of Dholuo word classes using maximum entropy learningen
dc.typeWorking Paperen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record