• Login
    • Login
    Advanced Search
    View Item 
    •   UoN Digital Repository Home
    • Journal Articles
    • Faculty of Science & Technology (FST)
    • View Item
    •   UoN Digital Repository Home
    • Journal Articles
    • Faculty of Science & Technology (FST)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Automatic construction of a Kiswahili corpus from the World Wide Web

    Thumbnail
    View/Open
    Fulltext.pdf (218.5Kb)
    Date
    2005
    Author
    Miriti, Evans Ak
    Type
    Article
    Language
    en
    Metadata
    Show full item record

    Abstract
    A corpus is a large collection of language data either in written form or spoken form or both. It can be used to construct a language model that is used in many language technology applications. Some of these include speech to text, optical character recognition, machine translation and spell checking. The easiest way to create a text corpus is by putting together electronic text documents. For most languages, getting a huge collection of electronic texts is a time-consuming and challenging task. The monotonous nature of such a task will inevitably lead to much less attention being paid to the errors that might find their way into the text collection. This paper describes the working of an application that was used to build a Kiswahili corpus from the Internet to be used in natural language processing applications.
    URI
    http://profiles.uonbi.ac.ke/eamiriti/publications/automatic-construction-kiswahili-corpus-world-wide-web
    http://erepository.uonbi.ac.ke:8080/xmlui/handle/123456789/50524
    Citation
    K, G, E. M. 2005. Automatic construction of a Kiswahili corpus from the World Wide Web. SPECIAL TOPICS IN COMPUTING AND ICT RESEARCH: Measuring Computing Research Excellence and Vitality. , Kampala: Fountain Publishers
    Publisher
    Centre For Biotechnology & Bioinformatics Publications
    Collections
    • Faculty of Science & Technology (FST) [4284]

    Copyright © 2022 
    University of Nairobi Library
    Contact Us | Send Feedback

     

     

    Useful Links
    UON HomeLibrary HomeKLISC

    Browse

    All of UoN Digital RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Copyright © 2022 
    University of Nairobi Library
    Contact Us | Send Feedback