| English | Arabic | Home | Login |

Thesis

2014

AUTOMATIC KURDISH KURMANJI TEXT CATEGORIZATION

2014-06-18
Text categorization (TC) is a supervised learning technique that uses labeled training Datasets to obtain a category system (classifier) and then immediately categorizes Unlabeled text facts (information) using the using the created classifier. It is an important Information Retrieval (IR) and Data mining (DM) process especially with the fast growth of the number of online text documents present in Kurdish Kurmanji language. Many category methods have been used for the text classification problem and most of the work in this area was conducted for English text. In this thesis, we investigate the application of Support Vector Machine (SVM) and k-Nearest Neighbor method (kNN) in automatic Kurdish Kurmanji TC. Evaluation used an in-house Kurdish Kurmanji TC corpus that includes 705 records independently classified into five categories. The bases of our assessment are the well-known precision, recall and F Measures.
2004

Bitmap Image optimization

2004-05-19
Bachelor project for obtaining a Bachelor's degree

Back