Text categorization (TC) is a supervised learning technique that uses labeled training
Datasets to obtain a category system (classifier) and then immediately categorizes
Unlabeled text facts (information) using the using the created classifier. It is an
important Information Retrieval (IR) and Data mining (DM) process especially with the
fast growth of the number of online text documents present in Kurdish Kurmanji
language. Many category methods have been used for the text classification problem and
most of the work in this area was conducted for English text. In this thesis, we
investigate the application of Support Vector Machine (SVM) and k-Nearest Neighbor
method (kNN) in automatic Kurdish Kurmanji TC. Evaluation used an in-house Kurdish
Kurmanji TC corpus that includes 705 records independently classified into five
categories. The bases of our assessment are the well-known precision, recall and F Measures.
2004
Bitmap Image optimization
2004-05-19
Bachelor project for obtaining a Bachelor's degree