Arabic Text Classification: Text Preprocessing, Term Weighting, and Morphological Analysis - Motaz Saad - 图书 - LAP LAMBERT Academic Publishing - 9783844319576 - 2011年4月3日
如封面与标题不符,以标题为准

Arabic Text Classification: Text Preprocessing, Term Weighting, and Morphological Analysis

价格
元 433
不含税

远程仓调货

预计送达时间 年6月15日 - 年6月25日
添加至iMusic心愿单

Text mining draw more and more attention recently, it has been applied on different domains including web mining, and sentiment analysis. Text preprocessing is an important stage in text mining. The main problems in text mining are structuring text data, and the very high dimensionality of text data. Natural language processing and morphological tools can be employed to reduce the dimensionality of text data. In addition, term weighting schemes can be used to enhance text representation as feature vector. Researches in the field of Arabic text mining are still fairly limited. The work of this book presents and compares the impact of text preprocessing on Arabic text classification using popular text classification algorithms. Text preprocessing includes applying different term weighting schemes, and Arabic morphological analysis (stemming and light stemming). Text Classification algorithms are applied on 7 Arabic corpora. Results show that Light stemming with term pruning is best feature reduction technique; Support Vector Machines and Naïve Bayes variations outperform other algorithms; Weighting schemes impact the performance of distance based classifier.

介质类型 图书     Paperback Book   (平装胶订图书)
已发行 2011年4月3日
ISBN13 9783844319576
出版商 LAP LAMBERT Academic Publishing
页数 172
商品尺寸 226 × 10 × 150 mm   ·   274 g
语言 德语