分享给好友:
Large-scale Arabic Text Classification: an Approach Towards Distributed Data Mining Mohammed M. Abu Tair
Large-scale Arabic Text Classification: an Approach Towards Distributed Data Mining
Mohammed M. Abu Tair
Text classification has become one of the most important techniques in text mining. A number of machine learning algorithms have been introduced to deal with automatic text classification. One of the common classification algorithms is the k-NN algorithm which is known to be one of the best classifiers applied for different languages including Arabic language. However, the k-NN algorithm is of low efficiency because it requires a large amount of computational power. Such a drawback makes it unsuitable to handle a large volume of text documents with high dimensionality and in particular in the Arabic language. This book, therefore, introduces a high performance parallel classifier for large-scale Arabic text that achieves the enhanced level of efficiency, scalability, and accuracy. The parallel classifier based on the sequential k-NN algorithm. We tested the classifier using the OSAC corpus. We studied the performance of the parallel classifier on a multicomputer cluster. The results indicate that the parallel classifier has very good speedups and scalability and is capable of handling large document collections with higher classification results.
| 介质类型 | 图书 Paperback Book (平装胶订图书) |
| 已发行 | 2013年2月25日 |
| ISBN13 | 9783659347665 |
| 出版商 | LAP LAMBERT Academic Publishing |
| 页数 | 128 |
| 商品尺寸 | 150 × 8 × 225 mm · 209 g |
| 语言 | 德语 |
查看Mohammed M. Abu Tair的全部作品 ( 例如 Paperback Book )