Home // FUTURE COMPUTING 2012, The Fourth International Conference on Future Computational Technologies and Applications // View article


Normalized Table Matching Algorithm for Classifying News Articles

Authors:
Taeho Jo

Keywords: Text Categorization; Table based Matching Algorithm

Abstract:
In this research, we propose encoding texts into normalized tables for categorizing texts, automatically. Previously, the table based approach was proposed, but the categorical scores indicating how much the text is relevant to the given category may be overestimated or underestimated by the given text length. As the solution to the problem, in this research, we encode texts into fixed sized tables, define the operation for computing the similarity between two tables as a normalized value, and characterize it mathematically. As the benefits from this research, we are able to compute category scores independently of a given text length, consider weights from both texts, and expect the more stable performance. We validate empirically the proposed approach with respect to the performance and the stability by comparing it with the traditional approaches.

Pages: 61 to 66

Copyright: Copyright (c) IARIA, 2012

Publication date: July 22, 2012

Published in: conference

ISSN: 2308-3735

ISBN: 978-1-61208-217-2

Location: Nice, France

Dates: from July 22, 2012 to July 27, 2012