An Automatic Topic Identification Algorithm
Abstract
Problem statement: Topic is a stream of words which stands for the content of a text. Knowing the topic of a document can help people to be aware from its content and facilitate their searching process. Approach: This paper proposes an automatic algorithm to identify the topic for a textual document based on the chunks corresponding to each sentences in the document. Results and conclusion: We achieved 86% matching for both total and partial matching in our experimental data sample.
DOI: https://doi.org/10.3844/jcssp.2011.1363.1367
Copyright: © 2011 Hossein Shahsavand Baghdadi and Bali Ranaivo-Malançon. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 3,351 Views
- 3,416 Downloads
- 10 Citations
Download
Keywords
- Web document
- text-document topic
- partial matching
- experimental data
- Identification algorithm
- chen’s algorithm
- syntactic parser
- Adaptive Resonance Theory (ART)
- Maximum Ratio Balance (MRB)