Comparison Of Text Classification Models for Telugu News Articles

  • Unique Paper ID: 146440
  • PageNo: 760-763
  • Abstract:
  • Text classification is become important when the information is increasing rapidly over the internet. This information is in unstructured form and need to be digitized. As these documents are digital form it is necessary for organizing the data by automatically assigning a set of documents into predefined labels based on their content. It mainly depends on the methods that should be used in each phase improves the efficiency of the document classification. In this paper we propose a classification model that supports both the generality and efficiency. It also discusses some of the major issues involved in automatic text classification such as dealing with unstructured text, handling large number of attributes and natural language processing based techniques, dealing with missing metadata and choice of a suitable machine learning technique for training a text classifier. Both are achieved by following the logical sequence of the process of classifying the unstructured text document step by step and efficiency through various methods are proposed. The experimental results over news articles have been validated using statistical measures of accuracy and F-Score. The results have proven that the methods significantly improve the performance.

Copyright & License

Copyright © 2026 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{146440,
        author = {naga Sudha and madhaveelatha },
        title = {Comparison Of  Text Classification Models for  Telugu News Articles},
        journal = {International Journal of Innovative Research in Technology},
        year = {},
        volume = {4},
        number = {12},
        pages = {760-763},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=146440},
        abstract = {Text classification is become important when the information is increasing rapidly  over the internet. This information is in unstructured form and need to be digitized. As these documents are digital form it is necessary for organizing the data by automatically assigning a set of documents into predefined labels based on their content. It mainly depends on the methods that should be used in each phase improves the efficiency of the document classification. In this paper we propose a classification model that supports both the generality and efficiency. It also discusses some of the major issues involved in automatic text classification such as dealing with unstructured text, handling large number of attributes and natural language processing based techniques, dealing with missing metadata and choice of a suitable machine learning technique for training a text classifier. Both are achieved by following the logical sequence of the process of classifying the unstructured  text document step by step and efficiency through various methods are proposed. The experimental results over news articles have been validated using statistical measures of accuracy and  F-Score. The results have proven that the methods significantly improve the performance.},
        keywords = {Text classification, Logistic regression, Naive Bayes classifier, Support Vector machine, Gradient descent trees.},
        month = {},
        }

Cite This Article

Sudha, N., & madhaveelatha, (). Comparison Of Text Classification Models for Telugu News Articles. International Journal of Innovative Research in Technology (IJIRT), 4(12), 760–763.

Related Articles