Performance Analysis of Learning Models On Medical Documents

  • Unique Paper ID: 146500
  • Volume: 4
  • Issue: 12
  • PageNo: 822-828
  • Abstract:
  • With the exponential growth of online text, Text Classification domain becomes the major field of Natural language Processing and Machine learning. In this context Medical Document Classification is one of the popular research problem to analyze the high dimensionality features of medical data. Our Study considered various learning models and their performances over the medical documents and we considered OSUMED is one of the popular datasets containing MEDLINE documents as multi-labelled documents. Choosing a high accuracy classifier for text classification is still a challenging task for many of the practitioners. Our work aims to find the efficiency in classifiers and comparing the accuracy in classifying medical documents with well-known classifiers Naïve Bayes, Decision Tree, Support Vector Machine (Linear) and Stochastic Gradient Descent (SGDC). The performance of a feature selection method namely Univariate Feature Selection is analyzed using pattern classifiers namely Naïve Bayes, Decision Tree, Support Vector Machine (Linear) and SGDC and the obtained experimental results shows that the combination of Univariate Feature Selector and Support Vector Machines classifier gives more accurate results in most cases than the others.

Copyright & License

Copyright © 2025 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{146500,
        author = {Vanitha Guda and Manish Golla and Akhilesh Datta},
        title = {Performance Analysis of Learning Models On Medical Documents},
        journal = {International Journal of Innovative Research in Technology},
        year = {},
        volume = {4},
        number = {12},
        pages = {822-828},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=146500},
        abstract = {With the exponential growth of online text, Text Classification domain becomes the major field of Natural language Processing and Machine learning. In this context Medical Document Classification is one of the popular research problem to analyze the high dimensionality features of medical data. Our Study considered various learning models and their performances over the medical documents and we considered OSUMED is one of the popular datasets containing MEDLINE documents as multi-labelled documents. Choosing a high accuracy classifier for text classification is still a challenging task for many of the practitioners. Our work aims to find the efficiency in classifiers and comparing the accuracy in classifying medical documents with well-known classifiers Naïve Bayes, Decision Tree, Support Vector Machine (Linear) and Stochastic Gradient Descent (SGDC). The performance of a feature selection method namely Univariate Feature Selection is analyzed using pattern classifiers namely Naïve Bayes, Decision Tree, Support Vector Machine (Linear) and SGDC and the obtained experimental results shows that the combination of Univariate  Feature Selector and Support Vector Machines classifier gives more accurate results in most cases than the others.},
        keywords = {Classifier’s Accuracy, Document classification, Feature Selection, Learning Models, Medical Documents, Text Classification},
        month = {},
        }

Cite This Article

  • ISSN: 2349-6002
  • Volume: 4
  • Issue: 12
  • PageNo: 822-828

Performance Analysis of Learning Models On Medical Documents

Related Articles