Multiple Dataset Problems while Detecting Early Stage Lung Cancer

  • Unique Paper ID: 172781
  • Volume: 11
  • Issue: 9
  • PageNo: 820-826
  • Abstract:
  • Lung cancer is one of the leading causes of cancer-related deaths worldwide, and early detection significantly improves patient survival rates. Deep learning models, such as Convolutional Neural Networks (CNNs), Linear Discriminant Analysis (LDA), Recurrent Neural Networks (RNNs), Autoencoders, and Transformer-based models, can be utilized to automate lung cancer detection from medical imaging. However, a major challenge in developing a robust deep learning model is the variability in imaging data, which arises due to differences in X-ray machines and scanning techniques. This research highlights the impact of dataset variability on lung cancer detection. We utilize the LIDC-IDRI dataset from The Cancer Imaging Archive (TCIA), which contains lung CT scans from multiple imaging sources. The variability in image quality, contrast, and resolution across different machines introduces inconsistencies that hinder effective model training and generalization. This study focuses on analyzing these challenges and discussing potential solutions, such as dataset standardization and domain adaptation techniques, to enhance the reliability of deep learning-based lung cancer detection.

Cite This Article

  • ISSN: 2349-6002
  • Volume: 11
  • Issue: 9
  • PageNo: 820-826

Multiple Dataset Problems while Detecting Early Stage Lung Cancer

Related Articles