Ensemble of Data Augmentation Techniques for Efficient 3 Augmentation in NLP

Q: How many days will it take for my paper to be published?

The review time for papers is not fixed. However, if the paper is accepted and the author completes the processing charges formalities, the paper will be published within a few working days.

Q: I would like to receive a hard copy of the journal materials. Are there any additional charges?

You can log in to the author portal and pay 500 INR to receive the hard copy materials.

Q: Do IJIRT provide DOI for published papers?

Yes, we provide a DOI (Digital Object Identifier) upon request. Authors can contact us at editor@ijirt.org after publication to obtain the DOI for their paper.

Q: Do we provide Published Journal Print copy?

No, we do not provide Published Journal Print copy.

Q: Why am I not able to register to IPN?

There might be three potential reasons: 1) You did not verify your mobile number before clicking on the register button. Please click on the Verify sign immediately after entering your mobile number and complete OTP verification. 2) You are an international author. The IPN is enabled only for Indian Authors. 3) You did not tick the Terms & Conditions checkbox. Please make sure to select it before clicking on the register button.

Q: Where can I get a sample publication certificate?

Please login to Author Home, where you will find the sample copy of the publication certificate and confirmation letter.

Q: I made the processing charges payment and the amount is deducted from my bank account, but it is not updated in my Author Home. What should I do?

Please wait for one working day. We will check at our end and update the status. If it is still not updated after one working day, please email your Razorpay Payment ID along with the payment proof snapshot to editor@ijirt.org.

Q: Where can I sign the Copyright and Undertaking Declaration?

To publish your paper in IJIRT, copyright and undertaking is mandatory. You can digitally sign both by logging into Author Home and checking the right-side section.

Q: How many pages are allowed?

There is no page limit. However, to enhance quality and reader experience, we recommend keeping the paper up to 30 pages (single or double column) without extra charges.

Q: How many authors per paper are allowed?

Up to 13 authors are allowed without extra charges.

Nandan Parmar

Ensemble of Data Augmentation Techniques for Efficient 3 Augmentation in NLP

Authors: Nandan Parmar

Unique Paper ID: 165489
Volume: 11
Issue: 1
PageNo: 2706-2734

Keywords: Text Data Augmentation NLP Class Imbalance Text Embeddings

Abstract:
In the last decade, NLP has made significant advances in machine learning. In so many machine learning scenarios, there isn't enough data available to train a good classifier. Data augmentation can indeed be utilized to solve this problem. It utilizes transformations to artificially increase the amount of available training data. Due of linguistic data's discrete character, this topic is still relatively underexplored, in spite of the huge rise in usage. A major goal of the DA techniques is to increase the diversity of training data, allowing the model to better generalize when faced with novel testing data. This study uses the term "data augmentation" to allude as a broad concept that encompasses techniques for transforming training data. While most text data augmentation research focuses on the long-term aim of developing end-to-end learning solutions, this study focuses on using pragmatic, robust, scalable, and easy-to-implement data augmentation techniques comparable to those used in computer vision. In natural language processing, simple but successful data augmentation procedures have been implemented and inspired by such efforts, we construct and compare ensemble data augmentation for NLP classification. We are proposing an ensembling of simple yet effective data augmentation techniques. Through experiments on various dataset from kaggle, we show that ensembling of augmentation can boost performance with any text embedding technique particularly for small training sets. We conclude by carrying out experiments on a classification datasets. Based on the results, we draw conclusion that Effective DA approach by ensembles of data augmentation can help practitioners choose suitable augmentation technique in different settings.

Download article

email to a friend

Copyright & License

Copyright © 2025 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{165489,
        author = {Nandan Parmar},
        title = {Ensemble of Data Augmentation Techniques for Efficient 3 Augmentation in NLP},
        journal = {International Journal of Innovative Research in Technology},
        year = {2025},
        volume = {11},
        number = {1},
        pages = {2706-2734},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=165489},
        abstract = {In the last decade, NLP has made 
significant advances in machine learning. In so many 
machine learning scenarios, there isn't enough data 
available to train a good classifier. Data augmentation 
can indeed be utilized to solve this problem. It utilizes 
transformations to artificially increase the amount of 
available training data. Due of linguistic data's 
discrete character, this topic is still relatively 
underexplored, in spite of the huge rise in usage. A 
major goal of the DA techniques is to increase the 
diversity of training data, allowing the model to 
better generalize when faced with novel testing data. 
This study uses the term "data augmentation" to 
allude as a broad concept that encompasses 
techniques for transforming training data. While 
most text data augmentation research focuses on the 
long-term aim of developing end-to-end learning 
solutions, this study focuses on using pragmatic, 
robust, 
scalable, 
and easy-to-implement data 
augmentation techniques comparable to those used in 
computer vision. In natural language processing, 
simple but successful data augmentation procedures 
have been implemented and inspired by such efforts, 
we construct and compare ensemble data 
augmentation for NLP classification. We are 
proposing an ensembling of simple yet effective data 
augmentation techniques. Through experiments on 
various dataset from kaggle, we show that ensembling 
of augmentation can boost performance with any text 
embedding technique particularly for small training 
sets. We conclude by carrying out experiments on a 
classification datasets. Based on the results, we draw 
conclusion that Effective DA approach by ensembles 
of data augmentation can help practitioners choose 
suitable augmentation technique in different settings.},
        keywords = {Text Data Augmentation, NLP, Class  Imbalance, Text Embeddings},
        month = {June},
        }

Download .bib

Cite This Article

ISSN: 2349-6002
Volume: 11
Issue: 1
PageNo: 2706-2734

Ensemble of Data Augmentation Techniques for Efficient 3 Augmentation in NLP

Available:https://ijirt.org/article?manuscript=165489

24 Mar, 2020

Email Spam Detection using Naive Bayes and Particle Swarm Optimization

Impact Factor
8.01 (Year 2024)

An UGC-Compliant International Research Journal

Join Our IPN

IJIRT Partner Network

Submit your research paper and those of your network (friends, colleagues, or peers) through your IPN account, and receive 800 INR for each paper that gets published.

Join Now