REAL TIME VOICE CLONING

Q: How many days will it take for my paper to be published?

The review time for papers is not fixed. However, if the paper is accepted and the author completes the processing charges formalities, the paper will be published within a few working days.

Q: I would like to receive a hard copy of the journal materials. Are there any additional charges?

You can log in to the author portal and pay 500 INR to receive the hard copy materials.

Q: Do IJIRT provide DOI for published papers?

Yes, we provide a DOI (Digital Object Identifier) upon request. Authors can contact us at editor@ijirt.org after publication to obtain the DOI for their paper.

Q: Do we provide Published Journal Print copy?

No, we do not provide Published Journal Print copy.

Q: Why am I not able to register to IPN?

There might be three potential reasons: 1) You did not verify your mobile number before clicking on the register button. Please click on the Verify sign immediately after entering your mobile number and complete OTP verification. 2) You are an international author. The IPN is enabled only for Indian Authors. 3) You did not tick the Terms & Conditions checkbox. Please make sure to select it before clicking on the register button.

Q: Where can I get a sample publication certificate?

Please login to Author Home, where you will find the sample copy of the publication certificate and confirmation letter.

Q: I made the processing charges payment and the amount is deducted from my bank account, but it is not updated in my Author Home. What should I do?

Please wait for one working day. We will check at our end and update the status. If it is still not updated after one working day, please email your Razorpay Payment ID along with the payment proof snapshot to editor@ijirt.org.

Q: Where can I sign the Copyright and Undertaking Declaration?

To publish your paper in IJIRT, copyright and undertaking is mandatory. You can digitally sign both by logging into Author Home and checking the right-side section.

Q: How many pages are allowed?

There is no page limit. However, to enhance quality and reader experience, we recommend keeping the paper up to 30 pages (single or double column) without extra charges.

Q: How many authors per paper are allowed?

Up to 13 authors are allowed without extra charges.

Sakith Nalluri; A.Rohan Sai; M.Saraswati

REAL TIME VOICE CLONING

Authors: Sakith Nalluri, A.Rohan Sai, M.Saraswati

Unique Paper ID: 151003
Volume: 7
Issue: 11
PageNo: 297-302

Keywords: Text-to-speech synthesis Natural Language Processing Digital Signal Processing.

Abstract:
Recent progress in deep learning has shown impressive results in the area of speech-to-text. For this reason, a deep neural network is usually trained from a single speaker using a corpus of several hours of voice recorded professionally. Giving such a model a new voice is highly expensive, as it needs a new dataset to be collected and the model retrained. A recent research has developed a three-stage pipeline that allows you to clone an unseen voice from just a few seconds of reference speech during practice and without retraining the template. The researchers share strikingly natural-sounding findings. A Text-to-speech synthesizer is an application that converts text into spoken word, by analyzing and processing the text using Natural Language Processing (NLP) and then using Digital Signal Processing (DSP) technology to convert this processed text into synthesized speech representation of the text. Here, we developed a useful text-to-speech synthesizer in the form of a simple application that converts inputted text into synthesized speech and reads out to the user which can then be saved as an mp3. file. The development of a text to speech synthesizer will be of great help to people with visual impairment and make making through large volume of text easier.

Download article

email to a friend

Copyright & License

Copyright © 2026 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{151003,
        author = {Sakith Nalluri and A.Rohan Sai and M.Saraswati},
        title = {REAL TIME VOICE CLONING},
        journal = {International Journal of Innovative Research in Technology},
        year = {},
        volume = {7},
        number = {11},
        pages = {297-302},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=151003},
        abstract = {Recent progress in deep learning has shown impressive results in the area of speech-to-text. For this reason, a deep neural network is usually trained from a single speaker using a corpus of several hours of voice recorded professionally. Giving such a model a new voice is highly expensive, as it needs a new dataset to be collected and the model retrained. A recent research has developed a three-stage pipeline that allows you to clone an unseen voice from just a few seconds of reference speech during practice and without retraining the template. The researchers share strikingly natural-sounding findings. A Text-to-speech synthesizer is an application that converts text into spoken word, by analyzing and processing the text using Natural Language Processing (NLP) and then using Digital Signal Processing (DSP) technology to convert this processed text into synthesized speech representation of the text. Here, we developed a useful text-to-speech synthesizer in the form of a simple application that converts inputted text into synthesized speech and reads out to the user which can then be saved as an mp3. file. The development of a text to speech synthesizer will be of great help to people with visual impairment and make making through large volume of text easier.},
        keywords = {Text-to-speech synthesis, Natural Language Processing, Digital Signal Processing.},
        month = {},
        }

Download .bib

Cite This Article

ISSN: 2349-6002
Volume: 7
Issue: 11
PageNo: 297-302

REAL TIME VOICE CLONING

Available:https://ijirt.org/article?manuscript=151003

18 Nov, 2022

Drug Traceability In Supplies Of Healthcare Products Using Block Chain

09 Nov, 2022

Improving the Efficiency of Regression Test Case Prioritization for Large Scale Workflow SOA

05 Nov, 2022

Driver Sleep and Drowsiness Detection Using Machine Learning

05 Jun, 2022

Secured Network Coding approach for Cloud Storage

28 Mar, 2022

FACIAL EXPRESSION RECOGNITION USING CNN

27 Mar, 2022

MONITORING COVID-19 SOCIAL DISTANCING WITH PERSON DETECTION AND TRACKING VIA FINE-TUNED YOLO V3 AND DEEPSORT TECHNIQUES

12 Mar, 2022

A Fog-Centric Quantum Security for Cloud Storage Scheme with Enhanced Multipath routing architecture

12 Mar, 2022

KEY-DEDUPLICATION WITH ENHANCED SECURITY USING IDENTITY BASED TECHNIQUE

02 Feb, 2022

A DYNAMIC PRIVACY PRESERVING DATA PUBLISHING USING THREE METRICS TECHNIQUE

25 Jan, 2022

IMPROVEMENT OF THE RECONSTRUCTION QUALITY OF MULTI FRAME USING SUPER RESOLUTION METHOD

Impact Factor
8.01 (Year 2024)

An UGC-Compliant International Research Journal

Join Our IPN

IJIRT Partner Network

Submit your research paper and those of your network (friends, colleagues, or peers) through your IPN account, and receive 800 INR for each paper that gets published.

Join Now