OPEN SOURCE AND PROPRIETARY SOFTWARE FOR AUDIO DEEPFAKES AND VOICE CLONING: GROWTH AREAS, PAIN POINTS, FUTURE INFLUENCE

Vitaliy Danylov

doi:10.30525/2592-8813-2024-1-11

OPEN SOURCE AND PROPRIETARY SOFTWARE FOR AUDIO DEEPFAKES AND VOICE CLONING: GROWTH AREAS, PAIN POINTS, FUTURE INFLUENCE

PDF

Published: Apr 18, 2024

https://doi.org/10.30525/2592-8813-2024-1-11

Vitaliy Danylov

https://orcid.org/0009-0004-8919-3378

Abstract

Abstract. The article is dedicated to exploring the rapidly growing field of audio deepfake and voice cloning technologies. It examines the dual pathways of development in open-source and proprietary software, identifying key areas of growth, challenges faced by developers and users, and the potential impact these technologies may have on various sectors. The relevance of this article lies in its timely examination of a rapidly evolving technology that has significant implications for privacy, security, and content authenticity in the digital age. The research results show that audio deepfakes signify a major leap forward in our ability to generate and modify audio recordings, enabling the creation of highly convincing imitations of specific voices. This technology spans three main categories: imitation-based, synthetic-based, and voice cloning, each offering unique applications and introducing distinct challenges. These advancements have opened up new possibilities in fields such as entertainment, customer service, and security but also bring to light serious ethical and security considerations. The imperative for careful oversight and the development of regulatory frameworks to prevent misuse is clear. The ecosystem of audio deepfake technology features a wide range of open-source and proprietary software, each designed to meet specific requirements. Prominent solutions like Resemble. ai, Descript, and CereProc, among others, cater to diverse needs from entertainment to multilingual voice cloning. Alongside these, open-source projects like FaceSwap and Real-Time Voice Cloning offer valuable resources for innovation, emphasizing the importance of responsible usage and ethical development. The trajectory of audio deepfakes is marked by both promising prospects and formidable challenges. The potential for these technologies to revolutionize storytelling, create personalized experiences, and support educational initiatives is immense, facilitated by ongoing advancements in AI and the growth of open-source communities. However, the concerns surrounding ethical use, the demand for computational resources, and the challenge of achieving linguistic diversity underline the need for comprehensive ethical guidelines and sophisticated detection mechanisms. Navigating the future of audio deepfakes will require a balanced approach, weighing their transformative potential against the risks they pose. The practical significance of audio deepfakes extends into research and development, where they can be used to study speech disorders, aid in voice restoration for individuals who have lost their ability to speak and explore new forms of human-computer interaction. As the technology matures and becomes more accessible, its practical applications are expected to expand, potentially transforming how we interact with digital content and each other in virtual environments.

How to Cite

Danylov, V. (2024). OPEN SOURCE AND PROPRIETARY SOFTWARE FOR AUDIO DEEPFAKES AND VOICE CLONING: GROWTH AREAS, PAIN POINTS, FUTURE INFLUENCE. Baltic Journal of Legal and Social Sciences, (1), 105-113. https://doi.org/10.30525/2592-8813-2024-1-11

Article views: 956 | PDF Downloads: 894

References

1. Almutairi, Z. & Elgibreen, H. (2022). A Review of Modern Audio Deepfake Detection Methods: Challenges and Future Directions. Algorithms. 15. 19. 10.3390/a15050155. URL: https://www. researchgate.net/publication/360354997_A_Review_of_Modern_Audio_Deepfake_Detection_ Methods_Challenges_and_Future_Directions
2. Broz, M. (2023). Top 10 Deepfake Voice Software & Online Tools Review. VansMedia. URL: https:// vansmedia.vanceai.com/deepfake-voice-software-and-online-tools-review/
3. Fauve, B. (2023). The Rise of Audio Deepfakes: Implications and Challenges. URL: https://www. linkedin.com/pulse/rise-audio-deepfakes-implications-challenges-benoit-fauve
4. Habr (2023). Создание deepfake видео и синтез речи open-source проект Wunjo AI. URL: https:// habr.com/ru/articles/752910/
5. Hays, E. (2023). Beyond the Horizon: Exploring Future Prospects and Societal Impacts of Deepfake Technology. URL: https://medium.com/@toddkslater/beyond-the-horizon-exploring-future-prospects- and-societal-impacts-of-deepfake-technology-ecc53b51fcb2
6. Kamunya T. (2023). 8 Best Open Source Deepfake Software for Realistic Illusions. URL: https:// geekflare.com/best-open-source-deepfake-software/
7. Khanjani Z., Watson G., Janeja V. (2023) Audio deepfakes: A survey. Frontiers, 5. URL: https:// www.frontiersin.org/articles/10.3389/fdata.2022.1001063/full
8. Kietzmann, J., Mills, A. & Plangger, K. (2020). Deepfakes: perspectives on the future “reality” of advertising and branding. International Journal of Advertising. 40. 1–13. DOI: 10.1080/02650487.2020.1834211.
9. Naitali, A, Ridouani, M, Salahdine, F, Kaabouch, N. (2023) Deepfake Attacks: Generation, Detection, Datasets, Challenges, and Research Directions. Computers, 12(10):216. DOI: https://doi. org/10.3390/computers12100216
10. Whittaker L., Mulcahy R., Letheren K. (2023). Mapping the deepfake landscape for innovation: A multidisciplinary systematic review and future research agenda. Technovation, Volume 125. URL: https://doi.org/10.1016/j.technovation.2023.102784

Issue

No. 1 (2024): Baltic Journal of Legal and Social Sciences

##plugins.themes.bootstrap3.article.main##

##plugins.themes.bootstrap3.article.sidebar##

Abstract

How to Cite

##plugins.themes.bootstrap3.article.details##