BEYOND THE PROMPT: RECONCILING TRAINING DATA INGESTION AND MUSIC COPYRIGHT IN THE GENERATIVE AI ERA

Authors

DOI:

https://doi.org/10.58898/famedia.v1.4

Keywords:

Generative AI, Music Copyright, Algorithmic Black Box, Training Data Attribution, Fair Use, Text and Data Mining (TDM), Intellectual Property

Abstract

This paper examines the unprecedented regulatory restructuring in the global music industry driven by the rapid commercialization of generative artificial intelligence (GenAI) and its fundamental challenge to traditional copyright frameworks. Through a comparative analysis of emerging litigation and divergent legal approaches across the US, EU, and UK, the paper investigates the "algorithmic black box" of AI training data ingestion, the complexities surrounding AI authorship, and the highly debated "fair use" defense. Conclusions indicate that the unauthorized ingestion of copyrighted musical compositions and sound recordings threatens to dilute the market and depreciate foundational cultural capital, exposing systemic inadequacies in current legislation. To prevent irreversible damage to the creative sector, the paper concludes that AI models must not be legally equated to human learners. It recommends implementing scalable technological and regulatory solutions, specifically Training Data Attribution (TDA) coupled with a user-centric blanket licensing frameworks or an opt-in closed-universe database. These mechanisms would ensure fair remuneration for rightsholders, fostering a sustainable ecosystem where algorithmic innovation and human artistry can coexist.

Downloads

Download data is not yet available.

References

Abbott, R. (2016). I think, therefore I invent: Creative computers and the future of patent law. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.2727884

Anthony Justice and 5th Wheel Records, Inc. v. Suno, Inc., 1:25-cv-11739-FDS (United States District Court, District of Massachusetts August 18, 2025). https://www.musicbusinessworldwide.com/files/2025/08/095113130440.pdf

Bathaee, Y. (2018). The artificial intelligence black box and the failure of intent and causation.

Caldwell, M. (2023). What is an “author”?-copyright authorship of AI art through a philosophical lens. Houston Law Review, 61(2). https://houstonlawreview.org/article/92132-what-is-an-author-copyright-authorship-of-ai-art-through-a-philosophical-lens

Choi, W., Koo, J., Cheuk, K. W., Serrà, J., Martínez-Ramírez, M. A., Ikemiya, Y., Murata, N., Takida, Y., Liao, W.-H., & Mitsufuji, Y. (2025). Large-scale training data attribution for music generative models via unlearning (arXiv:2506.18312). arXiv. https://doi.org/10.48550/arXiv.2506.18312 Clark, D. C. (2026). TDM and training AI models: UK Government provides updated statement on copyright and AI. DLA Piper. https://www.dlapiper.com/insights/blogs/mse-today/2026/tdm-and-training-ai-models

Cooke, C. (2026, March 3). No copyright for AI-generated works in US as Supreme Court refuses to intervene in earlier landmark ruling. CMU | the Music Business Explained. https://completemusicupdate.com/no-copyright-for-ai-generated-works-in-us-as-supreme-court-refuses-to-intervene-in-earlier-landmark-ruling/ Copyright, Designs and Patents Act, Pub. L. No. C. 48, UK Public General Acts (1988). https://www.legislation.gov.uk/ukpga/1988/48/contents

Craig, C. J., & Kerr, I. R. (2019). The death of the AI author. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.3374951 Dredge, S. (2026, April 15). Sample creators will get payouts from Splice’s new GenAI tools. Music Ally. https://musically.com/2026/04/15/sample-creators-will-get-payouts-from-splices-new-genai-tools/

Epstein, M. M. (2025). Artificial intelligence and music mash-ups: Monetizing an opt-in closed universe database to preserve royalties and credit for composer and sound recording rights holders.

Fritz, J. (2025). Understanding authorship in artificial intelligence-assisted works. Journal of Intellectual Property Law & Practice, 20(5), 354–364. https://doi.org/10.1093/jiplp/jpae119

Hugenholtz, P. B. (2024). Copyright and the expression engine: Idea and expression in AI-assisted creations (SSRN Scholarly Paper No. 4982516). Social Science Research Network. https://doi.org/10.2139/ssrn.4982516

Johnson, D. R., & Post, D. (1996). Law and borders: The rise of law in cyberspace. Stanford Law Review, 48(5), 1367–1402. https://doi.org/10.2307/1229390 Kogon v. Google, LLC, 1:26-cv-02582 (N.D. Ill. March 6, 2026).

Kumarage, P., & Saarela, M. (2025). Explainability in generative AI: An umbrella review of current techniques, limitations, and future directions. Miller, A. R. (1993). Copyright protection for computer programs, databases, and computer-generated works: Is anything new since CONTU? Harvard Law Review, 106(5), 977. https://doi.org/10.2307/1341682

Muikku, J. (2017). Pro rata and user centric distribution models: A comparative study. Digital Media Finland. https://www.fim-musicians.org/wp-content/uploads/prorata-vs-user-centric-models-study-2018.pdf Royalty Exchange. (2025, March 5). Why transparency matters in music royalties | royalty exchange. https://royaltyexchange.com/blog/why-transparency-matters-in-music-royalties

Shumailov, I., Shumaylov, Z., Zhao, Y., Gal, Y., Papernot, N., & Anderson, R. (2024). The curse of recursion: Training on generated data makes models forget (arXiv:2305.17493). arXiv. https://doi.org/10.48550/arXiv.2305.17493

Stassen, M. (2025, November 11). GEMA wins landmark ruling against OpenAI over ChatGPT’s use of song lyrics. Music Business Worldwide. https://www.musicbusinessworldwide.com/gema-wins-landmark-ruling-against-openai-over-chatgpts-use-of-song-lyrics/

Stassen, M. (2026, April 16). Splice launches AI tools that compensate sample creators when their sounds are used. Music Business Worldwide. https://www.musicbusinessworldwide.com/splice-launches-ai-tools-that-compensate-sample-creators-when-their-sounds-are-used/ Stephen Thaler, an Individual, Appellant V. Shira Perlmutter, in Her Official Capacity as Register of Copyrights and Director of the United States Copyright Office and U.s. Copyright Office, Appellees, No. 23-5233 (United States Court of Appeals for the District of Columbia Circuit March 18, 2025).

Stout, K., Sperry, B., & Ramakrishnan, S. (2026). Generative AI: when fair use becomes unfair competition. 32(4).

Svantesson, D. J. B. (2017). Solving the Internet Jurisdiction Puzzle. Oxford University Press USA - OSO. Tencer, D. (2024, August 5). As Suno and Udio admit training AI with unlicensed music, record industry says: ‘There’s nothing fair about stealing an artist’s life’s work.’ Music Business Worldwide. https://www.musicbusinessworldwide.com/as-suno-and-udio-admit-training-ai-with-unlicensed-music-record-industry-says-theres-nothing-fair-about-stealing-an-artists-lifes-work/

Throsby, C. D. (2001). Economics and culture (Reprint., transferred to digital print). Cambridge Univ. Press.

Torrance, A. W., & Tomlinson, B. (2023). Training Is Everything: Artificial Intelligence, Copyright, and Fair Training (SSRN Scholarly Paper No. 4437680). Social Science Research Network. https://papers.ssrn.com/abstract=4437680 UK copyright and AI report: The ‘opt-out’ is dead, but what comes next? (2026, March 26). https://www.reedsmith.com/articles/uk-copyright-and-ai-report-the-opt-out-is-dead-but-what-comes-next/

UMG Recordings, Inc., et al. v. Suno, Inc. and John Does 1-10, 1:24-cv-11611-FDS (United States District Court, District of Massachusetts August 1, 2024). https://www.musicbusinessworldwide.com/files/2024/08/SUNO-response-to-copyright-suit.pdf U.S. Copyright Office. (2021, January 28).

Compendium of U.S. Copyright Office Practices | U.S. Copyright Office. https://www.copyright.gov/comp3/

Voss, A. (2026). Report on copyright and generative artificial intelligence – opportunities and challenges (INI Report Nos. A10-0019/2026). European Parliament, Committee on Legal Affairs. https://www.europarl.europa.eu/doceo/document/A-10-2026-0019_EN.html

Published

2026-07-02

How to Cite

BEYOND THE PROMPT: RECONCILING TRAINING DATA INGESTION AND MUSIC COPYRIGHT IN THE GENERATIVE AI ERA. (2026). Social Informatics Journal, 5(1), 35-45. https://doi.org/10.58898/famedia.v1.4

Similar Articles

1-10 of 34

You may also start an advanced similarity search for this article.