Artificial Intelligence Training and Copyright Law: Where to Draw the Line

Author information:

Alena Yarmak
PhD researcher, European University Institute, Florence, Italy
ORCID ID: https://orcid.org/0000-0002-3733-2864

Maksym Popov
Partner, Mentors Law Firm, Kyiv, Ukraine

Keywords: artificial intelligence, copyright, AI training, fair use, intellectual property law, theories of intellectual property

Download PDF
Обкладинка. 8-й випуск

Abstract

This article explores the issue of training artificial intelligence (AI) in the context of copyright law and compares it to human learning. The authors emphasize that such comparisons must be approached critically, considering the specific nature of both processes. Not every use of copyrighted works in AI training constitutes an infringement; however, AI’s capacity for large- scale memorization and reproduction of protected works necessitates a clear distinction between permissible and impermissible uses. The article proposes a classification of such uses into “negative learning,” “subconscious memorization,” and “verbatim reproduction,” highlighting the legal significance of each category. Special attention is paid to the comparative analysis of regulatory approaches in the United States and the European Union, grounded in dominant theories of intellectual property: the utilitarian and welfare theories in the U.S. versus the personhood theory in the EU. This analysis helps explain the divergence in AI regulation across legal systems. Given Ukraine’s legal tradition rooted in the continental model, the country is encouraged to continue aligning its regulatory path with EU law. The authors argue that there is no need to develop new intellectual property theories to address the challenges posed by AI. Instead, existing frameworks remain effective if applied complementarily: welfare theory should guide early-stage model development, while personhood theory should inform fair compensation mechanisms for creators whose works are used during training. The article also supports the adoption of an output-based remuneration model as a viable solution to balance the interests of authors and AI developers

How to Cite

In accordance with DSTU 8302:2015:
Ярмак А., Попов М. Навчання штучного інтелекту та авторське право: де провести межу? Цивілістична платформа. 2026. № 1 (8). С. 52-88. https://doi.org/10.69724/2786-8834-2026-8-1-52-88

According to the international style of APA:
Yarmak, A. Popov, M (2026) Artificial Intelligence Training and Copyright Law: Where to Draw the Line. С. P. Journal, 1 (8). https://doi.org/10.69724/2786-8834-2026-8-1-52-88 [in Ukrainian].

References

  1. A-BA-BA-HA-LA-MA-HA Publishing House. (2025). Woman from a dream: Anthology of English gothic short stories. https://store.ababahalamaha.com.ua/zhinka-zi-snu [in Ukrainian] (accessed June 26, 2025).
  2. Abbamonte, G. B. (2024). The application of the copyright TDM exceptions and transparency requirements in the AI Act to the training of generative AI. European Intellectual Property Review, 46, 479–489. https://search.informit.org/doi/10.3316/informit.T2024082300022791259107858
  3. Authors Guild v. Google, Inc. (2015). 804 F.3d 202 (2d Cir.). https://www.law.berkeley.edu/wp-content/uploads/2016/05/Authors-Guild-v-Google-804_F.3d_202.pdf
  4. Bartz v. Anthropic PBC. (2025). No. C 24-05417 WHA (N.D. Cal.).
  5. Bordoloi, S. K. (2025, January 13). The great data famine: How AI ate the internet (and what’s next). Sify. https://www.sify.com/ai-analytics/the-great-data-famine-how-ai-ate-the-internet-and-whats-next/
  6. Bradford, A. (2024). The false choice between digital regulation and innovation. Northwestern University Law Review, 119, 377–452. https://scholarlycommons.law.northwestern.edu/nulr/vol119/iss2/3
  7. Cabinet of Ministers of Ukraine. (2025). On approval of the action plan for implementing the recommendations of the European Commission presented in the Ukraine Progress Report within the 2024 EU Enlargement Package (Order No. 300-r). https://zakon.rada.gov.ua/laws/show/300-2025-%D1%80#Text  [in Ukrainian]
  8. Chander, A. (2014). How law made Silicon Valley. Emory Law Journal, 63, 639–694. https://scholarlycommons.law.emory.edu/elj/vol63/iss3/3
  9. Charlesworth, J. (2025). Generative AI’s illusory case for fair use. Vanderbilt Journal of Entertainment and Technology Law, 27, 323–356. http://dx.doi.org/10.2139/ssrn.4924997
  10. Cooper, A. F., & Grimmelmann, J. (2025, September 1). The files are in the computer: On copyright, memorization, and generative AI. arXiv. https://doi.org/10.48550/arXiv.2404.12590
  11. Cooper, A. F., et al. (2025, September 17). Extracting memorized pieces of (copyrighted) books from open-weight language models. arXiv. https://doi.org/10.48550/arXiv.2505.12546
  12. Court of Justice of the European Union. (2009). Case C-5/08, Infopaq International A/S v. Danske Dagblades Forening (Judgment of the Fourth Chamber, ECLI:EU:C:2009:465) https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=celex:62008CJ0005
  13. Court of Justice of the European Union. (2011). Case C-145/10, Eva-Maria Painer v. Standard VerlagsGmbH and Others (Judgment of the Third Chamber).
  14. Court of Justice of the European Union. (2025). Case C-250/25, summary of the request for a preliminary ruling pursuant to Article 98(1) of the Rules of Procedure (Budapest Környéki Törvényszék, Hungary).
  15. de la Durantaye, K. (2023, August 10). “Garbage in, garbage out” – Die Regulierung generativer KI durch Urheberrecht. SSRN. https://papers.ssrn.com/abstract=4571908
  16. Digital Millennium Copyright Act, Pub. L. No. 105–304, 112 Stat. 2860 (1998).
  17. Doe v. GitHub, Inc., No. 22-cv-06823-JST (N.D. Cal. May 11, 2023).
  18. Dornis, T. W. (2024, October 19). The training of generative AI is not text and data mining. SSRN. http://dx.doi.org/10.2139/ssrn.4993782 (дата звернення 25.12.2025)
  19. Dusollier, S., et al. (2025). Copyright and generative AI: Opinion. JIPITEC. Journal of Intellectual Property, Information Technology and E-Commerce Law. 16. https://www.jipitec.eu/jipitec/article/view/424
  20. European Commission. (2024). The Draghi report on EU competitiveness. https://commission.europa.eu/topics/eu-competitiveness/draghi-report_en
  21. European Digital Rights (EDRi). (n.d.). Copyright: Challenges of the digital environment.https://www.edri.org/files/paper07_copyright.pdf
  22. European Parliament & Council of the European Union. (2000). Directive 2000/31/EC of 8 June 2000 on certain legal aspects of information society services, in particular electronic commerce, in the Internal Market. Official Journal of the European Communities. 2000. L 178. P. 1–16. URL: https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:32000L0031
  23. European Parliament & Council of the European Union. (2001). Directive 2001/29/EC of the European Parliament and of the Council of 22 May 2001 on the harmonisation of certain aspects of copyright and related rights in the information society. Official Journal of the European Communities, L 167, 10–19. https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:32001L0029
  24. European Parliament & Council of the European Union. (2019). Directive (EU) 2019/790 of 17 April 2019 on copyright and related rights in the Digital Single Market and amending Directives 96/9/EC and 2001/29/EC (Text with EEA relevance). Official Journal of the European Union.
  25. European Parliament & Council of the European Union. (2024). Regulation (EU) 2024/1689 of the European Parliament and of the Council of 13 June 2024 laying down harmonised rules on artificial intelligence (Artificial Intelligence Act). Official Journal of the European Union, L. https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:32024R1689
  26. Evans, G. (2025, March 28). Hayao Miyazaki’s AI criticism resurfaces amid Studio Ghibli trend. The Independent. https://www.independent.co.uk/arts-entertainment/films/news/hayao-miyazaki-studio-ghibli-ai-trend-b2723358.html
  27. Fisher, F. Inc. v. Dillingham. (1924). Fred Fisher, Inc. v. Dillingham, 298 F. 145 (S.D.N.Y.). https://case-law.vlex.com/vid/fred-fisher-inc-v-894155988
  28. Fisher, W. W. (2001). Theories of intellectual property. Cambridge University Press. https://cyber.harvard.edu/property00/iptheory.html
  29. Fisher, W. W. (2004). Promises to keep: Technology, law, and the future of entertainment. Stanford University Press.
  30. Garicano, L., Holmström, B., & Petit, N. (2025, November 10). The constitution of innovation: A new European renaissance. The Constitution of Innovation. https://constitutionofinnovation.eu
  31. Gervais, D. (2023, December 4). Licensing of copyright-protected material for AI makes business sense. Copyright Clearance Center. https://www.copyright.com/wp-content/uploads/2023/12/Gervais-Licensing-of-Copyright-Protected-Material-for-AI-Makes-Business-Sense.pdf
  32. Getty Images (US), Inc., et al. v. Stability AI Ltd. (2025). Getty Images (US), Inc. and others v. Stability AI Ltd. [2025] EWHC (Ch).
  33. Ginsburg, J. C. (2025). AI inputs, fair use and the US Copyright Office report. Journal of Intellectual Property Law & Practice, 20, 521–529. https://doi.org/10.1093/jiplp/jpaf046
  34. Ginsburg, J. C. (2025, March 7). Humanist copyright. SSRN. https://dx.doi.org/10.2139/ssrn.5170170
  35. Glotov, S. (2025) Copyright v. artificial Intelligence: Machine learning is not a Copyright use of a Work. С. P. Journal, 1 (4). https://doi.org/10.69724/2786-8834-2025-4-1-72-89 [in Ukrainian].
  36. Gour, H. (2025, September 27). Bartz v. Anthropic: All you need to know about the largest copyright settlement in history. The Leaflet. https://theleaflet.in/digital-rights/law-and-technology/bartz-v-anthropic-all-you-need-to-know-about-the-largest-copyright-settlement-in-history
  37. Harvard Law Review. (2025, February 11). Recovering personality in copyright’s originality inquiry. https://harvardlawreview.org/?p=17018
  38. Hathaway, O. A. (2000, August 1). Path dependence in the law: The course and pattern of legal change in a common law system. SSRN. https://papers.ssrn.com/abstract=239332
  39. Henderson, E. A. (2024). Intellectual property liability for businesses in the age of AI: What new liabilities businesses using AI could face and the possible methods of self-protection. Michigan Business & Entrepreneurial Law Review, 13(1), 62–94. https://repository.law.umich.edu/mbelr/vol13/iss1/4
  40. High Commercial Court of Ukraine. (2020, May 14). Judgment in case No. 58/505. https://reyestr.court.gov.ua/Review/89321305  [in Ukrainian]
  41. Horbova, N. (2015). Theories of intellectual property: Toward problem formulation (Working paper). Zaporizhzhia National University. http://elar.tsatu.edu.ua/handle/123456789/1642 [in Ukrainian]
  42. Kadrey v. Meta Platforms, Inc., No. 23-cv-03417-VC (N.D. Cal. 2025).
  43. Kokhanovska, O. (2011). Main theories of intellectual property law and their influence on the development of modern legislation in Law Ukraine , 5, 52–59. [in Ukrainian]
  44. Landgericht Hamburg (Hamburg District Court). (2024). LAION v. Robert Kneschke, Case No. 310 O.22723.
  45. Landgericht München I (Munich Regional Court). (2025, November 11). GEMA v. OpenAI.
  46. Lee, E. (2025, February 1). Fair use and the origin of AI training. SSRN. https://dx.doi.org/10.2139/ssrn.5253011
  47. Lemley, Mark A., & Casey, B. (2020). Fair learning. SSRN. https://dx.doi.org/10.2139/ssrn.3528447
  48. Lessig, L. (1999). The law of the horse: What cyberlaw might teach. Harvard Law Review, 113, 501–549. https://doi.org/10.2307/1342331
  49. Locke, J. (1689/1980). Second treatise of government (D. Gowan & C. Greif, Trans.). Hackett Publishing Company.
  50. Los Angeles Times. (2023, May 11). Column: Your boss wants AI to replace you. The writers’ strike shows how to fight back. https://www.latimes.com/business/technology/story/2023-05-11/column-the-writers-strike-is-only-the-beginning-a-rebellion-against-ai-is-underway
  51. Lutes, B. A. (Ed.). (2025). Identifying the economic implications of artificial intelligence for copyright policy: Context and direction for economic research. U.S. Copyright Office.
  52. Metz, R. (2024, January 8). OpenAI accuses New York Times of “not telling the full story” about copyright while admitting “rare bug” spits out exact quotes. Fortune. https://fortune.com/2024/01/08/openai-blog-post-new-york-times-lawsuit-not-full-story-copyright/
  53. Mezei, P. (2025, August 5). Memorization and generative AI – A persistent issue with copyright consequences? SSRN. http://dx.doi.org/10.2139/ssrn.5404367
  54. National Intellectual Property Office of Ukraine. (2025, December 30). AI and copyright: International experience for authors. https://nipo.gov.ua/ai-tdm-ta-ap-mizhnarodni-pidkhody/ [in Ukrainian]
  55. Nematzadeh, A., Ruder, S., & Yogatama, D. (2020). Memory in human and artificial language processing systems. In Proceedings of the Bridging AI and Cognitive Science Workshop at ICLR 2020.https://baicsworkshop.github.io/pdf/BAICS_22.pdf
  56. On Copyright and Related Rights. Law of Ukraine (2022, December 1). No. 2811-IX (as amended by Law No. 4585-IX of August 21, 2025; current version as of December 12, 2025). https://zakon.rada.gov.ua/laws/show/2811-20#Text [in Ukrainian]
  57. Peukert, A. (2025). Regulating IP exclusion/inclusion on a global scale: The example of copyright vs. AI training. SSRN. https://papers.ssrn.com/abstract=4905400
  58. Peukert, C. (2025, December 3). The economics of copyright and AI: Empirical evidence and optimal policy. (In-depth analysis requested by the Committee on Legal Affairs (JURI) European Parliament.https://www.europarl.europa.eu/thinktank/en/document/IUST_STU(2025)778859
  59. Quintais, J. P. (2025). Copyright, the AI Act and extraterritoriality (Policy brief). The Lisbon Council.https://lisboncouncil.net/wp-content/uploads/2025/06/The-Lisbon-Council_Copyright-the-AI-Act-and-Extraterritoriality_Joao-Pedro-Quintais_June-2025_final.pdf
  60. Rademeyer, M., & Selvadurai, N. (2026). Out from the shadows: Developing effective copyright laws for AI training datasets and shadow libraries. Journal of Intellectual Property Law & Practice, 21(1), 22–35. https://doi.org/10.1093/jiplp/jpaf072
  61. Roeder, M. A. (1940). The doctrine of moral right: A study in the law of artists, authors and creators. Harvard Law Review, 53, 554–587.
  62. Rosati, E. (2025). Copyright Exceptions and Fair Use Defences for AI Training Done for “Research” and “Learning,” or the Inescapable Licensing Horizon. European Journal of Risk Regulation, 1–22. DOI: https://doi.org/10.1017/err.2025.10035
  63. Rosati, E. (2025, September 16). The future of the movie industry in the wake of generative AI: A perspective under EU and UK copyright law. SSRN. https://papers.ssrn.com/abstract=5493786
  64. Senftleben, M. (2022). Compliance of national TDM rules with international copyright law – An overrated nonissue? SSRN. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4134651
  65. Senftleben, M. (2025). Removing copyright obstacles to AI training while ensuring author remuneration. Chicago-Kent Law Review, 100, 1–42. https://scholarship.kentlaw.iit.edu/cklawreview/vol100/iss1/5/
  66. Shevchenko, A. I. (2023). Strategy for the development of artificial intelligence in Ukraine. IPSHI. https://doi.org/10.15407/development_strategy_2023 [in Ukrainian]
  67. Shtefan, A. S. (2017). Copyright and related rights: Features of legal protection, exercise, and enforcement.Interservis. [in Ukrainian]
  68. Shumailov, I., et al. (2024). AI models collapse when trained on recursively generated data. Nature, 631, 755–758. https://www.nature.com/articles/s41586-024-07566-y
  69. Söbbing, T., & Schwarz, A. (2025). Einführung in das Recht der künstlichen Intelligenz. Disserta. [in German]
  70. Spasybo-Fateyeva, I. V. (Ed.). (2018). Kharkiv civil law school: Exercise and protection of intellectual property rights. Pravo. [in Russian]
  71. Supreme Court of the United States. (1954). Mazer v. Stein, 347 U.S. 201.
  72. Supreme Court of the United States. (1984). Sony Corp. of America v. Universal City Studios, Inc., 464 U.S. 417 (No. 81–1687).
  73. Supreme Court of the United States. (1985). Harper & Row, Publishers, Inc. v. Nation Enterprises, 471 U.S. 539 (No. 83–1632). https://supreme.justia.com/cases/federal/us/471/539
  74. Supreme Court of the United States. (2021). Google LLC v. Oracle America Inc., 593 U.S. 1 (No. 18–956).
  75. Supreme Court of Ukraine, Civil Cassation Court. (2024, January 17). Judgment in case No. 308/7570/18-st. https://reyestr.court.gov.ua/Review/117473137 [in Ukrainian]
  76. Tarasiuk, A. (2024). Legal grounds for the use of copyright works in machine learning. Theory and practice of intellectual property. 2, 73–82. https://doi.org/10.33731/22024.305506 [in Ukrainian]
  77. The Village Ukraine. (2025, January 29). “A-ba-ba-ha-la-ma-ha” made a cover using AI. Then came discussions, accusations, and promo codes. https://www.village.com.ua/village/culture/culture-situation/359337-laquo-em-a-ba-ba-ga-la-ma-ga-em-raquo-zrobila-obkladynku-za-dopomogoyu-shi-dali-buli-diskusiyi-zvinuvachennya-ta-promokodi [in Ukrainian]
  78. The Walt Disney Company. (2025, December 11). The Walt Disney Company and OpenAI reach agreement to bring Disney characters to Sora. https://thewaltdisneycompany.com/disney-openai-sora-agreement/
  79. Thomson Reuters Enterprise Centre GmbH v. Ross Intelligence Inc., No. 1:20-cv-613-SB (D. Del. 2025). https://law.justia.com/cases/federal/district-courts/delaware/dedce/1:2020cv00613/72109/804/
  80. Tong, A., et al. (2024, February 22). Exclusive: Reddit in AI content licensing deal with Google. Reuters. https://www.reuters.com/technology/reddit-ai-content-licensing-deal-with-google-sources-say-2024-02-22/
  81. Tribunal de grande instance de Paris. (2008, August 6). Métropole Télévision v. Wizzgo, Case No. 08/56275.
  82. Tribunal de grande instance de Paris. (2008, November 10). NT1 v. Wizzgo, Case No. 08/58864.
  83. Tribunal de grande instance de Paris. (2008, November 6). France 2 v. Wizzgo, Case No. 08/58349.
  84. Tribunal de grande instance de Paris. (2008, November 6). TF1 v. Wizzgo, Case No. 08/58348.
  85. United States Constitution. (1787). Article I, Section 8, Clause 8.
  86. United States Court of Appeals for the Ninth Circuit. (1992). Sega Enterprises Ltd. v. Accolade, Inc., 977 F.2d 1510. https://law.justia.com/cases/federal/appellate-courts/F2/977/1510/305345/
  87. United States Court of Appeals for the Ninth Circuit. (2000). Three Boys Music Corp. v. Bolton, №55150, 212 F.3d 477.
  88. United States Court of Appeals for the Ninth Circuit. (2007). Perfect 10, Inc. v. Amazon.com, Inc., 508 F.3d 1146. https://cdn.ca9.uscourts.gov/datastore/opinions/2007/12/03/0655405.pdf
  89. United States District Court for the District of Delaware. (2025). Thomson Reuters Enterprise Centre GMBH v. Ross Intelligence Inc, No. 1:20-cv-613-SB.
  90. United States District Court for the District of Massachusetts. (2024). UMG Recordings, Inc. v. Suno, Inc., No. 1:24-cv-11611.
  91. Valamontes, A. Quantifying the financial and legal impact of unauthorized ai training on independent music creators (2025). Report No 1. DOI: 10.13140/RG.2.2.14239.27044
  92. Vashchynets, I. I. (2006). Civil-law protection of copyright in the context of information technology development [PhD thesis, V. M. Koretsky Institute of state and law NAS of Ukraine]. [in Ukrainian]
  93. Yarova, M. (2025, January 23). Uanet is outraged over the “A-ba-ba-ha-la-ma-ha” book cover created by AI. Scroll Media. https://scroll.media/2025/01/23/skandal-ai-a-ba-ba-ga-la-ma-ga/ [in Ukrainian]
  94. Young, S. (2025). DeepSeek, ChatGPT, and the global fight for technological supremacy. Harvard Law School. https://hls.harvard.edu/today/deepseek-chatgpt-and-the-global-fight-for-technological-supremacy/