Generative AI as a historical source: source criticism, citation integrity, and the jagged frontier of digital history

Iryna A. Selyshcheva

doi:10.55056/cte.1438

Authors

Iryna A. Selyshcheva Kryvyi Rih State Pedagogical University https://orcid.org/0000-0002-4841-6449

DOI:

https://doi.org/10.55056/cte.1438

Keywords:

digital history, generative artificial intelligence, large language models, source criticism, hallucination and citation integrity, handwritten text recognition, AI literacy in history education

Abstract

Between the public release of ChatGPT in late 2022 and 2026, generative artificial intelligence (AI) moved from a computational novelty to a structural feature of historical scholarship, reshaping how primary sources are transcribed, described, analysed, and communicated. This article argues that the decisive methodological shift is not the automation of existing tasks but the arrival of a new kind of object for the historian's craft: the large language model (LLM) itself, which must be read as a historical source rather than trusted as a neutral instrument. Drawing on peer-reviewed evaluations, professional-society guidance, primary legal filings, and documented failure cases, the article develops three connected claims. First, generative models are best understood as an algorithmic cartography of the digitised record whose jagged frontier of competence maps which pasts have been absorbed into training data and which remain silent. Second, the same architecture that enables transcription of damaged manuscripts and large-scale corpus analysis also produces hallucinations and fabricated citations at rates incompatible with the evidentiary standards of the discipline; recent audits and accountability cases in scholarship, government, and the courts illustrate the stakes. Third, the responsible integration of these tools depends on extending traditional source criticism to the model, on non-negotiable verification of every reference, on cryptographic provenance and Indigenous data-governance frameworks, and on assessment redesign rather than prohibition. The article synthesises evidence across document analysis, public history, and pedagogy to propose a programme for a critically literate, symbiotic historical scholarship.

Downloads

Download data is not yet available.

Abstract views: 35 / PDF views: 7

References

Alenichev, A., Shaffer, J.D., Kingori, P., Grietens, K.P., Muldoon, J. and Rocher, L., 2026. ‘We can see a savage’: a case study of the colonial gaze in generative AI algorithms. AI & SOCIETY, 41(4), pp.3413–3435. Available from: https://doi.org/10.1007/s00146-025-02685-0. DOI: https://doi.org/10.1007/s00146-025-02685-0

American Alliance of Museums, 2024. AI Adolescence. Museum. Available from: https://www.aam-us.org/2024/01/16/ai-adolescence-in-museums/.

American Historical Association, 2025. AHA Publishes Guiding Principles for Artificial Intelligence in History Education. AHA News. Available from: https://www.historians.org/news/aha-publishes-guiding-principles-for-artificial-intelligence-in-history-education/.

American Historical Association, Ad Hoc Committee on Artificial Intelligence in History Education, 2025. Guiding Principles for Artificial Intelligence in History Education. American Historical Association. Approved by the AHA Council, 29 July 2025. Available from: https://www.historians.org/resource/guiding-principles-for-artificial-intelligence-in-history-education/.

American Historical Review, 2024. AHR Call for Proposals: AI in Historical Perspectives. AHR History Lab. Rolling call through 30 December 2026. Available from: https://www.historians.org/news-publications/american-historical-review/how-to-submit/ai-in-historical-perspectives/.

Ansari, S., 2026. Compound Deception in Elite Peer Review: A Failure Mode Taxonomy of 100 Fabricated Citations at NeurIPS 2025. Available from: https://doi.org/10.48550/arXiv.2602.05930.

Antonelli, P., Reas, C., Anadol, R. and Kuo, M., 2021. Modern Dream: How Refik Anadol Is Using Machine Learning and NFTs to Interpret MoMA’s Collection. Available from: https://www.moma.org/magazine/articles/658.

Bender, E.M., Gebru, T., McMillan-Major, A. and Shmitchell, S., 2021. On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. New York, NY, USA: Association for Computing Machinery, FAccT ’21, p.610–623. Available from: https://doi.org/10.1145/3442188.3445922. DOI: https://doi.org/10.1145/3442188.3445922

Black, A., 2025. The ChatGPT Exam: Critiquing Generative AI to Assess Learning. Teaching History: A Journal of Methods, 49(1), p.34–41. Available from: https://doi.org/10.33043/gg58bfzzg. DOI: https://doi.org/10.33043/gg58bfzzg

B Dikow, R., DiPietro, C., G Trizna, M., BredenbeckCorp, H., G Bursell, M., B Ekwealor, J.T., J Hodel, R.G., Lopez, N., B Mattingly, W.J., Munro, J., M Naples, R., Oubre, C., Robarge, D., Snyder, S., L Spillane, J., Tomerlin, M.J., J Villanueva, L. and E White, A., 2023. Developing responsible AI practices at the Smithsonian Institution. Research Ideas and Outcomes, 9, p.e113334. Available from: https://doi.org/10.3897/rio.9.e113334. DOI: https://doi.org/10.3897/rio.9.e113334

Carroll, S.R., Garba, I., Figueroa-Rodríguez, O.L., Holbrook, J., Lovett, R., Materechera, S., Parsons, M., Raseroka, K., Rodriguez-Lonebear, D., Rowe, R., Sara, R., Walker, J.D., Anderson, J. and Hudson, M., 2020. The CARE Principles for Indigenous Data Governance. Data Science Journal, 19(1), p.43. Available from: https://doi.org/10.5334/dsj-2020-043. DOI: https://doi.org/10.5334/dsj-2020-043

Chesney, R. and Citron, D.K., 2019. Deep Fakes: A Looming Challenge for Privacy, Democracy, and National Security. California Law Review, 107(6), pp.1753–1820. Available from: https://doi.org/10.15779/Z38RV0D15J. DOI: https://doi.org/10.2139/ssrn.3213954

Clayton, D., Altink, H. and Wilson, E., 2025. Piloting Responsible and Effective Use of Generative AI in Undergraduate History Teaching. Historical Transactions (Royal Historical Society). 16 July 2025. Available from: https://blog.royalhistsoc.org/2025/07/16/piloting-responsible-and-effective-use-of-generative-ai-in-undergraduate-history-teaching/.

Coalition for Content Provenance and Authenticity, 2025. C2PA | Verifying Media Content Sources. Available from: https://c2pa.org/.

Dell’Acqua, F., McFowland, E., Mollick, E., Lifshitz, H., Kellogg, K.C., Rajendran, S., Krayer, L., Candelon, F. and Lakhani, K.R., 2026. Navigating the Jagged Technological Frontier: Field Experimental Evidence of the Effects of Artificial Intelligence on Knowledge Worker Productivity and Quality. Organization Science, 37(2), pp.403–423. Available from: https://doi.org/10.1287/orsc.2025.21838. DOI: https://doi.org/10.1287/orsc.2025.21838

Draxler, C., Heuvel, H. van den, Hessen, A. van, Ircing, P. and Lehečka, J., 2024. Speech Technology Services for Oral History Research. In: I. Anuradha, M. Wynne, F. Frontini and A. Plum, eds. Proceedings of the First Workshop on Holocaust Testimonies as Language Resources (HTRes) @ LREC-COLING 2024. Torino, Italia: ELRA and ICCL, pp.38–43. Available from: https://aclanthology.org/2024.htres-1.6/. DOI: https://doi.org/10.63317/2osuy23hwynf

England and Wales High Court, 2025. Ayinde -v- London Borough of Haringey, and Al-Haroun -v- Qatar National Bank. [2025] EWHC 1383 (Admin), judgment of 6 June 2025 (Dame Victoria Sharp P. and Johnson J.). Available from: https://www.judiciary.uk/judgments/ayinde-v-london-borough-of-haringey-and-al-haroun-v-qatar-national-bank/.

Europeana, 2023. AI in relation to GLAMs Task Force: Report and recommendations. Europeana Pro. Available from: https://pro.europeana.eu/files/Europeana_Professional/Europeana_Network/Europeana_Network_Task_Forces/Final_reports/AI%20in%20relation%20to%20GLAMs%20Task%20Force%20Report.pdf.

Ghiriti, A., Göderle, W. and Kern, R., 2024. Exploring the Capabilities of GPT4-Vision as OCR Engine. In: A. Antonacopoulos, A. Hinze, B. Piwowarski, M. Coustaty, G.M. Di Nunzio, F. Gelati and N. Vanderschantz, eds. Linking Theory and Practice of Digital Libraries (TPDL 2024). Cham: Springer Nature Switzerland, Lecture Notes in Computer Science, vol. 15178. Available from: https://doi.org/10.1007/978-3-031-72440-4_1. DOI: https://doi.org/10.1007/978-3-031-72440-4_1

Gillis, B., 2025. A Disciplinary Approach to Generative AI in the History Classroom. Perspectives on History (American Historical Association). 24 September 2025. Available from: https://www.historians.org/perspectives-article/a-disciplinary-approach-to-generative-ai-in-the-history-classroom/.

Grigoli, L.R., 2023. Townhouse Notes: Ghosts in the Machine. Perspectives on History (American Historical Association). Available from: https://www.historians.org/perspectives-article/townhouse-notes-ghosts-in-the-machine-march-2023/.

Grynbaum, M.M. and Mac, R., 2023. The Times Sues OpenAI and Microsoft Over A.I. Use of Copyrighted Work. The New York Times, 27 December 2023. Available from: https://www.nytimes.com/2023/12/27/business/media/new-york-times-open-ai-microsoft-lawsuit.html.

Guldi, J., 2023. The Dangerous Art of Text Mining: A Methodology for Digital History. Cambridge: Cambridge University Press. Available from: https://doi.org/10.1017/9781009263016. DOI: https://doi.org/10.1017/9781009263016

Guldi, J., 2024. The Revolution in Text Mining for Historical Analysis is Here. The American Historical Review, 129(2), pp.519–543. Available from: https://doi.org/10.1093/ahr/rhae163. DOI: https://doi.org/10.1093/ahr/rhae163

Henley, A., Bruckner, L., Jacobs, H., Jansen, M., Nunez, B., Rodriguez, R. and Wilson, M., 2024. On the Books: Jim Crow and Algorithms of Resistance, a Collections as Data Case Study. Journal on Computing and Cultural Heritage, 16(4). Available from: https://doi.org/10.1145/3631128. DOI: https://doi.org/10.1145/3631128

Hutchinson, D., 2024. Mapping the Latent Past: Assessing Large Language Models as Digital Tools through Source Criticism. Journal of Digital History, 3(1). Available from: https://doi.org/10.1515/JDH-2023-0018. DOI: https://doi.org/10.1515/jdh-2023-0018

Jackson, S., 2023. Don’t Stop Worrying or Learn to Love AI: A Plea for Caution. Perspectives on History (American Historical Association). 6 November 2023. Available from: https://www.historians.org/perspectives-article/dont-stop-worrying-or-learn-to-love-ai-a-plea-for-caution-november-2023/.

Ji, Z., Lee, N., Frieske, R., Yu, T., Su, D., Xu, Y., Ishii, E., Bang, Y.J., Madotto, A. and Fung, P., 2023. Survey of Hallucination in Natural Language Generation. ACM Computing Surveys, 55(12), p.248. Available from: https://doi.org/10.1145/3571730. DOI: https://doi.org/10.1145/3571730

Journal of Digital History, 2025. AI & history (Issue n.8). Journal of Digital History (De Gruyter / C2 DH Luxembourg). Available from: https://www.journalofdigitalhistory.org/en/articles.

Kim, S., Baudru, J., Ryckbosch, W., Bersini, H. and Ginis, V., 2025. Early evidence of how LLMs outperform traditional systems on OCR/HTR tasks for historical records. Available from: https://doi.org/10.48550/arXiv.2501.11623.

Lee, M. and Hsu, J.H.P., 2024. An Evaluation of GPT-4V for Transcribing the Urban Renewal Hand-Written Collection. ADHO Digital Humanities Conference 2024 (DH2024), Arlington, Virginia. Available from: https://doi.org/10.48550/arXiv.2409.09090.

Lee, T.B. and Grimmelmann, J., 2024. Why The New York Times might win its copyright lawsuit against OpenAI. Ars Technica. Available from: https://arstechnica.com/tech-policy/2024/02/why-the-new-york-times-might-win-its-copyright-lawsuit-against-openai/.

Leslie, D., 2025. From Future Shock to the Vico Effect: Generative AI and the Return of History. Harvard Data Science Review, (Special Issue 5). Https://hdsr.mitpress.mit.edu/pub/bcp7n3bs. DOI: https://doi.org/10.1162/99608f92.e6f531e6

Levchenko, M.A., 2025. Evaluating LLMs for Historical Document OCR: A Methodological Framework for Digital Humanities. In: I.N. Arachchige, F. Frontini, R. Mitkov and P. Rayson, eds. Proceedings of the First Workshop on Natural Language Processing and Language Models for Digital Humanities. Varna, Bulgaria: INCOMA Ltd., Shoumen, Bulgaria, pp.75–85. Available from: https://aclanthology.org/2025.lm4dh-1.7/. DOI: https://doi.org/10.26615/978-954-452-106-6-007

Liang, W., Yuksekgonul, M., Mao, Y., Wu, E. and Zou, J., 2023. GPT detectors are biased against non-native English writers. Patterns, 4(7), p.100779. Available from: https://doi.org/10.1016/j.patter.2023.100779. DOI: https://doi.org/10.1016/j.patter.2023.100779

Lye, C.Y., 2025. Towards AI-Resilient Assessment: Applying Design Thinking in Assessment Redesign. SIG-AILTA. Available from: https://sigailta.com/2025/09/09/towards-ai-resilient-assessment-applying-design-thinking-in-assessment-redesign/.

Morena, D., 2018. IRIS+ Part One: Designing + Coding a Museum AI. American Alliance of Museums. Available from: https://www.aam-us.org/2018/06/12/iris-part-one-designing-coding-a-museum-ai/.

Paoli, N., 2025. Deloitte was caught using AI in $290,000 report to help the Australian government crack down on welfare after a researcher flagged hallucinations. Fortune. 7 October 2025. Available from: https://fortune.com/2025/10/07/deloitte-ai-australia-government-report-hallucinations-technology-290000-refund/.

Rao, N. and O’Riordan, S., 2024. Increasing Accessibility of Audiovisual Content Using Whisper. (A LYRASIS Catalyst Fund Research Report). LYRASIS. Available from: https://doi.org/10.48609/na33-1y19.

Royal Historical Society, 2025. Generative AI, History and Historians: A Reading Guide. Historical Transactions (Royal Historical Society). Available from: https://blog.royalhistsoc.org/2025/10/02/generative-ai-history-and-historians-a-reading-guide/.

Sakai, Y., Kamigaito, H. and Watanabe, T., 2026. HalluCitation Matters: Revealing the Impact of Hallucinated References with 300 Hallucinated Papers in ACL Conferences. Available from: https://doi.org/10.48550/arXiv.2601.18724.

SEGD-Society for Experiential Graphic Design, 2023. MIT Museum. Available from: https://segd.org/projects/mit-museum/.

Shaffi, S., 2023. ‘It’s the opposite of art’: why illustrators are furious about AI. The Guardian, 23 January 2023. Available from: https://www.theguardian.com/technology/2023/jan/23/ai-generated-art-future-museums.

The Metropolitan Museum of Art, 2026. Open Access at The Met. Available from: https://www.metmuseum.org/about-the-met/policies-and-documents/open-access.

The New York Times Company, 2023. The New York Times Company Plaintiff v. Microsoft Corporation, OpenAI, Inc., OpenAI LP, OpenAI GP LLC, OpenAI LLC, OpenAI OpCo LLC, OpenAI Global LLC, OAI Corporation, LLC, and OpenAI Holdings, LLC, Defendants. U.S. District Court, Southern District of New York, No. 1:23-cv-11195-SHS. Complaint filed 27 December 2023; First Amended Complaint 12 August 2024. Available from: https://nytco-assets.nytimes.com/2023/12/NYT_Complaint_Dec2023.pdf.

Trowbridge, D., 2024. “Historians On”: AI in Teaching and Research. AHA Podcast, recorded at the 2024 AHA Annual Meeting, San Francisco. Available from: https://www.historians.org/podcast/historians-on-ai-in-teaching-and-research/.

Valleriani, M. and Gruber, D., 2025. Artificial Intelligence (AI) and Historical Research (Special Issue). Histories. Available from: https://www.mdpi.com/journal/histories/special_issues/S5JI978200.

Walters, W.H. and Wilder, E.I., 2023. Fabrication and errors in the bibliographic citations generated by ChatGPT. Scientific Reports, 13(1), p.14045. Available from: https://doi.org/10.1038/s41598-023-41032-5. DOI: https://doi.org/10.1038/s41598-023-41032-5

Wikipedia, 2026. WikiProject AI Cleanup. Available from: https://en.wikipedia.org/wiki/Wikipedia:WikiProject_AI_Cleanup.

Wollin-Giering, S., Hoffmann, M., Höfting, J. and Ventzke, C., 2024. Automatic Transcription of English and German Qualitative Interviews. Forum Qualitative Sozialforschung / Forum: Qualitative Social Research, 25(1). Available from: https://doi.org/10.17169/fqs-25.1.4129.

Xu, Z., Qiu, Y., Sun, L., Miao, F., Wu, F., Li, X., Wang, X., Lu, H., Zhang, Z., Hu, Y., Li, J., Jin, L., Zhang, F., Luo, R., Liu, X., Li, Y. and Liu, J., 2026. GhostCite: A Large-Scale Analysis of Citation Validity in the Age of Large Language Models. Available from: https://doi.org/10.48550/arXiv.2602.06718.

Yale Poorvu Center for Teaching and Learning, 2026. AI-Resilient Assessment. Yale University. Available from: https://poorvucenter.yale.edu/teaching/teaching-resource-library/ai-guidance-for-teachers/ai-course-assignment-design/resilient.