Responsible AI Practice in Libraries and Archives

A Review of the Literature

Authors

DOI:

https://doi.org/10.5860/ital.v43i3.17245

Keywords:

ethics, responsible ai, artificial intelligence

Abstract

Artificial intelligence (AI) has the potential to positively impact library and archives collections and services—enhancing reference, instruction, metadata creation, recommendations, and more. However, AI also has ethical implications. This paper presents an extensive literature and review analysis that examines AI projects implemented in library and archives settings, asking the following research questions: RQ1: How is artificial intelligence being used in libraries and archives practice? RQ2: What ethical concerns are being identified and addressed during AI implementation in libraries and archives? The results of this literature review show that AI implementation is growing in libraries and archives and that practitioners are using AI for increasingly varied purposes. We found that AI implementation was most common in large, academic libraries. Materials used in AI projects usually involved digitized and born digital text and images, though materials also ranged to include web archives, electronic theses and dissertations (ETDs), and maps. AI was most often used for metadata extraction and reference and research services. Just over half of the papers included in the literature review mentioned ethics or values related issues in their discussions of AI implementation in libraries and archives, and only one-third of all resources discussed ethical issues beyond technical issues of accuracy and human-in-the-loop. Case studies relating to AI in libraries and archives are on the rise, and we expect subsequent discussions of relevant ethics and values to follow suit, particularly growing in the areas of cost considerations, transparency, reliability, policy and guidelines, bias, social justice, user communities, privacy, consent, accessibility, and access. As AI comes into more common usage, it will benefit the library and archives professions to not only consider ethics when implementing local projects, but to publicly discuss these ethical considerations in shared documentation and publications.

Author Biographies

Sara Mannheimer, Montana State University

Sara Mannheimer is Data Librarian at Montana State University.

Natalie Bond, University of Montana

Natalie Bond is Head of Reference and Instruction and Government Information Librarian at University of Montana.

Scott W. H. Young, Montana State University

Scott W. H. Young is User Experience & Assessment Librarian at Montana State University.

Hannah Scates Kettler, Iowa State University

Hannah Scates Kettler is Associate University Librarian for Academic Services at Iowa State University.

Addison Marcus, Montana State University

Addison Marcus is Data Science Graduate Research Assistant at Montana State University.

Sally K. Slipher, Montana State University

Sally K. Slipher is Research Staff Statistician at Montana State University.

Jason A. Clark, Montana State University

Jason A. Clark is Head, Research Analytics, Optimization, and Data Services at Montana State University.

Yasmeen Shorish, James Madison University

Yasmeen Shorish is Director of Scholarly Communication Strategies at James Madison University.

Doralyn Rossmann, Montana State University

Doralyn Rossmann is Dean of the Library at Montana State University.

Bonnie Sheehey

Bonnie Sheehey is Assistant Professor of Philosophy at Montana State University.

References

Aaron Trehub and Ali Krzton, “Chapter 12. Using IBM Watson for Discovery and Research Support: A Library-Industry Partnership at Auburn University,” in The Rise of AI: Implications and Applications of Artificial Intelligence in Academic Libraries, ed. Sandy Hervieux and Amanda Wheatley, ACRL Publications in Librarianship 78 (Chicago: Association of College and Research Libraries, 2022).

American Council on Education, “Carnegie Classification of Institutions of Higher Education,” https://carnegieclassifications.acenet.edu/.

American Council on Education, “Size & Setting Carnegie Classification,” https://carnegieclassifications.acenet.edu/carnegie-classification/classification-methodology/size-setting-classification/.

Andromeda Yelton, “Chapter 2. HAMLET: Neural-Net-Powered Prototypes for Library Discovery,” Library Technology Reports 55, no. 1 (2018): 10–15.

Ben Lee, “Newspaper Navigator” (web page, Library of Congress Labs, 2020), https://web.archive.org/web/20240824031755/https://labs.loc.gov/work/experiments/newspaper-navigator/.

Benjamin Lee, “Compounded Mediation: A Data Archaeology of the Newspaper Navigator Dataset” (Knowledge Commons, 2020), https://doi.org/10.17613/k9gt-6685.

Bipasha Banerjee et al., “Applications of Data Analysis on Scholarly Long Documents,” in IEEE International Conference on Big Data (IEEE, 2022): 2473–81, https://doi.org/10.1109/BigData55660.2022.10020935.

Charlene Chou and Tony Chu, “An Analysis of BERT (NLP) for Assisted Subject Indexing for Project Gutenberg,” Cataloging & Classification Quarterly 60, no. 8 (2022): 807–35, https://doi.org/10.1080/01639374.2022.2138666.

Charlie Harper, Anne Kumer, Shelby Stuart, and Evan Meszaros, “Chapter 8. AI-Informed Approaches to Metadata Tagging for Improved Resource Discovery,” in The Rise of AI: Implications and Applications of Artificial Intelligence in Academic Libraries, ed. Sandy Hervieux and Amanda Wheatley, ACRL Publications in Librarianship 78 (Chicago: Association of College and Research Libraries, 2022).

Clara Turp and Lucy Kiester, “Ethics of AI Algorithms: Looking into PubMed’s ‘Best Match’ Algorithm” (presentation at Code4Lib, online, 2021), https://2021.code4lib.org/talks/Ethics-of-AI-algorithms-looking-into-PubMeds-Best-Match-Algorithm.

Eileen Jakeway, Lauren Algee, Laurie Allen, Meghan Ferriter, Jaime Mears, Abigail Potter, and Kate Zwaard, Machine Learning + Libraries Summit Event Summary (Library of Congress Labs – Digital Strategy Directorate, February 13, 2020), https://labs.loc.gov/static/labs/meta/ML-Event-Summary-Final-2020-02-13.pdf.

Elizabeth Lorang, Leen-Kiat Soh, Yi Liu, and Chulwoo Pack, “Digital Libraries, Intelligent Data Analytics, and Augmented Description: A Demonstration Project” (DigitalCommons@University of Nebraska–Lincoln, 2020): 3.

Ellen Prokop, X. Y. Han, Vardan Papyan, David L. Donoho, and C. Richard Johnson, “AI and the Digitized Photoarchive: Promoting Access and Discoverability,” Art Documentation : Bulletin of the Art Libraries Society of North America 40, no. 1 (2021): 1–20, https://doi.org/10.1086/714604.

Emad Khazraee and Susan Winter, “Library Knowledge Extensions (KNEXT): Data Analytics to Support Innovation Communities,” IMLS grant proposal, 2017, http://www.imls.gov/grants/awarded/lg-71-17-0124-17.

J. Schneider, C. Adams, S. DeBauche, R. Echols, C. McKean, J. Moran, and D. Waugh, “Appraising, Processing, and Providing Access to Email in Contemporary Literary Archives,” Archives and Manuscripts 47, no. 3 (2019): 305–26, https://doi.org/10.1080/01576895.2019.1622138.

James Powell, Kari Sentz, Elizabeth Moyer, and Martin Klein, “Teaching AI When to Care about Gender,” The Code4Lib Journal, no. 54 (2022), https://journal.code4lib.org/articles/16718.

Jeremy Walker and Jason Coleman, “Using Machine Learning to Predict Chat Difficulty,” College & Research Libraries 82, no. 5 (2021): 683–707, https://doi.org/10.5860/crl.82.5.683.

Jim Hahn and Courtney McDonald, “Account-Based Recommenders in Open Discovery Environments,” Digital Library Perspectives 34, no. 1 (2018): 70–76, https://doi.org/10.1108/DLP-07-2017-0022.

Jim Hahn, “Semi-Automated Methods for BIBFRAME Work Entity Description,” Cataloging & Classification Quarterly 59, no. 8 (2021): 853–67, https://doi.org/10.1080/01639374.2021.2014011.

Jon W. Dunn, Ying Feng, Juliet L. Hardesty, Brian Wheeler, Maria Whitaker, Thomas Whittaker, Shawn Averkamp, et al., “Audiovisual Metadata Platform Pilot Development (AMPPD) Final Project Report” (IU ScholarWorks, 2021), https://hdl.handle.net/2022/26989.

Kathy Charmaz, Constructing Grounded Theory, 2nd ed., Introducing Qualitative Methods (London: Sage, 2014).

Kevin W. Walker and Zhehan Jiang, “Application of Adaptive Boosting (AdaBoost) in Demand-Driven Acquisition (DDA) Prediction: A Machine-Learning Approach,” Journal of Academic Librarianship 45, no. 3 (2019): 203–12, https://doi.org/10.1016/j.acalib.2019.02.013.

Kirsten Strigel Carter, Abby Gondek, William Underwood, Teddy Randby, and Richard Marciano, “Using AI and ML to Optimize Information Discovery in Under-Utilized, Holocaust-Related Records,” AI & Society 37, no. 3 (2022): 837–58, https://doi.org/10.1007/s00146-021-01368-w.

Klaus Krippendorff, Content Analysis: An Introduction to Its Methodology (SAGE Publications, 2012).

Marissa Friedman, Cameron Ford, Mary Elings Pi, Vijay Singh, and Tracey Tan, “Using AI/Machine Learning to Extract Data from Japanese American Confinement Records,” in IEEE International Conference on Big Data (IEEE, 2021): 2210–19, https://doi.org/10.1109/BigData52589.2021.9672076.

Matt Enis, “Library of Congress Trains Machine Learning with Crowdsourcing,” Library Journal 147, no. 2 (2022): 8–11.

Matthew Lincoln, Julia Corrin, Emily Davis, and Scott B. Weingart, “CAMPI: Computer-Aided Metadata Generation for Photo Archives Initiative” (preprint), Carnegie Mellon University, 2020, https://doi.org/10.1184/R1/12791807.v2.

Matthew Reidsma, “Auditing Algorithms in Commercial Discovery Tools” (presentation at Code4Lib, Washington DC, 2018), https://2018.code4lib.org/talks/auditing-algorithms-in-commercial-discovery-tools.

Matthew Short, “Text Mining and Subject Analysis for Fiction; or, Using Machine Learning and Information Extraction to Assign Subject Headings to Dime Novels,” Cataloging & Classification Quarterly 57, no. 5 (2019): 315–36, https://doi.org/10.1080/01639374.2019.1653413.

Michelle Ehrenpreis and J. DeLooper, “Implementing a Chatbot on a Library Website,” Journal of Web Librarianship 16, no. 2 (2022): 120–42, https://doi.org/10.1080/19322909.2022.2060893.

Peter Organisciak et al., “Examining Patterns of Text Reuse in Digitized Text Collections,” in ACM/IEEE Joint Conference on Digital Libraries (Champaign, 2019): 361–62, https://doi.org/10.1109/JCDL.2019.00071.

Richard Shrake and Scott Ziegler, “You’ll Probably Like This One, Too: Using Circulation Data to Automate Recommendations in a Special Collections Library” (presentation at Code4Lib, Los Angeles, 2017), https://2017.code4lib.org/talks/Youll-probably-like-this-one-too-Using-circulation-data-to-automate-recommendations-in-a-special-collections-library.

Ryan Schuerkamp, Jared Barrett, Amber Bales, Alia Wegner, and Philippe J. Giabbanelli, “Enabling New Interactions with Library Digital Collections: Automatic Gender Recognition in Historical Postcards via Deep Learning,” Journal of Academic Librarianship 49, no. 4 (2023): 102736, https://doi.org/10.1016/j.acalib.2023.102736.

Sara Mannheimer, Addison Marcus, Sally K. Slipher, Natalie Bond, Scott W. H. Young, Hannah Scates Kettler, Jason Clark, Yasmeen Shorish, Doralyn Rossmann, and Bonnie Sheehey, “Data for: Responsible AI Practice in Libraries and Archives: A Review of the Literature,” Qualitative Data Repository, QDR Main Collection (2024), https://doi.org/10.5064/F6QME490.

Sharesly Rodriguez and Christina Mune, “Uncoding Library Chatbots: Deploying a New Virtual Reference Tool at the San Jose State University Library,” Reference Services Review 50, no. 3/4 (2022): 392–405, https://doi.org/10.1108/RSR-05-2022-0020.

US Census Bureau, “Regions and Divisions,” updated October 8, 2021, https://www.census.gov/programs-surveys/economic-census/guidance-geographies/levels.html#par_textimage_34.

Downloads

Published

2024-09-23

How to Cite

Mannheimer, S., Bond, N., Young, S. W. H., Kettler, H. S., Marcus, A., Slipher, S. K., … Sheehey, B. (2024). Responsible AI Practice in Libraries and Archives: A Review of the Literature. Information Technology and Libraries, 43(3). https://doi.org/10.5860/ital.v43i3.17245

Issue

Section

Articles