Show simple item record

dc.contributor.authorLowagie, Hannes
dc.contributor.authorVan Woensel, Julie
dc.date2024-10-07
dc.date.accessioned2024-10-08T12:50:21Z
dc.date.available2024-10-08T12:50:21Z
dc.identifier.urihttps://orfeo.belnet.be/handle/internal/13459
dc.descriptionThis paper details the National Library of Belgium’s (KBR) exploration of automating the subject indexing process for their extensive collection using Python scripts. The initial exploration involved creating a reference dataset and automating the classification process using MARCXML files. The focus is on demonstrating the practicality, adaptability, and user-friendliness of the Python-based solution. The authors introduce their unique approach, emphasizing the semantically significant words in subject determination. The paper outlines the Python workflow, from creating the reference dataset to generating enriched bibliographic records. Criteria for an optimal workflow, including ease of creation and maintenance of the dataset, transparency, and correctness of suggestions, are discussed. The paper highlights the promising results of the Python-powered approach, showcasing two specific scripts that create a reference dataset and automate subject indexing. The flexibility and user-friendliness of the Python solution are emphasized, making it a compelling choice for libraries seeking efficient and maintainable solutions for subject indexing projects.en_US
dc.languageengen_US
dc.titleSimplifying Subject Indexing: A Python-Powered Approach in KBR, the National Library of Belgiumen_US
dc.typeArticleen_US
dc.subject.frascatiComputer and information sciencesen_US
dc.audienceScientificen_US
dc.subject.freePythonen_US
dc.subject.freeSubject Indexingen_US
dc.subject.freeCataloguingen_US
dc.source.titleCode4lib Journalen_US
dc.source.issue59en_US
Orfeo.peerreviewedYesen_US


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record