Show simple item record

dc.contributor.authorDi Pretoro, Emmanuel
dc.contributor.authorDe Roock, Edwin
dc.contributor.authorFremout, Wim
dc.contributor.authorBuelinckx, Erik
dc.contributor.authorBuyle, Stephanie
dc.contributor.authorVan der Stede, Véronique
dc.date2021
dc.date.accessioned2021-06-15T10:20:32Z
dc.date.available2021-06-15T10:20:32Z
dc.identifier.citationEmmanuel Di Pretoro, Edwin De Roock, Wim Fremout, Erik Buelinckx, Stephanie Buyle & Véronique Van der Stede, 'Optimizing Elasticsearch search experience using a thesaurus', in : Code4Lib Journal, 51 ( 2021-06-14), online, URL : https://journal.code4lib.org/articles/15749 (accessed 15/06/2021)en_US
dc.identifier.issn1940-5758
dc.identifier.urihttps://orfeo.belnet.be/handle/internal/7874
dc.descriptionThe Belgian Art Links and Tools (BALaT) (http://balat.kikirpa.be/) is the continuously expanding online documentary platform of the Royal Institute for Cultural Heritage (KIK-IRPA), Brussels (Belgium). BALaT contains over 750,000 images of KIK-IRPA’s unique collection of photo negatives on the cultural heritage of Belgium, but also the library catalogue, PDFs of articles from KIK-IRPA’s Bulletin and other publications, an extensive persons and institutions authority list, and several specialized thematic websites, each of those collections being multilingual as Belgium has three official languages. All these are interlinked to give the user easy access to freely available information on the Belgian cultural heritage. During the last years, KIK-IRPA has been working on a detailed and inclusive data management plan. Through this data management plan, a new project HESCIDA (Heritage Science Data Archive) will upgrade BALaT to BALaT+, enabling access to searchable registries of KIK-IRPA datasets and data interoperability. BALaT+ will be a building block of DIGILAB, one of the future pillars of the European Research Infrastructure for Heritage Science (E-RIHS), which will provide online access to scientific data concerning tangible heritage, following the FAIR-principles (Findable-Accessible-Interoperable-Reusable). It will include and enable access to searchable registries of specialized digital resources (datasets, reference collections, thesauri, ontologies, etc.). In the context of this project, Elasticsearch has been chosen as the technology empowering the search component of BALaT+. An essential feature of this search functionality of BALaT+ is the need for linguistic equivalencies, meaning a term query in French should also return the matching results containing the equivalent term in Dutch. Another important feature is to offer a mechanism to broaden the search with elements of more precise terminology: a term like “furniture” could also match records containing chairs, tables, etc. This article will explain how a thesaurus developed in-house at KIK-IRPA was used to obtain these functionalities, from the processing of that thesaurus to the production of the configuration needed by Elasticsearch.en_US
dc.languageengen_US
dc.publisherCode4Lib Journalen_US
dc.titleOptimizing Elasticsearch search experience using a thesaurusen_US
dc.typeArticleen_US
dc.subject.frascatiComputer and information sciencesen_US
dc.audienceScientificen_US
dc.source.titleCode4Lib Journalen_US
Orfeo.peerreviewedYesen_US
dc.identifier.urlhttps://journal.code4lib.org


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record