Karolinska Institutet
Browse

Multilingual query expansion in the SveMed plus bibliographic database: A case study

Download (943.42 kB)
journal contribution
posted on 2024-10-25, 12:34 authored by Ylva GavelYlva Gavel, Per-Olov Andersson
SveMed+ is a bibliographic database covering Scandinavian medical journals. It is produced by the University Library of Karolinska Institutet in Sweden. The bibliographic references are indexed with terms from the Medical Subject Headings (MeSH) thesaurus. The MeSH has been translated into several languages, including Swedish, making it suitable as the basis for multilingual tools in the medical field. The data structure of SveMed+ closely mimics that of PubMed/MEDLINE. Users of PubMed/MEDLINE and similar databases typically expect retrieval features that are not readily available off-the-shelf. The SveMed+ interface is based on a free text search engine (Solr) and a relational database management system (Microsoft SQL Server) containing the bibliographic database and a multilingual thesaurus database. The thesaurus database contains medical terms in three different languages and information about relationships between the terms. A combined approach involving the Solr free text index, the bibliographic database and the thesaurus database allowed the implementation of functionality such as automatic multilingual query expansion, faceting and hierarchical explode searches. The present paper describes how this was done in practice.

History

File version

  • Accepted manuscript

Publication status

Published

Sub type

Article

Journal

JOURNAL OF INFORMATION SCIENCE

ISSN

0165-5515

eISSN

1741-6485

Volume

40

Issue

3

Language

  • eng

Usage metrics

    Articles

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC