Medizinische Universität Graz - Research portal

Logo MUG Resarch Portal

Selected Publication:

SHR Neuro Cancer Cardio Lipid Metab Microb

Daumke, P; Marku, K; Poprat, M; Schulz, S; Klar, R.
Biomedical information retrieval across languages.
Med Inform Internet Med. 2007; 32(2):131-147 Doi: 10.1080/14639230701197587
Web of Science PubMed FullText FullText_MUG

 

Co-authors Med Uni Graz
Schulz Stefan
Altmetrics:

Dimensions Citations:

Plum Analytics:

Scite (citation analytics):

Abstract:
This work presents a new dictionary-based approach to biomedical cross-language information retrieval (CLIR) that addresses many of the general and domain-specific challenges in current CLIR research. Our method is based on a multilingual lexicon that was generated partly manually and partly automatically, and currently covers six European languages. It contains morphologically meaningful word fragments, termed subwords. Using subwords instead of entire words significantly reduces the number of lexical entries necessary to sufficiently cover a specific language and domain. Mediation between queries and documents is based on these subwords as well as on lists of word-n-grams that are generated from large monolingual corpora and constitute possible translation units. The translations are then sent to a standard Internet search engine. This process makes our approach an effective tool for searching the biomedical content of the World Wide Web in different languages. We evaluate this approach using the OHSUMED corpus, a large medical document collection, within a cross-language retrieval setting.
Find related publications in this database (using NLM MeSH Indexing)
Algorithms -
Humans -
Information Storage and Retrieval - methods
Information Systems - organization and administration
Multilingualism -
Semantics -
Translating -

Find related publications in this database (Keywords)
medical Web search
cross-language information retrieval
query translation
multilingual lexicon
morphological analysis
© Med Uni GrazImprint