Selected Publication:
SHR
Neuro
Cancer
Cardio
Lipid
Metab
Microb
Daumke, P; Marku, K; Poprat, M; Schulz, S; Klar, R.
Biomedical information retrieval across languages.
Med Inform Internet Med. 2007; 32(2):131-147
Doi: 10.1080/14639230701197587
Web of Science
PubMed
FullText
FullText_MUG
- Co-authors Med Uni Graz
-
Schulz Stefan
- Altmetrics:
- Dimensions Citations:
- Plum Analytics:
- Scite (citation analytics):
- Abstract:
- This work presents a new dictionary-based approach to biomedical cross-language information retrieval (CLIR) that addresses many of the general and domain-specific challenges in current CLIR research. Our method is based on a multilingual lexicon that was generated partly manually and partly automatically, and currently covers six European languages. It contains morphologically meaningful word fragments, termed subwords. Using subwords instead of entire words significantly reduces the number of lexical entries necessary to sufficiently cover a specific language and domain. Mediation between queries and documents is based on these subwords as well as on lists of word-n-grams that are generated from large monolingual corpora and constitute possible translation units. The translations are then sent to a standard Internet search engine. This process makes our approach an effective tool for searching the biomedical content of the World Wide Web in different languages. We evaluate this approach using the OHSUMED corpus, a large medical document collection, within a cross-language retrieval setting.
- Find related publications in this database (using NLM MeSH Indexing)
-
Algorithms -
-
Humans -
-
Information Storage and Retrieval - methods
-
Information Systems - organization and administration
-
Multilingualism -
-
Semantics -
-
Translating -
- Find related publications in this database (Keywords)
-
medical Web search
-
cross-language information retrieval
-
query translation
-
multilingual lexicon
-
morphological analysis