Using Stanbol to enhance medical data by exploring Linked Data

Cytogenetics Lab, Institute of Mother and Child,  in cooperation with CompBio group at the University of Warsaw, presents a demo of a research & clinical tool IMID2py now empowered with Apache Stanbol.

Linked data from life sciences is a substantial part of the LOD cloud [1], it contains most interlinks. With the use of Apache Stanbol components: Entityhub, Contenthub, Enhancer, we proposed a solution which allows our users geneticists to:

  • annotate medical content (i.e. a result from genetic experiment) with relevant data
  • formulate various queries, use several provided enhancers, search indexed LOD entities with VIE auto-complete
  • search among tagged annotations, with facets provided by Stanbol Contenthub

Tree of enhancements
In our solution, user creates a tree of enhancements for her/his content: this is a small part of a LOD cloud which users find relevant.  A user is able to search for enhancements thanks to Entityhub with pre-indexed linked data from large open databases: UNIPROT, PubMed, eHealth.

Linked Open Data (LOD) exploration
Users explore Linked Data by asking semi-automatically generated queries, and by reviewing results returned by Stanbol Entityhub. We’ve provided several enhancers based on Stanbol Entityhub query language. Finding out from our users what is most useful, and providing them with useful tools is what we’re trying to achieve.

IMID2py at the Cytogenetic Lab
IMID2py is used at the cytogenetic lab to review aCGH (array comparative genomic hybridization) results from our patients. A geneticists job is to assist doctors in stating clinical diagnosis. However, since genetics is a very rapidly developing field, part of the job is to perform research on difficult, unknown, cases.

By allowing users to easily document their research path in the tree of enhancements, and later search among, and create reports of their findings, we try to enable reasonable use of constantly-growing Linked Data.

Further development will provide more enhancers and Enhancement Chains, abstraction of  available enhancers to facilitate more thorough, more automatic research and reporting.

Demo Quick Tour & Screencasts
You’ll find the demo quick tour on the demo site.

You can watch video screencasts from the demo here.

Using Apache Stanbol components

  • Entityhub provides indexed Linked Data, finds entities and performs more complicated queries.
  • Contenthub stores content enhancements, labels them, and provides faceted search.
  • VIE Autocomplete provides auto-complete search among indexed Linked Data.

Indexing Linked Data

  • For the purpose of this demo UNIPROT linked data was indexed using Apache Stanbol tools.
    • UNIPROT RDF release, which also contains following Linked Data: Gene Ontology terms, PubMed abstracts, GeneID references, and more.
    • UNIPROT linked data is easily accessible through Apache Stanbol Entityhub RESTful API
  • Another Linked Data set  available in the demo is eHealth Apache Stanbol demo set.

Python APIs to Stanbol RESTful services



Comments are closed.