Entity Extraction for Human Resource Management powered by Apache Stanbol

We presented a short overview of CELI solutions, recently integrated into the Apache Stanbol framework, at the IKS Workshop in Salzburg in June 2012. CELI is an Italian company specialized in the realization of software solutions that allow the automatic processing of texts in electronic format. CELI’s products and solutions emphasize language understanding and implement the latest results achieved in the world of research. The video is about 15 minutes and overviews the following topics:

CELI enhancement engines integrated in Apache Stanbol (web service based):

  • Language Identifier (bg, it, fr, da, de, en, es, pt, pl, hu, nl, sv, no, ro, ru)
  • Morphological Analysis (it, da, de, ro, ru, pl)
  • Named Entity Recognition (it, fr)
  • DBPedia type classifier (it, fr, en, de, es, nl, pl)

How to obtain a license for the CELI engines (research, commercial, trial, ..)
Entity Extraction for Human resource management: an experimental component for the extraction of semantic entities and concepts within CV documents. The engine is able to detect information concerning candidate’s skills, education level, work experiences and contacts.


We would like to thanks to the organizers for inviting us to present our solutions at the IKS workshop!

Author: ABosca

Alessio Bosca received the M.S. degree and the Ph.D. degree in computer science (Semantic Web Technologies) from the Polytechnic University, Torino, Italy, in 2002 and 2006 respectively. His research interests include Knowledge Representation Systems, Service Oriented Architectures and Human Language Technologies. Within his postdoctoral activities he collaborated to the "SPICE" EU project working on the topic of Semantic Web Services composition starting from informal user requests, expressed in natural language. In September 2007 he moved to R&D department of CELI; since then has been involved in several EU projects (CACAO, EuropeanaConnect, Organic.Lingua) where he maturated technical expertise in the fields of Semantic Content Mangement, Search Engines, Digital Libraries and Cross-Language Information Retrieval.

Comments are closed.