Data Engineering for Large Language Models
AI is analyzing your overall score…
Identifying your key strengths…
Evaluating your skill match against the job requirements…
Assessing your cultural and operational fit
I am an active researcher in NLP (natural language processing), GIR (geographic information retrieval), QA (question answering), semantic information retrieval, and AI (artificial intelligence). Specialties: Bayesian networks and influence diagrams, Rete and production rules, semantic networks; neural networks; semantic tagging and annotation; metonymy and idiomatic expressions; parallel macros, SPLASH-2 benchmarks
FernUniversität in Hagen
Dr. rer. nat., Computer Science
January 1, 1999 – January 1, 2006
Carl von Ossietzky University of Oldenburg
Dipl. Inform., Informatics, Psychology
January 1, 1991 – January 1, 1999
Wing Operations Centre HTG64 Ahlhorn
1. Flugbetriebsspezialist
January 1, 1990 – January 1, 1991
Gymnasium Marianum Meppen
Abitur, English/Mathematics
January 1, 1981 – January 1, 1990
Fraunhofer IAIS
Team Lead Data Engineering
March 1, 2023 – Present
St Augustin · Hybrid
Norcom
Team Lead Data Science
July 1, 2022 – Present
Munich, Bavaria, Germany
Drooms
Team Lead Machine Learning
June 1, 2019 – July 1, 2022
Frankfurt Rhine-Main Metropolitan Area
Wayfair
Senior Manager Product International
October 1, 2017 – February 1, 2018
Berlin Metropolitan Area
Teckro
VP Machine Learning
December 1, 2016 – August 1, 2017
Limerick Metropolitan Area
Elsevier
Technology Research Director at Elsevier Labs
July 1, 2016 – November 1, 2016
The Randstad, Netherlands
Elsevier
Lead Data Scientist
January 1, 2016 – July 1, 2016
The Randstad, Netherlands
Elsevier
NLP Expert
December 1, 2014 – December 1, 2015
The Randstad, Netherlands
Dublin City University/CNGL
Research Fellow
January 1, 2012 – December 1, 2014
Dublin City University/CNGL
Research Scientist
October 1, 2008 – February 1, 2012
FernUniversität in Hagen
Research Assistant (PostDoc)
January 1, 2006 – September 1, 2008
Hagen, North Rhine-Westphalia, Germany
OFFIS
Scientific Assistant
December 1, 1995 – March 1, 1997
Oldenburg, Germany
FernUniversität in Hagen
Teaching Assistant
April 1, 1995 – September 1, 1999
Hagen, Germany
SuPRiM - Supply chain Risk and Performance Managament
November 1, 2014 – Present
Enterprise Ireland funded industry collaboration to assess potential risks in the supply chain.
Uonevu
November 1, 2013 – Present
Uonevu is a cloud=based platform that detects offensive language, negative stereotyping, and blocks cases of cyberbullying in social media messages. Stereotypes are collected via crowdsourcing (https://cyberbullying.herokuapp.com/). Media coverage: . http://www.irishexaminer.com/ireland/people-power-platform-to-block-cyberbullying-269468.html . http://www.irishexaminer.com/ireland/dcu-seeks-publics-help-to-tackle-subtle-cyberbullying-285366.html . http://www.thesundaytimes.co.uk/sto/news/ireland/article1409522.ece?CMP=OTH-gnws-standard-2014_05_10 . http://www.thecollegeview.com/2014/10/01/dcu-researchers-find-new-methods-to-tackle-subtle-cyberbullying/
Ngramatic
March 1, 2013 – Present
Ngramatic is a commercial project from CNGL @ DCU providing fine-grained, scalable and multilingual content classification and summarisation services to a range of sectors (e.g. Finance, Security, Software) via a cloud-based application.
Critical Data Auditor
January 1, 2013 – December 1, 2013
Enterprise Ireland funded innovation partnership project to investigate means to improve analysis, clustering, and visualization of potentially sensitive information in large document repositories.
CNGL
November 1, 2012 – Present
Centre for Global Intelligent Content
Labjam
August 1, 2011 – Present
Labjam is a research management platform facilitating assessment and reporting, performance benchmarking, reduction of administrative overhead, and enhanced visibility of research activities and outputs. Labjam supports - tracking and reporting of scientific, social and economic impact factors. - benchmarking of research groups against comparable international peers. - reducing the administrative overhead and red-tape associated with large-scale collaborative R&D. - enhancing visibility into the activities of research colleagues and students.
Khresmoi
January 1, 2010 – January 1, 2014
A multilingual, multimodal search and access system for biomedical information and documents. The system allows access to biomedical data: - from many sources, - analyzing and indexing multi-dimensional (2D, 3D) medical images, - with improved search capabilities due to the integration of technologies to link the texts and images to facts in a knowledge base, - in a multilingual environment, - providing trustable results at a level of understandability adapted to the users. KHRESMOI combined multiple data sources and knowledge derived from various heterogeneous knowledge sources. This includes text sources such as online journals and books, and trusted websites; and image sources, including images from journals and images from Picture Archiving and Communication Systems (PACS) at radiology departments.
CNGL
October 1, 2008 – Present
Centre for Next Generation Localisation
Verified Peer Reviewer
Publons
June 24, 2026 – Present
Cultural Fit Analysis
The candidate has a strong background in both academic research and industry, working for various companies and universities. The project diversity, ranging from supply chain risk management to cyberbullying detection and biomedical information systems, indicates adaptability and a broad interest in applying NLP across different domains. The consistent focus on 'Language Processing' and 'Programming Languages' throughout their career aligns well with an NLP Engineer role. However, the lack of specific project technologies listed makes it difficult to assess the breadth of their practical toolset and how well it aligns with modern industry practices beyond the conceptual level. The long tenure in research and leadership roles suggests a preference for impactful, potentially long-term projects and a structured environment.
Soft Skills & Operational Fit
The candidate's extensive experience in team leadership roles (Team Lead Data Engineering, Team Lead Data Science, Team Lead Machine Learning, VP Machine Learning, Senior Manager Product International, Research Fellow, Research Scientist) suggests strong leadership, project management, and potentially good communication and collaboration skills. The involvement in various research projects and academic roles also indicates a strong problem-solving aptitude and a continuous learning mindset. However, without specific psychometric test results or interview data, a definitive assessment of stress handling or direct team collaboration style is not possible.