High-quality, rich and meaningful data are crucial to successfully implementing Artificial Intelligence (AI) and Big Data Analytics (BDA) solutions. Delivering required data to feed into AI and BDA models is costly, difficult, and often limited in data and skill availability. It is well known that up to 80% of the effort spent in AI and BDA projects is dedicated to ensuring data is fit for purpose. Activities are required to discover, understand, select, clean, transform, and integrate data from a variety of sources in such a way that data can be fed into the modelling phase. Such activities result in enriched data, eventually improving the quality of downstream BDA and AI applications. The data enrichment process is implemented by specifying, deploying, and executing data enrichment pipelines over data that can be structured, semi-structured and unstructured, in large amounts, and from static or streaming sources. While techniques exist to cover different enrichment operations such as data cleaning, linking, feature extraction, classification and semantic annotation, etc., the lack of comprehensive approaches and established tools dedicated to data enrichment makes the definition, implementation, and operation of enrichment pipelines difficult for too many organizations willing to improve their BDA and AI applications.
The overall vision of the enRichMyData project is to create a novel paradigm for building rich, high-quality, valuable, and FAIR-compliant datasets to feed downstream BDA and AI applications in the context of data-sharing ecosystems, such as data spaces. The paradigm facilitates the specification and execution of data enrichment pipelines, focusing on supporting various data enrichment operations. enRichMyData makes this easily accessible to a wide set of large and small organizations that encounter difficulties in delivering suitable data to feed their BDA and AI solutions due to the lack of usable tools/expertise for the cost-effective management of data enrichment pipelines.
News & Events

From Development to Deployment: enRichMyData Plenary in Naples
The enRichMyData consortium gathered on 22-23 May 2025, in Naples, Italy for a focused and collaborative plenary meeting, hosted by our partners at EXPERT.AI. Building on the discussions from the previous plenary, partners continued to advance key topics, including: Ongoing demonstrations and testing of the enRichMyData tools and Business Case (BC) pipelines Refinement of strategies

enRichMyData at DataWeek 2025: Showcasing Tools for Scalable Data Enrichment
The enRichMyData project will host an interactive session at DataWeek 2025, taking place in Athens, Greece, on May 28, 2025, from 13:30 to 14:30 in Room 1 (Σ010). The session, titled “Data Enrichment for Industry: Unlocking the Potential of Reference Data,” will provide participants with a hands-on opportunity to explore cutting-edge tools that simplify and

enRichMyData Co-Organises the 6th Edition of the Data Science and AI International Summer School
The enRichMyData project is proud to announce its role as a co-organiser of the upcoming 6th edition of the Data Science and AI International Summer School, which will take place in the scenic mountain town of Predeal, Romania, from July 19 to 27, 2025. Hosted by Bucharest Business School (BBS @ ASE) and supported by

enRichMyData Partners Collaborate in Eindhoven to Drive Project Forward
The enRichMyData partners convened at the Philips offices in Eindhoven, the Netherlands, for an engaging plenary meeting. The focus of the gathering was to test the advanced tools and services developed within the project and to chart the course for upcoming activities, including plans for the exploitation of the enRichMyData toolbox. With just eight months

enRichMyData at ISWC 2024: Driving Innovation in Semantic Technologies
The 23rd International Semantic Web Conference (ISWC) once again highlighted its role as the premier global venue for advancing semantic web and knowledge graph technologies. These innovations, essential for fostering interoperability and streamlining data enrichment, are closely aligned with the mission of the enRichMyData project. Naturally, our teams played a prominent role in this year’s

enRichMyData Toolbox Version 2 Released
The enRichMyData Toolbox Version 2 is designed to handle even the most complex data enrichment scenarios. Useful for data scientists or engineers, this open-source toolbox provides everything needed to design, execute, and optimize data enrichment pipelines with ease. The toolbox brings together a collection of interoperable tools and services that can be seamlessly combined and
Social Media
Consortium
The enRichMyData project is coordinated by SINTEF (Norway), one of European’s largest independent research organisations. The project partners include companies such as Philips (The Netherlands) and Bosch (Germany), dedicated to engineering and manufacturing; Speed Network (Estonia), a provider of procurement data; JOT Internet Media (Spain), a digital marketing company; CS Group (Romania), a software service company; Expert AI (Italy), a technology company specializing in natural language understanding; and Ontotext (Bulgaria), a semantic technology company. They will have the full support of the research partners that, in addition to SINTEF, include the University of Milano Bicocca (Italy), Jozef Stefan Institute (Slovenia), University of Copenhagen (Denmark), GATE Institute (Bulgaria), and BGRIMM Technology Group (China).