High-quality, rich and meaningful data are crucial to successfully implementing Artificial Intelligence (AI) and Big Data Analytics (BDA) solutions. Delivering required data to feed into AI and BDA models is costly, difficult, and often limited in data and skill availability. It is well known that up to 80% of the effort spent in AI and BDA projects is dedicated to ensuring data is fit for purpose. Activities are required to discover, understand, select, clean, transform, and integrate data from a variety of sources in such a way that data can be fed into the modelling phase. Such activities result in enriched data, eventually improving the quality of downstream BDA and AI applications. The data enrichment process is implemented by specifying, deploying, and executing data enrichment pipelines over data that can be structured, semi-structured and unstructured, in large amounts, and from static or streaming sources. While techniques exist to cover different enrichment operations such as data cleaning, linking, feature extraction, classification and semantic annotation, etc., the lack of comprehensive approaches and established tools dedicated to data enrichment makes the definition, implementation, and operation of enrichment pipelines difficult for too many organizations willing to improve their BDA and AI applications.
The overall vision of the enRichMyData project is to create a novel paradigm for building rich, high-quality, valuable, and FAIR-compliant datasets to feed downstream BDA and AI applications in the context of data-sharing ecosystems, such as data spaces. The paradigm facilitates the specification and execution of data enrichment pipelines, focusing on supporting various data enrichment operations. enRichMyData makes this easily accessible to a wide set of large and small organizations that encounter difficulties in delivering suitable data to feed their BDA and AI solutions due to the lack of usable tools/expertise for the cost-effective management of data enrichment pipelines.
News & Events
enRichMyData, INTEND, and Dynabic Projects to Host “Generative AI for Data and Knowledge Engineering” Session at BDVAF 2024
The enRichMyData project, in collaboration with the INTEND and Dynabic projects, is set to host an insightful session titled “Generative AI for Data and Knowledge Engineering” at the Big Data Value Association Forum (BDVAF) 2024 in Budapest, taking place from 2-4 October. This session will explore the transformative potential of generative AI, particularly large foundational
EnRichMyData Project Convenes in Madrid for Plenary Meeting
The enRichMyData project partners gathered in Madrid for a 3-day highly productive plenary meeting, marking a significant milestone in the collaborative effort to enhance data integration and utilization tools. They discussed the integration of tools from the enRichMyData toolbox and synchronization with the business case release plans. One of the primary focuses of the meeting
enRichMyData presented a tutorial at the Semantic Web Conference
A tutorial about the semantic data enrichment was presented as part of the tutorial programme of the Fabrics of Knowledge: Knowledge Graphs and Generative AI Conference, on 26-30 May 2024, in Hersonissos, Crete, Greece. This tutorial introduces the topic of semantic data enrichment, covering theoretical and practical considerations. In particular, it explains the role that
International Data Science Summer School 2024
enRichMydata is collaborating with DataCloud, Graph-Massivizer Project, UPCAST, and InterTwino projects as well as with GATE Institute and the Academia de Studii Economice din București in the organization of the 5th edition of the Data Science International Summer School in Predeal, Romania between 20-28 July 2024. This intensive learning experience with lots of networking is managed
Navigating the Future of Data Monetisation
enRichMyData project was presented in the Data Monetization session on Day 1 at the Data Spaces Symposium 2024. The symposium took place on March 12-14, 2024, at Darmstadtium (Darmstadt, Germany), Frankfurt Region. Organized by the Data Spaces Support Centre (DSSC) and the Data Spaces Business Alliance (DSBA), this symposium is a pivotal gathering for those
Data Monetisation Session at Data Space Symposium 2024
The Data Monetization session, organised by UPCAST, enRichMyData and Graph-Massivizer Projects, is part of Day 1 at the Data Spaces Symposium 2024. The symposium will take place March 12-14, 2024, in Darmstadt, Germany. The session will focus on how organisations can extract value from their data in the evolving digital landscape. Various approaches, leveraging advanced
Social Media
Consortium
The enRichMyData project is coordinated by SINTEF (Norway), one of European’s largest independent research organisations. The project partners include companies such as Philips (The Netherlands) and Bosch (Germany), dedicated to engineering and manufacturing; Speed Network (Estonia), a provider of procurement data; JOT Internet Media (Spain), a digital marketing company; CS Group (Romania), a software service company; Expert AI (Italy), a technology company specializing in natural language understanding; and Ontotext (Bulgaria), a semantic technology company. They will have the full support of the research partners that, in addition to SINTEF, include the University of Milano Bicocca (Italy), Jozef Stefan Institute (Slovenia), University of Copenhagen (Denmark), GATE Institute (Bulgaria), and BGRIMM Technology Group (China).