The ten steps of information integration

Friedhelm Reydt

IT-Solution Development & Product Management Enthusiast 🚀 | Connoisseur of SYSTEMIC & AI-DRIVEN 🤖 INFORMATION INTEGRATION (Automation of ESG reporting etc.) | Passionate systemic Business Coach 🤝

Published May 31, 2023

+ Follow

by Friedhelm Reydt.

Information integration usually consists of ten (abstracted) steps:

1. Job specification

For which object or process within an organisation is data completeness required?

Example: In order to provide customers with a price indication with the help of a product calculator, all the necessary data must be available. The order specification describes the information needs of the addressees as well as the target state to be achieved.

2. Data identification

If the data for the calculator is incomplete, the product will be offered on the market either too expensive or too cheap. Both can have a negative impact on the market positioning of the company. Without wanting to go into the method of recursive data identification here, this phase includes the complete localisation of all data sources that are available in different areas of activity and are needed to fulfil the information requirements of the addressed target group. Both formal and informal technical or non-technical data sources can be considered. Technical data can be structured and unstructured. The goal of data identification is information completeness. All identified information is presented in the form of an information map.

3. Data extraction

To ensure information completeness, data from the identified technical source systems are continuously passed on for data transformation. This should be done regularly and automatically. Non-technical and unstructured data (e.g. Word files from document management systems) are transformed into structured technical data.

4. Data transformation

All data is converted into a common format and structure ... and collected in a temporary database.

5. Data cleansing

Inconsistent, duplicate and missing data are eliminated from the temporary database. A set of rules is created in advance for this purpose. Data cleansing can be automated and/or manual.

6. Data reconciliation

With regard to the single version of the truth principle, semantic differences between data sources must be identified and eliminated. This is also done with the help of a set of rules.

Excursus

Data models vs. information sets

The information systems of the departments are based on specific data models, which, for example, can consist of components such as customer name, service item and corresponding price for an invoice. Such information systems may or may not communicate with each other in the sense of the value chain.

If process-relevant systems do not communicate with each other, even though the data generated in system A is needed in the operational context of system B, a media break occurs within the digital value chain, which in the worst case remains undetected or must be remedied with the help of a manual process step. A media break always indicates a possible source of error that can lead to data being falsified.

To resolve the dilemma of insufficient data reconciliation, tools such as Information Integration Platforms are used to map higher-level information sets to clean up missing, erroneous or redundant data. An information set can therefore be composed of different attributes from different data models.

To view or add a comment, sign in

The ten steps of information integration

Friedhelm Reydt

IT-Solution Development & Product Management Enthusiast 🚀 | Connoisseur of SYSTEMIC & AI-DRIVEN 🤖 INFORMATION INTEGRATION (Automation of ESG reporting etc.) | Passionate systemic Business Coach 🤝

Recommended by LinkedIn

Excursus

Data models vs. information sets

More articles by Friedhelm Reydt

Insights from the community

Others also viewed

From Manual to Magical: The Power of No-Code Data Automation

The Rise of Dirty Data and How Data Migration Can Save Your Business

Automation in Data Migration

Streamlining Data Migration with Dynamic Data Replicator: Copy and Overwrite Functionality

Bridging Data Gaps: How Strategic Integration Solves Complex Industry Problems

Enhancing Data Resilience with Dynamic Data Replicator

Revolutionizing Test Data Management: Achieving Near Zero Downtime with Dynamic Data Replicator

Chaos to Cohesion

My Data Quality Notes

How to Mitigate the Risks and Challenges in Data Migration

Explore topics

Recommended by LinkedIn

Excursus

Data models vs. information sets

More articles by Friedhelm Reydt

Selective wokeness? Warum uns allen Diskriminierung im Job begegnen kann und wie dabei aus Täter Opfer werden

Who is the "Systemic & AI-based information integration" framework suitable for and what complex DATA & AI projects are involved?

Systemic & AI-based information integration: Framework v1 for complex DATA & AI projects to be published at the end of August 2024

Für wen eignet sich das Framework „Systemische & KI-basierte Informationsintegration“ und um welche komplexen DATA & AI Projekte geht es dabei genau?

Systemische & KI-basierte Informationsintegration: Framework v1 für komplexe DATA & AI Projekte erscheint Ende August 2024

Systemic & AI-based Information Integration at a glance. The mystery revealed.

Künstliche Intelligenz und die Kunst, ein Adventure zu programmieren. Rezension zu Gary Rileys CLIPS-Tutorial "ADVENTURES IN RULE-BASED PROGRAMMING"

AGILE failed? Why the successful use of agile methods in complex tasks requires information completeness

AGIL gegen die Wand gelaufen? Warum der erfolgreiche Einsatz agiler Methoden in komplexen Aufgabenstellungen Informationsvollständigkeit voraussetzt

Why the digital transformation at plant manufacturer W took the form of spreadsheet tables and why this was not a good idea...

Insights from the community

Others also viewed

From Manual to Magical: The Power of No-Code Data Automation

The Rise of Dirty Data and How Data Migration Can Save Your Business

Automation in Data Migration

Streamlining Data Migration with Dynamic Data Replicator: Copy and Overwrite Functionality

Bridging Data Gaps: How Strategic Integration Solves Complex Industry Problems

Enhancing Data Resilience with Dynamic Data Replicator

Revolutionizing Test Data Management: Achieving Near Zero Downtime with Dynamic Data Replicator

Chaos to Cohesion

My Data Quality Notes

How to Mitigate the Risks and Challenges in Data Migration

Explore topics