We consolidate data from legacy systems, SaaS platforms, and spreadsheets into a single validated master dataset using an ETL pipeline and a rules engine. Every step is traceable: logging, validation rules, and reports are built in. Compliance is demonstrable. Data is immediately available via APIs for webshops, dashboards, and partners.
Solution
ETL Process
Client
AkzoNobel

Data was spread across legacy systems, SaaS platforms, and spreadsheets. Manual entry and misaligned update cycles created inconsistencies and missing fields. It was unclear which data was current and which was complete. Without a single reliable source, meeting internal and regulatory requirements took more time than it should have. Audits required full traceability, but change history was not centrally available. Departments and regions maintained their own definitions and lists. Corrections in one system did not carry over to others. That eroded trust in reports and made it risky to push data to webshops, dashboards, and partners.
Data was spread across multiple systems in different formats, with inconsistencies and missing fields throughout. There was no single source of truth. Teams spent significant time on manual checks and coordination rather than on actual work.
Audits required full traceability, but change history was not centrally available. Departments and regions maintained their own definitions. Corrections did not automatically carry over to other systems.
Without validated master data, pushing information to webshops, dashboards, and external partners introduced unnecessary risk. What was needed: an approach that collects, validates, and consolidates data into reliable master data, with full traceability through logging and reports.
An ETL pipeline collects data from legacy systems, SaaS platforms, and spreadsheets and transforms it into one unified structure. The pipeline runs event-driven where possible, batch-based where needed.
The rules engine validates every record for completeness, consistency, and compliance with business and regulatory requirements. Only records that pass all checks move forward.
Accepted records form a single master dataset. Key matching and priority rules resolve duplicates and conflicts. Every item has one current, reliable representation.
Records that fail validation are not silently blocked. The pipeline generates a control report with the exact fields and root causes, along with an audit trail that covers data origin and change history. Teams can make targeted fixes without touching the rest of the chain.
The master dataset is accessible through well-documented APIs. Webshops and dashboards read directly from this source. Where relevant, we feed external applications as well, so everyone works from the same validated data.
Caching limits source load and reduces latency. Fallbacks prevent downtime when a source becomes unavailable. Partial runs keep the pipeline moving when part of the chain is temporarily offline. Monitoring and alerting continuously track performance and compliance.

The organization has one validated master dataset that meets business and regulatory requirements. Audit trails and control reports make compliance demonstrable at any time.

Teams no longer validate or correct data in separate files. Control reports show exactly where improvements are needed, so fixes are targeted and efficient.

Decisions are based on current, consistent data. Dashboards provide direct insight without delays from manual checks.

Validated data flows automatically via APIs to webshops, dashboards, and external applications. Everyone works from the same source of truth.

The ETL pipeline and rules engine are built to grow. Adding new systems or data streams does not require rebuilding what is already in place.


From fragmented, error-prone data to a reliable, compliant master dataset. The ETL pipeline and rules engine replaced manual work with automated validation and logging. Reports, web shops, and dashboards all draw from a single source of truth. The foundation is in place for more efficient processes and future expansion.