GovCompass
Knowledge base

Art. 26.4 EU AI Act: input data quality for deployers

By Michel Venniker· · Aligned with the consolidated EU AI Act, including the 2026 Omnibus amendments.

Art. 26.4 requires deployers of high-risk AI to ensure that input data is relevant and sufficiently representative for the system's intended purpose. The deployer is responsible for data quality in operation, even though the provider sets the specifications under Art. 10.

Updated: June 2026

Introduction: data quality as compliance obligation

Art. 26.4 obliges deployers to "take appropriate technical and organisational measures to ensure that data used to operate and monitor high-risk AI systems is relevant, sufficiently complete, and fit for purpose, and has appropriate levels of accuracy, robustnessrobustnessA system's ability to perform reliably under realistic conditions including noise, edge cases and adversarial pressure — the engineering core of the safety-and-reliability principle.Open full entry →, and security."

This obligation acknowledges a fundamental truth about AI systems: garbage in, garbage out. An AI systemAI systemA machine-based system that, for explicit or implicit objectives, infers from input how to generate outputs — predictions, content, recommendations or decisions — that can influence physical or virtual environments. The OECD-style definition followed by the EU AI Act.Open full entry → operating on degraded, biased, or incomplete input data will produce degraded, biased, or incomplete outputs, regardless of how well the model was built. The deployerdeployerAn organization using an AI system under its own authority in its activities — carrying operator duties: use per instructions, oversight, input relevance, monitoring, notices.Open full entry → bears legal responsibility for ensuring the data environment the AI operates in meets quality standards.

What data does Art. 26.4 cover?

Art. 26.4 applies to all data that enters the AI system during deployment:

  • Operational input data: The data the system processes to produce outputs (e.g. applicant CVs in a screening system, transaction data in a fraud detection system)
  • Reference data: Baseline data the system compares against (e.g. historical customer profiles)
  • Feedback data: Data fed back to the system to improve or adjust performance during deployment
  • Monitoring data: Data used to assess system performance

Training data quality is primarily a providerproviderThe actor who develops an AI system (or has it developed) and places it on the market or into service under its own name — carrying manufacturer-style duties: design controls, documentation, conformity.Open full entry → obligation under Art. 10. However, if a deployer fine-tunes or retrains a model, they assume provider-like obligations for that data.

The four quality dimensions

1. relevance

Input data must be relevant to the decision the AI system is making. Using demographic proxies that have no causal relationship to the predicted outcome is a relevance failure, and may also implicate GDPR data minimisationdata minimisationProcessing only data that is adequate, relevant and necessary — in ML, implemented through pseudonymisation, feature selection, synthetic data and privacy-enhancing techniques.Open full entry → requirements.

2. completeness

Data must be sufficiently complete for the system to make valid predictions. Many AI systems perform significantly worse on incomplete data, but do not always signal this clearly. Deployers must understand their system's minimum data requirements (documented in the provider's instructions) and have processes to handle incomplete data submissions.

3. accuracy and robustness

Data must accurately represent the real-world situation. Stale data, data entry errors, and data transformation errors all degrade accuracy. For high-risk AI, deployers should have input validation processes that catch common data quality errors before they reach the model.

4. security

Input data must be protected against unauthorised access and manipulation. For high-risk AI systems, data poisoningdata poisoningAn attack that corrupts training data so the model learns attacker-chosen behaviour; a core adversarial-ML threat to the data pipeline.Open full entry →, the deliberate injection of manipulated data to influence AI outputs, is a significant security threat that deployers must address in their information security framework.

Practical measures

  • Automated data validation rules that flag incomplete or out-of-range inputs
  • Regular data quality audits comparing input distributions against expected baselines
  • Data quality SLAs in supplier contracts for data that comes from external sources
  • Drift detection: monitoring whether input data distributions change over time in ways that may degrade model performance
  • Documentation of data quality incidents and remediation actions

Compliance checklist

  1. Have you documented the minimum data quality requirements for each high-risk AI system (from provider instructions)?
  2. Is there an automated validation process for input data before it reaches the AI system?
  3. Is there a process for handling incomplete or low-quality input data?
  4. Are data quality incidents logged and tracked?
  5. Is input data security addressed in your information security framework?
  6. Do you monitor for data drift that could affect AI system performance?
Legal referencesArt. 26

More on Privacy

Art. 10 EU AI Act: data and data governance for high-risk AI

Reference

Art. 10 requires that the training, validation, and testing data for high-risk AI systems meets quality criteria: relevant, sufficiently representative, and as free of errors and complete as possible for the intended purpose. It also requires documented data governance practices covering collection, preparation, bias examination, and gap mitigation, and it permits the limited processing of special-category data where strictly necessary to detect and correct bias, under safeguards.

Art. 26.9 EU AI Act: DPIA obligation for high-risk AI

Reference

Art. 26.9 links the EU AI Act to the GDPR: where a data protection impact assessment (DPIA) is required under GDPR Art. 35, deployers of high-risk AI must use the information from the provider's documentation to support that assessment.

Control-level compliance: the EU AI Act as an instrumented system

Analysis

Control-level compliance means satisfying the EU AI Act through engineered, evidenced controls rather than policy documents. The technical articles translate directly into system controls: immutable logs (Art. 12, 19), a kill switch (Art. 14(4)(e)), data masking before the model (Art. 10), configurable block policies (Art. 26), risk scoring and incident reporting within deadline (Art. 9, 73), and workspace isolation with role-based access (Art. 14, 26). Compliance at this level is an instrumented system, not a policy as PDF.

EU AI Act and GDPR: how do the two regulations relate?

Guide

The EU AI Act and the GDPR create overlapping but distinct obligations for AI systems that process personal data. They align on data quality, impact assessments, transparency, and individual rights, but differ in scope, accountability roles, and incident-reporting timelines, so the efficient approach is integrated compliance, such as a combined DPIA/FRIA.

More on Safety & reliability

Art. 14 EU AI Act: designing high-risk AI for human oversight

Reference

Art. 14 requires providers to design and build high-risk AI systems so that they can be effectively overseen by humans during use. The system must let an overseer understand its capabilities and limits, watch for anomalies, resist automation bias, correctly interpret outputs, decide not to use the system, and intervene or stop it through a kill switch (Art. 14(4)(e)). It is the design obligation that makes the deployer oversight duty of Art. 26.2 possible.

Art. 26.5 EU AI Act: post-market monitoring for deployers

Reference

Art. 26.5 requires deployers of high-risk AI to monitor the system's operation against the provider's instructions and to report risks and serious incidents. Monitoring is the early-warning mechanism that connects to incident reporting under Art. 73.

Art. 5 EU AI Act: all 8 prohibited AI practices explained

Reference

Art. 5 lists the eight prohibited AI practices, including subliminal manipulation, exploitation of vulnerable groups, social scoring, and untargeted facial-recognition scraping. These prohibitions are absolute, apply to every organisation regardless of size, and have been in force since 2 February 2025.

Art. 6 EU AI Act: how to classify a high-risk AI system

Reference

Art. 6 sets out how to classify a high-risk AI system: a system is high-risk if it is a safety component of a product under Annex I, or falls within one of the Annex III use cases. Misclassification is itself a violation, and the responsibility rests with the organisation, not the supplier.