SAP S/4HANA DATA ARCHIVAL

    SAP S/4HANA Data Archival — Off HANA In-Memory, Onto Cloud Cold Storage

    Move ACDOCA, BKPF/BSEG, LFA1/KNA1, MARA, and the rest of your closed-period S/4HANA data to queryable cloud archive. German HGB/AO 10-year retention, SOX, IFRS, BaFin, FDA compliant. SQL/REST query interface. 60–85% lower TCO than keeping data on HANA.

    60–85%
    TCO reduction vs HANA in-memory
    10 yr
    German HGB/AO retention covered
    < 1 sec
    Typical auditor query latency
    100%
    Schema preservation in archive

    Why SAP S/4HANA data archival matters — the HANA in-memory cost problem

    HANA's strength is its weakness: every TB of active data sits in RAM, licensed per GB, costing money whether you query it daily or once a year.

    S/4HANA customers carry an average of 8–12 years of accumulated transactional history in their HANA databases — BKPF/BSEG/ACDOCA grow at 100–500M rows per year in mid-size customers, larger at enterprise scale. Every byte of that history is held in HANA in-memory storage, attracting per-GB licence cost and infrastructure RAM. RISE with SAP makes this visible as a managed-service-unit (MSU) line item that escalates 8–12% annually. The blunt economic fact: 80% of your HANA data is queried less than once a month, but you pay 100% of the in-memory price for it.

    SAP S/4HANA data archival flips the economics. Closed-period transactional history — typically anything older than 24–36 months — moves to cloud object storage in Parquet, partitioned by fiscal year and company code, with a SQL/REST query interface that auditors find more responsive than the original HANA database. HANA in-memory footprint shrinks by 40–70%; the corresponding HANA licence consumption is released; RISE MSU consumption drops; and the archive remains fully queryable for German HGB/AO 10-year retention, SOX 7-year, FDA full-lifecycle, and BaFin financial-services retention requirements.

    The same archival path also works as a migration-prep step. Customers moving from S/4HANA to Oracle Fusion typically archive 80% of their historical S/4HANA data to cold storage before cutover — only operational open balances need to land in Fusion. Result: smaller migration scope, faster cutover, and a permanent archive that survives the eventual S/4HANA decommission.

    Compliance regimes Syntra archive satisfies

    1
    German HGB §257 / AO §147
    10-year retention for accounting records, invoices, tax-relevant documents. Signed, timestamped immutable archive with full audit log.
    2
    SOX / IRS / HMRC
    7-year US, 6-year UK retention for financial records. Pre-built auditor extracts: GL detail, AP voucher detail, AR aging.
    3
    BaFin (German banking)
    Sector-specific retention for regulated financial services, often 10 years. Role partitioning for tax authority and regulatory access.
    4
    FDA 21 CFR Part 11
    Pharma manufacturing records, batch genealogy, equipment records — full product lifecycle plus statute of limitations, often 20+ years.

    What goes into a complete SAP S/4HANA data archive

    Not just data — everything an auditor, regulator, or tax authority might ever ask about your closed-period S/4HANA history.

    📒

    Financial transactional history

    ACDOCA universal journal, BKPF/BSEG legacy journal, BSIK/BSAK AP, BSID/BSAD AR — full closed-period detail with original account-assignment context (cost centre, profit centre, WBS, internal order, functional area) preserved.

    👤

    Vendor & customer master history

    LFA1/LFB1 vendor master with full audit trail, KNA1/KNB1 customer master, payment terms history, withholding-tax setup history, dunning history. GDPR-aware masking on PII fields, role-controlled unmasking for tax-authority queries.

    📦

    Materials & inventory

    MARA/MARC material master across all plants, MBEW valuation history, MSEG movement history, MCHB batch records (FDA-relevant for pharma), MKPF document headers. Multi-plant inventory positions preserved as-of any historical date.

    🛠️

    Customisation catalog

    Every ABAP report (TRDIR), CDS view extension, BAdI implementation, IDoc partner profile, SAP CAP service — source code, version history, business purpose. Post-decommission 'what did the system do?' evidence.

    📊

    Reports & queries

    ABAP report definitions, CDS view definitions, SAP Query / Quick Viewer definitions, Fiori app catalog snapshots — preserved as artifacts so a future auditor can see the exact report that originally signed off a closed period.

    🔐

    Security snapshot

    Role and authorisation-object snapshot at decommission date, segregation-of-duties rule evaluation, organisational authorisation context — the evidence layer auditors need to validate access controls during the retention window.

    The SAP S/4HANA data archival process — five stages

    Whether you're archiving in preparation for Fusion migration, or after, or just to control HANA licence cost on a live S/4HANA system, the workflow is the same.

    1

    Scope & Retention Design — Week 1

    Inventory S/4HANA modules in use, classify data domains (FI, CO, MM, SD, PP, EAM), map each domain to applicable retention regime (German HGB/AO, SOX, IFRS, BaFin, FDA). Output: a per-domain retention policy signed by compliance, finance, and operations.

    2

    Extract & Stage — Weeks 2–5

    Pre-built HANA extractors (or CDS view consumers for RISE) pull every in-scope SAP table — full history, not just open items. Output staged to cloud object storage as Parquet, partitioned by fiscal year, period, company code (BUKRS). Hash-signed at row level.

    3

    Customisation & Report Inventory — Weeks 3–6

    Discovery engine crawls TRDIR (ABAP repository), DDLDEPENDENCY (CDS view catalog), V_EXIT_ACT (BAdI registry), IDoc partner directory, Fiori app catalog. Catalog stored alongside data with same retention policy.

    4

    Build Query Interface — Weeks 5–7

    SQL (JDBC/ODBC) and REST endpoints provisioned. Role-based access configured per domain. Pre-built auditor extracts (GL detail, AP aging, payroll register, asset register, batch genealogy for pharma) materialised. Sensitive-field masking applied.

    5

    Validate & Decommission — Weeks 7–9

    Sample auditor queries run against archive vs live S/4HANA to validate parity. Sign-off pack issued. Archive becomes system of record for retention. For full decommission: S/4HANA moves to read-only, HANA licence consumption is released, then the source instance is decommissioned.

    What customers save by archiving instead of holding everything in HANA

    The economics of SAP S/4HANA data archival versus keeping cold data on HANA in-memory storage.

    💰

    HANA in-memory licence

    HANA Enterprise Edition is licensed per GB of in-memory data; mid-size customers spend £500K–£2M/year. Shrinking the active dataset by 50% via archival typically releases £250K–£1M/year in licence consumption — directly recoverable at next renewal.

    ☁️

    RISE with SAP MSU

    RISE customers see HANA cost expressed as managed-service units (MSU), tied to active data volume. Archive shrinks the MSU draw, with RISE customers reporting 30–50% reduction in annual subscription escalation at the next renewal cycle.

    🖥️

    Infrastructure & RAM

    Self-hosted S/4HANA on-prem typically runs on HANA-certified appliances or TDI infrastructure with multi-TB RAM. Each TB of in-memory data costs £30K–£80K/year in hardware refresh and DC space. Cold archive runs on object storage at pennies per GB-month.

    👥

    Basis & DB ops overhead

    Larger HANA databases mean longer backup windows, more complex HSR setup, longer index rebuild times, more delta-merge contention. Halving active data halves the operational burden — typically 0.5–1.0 FTE per year of Basis time recovered.

    🔁

    Patch & upgrade cycles

    HANA patches, S/4HANA support packs, and SP-stack upgrades take longer and carry more risk on larger databases. Archived data doesn't participate in patching — only the live HANA portion does.

    📉

    Risk & attack surface

    Every TB of operational data is a potential breach exposure. Archive moves the exposure to encrypted, audited, role-restricted object storage — safer for the data and more defensible to the CISO and to regulators (BaFin, FCA, ICO).

    Frequently asked questions

    What is SAP S/4HANA data archival and why is it different from SAP's classic Data Archiving (ADK)?+

    SAP S/4HANA data archival is the process of moving closed-period transactional data, historical postings, and time-expired master records out of the live S/4HANA HANA database into a long-term cloud archive. SAP's classic Data Archiving (ADK, archive development kit, transaction SARA) does something similar but stays within the SAP world: data goes to an ADK file format readable by SAP only. Syntra ETL's S/4HANA data archival is cloud-native: archive lives in queryable Parquet on cloud object storage, with SQL/REST access that any BI, audit, or downstream tool can consume — without keeping an S/4HANA instance alive to read it. For customers planning eventual migration to Oracle Fusion, this is critical: the archive survives the source decommission.

    Why archive SAP S/4HANA data given HANA's in-memory architecture is supposedly 'no limits'?+

    HANA is fast at any volume in principle, but it's not free at any volume. HANA in-memory storage is licensed per active-data GB and consumes RAM proportionally — every TB of data carries direct licence cost plus the infrastructure to hold it in memory. RISE with SAP charges similarly on a managed-service-unit basis tied to active data footprint. By moving closed-period and rarely-accessed data to cold archive (Parquet on S3, GCS or Azure Blob), customers shrink the HANA in-memory footprint by 40–70%, releasing licence consumption proportionally. The archive is queryable in seconds via SQL/REST when needed — you keep the data, you stop paying HANA in-memory rates to hold it.

    How long should we retain SAP S/4HANA data?+

    Retention varies by jurisdiction and domain. German HGB §257 and AO §147 mandate 10 years for accounting records, invoices, and documents related to taxation — the strictest mainstream commercial retention regime. UK Companies Act and HMRC: 6 years. US SOX and IRS: 7 years for financial records. EU GDPR adds the inverse pressure: personal data must NOT be retained beyond business necessity. For sector-specific: FDA 21 CFR Part 11 for pharma manufacturing records (typically full product lifecycle + statute of limitations, often 20+ years); BaFin for German financial services (varies by record type, often 10 years); MiFID II for investment services (5–7 years). Syntra ETL's archival policies are configurable per data domain so each class meets its retention rule without paying to keep everything everywhere.

    Does Syntra ETL preserve SAP S/4HANA data structure during archival?+

    Yes. The S/4HANA data archival path preserves the source table schema — column names, datatypes, ABAP-dictionary-defined structure — in the archive. Archived data lives as Parquet with embedded schema, plus a JSON Schema sidecar for tooling. The archive query interface lets auditors run familiar SELECT statements against BKPF, BSEG, ACDOCA, LFA1, KNA1, MARA and so on, returning results that match what they would have seen in the live S/4HANA system — including all account-assignment dimensions, document-level context, and material/customer/vendor relationships. No schema flattening, no shape changes — the archive looks like S/4HANA to the auditor.

    What happens to ABAP customisations and CDS view extensions during archival?+

    Code itself isn't archived as data — it's preserved as a customisation inventory artifact. Syntra ETL's discovery engine catalogs every ABAP report (TRDIR), CDS view extension (DDLDEPENDENCY), BAdI implementation (V_EXIT_ACT), IDoc partner profile, and SAP Cloud Application Programming service in the source system. The catalog includes source code, version history, last-modified date, and (where derivable) the business purpose. This serves two needs: post-decommission audit evidence ('what did our S/4HANA system actually do?') and migration support ('what custom logic did we need to re-implement in Fusion?'). The catalog is signed, timestamped, and retained per the same retention rules as the data.

    Can the SAP S/4HANA data archive replace our HANA instance entirely?+

    Yes, for organisations that have migrated to a new ERP (typically Oracle Fusion) and no longer need operational S/4HANA. The Syntra archive holds the complete data, the customisation catalog, the security-model snapshot, and the reporting library — everything an auditor would otherwise demand a live S/4HANA instance for. Customers save £500K–£3M+ per year in HANA licensing, RISE with SAP subscription, Basis support, infrastructure, and patch-management overhead. The archive query interface satisfies German HGB/AO, SOX, IFRS, BaFin, and FDA access requirements for the full retention window, with sub-second query response and signed, timestamped evidence packs.

    How is access to the SAP S/4HANA archive secured?+

    Access is role-based with mandatory audit logging. Every query against the archive is logged with user, timestamp, query text, rows returned, and data classification accessed. Archive data is encrypted at rest with KMS-managed keys and in transit with TLS 1.3. Sensitive fields (employee SSN, customer bank account, vendor tax ID, salary data in PA0008) are masked by default and require explicit role permission to unmask. For German customers under BaFin or HGB regimes, the archive supports separate role partitioning so tax-authority access (Finanzamt) can be granted without exposing operational finance data, and FDA-regulated pharma customers get separate quality-assurance role partitioning for batch and equipment records.

    How fast is query performance against archived SAP S/4HANA data?+

    Sub-second for typical auditor queries (a single fiscal period for a single company code, a single AP invoice lookup, a vendor history for a single supplier). Multi-period or multi-year aggregation queries against billion-row BSEG history typically return in 5–30 seconds depending on partitioning. Archive data is stored in Parquet partitioned by fiscal year, period, company code (BUKRS), and document type, and the query engine uses column pruning and partition pruning to minimise scan. For year-end audits, customers commonly pre-materialise an auditor-facing dataset (trial balance per BUKRS, AP aging, fixed-asset register, IFRS vs local-GAAP reconciliation) for instant access during the audit window.

    Ready to plan SAP S/4HANA data archival?

    30-minute discovery call. We'll walk through your S/4HANA modules, HANA footprint, retention requirements (HGB, SOX, BaFin, FDA), and decommission timeline — and quantify the TCO reduction you'd see in year one.