Question 1

What is a SAP SuccessFactors cloud archive?

Accepted Answer

A sap successfactors cloud archive is a Parquet-on-object-storage archive of your SuccessFactors HXM data — PerPerson, PerEmployment, EmpJob, EmpCompensation, FormHeader, JobReq, learning history, MDF custom objects, Foundation Objects — held in your own cloud account (AWS S3 / Azure Blob / GCS / OCI Object Storage) with tiered storage (hot/warm/cold), queryable directly via Athena / Synapse Serverless / BigQuery External Tables / Snowflake External Tables, with hash-signed manifests for audit. It replaces the cost and operational burden of keeping the SuccessFactors tenant subscription live solely to hold historical HR data while satisfying every downstream need — ex-employee lookups, works-council audits, GDPR DSARs, SOX HR-control evidence, pension calculations.

Question 2

Why move SuccessFactors data to a cloud archive instead of keeping the tenant live?

Accepted Answer

Cost, control and longevity. Cost: a 20,000-employee SF tenant subscription across EC + Performance + Comp + Recruiting + Learning typically runs $7–14M/year at $30–60 PEPM combined; keeping it live just for historical access burns that budget on data that nobody is actively transacting against. A sap successfactors cloud archive holding the same data on tiered object storage costs single-digit-thousands per year for the same 20k-employee footprint. Control: the archive sits in your own cloud account, in your chosen region, with your IAM and encryption — not in SAP's multi-tenant control plane. Longevity: the archive is in open Parquet format that any SQL engine can read for the next 20+ years, independent of SAP's roadmap, SF's bi-annual upgrade cycle and any future SAP HXM rebranding.

Question 3

How is a SuccessFactors cloud archive different from raw OData exports to flat files?

Accepted Answer

Three things. (1) Format and queryability — Parquet with columnar compression and predicate pushdown lets SQL engines scan billions of rows in seconds for a few cents; flat-file JSON or CSV exports are essentially write-only and require a re-load to anywhere queryable. (2) Schema governance — the archive carries the SF logical entity model (Worker, Assignment, Salary, Form, JobReq) with type-stable columns evolved as SF entities change across the bi-annual upgrade cycle; raw OData exports embed every entity-version change in the file structure, so a 2018 export and a 2026 export have incompatible shape. (3) Provenance and audit — the archive is hash-signed at file and manifest level with extraction timestamp, OAuth token, RBP context and source-row count; flat-file exports have none of that and fail any forensic-grade audit.

Question 4

What's the typical storage profile of a sap successfactors cloud archive?

Accepted Answer

For a 20,000-employee tenant with 10 years of effective-dated history, the archive typically sits in the 50–300 GB range across all entities — modest by cloud-storage standards. PerPerson + PerEmployment + EmpJob + EmpCompensation effective-dated history is the bulk (often 5–8M version rows), followed by FormHeader (every performance form ever issued), then JobReq + Application (recruiting history), then learning history (completion records). MDF custom objects vary wildly — some tenants have very rich MDFs that dominate, others have almost none. Tiered storage strategy: hot (S3 Standard / Hot Blob / GCS Standard) for the last 12 months at ~$0.023/GB/month, warm (S3 Standard-IA / Cool Blob / GCS Nearline) for 1–7 years at ~$0.0125/GB/month, cold (S3 Glacier / Archive Blob / GCS Archive) for 7+ years at ~$0.004/GB/month. Total annual storage cost for 20k-employee 10-year archive: single-digit-thousands of dollars.

Question 5

How does the cloud archive get refreshed from SuccessFactors?

Accepted Answer

Two patterns. Pattern one (post-migration archive) — single full extract at SF tenant decommissioning time, then the archive is read-only forever. Pattern two (ongoing co-existence archive) — initial full extract, then daily or near-real-time incremental loads via Syntra ETL's watermark-based OData modified-since extractors while the SF tenant remains live for some active use. Both patterns produce the same hash-signed Parquet output. The choice depends on whether the SF tenant is being fully retired (pattern one) or kept live for a long-tail of active functionality with the archive providing analytics and long-term storage (pattern two). Most SF cloud archive deployments end up on pattern one within 12–24 months of full Fusion HCM cutover.

Question 6

Can we query the SuccessFactors cloud archive directly without re-loading anywhere?

Accepted Answer

Yes — that is the central design principle. The archive is in open Parquet format, partitioned by legal employer and effective fiscal year, and registered as external tables in your query engine of choice: Athena (AWS), Synapse Serverless (Azure), BigQuery External Tables (GCP), Snowflake External Tables (any cloud), or Oracle ADW Object Storage external tables (OCI). Queries hit the Parquet directly with predicate pushdown and column projection — typical 'show me a worker's full job history as of 14 March 2019' query scans a few MB of one date-partition file and returns in under a second for a few cents of compute. No re-loading to a warehouse, no separate ETL pipeline, no shadow copies of the archive.

Question 7

How does the cloud archive handle SuccessFactors RBP and access control?

Accepted Answer

RBP (Role-Based Permissions — SF's permission roles + permission groups model) is captured at extraction time and converted to a logical access-control model attached to the archive. When archive queries are issued through the consumer portals (ex-employee self-service, HR audit, works council, GDPR DSAR), the same RBP-equivalent filtering is enforced at the query layer using cloud-native IAM, column-level masking (Snowflake masking policies, BigQuery column-level security, Athena Lake Formation), and row-level security. The result: the same access-control posture you had in the live SF tenant, enforced against the archive, without keeping the SF tenant subscription active. Every query is logged for GDPR Article 30 RoPA and SOX audit trail.

Question 8

Is a sap successfactors cloud archive compliant with EU GDPR and German Betriebsverfassungsgesetz?

Accepted Answer

Yes, with the right deployment posture. GDPR compliance requires: data minimization (the archive holds only data with documented retention basis — ex-employees beyond retention purged on schedule), right of access (DSAR responder UI indexed by national identifier returns every record in minutes), right to erasure (forget-me workflow removes subject records from Parquet using copy-on-write delta partitions while preserving the audit trail), processing record (every access logged for Article 30 RoPA). German Betriebsverfassungsgesetz (works council law) compliance adds: works-council representative access portal, statutory headcount filings, gender-pay-gap historical analysis, 10+ year retention for some records. The archive is typically deployed in EU-region object storage (S3 eu-central-1, Azure Germany West Central, GCS europe-west3, OCI Frankfurt) so data never leaves the EU.

SAP SuccessFactors Cloud Archive — Parquet, Tiered, Queryable

What a sap successfactors cloud archive actually is

What the cloud archive holds

What makes the sap successfactors cloud archive different from raw exports

Parquet columnar format

Partitioned by LE + fiscal year

Hash-signed manifests

Tiered storage rotation

External-table query model

RBP-equivalent access control

Standing up a sap successfactors cloud archive — the deployment

Cloud target selection — Days 1–3

Full SF extraction — Days 3–10

External-table registration — Days 10–12

RBP & access-control mapping — Days 12–15

Tiered storage lifecycle — Days 15–16

Parallel-run & cutover — Weeks 4–8

The query engines that read the SuccessFactors cloud archive

AWS Athena

Azure Synapse Serverless

BigQuery External Tables

Snowflake External Tables

Oracle ADW External Tables

Power BI / Tableau / Looker

Frequently asked questions

Plan your sap successfactors cloud archive deployment