Question 1

What is the Sage People cloud archive and how does it differ from data archival?

Accepted Answer

The Sage People cloud archive is a specific deployment shape of the Syntra ETL archive: a queryable, cost-tiered object-storage estate that holds extracted Sage People data as columnar Parquet files in your own AWS S3, Azure Blob, or Google Cloud Storage account. Data archival is the broader business activity — what you retain, for how long, under which regulation. The Sage People cloud archive is the technical product that delivers that activity at cents-per-GB economics, while keeping queries fast enough to satisfy an HMRC subject access request, an ICO data-subject request, or an internal audit lookup against Worker__c and Salary__c history without ever waking the original Salesforce org. Most customers run both: the policies and retention schedules sit in the archival programme, and the cloud archive is where the bytes physically live.

Question 2

Where exactly does the Sage People cloud archive store data?

Accepted Answer

Inside your own cloud tenancy — not Syntra's. The Sage People cloud archive deploys into your AWS account (S3 + Athena + Glue Data Catalog), your Azure subscription (Blob Storage + Synapse Serverless + Purview), or your Google Cloud project (Cloud Storage + BigQuery external tables). Syntra ETL writes Parquet partitions, registers them in your catalog, and grants your IAM roles read access. We never custody the data, never hold a copy on Syntra-owned infrastructure, and never charge you a per-GB storage fee — your cloud provider invoices you directly. For UK-resident Sage People customers worried about data sovereignty post-Brexit, the cloud archive can be pinned to UK regions (London for AWS, UK South for Azure) with explicit residency controls.

Question 3

How is the Sage People cloud archive tiered for cost?

Accepted Answer

Tiering follows access frequency. Hot tier (last 13 months of Worker__c, Salary__c, Leave_Request__c activity) lives on S3 Standard / Azure Hot Blob / GCS Standard — queries return in seconds. Warm tier (months 14–36) sits on S3 Standard-IA / Azure Cool / GCS Nearline — queries return in tens of seconds, storage cost drops ~40%. Cold tier (years 4–7) moves to S3 Glacier Instant Retrieval / Azure Cool-to-Archive lifecycle / GCS Coldline — queries return in minutes, storage cost drops 80%+. Frozen tier (year 8+, regulatory minimum retention) moves to S3 Glacier Deep Archive / Azure Archive / GCS Archive — sub-cents per GB-month with a 12-hour retrieval window suitable for once-a-decade HMRC investigations. The Sage People cloud archive moves partitions automatically based on lifecycle rules you set per object and per business unit.

Question 4

What kinds of queries does the Sage People cloud archive support?

Accepted Answer

Anything you'd ask of the live Sage People org, against historical data, expressed in standard SQL. AWS Athena, Azure Synapse Serverless, and BigQuery all read the Parquet directly using their respective SQL dialects. Typical queries: 'show all salary changes for employee EMP-12345 in 2021' (subject access request); 'list every UK worker who was on SMP in tax year 2022–23' (HMRC inquiry); 'export the full Leave_Request__c history for the Manchester office, FY2019–FY2023' (litigation hold); 'rebuild headcount as at 31 March 2022 for the audit working paper' (statutory audit). Because Parquet is columnar, a query touching three columns scans 3% of the bytes — costs and latencies are dramatically lower than a Salesforce SOQL query against the same logical data.

Question 5

How does the Sage People cloud archive handle UK GDPR and DPA 2018 deletion rights?

Accepted Answer

Right-to-erasure requests under UK GDPR Art. 17 and DPA 2018 are honoured at the row level. The Syntra ETL retention engine flags a worker record across every partition that contains it (Worker__c, Employment_Record__c, Salary__c, Leave_Request__c, Performance_Review__c, plus any custom-object extensions). On scheduled retention runs, the engine rewrites the affected Parquet partitions excluding the flagged rows, replaces them atomically, and updates the catalog. The deletion is permanent and verifiable — there is no soft-delete tombstone left behind. A signed deletion certificate (subject identifier, partitions affected, byte counts before/after, SHA-256 hashes) is emitted as evidence for the ICO and for the data subject. The Sage People cloud archive maintains a separate, ICO-compliant exception register for records subject to legal hold, where erasure is suspended pending resolution.

Question 6

How does the Sage People cloud archive compare to keeping the Salesforce org alive read-only?

Accepted Answer

Keeping a retired Sage People org alive in read-only mode still requires Salesforce Platform licences (typically £100–£250 per user per month for the Integration / Read-only SKUs, multiplied by the user count that needs HR-history access), plus annual sandbox refresh costs, plus any third-party AppExchange app subscriptions that were used inside Sage People. For a 2,000-employee customer with ~50 HR/payroll/audit users who need historical access, the read-only org route runs £60–150k/year indefinitely. The Sage People cloud archive replaces that with a £2–6k/year object-storage bill plus £8–15k/year for query catalog and Syntra ETL platform — a 5–10× reduction sustained across the entire retention horizon. ROI typically pays back the migration project cost in 18–30 months on archive savings alone.

Question 7

How does cutover from live Sage People to the cloud archive work?

Accepted Answer

Cutover is staged. Stage 1: full historical extract of every Sage People custom object (Worker__c, Employment_Record__c, Salary__c, Leave_Request__c, Position__c, plus your custom-object extensions) into the cloud archive, with row-level reconciliation against the live org. Stage 2: incremental delta capture via SystemModstamp watermarks while the org is still live, replayed nightly into the archive so it stays current. Stage 3: cut. The live Sage People org is either decommissioned outright (if you've migrated to a new HCM) or reduced to read-only with the licence count dropped to the minimum required for the runout period. Stage 4: ongoing operation — the Sage People cloud archive is the durable record; the live org (if any) is treated as transient. Most customers complete stages 1–3 in 6–10 weeks for a 2,000-employee org.

Question 8

Can the Sage People cloud archive feed downstream systems and reports?

Accepted Answer

Yes. The Sage People cloud archive is a first-class data source for any BI tool that reads Parquet, Athena, Synapse, or BigQuery — Tableau, Power BI, Looker, Qlik, ThoughtSpot, plus any custom Python/R notebook. Common downstream patterns: workforce analytics dashboards that need 5+ years of headcount and attrition trend (impossible to keep performant inside Sage People itself); finance reconciliation reports that join archived salary history to the GL; HRIS audit reports for SOC 2 / ISAE 3402 evidence collection; ML feature stores that need historical worker attributes for talent-prediction models. The Sage People cloud archive is also the standard source for downstream Oracle Fusion HCM if a future migration happens — eliminating a second extraction from a long-since-decommissioned Salesforce org.

Sage People Cloud Archive — Queryable Parquet, Cents per GB

Why the Sage People cloud archive beats every other retention option

What the Sage People cloud archive delivers

The six architectural choices that make the Sage People cloud archive different

Your cloud, your data

Open Parquet, not proprietary

Athena / Synapse / BigQuery SQL

Hot → warm → cold → frozen tiering

Row-level UK GDPR erasure

UK regulatory lifecycle defaults

Stand up the Sage People cloud archive in 6–10 weeks

Cloud account provisioning — Week 1

Sage People extract scope sign-off — Week 1–2

Pilot extract & query validation — Week 2–4

Full historical extract — Week 4–7

Incremental delta replay — Week 6–8

Cutover + lifecycle activation — Week 8–10

What the Sage People cloud archive does that read-only Salesforce can't

5–10× lower TCO

Faster audit queries

No vendor lock-in

Scales without licence drama

Better data-sovereignty story

Lower carbon footprint

Frequently asked questions

Ready to scope your Sage People cloud archive?