Productised cloud archive for Epic downstream finance, HCM, SCM data plus retired legacy clinical records. Parquet on tiered object storage (AWS / OCI / Azure / GCP / on-prem S3), HIPAA-grade encryption, SQL queryable, per-state retention engine. ~80% storage cost reduction.
Building an Epic archive in-house is a 6–12 month project. The productised epic systems cloud archive brings extraction, transformation, storage, access and retention rules pre-built, tested and updated.
Healthcare retention obligations are long and complex. Adult medical records 5–10 years in most states. Pediatric records 25–30 years. HIPAA accounting-of-disclosures 6 years. IRS billing records 7 years. Medicare cost reports 10 years. False Claims Act 6 years. State public records where applicable. Joint Commission medical staff records often permanent. Storing all of that in active Epic Clarity is technically possible but expensive — and it slows down operational reporting users who only need the recent 2–3 years.
The disciplined answer is a cloud archive. The disciplined approach to a cloud archive is to not build it from scratch — five sub-systems (extraction, transformation, storage, access, retention) each non-trivial, plus ongoing maintenance as state retention rules evolve and Epic versions change. The Epic Systems cloud archive product brings all five pre-built, tested across healthcare deployments, and continuously updated as the regulatory landscape shifts.
Architecture: Parquet (columnar, vendor-neutral, compression typically 4–6x) on tiered cloud object storage (or on-prem S3-compatible storage if your compliance posture requires it). Hot tier for recent records, warm for operational lookback, cold for true long-tail. SQL queryable via the standard analytical query engine of your cloud (Athena on AWS, OCI Data Catalog/BigQuery-style on OCI, Synapse on Azure, BigQuery on GCP). HIPAA-grade encryption end to end with customer-controlled keys. Role-based access with HIPAA logging baked in. Retention engine that runs auto-expiry with proof-of-destruction. That's the product.
Each one comes pre-built and tested — and gets updated as the regulatory + Epic landscape evolves.
AWS, OCI, Azure, GCP, on-prem S3-compatible. Pick what matches your existing cloud strategy. Hybrid deployments supported (e.g., OCI hot + on-prem cold mirror).
Automatic hot → warm → cold movement based on age + access pattern. ~80% cost reduction over a 10-year horizon versus single-tier.
Athena, Trino, Presto, BigQuery, Snowflake, Synapse — query Parquet directly. No restore-and-rummage, no proprietary access layer.
Encryption, KMS, role-based access, accounting-of-disclosures logging — all out of the box. BAA with cloud provider. SOC 2 controls on the archive.
Per-state rules pre-built for adult, pediatric, billing, public records. Continuously maintained. Auto-expiry with proof-of-destruction signed by privacy officer.
Archive exposed to Oracle OTBI and BI Publisher. Unified live + historical dashboards span Fusion + archive seamlessly.
A repeatable workflow from cloud architecture through tiered storage, retention engine and OTBI integration.
Cloud platform selection (AWS / OCI / Azure / GCP / on-prem). KMS strategy. Network architecture (private endpoints, no public exposure of PHI). BAA executed with cloud provider where applicable.
Object storage buckets provisioned per tier. Lifecycle rules configured for automated hot → warm → cold movement. Encryption keys provisioned in KMS. Access policies templated by role.
Clarity-certified extractors configured to push directly to archive Parquet. Legacy system extractors (for retiring Lawson/McKesson/etc.) configured. Cogito reconciliation harness activated.
Per-state retention rules mapped to record classes. Legal entity → state mapping. Auto-expiry schedule set. Proof-of-destruction workflow tested. Privacy officer sign-off.
Role-based access wired (finance, HR, clinical, audit, ex-employee self-serve). HIPAA logging activated. Self-serve UI for common queries. SQL query endpoint for ad-hoc. Break-glass workflow tested.
Archive exposed to Oracle OTBI and BI Publisher. Unified live + historical dashboards built. Active Clarity retention trimmed to operational window. Legacy systems decommissioned. Annual retention review scheduled.
The same archive architecture deploys on every major cloud and on-prem S3-compatible storage. Pick what matches your existing strategy.
S3 (standard / intelligent tiering / Glacier / Glacier Deep Archive) with AWS KMS. Athena for SQL queries. HIPAA-eligible services list respected. BAA in place.
OCI Object Storage (standard / infrequent access / archive) with OCI Vault. Co-located with Oracle Fusion for low-latency integration. OCI HIPAA-eligible services.
Blob Storage (hot / cool / archive) with Azure Key Vault. Synapse for SQL queries. Azure HIPAA BAA included with enterprise agreements.
Cloud Storage (standard / nearline / coldline / archive) with Cloud KMS. BigQuery for SQL queries. Google Cloud HIPAA BAA.
MinIO, NetApp StorageGRID, Dell ECS, Pure FlashBlade. For organizations with strict on-prem PHI residency requirements. Same architecture, same access layer.
Common pattern: OCI hot tier co-located with Oracle Fusion + on-prem cold-tier mirror for compliance. Or AWS primary + Azure DR. Architecture supports any topology.
The Syntra ETL Epic Systems cloud archive is a productised long-term archive layer for Epic downstream finance, HCM, SCM and retired legacy clinical data. Architecture: Parquet (columnar, vendor-neutral) on tiered cloud object storage (AWS S3, OCI Object Storage, Azure Blob, Google Cloud Storage) with hot/warm/cold tiers; HIPAA-grade encryption (AES-256 at rest, TLS 1.3 in transit, KMS-managed keys); SQL queryable via Athena/Trino/BigQuery/Snowflake/Synapse; role-based access with full HIPAA accounting-of-disclosures logging; per-state retention rule engine with auto-expiry and proof-of-destruction. Designed for multi-decade retention at sustainable cost — ~80% storage cost reduction versus operational Clarity through tiering.
Building an archive in-house is a 6–12 month project that has to solve five things: extraction (Clarity-certified, partitioned, reconciled against Cogito); transformation (Parquet schema, retention metadata, indexing); storage (tiering, encryption, KMS integration); access (role-based, HIPAA-logged, self-serve UI); and retention rules (per state, per record class, with auto-expiry and proof-of-destruction). Every one of those is a non-trivial sub-project. The Epic Systems cloud archive product brings all five pre-built, tested across healthcare deployments and updated as state retention rules evolve. Customers typically save 6–9 months of in-house build time and ~$1–2M of equivalent consulting cost.
All major cloud providers and on-prem object storage. AWS (S3 standard / intelligent tiering / Glacier / Glacier Deep Archive with KMS encryption); Oracle Cloud Infrastructure (Object Storage standard / infrequent access / archive with OCI Vault); Microsoft Azure (Blob Storage hot / cool / archive with Azure Key Vault); Google Cloud Storage (standard / nearline / coldline / archive with Cloud KMS); on-prem S3-compatible (MinIO, NetApp StorageGRID, Dell ECS, Pure FlashBlade). For HIPAA the cloud provider's HIPAA-eligible service list is respected and BAAs are in place. Multi-cloud and hybrid deployments are supported — common pattern is OCI Object Storage co-located with Oracle Fusion plus an on-prem cold-tier mirror.
Yes. Architecture is designed for HIPAA compliance from the ground up: encryption at rest (AES-256) and in transit (TLS 1.3); KMS-managed keys with customer-controlled key rotation; role-based access enforced at the storage layer; full audit logging of every read access with actor, timestamp, justification (satisfies §164.312(b) audit control and §164.528 accounting-of-disclosures); business associate agreement with cloud provider; SOC 2 Type II controls on the archive layer itself. State retention rules pre-built for all 50 states + DC + territories: adult medical records, pediatric records, billing records, fraud-exposure window, public records (for public hospitals). Retention triggers automated deletion with auditable proof-of-destruction.
Tiering is rule-based on age + access pattern. Default rules (configurable per record class): records less than 12 months old stay on hot tier (SSD-backed object storage, sub-second query); 12 months to 7 years on warm tier (standard object storage, sub-minute query); 7+ years on cold tier (archive-tier storage like S3 Glacier Deep Archive, with restore window before query). Access pattern can promote records back to a warmer tier — a 5-year-old record that suddenly gets queried 100x for an audit moves to warm tier automatically. The Epic Systems cloud archive tier engine produces ~80% storage cost reduction versus single-tier active Clarity over a 10-year horizon.
Initial bulk load (one-time historical backfill) typically runs 12–48 hours for a regional health system's multi-TB downstream finance/HCM/SCM history. Multi-hospital academic medical centres with 15+ years of history can run 4–7 days. Loads are parallelised by service area + fiscal period, throttled to respect source-system limits (Clarity SQL Server CPU/IO; legacy system rate limits). Ongoing delta loads (post initial backfill, capturing daily new records) run sub-hour every night. Every load emits a hash-signed manifest. The Epic Systems cloud archive is queryable as soon as the first partition lands — incremental availability, not all-or-nothing.
Yes — through two patterns. Pattern one (OTBI / BI Publisher): the archive is exposed as a queryable source to Oracle OTBI for ad-hoc analytics and BI Publisher for pixel-perfect reports. Same dashboard can show live Fusion data + archive historical data as a unified time series. Pattern two (OIC integration): Oracle Integration Cloud pulls specific archive records on demand into Fusion for processes that need historical context (vendor 1099 substantiation, ex-employee W-2 reissue triggering payroll re-run, audit walkthrough preparation). Both patterns work — choose based on whether the use case is interactive reporting or process integration. Both honour HIPAA access controls.
Highly favourable versus active Clarity. For a typical 8-hospital regional system carrying 15 years of downstream finance + HCM + SCM data: total archive footprint ~20–60 TB; annual storage cost $50–200K depending on tier distribution and access pattern; annual licensing for the archive product itself in the $100–300K range depending on volume + access. Versus active Clarity storage + license for the same window: often $500K–1M+/year. Net savings typically $300K–800K/year, plus avoided legacy-system licensing for the systems being decommissioned ahead of Epic consolidation (often another $200K–1M+/year). Multi-year ROI is strong and gets better as the retention horizon extends.
Book a 30-minute architecture call. Walk through your cloud strategy, retention obligations, current Clarity footprint and legacy systems in scope. We'll show a concrete deployment on your preferred cloud — and the multi-year cost projection versus active Clarity.