Question 1

What is a majesco / sapiens data extraction tool?

Accepted Answer

A majesco / sapiens data extraction tool is the technology layer that pulls structured policy, billing and claims data — plus unstructured attachments — out of Majesco P&C/L&A Core Suite or Sapiens IDITSuite/CoreSuite/ALIS without disrupting live insurance operations. For Majesco Cloud Platform the primary path is Majesco Data Lake — Majesco's bulk Parquet export to a customer-controlled cloud bucket — supplemented by REST APIs for incremental deltas and SOAP for legacy integration patterns. For Sapiens IDITSuite SaaS the path is IDIT data services plus REST APIs. For legacy on-prem the extraction is JDBC against the Oracle or SQL Server backend with stored-procedure-aware metadata. Syntra ETL's majesco / sapiens data extraction tool ships pre-built extractors for all three worlds so you don't spend three months building extraction pipelines from scratch.

Question 2

How does Syntra ETL's majesco / sapiens data extraction tool work with Majesco Data Lake?

Accepted Answer

Majesco Data Lake is Majesco's mechanism for bulk data extraction from Majesco Cloud Platform — it publishes Parquet files of the full P&C Core Suite data model (policies, risks, coverages, transactions, claims, exposures, reserves, payments, reinsurance) to a customer-controlled cloud storage bucket on a scheduled cadence. Syntra ETL's majesco / sapiens data extraction tool consumes Data Lake Parquet directly, applies schema validation against the live Majesco metadata, surfaces any custom-product columns added through source-control workflow, and routes the data into the downstream Fusion pipeline or long-term archive. No bespoke Data Lake scaffolding, no manual schema mapping — just configure scope and run.

Question 3

How does the tool handle Sapiens IDIT data services?

Accepted Answer

Sapiens IDIT (IDITSuite SaaS, formerly Tia) exposes data services for bulk extraction of the full P&C insurance data model plus REST APIs for incremental deltas. Sapiens ALIS for L&A exposes equivalent data services covering policy-policyholder, premium recognition, surrender values, dividend processing and NAIC #797 replacement records. The majesco / sapiens data extraction tool ships pre-built profiles for IDIT data services, ALIS data services and Sapiens REST APIs with OAuth2 authentication and modified-since watermark patterns for incremental syncs. The same downstream transformation engine consumes Sapiens data alongside Majesco data so a single carrier running both platforms gets consolidated archive and Fusion integration.

Question 4

Can the tool extract from legacy on-prem Majesco and Sapiens installations?

Accepted Answer

Yes — and this matters because most mid-market insurers still have legacy on-prem Majesco or Sapiens running for at least some lines while they migrate to Cloud. The on-prem extractor uses read-only JDBC against the Oracle or SQL Server backend (typically pointing at a read-replica to avoid impact on production) with stored-procedure-aware metadata that decodes custom rating rules, typecodes and product-model extensions. For Majesco's classic STG (Staging) and CIM (Core Integration Model) tables, plus Sapiens IDIT's IDITNAME/IDITPOL/IDITCLM table families, the tool ships pre-built table catalogs. The same downstream pipeline emits FBDI or archive Parquet whether the source was Cloud or on-prem JDBC, so a hybrid is handled transparently.

Question 5

What data domains does the majesco / sapiens data extraction tool cover?

Accepted Answer

Full P&C and L&A footprint. Policy: products, coverages, policies, risks, endorsements, transactions, written/earned/unearned premium ledger, rating outputs, underwriter overrides, custom rating rule outputs. Billing: bills, invoices, receipts, disbursements, commission statements, agency settlements, NSF/write-offs, premium-payment-plan history. Claims: claims, exposures, reserves (case + IBNR), indemnity payments, expense payments, recovery payments, SIU flags, litigation flags, attachments. Reinsurance: treaty definitions, layer attachments, facultative placements, cession history, bordereaux extracts. L&A specific (Sapiens ALIS): policy-policyholder, deferred premium recognition, cash values, surrender history, dividends, replacement records (NAIC #797). Plus configuration: product catalog, Rate Manager / RuleXpress rules, BPM workflow definitions, integration BPMs.

Question 6

How does the tool handle multi-TB underwriting and claims attachments?

Accepted Answer

Attachments are the largest data volume in any insurance extract — claims attachments (police reports, medical records, repair estimates, SIU dossiers, recorded statements), underwriting files (loss-control reports, application supplements, MVR/CLUE reports) and L&A medical exam records (paramedical exams, blood/urine results, attending physician statements) routinely run multi-TB across 20+ years of retention. The majesco / sapiens data extraction tool streams attachments through Majesco Data Lake, Sapiens IDIT data services and REST APIs in parallel (typically 25 concurrent connections, respecting rate limits), preserves the source attachment-id as a cross-reference, hash-signs each file, and stages to cloud object storage with state-retention partitioning so HIPAA, state-commissioner and reinsurance audits can be answered without re-extracting from a decommissioned system.

Question 7

How is scheduling and incremental extract handled?

Accepted Answer

Three modes. Initial bulk: a one-shot full extract of historical Majesco/Sapiens data, sized appropriately and run during off-peak windows; typical first-pass for 10 years of policy/claim history takes 3-10 days depending on volume and rate limits. Scheduled incremental: a Data Lake / IDIT / REST delta pull on a daily/hourly/15-minute cadence using modified-since watermarks per domain. On-demand: ad-hoc extracts triggered by state-commissioner data calls, reinsurance audits or SIU investigations. All three modes use the same extractor configuration, log to the same audit trail and feed the same downstream pipeline.

Question 8

What output formats does the majesco / sapiens data extraction tool produce?

Accepted Answer

Multi-target. Parquet (snappy-compressed, partitioned by state, LOB and fiscal year) for the long-term archive and analytics. FBDI ZIPs (Journal Import, AP Invoice Import, Receipt Import, Supplier Import, Customer Import) for direct Oracle Fusion loading. REST API JSON payloads for incremental real-time syncs into Fusion. CSV and Avro for downstream BI and data-warehouse loads. Hash-signed manifest files for every output describing record counts, sum totals (premium, paid-loss, ceded amounts) and reconciliation keys so auditors can verify the extract without trusting the tool.

Majesco / Sapiens Data Extraction Tool — Data Lake + IDIT + REST + JDBC

Why pick a purpose-built majesco / sapiens data extraction tool over a one-off pipeline

Why a purpose-built tool wins

What the majesco / sapiens data extraction tool actually does — six core capabilities

Majesco Data Lake

Sapiens IDIT + ALIS data services

REST API incremental sync

Legacy on-prem JDBC

Attachment streaming

Signed manifest output

How the majesco / sapiens data extraction tool runs — end-to-end

Source profile setup — Day 1

Schema discovery — Day 1-2

Bulk extract (initial) — Day 2-10

Incremental sync (scheduled) — Day 10 onward

Downstream routing — Day 10 onward

Reconciliation & manifest — Every extract

Output destinations — one extract, many targets

Oracle Fusion FBDI

Compliance archive

Real-time REST sync

BI / data warehouse

Reinsurance bordereaux

SIU & legal hold

Frequently asked questions

See the majesco / sapiens data extraction tool in action