Dark Mode

Home

Data Categories

Synthetic Data for AI & Machine Learning

Zalingo Synthetic Finance — Premium Evaluation Kit - 1M Rows

ALITA Therapeutics Ltd

Licensed LLM Data Provider

£1999

Zalingo Synthetic Finance — Premium Evaluation Kit - 1M Rows

Name: Zalingo Synthetic Finance — Premium Evaluation Kit - 1M Rows
Creator: ALITA Therapeutics Ltd
Published: 2025-09-08T19:21:58.856Z
License: https://docs.opendatabay.com/ai-training-and-model-development-licenses/general-ai-training-and-fine-tuning-data-license

Synthetic Tabular Data

Tags and Keywords

Synthetic

Data

Finance

Fraud

Detection

Chargebacks

Authorization

Card-not-present

Cross-border

Fx

Fees

Risk

Scoring

Benchmark

Parquet

Notebooks

Rows

Pii-safe

Anonymised

Time

Series

1m

Onemillion

1million

Zalingo Synthetic Finance — Premium Evaluation Kit - 1M Rows Dataset on Opendatabay data marketplace

"No reviews yet"

£1,999

About

Zalingo Synthetic Finance — Premium Evaluation Kit (Fraud • Authorization • Cross-Border) — 1M Rows + Notebooks

A premium, end-to-end evaluation kit for payments analytics and risk modelling. You get ~1,000,000 privacy-safe synthetic transactions spanning card-not-present (CNP) fraud, authorization outcomes, and cross-border/FX fees, plus Jupyter notebooks and data dictionaries—so teams can benchmark pipelines and models quickly without handling real cardholder data (no PII).

Scaling up? After purchase, message us about enterprise bundles (tens of millions of rows) and weekly/daily refresh subscriptions delivered via S3/API.

What’s Inside

Data (Parquet, Snappy): ~1,000,000 rows, partitioned by date/merchant/MCC; includes labels & precomputed features.
Notebooks: EDA & quality, feature engineering (velocity/geo/device/merchant/graph-lite), and baseline models with ROC/PR & cost curves.
Docs & Schema: Data dictionary, label policy, quick-start guide, JSON schema examples.

Key Fields (representative)

Core: transaction_id, account_id, ts_utc, amount, currency, channel (pos|ecom|atm|transfer), mcc, merchant_id, merchant_country, device_fingerprint, ip_country, user_agent.
Authorization: three_ds_result, avs_result, cvv_result, auth_result, decline_reason_code.
Velocity: txn_ct_15m/1h/24h/7d, amount_sum_1h/24h, unique_merchant_ct_7d.
Geo/Behavior: distance_km_billing_shipping, first_time_merchant_flag, recurring_flag, coupon_used.
Cross-Border & FX: is_cross_border, home_currency, fx_rate_used, fx_markup_bps, fee_total.
Merchant/MCC: merchant_risk_tier, mcc_group, merchant_country_risk_index.
Graph-lite: shared_device_ct_7d, shared_ip_ct_7d, hub_account_flag (synthetic).
Labels & Scores: fraud_label (0/1), chargeback_flag (0/1), risk_score_0_1, chargeback_reason_code.

Distribution

Format: ZIP with /data (Parquet), /notebooks, /docs, /schema.
Volume: ~1,000,000 rows, 25–45 columns, multi-part Parquet.
Approx Size: 60–150 MB zipped (category-dependent).
Partitioning: by event_date / merchant_id / mcc_group for efficient reads.

Usage

Fraud & chargeback modelling (baselines, cost curves, feature ablations).
Authorization optimisation (AVS/3DS policy experiments, threshold tuning).
Cross-border/FX (markup benchmarking, margin sensitivity).
Pipeline QA & MLOps (schema contracts, drift monitors, dashboards).
Education & enablement (hands-on exercises without compliance hurdles).

Coverage

Geographic: Multi-country synthetic coverage (ISO codes).
Time Range: Recent multi-year synthetic window with weekly/seasonal patterns.
PII: None — fully synthetic; not re-identifiable.

Who Can Use It

Risk/Data Science, Payments/FinOps, Product/Analytics, Vendors/SIs for demos and validation.

Notes / Disclaimers

Not real cardholder data. Not for production credit decisions.
Rates/labels/fees are synthetic calibrated distributions and do not reflect any specific issuer/acquirer/PSP.

Evaluation License (Non-Production, Internal Use Only — 90 Days) Buyer is granted a non-exclusive, non-transferable license to use the data and included assets solely for internal evaluation, prototyping, and testing for 90 days from purchase. No production use, external distribution, resale, sublicensing, or sharing beyond Buyer’s employees and on-site contractors under NDA. Derived models/features may be retained for internal research; production deployment requires a separate enterprise license. All materials are provided “as is” without warranties; liability limited to the amount paid.

Listing Stats

VIEWS

DELIVERY

INSTANT DOWNLOAD

LISTED

08/09/2025

UPDATED

12/09/2025

REGION

GLOBAL

QUALITY

5 / 5

£1,999

Download Dataset in ZIP Format

Recommended Datasets

Loading recommendations...