Opendatabay APP
Data provider Zalingo Data Refinery banner image on Opendatabay marketplace

Zalingo Data Refinery

Verified Icon

Verified Data Provider

Get In touch with Zalingo Data Refinery

Details

Location

128 City Road, London, EC1V2NX

Joined

11/09/2025

Response time

Instant

Twitter

Not Provided

LinkedIn
https://www.l...

About

Zalingo AI (ALITA Therapeutics Ltd) — Synthetic Data Refinery (UK) PII-safe, high-fidelity synthetic datasets for rapid analytics, ML prototyping, and pipeline validation — without access hurdles.

What we offer

  • Multi-industry coverage: Finance, Healthcare, Manufacturing, Retail, Technology, User Behaviour.
  • Clean schemas & docs: stable columns, data dictionaries, preview samples.
  • Fast time-to-value: parquet-first, query-friendly partitions, optional notebooks in premium kits.
  • Ethics & privacy: 100% synthetic; no PII/PHI, not derived from real individuals or any single facility.

Tiers & Pricing

  • Core Samples — £249 100k-row general samples per industry. Great for schema checks, dashboards, and proof-of-concepts.
  • Focused Samples — £749 100k-row, use-case–specific datasets with labels/signals (e.g., CNP Fraud & Chargebacks, Readmission & LOS, PdM & Downtime RCA, Price Elasticity & Promo Uplift, SaaS Churn & Retention, Engagement & Conversion Signals).
  • Premium Evaluation Kits — £1,999 ~1,000,000 rows + Jupyter notebooks + data dictionary & schema for end-to-end evaluation. (Available for Finance, Healthcare, Manufacturing, Retail, Technology (SaaS), User Behaviour; others on request.)

Sector Catalog (examples)

  • Finance: card/e-com transactions, auth outcomes, cross-border/FX fields, fraud/chargeback labels, velocity & device/IP features.
  • Healthcare: encounters with ICD/CPT-like codes, meds, labs, vitals, utilization; labels for 30-day readmission and LOS.
  • Manufacturing: machine telemetry (temp/vibration/rpm/power), production & downtime events, PdM/RUL, OEE, scrap/quality, energy.
  • Retail: basket & item-level POS/e-com, promos/markdowns, elasticity & uplift signals, loyalty/returns.
  • Technology (SaaS): product usage & API telemetry, features, errors/latency, plan tiers, churn/retention labels.
  • User Behaviour: web/mobile/omni events with attribution, funnels/cohorts, engagement metrics, propensity & CLV proxies.

Delivery & Format

  • Instant download ZIPs with Parquet (Snappy) and optional CSV previews.
  • Partitions by date/entity for efficient reads (works with Pandas/Polars/Spark).
  • Refresh cadence: Monthly by default; Weekly/Daily available via subscription (S3/API).

Licensing

  • Core/Focused: Proprietary — internal use, no redistribution/resale.
  • Premium Kits: Evaluation License (90-day, non-production); upgrade to enterprise for production use and wider rights. (License texts included inside each listing.)

Support & Quality

  • Response: within 1 business day.

  • Fixes: material schema/data issues within 5 business days.

  • Upgrade credit: move from sample/focused to enterprise within 60 days and receive credit.

  • Disclaimers:

    • Healthcare: not real patient data; not for clinical decision-making.
    • Finance: not real cardholder data; not for production credit decisions.
    • General: synthetic, calibrated distributions; not representative of any specific company or facility; no targeting of individuals.

How to choose

  • Just evaluating pipelines? Start with a Core Sample (£249).
  • Have a clear use-case? Pick the Focused Sample (£749) that matches it.
  • Need end-to-end benchmarking? Select a Premium Evaluation Kit (£1,999) with notebooks & docs.

Questions or custom needs (schema tweaks, languages, extra labels, delivery via S3/API)? Message us via the marketplace or email joshua@alita-therapeutics.com.

Statistics

Items

21

Total Downloads

0

Total Dataset Views

21

Data Products

Explore data collections and datasets from Zalingo Data Refinery