Zalingo Data Refinery

Verified Data Provider
Get In touch with Zalingo Data Refinery
Details
Location
128 City Road, London, EC1V2NX
Joined
11/09/2025
Response time
Instant
Not Provided
About
Zalingo AI (ALITA Therapeutics Ltd) — Synthetic Data Refinery (UK) PII-safe, high-fidelity synthetic datasets for rapid analytics, ML prototyping, and pipeline validation — without access hurdles.
What we offer
- Multi-industry coverage: Finance, Healthcare, Manufacturing, Retail, Technology, User Behaviour.
- Clean schemas & docs: stable columns, data dictionaries, preview samples.
- Fast time-to-value: parquet-first, query-friendly partitions, optional notebooks in premium kits.
- Ethics & privacy: 100% synthetic; no PII/PHI, not derived from real individuals or any single facility.
Tiers & Pricing
- Core Samples — £249 100k-row general samples per industry. Great for schema checks, dashboards, and proof-of-concepts.
- Focused Samples — £749 100k-row, use-case–specific datasets with labels/signals (e.g., CNP Fraud & Chargebacks, Readmission & LOS, PdM & Downtime RCA, Price Elasticity & Promo Uplift, SaaS Churn & Retention, Engagement & Conversion Signals).
- Premium Evaluation Kits — £1,999 ~1,000,000 rows + Jupyter notebooks + data dictionary & schema for end-to-end evaluation. (Available for Finance, Healthcare, Manufacturing, Retail, Technology (SaaS), User Behaviour; others on request.)
Sector Catalog (examples)
- Finance: card/e-com transactions, auth outcomes, cross-border/FX fields, fraud/chargeback labels, velocity & device/IP features.
- Healthcare: encounters with ICD/CPT-like codes, meds, labs, vitals, utilization; labels for 30-day readmission and LOS.
- Manufacturing: machine telemetry (temp/vibration/rpm/power), production & downtime events, PdM/RUL, OEE, scrap/quality, energy.
- Retail: basket & item-level POS/e-com, promos/markdowns, elasticity & uplift signals, loyalty/returns.
- Technology (SaaS): product usage & API telemetry, features, errors/latency, plan tiers, churn/retention labels.
- User Behaviour: web/mobile/omni events with attribution, funnels/cohorts, engagement metrics, propensity & CLV proxies.
Delivery & Format
- Instant download ZIPs with Parquet (Snappy) and optional CSV previews.
- Partitions by date/entity for efficient reads (works with Pandas/Polars/Spark).
- Refresh cadence: Monthly by default; Weekly/Daily available via subscription (S3/API).
Licensing
- Core/Focused: Proprietary — internal use, no redistribution/resale.
- Premium Kits: Evaluation License (90-day, non-production); upgrade to enterprise for production use and wider rights. (License texts included inside each listing.)
Support & Quality
-
Response: within 1 business day.
-
Fixes: material schema/data issues within 5 business days.
-
Upgrade credit: move from sample/focused to enterprise within 60 days and receive credit.
-
Disclaimers:
- Healthcare: not real patient data; not for clinical decision-making.
- Finance: not real cardholder data; not for production credit decisions.
- General: synthetic, calibrated distributions; not representative of any specific company or facility; no targeting of individuals.
How to choose
- Just evaluating pipelines? Start with a Core Sample (£249).
- Have a clear use-case? Pick the Focused Sample (£749) that matches it.
- Need end-to-end benchmarking? Select a Premium Evaluation Kit (£1,999) with notebooks & docs.
Questions or custom needs (schema tweaks, languages, extra labels, delivery via S3/API)? Message us via the marketplace or email joshua@alita-therapeutics.com.
Statistics
Items
21
Total Downloads
0
Total Dataset Views
21
Data Products
Explore data collections and datasets from Zalingo Data Refinery