Synthetic Colon Cancer Patient Treatment Dataset
Patient Health Records & Digital Health
Related Searches
Trusted By




"No reviews yet"
£19.99
About
The Synthetic Colon Cancer Treatment Dataset has been created for educational and research purposes to support the analysis of treatment responses, patient characteristics, and survival outcomes in colon cancer. This synthetic, anonymized dataset provides rich details on clinical interventions, pathological findings, and patient survival statuses.
Dataset Features
- ID: Unique identifier for each patient.
- Study: Study or trial group identifier.
- RX (Treatment): Type of treatment received (e.g., Lev+5FU, Obs).
- Sex: Biological sex of the patient (M/F).
- Age: Age of the patient at diagnosis (in years).
- Obstruct: Indicates if the tumor caused bowel obstruction (Yes/No).
- Perfor: Indicates if the tumor caused bowel perforation (Yes/No).
- Adhere: Indicates tumor adhesion to surrounding structures (Yes/No).
- Nodes: Number of positive lymph nodes.
- Status: Vital status of the patient (Alive/Dead).
- Differ (Differentiation): Histological differentiation of the tumor (e.g., Moderate, Poor).
- Extent: Extent of disease spread (e.g., Distant, Regional).
- Surg: Surgical intervention status (e.g., Surgery, No Surgery).
- Node4: Binary flag indicating whether 4 or more nodes are involved (Yes/No).
- Time: Survival time in days.
- Etype (Event Type): Classification of death or survival outcome (e.g., Cancer Death, Other Death).
Distribution

Usage
This dataset can be used for the following applications:
- Treatment Outcome Research: Analyze the relationship between treatment types and patient survival.
- Predictive Modeling: Develop survival prediction models based on clinical and pathological features.
- Clinical Decision Support: Evaluate which combinations of factors influence prognosis, aiding evidence-based treatment planning.
- Educational Purposes: Enable students and researchers to explore real-world-like oncology datasets with survival outcomes.
Coverage
The dataset is fully anonymized and designed to replicate the complexity of real-world colon cancer cases. It is suitable for both statistical modeling and machine learning applications, particularly in survival analysis, treatment efficacy, and decision-making support tools.
License
CC0 (Public Domain)
Who Can Use It
- Medical Researchers and Oncologists: To evaluate treatment effects and prognostic indicators.
- Data Scientists: To build and test survival analysis models and classification algorithms.
- Healthcare Educators and Students: As a comprehensive dataset for studying outcomes-based oncology data.