Opendatabay APP

Numeric Captcha Recognition Set

Data Science and Analytics

Tags and Keywords

Captcha

Recognition

Vision

Deep

Images

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Numeric Captcha Recognition Set Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

A collection of captcha images paired with their corresponding 6-digit numeric solutions. This resource is designed specifically to aid in the development and evaluation of machine learning algorithms focused on accurate captcha recognition and complex image-to-text tasks. The data is structured for immediate use in training, testing, and validating models.

Columns

  • image_path: Provides the relative file path necessary to locate each individual captcha image file. All paths are unique and valid.
  • solution: The corresponding 6-digit number displayed within the captcha image, serving as the required ground truth label for training and evaluation processes.

Distribution

The overall collection contains 10,000 records. The image files are supported by a CSV file named captcha_data.csv, which has a size of approximately 432.7 kB. The data is pre-partitioned into three distinct subsets to facilitate model development: 6,000 images are designated for training, 2,000 images are reserved for testing model performance, and 2,000 images are intended for validation during the training process.

Usage

Ideal for training models focused on computer vision, deep learning, and transfer learning applications. Specific use cases include building robust systems for optical character recognition (OCR) on distorted text, and evaluating performance in image-to-text classification tasks involving numerical security features.

Coverage

The scope is strictly focused on simulated digital security challenges, specifically 6-digit numeric captchas. The content lacks geographic or time range limitations, as it consists of synthetically generated image data. Data availability is complete for all 10,000 records, ensuring a balanced input for machine learning models.

License

CC0: Public Domain

Who Can Use It

Machine learning engineers developing classification and recognition models; data scientists researching computer vision techniques; academic researchers studying robustness in text recognition systems under noisy image environments.

Dataset Name Suggestions

  • Numeric Captcha Recognition Set
  • 6-Digit Image-to-Text Dataset
  • Captcha Solutions for ML
  • Deep Learning Captcha Images

Attributes

Original Data Source: Numeric Captcha Recognition Set

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

15/11/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in ZIP Format