1,000+ Hours of Nigerian English Interview Videos for AI Training
Foundation Model Datasets
Tags and Keywords
Trusted By




"No reviews yet"
£6,000
About
More Than 1,000 Hours (Growing Daily) of Fully-Consented Real Online Job Interview Video in Nigerian English | AI Training Data | Video+Audio + Timestamp-Aligned Transcripts | Question/Context Descriptions | Global Coverage
For full data, please contact hello@princep.io or visit website.
A large-scale dataset of 1,000+ hours (growing daily) of real online job interview video in Nigerian English, featuring natural, non-scripted interview responses from a highly diverse speaker base across many regions and accent groups — including Africa, Southeast Asia, South Asia, Latin America, North America, Europe, and more.
All participants explicitly opt in (fully consented) for their data to be used for AI training and shared under controlled licensing.
This is a living dataset: new recordings are added daily, so total hours increase over time (current total: 10K+ hours as of Jan 2026).
Each clip is primarily single-speaker (candidate-focused), making it highly valuable for training models on authentic, real-world monologue speech in interview conditions.
Unlike staged or scripted recordings, this dataset captures authentic interview behavior — spontaneous phrasing, pauses, disfluencies, turn-taking, and real-world device variability. The video modality adds valuable capture signals that improve model generalization to production environments.
Key Features
1) Accent Diversity at Scale (Underrepresented Accents Included)
Designed for real-world robustness with broad English accent coverage:
- Diverse regional accents and speech patterns
- Variation in pronunciation, rhythm, speech speed, and vocabulary
- Strong coverage of accents often missing from public datasets (Africa, SEA, South Asia, LatAm)
2) Real Interview Video (Non-Scripted, Natural Speech)
All sessions come from genuine interview-style Q&A, capturing:
- Natural disfluencies (hesitations, self-corrections, fillers)
- Realistic interview pacing and tone
- Authentic response structure under real interview conditions
3) AI-Ready Packaging (Video + Transcript + Context)
Each session can include synchronized assets such as:
- Video with embedded audio (online interview capture)
- Timestamp-aligned transcripts (segment/sentence level)
- Question prompts + context descriptors (question type/category)
- Technical and quality metadata (duration, device/channel signals)
Supports tasks including:
- Audiovisual speech recognition
- Lip/audio alignment research
- Speech modeling
- Multimodal conversational understanding
4) Real-World Capture Conditions
Video reflects realistic online interview environments:
- Mobile and desktop capture
- Consumer-grade device cameras and microphones
- Mostly indoor environments with natural lighting/background variation
- VoIP-style audio characteristics and device differences
5) Fully-Consented & Commercial-Ready
- Explicit opt-in consent for AI training and controlled dataset sharing
- Packaged for smooth integration into ML pipelines and enterprise procurement workflows
6) Continuously Expanding Library (Daily Updates)
- New recordings added every day
- Dataset grows over time
- Current total: 10K+ hours (Jan 2026)
- Updated releases available upon request
Use Cases
- Multimodal AI training (video + audio)
- Audiovisual speech recognition and robustness benchmarking
- ASR / speech-to-text with diverse accents
- Multimodal evaluation under real capture conditions
- Accent robustness testing across regions
Delivery Format (Typical)
- Video files (with embedded audio)
- Timestamp-aligned transcripts
- Question/context descriptors + metadata schema
- Documentation (data card, release manifest, usage notes)
Loading...
£6,000
Download Dataset in Unknown Format
Recommended Datasets
Loading recommendations...
