Market Signal Solutions
Licensed LLM Data Provider
Get In touch with Market Signal Solutions
Details
Location
14 Rue des Pins
Joined
04/04/2026
Response time
Instant
Not Provided
About
Who We Are MarketSignal Solutions is an alternative data provider based in Quebec, Canada, founded by Keven Dube. We build AI-powered financial feature datasets designed for quantitative researchers, algorithmic traders, and systematic portfolio managers.
Our mission is straightforward: deliver production-grade, ML-ready feature matrices that combine real news sentiment with market data — so quants can focus on modeling, not data engineering.
What We Produce 1,442 engineered features per 15-minute batch, 96 observations per day, 24/7 coverage. Built on three fused data layers:
- GDELT Layer (~1,403 cols) — GKG content analysis, geopolitical events, media mentions, 1,256 GCAM sentiment dimensions, entity detection, sector analysis, sentiment velocity, macro themes
- AI Layer (18 cols) — Google Gemini 2.5 Flash sentiment scoring, impact/novelty/controversy scores, narrative classification, binary event flags, AI reasoning
- Price Layer (21 cols) — Multi-source intraday prices from Polygon, Twelve Data, and Yahoo Finance with cross-source consensus metrics
Coverage
- Magnificent 7: NVDA, AAPL, TSLA, AMZN, META, MSFT, GOOG
- Time span: January 2026 onward, updated monthly
- Resolution: 15-minute batches, 96 per day, 24/7
Our Data Pipeline Dedicated infrastructure processing GDELT global news data every 15 minutes:
- GDELT Ingestion — Three data streams (GKG, Events, Mentions) downloaded and parsed every 15 minutes from the GDELT global feed
- Content Analysis — 1,256 GCAM dimensions applied to each article batch — emotional, thematic, and linguistic scoring across validated lexicons
- AI Enrichment — Google Gemini 2.5 Flash (temperature=0, deterministic) analyzes each batch — 18 structured features per ticker including narrative classification and reasoning
- Price Collection — Multi-source intraday prices from Polygon, Twelve Data, and Yahoo Finance, cross-validated daily post-market
- Assembly — GDELT + AI + Price layers merged into unified 1,442-column rows per 15-minute batch
Data integrity is non-negotiable. Every dataset passes automated quality audits before publication: column count validation, date continuity, sentiment coverage, and more.
Why Choose MarketSignal Solutions Real Sentiment, Not Synthetic. Every sentiment score traces back to actual financial articles analyzed by AI. We process the full text — not just headlines. This is fundamentally different from LLM-generated synthetic sentiment datasets flooding the market.
Unmatched Feature Depth. 1,442 features per 15-minute observation — the deepest intraday sentiment feature space available for the Magnificent 7. No other provider combines GDELT's 1,256 GCAM dimensions with AI scoring and multi-source price data at this resolution.
Production-Grade Feature Engineering. Geopolitical event scoring, sentiment velocity, macro theme detection, entity analysis, sector-level decomposition. These are the features institutional quants actually use.
ML-Ready Format. Clean CSV files. Missing values handled consistently. Drop-in ready for any ML pipeline: scikit-learn, PyTorch, XGBoost, LightGBM.
Point-in-Time Correct. Every feature is computed with only information that was available at the time of the batch. No look-ahead bias.
Products & Pricing £99.99 per monthly release, per ticker. Each purchase includes all cumulative data from January 2026 through the current month. No subscription — buy only the months you need. Available per ticker: NVDA, AAPL, TSLA, AMZN, META, MSFT, GOOG.
Data Sources & Compliance All data inputs are sourced from publicly available or licensed sources:
- GDELT Project — Global news articles (free public API)
- Yahoo Finance — Price data (public API)
- Polygon — Intraday price data (licensed API)
- Twelve Data — Intraday price data (licensed API)
- Google Gemini AI — Sentiment analysis (licensed API)
No copyrighted article text is included in any deliverable. Our datasets contain only numerical features derived from article analysis. The raw articles are processed in our pipeline but never distributed.
Disclaimer These datasets are provided for quantitative research and educational purposes only. They do not constitute investment advice, trading recommendations, or solicitation to buy or sell any security. Past patterns in the data do not guarantee future results. Users are solely responsible for their own investment decisions and should consult qualified financial professionals before trading.
MarketSignal Solutions — Quebec, Canada Website: marketsignal.solutions

