Opendatabay APP

Global Central Bank Discourse Dataset

Government & Civic Records

Tags and Keywords

Investing

Government

Nlp

Text

Speeches

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Global Central Bank Discourse Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset offers a valuable collection of speeches from senior central bankers, sourced from various influential central banks globally. Central banks are pivotal institutions that determine monetary policy, significantly influencing financial markets. Consequently, their speeches are closely observed by market participants. This free dataset provides rich textual data for analysis, covering a period from 1997 until 2022, with some records dating back to 1900. It is deemed high quality with a rating of 5 out of 5.

Columns

  • speech reference: A unique identifier for each speech.
  • country: Specifies the country or economic area to which the central bank or speaker belongs.
  • date: The exact date the speech was delivered.
  • title: The official title of the speech.
  • author: The name of the central banker who delivered the speech.
  • is_gov: A binary indicator (0 or 1) denoting whether the speaker is a central bank governor.
  • text: The full textual content of the speech.

Distribution

The data files are typically in CSV format. The collection spans from 1900-01-01 to 2022-11-10. While the primary corpus runs from 1997 to 2022, the full range of records includes:
  • 1 record from 01/01/1900 to 04/15/1912.
  • 209 records from 01/01/1986 to 04/15/1998.
  • 3,470 records from 04/15/1998 to 07/28/2010.
  • 4,041 records from 07/28/2010 to 11/10/2022. Geographically, the euro area represents 30% of the speeches, the United States accounts for 20%, and other regions make up 49% of the dataset. Information regarding whether a speaker is a governor (is_gov column) is available for 7,692 unique entries, with approximately 2,681 instances indicating a governor and 5,040 instances indicating a non-governor.

Usage

This dataset is ideal for:
  • Monetary policy analysis: Gaining insights into central bank strategies and economic perspectives.
  • Financial market prediction: Developing models that incorporate central bank communications.
  • Natural Language Processing (NLP): Training language models on economic and financial text.
  • Academic research: Studying the evolution of central bank communication and its impact.
  • Sentiment analysis: Assessing the tone and outlook conveyed in official speeches.

Coverage

  • Geographic Scope: Global, with speeches from influential central banks worldwide. Notable representation from the euro area and the United States.
  • Time Range: The main corpus covers 1997 to 2022, with some records extending back to 1900 and up to November 2022.
  • Speaker Scope: Speeches from senior central bankers, including those holding the position of governor.

License

CC BY-NC-SA

Who Can Use It

This dataset is particularly useful for:
  • Financial Analysts: To inform investment strategies and market forecasts.
  • Economists: For researching macroeconomic trends and policy effectiveness.
  • Data Scientists & NLP Practitioners: For building and refining models related to economic language and financial sentiment.
  • Academics & Researchers: For studies on central banking, public policy, and economic communication.
  • Policymakers: To understand global monetary policy discourse.

Dataset Name Suggestions

  • Central Bank Speeches: A Global Collection (1997-2022)
  • Global Central Bank Discourse Dataset
  • Monetary Policy Speeches Corpus
  • Central Banker Communications Archive

Attributes

Original Data Source: Central Bank Speeches

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

17/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free