Downloads
Each dataset is available as a labeled Stata .dta and its source file.
⇩ Download all data (ZIP)stata_codebook.do
| Dataset | Grain | Rows | Stata | Source |
|---|---|---|---|---|
main_data | country-year | 10,092 × 92 | main_data.dta | main_data.dta |
WDICountry | country / aggregate | 263 × 30 | WDICountry.dta | WDICountry.csv |
Run stata_codebook.do in Stata once to attach long-form per-variable notes to the .dta files.
Load directly in code
Every file loads straight from GitHub (raw URLs). Swap the file name to load any dataset.
Stata
* Stata 14+ : `use` reads an https URL directly
global BASE "https://raw.githubusercontent.com/cmg777/starter-academic-v501/master/content/post/stata_convergence2/data/"
use "${BASE}main_data.dta", clear
describe
notesPython
!pip install -q pyreadstat
import pandas as pd
BASE = "https://raw.githubusercontent.com/cmg777/starter-academic-v501/master/content/post/stata_convergence2/data/"
df = pd.read_stata(BASE + "main_data.dta")
# load every dataset at once
files = ["main_data", "WDICountry"]
data = {f: pd.read_stata(BASE + f + ".dta") for f in files}
# pyreadstat (richest metadata) reads LOCAL files -> download first
import pyreadstat, urllib.request
urllib.request.urlretrieve(BASE + "main_data.dta", "main_data.dta")
df, meta = pyreadstat.read_dta("main_data.dta")Copy and paste this snippet in Google Colab app. https://colab.research.google.com/notebooks/empty.ipynb
R
# R : haven::read_dta auto-downloads an https URL
library(haven)
BASE <- "https://raw.githubusercontent.com/cmg777/starter-academic-v501/master/content/post/stata_convergence2/data/"
df <- read_dta(paste0(BASE, "main_data.dta"))Overview & sources
Companion data for a hands-on Stata tutorial that reproduces the main findings of Kremer, Willis & You (2021), “Converging to Convergence.” The analysis documents how the cross-country relationship between economic growth and initial income reversed — from divergence in the 1960s to unconditional convergence by the 2000s — and explains the shift through the convergence of growth correlates. It estimates beta-convergence (growth on initial log income) and sigma-convergence (the cross-country spread of log income), year-interacted rolling beta regressions, income-quartile and regional decompositions, and the omitted-variable-bias (OVB) identity that decomposes the gap between unconditional and conditional convergence as β − β* = δ × λ. The data are real: the authors’ replication panel combines Penn World Table 10.0 GDP with 50+ institutional, policy, geographic, and cultural correlates. The analysis is purely descriptive — it documents correlations and trends, not causal effects.
main_data is the analysis panel — one row per country × year, an unbalanced panel of ~160 GDP-reporting countries (174 country codes in all) over 1960–2017, carrying GDP, the 10-year forward growth rate, and the full set of growth correlates. WDICountry is the World Bank country-metadata lookup — one row per country/aggregate — merged onto the panel by ISO3 country code to attach the World Bank region used in the regional decomposition (Section 6.2).
Data sources
| Source | Provides | Reference / URL |
|---|---|---|
| Kremer, Willis & You (2021) | Replicated study; the replication panel (main_data) and its variable set | Kremer, M., Willis, J., & You, Y. (2021). Converging to Convergence. NBER Working Paper 29484. https://www.nber.org/papers/w29484 |
| Penn World Table 10.0 | Real GDP per capita levels and the 10-year forward growth rate (loggdp, loggdp_growth_10) | Feenstra, R. C., Inklaar, R., & Timmer, M. P. (2015). The Next Generation of the Penn World Table. American Economic Review, 105(10), 3150-3182. https://www.rug.nl/ggdc/productivity/pwt/ |
| World Bank WDI | Most growth correlates (investment, inflation, schooling, credit, taxes, openness) and the country/region lookup (WDICountry) | World Bank, World Development Indicators. https://databank.worldbank.org/source/world-development-indicators |
| Other correlate sources | Institutions, geography and culture correlates merged into the panel | Polity IV (democracy); Freedom House (political rights/civil liberties); Worldwide Governance Indicators; Heritage Foundation Index of Economic Freedom; Hofstede VSM (culture); Acemoglu-Johnson-Robinson settler-mortality data; La Porta et al. (legal origin); Barro-Lee (schooling). |
| Method references | Convergence concepts and estimators | Barro (1991); Barro & Sala-i-Martin (1992); Patel, Sandefur & Subramanian (2021); Durlauf, Johnson & Temple (2005). |
Cite this data
Please cite this dataset as follows.
APA
Mendez, C. (2026). Converging to Convergence: Understanding the Main Ideas of the Convergence Literature [Data set]. https://carlos-mendez.org/post/stata_convergence2/
Kremer, M., Willis, J., & You, Y. (2021). Converging to Convergence. NBER Working Paper 29484. Barro, R., & Sala-i-Martin, X. (1992). Convergence. Journal of Political Economy, 100(2), 223-251. Feenstra, R. C., Inklaar, R., & Timmer, M. P. (2015). The Next Generation of the Penn World Table. American Economic Review, 105(10), 3150-3182.BibTeX
@misc{mendez2026stataconvergence2,
author = {Mendez, Carlos},
title = {Converging to Convergence: Understanding the Main Ideas of the Convergence Literature},
year = {2026},
howpublished = {\url{https://carlos-mendez.org/post/stata_convergence2/}},
note = {Data set}
}
@techreport{kremer2021converging,
author = {Kremer, Michael and Willis, Jack and You, Yang},
title = {Converging to Convergence},
institution = {National Bureau of Economic Research},
type = {NBER Working Paper}, number = {29484}, year = {2021}
}
@article{barro1992convergence,
author = {Barro, Robert J. and Sala-i-Martin, Xavier},
title = {Convergence},
journal = {Journal of Political Economy},
volume = {100}, number = {2}, pages = {223--251}, year = {1992}
}
@article{feenstra2015pwt,
author = {Feenstra, Robert C. and Inklaar, Robert and Timmer, Marcel P.},
title = {The Next Generation of the Penn World Table},
journal = {American Economic Review},
volume = {105}, number = {10}, pages = {3150--3182}, year = {2015}
}Variable explorer search & filter all 122 variables
Type to filter by name or label, or use the chips to filter by type. Each row shows a mini distribution. Click a header to sort.
| Variable | Type | Distribution | Label | Definition | Units | In files | Source |
|---|---|---|---|---|---|---|---|
FH_civil_liberties# | continuous | Freedom House civil liberties (1-7) | Freedom House civil-liberties rating (1 = most free, 7 = least free). | 1-7 rating | main_data | Freedom House | |
FH_political_rights# | continuous | Freedom House political rights (1-7) | Freedom House political-rights rating (1 = most free, 7 = least free); a short-run institutional correlate. | 1-7 rating | main_data | Freedom House | |
VSM_individualism# | continuous | Individualism Index (Hofstede) | Hofstede culture dimension: individualism vs. collectivism. | 0-100 index | main_data | Hofstede VSM | |
VSM_indulgence# | continuous | Long-Term Orientation Index (Hofstede) | Hofstede culture dimension. NOTE: the embedded .dta labels for VSM_indulgence and VSM_longterm are swapped at source; this column is the long-term-orientation dimension. | 0-100 index | main_data | Hofstede VSM | |
VSM_longterm# | continuous | Indulgence vs. Restraint Index (Hofstede) | Hofstede culture dimension. NOTE: the embedded .dta labels for VSM_longterm and VSM_indulgence are swapped at source; this column is the indulgence/restraint dimension. | 0-100 index | main_data | Hofstede VSM | |
VSM_masculinity# | continuous | Masculinity Index (Hofstede) | Hofstede culture dimension: masculinity vs. femininity. | 0-100 index | main_data | Hofstede VSM | |
VSM_power_dist# | continuous | Power Distance Index (Hofstede) | Hofstede culture dimension: acceptance of unequal power distribution. | 0-100 index | main_data | Hofstede VSM | |
VSM_uncertain_avoid# | continuous | Uncertainty Avoidance Index (Hofstede) | Hofstede culture dimension: tolerance for uncertainty/ambiguity. | 0-100 index | main_data | Hofstede VSM | |
WDI_GDP_pc# | continuous | GDP per capita (WDI, level) | GDP per capita level from World Bank WDI. | US$ | main_data | World Bank WDI | |
WDI_credit# | continuous | Domestic credit, alt. (% of GDP) | Alternative WDI credit series used in the multivariate convergence set. (Embedded .dta label is a copy artifact.) | % of GDP | main_data | World Bank WDI | |
WDI_primary# | continuous | Primary enrollment (% gross) | Gross primary-school enrollment ratio. | % gross | main_data | World Bank WDI | |
WDI_secondary# | continuous | Secondary enrollment (% gross) | Gross secondary-school enrollment ratio. | % gross | main_data | World Bank WDI | |
WDI_tertiary# | continuous | Tertiary enrollment (% gross) | Gross tertiary-school enrollment ratio. | % gross | main_data | World Bank WDI | |
WGI_control_corruption# | continuous | Control of corruption (WGI) | Worldwide Governance Indicators: control-of-corruption estimate. | z-score (~ -2.5 to 2.5) | main_data | World Bank WGI | |
WGI_gov_effectiveness# | continuous | Government effectiveness (WGI) | Worldwide Governance Indicators: government-effectiveness estimate. | z-score (~ -2.5 to 2.5) | main_data | World Bank WGI | |
WGI_pol_stability# | continuous | Political stability & absence of violence (WGI) | Worldwide Governance Indicators: political-stability / absence-of-violence estimate. | z-score (~ -2.5 to 2.5) | main_data | World Bank WGI | |
WGI_regulatory_quality# | continuous | Regulatory quality (WGI) | Worldwide Governance Indicators: regulatory-quality estimate. | z-score (~ -2.5 to 2.5) | main_data | World Bank WGI | |
WGI_rule_law# | continuous | Rule of law (WGI) | Worldwide Governance Indicators: rule-of-law estimate. | z-score (~ -2.5 to 2.5) | main_data | World Bank WGI | |
alternative_conversion_factor# | identifier | – | Alternative conversion factor | Periods using an alternative currency-conversion factor. | string | WDICountry | World Bank WDI |
autoc# | continuous | Institutionalized autocracy (0-10) | Polity autocracy sub-index. Special codes -66/-77/-88 denote non-scores, not values. | 0-10 (+ Polity special codes) | main_data | Polity IV | |
avelf# | continuous | Ethno-linguistic fractionalization | Index of ethno-linguistic fractionalization (probability two random people differ). | 0-1 index | main_data | Fractionalization data | |
avexpr# | continuous | Protection against expropriation risk | Average index of protection against expropriation risk (institutional quality). | 0-10 index | main_data | Acemoglu, Johnson & Robinson (2001) | |
balance_of_payments_manual_in_us# | identifier | – | Balance of Payments Manual in use | IMF BPM edition used for balance-of-payments data. | category | WDICountry | World Bank WDI |
barrolee2060# | continuous | Average years of schooling (Barro-Lee) | Mean years of schooling, population aged 20-60; the human-capital Solow fundamental. | years | main_data | Barro-Lee | |
businessfreedom# | continuous | Business freedom score | Heritage Foundation business-freedom component. | 0-100 | main_data | Heritage Foundation | |
code# | identifier | – | Country code (ISO3) | Three-letter ISO country code; the panel identifier and the merge key to WDICountry. | ISO3 string | main_data | Replication panel / WDI |
country# | identifier | – | Country name | Country name corresponding to the ISO3 code. | string | main_data | Replication panel |
country_code# | identifier | – | Country code (ISO3) | Three-letter ISO country code; the lookup key (matches main_data code). | ISO3 string | WDICountry | World Bank WDI |
country_id# | identifier | – | Numeric country id | Numeric panel id (Stata egen group(code)); used as the panel/cluster unit. | integer id | main_data | Derived (this study) |
credit# | continuous | Domestic credit by financial sector (% of GDP) | Domestic credit provided by the financial sector, share of GDP; financial-development correlate. | % of GDP | main_data | World Bank WDI | |
currency_unit# | identifier | – | Currency unit | Name of the national currency. | string | WDICountry | World Bank WDI |
democ# | continuous | Institutionalized democracy (0-10) | Polity democracy sub-index. Special codes -66 (interruption), -77 (interregnum), -88 (transition) denote non-scores, not values. | 0-10 (+ Polity special codes) | main_data | Polity IV | |
edes1975# | continuous | European descent, 1975 (%) | Percent of population of European descent in 1975. | % | main_data | Historical sources | |
edugap# | continuous | Education gender gap | Gap in schooling between men and women (used in the multivariate convergence set). | years | main_data | Barro-Lee (derived) | |
euro1900# | continuous | European settlers, 1900 (share) | Share of European settlers around 1900. | 0-1 share | main_data | Historical sources | |
external_debt_reporting_status# | identifier | – | External debt reporting status | Status of the economy's external-debt reporting. | category | WDICountry | World Bank WDI |
financialfreedom# | continuous | Financial freedom score | Heritage Foundation financial-freedom component. | 0-100 | main_data | Heritage Foundation | |
fuel_exports# | continuous | Fuel exports (% of merchandise exports) | Fuel exports as a share of merchandise exports. | % of exports | main_data | World Bank WDI | |
fuel_mineral_exports# | continuous | Fuel + ore & metals exports (% of merch. exports) | Combined fuel and ores/metals exports as a share of merchandise exports. | % of exports | main_data | World Bank WDI | |
g7089n# | continuous | Growth in neighboring countries (1970-89) | Average growth of neighboring countries, 1970-1989 (spatial spillover control). | % per year | main_data | Replication panel (derived) | |
goldm# | dummy | Natural resources - gold (dummy) | Indicator for significant gold endowment. | 0/1 | main_data | Geological resource data | |
gov_spending# | continuous | Government consumption (% of GDP) | General government final consumption expenditure as a share of GDP. | % of GDP | main_data | World Bank WDI | |
government_accounting_concept# | identifier | – | Government accounting concept | Budgetary vs. consolidated central-government accounting concept. | category | WDICountry | World Bank WDI |
governmentintegrity# | continuous | Government integrity score | Heritage Foundation government-integrity component. | 0-100 | main_data | Heritage Foundation | |
igov_rpppPercentofGDP# | continuous | Public investment (% of GDP) | General-government investment as a share of GDP (IMF igov_rppp). | % of GDP | main_data | IMF Investment & Capital Stock dataset | |
imf_data_dissemination_standard# | identifier | – | IMF data dissemination standard | IMF data-dissemination subscription (SDDS/GDDS/e-GDDS). | category | WDICountry | World Bank WDI |
inc_tax# | continuous | Taxes on income/profits/capital gains (% of revenue) | Share of government revenue from taxes on income, profits and capital gains. | % of revenue | main_data | World Bank WDI | |
income_group# | identifier | – | World Bank income group | WDI income classification (Low / Lower-middle / Upper-middle / High income). | category | WDICountry | World Bank WDI |
inflation# | continuous | Inflation (GDP deflator / CPI, annual %) | Annual inflation rate; a short-run macro-policy correlate. (Embedded .dta label is a copy artifact; the variable is inflation.) | % per year | main_data | World Bank WDI | |
investment# | continuous | Gross capital formation (% of GDP) | Investment share of GDP; a Solow fundamental (steady-state determinant). | % of GDP | main_data | World Bank WDI | |
investmentfreedom# | continuous | Investment freedom score | Heritage Foundation investment-freedom component. | 0-100 | main_data | Heritage Foundation | |
ipriv_rpppPercentofGDP# | continuous | Private investment, IMF (% of GDP) | Private investment as a share of GDP (IMF ipriv_rppp). | % of GDP | main_data | IMF Investment & Capital Stock dataset | |
iron# | dummy | Natural resources - iron (dummy) | Indicator for significant iron endowment. | 0/1 | main_data | Geological resource data | |
iso2_code# | identifier | – | ISO 2-letter code | Two-letter ISO country code. | ISO2 string | WDICountry | World Bank WDI |
laborparti# | continuous | Labor force participation rate (%) | Total labor-force participation rate, population 15+ (modeled ILO estimate). | % of 15+ population | main_data | World Bank WDI / ILO | |
landlock# | dummy | Landlocked (dummy) | 1 if the country is landlocked, else 0. | 0/1 | main_data | Geographic data | |
lat_abst# | continuous | Absolute latitude (0-1) | Absolute latitude of the country, scaled 0-1 (distance from the equator). | 0-1 (|lat|/90) | main_data | Geographic data | |
latest_agricultural_census# | identifier | – | Latest agricultural census | Year(s) of the latest agricultural census. | year (string) | WDICountry | World Bank WDI |
latest_household_survey# | identifier | – | Latest household survey | Latest household survey (type and year). | string | WDICountry | World Bank WDI |
latest_industrial_data# | continuous | Latest industrial data (year) | Year of the latest industrial data. | year | WDICountry | World Bank WDI | |
latest_population_census# | identifier | – | Latest population census | Year of the latest population census. | year (string) | WDICountry | World Bank WDI |
latest_trade_data# | continuous | Latest trade data (year) | Year of the latest trade data. | year | WDICountry | World Bank WDI | |
legor_fr# | dummy | Legal origin - France (dummy) | 1 if the country's legal system is of French civil-law origin. | 0/1 | main_data | La Porta et al. (legal origin) | |
legor_ge# | dummy | Legal origin - Germany (dummy) | 1 if the country's legal system is of German civil-law origin. | 0/1 | main_data | La Porta et al. (legal origin) | |
legor_sc# | dummy | Legal origin - Scandinavia (dummy) | 1 if the country's legal system is of Scandinavian origin. | 0/1 | main_data | La Porta et al. (legal origin) | |
legor_so# | dummy | Legal origin - Socialist (dummy) | 1 if the country's legal system is of socialist origin. | 0/1 | main_data | La Porta et al. (legal origin) | |
legor_uk# | dummy | Legal origin - UK (dummy) | 1 if the country's legal system is of British (common-law) origin. | 0/1 | main_data | La Porta et al. (legal origin) | |
lending_category# | identifier | – | World Bank lending category | IDA/IBRD/Blend lending category. | category | WDICountry | World Bank WDI |
logcgdpo# | continuous | Log real GDP (output, current PPP) | Log real GDP per capita, PWT output-side series at current PPPs (cgdpo). | log US$ | main_data | Penn World Table 10.0 | |
logem4# | continuous | Log settler mortality | Log of early-settler mortality (Acemoglu-Johnson-Robinson institutions instrument). | log deaths per 1,000 | main_data | Acemoglu, Johnson & Robinson (2001) | |
loggdp# | continuous | Log GDP per capita (PWT) | Natural log of real GDP per capita from Penn World Table 10.0; the initial-income regressor in convergence regressions. | log US$ (PPP) | main_data | Penn World Table 10.0 | |
loggdp_WDI# | continuous | Log GDP per capita (WDI) | Natural log of GDP per capita from World Bank WDI (alternative income series). | log US$ | main_data | World Bank WDI | |
loggdp_growth_10# | continuous | 10-year forward GDP-pc growth (annualized %) | Forward-looking 10-year annualized growth rate of GDP per capita; the main outcome variable. | % per year | main_data | Penn World Table 10.0 (derived) | |
loggdp_pw# | continuous | Log GDP per capita (PWT variant) | Alternative log GDP-per-capita measure used in robustness checks. | log US$ | main_data | Penn World Table 10.0 | |
logrgdpe# | continuous | Log real GDP (expenditure side) | Log real GDP per capita, PWT expenditure-side series (rgdpe). | log US$ | main_data | Penn World Table 10.0 | |
logrgdpna# | continuous | Log real GDP (national accounts) | Log real GDP per capita, PWT national-accounts growth series (rgdpna); base for the 10-year growth rate. | log US$ | main_data | Penn World Table 10.0 | |
logrgdpo# | continuous | Log real GDP (output side) | Log real GDP per capita, PWT output-side series (rgdpo). | log US$ | main_data | Penn World Table 10.0 | |
logrgdpo_91# | continuous | Log real GDP (output side, 1991 base) | Log real GDP per capita, PWT output-side series anchored to a 1991 base. | log US$ | main_data | Penn World Table 10.0 | |
long_name# | identifier | – | Official (long) country name | Full official name of the economy. | string | WDICountry | World Bank WDI |
lt100km# | continuous | Population within 100km of coast | Share of population living within 100km of the coast. | 0-1 share | main_data | Geographic data | |
lvaw_garriga# | continuous | Central bank independence (Garriga, weighted) | Central-bank-independence index, Garriga (2016), weighted version. | 0-1 index | main_data | Garriga (2016) | |
meantemp# | continuous | Mean temperature | Country mean temperature (long-run geographic correlate). | degrees Celsius | main_data | Geographic data | |
miliexp# | continuous | Military expenditure (% of GDP) | Military spending as a share of GDP. | % of GDP | main_data | World Bank WDI | |
mineral_exports# | continuous | Ores & metals exports (% of merchandise exports) | Ores and metals exports as a share of merchandise exports. | % of exports | main_data | World Bank WDI | |
national_accounts_base_year# | identifier | – | National accounts base year | Base year of the national-accounts series. | year (string) | WDICountry | World Bank WDI |
national_accounts_reference_year# | continuous | National accounts reference year | Reference year for the national-accounts series. | year | WDICountry | World Bank WDI | |
oilres# | continuous | Natural resources - oil reserves | Oil-reserves endowment measure. | reserves measure | main_data | Geological resource data | |
other_groups# | identifier | – | Other groups | Other World Bank groupings the economy belongs to (e.g. HIPC, EMU). | category | WDICountry | World Bank WDI |
overallscore# | continuous | Index of Economic Freedom (overall) | Heritage Foundation overall economic-freedom score. | 0-100 | main_data | Heritage Foundation | |
polity2# | continuous | Polity2 democracy score (-10..10) | Combined Polity score; the headline democracy correlate in the OVB worked example. Interpolated/analysis-ready (no special missing codes). | -10 (autocracy) to +10 (democracy) | main_data | Polity IV | |
population# | continuous | Population, total | Total population. | persons | main_data | World Bank WDI | |
population_1900# | continuous | Population in 1900 (thousands) | Mid-year population around 1900 (historical control). | thousands | main_data | Historical demographic sources | |
population_growth# | continuous | Population growth (annual %) | Annual rate of total-population growth; a Solow fundamental. | % per year | main_data | World Bank WDI | |
ppp_survey_year# | continuous | – | PPP survey year | Year of the latest ICP purchasing-power-parity survey. | year | WDICountry | World Bank WDI |
pri_inv# | continuous | Private investment (% of GDP) | Private investment share of GDP (one of the 10 correlates in the fixed multivariate sample). | % of GDP | main_data | IMF Investment & Capital Stock dataset | |
propertyrights# | continuous | Property rights score | Heritage Foundation property-rights component. | 0-100 | main_data | Heritage Foundation | |
region# | identifier | – | World Bank region | WDI geographic region; the merged key used to build the four-way region_group (Section 6.2). Blank for aggregate rows. | region name | WDICountry | World Bank WDI |
resource_rents# | continuous | Total natural-resource rents (% of GDP) | Sum of natural-resource rents as a share of GDP (cleaning threshold variable). | % of GDP | main_data | World Bank WDI | |
sev_tax# | continuous | Taxes on goods and services (% of revenue) | Share of government revenue from taxes on goods and services. | % of revenue | main_data | World Bank WDI | |
short_name# | identifier | – | Short country name | Short display name of the economy. | string | WDICountry | World Bank WDI |
silv# | dummy | Natural resources - silver (dummy) | Indicator for significant silver endowment. | 0/1 | main_data | Geological resource data | |
sna_price_valuation# | identifier | – | SNA price valuation | System-of-National-Accounts price-valuation basis. | category | WDICountry | World Bank WDI |
source_of_most_recent_income_and# | identifier | – | Source of most recent income/expenditure data | Survey source of the most recent income and expenditure data. | string | WDICountry | World Bank WDI |
special_notes# | identifier | – | Special notes | Free-text WDI notes about the economy. | string | WDICountry | World Bank WDI |
system_of_national_accounts# | identifier | – | System of National Accounts | SNA standard followed (e.g. 1993 SNA, 2008 SNA). | category | WDICountry | World Bank WDI |
system_of_trade# | identifier | – | System of trade | General vs. special trade-recording system. | category | WDICountry | World Bank WDI |
table_name# | identifier | – | Table country name | Name as used in WDI tables (drop-if-empty filter in the merge). | string | WDICountry | World Bank WDI |
tariff_all_ew# | continuous | Tariff rate, simple mean, all products (%) | Applied tariff rate, simple (equal-weighted) mean across all products. | % | main_data | World Bank WDI | |
tariff_all_vw# | continuous | Tariff rate, weighted mean, all products (%) | Applied tariff rate, trade-weighted mean across all products. | % | main_data | World Bank WDI | |
taxburden# | continuous | Tax burden score | Heritage Foundation tax-burden component. | 0-100 | main_data | Heritage Foundation | |
temp1# | continuous | Temperature indicator 1 | First of five temperature-zone indicators (geographic control). | 0-1 share | main_data | Geographic data | |
temp2# | continuous | Temperature indicator 2 | Second of five temperature-zone indicators. | 0-1 share | main_data | Geographic data | |
temp3# | continuous | Temperature indicator 3 | Third of five temperature-zone indicators. | 0-1 share | main_data | Geographic data | |
temp4# | continuous | Temperature indicator 4 | Fourth of five temperature-zone indicators. | 0-1 share | main_data | Geographic data | |
temp5# | continuous | Temperature indicator 5 | Fifth of five temperature-zone indicators. | 0-1 share | main_data | Geographic data | |
totalscore# | continuous | Composite institutional score | Composite total score used as an auxiliary institutional measure. | score | main_data | Replication panel | |
tropics# | continuous | Tropical climate share | Share of land area in tropical climate zones. | 0-1 share | main_data | Geographic data | |
vital_registration_complete# | identifier | – | Vital registration complete | Whether vital (births/deaths) registration is complete. | Yes/blank | WDICountry | World Bank WDI |
wb_2_code# | identifier | – | World Bank 2-letter code | World Bank internal two-letter economy code. | string | WDICountry | World Bank WDI |
working_population_prop# | continuous | Working-age population share (15-64, %) | Population aged 15-64 as a share of total. | % of total | main_data | World Bank WDI | |
year# | year | – | Calendar year | Annual time index, 1960-2017. | year | main_data | Replication panel |
zinc# | dummy | Natural resources - zinc (dummy) | Indicator for significant zinc endowment. | 0/1 | main_data | Geological resource data |
Cross-file variable index
Which file each variable appears in (● = present).
Construction & formulas
Convergence is measured on real GDP per capita. Let y_it be country i's
GDP per capita in year t and g_{i,t→t+10} its 10-year forward annualized
growth rate (the outcome loggdp_growth_10).
- Beta-convergence:
g_{i,t→t+10} = α + β·log(y_it) + ε— a negativeβmeans poorer countries grow faster (convergence). - Trend in beta (year-interacted):
g = β_t·log(y_it) + μ_t + ε, oneβ_tper year, with year fixed effectsμ_t. - Sigma-convergence:
σ_t = SD_i[ log(y_it) ]— the cross-country standard deviation of log income each year; a narrowing series is sigma-convergence. - Conditional convergence:
g = α + β*·log(y_it) + λ·X_it + ε, whereXis a growth correlate andβ*is the convergence slope holding it fixed. - Correlate-income slope:
X_it = ν + δ·log(y_it) + u— how much richer countries have more of the correlate. - OVB identity:
β − β* = δ × λ— the gap between unconditional and conditional convergence equals the product of the correlate-income slope and the growth-regression slope (an exact algebraic identity).
Speed and half-life of convergence follow from β: the implied convergence
speed is λ_conv = −ln(1 + β)/T and the half-life is
ln(2)/λ_conv (here T = 10 years). Income variables are in natural
logs (loggdp from PWT real GDP per capita); correlates are normalized by their 1985
cross-country standard deviation in the OVB exercises so coefficients are comparable across variables.
The datasets
Switch datasets with the tabs. Each shows the full variable dictionary plus a sortable statistics table with mini distributions and data coverage.
expand to search (Ctrl/⌘+F) or print across all datasets
Variable dictionary
| Variable | Label | Definition | Construction | Units | Source | Coverage |
|---|---|---|---|---|---|---|
code identifier | Country code (ISO3) | Three-letter ISO country code; the panel identifier and the merge key to WDICountry. | ISO3 string | Replication panel / WDI | 174 codes | |
legor_uk dummy | Legal origin - UK (dummy) | 1 if the country's legal system is of British (common-law) origin. | 0/1 | La Porta et al. (legal origin) | ||
legor_fr dummy | Legal origin - France (dummy) | 1 if the country's legal system is of French civil-law origin. | 0/1 | La Porta et al. (legal origin) | ||
legor_ge dummy | Legal origin - Germany (dummy) | 1 if the country's legal system is of German civil-law origin. | 0/1 | La Porta et al. (legal origin) | ||
legor_sc dummy | Legal origin - Scandinavia (dummy) | 1 if the country's legal system is of Scandinavian origin. | 0/1 | La Porta et al. (legal origin) | ||
legor_so dummy | Legal origin - Socialist (dummy) | 1 if the country's legal system is of socialist origin. | 0/1 | La Porta et al. (legal origin) | ||
euro1900 continuous | European settlers, 1900 (share) | Share of European settlers around 1900. | 0-1 share | Historical sources | ||
logem4 continuous | Log settler mortality | Log of early-settler mortality (Acemoglu-Johnson-Robinson institutions instrument). | log deaths per 1,000 | Acemoglu, Johnson & Robinson (2001) | ||
meantemp continuous | Mean temperature | Country mean temperature (long-run geographic correlate). | degrees Celsius | Geographic data | ||
lt100km continuous | Population within 100km of coast | Share of population living within 100km of the coast. | 0-1 share | Geographic data | ||
avelf continuous | Ethno-linguistic fractionalization | Index of ethno-linguistic fractionalization (probability two random people differ). | 0-1 index | Fractionalization data | ||
temp1 continuous | Temperature indicator 1 | First of five temperature-zone indicators (geographic control). | 0-1 share | Geographic data | ||
temp2 continuous | Temperature indicator 2 | Second of five temperature-zone indicators. | 0-1 share | Geographic data | ||
temp3 continuous | Temperature indicator 3 | Third of five temperature-zone indicators. | 0-1 share | Geographic data | ||
temp4 continuous | Temperature indicator 4 | Fourth of five temperature-zone indicators. | 0-1 share | Geographic data | ||
temp5 continuous | Temperature indicator 5 | Fifth of five temperature-zone indicators. | 0-1 share | Geographic data | ||
edes1975 continuous | European descent, 1975 (%) | Percent of population of European descent in 1975. | % | Historical sources | ||
landlock dummy | Landlocked (dummy) | 1 if the country is landlocked, else 0. | 0/1 | Geographic data | ||
goldm dummy | Natural resources - gold (dummy) | Indicator for significant gold endowment. | 0/1 | Geological resource data | ||
iron dummy | Natural resources - iron (dummy) | Indicator for significant iron endowment. | 0/1 | Geological resource data | ||
silv dummy | Natural resources - silver (dummy) | Indicator for significant silver endowment. | 0/1 | Geological resource data | ||
zinc dummy | Natural resources - zinc (dummy) | Indicator for significant zinc endowment. | 0/1 | Geological resource data | ||
oilres continuous | Natural resources - oil reserves | Oil-reserves endowment measure. | reserves measure | Geological resource data | ||
lat_abst continuous | Absolute latitude (0-1) | Absolute latitude of the country, scaled 0-1 (distance from the equator). | 0-1 (|lat|/90) | Geographic data | ||
avexpr continuous | Protection against expropriation risk | Average index of protection against expropriation risk (institutional quality). | 0-10 index | Acemoglu, Johnson & Robinson (2001) | ||
tropics continuous | Tropical climate share | Share of land area in tropical climate zones. | 0-1 share | Geographic data | ||
g7089n continuous | Growth in neighboring countries (1970-89) | Average growth of neighboring countries, 1970-1989 (spatial spillover control). | % per year | Replication panel (derived) | ||
VSM_power_dist continuous | Power Distance Index (Hofstede) | Hofstede culture dimension: acceptance of unequal power distribution. | 0-100 index | Hofstede VSM | ||
VSM_individualism continuous | Individualism Index (Hofstede) | Hofstede culture dimension: individualism vs. collectivism. | 0-100 index | Hofstede VSM | ||
VSM_masculinity continuous | Masculinity Index (Hofstede) | Hofstede culture dimension: masculinity vs. femininity. | 0-100 index | Hofstede VSM | ||
VSM_uncertain_avoid continuous | Uncertainty Avoidance Index (Hofstede) | Hofstede culture dimension: tolerance for uncertainty/ambiguity. | 0-100 index | Hofstede VSM | ||
VSM_indulgence continuous | Long-Term Orientation Index (Hofstede) | Hofstede culture dimension. NOTE: the embedded .dta labels for VSM_indulgence and VSM_longterm are swapped at source; this column is the long-term-orientation dimension. | 0-100 index | Hofstede VSM | ||
VSM_longterm continuous | Indulgence vs. Restraint Index (Hofstede) | Hofstede culture dimension. NOTE: the embedded .dta labels for VSM_longterm and VSM_indulgence are swapped at source; this column is the indulgence/restraint dimension. | 0-100 index | Hofstede VSM | ||
population_1900 continuous | Population in 1900 (thousands) | Mid-year population around 1900 (historical control). | thousands | Historical demographic sources | ||
year year | Calendar year | Annual time index, 1960-2017. | year | Replication panel | ||
country identifier | Country name | Country name corresponding to the ISO3 code. | string | Replication panel | ||
inc_tax continuous | Taxes on income/profits/capital gains (% of revenue) | Share of government revenue from taxes on income, profits and capital gains. | % of revenue | World Bank WDI | ||
sev_tax continuous | Taxes on goods and services (% of revenue) | Share of government revenue from taxes on goods and services. | % of revenue | World Bank WDI | ||
credit continuous | Domestic credit by financial sector (% of GDP) | Domestic credit provided by the financial sector, share of GDP; financial-development correlate. | % of GDP | World Bank WDI | ||
WDI_credit continuous | Domestic credit, alt. (% of GDP) | Alternative WDI credit series used in the multivariate convergence set. (Embedded .dta label is a copy artifact.) | % of GDP | World Bank WDI | ||
WDI_GDP_pc continuous | GDP per capita (WDI, level) | GDP per capita level from World Bank WDI. | US$ | World Bank WDI | ||
miliexp continuous | Military expenditure (% of GDP) | Military spending as a share of GDP. | % of GDP | World Bank WDI | ||
gov_spending continuous | Government consumption (% of GDP) | General government final consumption expenditure as a share of GDP. | % of GDP | World Bank WDI | ||
WDI_secondary continuous | Secondary enrollment (% gross) | Gross secondary-school enrollment ratio. | % gross | World Bank WDI | ||
WDI_primary continuous | Primary enrollment (% gross) | Gross primary-school enrollment ratio. | % gross | World Bank WDI | ||
WDI_tertiary continuous | Tertiary enrollment (% gross) | Gross tertiary-school enrollment ratio. | % gross | World Bank WDI | ||
laborparti continuous | Labor force participation rate (%) | Total labor-force participation rate, population 15+ (modeled ILO estimate). | % of 15+ population | World Bank WDI / ILO | ||
tariff_all_ew continuous | Tariff rate, simple mean, all products (%) | Applied tariff rate, simple (equal-weighted) mean across all products. | % | World Bank WDI | ||
inflation continuous | Inflation (GDP deflator / CPI, annual %) | Annual inflation rate; a short-run macro-policy correlate. (Embedded .dta label is a copy artifact; the variable is inflation.) | % per year | World Bank WDI | ||
tariff_all_vw continuous | Tariff rate, weighted mean, all products (%) | Applied tariff rate, trade-weighted mean across all products. | % | World Bank WDI | ||
investment continuous | Gross capital formation (% of GDP) | Investment share of GDP; a Solow fundamental (steady-state determinant). | % of GDP | World Bank WDI | ||
working_population_prop continuous | Working-age population share (15-64, %) | Population aged 15-64 as a share of total. | % of total | World Bank WDI | ||
population_growth continuous | Population growth (annual %) | Annual rate of total-population growth; a Solow fundamental. | % per year | World Bank WDI | ||
population continuous | Population, total | Total population. | persons | World Bank WDI | ||
resource_rents continuous | Total natural-resource rents (% of GDP) | Sum of natural-resource rents as a share of GDP (cleaning threshold variable). | % of GDP | World Bank WDI | ||
fuel_exports continuous | Fuel exports (% of merchandise exports) | Fuel exports as a share of merchandise exports. | % of exports | World Bank WDI | ||
mineral_exports continuous | Ores & metals exports (% of merchandise exports) | Ores and metals exports as a share of merchandise exports. | % of exports | World Bank WDI | ||
fuel_mineral_exports continuous | Fuel + ore & metals exports (% of merch. exports) | Combined fuel and ores/metals exports as a share of merchandise exports. | % of exports | World Bank WDI | ||
lvaw_garriga continuous | Central bank independence (Garriga, weighted) | Central-bank-independence index, Garriga (2016), weighted version. | 0-1 index | Garriga (2016) | ||
barrolee2060 continuous | Average years of schooling (Barro-Lee) | Mean years of schooling, population aged 20-60; the human-capital Solow fundamental. | (sum) w_yr_sch from the Barro-Lee dataset. | years | Barro-Lee | |
edugap continuous | Education gender gap | Gap in schooling between men and women (used in the multivariate convergence set). | years | Barro-Lee (derived) | ||
overallscore continuous | Index of Economic Freedom (overall) | Heritage Foundation overall economic-freedom score. | 0-100 | Heritage Foundation | ||
propertyrights continuous | Property rights score | Heritage Foundation property-rights component. | 0-100 | Heritage Foundation | ||
governmentintegrity continuous | Government integrity score | Heritage Foundation government-integrity component. | 0-100 | Heritage Foundation | ||
taxburden continuous | Tax burden score | Heritage Foundation tax-burden component. | 0-100 | Heritage Foundation | ||
businessfreedom continuous | Business freedom score | Heritage Foundation business-freedom component. | 0-100 | Heritage Foundation | ||
investmentfreedom continuous | Investment freedom score | Heritage Foundation investment-freedom component. | 0-100 | Heritage Foundation | ||
financialfreedom continuous | Financial freedom score | Heritage Foundation financial-freedom component. | 0-100 | Heritage Foundation | ||
WGI_gov_effectiveness continuous | Government effectiveness (WGI) | Worldwide Governance Indicators: government-effectiveness estimate. | z-score (~ -2.5 to 2.5) | World Bank WGI | ||
WGI_regulatory_quality continuous | Regulatory quality (WGI) | Worldwide Governance Indicators: regulatory-quality estimate. | z-score (~ -2.5 to 2.5) | World Bank WGI | ||
WGI_rule_law continuous | Rule of law (WGI) | Worldwide Governance Indicators: rule-of-law estimate. | z-score (~ -2.5 to 2.5) | World Bank WGI | ||
WGI_control_corruption continuous | Control of corruption (WGI) | Worldwide Governance Indicators: control-of-corruption estimate. | z-score (~ -2.5 to 2.5) | World Bank WGI | ||
WGI_pol_stability continuous | Political stability & absence of violence (WGI) | Worldwide Governance Indicators: political-stability / absence-of-violence estimate. | z-score (~ -2.5 to 2.5) | World Bank WGI | ||
igov_rpppPercentofGDP continuous | Public investment (% of GDP) | General-government investment as a share of GDP (IMF igov_rppp). | % of GDP | IMF Investment & Capital Stock dataset | ||
ipriv_rpppPercentofGDP continuous | Private investment, IMF (% of GDP) | Private investment as a share of GDP (IMF ipriv_rppp). | % of GDP | IMF Investment & Capital Stock dataset | ||
pri_inv continuous | Private investment (% of GDP) | Private investment share of GDP (one of the 10 correlates in the fixed multivariate sample). | % of GDP | IMF Investment & Capital Stock dataset | ||
democ continuous | Institutionalized democracy (0-10) | Polity democracy sub-index. Special codes -66 (interruption), -77 (interregnum), -88 (transition) denote non-scores, not values. | 0-10 (+ Polity special codes) | Polity IV | ||
autoc continuous | Institutionalized autocracy (0-10) | Polity autocracy sub-index. Special codes -66/-77/-88 denote non-scores, not values. | 0-10 (+ Polity special codes) | Polity IV | ||
polity2 continuous | Polity2 democracy score (-10..10) | Combined Polity score; the headline democracy correlate in the OVB worked example. Interpolated/analysis-ready (no special missing codes). | -10 (autocracy) to +10 (democracy) | Polity IV | ||
FH_political_rights continuous | Freedom House political rights (1-7) | Freedom House political-rights rating (1 = most free, 7 = least free); a short-run institutional correlate. | 1-7 rating | Freedom House | ||
FH_civil_liberties continuous | Freedom House civil liberties (1-7) | Freedom House civil-liberties rating (1 = most free, 7 = least free). | 1-7 rating | Freedom House | ||
totalscore continuous | Composite institutional score | Composite total score used as an auxiliary institutional measure. | score | Replication panel | ||
logrgdpna continuous | Log real GDP (national accounts) | Log real GDP per capita, PWT national-accounts growth series (rgdpna); base for the 10-year growth rate. | log US$ | Penn World Table 10.0 | ||
logcgdpo continuous | Log real GDP (output, current PPP) | Log real GDP per capita, PWT output-side series at current PPPs (cgdpo). | log US$ | Penn World Table 10.0 | ||
logrgdpo continuous | Log real GDP (output side) | Log real GDP per capita, PWT output-side series (rgdpo). | log US$ | Penn World Table 10.0 | ||
logrgdpe continuous | Log real GDP (expenditure side) | Log real GDP per capita, PWT expenditure-side series (rgdpe). | log US$ | Penn World Table 10.0 | ||
logrgdpo_91 continuous | Log real GDP (output side, 1991 base) | Log real GDP per capita, PWT output-side series anchored to a 1991 base. | log US$ | Penn World Table 10.0 | ||
country_id identifier | Numeric country id | Numeric panel id (Stata egen group(code)); used as the panel/cluster unit. | group(code) | integer id | Derived (this study) | |
loggdp_WDI continuous | Log GDP per capita (WDI) | Natural log of GDP per capita from World Bank WDI (alternative income series). | log US$ | World Bank WDI | ||
loggdp continuous | Log GDP per capita (PWT) | Natural log of real GDP per capita from Penn World Table 10.0; the initial-income regressor in convergence regressions. | log(GDPpc), PWT real GDP per capita (national-accounts series, logrgdpna). | log US$ (PPP) | Penn World Table 10.0 | |
loggdp_pw continuous | Log GDP per capita (PWT variant) | Alternative log GDP-per-capita measure used in robustness checks. | log US$ | Penn World Table 10.0 | ||
loggdp_growth_10 continuous | 10-year forward GDP-pc growth (annualized %) | Forward-looking 10-year annualized growth rate of GDP per capita; the main outcome variable. | 100 x (F10.logrgdpna - logrgdpna)/10 over a 10-year horizon (forward-looking; last usable year ~2007). | % per year | Penn World Table 10.0 (derived) |
Distribution & statistics (click a header to sort)
| Variable | Distribution | Coverage | N | Distinct | Min | Mean | Median | Max | SD |
|---|---|---|---|---|---|---|---|---|---|
code | – | 100% | 10,092 | 174 | — | — | — | — | — |
legor_uk | 100% | 10,092 | 3 | — | — | — | — | — | |
legor_fr | 100% | 10,092 | 3 | — | — | — | — | — | |
legor_ge | 100% | 10,092 | 3 | — | — | — | — | — | |
legor_sc | 100% | 10,092 | 3 | — | — | — | — | — | |
legor_so | 100% | 10,092 | 3 | — | — | — | — | — | |
euro1900 | 84% | 8,526 | 26 | 0 | 31.24 | 2.70 | 100.0 | 42.42 | |
logem4 | 49% | 4,988 | 44 | 0.936 | 4.58 | 4.44 | 7.99 | 1.29 | |
meantemp | 34% | 3,480 | 56 | -0.200 | 23.13 | 24.47 | 29.30 | 4.92 | |
lt100km | 35% | 3,538 | 48 | 0 | 0.374 | 0.239 | 1.00 | 0.352 | |
avelf | 74% | 7,482 | 125 | 0 | 0.357 | 0.270 | 1.00 | 0.305 | |
temp1 | 87% | 8,816 | 32 | -4.00 | 18.87 | 20.00 | 32.00 | 8.20 | |
temp2 | 87% | 8,816 | 34 | -6.00 | 24.16 | 26.00 | 40.00 | 9.24 | |
temp3 | 87% | 8,816 | 22 | 23.00 | 38.14 | 38.00 | 48.00 | 4.92 | |
temp4 | 87% | 8,816 | 45 | -44.00 | -2.57 | 0 | 20.00 | 16.69 | |
temp5 | 87% | 8,816 | 25 | 1.00 | 14.00 | 14.50 | 26.00 | 6.61 | |
edes1975 | 86% | 8,642 | 21 | 0 | 33.02 | 1.00 | 100.0 | 43.71 | |
landlock | 89% | 8,990 | 2 | 0 | 0.200 | 0 | 1.00 | 0.400 | |
goldm | 87% | 8,816 | 6 | 0 | 0.441 | 0 | 47.00 | 3.91 | |
iron | 87% | 8,816 | 10 | 0 | 0.323 | 0 | 16.00 | 1.63 | |
silv | 87% | 8,816 | 4 | 0 | 0.362 | 0 | 13.00 | 1.99 | |
zinc | 87% | 8,816 | 10 | 0 | 0.507 | 0 | 15.00 | 2.06 | |
oilres | 84% | 8,526 | 63 | 0 | 117,084 | 0 | 4,500,000 | 523,284 | |
lat_abst | 89% | 8,990 | 94 | 0 | 0.294 | 0.256 | 0.722 | 0.192 | |
avexpr | 66% | 6,670 | 76 | 3.00 | 7.12 | 7.05 | 10.00 | 1.73 | |
tropics | 52% | 5,220 | 2 | 0 | 0.644 | 1.00 | 1.00 | 0.479 | |
g7089n | 51% | 5,162 | 88 | -3.36 | 1.47 | 1.76 | 6.78 | 1.72 | |
VSM_power_dist | 100% | 10,092 | 49 | — | — | — | — | — | |
VSM_individualism | 100% | 10,092 | 50 | — | — | — | — | — | |
VSM_masculinity | 100% | 10,092 | 47 | — | — | — | — | — | |
VSM_uncertain_avoid | 100% | 10,092 | 47 | — | — | — | — | — | |
VSM_indulgence | 51% | 5,104 | 83 | 0 | 46.04 | 46.35 | 100.0 | 23.66 | |
VSM_longterm | 51% | 5,104 | 82 | 0 | 44.87 | 42.86 | 100.00 | 22.02 | |
population_1900 | 34% | 3,480 | 60 | 215.0 | 20,899 | 3,999.0 | 400,000 | 62,507 | |
year | – | 100% | 10,092 | 58 | 1960 | 1988.5 | 1988 | 2017 | 16.74 |
country | – | 100% | 10,092 | 174 | — | — | — | — | — |
inc_tax | 36% | 3,588 | 3,587 | -1.35 | 24.36 | 21.28 | 79.54 | 14.27 | |
sev_tax | 35% | 3,581 | 3,579 | 0.024 | 30.19 | 31.20 | 91.29 | 13.58 | |
credit | 68% | 6,843 | 6,843 | -114.7 | 49.38 | 34.56 | 347.5 | 48.71 | |
WDI_credit | 68% | 6,877 | 6,877 | 0.005 | 36.67 | 23.26 | 312.0 | 37.40 | |
WDI_GDP_pc | 78% | 7,869 | 7,869 | 131.6 | 9,963.2 | 3,209.5 | 113,682 | 15,213 | |
miliexp | 67% | 6,717 | 6,583 | 0 | 2.71 | 1.94 | 48.52 | 2.97 | |
gov_spending | 73% | 7,319 | 7,314 | 0 | 15.46 | 14.73 | 135.8 | 6.59 | |
WDI_secondary | 55% | 5,587 | 5,583 | 0 | 63.15 | 68.41 | 163.9 | 34.26 | |
WDI_primary | 65% | 6,546 | 6,544 | 2.83 | 95.65 | 100.7 | 173.8 | 22.54 | |
WDI_tertiary | 50% | 5,070 | 5,058 | 0 | 23.84 | 17.23 | 126.4 | 23.17 | |
laborparti | 48% | 4,844 | 4,510 | 36.95 | 62.43 | 61.97 | 90.63 | 10.23 | |
tariff_all_ew | 31% | 3,149 | 1,309 | 0 | 8.54 | 6.36 | 105.4 | 7.26 | |
inflation | 78% | 7,887 | 7,696 | -98.70 | 11.42 | 5.81 | 100.0 | 19.33 | |
tariff_all_vw | 31% | 3,149 | 1,280 | 0 | 6.92 | 4.50 | 421.5 | 10.38 | |
investment | 72% | 7,284 | 7,242 | -5.74 | 22.83 | 22.38 | 95.32 | 8.64 | |
working_population_prop | 99% | 9,968 | 9,968 | 45.63 | 58.89 | 57.97 | 85.87 | 6.97 | |
population_growth | 99% | 10,019 | 10,017 | -10.96 | 1.83 | 1.87 | 17.70 | 1.49 | |
population | 99% | 10,023 | 10,010 | 63,699 | 29,505,564 | 5,881,435 | 1,386,395,000 | 111,532,185 | |
resource_rents | 68% | 6,895 | 6,723 | 0 | 6.74 | 2.64 | 74.40 | 9.82 | |
fuel_exports | 59% | 5,935 | 5,772 | 0 | 15.04 | 2.87 | 722.8 | 30.26 | |
mineral_exports | 61% | 6,147 | 6,132 | 0 | 8.95 | 2.83 | 158.5 | 16.08 | |
fuel_mineral_exports | 58% | 5,891 | 5,877 | 0 | 23.98 | 10.21 | 788.4 | 33.48 | |
lvaw_garriga | 53% | 5,322 | 329 | 0.017 | 0.475 | 0.452 | 0.979 | 0.191 | |
barrolee2060 | 74% | 7,480 | 1,496 | 0.013 | 6.16 | 6.03 | 13.56 | 3.46 | |
edugap | 74% | 7,480 | 1,496 | -2.19 | 0.816 | 0.700 | 4.15 | 0.946 | |
overallscore | 34% | 3,477 | 523 | 21.40 | 59.50 | 59.30 | 89.40 | 10.69 | |
propertyrights | 35% | 3,493 | 126 | 0 | 47.66 | 50.00 | 97.10 | 23.95 | |
governmentintegrity | 35% | 3,505 | 277 | 0 | 40.24 | 33.00 | 100.0 | 22.64 | |
taxburden | 35% | 3,491 | 604 | 29.80 | 72.44 | 74.40 | 100.0 | 14.24 | |
businessfreedom | 35% | 3,500 | 620 | 10.00 | 64.13 | 65.10 | 100.0 | 15.54 | |
investmentfreedom | 100% | 10,092 | 21 | — | — | — | — | — | |
financialfreedom | 100% | 10,092 | 11 | — | — | — | — | — | |
WGI_gov_effectiveness | 37% | 3,738 | 3,671 | -2.48 | -0.044 | -0.244 | 2.44 | 0.996 | |
WGI_regulatory_quality | 37% | 3,739 | 3,717 | -2.65 | -0.028 | -0.180 | 2.26 | 0.985 | |
WGI_rule_law | 37% | 3,763 | 3,763 | -2.61 | -0.105 | -0.331 | 2.10 | 1.000 | |
WGI_control_corruption | 37% | 3,758 | 3,752 | -1.87 | -0.071 | -0.340 | 2.47 | 1.01 | |
WGI_pol_stability | 37% | 3,743 | 3,706 | -3.31 | -0.136 | -0.051 | 1.76 | 0.977 | |
igov_rpppPercentofGDP | 87% | 8,735 | 8,726 | 2.59e-04 | 16.88 | 1.32 | 2,536.7 | 94.42 | |
ipriv_rpppPercentofGDP | 87% | 8,735 | 8,725 | 0.002 | 48.07 | 3.74 | 5,132.6 | 195.1 | |
pri_inv | 87% | 8,736 | 8,732 | 0.015 | 0.640 | 0.670 | 0.973 | 0.184 | |
democ | 100% | 10,092 | 15 | — | — | — | — | — | |
autoc | 100% | 10,092 | 15 | — | — | — | — | — | |
polity2 | 100% | 10,092 | 22 | — | — | — | — | — | |
FH_political_rights | 100% | 10,092 | 8 | — | — | — | — | — | |
FH_civil_liberties | 100% | 10,092 | 8 | — | — | — | — | — | |
totalscore | 100% | 10,092 | 97 | — | — | — | — | — | |
logrgdpna | 83% | 8,328 | 8,323 | 5.51 | 8.79 | 8.81 | 12.23 | 1.23 | |
logcgdpo | 83% | 8,328 | 8,320 | 5.09 | 8.69 | 8.63 | 12.67 | 1.19 | |
logrgdpo | 83% | 8,328 | 8,325 | 5.37 | 8.71 | 8.67 | 12.62 | 1.19 | |
logrgdpe | 83% | 8,328 | 8,322 | 5.94 | 8.68 | 8.63 | 12.56 | 1.20 | |
logrgdpo_91 | 82% | 8,280 | 8,273 | 4.88 | 8.60 | 8.56 | 12.46 | 1.18 | |
country_id | – | 100% | 10,092 | 174 | — | — | — | — | — |
loggdp_WDI | 78% | 7,869 | 7,864 | 4.88 | 8.15 | 8.07 | 11.64 | 1.52 | |
loggdp | 83% | 8,328 | 8,325 | 5.37 | 8.71 | 8.67 | 12.62 | 1.19 | |
loggdp_pw | 82% | 8,308 | 8,304 | 5.94 | 9.25 | 9.21 | 13.07 | 1.10 | |
loggdp_growth_10 | 68% | 6,888 | 6,871 | -12.34 | 1.96 | 1.98 | 22.13 | 2.79 |
Variable dictionary
| Variable | Label | Definition | Construction | Units | Source | Coverage |
|---|---|---|---|---|---|---|
country_code identifier | Country code (ISO3) | Three-letter ISO country code; the lookup key (matches main_data code). | ISO3 string | World Bank WDI | 263 rows | |
short_name identifier | Short country name | Short display name of the economy. | string | World Bank WDI | ||
table_name identifier | Table country name | Name as used in WDI tables (drop-if-empty filter in the merge). | string | World Bank WDI | ||
long_name identifier | Official (long) country name | Full official name of the economy. | string | World Bank WDI | ||
iso2_code identifier | ISO 2-letter code | Two-letter ISO country code. | ISO2 string | World Bank WDI | ||
currency_unit identifier | Currency unit | Name of the national currency. | string | World Bank WDI | ||
special_notes identifier | Special notes | Free-text WDI notes about the economy. | string | World Bank WDI | ||
region identifier | World Bank region | WDI geographic region; the merged key used to build the four-way region_group (Section 6.2). Blank for aggregate rows. | region name | World Bank WDI | ||
income_group identifier | World Bank income group | WDI income classification (Low / Lower-middle / Upper-middle / High income). | category | World Bank WDI | ||
wb_2_code identifier | World Bank 2-letter code | World Bank internal two-letter economy code. | string | World Bank WDI | ||
national_accounts_base_year identifier | National accounts base year | Base year of the national-accounts series. | year (string) | World Bank WDI | ||
national_accounts_reference_year continuous | National accounts reference year | Reference year for the national-accounts series. | year | World Bank WDI | ||
sna_price_valuation identifier | SNA price valuation | System-of-National-Accounts price-valuation basis. | category | World Bank WDI | ||
lending_category identifier | World Bank lending category | IDA/IBRD/Blend lending category. | category | World Bank WDI | ||
other_groups identifier | Other groups | Other World Bank groupings the economy belongs to (e.g. HIPC, EMU). | category | World Bank WDI | ||
system_of_national_accounts identifier | System of National Accounts | SNA standard followed (e.g. 1993 SNA, 2008 SNA). | category | World Bank WDI | ||
alternative_conversion_factor identifier | Alternative conversion factor | Periods using an alternative currency-conversion factor. | string | World Bank WDI | ||
ppp_survey_year continuous | PPP survey year | Year of the latest ICP purchasing-power-parity survey. | year | World Bank WDI | ||
balance_of_payments_manual_in_us identifier | Balance of Payments Manual in use | IMF BPM edition used for balance-of-payments data. | category | World Bank WDI | ||
external_debt_reporting_status identifier | External debt reporting status | Status of the economy's external-debt reporting. | category | World Bank WDI | ||
system_of_trade identifier | System of trade | General vs. special trade-recording system. | category | World Bank WDI | ||
government_accounting_concept identifier | Government accounting concept | Budgetary vs. consolidated central-government accounting concept. | category | World Bank WDI | ||
imf_data_dissemination_standard identifier | IMF data dissemination standard | IMF data-dissemination subscription (SDDS/GDDS/e-GDDS). | category | World Bank WDI | ||
latest_population_census identifier | Latest population census | Year of the latest population census. | year (string) | World Bank WDI | ||
latest_household_survey identifier | Latest household survey | Latest household survey (type and year). | string | World Bank WDI | ||
source_of_most_recent_income_and identifier | Source of most recent income/expenditure data | Survey source of the most recent income and expenditure data. | string | World Bank WDI | ||
vital_registration_complete identifier | Vital registration complete | Whether vital (births/deaths) registration is complete. | Yes/blank | World Bank WDI | ||
latest_agricultural_census identifier | Latest agricultural census | Year(s) of the latest agricultural census. | year (string) | World Bank WDI | ||
latest_industrial_data continuous | Latest industrial data (year) | Year of the latest industrial data. | year | World Bank WDI | ||
latest_trade_data continuous | Latest trade data (year) | Year of the latest trade data. | year | World Bank WDI |
Distribution & statistics (click a header to sort)
| Variable | Distribution | Coverage | N | Distinct | Min | Mean | Median | Max | SD |
|---|---|---|---|---|---|---|---|---|---|
country_code | – | 100% | 263 | 263 | — | — | — | — | — |
short_name | – | 100% | 263 | 263 | — | — | — | — | — |
table_name | – | 100% | 263 | 263 | — | — | — | — | — |
long_name | – | 100% | 263 | 263 | — | — | — | — | — |
iso2_code | – | 99% | 261 | 261 | — | — | — | — | — |
currency_unit | – | 83% | 217 | 150 | — | — | — | — | — |
special_notes | – | 36% | 94 | 80 | — | — | — | — | — |
region | – | 83% | 217 | 7 | — | — | — | — | — |
income_group | – | 83% | 217 | 4 | — | — | — | — | — |
wb_2_code | – | 100% | 262 | 262 | — | — | — | — | — |
national_accounts_base_year | – | 80% | 210 | 33 | — | — | — | — | — |
national_accounts_reference_year | 27% | 70 | 14 | 2,000.0 | 2,012.8 | 2,015.0 | 2,019.0 | 3.83 | |
sna_price_valuation | – | 80% | 211 | 2 | — | — | — | — | — |
lending_category | – | 55% | 144 | 3 | — | — | — | — | — |
other_groups | – | 22% | 59 | 2 | — | — | — | — | — |
system_of_national_accounts | – | 79% | 208 | 3 | — | — | — | — | — |
alternative_conversion_factor | – | 18% | 47 | 32 | — | — | — | — | — |
ppp_survey_year | – | 0% | 0 | 0 | — | — | — | — | — |
balance_of_payments_manual_in_us | – | 76% | 200 | 2 | — | — | — | — | — |
external_debt_reporting_status | – | 45% | 119 | 3 | — | — | — | — | — |
system_of_trade | – | 77% | 203 | 2 | — | — | — | — | — |
government_accounting_concept | – | 60% | 158 | 2 | — | — | — | — | — |
imf_data_dissemination_standard | – | 71% | 186 | 3 | — | — | — | — | — |
latest_population_census | – | 81% | 214 | 38 | — | — | — | — | — |
latest_household_survey | – | 58% | 152 | 75 | — | — | — | — | — |
source_of_most_recent_income_and | – | 64% | 168 | 68 | — | — | — | — | — |
vital_registration_complete | – | 46% | 120 | 2 | — | — | — | — | — |
latest_agricultural_census | – | 49% | 128 | 30 | — | — | — | — | — |
latest_industrial_data | 56% | 147 | 33 | 1,973.0 | 2,007.0 | 2,012.0 | 2,014.0 | 9.42 | |
latest_trade_data | 94% | 246 | 4 | 2,014.0 | 2,016.8 | 2,017.0 | 2,017.0 | 0.467 |
Known limitations & caveats
- Descriptive, not causal. All regressions document cross-country correlations and trends; the OVB decomposition partitions observed correlations and does not establish causal effects.
- Unbalanced panel. The sample grows from 109 GDP-reporting countries in 1960 to ~160 by 1990 (decolonization, then post-Soviet states); changing composition can mechanically affect estimates.
- Pre-2008 endpoint. The 10-year forward growth rate is forward-looking, so the last usable year is 2007/2008 — the Global Financial Crisis, the post-GFC recovery, and COVID-19 are not covered.
- Small samples for some correlates. Culture (Hofstede VSM) and tariff variables have fewer than ~60 observations, limiting precision; coverage varies widely across the 90+ correlates.
- Polity coded missingness. The Polity series (
democ,autoc) use special codes (−66 interruption, −77 interregnum, −88 transition); treat these as missing, not as scores.polity2is the interpolated, analysis-ready version on a −10…+10 scale. - Normalization sensitivity. In the OVB exercises every correlate is divided by its 1985 standard deviation; the δ/λ magnitudes (not signs) depend on this choice.