Department of Labor Logo United States Department of Labor
Dot gov

The .gov means it's official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you're on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Article
June 2023

Two plus two really does equal four: simulating official BLS gasoline price measures

Gasoline prices are a major contributor to inflation and have exhibited significant volatility over the last 3 years. Several price series of the U.S. Bureau of Labor Statistics (BLS) track changes in gas prices throughout the supply chain from oil extraction to gas stations. This article uses novel statistical methods that simulate these BLS price measures along the gasoline supply chain to demonstrate their internal statistical consistency and to better understand the impact of gas station markups on gasoline inflation. The statistical results of this article show that BLS price measures are accurately simulated by statistically backing-out each index from its respective supply-chain counterparts. Additionally, this article shows that gas station markups may have had a modest inflationary impact over the course of the COVID-19 pandemic.

Gasoline prices are a major contributor to inflation and have recently exhibited significant volatility.1 Several price series of the U.S. Bureau of Labor Statistics (BLS) track changes in gas prices throughout the supply chain from petroleum extraction to gas stations.2 The role of profits in driving inflation has been a closely followed issue in 2022 and 2023, and some research indicates that profits drove inflation in these years.3 BLS research on other industries, such as the automotive industry, shows that intermediaries in the supply chain can have an impact on consumer pricing, and that the impact of these intermediaries can be identified by examining the implicit statistical relationship between the BLS price indexes for each respective portion of the supply chain.4 The prices and indexes for wholesalers, retailer intermediaries, and final consumers should have statistically strong and economically meaningful relationships in industries in which quality change is low (this occurs with gasoline and other homogenous commodities). This article uses novel statistical methods that simulate the BLS gasoline price measures along the supply chain to demonstrate the internal statistical consistency of these methods, to show the validity of the methods, and to better understand the impact of gas station markups on gasoline inflation.5 Overall, gas station markups had volatile and varying impacts and pressures on gasoline price changes during the COVID-19 pandemic. There were three periods of price changes:

  • March 2020 through April 2020, a period of rapid and strong inflationary pressure from gas station markup increases
  • May 2020 through January 2021, a period of modest deflationary pressure from gas station markup decreases
  • January 2021 through May 2023, a period of modest and steady inflationary impact from gas station markup increases

Theory and industry background

It is possible to simulate BLS price indexes by statistically isolating producer commodity, producer margins, or consumer commodity indexes from each of their respective supply chain counterparts. For example, Kevin M. Camp, Michael Havlin, and Sara Stanley show that the BLS margins index for dealership services in the auto industry can be approximated by the residual between the Producer Price Index (PPI) for completed vehicles and the Consumer Price Index (CPI) for completed vehicles.6 Camp, Havlin, and Stanley show that the residual between commodity indexes for completed vehicles generally tracked the dealership profit-margins index of the PPI. However, it is unclear in Camp, Havlin, and Stanley what is the true statistical relationship between the residual and the official margins index. Additionally, Michael Havlin introduces some additional methods showing that the CPI for completed vehicles can be simulated with an input price index that includes dealership markups.7

Although the methods used in the automotive market worked reasonably well, they are likely to work much better in a homogenous industry. There are several factors that could prevent the margins-simulation method from working perfectly for heterogenous goods like automobiles and other manufactured goods. First, a strong statistical correlation between approximated and official margins indexes may not be possible to simulate because BLS quality adjusts the commodity PPIs and CPIs but does not quality adjust the margins indexes. Gaps between the simulated approximation index and the official margins index could partially represent the isolated quality-adjustment differences between each commodity index and other technical differences. Second, time lags on either side of the retailing intermediaries could prevent a strong correlation between the approximated and the official margins indexes. Finally, one must add the regression residuals to the other regression parameters, namely, the intercept and beta coefficient, to complete the margins-simulation process suggested in Camp, Havlin, and Stanley.8

Industries are excellent test cases for examining the implicit theoretical interdependencies of the commodity and margins indexes in the CPI and PPI. A good test case is an industry that has a high degree of product overlap in the intermediate indexes, does not have a substantial amount of product-quality change, does not have large time lags in the supply chain, and has a high degree of product-flow overlap between supply-chain participants.

Also, industries that produce homogenous commodities, such as gasoline, are excellent test cases for examining the conditions and characteristics that may support the use of the methods developed in Havlin and Camp, Havlin, and Stanley. These methods can be tested with fewer confounding variables (such as quality change, sample-size differences, product-composition differences, and product-flow differences) in analyses of highly homogeneous gasoline indexes with large sample sizes. Nevertheless, technical differences between price-relative calculations, such as those involving geometric means and Laspeyres indexes, may continue to cause differences. Because of assumptions regarding consumer behavior, the CPI uses geometric means that tend to weaken the impact of large changes. Because the PPI does not use geometric means, large price changes may have a larger impact on PPIs than on CPIs. To test the methods developed in Havlin and Camp, Havlin, and Stanley, this article focuses on the CPI for gasoline, the PPI for gasoline, the PPI for automotive fuels and lubricants retailing (PPI for gasoline margins), CPI data on average gasoline prices, and Energy Information Administration (EIA) data on wholesale gasoline prices.

The regression model from Camp, Havlin, and Stanley suggests how the error term ϵt, plus the model intercept β0, and plus the beta coefficient β1 of a regression model fitted on the index levels Gasoline CPIt and Gasoline PPIt should estimate the PPI for gasoline margins for each month after rebasing to the same index levels.9 The gasoline margins PPI short-term price relative can also be simulated by inflating the CPI data on average gasoline prices by the CPI for gasoline each month and then subtracting from that figure the EIA data on wholesale gasoline prices inflated each month by the PPI for gasoline. These short-term price relatives can then be used to reconstruct the PPI for gasoline margins. (See appendix.)

Furthermore, Havlin demonstrates that the interdependencies of the PPI for a particular good and the PPI margin for that good should closely correlate—if not equal—the CPI for that same good. The gasoline input price index including markups is a weighted composite of the PPI for gasoline and the PPI for gasoline margins (demonstrated in the appendix). This index is conceptually similar to the composite input price indexes demonstrated by Jayson Pollock and Jonathan C. Weinhagen.10 Also, the gasoline input price index including markups is an application of the model used by Havlin.

The accuracy of the gasoline input price index including markups can be determined by regressing the index against the CPI for gasoline. This method is demonstrated by Don A. Fast and Susan E. Fleck.11 In this regression, because margins are explicitly incorporated into the model with a weighted-interaction term, the residual term between the CPI for gasoline and the gasoline input price index including markups should equal the cumulative impact of technical differences between the PPI for gasoline and the CPI for gasoline (this equality is similar to geometric means and sample-frame differences).

Both Fast and Fleck and Don Fast, Susan E. Fleck, and Dominic A. Smith provide a useful roadmap for assessing the performance of indexes designed to simulate official government statistics.12 Various test statistics are used by these researchers to assess the performance of simulations of margins and commodity indexes described above and shown below. The correlation coefficient, beta coefficients, and p-values from regressions on the percent changes of indexes (short-term price relatives) are used to assess short-term correlations. Longer term correlations are tested by graphical analysis of the index levels (long-term price relatives).

Data analysis and results

A comparison of the commodity and margins indexes in the gasoline industry shows their implicit relationships. These relationships are alluded to by Camp et al. and demonstrated by Havlin.13 Chart 1 shows that the PPI for gasoline margins increases when the PPI for gasoline falls at a faster rate than the CPI for gasoline. Similarly, when the PPI for gasoline rises at a faster rate than the CPI for gasoline, then the PPI for gasoline margins increases. For example, because of pandemic-related volatility, producer gas prices fell by 64.9 percent from January 2020 through April 2020, while consumer prices only fell by 25.6 percent. As a result of producer prices falling 153.6 percent more than consumer prices, the PPI for gasoline margins increased by 88.4 percent over the same period. Similarly, from April 2020 through July 2020, consumer prices increased by 15.1 percent while producer prices increased by 115.0 percent; as a result, the PPI for gasoline margins decreased by 28.3 percent over the same period.

Chart 2 and table 1 show the results of different methods for simulating the gasoline margins PPI. The first method, an enhancement of Camp, Havlin, and Stanley’s method, adds the residual of an ordinary least squares (OLS) model between the index levels of the PPI for gasoline and the CPI for gasoline to the intercept and the beta coefficient of the model and is then rebased. The intercept and beta coefficient represent the average gas station markup, and the residual represents deviations from that markup. Adding the three parameters together provides an index estimate of the markup. The second method, first demonstrated by Havlin,14 takes the CPI data for the average price of gasoline (starting from May 2019) and the EIA wholesale price (starting from May 2019) and then inflates each series by the CPI and PPI monthly percent changes for gasoline. Next, this sum is subtracted from the inflated difference for each month to create an index.15 Graphically, the results for the two methods are nearly identical, so observing both their levels in chart 2 is not possible without the use of dashes. Two very different methods yielding nearly identical results (both graphically and statistically correlated with the PPI for gasoline margins) demonstrate the effectiveness of the simulation methods. While graphical analysis is useful, statistical analysis is also important for evaluating the effectiveness of the simulation methods because nonstationarity in index levels can overstate the impact of outlier short-term changes.16

Table 1. Margins correlations
Method/parameterCorrelation coefficientBeta of STR OLSP-value of beta STR OLSRSME

Method 1: sum of regression parameters

0.700.890.000.09

Method 2: inflating average prices

0.700.900.000.09

Note: STR = short-term price relative. OLS = ordinary least squares. RSME = root-mean-square error.

Source: Author's calculations based on data from the U.S. Bureau of Labor Statistics and the Energy Information Administration.

Table 1 shows the test statistics used to compare the simulated indexes with the official PPI for gasoline margins. The results of the comparisons show that the performance metrics far exceed what some economists characterize as a “good” fit.17 BLS economists classified a number of simulated BLS indexes by using various statistical thresholds.18 The results of creating a price index by summing the parameters of the OLS model with the CPI and PPI commodity indexes exceed the statistical thresholds used to determine “good” fits established in the academic literature. The theoretical analysis discussed above, the graphical results in chart 2, and the test statistics in table 1 provide robust evidence demonstrating the internal consistency of the BLS measures. Also, these analyses show that BLS price indexes can be simulated with the algebraic and statistical methods used in this article.

It is also possible to simulate the CPI for gasoline by using the PPI for gasoline and the PPI for gasoline margins. The gasoline input price index including markups, introduced in this article, has a higher correlation with the official CPI than with the PPI for gasoline. The OLS regression can be viewed as a model with a variably weighted interaction term as the independent variable and the CPI as the dependent variable. The weights in the gasoline input price index including markups are defined as the margin in the prior period, and the initial margin is assumed to be 6.7 percent.19 Chart 3 graphically compares the CPI for gasoline and the PPI for gasoline with the gasoline input price index including markups.

Table 2 shows the test statistics used to compare the gasoline input price index including markups with the CPI for gasoline. The correlation statistic for the gasoline input price index including markups is appreciably higher than the correlation statistic for the official PPI for gasoline. These correlation statistics exceed the threshold that BLS-domain-hosted articles consider are “good” fits.20 The theoretical analysis discussed above, the graphical results in chart 3, and the test statistics in table 2 provide robust evidence demonstrating the internal consistency of these BLS measures. Again, these results show that BLS price indexes can be simulated with the algebraic and statistical methods used in this article.

Table 2. Correlation with CPI
Correlation with CPI for gasolineCorrelation coefficientBeta of STR OLSP-value of STR OLSRMSE

Gasoline input price index including markups

0.940.820.000.02

PPI for gasoline

0.790.350.000.04

Note: STR = short-term price relative; OLS = ordinary least squares; RSME = root-mean-square error; CPI = Consumer Price Index; PPI = Producer Price Index.

Source: Author's calculations based on data from the U.S. Bureau of Labor Statistics and the Energy Information Administration.

Overall, the results of this article show that, because margins were not constant, gas stations had inflationary and deflationary roles in gas-price changes over the 3-year study period. All three of the index methods, that for the official PPI for gasoline margins and those for the two simulations, show the same trends in price changes of gasoline margins. Gas station margins contributed substantially to inflationary pressures early in the pandemic, from January 2020 through April 2020, but then contributed substantially to deflationary pressures from April 2020 through July 2020. More noteworthy, there had been a steady but modest inflationary impact from gas station margins from January 2021 through May 2023. This result can be observed in the steady increase of all three simulated gas station margins. All three margins exhibited similar volatility and similar trends.

Overview and discussion

The models suggested by Camp, Havlin, and Stanley and by Havlin work well when applied to the homogenous and timely gasoline indexes.21 The results documented in this article demonstrate the internal consistency of the simulated indexes and show that index simulation and estimation methods may be applied to other homogeneous industries. Researchers may try this approach in simulating price changes in other industries, attempt a vector autoregression, or even attempt a simulation of hypothetical margins indexes based on multidecade commodity indexes that predate the official margins indexes published by BLS. These results are of interest to the general public because they help explain how gas prices at the pump may be affected by gas station profits. Over the past 3 years, the impact of gas station margins on gasoline prices has been volatile. This impact of gas station margins was sometimes inflationary and at other times deflationary.

 

Appendix

 

Margin simulation method 1:

,

where β0 and β1 capture the average gas station markup over time period t, and ϵt represents upward or downward deviations from the average markup. Therefore, in time t,

.

 

 

Margin simulation method 2:

 

 

where AGTPt–1 and AGTPt–2 are the average gasoline transaction prices in, respectively, periods t–1 and t–2; AGWPt–1 and AGWPt–2 are the average EIA wholesale-gasoline prices in, respectively, periods t–1 and t–2; CPI STR Gt and CPI STR Gt–1  are the CPI short-term price relatives (STRs) for gasoline in, respectively, periods t and t–1; and PPT STR Gt and PPI STR Gt–1 are the PPI STRs for gasoline in, respectively, periods t and t–1.

 

The values of AGTPt–i and AGWPt–i are calculated as follows:

 

The initial base-period values AGTP and AGWP are calculated as follows:

 

 

 

The values of WT and Wt are calculated as follows:

 

The initial base-period value W is calculated as follows:

 

 

 

 

 

The Gasoline CPIt is calculated as follows:

,

 

where β0 and β1 represent the average difference between the indexes in period t, and ϵt represents deviations from that average. Because profit margins have now been controlled for, the average difference arguably represents the cumulative technical difference between the PPI and CPI measurement methods.

 

Suggested citation:

Michael Havlin, "Two plus two really does equal four: simulating official BLS gasoline price measures," Monthly Labor Review, U.S. Bureau of Labor Statistics, June 2023, https://doi.org/10.21916/mlr.2023.14

Notes


1 Readers interested in the relative contribution figures for the different commodities in the Consumer Price Index (CPI) can refer to “Measuring price change in the CPI: motor fuel,” Consumer Price Index (U.S Bureau of Labor Statistics, February 2023), https://www.bls.gov/cpi/factsheets/motor-fuel.htm; and Kevin M. Camp, David Mead, Stephen B. Reed, Christopher Sitter, and Derek Wasilewski, "From the barrel to the pump: the impact of the COVID-19 pandemic on prices for petroleum products," Monthly Labor Review, U.S. Bureau Labor Statistics, October 2020, https://doi.org/10.21916/mlr.2020.24.

2  Camp, Mead, Reed, Sitter, and Wasilewski, "From the barrel to the pump.”

3 Andrew Glover, José Mustre-del-Río, and Alice von Ende-Becker, “How much have corporate profits contributed to recent inflation?,” Economic Review (Federal Reserve Bank of Kansas City, first quarter 2023), https://www.kansascityfed.org/Economic%20Review/documents/9329/EconomicReviewV108N1GloverMustredelRiovonEndeBecker.pdf.

4 Kevin M. Camp, Michael Havlin, and Sara Stanley, "Automotive dealerships 2007–19: profit-margin compression and product innovation," Monthly Labor Review, U.S. Bureau of Labor Statistics, October 2022, https://doi.org/10.21916/mlr.2022.26.

5 Each of the three kinds of indexes, CPIs for physical goods, producer price indexes (PPIs) for physical goods, and PPIs for retail trade, measures certain goods and services at different stages of the supply chain with the scope of one index beginning where the other ends. Being able to simulate an index by using the other two indexes shows that each index is tracking the transactions and prices that each index is ostensibly intended for.

6 Camp, Havlin, and Stanley, "Automotive dealerships 2007–19.”

7 Michael Havlin, “Automotive dealerships 2019–22: dealer markup increases drive new-vehicle consumer inflation,” Monthly Labor Review, U.S. Bureau of Labor Statistics, April 2023, https://doi.org/10.21916/mlr.2023.7.

8 Camp, Havlin, and Stanley, "Automotive dealerships 2007–19.”

9 Ibid.

10 Jayson Pollock and Jonathan C. Weinhagen, "A new BLS satellite series of net inputs to industry price indexes: methodology and uses," Monthly Labor Review, U.S. Bureau of Labor Statistics, September 2020, https://doi.org/10.21916/mlr.2020.22.

11 Don A. Fast and Susan E. Fleck, “Unit values for import and export price indexes: a proof of concept,” in Big Data for 21st Century Economic Statistics, edited by Katharine G. Abraham, Ron S. Jarmin, Brian Moyer, and Matthew D. Shapiro, pp. 275-296, National Bureau of Economic Research and University of Chicago Press, 2022, https://www.bls.gov/mxp/data/unit-values-import-export-price-indexes.pdf.

12 Fast and Fleck, “Unit values for import and export price indexes;” and Don Fast, Susan E. Fleck, and Dominic A. Smith, “Unit value indexes for exports—new developments using administrative trade data,” Journal of Official Statistics, March 2022, https://doi.org/10.2478/jos-2022-0005.

13 Camp, Havlin, and Stanley, "Automotive dealerships 2007–19;” and Havlin, “Automotive dealerships 2019–22.”

14 Havlin, “Automotive dealerships 2019–22.”

15 Ibid.

16 Nonstationarity in time-series data means that the data have a nonrandom trend that results in the data trending in a direction. Put differently, nonstationarity means that a current value has a non-random relationship with the prior value. This can make graphical analysis misleading because two lines going in the same direction could merely be doing so by coincidence of nonstationarity. To ensure that there truly is a statistical relationship, tests should be run to determine nonstationarity.

17 A “good” fit is a subjective measure defined in Fast, Fleck, and Smith, “Unit value indexes for exports.” A “good” fit is evaluated through an informal assessment of various statistical tests and graphical correlations. The models used in this article outperform the average performance of the “good” fit indexes demonstrated by Fast, Fleck, and Smith.

18 Fast and Fleck, “Unit values for import and export price indexes;” and Fast, Fleck, and Smith, “Unit value indexes for exports.”

19 This is the difference between the Energy Information Administration (EIA) Dealer Tank Wagon Sales (DTW) price and the average consumer price from EIA and U.S. Bureau of Labor Statistics data in May 2019.

20 Fast and Fleck, “Unit values for import and export price indexes;” and Fast, Fleck, and Smith, “Unit value indexes for exports.”

21 Camp, Havlin, and Stanley, "Automotive dealerships 2007–19;” and Havlin, “Automotive dealerships 2019–22.”

article image
About the Author

Michael Havlin
mhavlin@fmc.gov

Michael Havlin is an economist at the U.S. Federal Maritime Commission.

close or Esc Key