Department of Labor Logo United States Department of Labor
Dot gov

The .gov means it's official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you're on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Consumer Price Index

BLS handling of missing data to produce 2025 consumer spending estimates and use in price indexes products

NOTE

BLS has engaged the National Association for Business Economics (NABE) and the Committee on National Statistics (CNSTAT) to conduct expert panels regarding approaches to handling missing data to produce 2025 consumer spending estimates. This paper was prepared as background material for the expert panels.

Executive summary

From October 1, 2025, to November 12, 2025, a lapse in government appropriations halted data collection activities for the Consumer Expenditure Surveys (CE). Although the government received funding again on November 13, 2025, data collection for the surveys did not resume until December 5, 2025. As a result, no data were collected in October and November 2025.

The Bureau of Labor Statistics (BLS) uses CE data to publish estimates of expenditures and income by consumer demographic characteristics. Data are published at various item and geographic levels. BLS also provides microdata prepared for public use.

BLS uses CE data to calculate the Consumer Price Index (CPI). The CPI measures the change in the cost of living by measuring the change over time in the prices paid by urban consumers for a market basket of consumer goods and services. BLS uses CE data to determine the weights of different categories of goods and services in the CPI as well as to construct samples of places and consumer products for the price survey.

Missing data in October and November 2025 affects the quality of CE and CPI estimates. To mitigate the impact on the quality of estimates, BLS is considering intervention options. If an intervention is needed, there are three broad intervention options: weight adjustments, unit-level modeling, and domain estimation. BLS will assess these intervention options against three criteria: mitigating the impact on estimate accuracy, maintaining the statistical integrity of calculations, and ensuring timeliness of publications.

The following summarizes the BLS current assessment of these options based on simulations and operational considerations. However, these findings are not intended to presuppose a preferred approach. Rather, they are provided to inform the public’s assessment of the tradeoffs involved. BLS seeks input on the relative importance of accuracy, integrity, and timeliness, the appropriateness of the methods considered, and whether alternative or hybrid approaches should be pursued.

Based on data simulations using historical data, BLS determines weight adjustments will perform well in mitigating the impact on estimate accuracy under the scenarios considered. Unit-level modeling or domain estimation show potential for further gains in accuracy, but would require additional development, testing, and validation. These requirements may affect the ability to meet current publication timelines.

Contents

Overview

To understand the impact of the missing data, it is helpful to begin with an overview of the surveys, methods, and products. This section includes an overview of the Consumer Expenditure Surveys (CE) and the Consumer Price Index (CPI).

Overview of the Consumer Expenditure Surveys

The CE are nationwide household surveys designed to collect expenditure data that measure how U.S. households spend their money. The CE consist of two separate surveys, a quarterly Interview Survey and a weekly Diary Survey. The U.S. Census Bureau collects the data for both surveys under contract with BLS and has a target population of the U.S. civilian noninstitutional population, which we refer to as the “U.S. population.”

The Interview Survey collects data on major or recurring expenditures, such as rent, mortgage, and car payments. Its design is a rotating panel survey, where a consumer unit (CU) completes one interview every 3 months, for 4 quarters.[1] Each quarterly interview captures expenditures in a 3-month recall, so expenditure data are recorded directly after the quarterly expenditures occur.

The Diary Survey is designed to collect data on smaller or frequently purchased items, such as groceries and apparel, that would be difficult to recall for a 3-month period. Its design is a 2-week reporting period, where a CU completes one diary per week reporting all their expenditures for that week. The expenditures occurring in each of these weekly diaries are assigned the month that the diary-keeping period began.

Each responding CU is weighted to represent a portion of the U.S. population. The weight for each responding CU comprises a base weight (which is the inverse of its probability of selection), a nonresponse adjustment factor, and a calibration adjustment.[2] Weights are assigned to CUs on a quarterly basis, and the weights sum to the total U.S. population each quarter. Using the weights, the BLS estimates average annual expenditure per CU at varying levels of detail (e.g., the highest level of aggregation is total expenditures and the lowest level of aggregation is a category such as “eggs”).

BLS calculates annual expenditure estimates using a standard weighted average formula, adjusting for the use of 4 quarters of data (i.e., in the formula, quarterly population weights are divided by 4). To create calendar-year estimates based on the date of expenditure from the 3-month recall in the Interview Survey, further adjustments are made to factor in the number of months that are in scope for the calendar year. This adjustment is not necessary for the Diary Survey because all expenditures are assigned to the months within the calendar year. However, because expenditures are based on weekly data, they are multiplied by 52 to create annual estimates.

For expenditures captured in both surveys, the best source (Interview or Diary Survey) for each expenditure category is selected to create integrated annual total estimates for the CE’s average annual expenditure tables[3]. In 2024, there were 189 Universal Classification Codes (UCCs, the lowest level of expenditure category detail estimated in the CE) sourced from the Diary Survey, making up 19.8 percent of total expenditures in 2024 ($15,565 of $78,535). 392 UCCs sourced from the Interview Survey, making up the remaining 80.2 percent.[4]

BLS calculates aggregated measures of spending and income for publication in tables and databases. Data are produced at various levels of item and geographic detail, demographic groups, and across multiple time spans.

A feature of the CE data is the rich set of demographic characteristics collected with expenditure and income information. BLS publishes statistics for 17 demographic groups (like housing tenure), resulting in estimates for over 100 demographic characteristics (like homeowners with and without mortgages and renters). Providing greater demographic detail, BLS produces tables for eight combinations of demographic groups (like housing tenure by region) which results in additional detail for 240 demographic characteristics (like renters in the Midwest). The extensiveness of demographic characteristics complicates the missing data problem.

The public-use microdata files (PUMD) are also published annually with the CE data release. While the tables provide detail, the PUMD allows flexibility to generate data for demographic characteristics of interest. The PUMD are a subset of microdata files with additional perturbations applied for disclosure avoidance.

Overview of Consumer Price Index

The CPI is a measure of the average change over time in the prices paid by urban consumers for a market basket of consumer goods and services. Household spending weights are used to average the price changes in component goods and services across the market basket. BLS calculates spending weights using CE microdata.

BLS publishes many measures of consumer price change, for different items, geographic areas, and populations. BLS calculates indexes for 7,776 item-area combinations (basic indexes). BLS annually updates population-specific market baskets using two demographic groups: All Urban Consumers (CPI-U) and Wage Earners and Clerical Workers (CPI-W).

BLS produces another measure of inflation, the Chained CPI-U (C-CPI-U). In contrast to the CPI-U, the C-CPI-U is designed to capture consumer substitution that occurs across items, due to changing relative prices of the items. Historically, the average annual increase in the C-CPI-U is about 0.2 percentage points lower than the CPI-U.[5] The C-CPI-U uses the Törnqvist price index formula to aggregate the basic indexes into the all-items, U.S. City average estimate. The Törnqvist formula requires an expenditure estimate for both months over which price change is measured. The use of monthly spending weights is a key feature of the C-CPI-U, in contrast to the CPI-U and CPI-W, products which are estimated with annual spending weights.

The CE microdata required to produce monthly spending weights are not available to BLS in real-time.[6] Hence, BLS publishes a preliminary version of the C-CPI-U in real-time using the constant elasticity of substitution price index formula and annual expenditure weights that are available at the time of publication.[7] BLS subsequently revises this preliminary version with a final version 10-12 months later when the requisite monthly expenditure data become available. For example, the January 2025 C-CPI-U was released as a preliminary estimate with January 2025 CPI-U indexes in February 2025. Later, BLS released the final version of the January 2025 C-CPI-U in February 2026.

BLS uses CE microdata to produce monthly and annual weights. Index aggregation requires 7,776 spending weights to calculate the CPI-U, CPI-W, and the C-CPI-U. BLS uses smoothing processes to reduce volatility in those weights. To smooth annual weights, BLS uses 4 quarters of data to composite estimate the weight for an item in a basic area with a broader geographic area definition.[8]

BLS uses a different process to smooth monthly weights. The monthly weights that enter the Törnqvist in calculation of the Chained CPI derive from area allocations of national item month expenditure estimates. Sample sizes in CE are not sufficiently large to produce reliable estimates of the monthly expenditure for all 7,776 basic index cells without use of a variance reduction technique. BLS uses CE data from 11 months prior to the reference month to define smoothed geographic allocation fractions for each item (namely, the average share each area contributed to national expenditures on a given item in the 1-year period ending in the reference month).[9]

While BLS also uses CE microdata to select a sample of establishments and item categories for many categories, the focus of this report is the CPI weights.

Impact of missing consumer expenditure data

The 2025 lapse in appropriations had different impacts on the two Consumer Expenditure Surveys. In turn, the missing CE data have downstream impacts on the CPI annual and monthly weights. This section will describe the effects of the missing data for each survey and the resulting publishable estimates.

Interview Survey impact

As described earlier, the Interview Survey relies upon CUs to recall purchases for the previous 3 months. Because interviewing did not occur in October and November 2025, the amount of collected expenditure data for July, August, September, and October 2025 was affected. Furthermore, the effect was not evenly distributed between the months. Relative to the survey design, 33 percent fewer expenditures are expected in July and October 2025, whereas 66 percent fewer expenditures are expected in August and September (table 1).

Table 1: Impact on reported expenditures by expenditure and collection months (Interview Survey)
Expenditure Month Collection month Impact on Number of Reported Expenditures

Jan

Feb, Mar, Apr No impact

Feb

Mar, Apr, May No impact

Mar

Apr, May, Jun No impact

Apr

May, Jun, Jul No impact

May

Jun, Jul, Aug No impact

Jun

Jul, Aug, Sep No impact

Jul

Aug, Sep, Oct* 33 percent reduction

Aug

Sep, Oct*, Nov* 66 percent reduction

Sep

Oct*, Nov*, Dec 66 percent reduction

Oct

Nov*, Dec, Jan 33 percent reduction

Nov

Dec, Jan, Feb No impact

NOTE: * = Missed collection month

Diary Survey impact

The Diary Survey is a 2-week survey where two weekly diaries are placed with respondents, then picked up 2 weeks later. With the lapse in appropriations, diaries could not be placed in October and November and as a result, there are no expenditure data for October or November collected in the Diary Survey. Because the Diary Survey is a one-time collection, rather than the panel design of the Interview Survey, the sampled addresses for these months were never interviewed.

An additional impact of the lapse in appropriations was that some September diaries that were placed later in the month were not collected in October. Because not all September diaries were completed, the response rate was lower in September than usual (30 percent in September 2025 compared with 43 percent in September 2024). BLS could address lower response rates using existing methods of nonresponse weighting for this month.

The impact of the lapse in appropriations on the CE Diary Survey data collection is illustrated below (table 2). Months January through August and December are unaffected by the missing data, there were reduced response rates in September, and there were no responses in October and November.

Table 2: Impact on reported expenditures by expenditure month (Diary Survey)
Expenditure month Impact on number of reported responses

January - August

No impact

September

Reduced response rate

October

No responses

November

No responses

December

No impact

Calibration weighting for CE is a quarterly process that occurs separately for each survey. This process calibrates the nonresponse-adjusted base weight, which produces CU full-sample weights and additional replicate weights. The calibration weighting process ensures that estimates produced with the full-sample weights are consistent with current U.S. population totals, which are updated quarterly. The current U.S. population groupings are for age, race, household tenure (owner or renter), division of the country, urbanicity (urban or rural), and ethnicity (Hispanic or non-Hispanic). Although this process occurs separately for each survey, both surveys use the same quarterly population counts to ensure each survey’s weights are consistent with population total estimates.

During a typical quarter, calibration weighting calculates CE population weights that sum to a quarterly U.S. population total. As the fourth quarter 2025 was an atypical quarter due to the missing October and November data, the calibration process required adjustments. BLS is adjusting the calibration processes so the responding households in December 2025 would be calibrated to itself only, by using one-third of the quarterly population totals, and not to the full quarterly population totals. In sum, BLS procedures to adjust for missing data were insufficient to address the extent of missing data due to the lapse in appropriations and adjustments were needed.

Consumer Price Index impact

Missing CE data in October and November 2025 affects spending weights for the CPI. As an annual practice, each January, BLS updates spending weights used to calculate the CPI-U and CPI-W to reflect consumer spending. These updated spending weights help ensure that the CPI weights reflect the latest data available on consumer spending behavior. With the release of January 2027 indexes, BLS plans to update spending weights used to calculate the CPI-U and CPI-W to reflect consumer spending in 2025. Since consumer expenditure data from July through November 2025 are affected, adjustments are needed to calculate annual spending weights for the CPI.

With the release of July 2026 indexes, BLS plans to finalize the C-CPI-U for the third quarter 2025 (July, August, and September). With the release of October 2026 indexes, BLS plans to finalize the C-CPI-U for the fourth quarter of 2025 (November, December). BLS did not publish October 2025 preliminary C-CPI-U values due to the lapse in appropriations.[10] This was because there was missing price data. Therefore, the October 2025 final C-CPI-U index will not be published, due to the lack of both price and expenditure data.

Published Estimates impact

If BLS did not adjust for the missing data, annual consumer spending estimates would not be reflective of a full year of spending, and the CPI spending weights would be incomplete. As described above, there are no October and November data to calibrate CE weights to known totals. Similarly, there are no CUs interviewed whose weights can be adjusted using a non-interview adjustment factor. The result is biased estimates of spending.

Measures of price change would also be biased. Annualized spending shares for the CPI-U and CPI-W would reflect less than a full year of spending. Monthly spending shares for July, August, and September would reflect lower spending on items sourced from the Interview Survey. Monthly spending shares for October and November would reflect no spending on items sourced from the Diary Survey.

BLS simulated the effects of missing data by replicating the 2025 conditions in datasets for 2022 through 2024. Portions of the Interview Survey data were removed for the unit-level modeling and weight adjustment methods. All Interview Survey data for June through October were removed for the domain estimation method. Data for items sourced from the Diary Survey were removed in the months of October and November.

To demonstrate the impacts of missing data, chart 1 shows the monthly relative importance of owner’s equivalent rent in the CPI market basket using simulated data in 2024 compared with published data. The simulated weight of owners’ equivalent rent is lower than the published value in July, August, and September. Conversely, the simulated weight is higher than the published value in October and November.

Based on simulated data from 2022-2024, the impact on spending results in upwardly biased measures of inflation. Table 3 displays the impact on the annualized change to the final Chained CPI-U simulating the missing data in the last three years. On average, inflation would have measured 0.58 percentage points higher than the published measure.

Table 3: Final C-CPI-U, July – December annualized percent change, published and missing data simulation, 2022-2024
Year Published Missing Data Simulation Absolute Difference (Published – Missing Data)

2022

0.51 1.28 0.77

2023

0.49 0.93 0.44

2024

0.57 1.1 0.53

The PUMD will also reflect missing data. Data users will need to develop a method to adjust for missing data to suit their purposes.

Criteria for options

BLS developed criteria against which options can be evaluated, to provide accurate, objective, relevant, timely, and accessible statistics. BLS applies statistical data quality principles provided in guidance from the Office of Management and Budget (for example, OMB Statistical Policy Directives) and the National Research Council’s Principles and Practices for a Federal Statistical Agency[11]. In addition, BLS established a common foundation of data quality considerations using the Federal Committee on Statistical Methodology’s (FCSM) data quality framework[12]. From these principles, BLS determined criteria to evaluate options for addressing the missing data. The selected option should mitigate the impact on estimate accuracy in the face of missing data, while maintaining the integrity of calculations. There can be a trade-off between the accuracy and integrity of an option and the BLS ability to ensure the timeliness of publications. These three criteria (accuracy, integrity, and timeliness) must be weighed together.

Mitigating the impact on estimate accuracy

Accuracy is defined as the closeness of an estimate from a statistical product to its true value. While the true value is not known, data simulations can approximate the impact. Comparisons with other sources can also be helpful to estimate bias relative to the true value. The measures BLS are considering when assessing accuracy is average annual expenditure mean (all consumer units) and the 12-month change in the final C-CPI-U (All items, U.S. city average). When comparing options to address missing data, BLS aims to minimize bias and maximize precision of these target estimates. For context of typical measures of bias, a 2003 study of bias in the CPI estimated the effects of quality change and introduction of new goods had the largest effect, around 0.3 percentage points per year.[13] While BLS has made methodological improvements since then, this figure gives a sense of scale.

Furthermore, to maintain the accuracy of the CPI-U and CPI-W, the annual weight update should not be delayed. There is generally an upward bias in Laspeyres price indexes as the age of spending weights increases relative to the price index reference period[14].

Maintain integrity of calculations

The FCSM data quality framework includes integrity as a dimension of quality change, and is composed of scientific integrity, credibility, computer and physical security, and confidentiality.[15] To maintain scientific integrity and credibility, it is imperative BLS produce estimates without error given a defined methodology. BLS rigorously tests all software systems, methods, and procedures used to produce official statistics. All presented options are interventions in the existing production systems, since the extensiveness of the missing October and November 2025 CE data is unprecedented. As such, BLS must extensively test the selected option to ensure quality assurance standards are met. The further a method deviates from existing production systems, the more testing and quality assurance is required, increasing the resource cost and time to implement. Fewer intervention points require relatively less testing.

Ensure timeliness of publications

Timeliness is defined as the length of time between the reference period of the consumer experience and the publication date of the statistical product. Table 4 lists the scheduled publication dates for statistical products affected by the missing CE data. BLS strives to maintain all publication dates since statistical policy directive three requires timely release of data.[16] Maintaining the publication dates in table 4 constrains the options available to BLS. BLS could continue researching options so that a broader set is available should missing data occur in the future. 

Table 4: Scheduled publication date for statistical products impacted by the missing CE data
Statistical Product Publication Date

Final C-CPI-U, July-September 2025

August 12, 2026

Consumer spending 2025

October 29, 2026

Final C-CPI-U, October-December 2025

November 10, 2026

CPI-U and CPI-W, January 2027

February 2027 (date TBD)

Options considered

BLS is considering three broad options to account for the missing data: weight adjustments, unit-level modeling, and domain estimation. Weight adjustments for complete observations can compensate for the missing months of data collection. Alternatively, it is possible to consider model-based estimation of missing expenditures either at the unit-level (such as eggs for a CU) or domain level (such as eggs for total consumers). Models require predictor variables, and so model-based estimation is only as successful as the usefulness of known predictor variables.[17] Ultimately, as the complexity of a model increases to better mitigate impacts to estimate accuracy, BLS must balance against criteria to maintain integrity of calculations (i.e., to get it right) and ensure timeliness of publications.

Weight adjustments

Since data were collected for all expenditure reference months in the Interview Survey, a weight adjustment can compensate for the missing data. Weight adjustments would be applied to the estimates after the calibration weights are constructed. Furthermore, weight adjustments would be applied to microdata to produce BLS outputs, and not to PUMD.

For the Interview Survey, the weight of CUs can be adjusted by multiplying by 3 or 3/2 to adjust for 2/3 or 1/3 data missing, respectively. This option assumes that the responding CUs interviewed in August, September, December, and January report the month of expenditures accurately. If respondents do not report the months of expenditures accurately, this option could exacerbate the effect of that error. For example, if respondents are more likely to assign an expenditure to the most recent month in the recall period, using only interviews conducted in December to estimate September expenditures could lead to underreporting.

For the Diary Survey, weights for responding CUs can be adjusted to compensate for the missing data in October and November. Under the premise of temporal, 1-month autocorrelation, a reasonable weight adjustment for Diary Survey is to double the weights of CUs that completed diaries in September and December to adjust for missing October and November data. This option assumes that the September and December expenditures reported by CUs are the most like expenditures that would have been reported in October and November. There is an increased risk that seasonal patterns that would have occurred for some expenditures in October and November will not be captured in the mean estimate using the Diary Survey weight adjustments. Due to the lower sample size for the Diary Survey, the proposed adjustments to the data will likely increase the variances and standard errors of expenditures.

These weight adjustments for the Interview and Diary Surveys require additional assumptions. The first assumption is that the observed and missing data are identically distributed for all consumer expenditures of interest in completed and missing months. The groups that receive weight adjustments are meaningfully defined based on socio-demographic population characteristics and/or the spatial adjacency of the geographic areas for which data are collected. Also, the rotating panels of respondents assigned for each month of data collection must represent the same population within each domain.

While weight adjustments could incorporate modeled information based on predictors, BLS proposes one set of weights for all expenditure estimates of interest. While this addresses integrity and timeliness criteria (expediency in producing all required estimates using a single set of weights), it comes at the potential cost of accuracy by neglecting possible temporal differences of consumer expenditures due to seasonality and trends. This may introduce estimation bias.

Another outcome of the weight adjustment approach is the potential for increased variance. Direct sampling variance estimation will depend on the inflated weights for the observed data. If the initial survey weights are variable for the observed data, the inflated weights will be even more variable making the inflated estimates more inefficient (i.e., higher variance). This is mitigated by weighting calibration.

Effect of weight adjustments

At the time of this report (May 2026), processed 2025 calendar year data were not fully available to use in validating methods. To best approximate the effect of the missing data and subsequently the effectiveness of the options, data were simulated. Weight adjustments were applied to the simulated 2024 data to compare differences in means, standard errors (SEs), and relative standard errors (RSEs) to the published 2024 calendar year.

Overall, the simulated 2024 data did not have much, if any, effect on demographics, as the changes between these datasets were negligible. For example, the published data population count (in number of consumer units) was about 135.8 million. In the simulated data, after employing the adjustments, this same population count rises from about 135.8 million to 136.0 million. Similarly, the percentages for which various demographic groups account (e.g., housing tenure, race, and ethnicity) ranged between a difference of -0.2 and 0.2 percentage points. This is not surprising since the weights are calibrated to select population targets, including housing tenure, race, and ethnicity.

Unlike demographics, there are more meaningful differences in the expenditures between the simulated and published data. Chart 2 shows the percent differences between means at the UCC level calculated with the simulated 2024 data compared with the published 2024 data, grouped by level of difference. The change in means resemble a symmetric and unimodal distribution, with the most noticeable decline in means at 100.0 percent for digital video players and recorders, and the most noticeable increase in means of 133.8 percent for new motorcycles – two expenditures with a relatively low percentage of CUs reporting them each year (in production data, 0.01 and 0.04 percent of consumer units reported these expenditures, respectively).

Chart 3 shows the percent differences between standard errors (SE) calculated for the production and simulated datasets. [18] Changes in SEs are more pronounced, which is expected because of lower sample size resulting from the missing data. However, an increase to the SE was not consistently observed for all expenditures, nor are all the decreases inconsequential. The observed SE changes ranged from 221.3 percent, for septic tank cleaning (renter) to -90.0 percent, for men’s costumes.[19] SEs that were lower in the simulated data suggest that these expenditures were more variable in October and November 2024 than in the rest of 2024. If this peculiarity of the data is common for these months, then we could expect a similar pattern for the 2025 data; however, by definition, there will not be any way to measure the true effect of the missing data in 2025 without known expenditures available for comparison.

Chart 4 shows the percent differences between the relative standard errors (RSE) calculated for the production and simulated datasets. Changes in RSEs are similar in magnitude to those of SEs, not surprising given that RSEs are the ratios of SEs to means, and means changed little compared to SEs. Men’s Accessories saw the most noticeable decline in RSEs, at -50.5 percent. Conversely, material for patio, walk, fence, drive, masonry, brick, and stucco had the most noticeable increase in RSE, at 200.4 percent.

BLS considers mean estimates with RSEs of 25 percent or more to be unreliable and flags those estimates in published consumer expenditure tables to inform data users of the high RSE estimates. Therefore, one notable comparison to make between the production and simulated datasets is the number of expenditures with RSEs that exceed this threshold. In the production dataset, there were 144 expenditures with RSEs greater than 25 percent at the most granular publication level. In the simulated dataset, 157 expenditures have RSEs greater than 25 percent. Notably, some expenditures had RSEs greater than 25 percent in the production dataset that were less than 25 percent in the simulated dataset.

The above charts and discussion summarize the differences at the most granular level of detail available in the CE data products (estimates at the UCC level), which are more variable than the estimates at higher levels of aggregation the CE produces for the news releases. The higher level of aggregation mitigates some of the effects seen at the lower level of detail.

Table 5 shows the same information as Charts 2, 3, and 4, but at aggregated estimates to simulate higher level results. For example, instead of looking at the statistics for Fresh milk, all types; Butter and margarine; Cheese; Ice cream and related products; and Other dairy and related products each individually, the total expenditure estimates are summed as “Dairy products” and the calculations for SEs and RSEs are based on the summed calculations. Data at higher levels of aggregation have more reports and are less variable. As a result, the comparisons between the production dataset and simulated dataset at a higher level of aggregation have a narrower range of differences than the comparisons at the lowest (UCC) level.

 Table 5: 2024 simulation impact at aggregate levels, percent differences in means, standard errors, and relative standard errors
Item Mean SE RSE

Average annual expenditures

-.54 12.26 11.76

Food

-.60 25.64 25.11

Food at home

-.45 23.68 23.37

Cereals and bakery products*

-.77 28.42 27.31

Meats, poultry, fish, and eggs*

-.91 18.62 17.56

Dairy products*

-.47 16.67 16.28

Fruits and vegetables*

.53 22.51 23.30

Other food at home*

-.46 25.77 24.92

Food away from home

-.83 16.67 15.19

Alcoholic beverages

2.46 12.64 15.35

Housing

-.39 4.42 4.25

Owned dwellings

-.34 7.41 6.75

Rented dwellings

-1.12 11.11 10.11

Other lodging

2.25 3.68 6.17

Apparel and services

-2.18 6.42 3.44

Transportation

-.94 23.12 21.61

Vehicle purchases (net outlay)

-2.34 20.53 17.68

Gasoline and other fuels

.33 3.82 4.63

Other vehicle expenses

-.45 1.55 .79

Vehicle insurance

.19 6.98 7.60

Public and other transportation

.87 37.62 38.50

Healthcare

-.12 12.50 11.84

Health insurance

.06 8.33 8.64

Medical services

.90 21.40 22.42

Drugs

-.98 12.60 11.75

Medical supplies

-6.48 -34.35 -38.61

Entertainment

1.34 35.09 37.42

Personal care products and services

-.80 -1.06 -1.99

Reading

.11 -8.72 -8.76

Education

-2.46 8.15 4.98

Tobacco products and smoking supplies

-2.83 8.93 5.59

Miscellaneous

1.46 -6.26 -4.93

Cash contributions

-2.60 -1.46 -4.04

Personal insurance and pensions

-.53 7.10 6.42

Life and other personal insurance

-3.15 -.82 -3.96

Retirement, pensions, and Social Security

-.37 8.93 8.77

Contributions to retirement plans

-.90 5.23 4.27

Deductions for Social Security

-.01 16.24 15.97

NOTE: * = Diary item.

Comparing the results of the simulated 2024 data with the published 2024 data is a useful tool to anticipate the likely nature of the impacts to the 2025 data. However, it is not possible to know the full effect for the 2025 calendar year data until they are received and processed.

Given the simulation results for annual consumer expenditure data, BLS expects the weight adjustments to mitigate the impacts of missing data on the annual spending weights for the CPI-U and CPI-W. The simulated results for annual mean expenditures indicate a low risk of bias in the fixed quantity weights used in the Laspeyres formula.

The weight adjustments can also be applied to mitigate the impact of missing data on the monthly spending weights for the final C-CPI-U. At a monthly level, the Interview Survey adjustment results in a complete estimate of spending for every month. The mechanics of adjusting September and December weights will not achieve complete estimates of spending in October and November for items sourced from the Diary Survey. To achieve the spirit of the weight adjustment given the mechanics of the Törnqvist formula, BLS would instead copy the September expenditure data as October and copy the December expenditure data as November.

Based on simulated data from 2022-2024, the weight-adjustment method mitigates the impact of the missing data on measures of inflation. Table 6 displays the impact on the annualized change to the final C-CPI-U if similar data were missing in the last 3 years. On average, there would have been no change in inflation compared to the published measure (average difference is 0 percentage points).

Table 6: Final C-CPI-U, July – December annualized percent change, published and weight adjustment data simulation, 2022-2024
Year Published Weight Adjustment Data Simulation Absolute Difference (Published – Missing Data)

2022

0.51 0.44 0.07

2023

0.49 0.5 0.01

2024

0.57 0.63 0.06

Although there is little overall change on an annual basis, the results of the C-CPI-U data simulation vary by month. Chart 5 displays differences in the 1-month percent change of the published C-CPI-U and the simulated index results using the weight adjustment approach.

Unit-level modeling

An alternative to weight adjustments is estimation. Model-based estimation of missing respondent expenditures could help mitigate the impact on estimate accuracy and reduce variance. The extent of improvement depends on the usefulness of the predictors available, and appropriateness of the model. Custom-built models require an extensive investment and are not feasible without significant delay to publications (a year or more). BLS could leverage readily available software packages for large-scale, mixed-type data imputation. Extensive testing and quality assurance would still be required to maintain the integrity of estimates.

Unit-level is defined as monthly spending for every CU. This is the finest level of aggregation – monthly spending at the CU level, aggregated into roughly 700 UCCs. An advantage to unit-level modeling, is once monthly estimates of spending at the CU/UCC level is complete, all other estimates can be generated without further intervention.

Estimation at the CU level requires that there are some predictors known about the sampled unit. For the interview survey, approximately 75 percent of the sampled units were in the sample in a previous panel with some portion having a complete interview. These sampled units do have information associated with their consumer unit. For the sampled units that did not complete at least one previous panel or were new to the sample in October or November, BLS is limited to information associated with the survey frame – specifically, only information related to the geographic location of the unit (zip code, census block).

Unlike the Interview Survey, the Diary Survey is not a panel, so the only known predictors for the sampled unit are those variables available from the survey frame.

BLS has worked with Census in the past to link sample addresses to administrative records (e.g., federal tax information, property tax data); however, these linking projects take time and often have some limitations in their applications making them infeasible for the 2025 publication timelines. This is an area that could be explored further under a longer timeframe to increase the available predictors for model-based estimation.

Modeling CU-level expenditures could leverage temporal correlations. A simple approach that could be accomplished in less time than more sophisticated approaches is to fully replace impacted 2025 data with data from those months in 2024 with an inflation factor to account for price changes. While this might be a reasonable estimation approach at aggregate levels, this becomes questionable at the household level for several reasons. Household level purchasing decisions are heavily influenced by demographics and current economic conditions. Modeling shifts in demographic distribution shifts and changing economic conditions introduce complexities.

Effect of unit-level modeling

To demonstrate unit-level modeling, BLS simulated a simple method. As with the earlier simulations, data were removed to mimic the missing data. Expenditures from the same months in the prior year adjusted for inflation serve as the estimated expenditures in the current year. Based on simulated data from 2022-2024, the unit-level estimation produces higher levels of inflation, on average an increase of 0.07 percentage point per year (table 7).

Table 7: Final C-CPI-U, July – December annualized percent change, published and unit-level, 2022-2024
Year Published Unit-level modeling data simulation Absolute difference (Published –Data Simulation)

2022

0.51 0.61 0.1

2023

0.49 0.53 0.05

2024

0.57 0.5 0.07

The effect of unit-level modeling on C-CPI-U indexes varies by month. Chart 6 displays the difference in 1-month percent changes between the published C-CPI-U and the index calculated using monthly weight data simulated using unit-level modeling.

While this approach can produce estimates of inflation that are reasonably accurate (though typically not as accurate as the weight adjustment approach, and with an apparent positive bias), its use could produce undesirable effects when applied to the separate task of producing the array of expenditure estimates BLS publishes across a variety of demographic groups.

Domain Estimation

In contrast to modeling at the CU level, BLS also considered statistical modeling at aggregate levels. That is, conducting domain estimation for CE publication cells (such as renters in the Midwest) or CPI basic index cells (such as eggs in Boston). The average expenditure for a defined domain in October and November could be modeled based on any number of predictors. This approach is beneficial because it could better address any seasonality concerns by leveraging temporal correlations of estimates for consecutive months (provided a long time series is available) and socio-demographic covariates to produce smoothed estimates for the missing months. This approach could better mitigate the impact of missing data on estimate accuracy, and improve the estimate efficiency (i.e., lower variance) for the collected months.

Although this approach is promising, due to the large number of demographic characteristics for which BLS produces CE publications, it quickly becomes massive in scale. Additionally, the estimates are not independent (for example, demographic cross-tabulations must all sum to the same totals). Modeling multiple series of domain estimates together has the advantage of within-model calibration. Modeling the series separately is easier to scale by all the cross-classification of socio-demographic characteristics, but after-model calibration would be needed, and would add variance (limiting the usefulness of the model). To reasonably estimate aggregate-level defined domains, BLS would have to reduce the scope of its current publications and limit the level of detail published and subsequent output. Like unit-level modeling, this is an area that could be explored further under a longer timeframe.

To demonstrate domain estimation, BLS simulated two univariate methods: linear interpolation and Auto Regressive Integrated Moving Average (ARIMA). In each case, missing data were simulated as described earlier.

Linear interpolation uses the two boundary months with data surrounding the missing months and creates a straight line connecting the boundary points to compute values for the missing data. Linear interpolation was applied at the item-U.S. level by CE survey source. Interview Survey data for the months July through October were replaced by interpolation using boundary data from June and November. For Diary Survey data, interpolation was applied to the months of October and November using data from September and December. Data for items sourced from both Diary and Interview Surveys data were interpolated from July through November based on June and December data.

The ARIMA method is a simple standardized time-series model. For the ARIMA simulation, BLS utilized the X-13ARIMA-SEATS application with an (011) (011) model and automatic outlier detection selected to mitigate additive outliers and level shifts with a critical threshold of 5.0.  

BLS also considered a multivariate forecast model, which was in development for the purpose of improving the accuracy of preliminary estimates of the C-CPI-U. The model predicts the C-CPI-U by forecasting monthly expenditures at the item-U.S. level and allocating them to the 32 areas in the geographic structure by fixing all allocation fractions to their last known values.

More specifically, national item expenditures, Ei,tUS , are modeled together while using known national item price indices, Pi,tUS. This basic structure allows expenditures on different items to be interdependent and to be influenced by the real-time state of item prices, both natural expectations. The vector time series model used in this approach is called Echo State Networks (ESNs), a type of Recurrent Neural Network model. The ESNs have been modified and augmented to account for features of the problem:

  1. The goal is a weighted aggregate

  2. The relative scale of the errors varies considerably across items

  3. The model incorporates both exogenous and endogenous inputs

  4. Seasonality varies among items. The suite of ESN fitting and forecasting functions were built in-house to include these structural features.

Recurrent Neural Networks are nonlinear time series models that learn relationships between inputs and outputs (a.k.a. responses) through the states of a large set of intermediary variables (“neurons''). ESNs are a type of Recurrent Neural Network in which the internal network couplings are sparse and randomly drawn. Couplings between inputs and neurons are also randomly drawn, leaving only couplings between neurons and outputs to be estimated. This represents a dramatic computational simplification, as once internal and input-to-neuron couplings are fixed, output couplings may be estimated through (fast) regularized linear regression.

ESNs effectively trade the problem of fixing the individual entries in the three coupling matrices simultaneously with that of choosing reasonable values for a comparatively small set of scalar parameters that control general features of the network.

Let t=1, 2, …. T  label time points for which there are observations of an N-vector response of interest, yt, believed to be related to p-vector inputs, xt.

Equation 1: Basic form for an Echo State Network

Equation 1: Basic form for an Echo State Network

where ht is an m-vector containing the state of the neurons at time t, U, V, and W are matrices of sizes m by p, N by m, and m by m, respectively, and g(·) is the “neuron activation function" applied element-wise. The BLS model uses g(s) = tanh(s), a common choice. ρW is the spectral radius of the random matrix W.α and ν are tuneable scalars that lie between 0 and 1. Their values are set through cross-validation, along with scalars that control probability distributions for random matrices W and U, and those related to the regularization. (Ultimately predictions for the responses are averaged over an ensemble of ESNs generated by repeatedly drawing W and U pairs from their respective distributions, and learning the matrix V for each draw).

As noted, this method has been in development for the purpose of improving preliminary estimation of the C-CPI-U (preliminary estimates are necessary due to the 4-quarter time-lag monthly expenditure data are subject to). During preliminary estimation of the C-CPI-U for month t, one is missing all monthly expenditure data for the prior 10, 11, or 12 months depending on which month of the quarter the reference month t is but is in possession of all requisite item-area price indices. In the context of the 2025 lapse in appropriations, there is missing or incomplete expenditure data for 5 months. The forecast method may be applied to this problem by simply shortening the horizon to 5 months.

ESNs are flexible models that can learn complex temporal dynamics after a relatively simple training process. Certain choices related to this application to the C-CPI-U, such as the particular partition of items and relative penalization across items in the estimation of output weights are important to performance. The prototype models incorporate an item’s typical share of expenditures in the training period and its sampling error (which vary considerably across items) for this task. This aspect of the method is in development. Example results using several 6-month time periods in 2022, 2023, and 2024 as forecast intervals are given for prototypes to give a sense of performance.

The echo-state network research is also helpful to evaluate the appropriate level at which to define the domain. In this research it has been observed that although the allocation fractions are not static, empirically the dynamics at the level of the chained index are predominantly driven by the evolution in national-item expenditure estimates (in the study period analyzed). BLS is taking a time series modeling approach that aims to forecast monthly item expenditures at the national level and subsequently reconstruct a prediction of the C-CPI-U by distributing the forecast national item expenditures to the 32 areas using the last known set of item-area allocation fractions (i.e., those defined at the last time-point in the training period).

Effect of Domain Estimation

As discussed above, modeling all consumer spending estimates published on the BLS website would create challenges maintaining the integrity and timeliness of the estimates. However, to demonstrate the approach, BLS modeled some estimates that are sourced from the Diary Survey where there is a greater risk of seasonal patterns in the data not being reflected with the weight adjustment. Table 8 shows the effect of different domain estimation techniques on mean annual expenditures for a few selected items.

Table 8: Select CE Estimates, mean annual expenditures, published and domain estimation data simulation, 2024
Select Expenditure Item Source Published Linear- interpolation simulation ARIMA data simulation Weight adjustment

Poultry

Diary 267.37 262.81 276.24 263.32

Beef

Diary 408.24 392.74 385.75 393.5

Men’s Apparel

Diary and Interview 374.26 347.32 428.98 352.21

Boys’ Apparel

Diary 92.35 85.46 138.39 85.63

Women’s Apparel

Diary 631.89  629.71  661.70 630.93

Girls’ Apparel

Diary 111.78 109.72 105.65 109.93

Infants’ Apparel

Diary 77.32 70.99 88.29 71.12

Footwear

Diary 461.15  466.17 530.87 467.07

Toys, games, arts and crafts, and tricycles

Interview 171.96  174.22 208.89 174.56

Utilities, fuels, and public services (excluding septic cleaning)

Interview 4,727.81  4,716.32 4788.93 4,747.98

Based on simulated data from 2022-2024, the domain estimation method mitigates the impact of the missing data on measures of inflation. Table 9 displays the impact on the annualized change to the final C-CPI-U if similar data were missing in the last 4 years. Results are shown for three techniques to estimate item-U.S. monthly expenditures, allocated to the area level for computation in the Törnqvist index. On average, table 9 shows that ESN performed closest to the published data, while ARIMA and interpolation had  larger differences. It is important to note that interpolation had 3 years of comparisons while ARIMA and ESN only had 2 years. As with other simulations, the effect varies by month (chart 7).

Table 9: Final C-CPI-U, July – December annualized percent change, published and aggregate-level estimation data simulation, 2022-2024
Year Published Linear-interpolation data simulation ARIMA data simulation Echo state network simulation

2022

0.51 0.51 N/A* N/A*

2023

0.49 0.65 0.46 0.53

2024

0.57 0.52 0.49 0.57

Average difference

N/A 0.07 0.06 0.02

NOTE: * = There is insufficient historical data to produce estimates for 2022

A weakness of using forecasting methods (whether ESN, ARIMA or otherwise) to fill the missing estimates item-level U.S. estimates is that they do not make use of data ‘on the other side of the gap,’ nor any of the partial household data collected during the affected months. At least one path to improvement is apparent: item-U.S. level forecasts can be benchmarked to the levels on the right side of the gap, once the expenditure estimates derived from data collected under normal operations become available.

That is, the forecast horizon can be taken to be 1 month longer than the actual interval over which item expenditure estimates were affected by the shutdown (in this case one month longer than the 5-month period, July through November 2025). A simple way to benchmark each item’s forecast to their actual December 2025 estimates once they become available would be to allocate the difference in level between the forecast and actual December values linearly over the preceding 5 months. For example, if the discrepancy between the forecast December expenditure for item i is ΔEiUS, one-sixth of the difference can be added to the expenditure estimate forecast for the first month, two-sixths to the value forecast for the second month, and so forth. By construction, the forecast values for December will adjust to exactly coincide with the actual known levels.

Conclusion

The lapse in government  appropriations resulted in missing consumer expenditure data in October and November 2025 which affects the quality of CE and CPI estimates. BLS is considering three broad intervention options to mitigate the impact of missing data on the quality of CE and CPI estimates. Each option represents a host of possible methods. The extent to which the integrity of calculations and timeliness of publications can be maintained depends on the specific method adopted. The greater the method complexity, the greater the risk to both integrity and timeliness.

BLS is providing this summary of options so that data users can be informed about the tradeoffs of each option. BLS research continues while obtaining feedback. Once a method is selected, BLS will inform users of the methods implemented on the BLS public website.

Table 10: Summary of Options by Evaluation Criteria
Option Accuracy Integrity Timeliness

Weight adjustment

Data simulations produced the most accurate average annual expenditure mean (all consumer units) and annualized 12-month change in the final C-CPI-U (All items, U.S. city average) The weight adjustments described can be implemented with least intervention in and deviation from the production systems. All publication dates can be met.

Domain estimation – unit level

Extensive research is needed to model unit-level (CU). There are few predictor variables available to model Diary Survey household characteristics. The methods will be developed outside the production system. Extensive testing and validation of the methods is needed. It is straightforward to insert the modeled data into production systems. It is unlikely publication timelines can be met. Off-the-shelf models will produce quickest results. However, research into predictor variables could be time consuming.

Domain estimation – aggregate level

Aggregate level domain estimation can mitigate the impact to data quality due to the missing data. Additional research is needed to fine-tune estimation methods and gain precision. The methods will be developed outside the production system. Extensive testing and validation of the methods is needed. Because of multiple points of intervention in the production system, inserting the modeled data is less efficient at the aggregate level than at the unit-level. It may be feasible to meet publication timelines if no additional research is conducted, the selected method is relatively simple to perform and test, and the scope of consumer spending estimates is reduced significantly. Publication deadlines would be extended depending on the extent of additional research, method complexity, and scope of consumer spending estimates.

Last Modified Date: May 1, 2026



[1]A consumer unit (CU) is defined as a group of individuals within a household that are related by blood or legal arrangement or who share major expenses. See “Consumer unit,” Glossary (U.S. Bureau of Labor Statistics).

[3]There are several criteria used to determine the best source. If there are enough observations in both surveys, the survey is selected that generates the highest mean. Additional consideration is given to seasonality. For more details see the Creech and Steinberg, “CE Source Selection for Publication Tables”, 2011 Consumer Expenditure Survey Anthology.

[4]Counts of UCCs based on the 2024 published tables and the “Source Selection file” that can be found described and linked on the PUMD Documentation page.

[5]The frequently asked questions for the C-CPI-U documents the difference between the CPI-U and C-CPI-U is on average 0.2 percentage points per year, though there is variation in any given year.

[6]That is, with a 1-month lag.

[7]The formula used to calculated preliminary versions of the C-CPI-U is described in Klick (2018).

[8]The BLS Handbook of Methods describes the CPI-U and CPI-W weight smoothing process in the section “Index Calculation.”

[9]The BLS Handbook of Methods describes the C-CPI-U weight smoothing process in the section “Estimation of monthly expenditures at the basic level.”

[10]The BLS website contains information about the impacts of the lapse in appropriations on the CPI.

[12]The Federal Committee on Statistical Methodology created a data quality framework.

[13]Lebow, David E. and Jeremy B. Rudd (2003). “Measurement Error in the Consumer Price Index: Where Do We Stand?” Journal of Economic Literature. Vol. 41, No 1, pp 159-201.

[14]For example, Kurtzon (2018) finds nearly all the difference between the CPI-U and the final C-CPI-U is due to weighting differences.

[15]The FCSM data quality framework includes scientific integrity and credibility as a quality dimension. 

[16]According to statistical policy directive 3, "Changes in the release schedule may occur only if special, unforeseen circumstances arise. The releasing agency must announce and fully explain any schedule changes as soon as it has determined they are unavoidable.“

[17]Useful predictor variables can also be leveraged to improve weight adjustments. In other words, the more information BLS has about the missing data, the better any method can adjust for it.

[18]CE standard errors are calculated using replicate weights and the Balanced Repeated Replication method. More information can be found in the BLS handbook of methods.

[19]Not considering the expenditure of digital video players and recorders that had no reports in the simulated data, thus no standard error.