Department of Labor Logo United States Department of Labor
Dot gov

The .gov means it's official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you're on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Article
February 2023

Incorporating medical claims data in the Consumer Price Index

In this article, we examine the possibility of including medical insurance claims data in the calculation of medical price indexes. Our analysis indicates that indexes supplemented with claims data more accurately reflect changes in prices for physicians’ services and outpatient hospital services. In addition, we identify both advantages and disadvantages of using claims data in index construction.

The use of medical insurance claims data in the construction of medical price indexes presents many opportunities and challenges for the U.S. Bureau of Labor Statistics (BLS). This article describes a new methodology for incorporating claims data into traditionally collected data for the Consumer Price Index (CPI) for physicians’ services and the CPI for outpatient hospital services. We address the current state of CPI data collection for medical care and describe the claims data, their sampling and representativeness, the methodologies used to construct price indexes with the data, our initial results, and the next steps for this project.

Medical insurance claims data, which are collected from bills submitted by healthcare providers to a patient’s insurance provider, can help address issues with survey response rates, while also reducing index volatility. Because of their large amount, claims data capture more types of medical procedures and involve faster updating of the quantities of those procedures.

We have constructed price indexes by using a national medical claims dataset. We compare these indexes with the CPIs for physicians’ services and outpatient hospital services at the U.S. city-average level. Some of the practical issues we consider in the comparison include the following:

(1)   Claims data introduce a time lag that can affect when price changes appear in an index.

(2)   Weighting involves issues related to the use of both claims data and traditionally collected data.

(3)   There is substantial variation in prices in the claims data even after controlling for provider, insurer, and service.

Our results indicate that supplementing the medical price indexes with claims data will improve index accuracy by increasing the number of observations.

Background

In 2004, BLS examined the use of claims data in index calculation, determining that the benefits of such use were not sufficient to justify the high cost of purchasing the data. This determination held even when the practical issues related to using claims data in the production of official price statistics could be addressed.1 Since 2004, however, growing challenges to the traditional collection of price data for medical services—challenges that risk introducing bias in the official statistics—have led BLS to reconsider the use of claims data. Respondent cooperation among medical providers has declined, and the response rates for the household surveys used to select providers have fallen. (See chart 1.)

Declining response rates negatively affect the medical price indexes in two ways. First, because fewer responses mean the indexes are based on less data, lower response rates can cause greater variance in index estimates. When a sample is smaller, price movements of specific quotes in that sample have a greater impact on index values, and the price change in the sample becomes a less reliable indicator of the true price change in the full universe of prices. So, a smaller sample size makes it harder to distinguish real price trends from sample noise.

Besides increasing sample variability, declining response rates may change the sample in such a way that the overall sample of collected price data becomes less representative. For example, if response rates were to fall for one physician payer type (self-pay, private insurance, or Medicare) more than for other payer types,2 the overall index for physicians’ services would be biased against the one payer type for which response rates suffered disproportionately.

In 2018, in response to the problems posed by lower response rates, the CPI program started a new project to study whether claims data can be used to improve the accuracy of medical price indexes. In phase I of the project, we analyzed data from a single insurer, for a single city.3 The data had a 2-month lag. Considering the results of our analysis, we reached the following conclusions:

(1)   Even with a lag, the CPIs for physicians’ services and outpatient hospital services can be improved by supplementing traditionally collected data with alternative data. As the lag increases, the accuracy of indexes calculated with supplemented data may decrease.

(2)   There are discrepancies between geographic area-level data and national-level data with respect to the acceptable length of a lag.4

For phase II of this project, BLS purchased medical claims data that cover all geographic areas of the CPI for a 4-year period. The data have a 3-month lag, which is 1 month longer than the lag of the data analyzed during phase I. Unlike the latter data, which are for a single insurer, the data for phase II include multiple health insurance providers, which are masked by the data provider.

During phase II, we determined that more research was needed before we could confidently propose incorporating inpatient hospital data in the official estimation of the CPI. Because of patient characteristics or other factors that cannot be controlled for in using claims data, inpatient hospital prices can change drastically for the same service, resulting in high variation in prices even after controlling for several variables. For this reason, this article focuses only on the results for physicians’ services and outpatient hospital services.5

Medical care services in the CPI

Currently, BLS collects price data directly from providers in a resource-intensive collection process. Data collectors often encounter barriers to collection such as busy respondents, gatekeepers, and confidentiality concerns, and these barriers limit cooperation with and access to the doctor or other respondent who can provide the data we need. Hospitals usually require data-release approval from high-level officials (such as chief financial officers or chief operating officers), and data collectors may have to contact a third party that handles the hospital’s billing to gather the necessary price data. These barriers prolong the process of data collection, requiring more time and effort than is necessary to collect other data items, and may still result in a refusal.

Providers of physicians’ services are selected randomly on the basis of results from the BLS Consumer Expenditure Surveys, and providers of hospital services are sampled from a dataset of the American Hospital Association. CPI data collectors work with the selected provider to identify a specific service on the basis of the billing history of that provider and record the service’s price-determining characteristics (such as insurance provider and plan type). In subsequent months, the provider is asked to provide a price for the same service, holding the characteristics of the original bill constant. The price is defined as the total amount paid (or expected to be paid) to the provider, including payments from both the patient and the insurance provider.

For a given geographic area and a given item category (called an entry-level item (ELI)), the collected price data are used to form a price relative that measures the one-period change in price for that item–area combination. For this research, we focus on the ELIs for physicians’ services and the outpatient portion of the ELI for hospital services.6 We calculate the price relative for the ELI as follows:

where  and  are the prices of good or service j in, respectively, period t and the base period; and  is the item weight in the base period. The item weight is proportional to the share of expenditures on good or service j in the base period.7

The existing BLS methodology for collecting medical price data has some important advantages over the use of claims data. Under the existing methodology, the characteristics of each service are held constant in price collection and the data are published in real time. In addition, the data are intentionally designed to represent the universe of medical goods and services consumed by households.8 These features are generally not found in claims data, which represent only a subset of private insurance claims and are available only after a lag.

Data

BLS purchased data, beginning in 2015, from a national health insurance aggregator. The data are a convenience sample of procedures performed by healthcare providers and reimbursed by insurers. The data cover all CPI geographic areas.9

For each CPI geographic area, we select a sample of 200 physician National Provider Identifiers (NPIs) and 10 hospital locations. The sample is selected by using probability proportion to size (PPS) based on expenditure. Medical services are defined by using Current Procedural Terminology (CPT) codes, and each service is broken out by the insurance company that pays for the service. Ten medical services are sampled via PPS for each physician, and 100 services are sampled at each outpatient hospital. This procedure results in the collection of up to 2,000 unique observations for physicians’ services and 1,000 unique observations for outpatient hospital services per area, per month. For physicians’ services, we also control for the most common place of service associated with a given NPI/insurer/CPT combination.

Each combination of medical provider, insurance provider, and medical service (and place of service for physicians’ services) can be thought of as a single observation. Each observation is supplied with an average price, which is calculated across all transactions at the outlet for the specified medical service and health insurance provider. The average prices are used to calculate the unit-value indexes.10 Included in the monthly data is a quantity value that reveals the number of transactions used in creating the average price. The average price and quantity values are updated each month over a 2-year pricing period, for a total of 24 months.

A limitation of a unit-value index is that it is estimated under the assumption that the average price is calculated across homogeneous goods. In reality, considerable price variation may exist at the level of provider, insurance company, or service. This variation is driven by unobserved factors, and changes in the mix of those factors will be measured as a price change in a unit-value index. Current CPI methodology is designed to capture price change by holding all else equal, so having price changes driven by unobserved factors is not ideal.

Using claims data has several additional drawbacks. If a provider does not bill any claims for a given service in a given month, we must impute the price for that service. Insurance claims data do not represent the entire eligible CPI market basket because they do not include Medicare Part B or out-of-pocket payments. As a result, claims data cannot serve as a full replacement for data collection, lessening the potential cost savings.

One clear advantage of claims data over traditionally collected data is their vastly larger sample size. Because of this advantage, BLS can potentially produce medical price indexes at a finer level (regional medical price indexes, physician indexes by specialty, etc.).

Background on CPT codes

CPT codes were created by the American Medical Association (AMA) to describe medical procedures and standardize reporting.11 Currently, these codes are used by providers in medical claims processing and in data collection and analysis. There are four categories of CPT codes. Category I codes correspond to an established procedure or service and are grouped into subcategories based on procedure type and anatomy. The three additional categories of codes—Category II, Category III, and Proprietary Laboratory Analyses codes—are used to, respectively, measure performance, label new procedures, and describe proprietary laboratory procedures. CPT codes are updated three times a year by an editorial panel of clinical and industry experts. The codes are updated frequently so that emerging technologies and treatments can be added to the coding set in a timely manner.

Although CPT codes cover most physician services, they do not capture some services, as well as drugs and supplies that can be billed separately. These drugs, supplies, and additional services are covered by Healthcare Common Procedure Coding System (HCPCS) level II codes, which are maintained by the Centers for Medicare and Medicaid Services.12 CPT and HCPCS codes are used to control for specific services performed by physicians and outpatient providers and are captured in the medical claims data described earlier.

Comparison data

BLS purchased a second medical claims dataset that could be compared with the primary claims dataset. This second dataset, hereafter referred to as “comparison data,” is a fully separate dataset purchased from a data provider that is unrelated to the primary claims data (hereafter referred to as “medical claims data”). The comparison data have over 100 million claim lines for both physicians’ services and hospital services. BLS purchased 4 years of data, covering the period from January 2016 to March 2020. The items in the comparison data are defined similarly to those in the medical claims dataset—by identifying a particular combination of medical service and provider. However, unlike the medical claims data, the comparison data do not involve controlling for the insurer.

Each observation in the comparison data is associated with a monthly average price and the quantity of claims used to calculate the average price. The comparison data have a larger sample size than the medical claims data, but they have a much greater time lag (roughly 9 months). Because of this lag, the comparison data are not suitable for use in production, but their robust sample size makes them appropriate for comparison.

Sampling methodology

The medical claims data contain a sample of data rather than a complete data frame. We develop a sampling methodology that the data provider uses to select observations. We use PPS without replacement to select medical providers, and we use total revenue during the previous 12 months as a measure of revenue size in our probability calculations. Before BLS receives the medical claims data, the data provider removes claim-level outliers.13

Sample rotation

The medical claims sample is fully rotated every 2 years, with 25 percent of geographic areas rotated every 6 months. This rotation is more frequent than the 4-year sample rotation currently used in the CPI. Increasing the rotation frequency aims to pick up new services and CPT code changes, because we are unable to crosswalk these codes as new code sets are released. The more frequent sample rotation is an improvement over current procedure, which is limited by the cost of initiating a new sample.

The month preceding the rotation month is a link month used to create relatives for the rotation month. For example, in July, the data provider would provide new outlets and observations in the rotating geographic areas, along with average prices corresponding to April. The data provider would also provide prices for the new outlets and observations corresponding to March, the link month, so that BLS would have a March-to-April relative to use in April.

Sample representativeness

We compared the sample distribution of the medical claims data with the distributions of the comparison data and, if available, data from other third-party sources. The distributions of outlets and services for the medical claims data, the comparison data, and the data from third-party sources are close. The fact that two large data sources such as the medical claims data and the comparison data show similar distributions across provider types and services may suggest that these sources reasonably represent the market. It could also indicate that both datasets source information from overlapping data providers. CPI data seem to oversample the more common categories (as illustrated in the next two tables), which indicates that integrating claims data offers potential advantages.

Table 1 shows the distributions of outlet specialties for the medical claims data, the comparison data, and the CPI data. These distributions are compared with data from the American Association of Medical Colleges (AAMC) Physician Specialty Data Report.14 The columns for the medical claims data, the comparison data, and the CPI data show the percentage of observations in each physician specialty. The columns for the medical claims data and the comparison data cover the entirety of 2016 and 2017. The columns for the AAMC data indicate the percentage of active physicians in each specialty. The AAMC data are for December 2015 and December 2017, and the CPI data are for July 2016 and July 2017. As noted in table 1, the categorical divisions among the medical claims data, the comparison data, the AAMC data, and the CPI data are slightly different. Some of the divergence among the data sources could be driven by different specialty mappings.

Table 1. Distribution of physician specialties, by data source, 2015–17 (in percent)
Category2015 AAMC data2016 medical claims data2016 comparison data2016 CPI data2017 AAMC data2017 medical claims data2017 comparison data2017 CPI data

Allergy/immunology

0.52.11.00.40.52.11.10.3

Anesthesiology

5.10.76.80.15.00.76.80.2

Cardiovascular disease

2.71.72.4[1]2.61.82.5[1]

Dermatology

1.52.62.71.71.42.72.81.7

Emergency medicine

4.8[1]4.4[1]5.0[1]4.6[1]

Endocrinology and metabolism

0.7[1]0.9[1]0.8[1]1.0[1]

Family practice

13.69.69.528.513.48.99.426.8

Gastroenterology

1.71.82.4[1]1.71.72.4[1]

Internal medicine

13.36.67.130.613.06.27.434.4

Medical doctor—MD (NEC)

[1][1]2.8[1][1][1]1.9[1]

Neurology

1.52.1[1]3.61.52.1[1]3.6

Obstetrics and gynecology

5.14.17.71.65.04.57.72.6

Oncology—hematology

1.69.2[1][1]1.79.5[1][1]

Oncology—radiation

0.62.0[1]1.90.61.7[1]1.0

Other facility (NEC)

[1][1]2.0[1][1][1]2.5[1]

Other specialty

[1][1][1]6.5[1][1][1]4.3

Otolaryngology 

1.21.52.20.71.11.52.10.8

Pathology

[1]9.02.90.1[1]6.82.70.0

Pediatric medicine

6.95.27.99.06.76.28.29.1

Plastic/maxillofacial surgery

0.9[1][1]3.20.9[1][1]2.5

Psychiatry

4.40.71.71.44.30.91.92.9

Radiology

3.311.16.22.03.211.65.70.9

Rheumatology

0.63.2[1][1]0.63.6[1][1]

Single or multispecialty clinic or group practice

[1]6.42.7[1][1]6.32.3[1]

Surgery—general

2.90.7[1]2.82.80.6[1]2.2

Surgery—orthopedic

2.42.06.03.12.62.06.04.6

Urgent care facility

[1][1]1.1[1][1][1]1.5[1]

Urology

1.20.91.20.41.20.91.20.1

[1] Category does not exist for corresponding data source.

Note: The AAMC data are for December 2015 and December 2017, and the CPI data are for July 2016 and July 2017. Data in columns may not add up to 100 percent because smaller categories are omitted for brevity. AAMC = Association of American Medical Colleges; CPI = Consumer Price Index; NEC = not elsewhere classified.

Source: U.S. Bureau of Labor Statistics, Association of American Medical Colleges, and national health insurance aggregator.

The distributions for the AAMC data, the comparison data, and the medical claims data are close, except for measures of expenditure in the fields of pathology, anesthesiology, and radiology. According to these distributions, the CPI oversamples specialties in family practice and internal medicine, the largest categories, and undersamples specialties in radiology, pathology, and obstetrics and gynecology. The small deviations across the medical claims data, the comparison data, and the AAMC data suggest that the comparison data and the medical claims data may reasonably represent physicians’ services in terms of outlet specialty.

Physicians’ services and outpatient hospital services are identified by using CPT codes. The AMA manages CPT codes and groups similar codes into larger CPT subcategories. Table 2 shows the distribution of CPT subcategories by expenditure weight. Relative to the medical claims data and the comparison data, the CPI oversamples evaluation and management (E/M) services. This leads to the CPI undersampling the other categories. E/M services also represent the largest category of services in the comparison data and the medical claims data. The category is rather broad, covering everything from routine office visits to psychiatric care. BLS dependence on respondent cooperation may be the cause of this oversampling of E/M services. To minimize respondent burden, field staff often price a common or simple service, and this service is likely to fall under the E/M category.

Table 2. Distribution of physicians' services, by expenditure weight and data source, 2016 and 2017 (in percent)
CPT subcategory20162017
Medical claims dataComparison dataCPI dataMedical claims dataComparison dataCPI data

Evaluation and management

33.737.072.534.737.175.1

Anesthesia

0.36.50.00.26.40.0

Surgery

7.520.49.17.319.88.8

Radiology

14.07.84.514.67.73.9

Pathology and laboratory medicine

10.55.54.78.15.64.3

Medicine

9.412.57.29.212.46.2

HCPCS level 2

24.610.32.125.810.61.6

Note: CPT = Current Procedural Terminology; CPI = Consumer Price Index; HCPCS = Healthcare Common Procedure Coding System.

Source: U.S. Bureau of Labor Statistics, American Medical Association, and Centers for Medicare and Medicaid Services.

The expenditure weight for comparison data in surgery is higher than the corresponding weights for the other two data sources shown in table 2. The expenditure weight for HCPCS medical claims is likely overstated in our analysis. Working with the data provider, we found that our sampling procedure oversamples HCPCS level II codes, and we are currently exploring possible solutions to this oversampling.

Table 3 shows the distribution of outpatient hospital services by expenditure weight in the medical claims data and the comparison data. Here, a large expenditure difference is observed between the two data sources for surgery codes. Again, this difference is due, in part, to the oversampling of HCPCS codes. Apart from the surgery and HCPCS codes, the two data sources generally align with each other.

Table 3. Distribution of outpatient hospital services, by expenditure weight and data source, 2016 and 2017 (in percent)
CPT subcategory20162017
Medical claims dataComparison dataMedical claims dataComparison data

Evaluation and management

16.110.516.211.0

Anesthesia

0.00.10.00.1

Surgery

9.528.49.728.1

Radiology

23.918.823.118.3

Pathology and laboratory medicine

7.08.97.38.7

Medicine

13.813.213.813.2

HCPCS level 2

29.620.130.020.6

Note: CPT = Current Procedural Terminology; HCPCS = Healthcare Common Procedure Coding System.

Source: U.S. Bureau of Labor Statistics, American Medical Association, and Centers for Medicare and Medicaid Services.

Index methodology

In the research stage of this project, we experimented with several index methodologies,15 including methodologies for calculating a mix of chained and unchained indexes and superlative indexes. (See also appendix.) Our research primarily focused on two methodologies. The first methodology involves the calculation of a chained Törnqvist index:

where  and  are the prices for observation i in, respectively, period t and period t – 1; and  and  are the expenditure shares for observation i in, respectively, period t and period t – 1.

The second methodology involves the calculation of an unchained Lowe index:

where  is the average calendar-year quantity for observation i. We construct two versions of the Lowe index. The first version uses the base-year quantities for the entire 2-year period during which an observation is in the sample. We also construct an annually chained Lowe index that uses the average monthly quantity from the prior year. We refer to this methodology as a “mixed-quantity” methodology, because it uses the sampling-period quantities in the first year of pricing and an average of first-year quantities in the second year of pricing.

Using claims data in the calculation of a price index presents several challenges. There can be substantial variation in prices for a given service. Changes in the average price for a given provider–service combination can be caused by redrawing a sample from the same underlying price distribution rather than by a shift in the actual price distribution. This problem is particularly pronounced when the number of claims used to calculate an average price is small, because some additional price-determining characteristics (e.g., patient age or health) are either not observed or not controlled for in our methodology. The problem cannot be solved simply by controlling for additional variables, because doing so would increase the likelihood of the data containing no claims for a given provider–service combination in a month.

Although some theoretical work has addressed the effect of price dispersion on price indexes, this work assumes that an identical product is being sold at different prices because of market frictions.16 This formulation differs from the case of medical care, where differences in prices for a given provider–service combination are due to unobserved quality differences. For example, the average price of a given service in a given month can increase because the patients who seek that service during that month happen to be sicker. In cases in which “big data” (e.g., scanner data) are used to calculate price indexes, products are defined at a very granular level, and the calculation of average price does not involve averaging over potentially different products (as is the case with medical care).

If there are no claims for a sampled item in a given month, a price relative cannot be calculated. In this case, we impute the missing current-period price by using the aggregated price relative for the corresponding combination of ELI, geographic area, and data source in the given month. This procedure is consistent with standard CPI methodology.

BLS additionally removes outliers within the medical claims data at the average-price level and the price-relative level. At the average-price level, we set a minimum price of $1 for physicians’ services and outpatient hospital services, removing any observations below this price level. After an average price is removed, it is imputed. At the price-relative level, we calculate the average price change and remove any observations with a price relative of ±3 standard deviations.

Initial results

In this section, we first review unlagged indexes that show price changes based on when services were delivered rather than on when the data were received. In analyzing differences in the results, we examine various statistics to determine the proximity of the medical claims indexes and the CPI to the comparison index (i.e., the index calculated with comparison data). These statistics include the total index change from start month to end month, the annualized change during that period, the sum of squared errors (SSE) between the index being analyzed and the comparison index, and the SSE of the index relative between the index being analyzed and the comparison index.

In assessing index differences, it is important to analyze the SSE scores for both the index relative (how close the indexes are on a month-to-month basis) and the index value (how close the indexes are throughout the entire period). A one-time level shift in the index value of one index may result in a persistent difference between that index and other indexes. The SSE based on the index relative does not contain persistent differences when a level shift occurs.

Below, we begin by reviewing the index results for private insurance indexes, which are experimental price indexes created by using only private insurance claims data. Whenever possible, we use a CPI for private insurance rather than the published CPI that includes all payers. Then, we combine the private insurance indexes with CPI self-pay and Medicare data to create all-payer indexes.

Private-insurance-only indexes

Chart 2a shows the private insurance indexes for physicians’ services. These indexes are created only with private insurance data. As shown in the chart, the comparison index increases steadily over time, whereas the CPI for private insurance first increases quickly and then decreases, showing smaller price changes than the comparison index from 2018 to the end of the period. The mixed-quantity, Törnqvist, and Lowe indexes all track closely with one another and with the comparison index, although they grow more slowly in later years. However, the CPI for private insurance moves differently from the other private insurance indexes. This could be due to the brief rise observed at the beginning of the series,17 as well as to other payer types imputing prices within the private-insurance-only index.18

For the private-insurance-only indexes for physicians’ services, the annualized increase for the comparison index is higher (by 0.3 percentage point to 0.4 percentage point) than the increases for the other medical claims indexes. (See table 4a.) The comparison index increases at an annualized rate of 1.4 percent, compared with 1.1 percent for the medical claims mixed-quantity index. The medical claims unchained Lowe index has the smallest SSE score in terms of index value, and the medical claims chained Törnqvist index has the smallest SSE score in terms of index relative.

Table 4a. Index comparison for different sources and methods, private-insurance-only indexes for physicians’ services, January 2016–March 2020
Index by source and methodTotal change (percent)Annualized change (percent)SSE index valueSSE index relative (×104)

Comparison unchained Lowe

5.91.4[1][1]

Consumer Price Index

2.50.627753

Medical claims unchained Lowe

4.71.12442

Medical claims mixed quantity

4.81.12949

Medical claims chained Törnqvist

4.31.04340

[1] SSE scores are calculated relative to the comparison index.

Note: SSE = sum of squared errors.

Source: U.S. Bureau of Labor Statistics.

Chart 2b shows the published all-payer CPI for outpatient hospital services and the private-insurance-only comparison and medical claims indexes for the same services category. As seen in the chart, all indexes track closely with one another. (Outliers are removed from the insurer’s data; there are no outliers in the CPI or comparison data.)

Table 4b presents metrics for index total change, annualized change, and SSE scores for outpatient hospital services. Consistent with the patterns observed in chart 2b, these metrics indicate that the four noncomparison indexes perform fairly similarly to the comparison index. The all-payer CPI and the mixed-quantity index end the period at a higher point than the comparison index and have higher SSE scores, whereas the Törnqvist index and the unchained Lowe index match more closely the comparison index.

Table 4b. Index comparison for different sources and methods, all-payer Consumer Price Index and private-insurance-only indexes for outpatient hospital services, January 2016–March 2020
Index by source and methodTotal change (percent)Annualized change (percent)SSE index valueSSE index relative (×104)

Comparison unchained Lowe

13.63.9[1][1]

Consumer Price Index (all payers)

16.33.637539

Medical claims unchained Lowe

14.53.29442

Medical claims mixed quantity

17.63.927466

Medical claims chained Törnqvist

14.53.27648

[1] SSE scores are calculated relative to the comparison index.

Note: SSE = sum of squared errors.

Source: U.S. Bureau of Labor Statistics.

All-payer indexes

In this section, we construct all-payer price indexes by combining the private-insurance-only indexes with the CPIs for physicians’ services and outpatient hospital services for self-pay and Medicare payers. For example, by combining the medical claims unchained Lowe index for outpatient hospital services with the self-pay and Medicare index for outpatient hospital services, we create a hypothetical all-payer CPI that is supplemented with alternative data. The medical claims mixed-quantity index for outpatient hospital services is also separately combined with the CPI for outpatient hospital services. This model assumes that the medical claims data are representative of the private insurance market as a whole.

All-payer indexes are then compared with the comparison index and the CPIs. The comparison index also is combined with the self-pay and Medicare index in order to create a comparison all-payer index. The medical claims indexes and the CPIs are combined by using payer-type market-share data at the area level. Market shares are determined by using Small Area Health Insurance Estimates data from the U.S. Census Bureau. When the indexes are combined with data for the other payer types from the CPI (including self-pay and Medicare Part B data), the differences between them decrease. This is because the Medicare and self-pay portions of the indexes are based on the same data.

Chart 3a shows the comparison all-payer index and the medical claims all-payer indexes for physicians’ services. The all-payer CPI is the same as the official CPI. Throughout the period, there are only small differences between the comparison index and the medical claims indexes. From July 2017 onward, after exhibiting an initial rise, the all-payer CPI also tracks closely with the medical claims indexes, although slight gaps occur in two periods—one between March 2019 and September 2019, and another between September 2019 and February 2020. Once we lag the medical claims indexes (the CPI remains unlagged), these gaps shrink.

The difference in annualized changes between the all-payer indexes (0.0 percentage point to 0.2 percentage point) is smaller than the difference between the private-insurance-only indexes examined in the previous section. (See table 5a.) The medical claims mixed-quantity index has the smallest SSE score in terms of index value, and the medical claims chained Törnqvist index has the smallest SSE score in terms of index relative.

Table 5a. Index comparison for different sources and methods, all-payer indexes for physicians’ services, January 2016–March 2020
Index by source and methodTotal change (percent)Annualized change (percent)SSE index valueSSE index relative (×104)

Comparison unchained Lowe

5.31.2[1][1]

Consumer Price Index

4.21.08451

Medical claims unchained Lowe

4.71.11222

Medical claims mixed quantity

5.01.1925

Medical claims chained Törnqvist

4.41.01821

[1] SSE scores are calculated relative to the comparison index.

Note: SSE = sum of squared errors.

Source: U.S. Bureau of Labor Statistics.

Chart 3b shows the comparison all-payer index and the medical claims all-payer indexes for outpatient hospital services. Again, the indexes trend similarly, ending the period at nearly the same point, although the mixed-quantity index ends at a slightly higher point than the other indexes.

Again, in the case of outpatient hospital services, the metrics for index total change, annualized change, and SSE scores are similar. (See table 5b.) The Törnqvist index for outpatient hospital services has the smallest SSE score in terms of index value, and the CPI has the lowest SSE score in terms of index relative.

Table 5b. Index comparison for different sources and methods, all-payer indexes for outpatient hospital services, January 2016–March 2020
Index by source and methodTotal change (percent)Annualized change (percent)SSE index valueSSE index relative (×104)

Comparison unchained Lowe

16.93.7[1][1]

Consumer Price Index

16.33.612023

Medical claims unchained Lowe

17.83.95225

Medical claims mixed quantity

20.34.418238

Medical claims chained Törnqvist

17.63.94226

[1] SSE scores are calculated relative to the comparison index.

Note: SSE = sum of squared errors.

Source: U.S. Bureau of Labor Statistics.

Lag analysis

A key characteristic of alternative medical claims data is their lag. Any alternative data used to supplement traditionally collected CPI data require processing by a vendor. Claims data are based on fully adjudicated claims, which can take several months to finalize. This processing results in data that are lagged by several months before they can be delivered to BLS. Given current BLS methodology for calculating CPIs for physicians’ services and outpatient hospital services, this lag is a weakness of using alternative data. However, the CPI program currently uses lagged data for other items. In our market research, we found multiple sources of data with a lag ranging from 2 months to 9 months. This lag is counted from the date of service of the medical claim.

The claims data used in this project have a 3-month lag, so data for a doctor’s office visit with a date of service in, say, January would not be delivered to BLS until April. BLS would use these January data to calculate April indexes, showing December-to-January price relatives in the April index, which would then be published in May. This section examines the behavior of hypothetical all-payer CPIs that are based on medical claims data with a 3-month lag. These data are combined with unlagged CPI self-pay and Medicare data. The resulting all-payer indexes best represent how lagged claims data would look if used in the calculation of official CPIs.

The charts and tables presented below are based on medical claims data, comparison data, and CPI data at the national level. Chart 4a presents the CPI, the lagged medical claims indexes, and the comparison index for physicians’ services. As seen in the chart, the medical claims indexes and the comparison index trend similarly, with no major jumps. After exhibiting an initial rise, the CPI for physicians’ services runs below the comparison index and increases steadily toward the end of the period. This pattern may be due to the fact that, from April 2018 onward, BLS began using data from the Medical Expenditure Panel Survey to adjust the weights of payer types included in the CPI for physicians’ services. The resulting index better reflects current market weights by increasing the share of payments covered by private insurance relative to the shares of payments made by uninsured people and payments covered by Medicare Part B.

The comparison index grows at an annualized rate of 1.2 percent, which is slightly faster than the 0.9- to 1.0-percent rate for the medical claims indexes. (See table 6a.) The CPI for physicians’ services has an annualized growth rate of 0.9 percent, which is between the growth rate of the comparison index and the growth rates of the medical claims indexes.

Table 6a. Index comparison for different sources and methods, all-payer indexes for physicians’ services, medical claims data with a 3-month lag, April 2016–March 2020
Index by source and methodTotal change (percent)Annualized change (percent)SSE index valueSSE index relative (×104)

Comparison unchained Lowe

4.91.2[1][1]

Consumer Price Index

3.60.98149

Medical claims unchained Lowe (lagged)

3.91.01922

Medical claims mixed quantity (lagged)

4.01.01722

Medical claims chained Törnqvist (lagged)

3.50.92921

[1] SSE scores are calculated relative to the comparison index.

Note: SSE = sum of squared errors.

Source: U.S. Bureau of Labor Statistics.

Chart 4b shows the CPI, the lagged medical claims indexes, and the comparison index for outpatient hospital services. Again, the indexes trend similarly, although the mixed-quantity index has the highest value at the end of the period. The mixed-quantity index is followed closely by the comparison index, the Törnqvist index, and the CPI. This pattern indicates that a 3-month lag may not be problematic.

Table 6b shows that the lagged mixed-quantity index has total and annualized changes that are among the closest to those of the comparison index. The Törnqvist index has the lowest SSE score in terms of index value, and the CPI has the lowest SSE score in terms of index relative.

Table 6b. Index comparison for different sources and methods, all-payer indexes for outpatient hospital services, medical claims data with a 3-month lag, April 2016–March 2020
Index by source and methodTotal change (percent)Annualized change (percent)SSE index valueSSE index relative (×104)

Comparison unchained Lowe

17.54.1[1][1]

Consumer Price Index

16.03.813722

Medical claims unchained Lowe (lagged)

13.93.348339

Medical claims mixed quantity (lagged)

18.84.416850

Medical claims chained Törnqvist (lagged)

16.33.812936

[1] SSE scores are calculated relative to the comparison index.

Note: SSE = sum of squared errors.

Source: U.S. Bureau of Labor Statistics.

Conclusion and next steps

The results of our analysis show that the use of medical claims data in price-index construction produces indexes that more accurately track price changes in medical care than the currently produced CPIs for physicians’ services and outpatient hospital services. However, incorporating medical claims data in the CPI presents both advantages and disadvantages.

In terms of advantages, medical claims data provide significantly more observations, capture more types of services, and, compared with traditionally collected field data, involve faster updating of procedure quantities. The amount of claims data is larger than that of traditionally collected data by an order of magnitude, providing more procedure codes and more data for each code. Finally, by providing the monthly quantities of each observation, claims data allow us to update weights much faster and to choose among several options in selecting price-index formulas.

The main drawbacks of using medical claims data include the data’s lag, the convenience sampling involved in their collection, and the delay in incorporating new procedure codes. However, we believe that the medical claims data we receive address CPI’s needs successfully.

On the basis of our analysis, BLS has drawn the following conclusions:

(1)   Even with a lag, the medical claims data can improve the CPI by supplementing traditionally collected data. Using medical claims data will reduce the amount of volatility in medical price indexes, address the problem of declining response rates, and produce consistent and accurate results.

(2)   Medical claims data provide more observations for the CPI for medical care services and its component indexes, increasing index accuracy. The indexes capturing medical claims data are generally closer to the comparison index in terms of both index values and index relatives.

(3)   The three formulas for calculating price indexes based on medical claims data produce similar results, and choosing one formula over another may not result in a substantially different index. For this reason, BLS should choose the index formula that most efficiently and accurately replicates CPI’s current formula. Although we were able to use medical claims data to calculate a superlative index with a formula that uses real-time weights, we believe that, for consistency in index-formula use, the production of medical price indexes should be based on current production methodology. Lower level substitution is not considered to be a major issue for medical goods and services. There is also a concern how a superlative index will reflect seasonal changes in the types of services performed.

BLS has decided to supplement traditionally collected data with medical claims data in the calculation of price indexes for physicians’ services and outpatient hospital services. The claims data will be incorporated into CPI calculations once BLS information technology systems are modified to accommodate the data. BLS will post an announcement on its website before implementation. BLS plans to continue research into the potential use of medical claims data in improving the inpatient hospital services component of the CPI market basket.

Appendix

Some consider the Törnqvist index to be the most accurate index for measuring price change. According to this view, an unlagged Törnqvist index would be the most accurate representation of price changes within the medical claims data. Because the unlagged Törnqvist index is similar to the lagged mixed-quantity index, the latter, which is our production index of choice, is a good approximation of price changes in the real market.

Chart A-1 shows the lagged mixed-quantity index and the unlagged Törnqvist index for physicians’ services. As seen in the chart, the two indexes trend similarly. There are some notable time lags whereby a rise in the unlagged Törnqvist index is followed by a similar rise in the lagged mixed-quantity index 3 months later. The SSE comparing the mixed-quantity index with the Törnqvist index was calculated for both index values and monthly index relatives. The SSE index value is 11.6 and the SSE index relative is 2.9.

Chart A-2 shows the lagged mixed-quantity index and the unlagged Törnqvist index for outpatient hospital services. Although the two indexes track closely with each other, their relative movements show some notable time-lag artifacts. For example, the Törnqvist index exhibits a rise in January 2019, and a similar rise does not appear until April 2019 for the mixed-quantity index. In this case, the SSE index value is 70.5 and the SSE index relative is 28.8.

Suggested citation:

John Bieler, Caleb Cho, JD Gayer, Brett Matsumoto, Brian T. Parker, and Daniel Wang, "Incorporating medical claims data in the Consumer Price Index," Monthly Labor Review, U.S. Bureau of Labor Statistics, February 2023, https://doi.org/10.21916/mlr.2023.3

Notes


1 Xue Song, William D. Marder, Robert Houchens, Jonathan E. Conklin, and Ralph Bradley, “Can a disease-based price index improve the estimation of the medical Consumer Price Index?” in W. Erwin Diewert, John S. Greenlees, and Charles R. Hulten, eds., Price index concepts and measurement (University of Chicago Press, 2009), pp. 329–372.

2 The term “payer type” refers to the person, organization, or entity that pays for the care administered by a healthcare provider.

3 John Bieler, Caleb Cho, Brett Matsumoto, Brian Parker, and Daniel Wang, “Using insurance claims data in the medical price indexes,” paper presented at the 2020 American Economic Association annual meeting, January 1–5, 2020, https://www.aeaweb.org/conference/2020/preliminary/paper/7i7sn2D3.

4 A longer lag is more noticeable at the geographic-area level. The U.S. Bureau of Labor Statistics (BLS) uses lagged data for other items such as rent.

5 For this project, we focused on physicians’ services and outpatient hospital services because they have the highest relative importance of all items in the medical price indexes and because they exhibit the lowest response rates. Prescription drugs also have a high relative importance but do not exhibit the same low response rates.

6 The BLS Consumer Price Index (CPI) program is still researching the use of medical claims data for inpatient hospital services.

7 See “Consumer Price Index,” Handbook of Methods (U.S. Bureau of Labor Statistics), https://www.bls.gov/opub/hom/cpi/pdf/cpi.pdf.

8 The data may not be representative if there is nonrandom nonresponse either in the household survey or by providers.

9 Before 2018, there were 87 geographic areas covered in the CPI. In 2018, a geographic revision of the CPI reduced the number of areas to 75 and changed the composition of the area sample. The data conform to the CPI geographic revision and cover the same 75 areas from 2018 onward. 

10 For a discussion of unit-value indexes and claims data, see Accounting for health and health care: approaches to measuring the sources and costs of their improvement (National Research Council, 2010).

11 CPT is a registered trademark of the American Medical Association.

12 Dental codes maintained by the American Dental Association are an exception.

13 Outliers are defined as observations falling above the threshold of +4 median absolute deviations or below the lower bound threshold of 30 percent of Medicare rates.

14 Only data for “patient care” were used. Other categories are “teaching” and “research” (see https://www.aamc.org/data-reports/workforce/interactive-data/active-physicians-largest-specialties-2017).

15 These methodologies involved the calculation of chained Paasche indexes; geometric means; Fisher, Laspeyres, and Lowe indexes; and chained and unchained quantity shares.

16 See, for example, Michael R. Baye, “Price dispersion and functional price indices,” Econometrica, vol. 53, no. 1, January 1985, pp. 217–224; and Marshall Reinsdorf, “The effect of price dispersion on cost of living indexes,” International Economic Review, vol. 35, no. 1, February 1994, pp. 137–149.

17 The CPI for physicians’ services also exhibits an initial rise in the case of all-payer and lagged medical claims indexes. This rise can be attributed, in part, to changes across several areas for both self-pay and private insurance prices.

18 During the brief rise in the series, missing insurance observations in the index for physicians’ services would impute off of all-payer types within the geographic area. It is possible that the CPI for private insurance is being driven by price movements for the self-pay and Medicare payer types.

article image
About the Author

John Bieler
cpia@bls.gov

John Bieler is an economist in the Office of Prices and Living Conditions, U.S. Bureau of Labor Statistics.

Caleb Cho

Caleb Cho is an economist in the Office of Prices and Living Conditions, U.S. Bureau of Labor Statistics.

JD Gayer

JD Gayer is an economist in the Office of Prices and Living Conditions, U.S. Bureau of Labor Statistics.

Brett Matsumoto

Brett Matsumoto is a research economist in the Office of Prices and Living Conditions, U.S. Bureau of Labor Statistics.

Brian T. Parker

Brian T. Parker is a supervisory economist in the Office of Prices and Living Conditions, U.S. Bureau of Labor Statistics.

Daniel Wang

Daniel Wang was formerly an economist in the Office of Prices and Living Conditions, U.S. Bureau of Labor Statistics.

close or Esc Key