An official website of the United States government
The U.S. Import and Export Price Indexes are modified Laspeyres indexes that aggregate import and export price data. The data are gathered mostly from sampled companies using trade dollar value weights produced by the Census Bureau. (For more information on data sources and the sampling process, see the data sources and design sections.)
For merchandise goods, the U.S. Import and Export Price Indexes are calculated using multiple classification systems for aggregation of data. The classification systems are the U.S. harmonized tariff schedule for imports and schedule B statistical classification for exports (harmonized), the North American Industrial Classification System (NAICS), and the end use classification system developed by the Bureau of Economic Analysis. For each of the classification groups, aggregation to the all import price index results in the same measure regardless of the classification structure. The same is true for exports. Only one top-level goods import and one top-level goods export price index are published and use the end use classification. For more information on how these classification systems relate to the U.S. Import and Export Price Indexes, see the concepts section.
For the harmonized system (HS), the NAICS indexes, and the end-use indexes, the most detailed calculated-index levels are the four-digit, the six-digit, and the five-digit levels of aggregation, respectively. Larger digits signify a greater level of detail.
The aggregation example in exhibit 3 shows an example of how an imported item (rivet), shown at the bottom of the exhibit, is mapped up through weight group and classification groups through the three classification systems. A similar mechanism occurs for exports. Note that not all calculated indexes and index levels are published.
Exhibit 3.
Index weights
The mapping structure and index weights are based upon a 2-year lagged classification structure and related fixed nominal dollar value of trade, such that the weights used to calculate price indexes starting with the January release use the most recent final annual foreign trade statistics collected by the Census Bureau.
Individual item weights are not available in the sample frame from which they are chosen. Instead, the weights are derived from the company weights based on trade weights within the 10-digit harmonized classification group from which the items are selected. For more detail on classification groups, see the design section. Company weights within a classification group are based on dollar value and the probability of selection. The company weight is distributed evenly among the sampled items in the classification group. Items may represent multiple items in the same sample in an effort to reduce company burden. So if similar items are sampled, only one may be collected where the item would retain the weight of each of the sampled items. In that case, the item weight within the sample would be multiplied by the number of items that the item represents. Items may also be sampled in multiple samples in which case the weights for each sample are summed. In effect, the item represents two or more items, one selected in each sample.
The individual item price changes are aggregated into progressively more generalized groupings—defined as weight groups, classification groups, and strata—until an overall price change is calculated for all imports and all exports. The weight group is a single company within a classification group, where the weight is derived by summing the company’s item weights. The weight groups are used to aggregate indexes to the classification group level. Classification groups are weighted by the 2-year lagged trade dollar values and are aggregated to the most detailed strata of each classification system for calculation and publication purposes.
For air transportation services, IPP calculates import and export air freight and air passenger price indexes using the balance of payments concept of payments and receipts between U.S. and foreign residents. In addition, inbound and outbound air freight price indexes are directional and measure the transportation of freight between the United States and foreign countries. At the lowest level of aggregation, items are mapped by locality using the U.S. Department of Transportation’s world area codes. At the most aggregated level, each air service measure is aggregated to an all-world price index for imports (exports) and inbound (outbound) separately. (For more information, see the concepts section.)
IPP calculates the U.S. Import and Export Price Indexes using a Lowe, or modified fixed-quantity Laspeyres formula, with trade dollar values or their estimates as weights, which are updated annually on a 2-year lagged basis. Laspeyres indexes measure price change from a base period to the current period using the base period weights. In a Lowe price index, the weight reference period may precede the price reference period. The Laspeyres price index is a special case of the Lowe index where the weight reference period must match the price reference period. The IPP second modification to the Laspeyres formula is the use of a chained index instead of a fixed-base index. With chaining, the current-month index is derived from the index value and price movement from the previous month rather than a base period. The index values from each month can then be chained back to the base period.
Below is the derivation of the modified fixed-quantity Laspeyres formula used to calculate all the U.S. Import and Export Price Indexes.
where
To maintain an uninterrupted time series, an imputed or estimated price is entered when there are missing price data. Observations could be missing due to erratic reporting on the part of respondents, late data, or strong seasonality in patterns of trade.
Exhibit 4 shows an example of how IPP deals with missing prices. The ellipses represent a value which cannot be computed due to the missing value for item 2 in period 1. The items are used to derive the index values or long-term relatives (LTR) for the next level of aggregation, the company weight group.
Items | Element | Period 0 | Period 1 | Period 2 |
---|---|---|---|---|
Item 1 |
Price | $10 | $10 | $10 |
STR | 1 | 1 | 1 | |
LTR | 100 | 100 | 100 | |
Item 2 |
Price | $20 | … | $30 |
STR | 1 | … | … | |
LTR | 100 | … | 150 | |
Item 3 |
Price | $5 | $10 | $5 |
STR | 1 | 2 | 0.5 | |
LTR | 100 | 200 | 100 | |
Note: STR stands for short-term relative. LTR stands for long-term relative. Source: U.S. Bureau of Labor Statistics, International Price Program. |
The missing price for item 2 in period 1, as indicated by the X in exhibit 2, would adversely affect the index because neither the period 1 nor period 2 short-term relative (STR) can be derived for item 2. In order to continue the series, values need to be imputed. There are 2 methods used to impute missing data if a respondent does not report a price.
The first method is cell mean imputation, generally used for items in industries where prices are relatively homogenous, such as raw materials or intermediate goods. Normally, the STR from an item’s weight group would be used. However, if none of the items in the weight group have usable price data, then the STR would come from the classification group, the most detailed level harmonized stratum the classification group is mapped to, or potentially a more aggregated level in the HS structure. The HS structure is used when imputing prices. The HS structure is a better match for price change than industry classification or end use because HS is based on product type.
When a suitable STR is found, it is used to impute a price for the item in the missing period. Consequently, if usable price data are received for a later period, the series can be continued by using the last imputed price as the denominator in computing an STR for the item.
Assuming all three items are evenly weighted in this example, the other items in the weight group rise by 50 percent. The imputed value of the missing price would be derived by taking the period 0 price of $20 and multiplying that by the average change for the weight group of 50 percent to get an imputed period 1 price of $30 as seen below in exhibit 5 (item 2, period 1).
Items | Element | Period 0 | Period 1 | Period 2 |
---|---|---|---|---|
Item 1 |
Price | $10 | $10 | $10 |
STR | 1 | 1 | 1 | |
LTR | 100 | 100 | 100 | |
Item 2 |
Price | $20 | $30 | $30 |
STR | 1 | 1.5 | 1 | |
LTR | 100 | 150 | 150 | |
Item 3 |
Price | $5 | $10 | $5 |
STR | 1 | 2 | 0.5 | |
LTR | 100 | 200 | 100 | |
Note: STR stands for short-term relative. LTR stands for long-term relative. Source: U.S. Bureau of Labor Statistics, International Price Program. |
The overall STR and LTR for the weight group can now be calculated for both period 1 and period 2.
The second type of imputation used for the Import and Export Price Indexes is called linear interpolation. The method uses the price for the item in periods before and after the period or periods where the price is not reported to linearly interpolate the middle value or values. This type of imputation is most commonly used for industries characterized by heterogeneous items that are based more on company price factors rather than industry factors. Finished goods would generally fall into this category of items. Applying linear interpolation to the example in exhibit 4, the value of the missing price for the item in period 1 would be the midpoint of the item prices in period 0 and period 2. That example is illustrated in exhibit 6.
Items | Element | Period 0 | Period 1 | Period 2 |
---|---|---|---|---|
Item 1 |
Price | $10 | $10 | $10 |
STR | 1 | 1 | 1 | |
LTR | 100 | 100 | 100 | |
Item 2 |
Price | $20 | $25 | $30 |
STR | 1 | 1.25 | 1.2 | |
LTR | 100 | 125 | 150 | |
Item 3 |
Price | $5 | $10 | $5 |
STR | 1 | 2 | 0.5 | |
LTR | 100 | 200 | 100 | |
Note: STR stands for short-term relative. LTR stands for long-term relative. Source: U.S. Bureau of Labor Statistics, International Price Program. |
Again, once the missing price for item 2 in period 1 is imputed, the STR and LTR for the weight group can be derived.
With linear interpolation, in period 1, before the period 2 price is known, the item would initially be imputed using the cell mean methodology and then would be revised using the linear interpolation method once the next actual price was reported. The linear interpolation method is preferred because it uses the price for the same item. However, because the next available price is not present in many cases, the fallback is to use the cell mean imputation methodology. The linear interpolation method is available because the import and export price indexes are revised in each of the three months after the indexes are first released. If no new price is received during the revision period, then the imputation is performed using the cell mean methodology.
The method for starting a price series in the Import and Export Price Indexes is known as initialization. This happens the month before the first price is received at initiation if that price is within the revision period when the item is initiated, and if not, the initialization happens the month before the first subsequent price is initiated. The technique is similar to the cell mean imputation technique used for missing prices. With initialization, the weight group LTR (if one is available) is used to start the series for an item. The regular cell mean imputation method is then used to fill in the missing STR.
Updates or modifications of a surveyed item sometimes affect the price or quality of the item. For example, a computer might be improved by adding more memory. The U.S. Import and Export Prices Indexes should reflect the pure price change in the index, not price changes because of changes in the quality of the item. If the effect of the modification on price can be reliably estimated, a quality-adjusted price is used. If the value of the modification cannot be reliably estimated, the item is no longer considered to be the same item, and a new item is substituted into the survey.
Industry analysts evaluate the magnitude and impact of the modification to each item in the survey when the description is changed by the respondent. The respondent may be consulted in making this determination. If the modification does not change the class, general function, or purpose of the item and the respondent can quantify the change in the item description or modification, then a ratio of the previous and current price is set for that period. The intent is to adjust the new price in terms of the previous price, excluding the quality change. Thus, the item’s original price and subsequent prices will consistently reflect the original unmodified item, and the price change that is recorded for the item in each period will be solely the change in the market price. If the modification does change the item’s class, general function, or purpose, or the value of the change cannot be quantified, then the item is deemed to be a new product. Then, item substitution is done by an industry analyst, which is discussed below, in the section on substitution procedures and practices.
For computers, hedonic modeling is used to estimate the value of the quality change when this information is not available from the company. The term “hedonics” applies to any regression analysis that breaks out the price of a product into separate price-determining characteristics. Many products can be modeled as a bundle of product characteristics, and a quality change represents a change in the quality of one or more of the characteristics. The cost of the quality change is estimated by deriving implicit prices for each characteristic, whenever the quality of any of the characteristics is altered.
The hedonic models are ordinary least squares (OLS) regressions that estimate implicit prices for various price-determining characteristics. The regression uses many prices of many computers with many characteristics, and the independent variable regression coefficients are the estimated price change for a one-unit change in each price-determining characteristic. When a price change occurs for a computer and is due to a change in the characteristics of the model, the coefficient of the characteristic in the hedonic model is multiplied by the change in the characteristic to calculate the quality adjustment. This is the value of the quality adjustment, or VQA.
The general functional form of the hedonic model is shown in the equation below.
where
the price of good i,
the constant term in the OLS regression,
characteristic j for good i,
the coefficient on characteristic j, and
a random error term that is assumed to have a normal distribution with a mean of 0 and a standard deviation equal to .
The constant term represents the base price. The coefficients are the implicit prices of the various characteristics. Price-determining characteristics may be modeled as continuous, discrete, or dummy variables.
Once the value of the quality adjustment is calculated, a link price is derived by taking the price of the new product in the current period and subtracting the value of the quality adjustment. Procedurally, the link price represents the value of the previous product in the current period (period 1). The period 1 STR is calculated as the percent change of the link price in period 1 to the price of the previous product in period 0. In period 2, the short-term relative is the change in the market price of the updated product from period 1 to period 2. Recall that only the price change is used in calculating price indexes. The different price levels between the previous product and the updated product are not used to calculate the price change. The advantage to this method is that the link price is being set in the period during which the quality adjustment takes place.
Item substitution is the replacement of a previously traded item with a new item from the same company and within the same commodity classification group. The previously priced item is discontinued because the item is no longer traded, or a change in item description is not quantifiable. In the month that the one-to-one replacement occurs, there is no short-term price ratio for either the discontinued or the new item. The new item will get a price ratio the next month once two prices are collected.
The U.S. Import and Export Price Indexes are reweighted annually due to changing patterns of trade. The weights used in the Lasperyes formula are the trade weights reported by the Census Bureau. These weights have a 2-year lag when annual trade data become available and applied to the indexes. For example, the weights for the indexes in 2019 are based on the import and export trade weights from the 2017 calendar year.
The best way to illustrate the need for annual reweighting is to give an example of the changing dollar values between weight years. In 2013, the petroleum area consisted of 12.6 percent of the total import trade dollar value in the import price indexes. By 2014, petroleum fell to 6.9 percent of the total import trade dollar value for the import price indexes, exhibiting a relatively sizable drop in value. If the indexes had not been reweighted on an annual basis, petroleum would have been overrepresented in the overall import price indexes.
To represent the least-biased weight of the Import and Export Price Indexes based on the trade value, the indexes need to be rebased to 100 for the weight year. The reason is that the relative weights or importance over time is based both on the fixed trade weights and the relative movement of the indexes as prices change. Indexes that fall at a relatively faster pace than other indexes have a smaller index value and will have less relative importance in the indexes.
Rebasing the published indexes on an annual basis rather than holding the base period constant makes the indexes less user friendly for those who track the index value from month to month. Indexes continuously published since 2000 have a fixed base period set equal to 2000 = 100. For indexes published subsequent to 2000, the index is based at 100 in the initial month of publication. In order to adjust the base year for usability, indexes are first rebased to the weight year. They are then calculated to update the index values for the current month and three revision periods. Next, the indexes are subsequently rebased back to 2000 = 100 or to the first publication month = 100 for indexes published after 2000.
None of the import and export price indexes are seasonally adjusted. International trade happens worldwide with most items available regardless of the time of year. There is insufficient evidence for seasonality in import and export prices, although there is potentially seasonality in quantities which annual weights do not account for on a monthly basis. An effective method for eliminating seasonality is to look at the rolling 12-month percent changes rather than monthly percentage changes.
Sampling variability results whenever a sample is used rather than the complete universe. Currently, standard errors are published on an annual basis for the all-import and all-export price indexes. (See Variance Estimates for Price Changes in the Import and Export Price Indexes January–December 2018). For each month in the previous calendar year, the standard error is available for the 1-month, 3-month, and 12-month percent changes. The standard error, the square root of the estimated variance, is a common measure used to derive confidence intervals for percent changes in the U.S. Import and Export Price Indexes. Confidence intervals can be used to determine if an index change is significantly different than zero.
There are different types of errors that are introduced when calculating the U.S. Import and Export Price Indexes. One way to look at measurements of error is the difference between sampling error and nonsampling error. Sampling error is the error resulting from drawing a sample of imported and exported items to and from the United States, rather than using the entire universe of trade.
Nonsampling error can take a number of different forms. One form is misspecification error, which takes place if the universe of data from which the sample is being drawn does not correctly measure the actual population. This type of error could result if the Census Bureau makes mistakes in measuring the trade dollar value statistics that BLS uses to construct their U.S. Import and Export Price Indexes. A second type of nonsampling error is nonresponse error. Each month, a subset of the items sampled do not have prices reported. This type of error results if the respondents and nonrespondents do not represent a similar cross section of the total universe. Another form of nonsampling error can be introduced from misreported prices.
One issue when deriving an estimate from a sample is the potential trade-off between variance and bias. Variance is a measure of how much the estimates derived from numerous samples differ from the true value of the estimate. Bias results when the expected value of the estimate is either higher or lower than the true value of what is being estimated. An estimate could have a high sampling variance and still be unbiased if the expected value of the estimate is equal to the true value. Likewise, an estimate may have small variation over numerous samples, yet be biased if the expected value of the estimate deviates from the true value.
BLS strives to minimize both sampling and nonsampling error as much as possible. Sampling error is reduced by maintaining as many prices as possible to support an index, given resource and company burden constraints. Nonsampling error is reduced by subjecting the data to careful review using automated checks and a staff of professional economists, as well as by employing methods to estimate missing observations.
To derive the standard error, a modified bootstrap method, applying rescaled sampling weights, is used to produce 150 replicate index set estimates from 150 simulated item set samples. Item set replicates are constructed according to IPP’s 3-stage sample design. At both of the first two stages of sampling, it is possible for a selection to be either a certainty selection (i.e., the probability of selection is greater than the iteratively calculated sampling interval) or a probability selection. The replicate resampling method takes this into consideration by first partitioning the selected items within each sampling stratum m into those items that resulted from certainty establishment selections and those items resulting from probability establishment selections. The item set resulting from establishment certainty selections is further partitioned into two item sets: sampling classification group certainty selections and sampling classification group probability selections.
Thus, the set of all sampled items S is the union of these three partitions over all sampling stratum m
where
N = the number of sampling strata, ,
p = 1 for items selected from probability establishments,
p = 2 for items selected from probability sampling classification groups within certainty establishments, and
p = 3 for items selected from certainty sampling classification groups within certainty establishments.
Each bootstrap sampling, b, selects units within each partition of each sampling stratum as follows:
,
where
the number of units originally sampled in partition p of sampling stratum m.
Bootstrap item weights are then calculated as
where
the bth replicate item weight for item i, within establishment j and sampling stratum partition ,
the standard item weight for item i, within establishment j and sampling stratum partition , and
the number of times establishment j within partition p of sampling stratum m, is selected in bootstrap sample b
In the rare instances that, a simple random sample of items within that establishment is selected. If only one item exists under this establishment singleton, that item is chosen with certainty.
For each of the 150 bootstrap samples, chained indexes of the desired length are calculated at all levels of aggregation using these modified item weights, original probabilities of selection, trade dollar values, and collected price data. For variance estimates, the variance is calculated across replicate percent change values for all published indexes as
where
is the full sample estimate.