Handbook of Methods > International Price Program

Handbook of Methods International Price Program Design

International Price Program: Design

The purpose of the U.S. Import and Export Price Indexes survey and sample design is to provide a representative and unbiased measure of price change in each published index. For the majority of heterogeneous product categories and a limited number of homogeneous product categories, the Bureau of Labor Statistics (BLS) conducts the U.S. Import and Export Price Indexes survey to collect prices of individual items that represent detailed product categories. For most homogeneous product categories, price data come from alternative data sources. (See data sources.) Merchandise goods product categories and services product categories are sampled separately.

The sample frame of merchandise goods

The universe of all merchandise goods trade is available from transaction records collected by the U.S. government from importers and exporters for regulatory purposes. This information is shared securely with official statistical agencies to calculate economic measures. Importers are required to file an entry summary for shipments with the U.S. Customs and Border Protection (CBP) using the Automated Commercial Environment (ACE), which is an electronic data collection system. Exporters report merchandise that is exported to countries other than Canada to the Census Bureau using the Automated Export System, the export component of ACE. The Canadian Border Services Agency collects information on U.S. export shipments to Canada. Import and export transactions are provided to the Census Bureau on a monthly basis, so the Census Bureau can calculate and publish import and export monthly trade statistics. The trade records provide information on the company, the dollar value and volume of transactions, and the 10-digit classification of the goods being traded based on the Harmonized System (HS). BLS receives the import and export records from the Census Bureau to serve as the import and export sampling frames.

For the entirety of merchandise goods trade, the trade that is representative of the sampled target population of heterogeneous products forms the sample frame for imports and exports separately.

To ensure the most up-to-date sets of sampled items, BLS draws an import sample and an export sample separately, and on a biannual basis, with an import sample being selected in a given year and an export sample being selected the following year, for full coverage of the identified target population over 2 years.

Each sample frame comprises all records for import or export trade data for a 12-month calendar year period. At any given time, two to three samples are contributing to the market basket of items priced to measure import and export prices. To mitigate loss of coverage of representative items in published indexes, the 2-year rotation of each sample may not be adhered to in the case of limited staffing resources. The scheduling of the samples is intended to provide some overlap so that the more recent sample reaches peak initiation at about the same time the earlier sample begins to taper off.

Establishments

For all imports and for exports to countries other than Canada, an establishment is defined by the establishment identifier number (EIN), an 11-digit number assigned by the Internal Revenue Service and recorded on the transactions for those shipments. In some instances, multiple establishments may be located at the same address. For Canada, export establishments are defined by the combination of the name and zip code of the exporter because data received from the Canadian Border Services Agency do not include U.S. EINs. A company may consist of one or many establishments.

Stratification

A sampling stratum is typically defined at the four-digit level of product detail of the HS category, and this is the lowest level of publication of price indexes for the HS product classification system. Stratifying by the detailed commodity level proportionally accounts for the dollar value of trade and allows the survey to more accurately represent important subsections of trade than if a sample were selected from a nonstratified frame. The samples are selected independently within each detailed commodity stratum, which contributes to reduction of the sampling error of the index estimates derived from the prices collected from the sample.

There is a trade dollar value below which a four-digit HS product area is not publishable because the number of quotes, which is set in the budget process, would be insufficient to support the representativeness of the price index. The threshold is calculated based on the total number of budgeted field initiation visits, the average number of quotes per establishment that is sent to the field, the total dollar value of the frame being sampled, and the minimum number of quotes to be priced per stratum to maintain publishability. The total number of budgeted field initiation visits multiplied by the average number of quotes per establishment determines the expected number of quotes to be fielded. The total dollar value of the frame is divided by the expected number of quotes to be fielded, and this result is multiplied by the minimum number of quotes to be priced per stratum to maintain publishability to determine the threshold.

The 10-digit HS product categories are the building blocks to sample and to calculate price indexes for different product and industry classifications. Using the 10-digit categories provides consistency across time and for North American Industry Classification System (NAICS) and Bureau of Economic Analysis (BEA) end-use concordances. These product categories are called classification groups for sampling and calculation purposes. In most cases, classification groups are the 10-digit HS level of aggregation.

The sample process

The sample selection process is carried out in three stages to efficiently identify large traders and ensure that as many price indexes can be published at as great a detail as possible by identifying specific import and export items to price over time. The following subsections discuss the three stages.

Selecting the establishments

The first stage of sampling begins with setting the sample frame. The sample frame comprises 12 months of import or export transactions for all sampled strata using their respective frame sources. The first stage of the sampling process relies on three constraints:

an upper bound of item burden for an establishment within a sampling stratum,
an upper bound of establishments to be fielded for the sample based on the total number of budgeted field initiation visits, and
the expected number of items per establishment to be collected.

Using the aforementioned constraints, BLS begins allocating the sample. First, BLS establishes some upper and lower boundaries. BLS establishes an upper bound of the total number of items to be collected for the sample working from the total staff capacity based on the average number of hours spent collecting an average sample unit. This total can be found by multiplying the expected number of items per establishment by the total number of establishments in the sampling frame. The total number of establishments in the sampling frame is adjusted to meet the staff capacity to collect the data. Next, BLS allocates the total number of items across all sampled strata based on each stratum’s dollar value of trade. The lower boundaries ensure that there is sufficient item coverage at the classification group-level for the three classification systems.

BLS distributes the number of establishments by strata based on the distribution of allocated items and subject to the upper bound of item burden for an establishment. Readjustment of item and establishment allocation occurs throughout the process to meet the minimum number of items and establishments required to satisfy publication standards that protect confidentiality and ensure index quality.

After the allocation process is completed, establishments are selected independently within sampling strata, which represent product areas at the most detailed level of publishable Harmonized System indexes. Because the size of trade for establishments varies greatly, selection is based on probability proportionate to size (PPS), where size is defined as dollar value traded. The approach reduces sampling error compared with equal probabilities of selection. Some establishments are selected with 100 percent probability; these are the large, frequent traders in a stratum. The rest of the establishments have probabilities of selection that are proportional to the value of their trade in the stratum. These comparatively smaller traders are designated as uncertainties. An establishment can be selected in more than one stratum and with different probabilities of selection; that is, each stratum is independently sampled.

Some sampling strata have low dollar values of trade, and the corresponding price index may not be publishable because the stratum does not meet the dollar threshold for publication. Even so, sampled establishments in these strata are weighted accordingly and aggregated to a higher level to ensure representativeness of overall trade for all-goods import and export price indexes.

Upon selection of the first-stage sample, the sampled establishments are refined by survey economists. The economists use historical survey data and other reference materials to validate the establishment’s name and address. In addition, economists determine if an establishment should be included in the survey based on one of two criteria: the company historically provides price data or is new to the survey. During the first-stage sampling process, multiple establishments with the same company name and address are combined into one entity representing the unique name and address for data collection, known as the collection unit. Also, establishments that are part of the same company but located at different addresses are combined into a collection unit if survey history or reference-material research suggest data collection should be at one centralized location for the entire company.

Selecting classification groups

The second stage of sampling is carried out in three steps: 1) capping burdens for collection units when necessary; 2) allocating items to selected establishments; and 3) selecting classification groups per establishment. The sampling unit is the classification group within the establishment. The first stage of sampling is determining which establishments BLS will attempt to request data from. The second stage is to determine what groupings of items by establishment BLS will ask about.

A cap is put on respondent burden because BLS seeks to minimize the burden on data providers while maximizing the number of prices collected. Setting a cap is primarily done to control an establishment’s item burden for those situations in which the establishment was sampled in multiple strata, in addition to those cases in which multiple establishments were combined into one collection unit during refinement. If either or both of these instances occur and result in the collection unit burden exceeding the upper limit, the item burden within each stratum of each establishment constituting the collection group is proportionately reduced. This prevents the overall burden for the collection unit from exceeding the upper limit.

After capping collection-unit burdens, second-stage sample allocation initially distributes the item burden assigned to a sampled establishment within a stratum to the corresponding classification groups by proportional dollar value. The classification group item allocations then undergo a series of readjustments across and within all sampled establishments that trade in these classification groups through an iterative process known as raking. The raking process involves redistributing items across classification groups to meet minimum item requirements. The process also ensures each classification group is allocated enough items across establishments to support publishability across the three primary classification-system strata (BEA end-use, HS, and NAICS) to which they contribute. At the same time, this maintains the item burden that was assigned to each of the sampled establishments in the stratum.

In the second stage of sampling, classification groups are selected with replacement within the sampled establishments. The process uses PPS methodology in which the measure of size is the expected number of items to be selected within each classification group for the establishment. The PPS method may result in a given establishment’s classification group being assigned a number of items, as items are selected multiple times based upon the relative proportion of the classification group’s value within the establishment.

Both raking and selection with replacement are performed to control respondent burden while maximizing the quality of price indexes by ensuring sufficient item coverage for the publication of indexes in the three classification systems.

Further sample refinement occurs when survey economists conduct a final review of the sampled classification groups and counts of items per classification group within the collection units, making any adjustments necessary. The collection units are then assigned to the regions where the data collection occurs and the field economist contacts the company to carry out the initial interview and conduct the price survey.

Third stage of sampling

The third stage of sample selection occurs at the first interview with the company respondent. During that interview, the respondent provides unique items within each sampled classification group. The respondent indicates a dollar share, weight, or importance to establish how representative an item is of a company’s trade. The respondent-identified weights are then used to draw a weighted random selection of items. In cases where the company is unable to rank importance, items are selected randomly.

Once the items are established, the same unique items are subsequently priced over time by the respondent. Sampled items are priced for approximately 5 years until the items are replaced by a fresh sample. Generally, each index spans prices collected from two to three samples. See the calculation section for specifics on how indexes are calculated.

For more information on the methods used to collect prices for sampled items, see data sources.

Services

U.S. Import and Export Price Indexes are calculated for air passenger travel and air freight transportation.

Sampling does not occur for the air passenger fares indexes. The price data from a commercial data source provide representative coverage of U.S. and foreign carriers and country market fares. The air passenger fares universe consists of fares to and from the United States and U.S. territories, and to and from foreign countries serviced by U.S. and foreign airlines. In addition, the Export Air Passenger Fares Index includes foreign-to-foreign fares when foreign residents are flying on U.S. airline carriers.

The universe for the air freight price indexes is aggregated transaction data published by the Bureau of Transportation Statistics, U.S. Department of Transportation (DOT). Freight is tendered by the company that needs items shipped to an airline for transportation, excluding mail and personal baggage; the service measured includes shipment from airport to airport only, omitting any ground transport or port service costs, which are classified as different types of services. Data cover both U.S. and foreign air carriers, with at least one point of service in the United States or one of its territories. The records are reported in the DOT T-100 International Market file and include route-specific information: origin and destination airports, air carrier names and nationalities, and the amount of cargo transported.

Air freight services are fully resampled approximately every 5 years. For the air freight price indexes sample, BLS uses probability sampling methods to select a two-stage sample of company-routes from the T-100 file that is representative of international air freight transportation. A company route is composed of an air carrier transporting freight internationally between a specific origin and destination. The sampling frame is stratified by world region, and in the first stage of sampling, companies are selected within those regions. Routes serviced by the sampled companies are selected in the second stage of sampling.

Last Modified Date: March 26, 2025