Skip to main content

United States of America

Expanding the family of U.S. CPIs, Thesia Garner (U.S. Bureau of Labor Statistics)

In recent years, there has been increased interest in going beyond headline measures of inflation to better describe the experiences of households. The CPI for All Urban Consumers (CPI-U) targets the inflation experience of over 90 percent of households in the United States, but it may not reflect the inflation experience of an individual household or group of households. This presentation describes two ongoing research efforts at the Bureau of Labor Statistics to expand its offerings of consumer price indexes in ways that allow for a richer description of household experiences.

Languages and translations
English

UNITED NATIONS

ECONOMIC COMMISSION FOR EUROPE

CONFERENCE OF EUROPEAN STATISTICIANS

Group of Experts on Measuring Poverty and Inequality

28-29 November 2023

Workshop on Harmonization of Poverty Statistics to Measure

SDG 1 and 10

27 November 2023

Title of contribution Expanding the family of U.S. CPIs

Author Name(s) William JOHNSON, Joshua KLICK, Paul LIEGEY, Robert MARTIN, Anya

STOCKBURGER,

Presenter Name Thesia GARNER

Presenter Organization U.S. Bureau of Labor Statistics

Topic Inflation and its impact on poverty and inequality

Summary:

In recent years, there has been increased interest in going beyond headline measures of inflation to

better describe the experiences of households. The CPI for All Urban Consumers (CPI-U) targets the

inflation experience of over 90 percent of households in the United States, but it may not reflect the

inflation experience of an individual household or group of households. This presentation describes

two ongoing research efforts at the Bureau of Labor Statistics to expand its offerings of consumer price

indexes in ways that allow for a richer description of household experiences. First, in response to

increasing user demand, we construct consumer price indexes for different groups along the income

distribution. From 2006 to 2023, lower income households generally faced larger inflation rates than

higher income households, and the gap is highest when measured using the Chained CPI, which is a

closer approximation to a cost-of-living index. We explore how different budget items contribute to

this gap, as well as how it changes over time. Second, we estimate a family of price indexes known as

Household Cost Indexes (HCI), which aim to measure the average inflation experiences of households

as they purchase consumer goods and services. These differ from the usual CPIs in two main respects.

First, the upper-level aggregation of the HCIs weights households equally, unlike most headline CPIs

which implicitly give more weight to higher-expenditure households. Second, the HCIs use the

payments approach to value owner-occupied housing services explicitly using household outlays. In

contrast, the U.S. CPIs use rental equivalence. The HCI for all urban consumers has an average 12-

month change of 1.51% over December 2011 to December 2021, compared to 1.86% for the CPI-U.

Roughly 95% of the difference is due to the payments approach.

Please select your preferred contribution (you may select both options):

☒ Presentation

☐ Paper (to be submitted by 20 October)

Household Cost Indexes: Prototype Methods and Results, US

We estimate a family of price indexes known as Household Cost Indexes (HCI) using U.S. data. HCIs aim to measure the average inflation experiences of households as they purchase goods and services for consumption, and similar indexes are produced in the United Kingdom and New Zealand. These differ from the Bureau of Labor Statistics’ headline Consumer Price Index (CPI) products in two main respects. First, the upper-level aggregation of the HCIs weights households equally, unlike most headline CPIs which implicitly give more weight to higherexpenditure households.

Languages and translations
English

1

Household Cost Indexes: Prototype Methods and

Results1

Robert S. Martin, Joshua Klick, William Johnson, Paul Liegey2

June 1, 2023

CONFERENCE PAPER/PRELIMINARY

Abstract

We estimate a family of price indexes known as Household Cost Indexes (HCI) using U.S.

data. HCIs aim to measure the average inflation experiences of households as they purchase

goods and services for consumption, and similar indexes are produced in the United Kingdom

and New Zealand. These differ from the Bureau of Labor Statistics’ headline Consumer Price

Index (CPI) products in two main respects. First, the upper-level aggregation of the HCIs weights

households equally, unlike most headline CPIs which implicitly give more weight to higher-

expenditure households. Second, the HCIs use the payments approach to value owner-occupied

housing services explicitly using household outlays. In contrast, the U.S. CPIs use rental

equivalence. The HCI for all urban consumers has an average 12-month change of 1.51% over

December 2011 to December 2021, compared to 1.86% for the CPI-U. The bulk of the

difference is due to the payments approach.

Key Words: Price index; inflation; democratic aggregation; payments approach

JEL Codes: C43, E31

1 We thank Anya Stockburger, Robert Cage, Thesia I. Garner, and many others at the Bureau of Labor Statistics for helpful comments and guidance. 2 Division of Price and Index Number Research (Martin), Division of Consumer Price Indexes (Klick, Liegey), Division of Price Statistical Methods (Johnson), Bureau of Labor Statistics, 2 Massachusetts Ave., NE, Washington, DC 20212, USA. Emails: [email protected], [email protected], [email protected], [email protected]

2

1. Introduction

This article estimates Household Cost Indexes (HCIs) using U.S. data. Similar price

indexes are already produced in the United Kingdom (Office for National Statistics, 2017) and

New Zealand (Statistics New Zealand, 2020). HCIs measure the change in cash outflows

required, on average, for households to access the goods and services they purchase at a

constant quality. Like the headline and subpopulation Consumer Price Indexes (CPIs) produced

by the Bureau of Labor Statistics (BLS), the HCIs aim to capture price change for consumer

goods and services. However, the HCIs differ in two important methodological respects from

the CPIs. First, the upper-level aggregation of the HCIs weights households equally, whereas the

CPI market baskets implicitly give higher weight to higher-expenditure households.3 Second,

the HCIs use the payments approach to value services from owner-occupied housing, using

outlays on mortgage interest, property taxes, and the full reported value of insurance,

appliances, maintenance and repairs (i.e., what the household pays and when they pay it). The

CPIs, in contrast, use an implicit measure of owner-occupied housing consumption called rental

equivalence, and all other goods and services are valued using acquisition prices and

expenditures (i.e., when the household acquired or took possession of the good). For HCIs in

principle, the payments approach should be applied more broadly, but this paper focuses only

on owner-occupied housing. We are ignoring household outlays for the purchase of vehicles

and other durable goods and instead are including the full acquisition expenditures for these

regardless of financing; including these in an HCI is left for a future study.

3 Households are still weighted by their sampling weight so that averages represent the population.

3

We compute an HCI for the urban U.S. population covering the period December 2011

to December 2021. The HCI is based on the Lowe (modified Laspeyres) formula using average

annual household weights with about a two-year lag. From December 2012 to December 2021,

we find an average twelve-month inflation rate of 1.51 percent for the HCI-U, compared to 1.86

for the CPI-U and 1.73 for the Chained CPI-U. We find that empirical differences between the

HCIs and CPIs are primarily due to the HCI’s use of the payments approach, which we estimate

subtracts 0.39 percentage points per year on average relative to an index that uses rental

equivalence. This difference reflects both a lower weight for owner-occupied housing in the HCI

as well as lower inflation in explicit housing costs when compared to owner’s equivalent rent. In

contrast, we estimate that equal household weighting increases the index only about 0.05

percentage points per year on average compared to an index which uses the standard

expenditure weighting, but otherwise uses the same methodology as the HCI.

CPIs are used in a wide variety of economic applications—as an overall macroeconomic

indicator, to deflate national accounts, to adjust marginal tax rates, and measure changes in the

cost-of-living representative of the entire economy. In such applications, measuring the change

in purchasing power of the average dollar of expenditure using an implicit consumption

concept like owner equivalent rent may be appropriate. In other cases, such comparing the

economic conditions of population subgroups, a measure tied to explicit outlays may be

attractive. One index cannot usually satisfy all needs, and in this sense the HCIs can provide

useful complimentary information about the average household inflation experience.

4

2. Literature Review

Current BLS CPI methodology is based on market-level expenditure weights and the

rental equivalence approach to owner-occupied housing (Bureau of Labor Statistics, 2020).

Household-weighted aggregation and the payments approach differ substantially from current

BLS CPI methodology, though neither is new to the price index literature. Astin and Leyland

(2015) propose using these methods to better capture the inflation experiences of households.

They argue such a measurement is more credible for indexing monetary values, while a

traditional CPI is superior for macroeconomic analysis and inflation targeting. Based in part on

their research, the Office of National Statistics developed a set of HCIs for the United Kingdom

(Office for National Statistics, 2017). Statistics New Zealand publishes a similar set of indexes

called the Household Living-Costs Price Indexes. Research on a similar set of indexes for the U.S.

began with Cage, et. al. (2018).

Household-weighted aggregation (also known as democratic aggregation) has been

considered at least since Prais (1958). The topic has been developed and reviewed in Pollak

(1989), National Research Council (2002), International Labor Organization (2004, Chapter 18),

Ley (2005), and Martin (2022), among others. Spending patterns differ across the distribution of

total expenditure. To the extent that these differences coincide with expenditure categories

that have higher or lower inflation than average, a household-weighted index will differ from a

traditional expenditure-weighted one. Equally weighted indexes have been studied with U.S.

data in Kokoski (2000) and Hobijn, et. al. (2009). The latter is notable for statistically matching

the interview and diary components of the Consumer Expenditure Survey (CE), and we follow

5

many aspects of its approach. Our paper also builds on work from Cage, et. al. (2018) and

Martin (2022), the latter of which finds that household-weighted aggregation adds about 0.08

percentage points per year to inflation measured by a Lowe-type CPI from December 2001 to

June 2021.

The payments approach to owner-occupied housing has been discussed at least since

the 1989 version of the International Labor Organization (ILO) CPI manual (as cited by

Goodhart, 2001), and much of our initial approach follows the 2004 version (International Labor

Organization, 2004, Chapter 10). The payments approach to owner-occupied housing focuses

on the month-to-month outlays by households rather than an upfront purchase price (the

acquisition approach) or the implicit consumption value (the use approach).4 In addition to the

HCIs for the United Kingdom and New Zealand, the payments approach is also used in the CPI

for Ireland (Central Statistics Office, 2016). Mortgage interest is also included in the housing

component of the CPI for Canada (Statistics Canada, 2019), and was a part of the U.S. CPI

housing component prior to 1983 (Gillingham and Lane, 1982). Diewert and Nakamura (2009)

contains a conceptual comparison of the payments approach against other methods like the

user cost approach and rental equivalence, while Garner and Verbrugge (2009) compare

methods empirically using the CE.

Astin and Leyland (2015) argue that the payments approach is superior for comparing

household inflation experiences and escalating payments. They make the case that because

rental equivalence is not tied to explicit outlays, an index which includes it as a large

4 Rental equivalence and user cost are both flavors of the use approach.

6

component may be less tethered to the actual price movements that affect household budgets.

For some subpopulations, there can be large differences between implicit rents and explicit

cash flows. For instance, in Cage et. al. (2018), the subpopulation of households which receives

at least 50% of its before-tax income from Social Security has higher relative expenditures on

shelter (35-39%) when measured using rental equivalence than the overall urban population

(32%), but lower relative expenditures when measured using payments (16-23%). This is

because these households are disproportionately likely to be owner-occupiers without

mortgages, meaning their explicit housing outlays are limited to items like property taxes,

insurance, and maintenance.

Astin and Leyland (2015), as well as ILO (2003) advocate such an index for escalation

purposes, but this position is not universally held. Diewert and Shimzu (2021) argue “it is not an

index that can measure household consumption of the services of durable goods because it

focuses on the immediate costs associated with the purchase of durable goods and ignores

possible future benefits of these purchases.” The payments approach has also been criticized in

Goodhart (2001), Poole, Ptacek, and Verbrugge (2005), and elsewhere on the basis that it

doesn’t reflect consumption in an economic sense. We agree that a flow-of-service method like

rental equivalence is more appropriate for a macro-focused CPI or a representative consumer’s

cost-of-living index (See, e.g., Diewert 1976). However, we study the HCIs as complementary

series intended to capture explicit outlays of households rather than the implicit consumption

prices (in an economic theoretic sense) reflected in a traditional CPI, though initially the

distinction is limited to owner-occupied housing. The objective of our paper is primarily to

compare owner-occupied housing and household aggregation methods.

7

3. Methods and Data

Our methods for this paper are preliminary and based on utilizing existing BLS surveys or

publicly available data sources. Like the CPIs, the HCIs are constructed in two stages. First, basic

indexes are constructed for item-area strata (e.g., coffee in Washington, DC). These are then

aggregated using expenditure weights from the CE. As our initial version only applies the

payments approach to owner-occupied housing, the elementary indexes and underlying

household expenditures used in upper-level aggregation are largely the same. See Bureau of

Labor Statistics (2020) for more details. For housing, the owner equivalent rent elementary

indexes are replaced with indexes for property taxes, mortgage interest, and property

management services. In addition, we use the full reported value of household expenditures on

household appliances, maintenance and repair, and insurance when constructing upper-level

aggregation weights. Finally, we estimate equally weighted averages of household expenditure

shares based on matched CE Interview and Diary data and use these in the second-stage

aggregation.

3.A. Payments Approach Item Structure and Elementary Indexes

The payments approach for owner occupied housing reflects the housing-related cash

outflows of households. Compared to the CPI, the HCI item structure excludes owner’s

equivalent rent and includes three additional expenditure classes—property taxes, mortgage

interest, and other primary residence expenses. The payments approach also removes several

adjustments CPI makes to other category weights, which we discuss more later in this section.

Within property taxes and mortgage interest, we create new elementary item indexes

8

representing primary residences. These also serve as proxies for secondary residences. In the

CPI, the price index for owner’s equivalent rent of primary residences (numbered “01”) also

serves as the proxy for the unpriced item (numbered “09”) representing secondary residences.

A further item classification (see

9

Table 1 for details) for other primary residence expenses consists of ground rent,

parking, and property management services. This category comprises less than one half of one

percent of the overall index weight, and we provisionally measure its price change using the

producer price index for final demand property management services as a proxy. Finally, our

objective, where possible, is to limit expenditures to those pertaining to primary residences and

vacation homes and exclude investment properties.

The rest of this section details the construction of the property tax and mortgage

interest payment indexes. We follow what is (to our knowledge) international practice by

including the interest component of mortgage payments (excluding second mortgages or home

equity lines of credit) and excluding the portion that goes toward principal reduction (and by

this reasoning down payments and cash purchases). From the 2004 ILO manual, only the

interest portion is considered a pure cash outflow; the principal portion immediately shows up

on the household’s balance sheet as an increase in assets, so it may be considered more like an

investment with a potential future return (International Labor Organization 2004, Chapter 10).

This view is not universal (see Astin and Leyland, 2015). However, including mortgage principal

presents additional technical challenges.5

Also following international practice, the mortgage interest and property tax payments

indexes derive conceptually from two sources of potential change: a rate (an interest rate or an

effective property tax rate) and the base to which the rate is applied (the debt level or the

5 The most straightforward method to estimate the proportional impact of changing interest rates on mortgage principal payments would involve plugging in aggregate (i.e., average) interest rates into a nonlinear function. In the sense of measuring a change in average payments across households, the potential bias of such a plug-in procedure from Jensen’s Inequality is unknown.

10

dwelling value). Changes in rates alone do not capture changes in purchasing power

(International Labor Organization 2004, Chapter 10). Some users could be concerned about

allowing the effects of home prices given these could be associated with (eventual) financial

returns to households. In our view, there is a tradeoff between representing the explicit outlays

of households and controlling for investment using economic theory. Indeed, as noted by

Poole, Ptacek, and Verbrugge (2005), adjusting housing payments to account for investment

results in the user cost approach, which is another implicit housing cost concept. Empirically,

Garner and Verbrugge (2009) show that user costs can differ greatly from explicit payments.6

Our initial strategy, following international practice, aims to exclude the investment aspect of

housing ownership by excluding mortgage principal. Appendix A shows the decision to

indirectly include home prices is significantly inflationary for the housing payments indexes and

suggests the decision to exclude mortgage principal is somewhat deflationary.

Finally, our preliminary results compute a single set of payments approach elementary

item indexes representing the U.S. urban population. We leave it to future research to extend

these methods to create elementary indexes by CPI geographic areas.

3.A.1. Mortgage Interest Payment Index

The mortgage interest payments index measures the proportional change in the interest

payment amount that would occur holding fixed the financing conditions—such as the loan

term and proportion of principal remaining. We aim to follow the recommendations in the

2004 ILO manual (Chapter 10), which is to use both a representative basket of interest rates

6 Garner and Verbrugge (2009) also find that user cost measures based on different underlying assumptions can differ greatly from each other and from implicit rents.

11

and a debt index, which holds “constant the age of the debt” between index periods

(International Labor Organization 2004, Chapter 10). Payments in each period are determined

by transactions occurring at many previous points in time, as mortgage loans are long-term

contracts. Consequently, our index is based on weighted averages of interest rates and house

prices corresponding to loans or debt of different ages. A fixed-basket approach has the

advantage of being feasible with aggregate interest rate and house price data, but the

disadvantage of not being micro-founded.7

Similar to Canada (Statistics Canada, 2019), we define the index as the product of a debt

index (which is influenced by home prices) and an interest rate index which compare payments

in the comparison period 𝑡 against the reference period 𝑠.8 The index is based on the model of

a thirty-year fixed rate mortgage, which dominates the U.S. market (about 75% of existing loans

as reported in the CE).9 It is written:

𝑃𝑀𝐼𝑃 = 𝑃𝐷𝑃𝑟 , (1)

where 𝑃𝐷 is the debt index and 𝑃𝑟 is the interest rate index. They are written

𝑃𝐷 = ∏ 𝐻

𝑡−𝑗

𝜓𝑏𝑗�̅� 𝑗=0

∏ 𝐻 𝑠−𝑗

𝜓𝑏𝑗�̅� 𝑗=0

(2)

and

7 We considered such a micro-founded approach which could, for example, average proportional changes in rates actually paid by households between the reference and comparison periods without fixing the loan age. Such an approach may be more appropriate for the U.S. market, which is dominated by 30-year fixed rate mortgages. However, basing such an approach on CE interest rate microdata misses any variation which occurs when a consumer unit moves from one house to another since consumer units are not followed. 8 While our debt index is similar to the housing component of Canada’s mortgage interest index, their interest rate component is based on unit value-like averages using administrative banking data. 9 We ignore preferential treatment of mortgage interest in the tax code.

12

𝑃𝑟 = ∏ 𝑟

𝑡−𝑗

𝜑𝑏𝑗𝜃−1 𝑗=0

∏ 𝑟 𝑠−𝑗

𝜑𝑏𝑗𝜃−1 𝑗=0

. (3)

The indexes measure change from period 𝑠 to period 𝑡 by weighting past home prices (relative

to a common base) and interest rates according to the relative importance of loans or debt

initiated in those months to the index periods 𝑡 and 𝑠.10

In these expressions, 𝐻𝜏 is a home price index for month 𝜏, 𝑟𝜏 is an average interest rate

for month 𝜏, 𝜓𝑏𝑗 is the population-weighted proportion of mortgagor-month observations with

debt of age 𝑗 (measured as the number of months since the property was acquired), and 𝜑𝑏𝑗 is

the population-weighted proportion of mortgagor-month observations with current loans of

age 𝑗 (measured as the number of months since the first payment) during the reference period

𝑏. The 𝜓 and 𝜑 parameters differ due to refinances. We use the proportion of mortgagors

(rather than the proportion of debt, which is closer to what Statistics Canada uses) in keeping

with the equal-weighting objective of the HCI. The parameter 𝜃 equals 360 to reflect the

number of potential payments in a thirty-year loan, while �̅� is set higher to allow for acquisition

periods to be earlier on refinanced properties. While not well bounded in theory, we set �̅�

equal to 408 to accommodate the beginning of our house price indexes in January 1975. This

covers about 97.5% of observations in our sample. We evaluate adjacent months 𝑡 and 𝑠. We

set 𝑏 as the fourth quarterly lag of the quarter containing month 𝑡. This reflects a realistic

production constraint for using CE data to construct the weights while keeping them as current

as possible. We use CE microdata on mortgage expenses and keep those observations with 30-

10 While the product of two geometric means with identical weights could be written as one geometric mean, writing the index as a product of two components makes for convenient discussion and analysis.

13

year fixed rate first mortgages on primary residences. We drop loan records that likely pertain

to non-housing expenditures (second mortgages and home equity lines of credit).

We use monthly averages of the weekly 30-year fixed mortgage rate averages from the

Freddie Mac Primary Mortgage Market Survey (PMMS), which are available only for the U.S.

market. We also use the Federal Housing Finance Agency’s (FHFA) All Transactions House Price

Index. This index is quarterly, and we interpolate monthly values using the natural spline in

SAS’s PROC EXPAND. The FHFA’s purchase only house price index is monthly and superior

conceptually for a debt index representing past home purchases. However, this series only goes

back to 1991, and would not be long enough to cover all loan ages in our sample.

3.A.2. Property Tax Payment Index

The property tax payment index measures the change in average property tax payments

for households. Our proposed method attempts to hold the aggregate quality of the housing

stock constant and uses annual data from the CE.11 Let 𝑋𝑠,𝑡 and 𝑉𝑠,𝑡 denote proportional growth

in population aggregates for property tax payments and owner-occupied housing unit values

between years 𝑠 and 𝑡, and let 𝐻𝑠,𝑡 be a constant-quality home price index between years 𝑠 and

𝑡. We use timeseries representing the entire U.S. and leave it for future research to extend the

method to geographic areas, which require more granular tax data than we currently have. We

compute the following:

𝑃𝑃𝑇𝑃 = 𝑋𝑠,𝑡 𝑉𝑠,𝑡

𝐻𝑠,𝑡. (4)

11 The CE asks homeowners the annual property taxes owed on their primary residence and adjusts these amounts if the property is partly used as a business. The CE also asks the consumer unit to estimate the market value of their primary residence. Investigating potentially more timely sources of property tax data is a task for future research.

14

Our method is similar to that of Statistics Canada and the Office for National Statistics,

which compute unit value indexes, or ratios of average property tax payments, though they do

so for different geographic areas. Let 𝑁𝑠,𝑡 be the growth in the number of owner-occupied

housing units between 𝑠 and 𝑡. A similar approach we explored with CE data computes

𝑃𝑃𝑇𝑈𝑉 = 𝑋𝑠,𝑡 𝑁𝑠,𝑡

. (5)

where we use the number of owner-occupier consumer units to proxy for the number of

owner-occupied housing units.12 Equation (4) is equal to equation (5) divided by (𝑉𝑠,𝑡/𝑁𝑠,𝑡)/𝐻𝑠,𝑡

which is the growth in average home values deflated by the constant-quality home price index.

We interpret this ratio as a measure of change in dwelling quality which is relevant under the

assumption that the total housing market valuations 𝑉𝑠,𝑡 and the house price indexes 𝐻𝑠,𝑡

approximate changes in value and price as would be measured by tax assessors. We found that

the long-term trends of Eq. (4) and (5) were very similar. As in Canada and the U.K., we do not

attempt to control for potential differences in quality of municipal services.

Our preliminary efforts use annual property tax aggregates from the CE, as the survey

asks about annual tax obligations rather than monthly payments. The monthly expenditure

microdata include these figures divided by 12. We find that that using Equations (4) and (5) on

this average monthly data leads to substantial short-term sampling variation. For this reason,

we compute the property tax index at an annual frequency and interpolate monthly values

12 In the CE, consumer units are equivalent to households in the vast majority of cases but are defined by joint economic decision making rather than residence or familiar relationships.

15

using a spline function. Statistics Canada and the Office for National Statistics, for instance,

update their property tax indexes once per year. The CE is not the ideal source for property tax

and housing value data, as data for a calendar year are released about nine months after that

year ends. For this reason, this paper’s analysis only covers through the end of 2021. Finding

timelier and larger samples using alternative data is an objective for future research.

3.B. Upper-level Aggregation

As in the CPI, we use CE data to derive upper-level aggregation weights, with some

important differences. As shown in

16

Table 1, the set of eligible elementary item strata now includes property taxes and mortgage

interest and excludes owner equivalent rent. The property tax and mortgage interest weight

are derived from the monthly expenditures on those items as collected by the CE. In addition,

we use the full reported values of expenditures on items like maintenance and repair,

homeowner’s insurance, appliances, and household furnishings. Under the rental equivalence

approach, these items are scaled down for owner-occupiers to reflect the likelihood of a renter

making the same purchase. Table 2 compares average housing-related relative importance

across consumer units in different subpopulations —by housing tenure, an indicator for being a

wage earner or clerical worker (as in the CPI-W), and an indicator for being elderly (age greater

than or equal to 62, as in the R-CPI-E)13—both under the payments approach and rental

equivalence. In general, housing payments make up a smaller share of overall spending under

the payments approach than under rental equivalence. For the urban population, for instance,

housing under the payments approach amounts to 34.3% of the market basket on average,

versus 42.9% on average under rental equivalence. Interestingly, patterns of spending across

some subpopulations differ by housing approach. For instance, under rental equivalence, the

average share going to housing among the elderly is relatively high at 46.8%. Under the

payments approach, however, the elderly have a high proportion going to insurance,

appliances, maintenance, and repairs (“other housing”), but relatively less going to mortgage

interest, resulting in a total housing weight of 34.1%, slightly less than the overall urban

population (34.3%).

13 Consumer units were classified according to their reported demographic in their last interview in the sample.

17

Table 1: Weights for Select Housing Items for the HCI Subsample in 2019

Payments Rental

Equivalence Code Description $ Bil. % RI* $ Bil. % RI*

HC01 Owner’s Equivalent Rent of Primary Residence NA NA 1,144.36 22.40 HC09 Unsampled Own. Equiv. Rent of Second. Res. NA NA 56.29 0.75 HD01 Tenants’ and Household Insurance 38.02 1.01 17.24 0.38 HH01 Floor Coverings 8.29 0.18 2.54 0.05 HK01 Major Appliances 17.05 0.39 2.38 0.06 HK09 Other Appliances 0.08 0.00 0.07 0.00 HM01 Tools, Hardware, and Supplies 17.23 0.43 11.67 0.26 HM09 Unsamp. Tools, Hardw., Outdoor Equip, Supp. 58.44 1.31 9.35 0.20 HP04 Repair of Household Items 46.52 0.83 4.14 0.08 HP09 Unsampled Household Operations 10.69 0.23 4.29 0.07 HR01 Property Tax of Primary Residence 199.70 4.51 NA NA HR09 Property Tax of Secondary Residence 8.61 0.16 NA NA HS01 Mortgage Interest of Primary Residence 211.64 4.26 NA NA HS09 Mortgage Interest of Secondary Residence 4.55 0.08 NA NA HT01 Other Owner Payments for Primary Residence 14.10 0.42 NA NA HT09 Other Owner Payments for Secondary Res. 1.29 0.02 NA NA * Average (equally weighted) relative importance across consumer units.

Table 2: Average Household Relative Importance for Housing by Subpopulation (percent)

Category Urban Wage- earner Elderly

Own. w/ Mortgage

Own. w/o Mortgage Renter

Payments Approach Rent 9.2 13.0 6.3 0.1 0.2 31.8 Property Tax 4.7 4.2 5.7 6.1 7.0 0.2 Mortgage Interest 4.3 5.2 2.7 10.2 0.2 0.1 Other Housing 16.0 14.8 19.4 16.9 22.0 8.8 Total Housing 34.3 37.2 34.1 33.2 29.5 40.9 Rental Equivalence Approach Rent 9.2 13.0 6.3 0.1 0.2 31.7 Owner’s Equiv. Rent 23.1 20.9 29.4 31.1 33.9 0.8 Other Housing 10.6 10.4 11.1 10.9 12.0 8.7 Total Housing 42.9 44.3 46.8 42.1 46.1 41.2 Note: Cells show average December 2020 relative importance (2019 reference period weights price-updated to December 2020 values) across households meeting the HCI sample requirement. While expenditures cover a year, consumer units are classified according by attribute from their last collection quarter.

18

Our upper-level aggregation uses the Lowe formula, and same as the CPI (as of January

2023) the quantity weights pertain to annual expenditure reference periods which are updated

each year. The household-weighted aggregation starts from the CE Interview sample, as

consumer units contribute up to one year of data and the Interview comprises most eligible

expenditures. Eligible expenditures from the Diary survey are imputed to the Interview sample

using a matching procedure based on Hobijn, et. al. (2009), which is described further later in

this section and similar to that used in Martin (2022). The procedure matches eligible Diary

consumer units to an Interview consumer unit based on demographic characteristics that are

predictive of total expenditure. The second-stage aggregation is then based on the Lowe

formula with lagged expenditure weights.

𝑃𝐻𝐶𝐼 = ∑ ∑�̅�𝑎,𝑖,𝑣,𝑏𝑃𝑎,𝑖,𝑡,𝑣

𝑖∈ℐ𝑎∈𝒜

(6)

�̅�𝑎,𝑖{𝑣,𝑏} = (

𝐻𝑎,𝑏

𝐻𝑏 )𝐻𝑎,𝑏

−1 ∑ 𝜔ℎ

ℎ∈ℋ𝑎,𝑏

𝑠𝑖,𝑣,𝑏,ℎ

(7)

𝐻𝑎 = ∑ 𝜔ℎ

ℎ∈ℋ𝑎,𝑏

, 𝐻𝑏 = ∑ ∑ 𝜔ℎ

ℎ∈ℋ𝑎,𝑏𝑎∈𝒜

,

(8)

where 𝑎 indexes the geographic area, 𝑖 the item stratum, 𝑣 the index pivot month, 𝑏 the weight

reference period, and ℎ the consumer unit. The set of areas is 𝒜, the set of items ℐ, and the

set of consumer units in area 𝑎 during period 𝑏 is ℋ𝑎,𝑏. The elementary index between pivot

month 𝑣 and period 𝑡 for item 𝑖 in area 𝑎 is given by 𝑃𝑎,𝑖,𝑡,𝑣. The associated household-weighted

expenditure shares are 𝑠�̅�,𝑖,𝑣,𝑏. These are equally (with respect to the population) weighted

averages of individual consumer unit annual expenditure shares 𝑠𝑖,𝑣,𝑏,ℎ, with 𝜔ℎbeing

household ℎ’s sampling weight. The weight reference period 𝑏 is the calendar year two years

19

prior to the calendar year containing month 𝑡, and the expenditure shares 𝑠𝑖,𝑣,𝑏,ℎ are price-

updated to represent period 𝑣 values using the ratio of the elementary index in month 𝑣 to its

average over period 𝑏.

Consumer units participate in the CE for up to four collection quarters, providing up to

twelve months of expenditures. Because participation is on a rolling basis and there is unit

nonresponse and occasional attrition, the number of observations exactly lining up with a single

calendar year is relatively small, often only a few hundred. Therefore, for the HCI, we define a

“reference year” sample differently than does either the CE or CPI. We assign a consumer unit

to a reference year 𝑏 if its last month of expenditure occurred during year 𝑏. So that each ℎ’s

expenditure basket reflects a whole year, we include only observations which completed all

four quarterly interviews, even if some of their expenditures occurred in the prior calendar

year. For the 2019 reference year, for instance, (used for indexes in 2021), we include

consumer units with at least one month occurring in 2019, meaning we include some

observations whose sample tenure started as early February 2018. With the four-quarter

requirement, this amounts to a sample of 3,063 unique consumer units (12,252 collection

quarters) representing our 2019 reference year. In comparison, 11,740 unique consumer units

(comprising 22,957 collection quarters) in the CE have expenditures recorded for the calendar

year 2019.14 For index subgroup definitions, we use consumer unit characteristics from their

final collection quarter.

14 These sample sizes were calculated by counting the number of unique FAMID (or the consumer-unit specific portion of the FAMID) for a given expenditure reference period.

20

As discussed in Martin (2022), including observations with periods less than one year

can distort household-weighted indexes due to greater variability in total expenditures and

lower average expenditure shares for less frequently purchased items. However, there is a

potential trade-off with the four-quarter requirement due to representativity. Table 3 shows

differences in the relative frequencies of a few consumer unit demographics. For the 2019

reference year, the HCI subsample has a greater proportion of owners and elderly than the full

sample of urban consumer units. At the same time, Table 2 shows there are differences in the

average expenditure shares on housing-related payments across these groups, suggesting

potential consequences for price indexes. For instance, the elderly spend relatively more on

property taxes than on mortgage interest, reflecting that they are disproportionately owners

without mortgages.

Table 3: Frequency of Consumer Unit Characteristics by Sample in 2019 (percent)

All Urban HCI Subsample

Owner with mortgage 37.3 41.4 Owner without mortgage 23.6 29.1 Renter 39.2 29.6 Wage earner 27.0 25.3 Elderly 30.8 37.7

Nevertheless, we find little evidence of a sample selection bias stemming from our HCI

eligibility criteria, at least over during sample period. Table 4 shows (comparing columns 2 and

3) the impact of using the CE subsample on major group-level weights is small relative to the

effect of using the payments approach or household aggregation. Additionally, we find

(Appendix C) that the sample selection impact on an expenditure-weighted version of the HCI-U

(corresponding to column 4 of Table 4) is minimal, about 0.01 percentage points per year.

21

Furthermore, our results show a CPI-like index calculated from these subsamples (with Diary

expenditures imputed as described in the next subsection), corresponding to column 3 of Table

4 closely matches the published CPI-U. These together imply our results are driven by the

payments approach and household-weighted aggregation, and not the reference period or CE

subsample. Our current method makes no adjustments to the CE sampling weights, which we

leave to future research. Such adjustments may be more important with more recent data than

our sample period, particularly with recent surges in mortgage interest rates.

There are a few other differences between our research indexes and official CPI

methods. Since the HCI is based on consumer unit-specific shares, which must be weakly

positive, we censor negative annual expenditures at zero.15 We also make some small item-

structure changes to simplify calculations using historical data. Finally, we omit weight-

smoothing procedures used in the CPI, including composite estimation for the item-area

weights, which are designed to lower their sampling variance across geographic areas. Our all

items, all areas CPI-U replications closely match the published indexes even without these

procedures, and our prototype procedure only estimates property tax and mortgage interest at

the national level. We leave it to future research to extend weight-smoothing procedures to the

HCIs.

Figure 1 below shows the December 2020 relative importance by major expenditure

group and select housing categories and compares them with the published shares for the CPI-

U. The HCI shares correspond to the 2019 weight reference year, while for the CPI they

15 This affects items RC01 “Sports Vehicles, Including Bicycles”, TA02 “Used Cars and Trucks”, and TA09 “Unsampled New and Used Motor Vehicles.” The CPI counts returns or sales as negative expenditures.

22

correspond to the 2017-18 reference period. Table 4 tracks the change in relative importance

by major group as different HCI elements are activated. The effects of the payments approach

and household-weighted aggregation on the relative weights are significant, but sometimes

have offsetting effects. For instance, the overall housing weight in the HCI is smaller than the

CPI, as property tax, mortgage interest, and the increase in other housing outlays amounts to

less than the decrease due to the exclusion of OER. By itself, this decrease in housing weight

increases the weight allocated to other categories, like medical and recreation. At the same

time, however, household-weighted aggregation shifts weight toward households with lower

total expenditures, further increasing the relative importance of rent and food while decreasing

that of transportation.

Figure 1: December 2020 Relative Importance for HCI-U and CPI-U

Panel a: HCI-U (2019 weights)

Panel b: CPI-U (2017-18 weights)

20.2%

9.2%

4.7%

4.3%

16.0%3.1%

14.2%

11.1%

6.6%

6.8% 3.8%

Food & Bev. Housing: Rent

Housing: Prop. Tax Housing: Mortgage

Housing: Other Apparel

Transportation Medical

Recreation Educ. & Comm.

Other

15.2%

7.9%

24.3%

10.3% 2.7%

15.2%

8.9%

5.8%

6.8% 3.2%

Food & Bev. Housing: Rent

Housing: OER Housing: Other

Apparel Transportation

Medical Recreation

Educ. & Comm. Other

23

Table 4: December 2020 Relative Importance for Different Index Types (percent)

Major Group CPI-U (2) (3) (4) HCI-U

Food and Beverages 15.16 15.68 15.60 17.96 20.16 Housing 42.39 41.84 42.13 33.34 34.26 Apparel 2.66 2.70 2.67 3.07 3.15 Transportation 15.16 15.43 14.60 16.80 14.23 Medical 8.87 8.79 9.18 10.58 11.09 Recreation 5.80 5.80 6.16 7.08 6.59 Education and Comm. 6.81 6.72 6.57 7.61 6.76 Other 3.16 3.04 3.09 3.56 3.76 Methods* Reference Period 2017-18 2018-19 2019** 2019** 2019** CE Sample Full Full 4-quarter 4-quarter 4-quarter Aggregation Expenditure Expenditure Expenditure Expenditure Household Owner Occ. Housing REQ*** REQ*** REQ*** Payments Payments * Columns 2-5 also reflect other methodology changes and simplifications described in text. ** Under our sample eligibility criteria, this includes spending back to February 2018. *** REQ = Rental Equivalence

3.B.1. Interview-Diary Matching Procedure

As mentioned, the basis of our household average expenditure weights is the CE

Interview sample, which covers about three-quarters of the expenditure basket as traditionally

sourced by the CPI. We implement a statistical matching procedure based on Hobijn et al.

(2009) to impute the remaining proportion which CPI sources from the Diary.16 Similar

observations from the Diary sample provide the remaining expenditure data for each Interview

consumer unit, according to a model of expenditures as a function of demographic

characteristics. The dependent variable is expenditures on items which HCI (and the CPI)

sources from the Diary, but for which the Interview either collects the same item or has more

16 Garner, et. al. (2022) and Martin (2022) also use matching processes based on Hobijn, et. al. (2009).

24

aggregate data.17 The model is a convenient way of combining many characteristics according

to which linear combination most strongly predicts expenditures. We then use the predicted

values to form measures of distance between an Interview recipient and its potential Diary

donors. For our main results, the only attribute guaranteed to match between donor and

recipient is quintile group membership based on the distribution of annual before-tax income.18

For our results on housing tenure subpopulations, we also guarantee this attribute matches.

The matching procedure is many-to-one, as we draw four donor Diaries for each Interview in

each month with replacement. The procedure is implemented separately by month so that

weekly Diary donors are evenly distributed temporally over the recipient Interview’s sample

tenure. Due to the sample selection criteria outlined earlier, for reference year 2019, for

example, that means we are running monthly regressions from February 2018 to December

2019. The stratification and model estimation are done on the full Interview sample, not just

the four-quarter subsample.

First, we stratify both Interview and Diary consumer unit samples for the reference

period by the sample quintiles of annual before-tax income. For each month 𝑡 and quintile

grouping 𝑞, we use the Interview sample to estimate the regression

𝑦ℎ𝑡 = 𝒙ℎ𝑡𝜷𝑞𝑡 + 𝑢ℎ𝑡,

(9)

17 From Martin (2022), Table A2, these amount to about 80% of Diary-sourced expenditures in 2019. Alternatively, it might seem attractive to use the Diary sample to estimate Diary expenditures as a function of demographic characteristics, as we intend to impute these expenditures for the Interview sample. However, we find that characteristics explain relatively little variation in Diary expenditures, perhaps due to the short (week-long) recall period. 18 The Diary samples are small enough that conditioning on multiple characteristics quickly leads to empty cells. See Hobijn, et al. (2009) for more discussion.

25

where 𝑦ℎ𝑡 is logged expenditure of consumer unit h. The term 𝑢ℎ𝑡 is an error term, and 𝒙ℎ𝑡

include Census region, urban/rural, age, race, sex, and education of the reference person,

consumer unit size, the log of annual before-tax income (if positive), and an indicator for

whether income was negative.19 We use the least squares estimator weighted by the CE

sampling weight, finlwt21. Over the sample period, R-squared values for the quintile and

month-specific regressions averaged 0.17, while income quintile itself explained about 0.31 of

the variation in the dependent variable.

Let �̂�𝑞𝑡 be the slope estimate for quintile 𝑞 in month 𝑡. As household characteristics are

available and comparably defined in both surveys, we calculate predicted values �̂�ℎ𝑡 = 𝒙ℎ𝑡�̂�𝑞𝑡

for each Diary and Interview observation. For a given Interview observation ℎ and Diary

observation 𝑘, the distance metric is defined as

𝛿𝑡(ℎ, 𝑘) = |�̂�ℎ𝑡 − �̂�𝑘𝑡|.

(10)

Within each month and income quintile, we calculate 𝛿𝑡(ℎ, 𝑘) for all {ℎ, 𝑘} pairs. Then for each

Interview observation ℎ, we randomly select (with replacement) four 𝑘 from the twenty

smallest 𝛿𝑡(ℎ, 𝑘) out of all the Diary observations from the same month and income quintile.

The random component is intended to ensure a more even distribution of matches across Diary

observations. The detailed set of expenditures of the donor Diary is then assigned to the

recipient Interview. As one donor Diary is intended to represent one quarter of one month of

expenditure, but Diaries correspond to a one-week recall period, the donor Diary expenditures

19 These demographic variables technically pertain to the collection quarter or some other reference period, so we implicitly assume they represent the associated reference months. For the matching regressions, we allow a consumer unit’s attributes to vary by collection quarter.

26

are scaled by 13/12. This process is repeated for each Interview observation, for each month it

is in the sample.20 Since the Interview sample is much larger than the Diary on a per-month

basis, each Diary is matched with several Interviews. Further analysis of the matching

procedure is in Appendix B.

4. Results

We find the HCI-U follows similar patterns of acceleration and deceleration as the CPI-U,

but it has significantly lower average rates of growth during our sample period. The average 12-

month change in the HCI-U averages 1.51% versus 1.86% for the CPI-U, as shown in

Table 5.

20 In the CPI, diary expenditures are multiplied by 13 to account for the difference in recall periods between weekly diaries and quarterly interviews. The scaling in our procedure is analogous in that an interview is matched with a total of 12 diaries each quarter, and with the scaling these also represent 13 weeks.

27

Figure 2 plots the index levels, showing markedly different trends between the CPI-U

and HCI-U from 2012-2020. The two indexes increased at a similar rate in 2021, averaging 4.6-

4.7% year-over-year growth throughout the year.

Table 5 includes an index (U-EW-REQ) which uses expenditure weighting and the rental

equivalence approach but uses our CE subsample and processing methods. It also includes a

comparable series (U-EW-PAY) which instead uses the payments approach but uses

expenditure weighting as in the CPI. Comparisons of these indexes and the HCI-U show the

difference in trends and average growth reflects primarily the impact of the payments

approach. U-EW-PAY averages about 0.39 percentage points per year less than U-EW-REQ, and

in a single year (2016) averages 0.74 percentage points lower. In 2021, the impact of the

payments approach is to add 0.15 percentage points to the average 12-month percent change,

reflecting increasing home prices and interest rates. In 2022, we also expect this effect to be

positive and much larger in magnitude due to the large increase in mortgage interest rates. In

contrast, comparing HCI-U to U-EW-PAY shows the household-weighted aggregation adding

28

only slight amount to the overall average 12-month percent change (0.05%), but yearly average

differences are as high as 0.16 percentage points in 2017. In 2021, household-weighted

aggregation lowers HCI-U by 0.1 percentage points on average.

Figure 2: HCI-U and CPI-U Index Levels

Table 5: Average 12-month Percent Changes by Year, HCI and CPI

1

1.05

1.1

1.15

1.2

1.25

2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021

lowe-u (ew, req) lowe-u (ew, pay) cpi-u hci-u

29

Year HCI-U CPI-U U-EW-

REQ U-EW-

PAY HCI-OM HCI-ONM HCI-RNT

2013 0.99% 1.47% 1.43% 0.86% 0.52% 1.22% 1.57% 2014 1.41% 1.62% 1.63% 1.27% 1.02% 1.65% 1.77% 2015 -0.44% 0.12% 0.15% -0.44% -0.88% -0.52% 0.27% 2016 0.56% 1.26% 1.24% 0.51% 0.10% 0.55% 1.19% 2017 1.76% 2.13% 2.13% 1.60% 1.41% 1.79% 2.24% 2018 2.36% 2.44% 2.42% 2.33% 2.32% 2.23% 2.52% 2019 1.39% 1.81% 1.81% 1.43% 1.30% 1.02% 1.80% 2020 0.93% 1.24% 1.21% 0.84% 0.65% 0.89% 1.31% 2021 4.62% 4.69% 4.58% 4.73% 4.54% 4.95% 4.44%

Average 1.51% 1.86% 1.84% 1.46% 1.22% 1.53% 1.90% Notes: U signifies urban population. U-EW-REQ is a CPI-like replication using the HCI sample and simplified expenditure processing methods, but expenditure-weighting and rental equivalence. Similarly, U-EW-PAY uses expenditure-weighting, but the payments approach. “OM” is owners with a mortgage, “ONM” is owners without a mortgage, and “RNT” is renters.

Figure 3 describes further how the actual outlays for owner-occupiers are associated

with lower inflation than would be implied by rental equivalence. Over the sample period, the

official index for owner’s equivalent rent increases 33.8% cumulatively, while our sub-aggregate

for owner’s payments (combining property tax, mortgage interest, and other owner payments)

increased only 11.5%. Within owner’s payments, the two major components, the trend in the

property tax index is similar to owner’s equivalent rent for most of the sample period.

However, the mortgage interest index trends flat, not yet picking up the sharp increases in

interest rates occurring in 2022 after our sample period ends.21 We also note that evolution of

the mortgage interest index is smoother than current average mortgage interest rates (from

21 Our analysis is constrained by sourcing property tax payments from the CE, which as of June 2023 are only available through the first half of 2022. The average 12-month change for the mortgage interest index is 8.2% in 2022. Using the first half of 2022 property tax burden (X/V) as a crude forecast, we find an average change in the owner’s payments index of 10.0% in 2022 (versus 5.7% for owner’s equivalent rent), and an average change in the HCI-U of 8.7% (versus 8% for the CPI-U).

30

the Freddie Mac PMMS), because the index is averaging over 30 years of past mortgage rates in

order to reflect current payments.

Figure 3: Owner’s Equivalent Rent vs. Owner’s Payments

Finally, we further illustrate the treatment of owned housing outlays by estimating HCI’s

for three subpopulations, owners with a mortgage (OM), owners without a mortgage (ONM),

and renters (RNT). We define these using the housing tenure value reported by the consumer

unit in their final interview. The final three columns of

0

1

2

3

4

5

6

0.9

0.95

1

1.05

1.1

1.15

1.2

1.25

1.3

1.35

1.4

2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021

Owner's Equiv. Rent (HC) Owner's Payments (HR, HS, HT)

Property Tax (HR) Mortgage Interest (HS)

Other Owner Payments (HT) 30-yr fix. rate (r. axis, %, PMMS)

31

Table 5 show the average 12-month percent changes, while Figure 4 plots the index levels. HCI-

RNT has average inflation of 1.9% and is closest to the CPI-U. While there may be overall weight

differences between the urban population and the subpopulation of renters, the evolution of

owner’s equivalent rent is close enough to the evolution of actual rent that this result is not

surprising. In contrast, the HCI inflation for owners is significantly lower, averaging 1.53% per

year for those without a mortgage and 1.22% per year for those with a mortgage. As with the

urban indexes, the relative rankings are not the same year to year. For instance, owners

without mortgages had the highest average inflation in 2021, 4.95%, versus 4.54% for owners

with a mortgage and 4.44% for renters.

Figure 4: HCIs for Housing Tenure Subpopulations

4.A. Alternative Treatments of Owner Payments for Housing

As discussed in Section 3.A, we follow international practice in excluding mortgage

principal and basing mortgage interest and property tax index changes on two sources: a

1

1.05

1.1

1.15

1.2

1.25

2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021

hci-om hci-onm hci-rnt

32

change in a rate (the interest rate or the effective property tax rate), and the change in a

monetary base (the debt level and the housing value). The appendix, including Figure 5 and

Figure 6, explore the sensitivity of the indexes to these decisions. Including mortgage principal

would raise the owner’s payments subindex (combining mortgage interest, property tax, and

other payments as in Figure 3) by 0.8 percentage points per year. Combined with the associated

weight increase to mortgages, this would result in an all-items HCI-U that is higher by 0.10

percentage points per year. The effect of home prices would be more substantial, lowering the

owner’s payments index by 4.0 percentage points per year and the all-items HCI-U by 0.38

percentage points per year.

5. Conclusions and Future Research

Our results show the HCI differs substantially from the CPI because it uses the payments

approach for owner occupied housing, and slightly because it weights households equally in its

upper-level aggregation. The payments approach tracks the actual outlays of homeowners,

which over our sample period of 2012 to 2021 have escalated at a lower trend than (imputed)

owner’s equivalent rent, resulting in lower inflation as measured by the HCI than as measured

by the CPI. We do not argue that the payments approach is superior from the standpoint of

measuring the cost-of-living as an economic theoretic concept or for use in monetary policy.

Rather, by reflecting the explicit outlays of owners, we show the HCI offers a measurement of

the household inflation experience which is empirically different than the CPI.

Future research could focus on many areas. Our measures of price change for mortgage

and property tax payments use only national-level data. A natural next step would be to extend

33

these to subnational geographic areas, if relevant and feasible. Further down the road,

exploring mortgage microdata of the sort described by Bhutta, et. al. (2020) could be

informative on different experiences of subpopulations, to the extent that long enough

histories can be obtained to account for the long lives of mortgage loans. More timely and

granular property tax data would also improve the HCI. In addition, in principle, the payments

approach could be extended to any durable good where payment occurs over a long

timeframe, with automobiles in particular being a high priority. Martin (2022) suggests treating

automobiles under an approach consistent with the target of the index (payments, in our case)

is critical if higher-frequency household weights are to be taken seriously, such as for a monthly

weighted superlative like the C-CPI-U. Custom sampling weights should also be created to

account for demographic differences for the four-quarter sample of consumer units used for

the HCIs, but further analysis may also be warranted related to weight frequency and

subsample selection. With the payments approach weighting of automobiles, for instance,

perhaps infrequent purchase issue discussed in Martin (2022) is less salient. Finally, the impact

household-weighted aggregation on the all-items index’s sampling variation or the potential of

weight-smoothing techniques have yet to be explored.

References

Astin, J., & Leyland, J. (2015). Towards a Household Inflation Index: Compiling a consumer price index

with public credibility. Royal Statistical Society. Retrieved November 20, 2020, from

https://rss.org.uk/RSS/media/News-and-

publications/Publications/Reports%20and%20guides/Astin-Leyland-HII-paper-Apr-2015.pdf

Bhutta, N., Fuster, A., & Hizmo, A. (2020). Paying Too Much? Price Dispersion in the US Mortgage

Market. Washington, DC: Board of Governors of the Federal Reserve System.

doi:https://doi.org/10.17016/FEDS.2020.062

34

Bureau of Labor Statistics. (2020). The Consumer Price Index. In Handbook of Methods. Washington, DC.

Retrieved from https://www.bls.gov/opub/hom/cpi/home.htm

Central Statistics Office. (2016). Consumer Price Index: Introduction of Updated Series (Base: December

2016=100). Cork: Central Statistics Office. Retrieved from

https://www.cso.ie/en/media/csoie/methods/consumerpriceindex/CPI_-

_introduction_to_series_2016.pdf

Diewert, W. E. (1976). Exact and Superlative Index Numbers. Journal of Econometrics, 4(2), 115-145.

doi:10.1016/0304-4076(76)90009-9

Diewert, W. E., & Nakamura, A. O. (2009). Accounting for Housing in a CPI. Philadelphia: Federal Reserve

Bank of Philadelphia. Retrieved from https://www.philadelphiafed.org/-

/media/frbp/assets/working-papers/2009/wp09-4.pdf

Diewert, W. E., & Shimizu, C. (2021). Chapter 10: The Treatment of Durable Goods and Housing. In

Consumer Price Index: Theory (Draft). Washington, D.C.: International Monetary Fund. Retrieved

from https://www.imf.org/en/Data/Statistics/cpi-manual#companion

Federal Housing Finance Agency. (2021). House Price Index Datasets. Retrieved from

https://www.fhfa.gov/DataTools/Downloads/Pages/House-Price-Index-Datasets.aspx

Freddie Mac. (2022). Primary Mortgage Market Survey - About. Retrieved April 29, 2022, from Primary

Mortgage Market Survey: https://www.freddiemac.com/pmms/about-pmms

Freddie Mac. (2023). Primary Mortgage Market Survey - Archive. Retrieved March 17, 2023, from

Primary Mortgage Market Survey: https://www.freddiemac.com/pmms/pmms_archives

Garner, T. I., & Verbrugge, R. (2009). Reconciling user costs and rental equivalence: Evidence from the

US consumer expenditure survey. Journal of Housing Economics, 18(3), 172-192.

doi:10.1016/j.jhe.2009.07.001

Gillingham, R., & Lane, W. (1982). Changing the treatment of shelter costs for homeowners in the CPI.

Monthly Labor Review, 9-14. Retrieved from

https://www.bls.gov/opub/mlr/1982/06/art2full.pdf

Goodhart, C. (2001). What Weight Should be Given to Asset Prices in the Measurement of Inflation? The

Economic Journal, F335-F356. doi:10.1111/1468-0297.00634

International Labor Organization. (2004). Consumer Price Index Manual: Theory and Practice. (P. Hill,

Ed.) Geneva: International Labor Organization. Retrieved from

https://www.ilo.org/wcmsp5/groups/public/---dgreports/---

stat/documents/presentation/wcms_331153.pdf

International Labour Organization. (2003). Resolution concerning consumer price indices. Resolution of

the Seventeenth International Conference of Labor Statisticians. Geneva. Retrieved from

http://ilo.org/wcmsp5/groups/public/---dgreports/---

stat/documents/normativeinstrument/wcms_087521.pdf

35

Office for National Statistics. (2017). Household Costs Indices: Methodology. Office for National

Statistics. Retrieved from

https://www.ons.gov.uk/economy/inflationandpriceindices/methodologies/householdcostsindi

cesmethodology

Office for National Statistics. (2019). Consumer Prices Indices Technical Manual. Office for National

Statistics. Retrieved from

https://www.ons.gov.uk/economy/inflationandpriceindices/methodologies/consumerpricesindi

cestechnicalmanual2019

Poole, R., Ptacek, F., & Verbrugge, R. (2005). Treatment of Owner-Occupied Housing in the CPI.

Washington, DC: Bureau of Labor Statistics. Retrieved from

https://www.bls.gov/advisory/fesacp1120905.pdf

Prais, S. J. (1959). Whose cost of living? The Review of Economic Studies, 126-134. doi:10.2307/2296170

Statistics Canada. (2019). The Canadian Consumer Price Index Reference Paper. Statistics Canada.

Retrieved from https://www150.statcan.gc.ca/n1/pub/62-553-x/62-553-x2019001-eng.htm

Statistics New Zealand. (2020). Household living-costs price indexes (HLPIs) data dictionary (Version 33).

Wellington: Statistics New Zealand. Retrieved November 23, 2022, from

https://datainfoplus.stats.govt.nz/Item/nz.govt.stats/a46a6353-947a-4062-89e7-

c6faef4fece1/?_ga=2.96280540.1570432553.1669226241-1704970333.1669226240

36

Appendix

A. Alternative Mortgage Interest and Property Tax Indexes

The mortgage payments index which includes mortgage principal replaces the interest

rate component, Eq. (3), with the following representing change in full mortgage payments

between months 𝑠 and 𝑡:

𝑃𝑓 =

∏ [𝑓(𝑟𝑡−𝑗, 𝜃 − 𝑗)] 𝜑𝑏𝑗𝜃−1

𝑗=0

∏ [𝑟𝑠−𝑗 , 𝜃 − 𝑗)] 𝜑𝑏𝑗𝜃−1

𝑗=0

. (11)

where 𝑓(𝑟, 𝜔) = 𝑟𝑅𝜔 (𝑅𝜔 − 1)⁄ , 𝜔 > 1, where 𝑅 = 1 + 𝑟. The function 𝑓 represents the

fixed mortgage payment as a proportion of the current debt amount. In this expression, the

interest rate 𝑟 is the annualized rate divided by 12 so that it corresponds to one month. Note,

when estimated using aggregate data, even if 𝑟𝑡−𝑗 equals an average interest rate across

households with loans of age 𝑗, the amount 𝑓(𝑟𝑡−𝑗, 𝜃 − 𝑗) cannot be interpreted as an average

mortgage payment ratio across households due to Jensen’s inequality. The relationship

between 𝑓(𝑟𝑡−𝑗, 𝜃 − 𝑗) and a true household average is unknown (at least to the authors) but

using such an average in a price index would require microdata tracking individual mortgagors

across loan changes including refinances (which we can observe in the CE) and new loans

(which we often do not observe due to address-based sampling). The mortgage payment

indexes without home prices remove the debt index component, Eq. (2), while the property tax

index without home prices is just the effective tax rate component, 𝑋𝑠,𝑡 𝑉𝑠,𝑡⁄ from Eq. (4).

37

Figure 5 plots the different Owner’s Payment subindexes (combining mortgage interest,

property taxes, etc., as in Figure 3) and compares them again against owner’s equivalent rent.

Adding mortgage principal increases the owner’s payments index by about 0.8 percentage

points per year when home prices are included, and about 1 percentage point per year when

home prices are excluded. Given the strong upward trend of home prices over the past several

decades, removing their lowers the payments index by 4.0 percentage points per year when

mortgage principal is excluded and by 6.6 percentage points per year when mortgage principal

is included, resulting in downward trends. Figure 6 tracks these payments indexes changes on

the all-items HCI-U, accounting for changes in both the elementary indexes and the aggregation

weights. The overall effect of mortgage principal is modest, adding 0.10 or 0.03 percentage

points per year depending on whether house prices are included. Home prices themselves have

a larger impact on the all-items index, decreasing it by either 0.38 or 0.45 percentage points per

year depending on whether mortgage principal is included.

Figure 5: Alternative Versions of Owner’s Payments

0.7

0.8

0.9

1

1.1

1.2

1.3

1.4

2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021

Owner's Equiv. Rent (HC) Paym. (HR, HS, HT) Paym. (with principal) Paym. (no home prices) Paym. (with principal, no home prices)

38

Figure 6: HCI-U Under Alternative Versions of Owner’s Payments

B. Interview-Diary Matching Details

We base our household-averaged weights on the CE Interview sample but use a

statistical matching procedure to assign sets of weekly Diary expenditures to each Interview

consumer unit. Our procedure is similar in spirit to that of Hobijn, et. al. (2009), though that

paper models expenditure change (implied by a consumer-unit specific price index) rather than

expenditure levels. Modeling expenditure changes is attractive given the ultimate use of the

matched dataset for price indexes, but Martin (2022) finds demographics explain much less of

the variation in expenditure changes. We limit the dependent variable to categories collected in

both the Interview and the Diary to ensure that the correlations picked up by the model are

relevant to the expenditures we ultimately wish to impute. Over the sample period, R-squared

values for the quintile and month-specific regressions averaged 0.17, while income quintile

itself explained about 0.31 of the variation in the dependent variable. Figure 7 below plots the

average regression R-squared for each quintile, where the averaging is over the 23 months used

1

1.05

1.1

1.15

1.2

1.25

2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021

cpi-u hci-u hci-u (with principal) hci-u (no home prices) hci-u (with principal, no home prices)

39

for each reference period. The figure shows that average R-squared for the income quintiles are

fairly stable over time, averaging about 0.23 for the 1st quintile, 0.17 for the second quintile,

0.13 for the third quintile, 0.11 for the fourth quintile, and 0.15 for the fifth quintile. The fits

(conditional on income quintile) are not particularly strong, which motivates matching an actual

diary’s expenditure set to an interview consumer unit rather than using regression fitted values.

Figure 7: Average R-Squared by Reference Period and Income Quintile

The rest of this section presents figures comparing the imputed weekly diary

expenditures to the actual. Figure 8 shows average imputed weekly expenditures for the

reference period track the actual averages well over time, always falling within 1% of the true

averages. Figure 9 compares average weekly Diary expenditures over time by major group. For

food and beverages, which is by far the largest category sourced from the Diary, the imputed

averages fall within 1% of the actual averages, and they fall within 10% for all other categories.

Figure 10 compares the deciles of weekly imputed Diary expenditures to those of the actual

0

0.05

0.1

0.15

0.2

0.25

2010 2011 2012 2010 2013 2014 2015 2016 2010 2017 2018 2019

IQ1 IQ2 IQ3 IQ4 IQ5

40

Diary expenditures for the 2019 reference period (results are similar for other periods). The two

marginal distributions line up well—the imputed deciles are within a few dollars of the actual

deciles.

Figure 8: Actual and Imputed Average Weekly Diary Expenditures by Reference Period

220

230

240

250

260

270

280

290

300

310

2010 2011 2012 2013 2014 2015 2016 2017 2018 2019

actual imputed

41

Figure 9: Average Weekly Diary Expenditures by Reference Period and Major Group

Panel a: Food and Beverages

Panel b: Housing

Panel c: Apparel

Panel d: Transportation

Panel e: Medical

Panel f: Recreation

Panel g: Education and Communication

Panel h: Other

0

50

100

150

200

actual imputed

0

10

20

30

40

actual imputed

0

10

20

30

40

actual imputed

0

5

10

15

20

25

30

actual imputed

0

2

4

6

8

actual imputed

0

10

20

30

40

actual imputed

0

1

2

3

4

actual imputed

0

5

10

15

actual imputed

42

Figure 10: Deciles of Actual and Imputed Weekly Diary Expenditures for 2019 Reference Year

In terms of joint distributions, the matching procedure also does a good job at

replicating average diary expenditures by several demographic characteristics, as shown in

Figure 11 for 2019. Not surprisingly, because income quintile is conditioned on, the procedure

replicates average expenditures by income quintile quite well. The procedure also does well

replicating average differences by housing tenure, age categories, Census region, presence of

children, and education categories, even though these characteristics are not explicitly

conditioned on in the matching process. In these cases, the match quality is being driven by the

correlation between these characteristics and income, as well as the extent to which similarity

in these characteristics across surveys is predictive of expenditures, and so leading to lower

distance between similarly attributed observations.

0

100

200

300

400

500

600

700

1 2 3 4 5 6 7 8 9

actual imputed

43

Figure 11: Average Weekly Diary Expenditures by Attribute, 2019 Reference Period

Panel a: Income Quintile

Panel b: Housing Tenure

Panel c: Age

Panel d: Presence of Children

Panel e: Census Region

Panel f: Education

0

100

200

300

400

500

600

1 2 3 4 5

actual imputed

0

100

200

300

400

Own w/ Mort.

Own w/o Mort.

Renter No cash rent

Student

actual imputed

0

50

100

150

200

250

300

350

<=61 >61

actual imputed

0

100

200

300

400

No kids Kids

actual imputed

260

270

280

290

300

310

320

330

NE MW S W

actual imputed

0

100

200

300

400

< H.S. H.S. & Some Coll.

>= Bachelors

actual imputed

44

C. All-items Indexes Using Different CE subsamples

Figure 12: Twelve-month inflation of CPI and indexes using payments approach by subsample

As a check of our sample requirement that consumer units contributing to the HCI have

four quarters of data in the CE survey, we compare all-items indexes (all using the payments

approach) with this eligibility requirement against all-items indexes without. For this

comparison, we examine expenditure-weighted aggregates across households, as equally

weighted aggregates can be sensitive to weight frequency and overall dispersion in total

expenditures (Ley, 2005; Martin, 2022). We consider both the full CE sample for the reference

year, as well as for the full CE sample for the biennial period ending in the reference year, as

our HCI subsample also includes four-quarter households who entered the CE in the year prior

to the reference year. Figure 12 plots the twelve-month percent changes of these indexes as

well as the CPI-U for reference. Over this period, average inflation of the CPI-U is 1.86% per

-0.02

-0.01

0

0.01

0.02

0.03

0.04

0.05

0.06

0.07

0.08 D

ec -1

2

M ay

-1 3

O ct

-1 3

M ar

-1 4

A u

g- 1

4

Ja n

-1 5

Ju n

-1 5

N o

v- 1

5

A p

r- 1

6

Se p

-1 6

Fe b

-1 7

Ju l-

1 7

D ec

-1 7

M ay

-1 8

O ct

-1 8

M ar

-1 9

A u

g- 1

9

Ja n

-2 0

Ju n

-2 0

N o

v- 2

0

A p

r- 2

1

Se p

-2 1

lowe-u (ew, pay, 4Q) lowe-u (ew, pay, full-be)

lowe-u (ew, pay, full-a) cpi-u

45

year. The payments approach index using the four-quarter sample averaged 1.46%, while the

indexes using the full annual and biennial samples averaged 1.47% and 1.46%, respectively.

Figure 13: Twelve-month inflation of HCI and indexes using payments approach by subsample

Figure 13 repeats the analysis in Figure 12, but compares the HCI-U and comparable

household-weighted indexes using the full annual or biennial CE samples. The HCI-U averaged

1.51% year-over-year, while the index using the full annual and full biennial samples averaged

1.50% and 1.51%, respectively, though larger differences occurred in 2021. Here, index

differences could reflect sample selection effects, but also likely reflect the mixed frequencies

of household weights underlying the full-sample indexes, as some consumer units have only a

few months or quarters of expenditure due to normal sample rotations and unit nonresponse.

Higher frequency expenditure shares tend to give less weight to less frequently purchased

items and more weight to more frequently purchased items (Martin, 2022). We do not want to

capture this latter effect because, in the case of the HCI’s, it is an artifact of using CPI weights

-0.02

-0.01

0

0.01

0.02

0.03

0.04

0.05

0.06

0.07

0.08

D ec

-1 2

M ay

-1 3

O ct

-1 3

M ar

-1 4

A u

g- 1

4

Ja n

-1 5

Ju n

-1 5

N o

v- 1

5

A p

r- 1

6

Se p

-1 6

Fe b

-1 7

Ju l-

1 7

D ec

-1 7

M ay

-1 8

O ct

-1 8

M ar

-1 9

A u

g- 1

9

Ja n

-2 0

Ju n

-2 0

N o

v- 2

0

A p

r- 2

1

Se p

-2 1

hci-u lowe-u (hw, pay, full-be) lowe-u (hw, pay, full-a)

46

for automobiles, which are measured by full purchase price at the time of acquisition, rather

than ongoing monthly payments. In 2021, when HCI-U (over the four-quarter sample) has

slightly higher inflation than the two full sample indexes. In 2021, vehicle price inflation was

high relative to the average inflation across all items, and the comparison in the figure is

consistent with the full-sample indexes giving too little weight to vehicles. A payments

approach for vehicles should mitigate this effect in the full samples.

Impact of Weight Timeliness on the US CPI

Languages and translations
English

1 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Impact of Weight Timeliness on the US CPI

Anya Stockburger, Joshua Klick, Chris Miller, Jessie Park

Bureau of Labor Statistics

Meeting of the Group of Experts on CPIs

June 9, 2023

2 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

BLS Consumer Price Indexes

100

110

120

130

140

150

160

170

180

190

CPI-U Final C-CPI-U

3 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Motivation: Weighting Improvements Product Release Lag

(difference between index reference period and publication)

Weight Lag (difference between weight and index reference period)

Improvement Goals

CPI-U ~10 days Biennial: 12-60 months

Annual: 12-36 months

Reduce weight lag

Final Chained CPI-U

~10 days + 10-12 months

1-2 months Reduce release lag Increase visibility

4 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Outline

Motivation

Impact of weight timeliness

Reducing weight lag in CPI-U

Future research

5 — U.S. BUREAU OF LABOR STATISTICS • bls.gov5 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Weight changes over time Cumulative percentage change in annual spending shares

-80%

-60%

-40%

-20%

0%

20%

40%

60%

2000 2005 2010 2015 2020

Pe rc

en t C

ha ng

e

Year

Apparel Education and Communication Food and Beverages Other Goods and Services Housing Medical Care Recreation Transportation

6 — U.S. BUREAU OF LABOR STATISTICS • bls.gov6 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Airline Fares Monthly Expenditure Weights

0.0%

0.2%

0.4%

0.6%

0.8%

1.0%

1.2%

1.4%

1.6%

1.8%

7 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Impact of chain drift - Tornqvist

Cage, Williams, Church 2021 “Chain Drift in the Chained Consumer Price Index: 1999-2017”

8 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Impact of chain drift – monthly chained Laspyeres

9 — U.S. BUREAU OF LABOR STATISTICS • bls.gov9 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Historical Impact of Timely Weights

-0.40%

-0.20%

0.00%

0.20%

0.40%

0.60%

0.80%

1.00%

2001 2006 2011 2016 2021

Difference in 12-month Percent Change CPI-U less Final C-CPI-U

10 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Recent Impact of Timely Weights

-0.30%

-0.20%

-0.10%

0.00%

0.10%

0.20%

0.30%

0.40%

0.50%

0.60%

0.70% Ja

n M

ar M

ay Ju l

Se p

No v

Ja n

M ar

M ay Ju

l Se

p No

v Ja

n M

ar M

ay Ju l

Se p

No v

Ja n

M ar

M ay Ju

l Se

p No

v Ja

n M

ar M

ay

2018 2019 2020 2021 2022

Difference in 12-month Percent Change CPI-U – Final C-CPI-U

11 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Items contributing to negative substitution bias

-0.02 -0.015 -0.01 -0.005 0 0.005 0.01 0.015 0.02

Full-service meals and snacks

Limited-service meals and snacks

Admissions

College tuition and fees

Airline fare

Personal computers

Toys

New vehicles

Motor vehicle insurance

Owner's Equivalent Rent

Upper-Level Substitution Bias Contribution - December 2021

12 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Negative substitution bias: price and weight change

Full-service meals and snacks

Limited-service meals and snacks

Admissions

College tuition and fees

Personal computers

Toys

New vehicles

Motor vehicle insurance

Owner's Equivalent Rent

-100%

-50%

0%

50%

100%

150%

-20.00% -15.00% -10.00% -5.00% 0.00% 5.00% 10.00%W ei

gh t C

ha ng

e

Price Change

Atypical consumer substitution

13 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Items contributing to positive substitution bias

-0.2 -0.1 0 0.1 0.2 0.3 0.4 0.5

Used cars and trucks

Gasoline

New vehicles

Full-service meals and snacks

Limited-service meals and snacks

Food at employee sites and schools

Clocks, lamps, and décor

Jewelry

Motor vehicle insurance

Owner's equivalent rent

Upper-Level Substitution Bias Contribution - December 2021

14 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Positive substitution bias: price and weight change

Used cars and trucks

GasolineNew vehicles

Full-service meals and snacks

Limited-service meals and snacksFood at employee

sites and schools

Clocks, lamps, and décor

Motor vehicle insurance

Owner's equivalent rent

-50%

-30%

-10%

10%

30%

50%

70%

90%

110% -60.00% -40.00% -20.00% 0.00% 20.00% 40.00% 60.00%

W ei

gh t c

ha ng

e

Price change

Typical consumer

substitution

15 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Improvements to Weight Timeliness

Weight update frequency

Annualized 12-month percent change (2002-2020)

Upper-level substitution bias

Biennial 2.06 0.24

Annual 2.03 0.21

Quarterly 1.95 0.13

Tornqvist (monthly)

1.82 -

 Chain drift None at aggregate levels

for annual or quarterly Issue for lower-level

quarterly

 Research papers Annual: Klick 2021 Quarterly: Klick, Park

2022

16 — U.S. BUREAU OF LABOR STATISTICS • bls.gov16 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Implement annual weight update

 Implemented annual weight updates with January 2023 indexes More relevant weights (replace 2019/2020 with

2021 expenditure data) See website for more information on 2022 and

2023 weight updates

17 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Weight changes in 2023 Apparel Food, Alcohol Away

& Haircuts

Recreation &

Transportation

Lodging Away

Food, Alcohol at Home Other & Housing Goods

Hospital Services & Medicinal Drugs

18 — U.S. BUREAU OF LABOR STATISTICS • bls.gov18 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

What’s next – improve timeliness of the C-CPI?

 Medium-term: 6-month lag to publish final C-CPI Survey protocol (placement dates) Processing efficiencies (auto-coding, monthly processing,

streamlined outlier review) Design changes (survey recall length)

 Long-term: real-time capture of expenditure information Funding for pilot test included in FY24 President’s budget

request

19 — U.S. BUREAU OF LABOR STATISTICS • bls.gov19 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Additional References  Greenlees, John and Williams, Elliot (2009). Reconsideration

of weighting and updating procedures in the US CPI. BLS working paper 431.

 Kurtzon, Greg (2018). How much does formula vs. chaining matter for a cost-of-living index? The CPI-U vs. the C-CPI-U. BLS working paper 498.

 Klick, Josh (2021). Measuring price change during economic downturns. Beyond the Numbers Vol. 10 No. 13.

 Matsumoto, Brett (2022). The impact of changing consumer expenditure patterns at the onset of the COVID-19 pandemic on measures of consumer inflation. Monthly Labor Review.

Contact Information

20 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Anya Stockburger Chief, Branch of Revision Methodology

Division of Consumer Price Indexes www.bls.gov/cpi

[email protected]

  • Impact of Weight Timeliness on the US CPI
  • BLS Consumer Price Indexes
  • Motivation: Weighting Improvements
  • Outline
  • Weight changes over time�Cumulative percentage change in annual spending shares
  • Airline Fares�Monthly Expenditure Weights
  • Impact of chain drift - Tornqvist
  • Impact of chain drift – monthly chained Laspyeres
  • Historical Impact of Timely Weights�
  • Recent Impact of Timely Weights
  • Items contributing to negative substitution bias
  • Negative substitution bias: price and weight change
  • Items contributing to positive substitution bias
  • Positive substitution bias: price and weight change
  • Improvements to Weight Timeliness
  • Implement annual weight update
  • Weight changes in 2023
  • What’s next – improve timeliness of the C-CPI?
  • Additional References
  • Contact Information

Expanding the family of US Consumer Price Indexes

Languages and translations
English

1 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Expanding the family of US Consumer Price Indexes

Anya Stockburger, Bill Johnson, Joshua Klick, Paul Liegey, Robert Martin,

Bureau of Labor Statistics

Meeting of the Group of Experts on CPIs

June 8, 2023

2 — U.S. BUREAU OF LABOR STATISTICS • bls.gov2 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

CPI Family of Indexes

CPI-U Chained CPI-U CPI-W

R-CPI-E

Chained R-CPI-Income

Household Cost Index

R-CPI-IncomeResearch Indexes

Production Indexes

3 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Outline

Motivation

Income-based indexes

Household Cost Indexes

Next steps

4 — U.S. BUREAU OF LABOR STATISTICS • bls.gov4 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Motivation – Increased need for data granularity

 Committee on National Statistics recommendation  Federal Reserve Bank interest  Office of Management and Budget, Bureau of

Economic Analysis, and other government interest  General user interest (major media)  Publications: Initial working paper, Spotlight on

Statistics

5 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

CPI by Income Methodology

$12,000

$118,000

0

20,000

40,000

60,000

80,000

100,000

120,000

140,000

Q1 Q5

Median Equivalized Income (Interview Survey - 2021)

Expenditure weights Group CE respondents into weighted ranking of equivalized income quintiles

Prices/rents All lower-level data the same (prices, outlets, rents)

Index aggregation Lowe, Tornqvist aggregation from lowest-level basic indexes

6 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Snapshot of spending weights by population, 2019-2020 biennial expenditure weight share, equivalized income

0% 5% 10% 15% 20% 25% 30%

Rent

Food at home

Motor fuel

Owner's equivalent rent

Vehicles and maintenance

Food away from home

Recreation

Q1 U Q5

7 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Annualized Inflation Gap Annualized inflation rate, CPI by income quintile, Lowe Formula, December 2005 -

December 2022

2.60

2.54

2.47

2.41

2.33

2.43

2.1

2.2

2.3

2.4

2.5

2.6

2.7

Q1 Q2 Q3 Q4 Q5

Income Quintiles Urban

8 — U.S. BUREAU OF LABOR STATISTICS • bls.gov8 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Inflation Gap Variation Lowest income quintile – Highest income quintile

Annual 12-month percent change December 2006 – December 2022

-1.0%

-0.5%

0.0%

0.5%

1.0%

1.5%

Equivalized income Unadjusted income

9 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Items contributing to inflation gap (2022) -5 0 5 10 15

Rent primary residence

Gasoline (all types)

Electricity

Utility (piped) gas service

Cigarettes

Commercial Health Insurance

Owners' rent primary residence

Lodging away from home

Airline fare

New vehicles

10 — U.S. BUREAU OF LABOR STATISTICS • bls.gov10 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Limitations and Future Improvements

 Lower-level price heterogeneity High levels of aggregation under-estimate inflation gap

(Jaravel 2019) Re-weighting housing prices shows little impact (Larsen

and Molloy 2021)

 BLS future research Further investigate housing adjustments Re-weighting alternative data (gasoline, new vehicles) Interested in a scanner data program (CNSTAT

recommendation), but funding…

11 — U.S. BUREAU OF LABOR STATISTICS • bls.gov11 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Household Cost Index

 Inspired by Office for National Statistics and Statistics New Zealand

 Definition: Measure the change in cash outflows required, on average, for households to access the goods and services they consume

 Methodology:  Household-weighted (democratic) aggregation,  Payments-approach to owner-occupied housing  Urban population

12 — U.S. BUREAU OF LABOR STATISTICS • bls.gov12 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Household-weighted (Democratic) Aggregation

 Create household-level expenditure shares Consumer Expenditure Surveys (Diary and Interview)

sample different households Eligible expenditures from the Diary survey imputed to the

Interview sample using a matching procedure based on Hobijn, et. al. (2009)

 Aggregation Aggregate across items/areas first for each household

using Lowe formula with lagged expenditure weights Average equally across households

13 — U.S. BUREAU OF LABOR STATISTICS • bls.gov13 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Payments Approach – Mortgage Interest Payment

 Weights Consumer Expenditure Survey

 Prices Mortgage interest payment index =

Debt index * Interest rate index Data sources: • Federal Housing Finance Agency’s All Transactions House Price

Index • Freddie Mac Primary Mortgage Market Survey

14 — U.S. BUREAU OF LABOR STATISTICS • bls.gov14 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Payments Approach – Property Tax Payments

 Weights Consumer Expenditure Survey

 Prices Property Tax Payment Index =

Total property tax payments * Constant quality Total housing stock value home price index

Data source: CE

15 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

HCI – Relative Importance December 2020

Major Group CPI-U HCI-U Food and Beverages 15.2 20.1 Housing 42.4 34.3 Apparel 2.7 3.1 Transportation 15.2 14.3 Medical 8.9 11.1 Recreation 5.8 6.6 Education and Comm. 6.8 6.7 Other 3.2 3.7

16 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

HCI – Index Results Average 12-month % change CPI-U 1.86% HCI (Payments Approach + Household-weighted Aggregation)

1.51%

HCI-U (Payments Approach Only) 1.46%

17 — U.S. BUREAU OF LABOR STATISTICS • bls.gov17 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Explaining the HCI results

18 — U.S. BUREAU OF LABOR STATISTICS • bls.gov18 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Limitations - HCI

 Household-weighted aggregation Infrequent purchases (challenge especially with

Tornqvist) Include in HCI given small impact?

 Payments approach Investigate a microdata approach for mortgage

interest index Investigate including mortgage principal

19 — U.S. BUREAU OF LABOR STATISTICS • bls.gov19 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

What’s next?

Improve methodology

Income-group specific lower- level indexes

Next step for HCI research?

Stakeholder outreach

Group of Experts BLS advisory committees Federal Committee on Statistical Methodology

Publish regular updates

R-CPI-Income C-CPI-Income

HCI?

Contact Information

20 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Anya Stockburger Chief, Branch of Revision Methodology

Division of Consumer Price Indexes www.bls.gov/cpi

[email protected]

  • Expanding the family of US Consumer Price Indexes
  • CPI Family of Indexes
  • Outline
  • Motivation – Increased need for data granularity
  • CPI by Income Methodology
  • Snapshot of spending weights by population, 2019-2020 biennial expenditure weight share, equivalized income
  • Annualized Inflation Gap�Annualized inflation rate, CPI by income quintile, Lowe Formula, December 2005 - December 2022
  • Inflation Gap Variation�Lowest income quintile – Highest income quintile�Annual 12-month percent change�December 2006 – December 2022
  • Items contributing to inflation gap (2022)�
  • Limitations and Future Improvements
  • Household Cost Index
  • Household-weighted (Democratic) Aggregation
  • Payments Approach – Mortgage Interest Payment
  • Payments Approach – Property Tax Payments
  • HCI – Relative Importance�December 2020
  • HCI – Index Results
  • Explaining the HCI results
  • Limitations - HCI
  • What’s next?
  • Contact Information

United States Inflation Experience across the Income Distribution

The Bureau of Labor Statistics (BLS) produces the Consumer Price Index (CPI) as a measure of price change faced by consumers. The CPI for All Urban Consumers (CPI-U) targets the inflation experience of nearly all consumers in the United States which may not reflect the inflation experience of an individual household or group of households. Increasingly there is user demand for CPIs across the income distribution. This paper builds on the authors’ prior research by modifying the cohort definition and extending the period of analysis.

Languages and translations
English

United States Inflation Experience across the Income Distribution Joshua Klick, Anya Stockburger

WORKING DRAFT

Prepared for the Group of Experts on Consumer Price Indices UNECE

Geneva, June 2023

Abstract The Bureau of Labor Statistics (BLS) produces the Consumer Price Index (CPI) as a measure of price change faced by consumers. The CPI for All Urban Consumers (CPI-U) targets the inflation experience of nearly all consumers in the United States which may not reflect the inflation experience of an individual household or group of households. Increasingly there is user demand for CPIs across the income distribution. This paper builds on the authors’ prior research by modifying the cohort definition and extending the period of analysis. From 2006-2022, lower income households generally faced larger inflation rates than higher income households. The short-term gap between lower and higher income household’s inflation rates changes when the cohort definition accounts for varying family sizes.

JEL Codes: C43, E31

Executive Summary The Bureau of Labor Statistics (BLS) produces the Consumer Price Index (CPI) as a measure of price change faced by consumers. The CPI for All Urban Consumers (CPI-U) targets the inflation experience of urban consumers which covers over 90 percent of the total population of the United States. This broad- based coverage may not reflect the inflation experience of an individual household or groups households1. Increasingly there is user demand for CPIs across the income distribution. These indexes paint a full picture of inflation for users interested in the state of the economy. Other users demand an index limited to lower income households for escalation purposes2.

This paper builds on the authors’ prior research by modifying the cohort definition3. The prior analysis defined income quartile (four) cohorts based on unadjusted total household before tax income. This analysis defines income quintiles (five) cohorts based on equivalized (household size-adjusted) total family income. Adjusting household size is standard practice in income inequality literature. By adjusting household income to a single-member equivalent, income levels are more comparable across households. For example, an $80,000 household income does not convey the same level of resources available to a 4-person family as it does a single-person household.

While this adjustment did not impact the long-term results, there are several notable short-term differences. Lowest income households almost always faced larger inflation rates than highest income households during the study period, however there are several spans when the opposite occurred. This anomalous result occurred more frequently and during different months for the adjusted indexes than the unadjusted indexes.

This paper extends the period of analysis to December 2022 for the CPI indexes. The 12-month change in the CPI-U for All Items was 1.9 percent in December 2018, the last month included in the prior analysis. Since mid-2021, inflation accelerated to a peak of 9.1 percent in June 2022. The average annual inflation rate from December 2005 to December 2022 was largest for the lowest income quintile and smallest for the highest income quintile. The gap in inflation rates between lowest and highest income households was 0.27 percentage points per year.

Background and Issues The BLS publishes consumer price indexes for subgroups of the target urban population. The CPI for Wage Earners and Clerical Workers (CPI-W) became a subgroup index in 1978 when the BLS adopted an urban population target and began calculating the CPI-U. In 1988, the BLS introduced a research series measuring price change for older Americans, the CPI-E. Research conducted by BLS on inflation rates for

1 While the population target is the urban population, the measurement unit is households rather than persons. 2 The BLS is researching a new index product for escalation purposes. Research in this paper describes a low- income subgroup definition that could be applied to the new index product. 3 Klick, Stockburger, “Experimental CPI for Higher and Lower Income Households,” March 2021, BLS working paper 537

low-income consumers began in the 1990s4. Prior research is briefly summarized in an earlier working paper published in March 20215.

Interest in income-based inflation measures continues. In June 2021, an Interagency Technical Working Group convened by the Office of Management and Budget issued a report recommending the BLS produce a new consumer price index to be used in the calculation of the U.S. Official Poverty Measure. The group recommended a low-income Chained CPI. In April 2022, The National Academy of Sciences issued a report recommending development of price indexes by income group6.

In the author’s March 2021 working paper, we outline numerous caveats and limitations with the current methodology to calculate subgroup indexes. Other researchers have shown using the same underlying microdata to calculate indexes for both target population and subgroups underestimates the gap in inflation rates between highest and lowest income households7. The methodological improvements presented in this paper do not account for consumer heterogeneity at lower levels of index aggregation, and so the same caveats and limitations from the March 2021 working paper apply.

Methodology Income Cohort Definition The March 2021 working paper describes the index methodology and data sources in detail. We use data collected in the Consumer Expenditure Surveys (CE) from 2004 to 20218. We estimate expenditures on the full market basket of items using integrated data from the Diary and Interview surveys. We use elementary price indexes, for example Bananas in Boston, that form the foundation of Consumer Price Index aggregation from 2006 to 2022. We derive implicit quantities for the modified Laspeyres formula indexes from biennial expenditures lagged two to three years. For example, we use expenditures from 2019 and 2020 to weight modified Laspeyres indexes in 2022. We refer the reader to the March 2021 working paper for additional information on index methods and formulas.

This paper improves the income group cohort definition. First, we employ a household weighted ranking to distribute the sample weights relatively equally across quintiles. Previously, we used an unweighted income ranking that did not reflect an equal distribution of household weights across quartiles. The BLS calibrates CE sample weights to the Current Population Survey to control for several demographic characteristics such as age, race, owner or renter, geography, and Hispanic ethnicity.9 Weighting

4 Thesia Garner, David Johnson, and Mary Kokoski 1996, “An experimental Consumer Price Index for the poor” https://www.bls.gov/opub/mlr/1996/09/art5full.pdf 5 Klick, Stockburger, “Experimental CPI for Higher and Lower Income Households,” March 2021, BLS working paper 537 6 National Academies of Sciences, Engineering, and Medicine. 2022. Modernizing the Consumer Price Index for the 21st Century. Washington, DC: The National Academies Press. https://doi.org/10.17226/26485. 7 Many examples include Broda and Romalis (2009), Broda, Leibtag, and Weinstein (2009), Agente and Lee (2017), Jaravel (2017), and Kaplan and Schulhofer-Wohl (2017). 8 BLS began imputing missing values of income in 2004, and income data from 2003 are not comparable. To initialize this research, we used a single year of expenditures in 2004 to calculate spending shares used in index calculation for 2006 and 2007. The remaining spending shares use two years of expenditures, consistent with CPI-U methodology. 9 See CE Handbook of Methods, Calculation Methodology https://www.bls.gov/opub/hom/cex/calculation.htm#calculation-methodology

methods also control for subsampling, and a non-interview adjustment that controls for geography, household size, number of contacts, and average gross income for a household’s zip code. The use of sample weights reflects known urban population totals, particularly relevant when comparing owners and renters, so that the weights are equivalent across quintiles, and are comparable to CE’s weighted ranking of the total population.10,11 An inherent benefit to this approach is that weights are relatively evenly distributed across defined quantiles. CE processes this income ranking variable for the total population. Therefore, urban and rural population differences across the CE quintiles (the rural proportion is higher for lower quintiles) provide motivation for CPI to calculate a weighted income distribution so that weights are distributed relatively equally for the urban population. This improvement did not substantively change the results at the All-Items US City Average level.

Second, we divide the CE respondents into quintiles of equivalized income, rather than quartiles as in the prior analysis. We determined that the proportion of quintile households is comparable to the wage- earner population (W) as summarized in Figure 1. Additionally, coverage of item-area weight cells for consumer price index estimation was sufficient to calculate five income groups rather than four as described in the results section. More detailed income groups provide greater granularity to data users and facilitate comparisons of lowest, median, and highest quintiles.

Figure 1. Household respondent summary from 2021 collection quarter 4

Count Proportion relative to U (Percentage) U W E Q1 Q2 Q3 Q4 Q5

Interview 4,515 21.4 36.5 20.2 20.2 19.8 19.9 19.9 Diary 2,694 22.8 38.5 20.3 20.9 19.5 18.8 20.5

Third, we equivalize household income to account for differing family sizes. There is a long literature using equivalence scales to adjust household income to account for different characteristics across households12.

Household size and composition varies across respondents. Equivalized income defined as income divided by the square root of family size, adjusts income to make this comparable across households, as a better measure of household economies of scale.13 The first and fourth quintile maximum income cut points from the Diary and Interview are greater than the corresponding maximum equivalized income

10 See CE Table 1101. Quintiles of income before taxes https://www.bls.gov/cex/tables/calendar-year/mean-item- share-average-standard-error/cu-income-quintiles-before-taxes-2021.pdf 11 For CE income distribution methodology see https://www.bls.gov/cex/csxguide.pdf. CE creates a before tax income ranking variable as a distribution over the interval (0,1] so that weights are relatively equally distributed across defined quantiles. The income ranking variable is created by sorting by income and a random number, used to break ties for CUs reporting the same income, in ascending order for each collection quarter and survey source. The total sum of FINLWT serves as the denominator, and cumulative sum of FINLWT21 serves as the numerator to create the distribution that ranges from greater than 0 to less than 1, to 7 decimal places of precision. 12 Angela Daley, Thesia Garner, Shelley Phipps, Eva Sierminska, “Differences Across Place and Time in Household Expenditure Patterns: Implications for the Estimation of Equivalence Scales,” BLS Working Paper, 2020 https://www.bls.gov/osmr/research-papers/2020/pdf/ec200010.pdf 13 https://www.brookings.edu/blog/up-front/2019/04/17/whats-in-an-equivalence-scale

summarized in Figure 2. The median income and equivalized income for the third quintile is equivalent to the urban population. The median equivalized income is less steep from the first to fifth quintile than median income reflecting the improved comparability across households as displayed in Figure 3.

Figure 2: Household maximum income and equivalized income summary from 2021 collection quarter 4 (in terms of Thousands)

0 20 40 60 80 100 120 140

Q1

Q2

Q3

Q4

Diary Income Interview Income Diary Equivalized Income Interview Equivalized Income

Figure 3. Household median income and equivalized income summary from 2021 collection quarter 4 (in terms of Thousands)

The household weighted ranking described above is used to evaluate equivalized income quintiles (E1:E5) and non-equivalized income quintiles (N1:N5). The counts of households for CPI weighted income quintiles can be compared to those same households for other income definitions to highlight the degree of similarity between subpopulation definitions as summarized in Figure 4. Overlap is the proportion of same households relative to the respective CPI weighted non-equivalized income ranking quintiles (N1:N5). The All group represents sum of the 5 quintiles. When the urban portion of CE total population income weighted ranking is compared to non-equivalized income rankings, there is a high degree of overlap ranging from 94% to 100%. When the equivalized income groups are compared to the non-equivalized income rankings the degree of overlap ranges from 53% to 83% highlighting definitional differences of household income, and potential differences for weighting these respective indexes.

0 20 40 60 80 100 120 140 160 180 200

U

W

E

Q1

Q2

Q3

Q4

Q5

Diary Income Interview Income Diary Equivalized Income Interview Equivalized Income

Figure 4: Proportion of counts of same households relative to non-equivalized income across quintile definitions (Percentage)

D I All Q1 Q2 Q3 Q4 Q5 All Q1 Q2 Q3 Q4 Q5 CE 97 98 97 95 97 100 96 97 95 94 96 100 CPI-(E1:E5) 67 80 62 54 55 83 67 82 62 53 55 82

An additional improvement is the smoothing of expenditure cells comparable to production weight processing. The CE collected survey data are subject to sampling error across geography and unreliable for index estimation, particularly relevant for subpopulation quintiles. The CPI smooths basic item area cell weights to reduce variance across geography. Local area annual weights are composite estimated with more stable broader level of geography (self-representing-regions and non-self-representing- regions). The composite estimate weight is between 0 and 1 and is based on minimizing the mean squared error between the local area versus broader geography.14 The impact of smoothing is described below as expenditure weight cell coverage as the proportion of missing basic item area cells.

With an improved definition of income incorporating population weights and equivalization, we considered how to divide households into quintiles. The BLS produces consumer expenditure estimates by income quintile. Those income quintiles are defined by cut points that are rarely adjusted15. To produce a time-series consistent definition of income groups for index estimation of 243 items by 32 geographic area cells, we chose to define income quintiles that shift to include a fifth of CE households in each group rather than defining cut-points that would need to be revised over time.

We also considered defining income quintile groupings by geography such that CE respondents are classified into income quintiles within a city (Primary Sampling Unit, PSU) selected for inclusion in the CPI. We ultimately concluded a nationally defined income distribution was preferred to represent all households as a single distribution for a national level index, that is methodologically consistent with Bureau of Economic Analysis (BEA) Personal Consumption Expenditures (PCE) and BLS PCE income quintile products.16 Area stratification of the income distribution has a minimal impact to national level indexes and changes the overarching definition/purpose of the product. A limitation of this method is that subnational indexes are not feasible because the weights are not equivalent across quintiles. We will continue research geographic considerations for weighting CPIs by income.

14 For details see https://www.bls.gov/osmr/research-papers/1999/pdf/st990050.pdf 15 For example, the nominal income bounds of the lowest income quintile were less than $3,000 from 1960-1983, and less than $5,000 to present. Historically, the income definitions are subject to change based in part on inflation particularly relevant beginning 2021. These weights are not equivalent across groups limiting distributional comparisons. Also, households are not equivalized based on the number of people within a consumer unit resulting in dissimilar measures of income groups. Income as a standalone variable is not sufficient for weighting subpopulation indexes. 16 See BEA Measuring Inequality in the National Accounts https://www.bea.gov/system/files/papers/measuring- inequality-in-the-national-accounts_0.pdf, and BLS Distribution of U.S. Personal Consumption Expenditures Using Consumer Expenditure Surveys Data: Methods and Supplementary Results https://www.bls.gov/cex/pce-ce- distributions.htm

Income Cohort Demographic Characteristics In addition to income and expenditures, the CE Surveys collect a variety of demographic information about survey respondents. In this section, we present the demographic differences between income quintiles. By construction, the average household size and number of children is more consistent across income quintiles after equivalizing income. Other demographic differences give further context for the expenditure share differences presented in the next section.

Using household size to equivalize income results in more consistent household sizes and number of children across income quintiles (figure 5). Without accounting for household size, more single person households and households without children are included in the lowest income quintile. Conversely, fewer single person households and households without children are included in the highest income quintile. When adjusting for household size, more families with children are included in the lowest income quintile and less families with children are included in the highest income quintile. These changes are consistent across income quintiles, and we include only the first- and fifth-income quintiles to simplify the presentation of results.

Figure 5: Average Family Size and Number of Children, Urban and by Income Quintile, 2020

Urban Q1 Q5 Unadjusted Equivalized Unadjusted Equivalized Family Size 1 person 30% 63% 45% 7% 20% 2 people 33% 22% 25% 34% 40% 3 or more people 37% 15% 30% 59% 40% Number of Children None 62% 80% 66% 43% 61% 1-2 children 30% 16% 24% 46% 34% 3 or more children 8% 4% 10% 11% 5%

This data confirms the importance of equivalizing income to adjust for varying household sizes. The expenditure pattern differences between unadjusted income quintiles are reflective more of household composition differences. By standardizing household sizes, the expenditure pattern differences presented in the next section are more reflective of income differences.

Households grouped by income quintile have different rates of home ownership, working status, and educational attainment (figure 6). Households earning the lowest quintile of income are more likely to rent their home and not work for pay than higher income households. Of the households with retired members, 65% report incomes that fall in the first and second quintile. The large number of retired individuals in the lower income quintiles explains why more than half of households earning income in the first and second quintiles own their home with no mortgage. Higher income households are more likely to own their home with an outstanding mortgage. Higher income households are also more likely to hold advanced degrees.

Figure 6: Housing tenure, working status, and educational attainment by population

Urban Q1 Q5 Housing Tenure

Owner with a mortgage 41% 18% 63% Owner with no mortgage 25% 29% 20% Renter 34% 53% 17%

Working status

Not working (due to disability or taking care of family)

9% 23% 3%

Not working (retired) 21% 34% 6% Working 70% 43% 91%

Educational Attainment

Less than high school degree 8% 18% 1% High school degree or some college

41% 55% 17%

Advanced degree 51% 28% 81%

Data Inputs CPI Basic Item-Area Expenditure Weight coverage The above household coverage analysis indicates that each of the quintiles has approximately the same number of households as the wage earner subpopulation. The expenditure weight coverage measures the proportion of missing item area cells used to weight basic indexes for 2nd stage estimation. When price change occurs, weighting basic indexes accurately relative to the All-Items US level is imperative to construct aggregate indexes. Coverage is measured as the proportion of item-area cells less than $1 as missing. There are 32 areas cells multiplied by each item series. There are 243 basic item area indexes that can be divided into priced item series and non-sampled item series. The non-sampled series are subject to infrequent number of expenditures reported and the price movement is based on aggregate priced series. Coverage of overall results are distorted when combining the non-sampled items and priced items. An additional adjustment occurs for health insurance which are excluded from this data quality metric. We display results in figure 7.

The urban population collected proportion of missing overall is 3.6% versus priced items is 0.5%; smoothing reduces the proportion of priced items missing to 0.0%. The wage earner collected proportion missing of priced items is 7.1%, and smoothing reduces this proportion to 0.6%. The lowest income quintile collected proportion missing of priced items is 13.7%, and smoothing reduces this proportion to 0.0%. The highest income quintile collected proportion of missing priced items is 4.6%, and smoothing reduces this proportion to 0.0%. Smoothing therefore has a larger impact on the lowest income quintile and improves weighting coverage for index estimation.

Figure 7: 2021 Reference year expenditure weight basic cell coverage as proportion missing (percentage)

Collected Smoothed # Items U W Q1 Q5 # Items U W Q1 Q5 Overall 209 3.6 13.1 19.9 10.0 225 0.4 1.4 0.4 0.4 Non-sampled 26 25.2 55.4 63.3 48.3 26 3.8 7.7 3.8 3.8 Priced 183 0.5 7.1 13.7 4.6 199 0.0 0.6 0.0 0.0

Income Quintile Spending Weights We produce price indexes, which use spending weights to calculate an average price change. While the spending weights for the urban population reflect average spending, they may not reflect spending of any individual household or groups of households. Spending weights vary across the income distribution. Overall, households earning the lowest quintile of income devote a larger share of their spending on essential goods and services. Households earning the highest quintile of income allocate a larger share of their spending on recreational and leisure goods and services. Figure 8 shows a snapshot of these spending differences in 2019-2020 for select categories. We present more categories in the appendix. We used spending weights constructed from these data to calculate indexes in 2022.

Figure 8: Snapshot of spending weights by population, 2019-2020 biennial expenditure weight share, equivalized income

These spending weights reflect the differences between quintiles of equivalized income. There are a few notable shifts in these spending weights from unadjusted income we used in an earlier analysis. The

0.0

5.0

10.0

15.0

20.0

25.0

30.0

Q1 Q2 Q3 Q4 Q5

share of spending on owner’s equivalent rent by households classified in the first quintile of equivalized income is 2 percentage points lower than households classified in the first quintile of unadjusted income. The households shifting out of the first income quintile after adjusting for household size are more likely to be retired and own their own homes without a mortgage. The households shifting into the first income quintile after adjusting for household size are more likely to rent their homes or own their homes with a mortgage. Although homeowners without mortgages pay less out of pocket to live in their home than other households, the owners’ equivalent rent approach to owned housing imputes an implicit rent. For retirees, who are more likely to own their home without a mortgage than other households, owner’s equivalent rent constitutes a large share of their spending weights. This is evidenced by the spending shares for the CPI-E population, nearly 60 percent of whom are retired. The net effect of households shifting into and out of the first income quintile is a reduction in spending on shelter services.

We also observe a notable shift in transportation spending weights from unadjusted income we used in an earlier analysis. Spending on all vehicle-related categories (new and used vehicles, motor fuels, and vehicle insurance) is 1.1 percentage points higher for the households categorized in the first quintile of equivalized income relative to their unadjusted counterparts. Again, retirees are likely the cause of this shift. Households included in the CPI-W population typically spend a larger share of their budget on vehicle-related expenses than urban households. The wage-earner and clerical worker population includes very few retirees (4 percent). With more households with members who are working included in the lowest quintile of equivalized income, they dedicate more of their budget to vehicle-related expenses.

Price Analysis As noted in the methodology section, the BLS calculates price indexes for different populations by applying varying spending weights to the same set of underlying basic price indexes. That is, when averaging price changes across all items, the price change for rent has a greater impact on overall price change for lowest-income versus highest-income households. If prices changed at the same rate for all item categories, there would be no difference in inflation rates by population. In this section we present price changes by item category which will explain overall index differences.

In figure 9, we show the price change relative to the spending share differences between first and fifth quintile income groups for select components of the CPI. The y-axis shows price change from January 2020 to December 2021 and shows new and used motor vehicles and motor fuel had the largest price increases during that period (nearly 27%). The x-axis shows the ratio of spending shares (first quintile divided by fifth quintile) using 2019-2020 spending shares. Compared to fifth income quintiles households, first income quintile households spent four times their budget share on rent and a quarter their budget share on lodging away from home.

Figure 9: Price change and spending share scatterplot, first and fifth quintile (price change January 2020-December 2021, spending shares 2019-2020 ratio Q1/Q5)

Results In this section we present indexes by income quintile. Overall, the trends we observe in previous analysis continued in 2019-2022. Lowest-income households tend to experience larger inflation rates than highest-income households. In this section we present index results and further analyze periods that defy the overall trend.

Overall Index Results Lowest-income households tend to experience larger inflation rates than highest-income households. We show the annualized inflation rates over the period in Figure 10. Lowest-income households faced inflation rates that were on average 0.27 percentage points larger than highest-income households every year over this period. Cumulatively, the inflation gap is 5.18% over 17 years.

Rent of primary residence(HA)

Tobacco and smoking products(GA)

Pork(FD)

Poultry(FF) Energy services(HF)

Beef and veal(FC)

Telephone services(ED)

Motor fuel(TB) New and used motor vehicles(TA)

Lodging away from home(HB)

-10%

-5%

0%

5%

10%

15%

20%

25%

30% 0 1 2 3 4 5

Price Change

Ratio of Spending Shares (Q1/Q5)

Figure 10: Annualized inflation rate, CPI by income quintile, Lowe Formula, December 2005 - December 2022

Variation of income inflation gap over time On average from 2006 through 2022, lowest-income households faced larger inflation rates than highest-income households. At its peak in August 2008, the gap in inflation rates was 1.37 percentage points. At its trough in February 2016, the gap in inflation was reversed with highest-income households facing inflation rates 0.31 percentage points larger than lowest-income households. The long-term gap in the average inflation rate is the same whether classifying households using equivalized income or unadjusted income. In the short-term, the magnitude and direction of the inflation gap differs depending on the income definition we use to classify households. We show the difference in annual 12- month percent change between lowest- and highest-income households in Figure 12.

2.6

2.54

2.47

2.41

2.33

2.43

2.15

2.2

2.25

2.3

2.35

2.4

2.45

2.5

2.55

2.6

2.65

Q1 Q2 Q3 Q4 Q5

Income Quintiles Urban

Figure 12: Annual 12-month percent change in CPI, difference between lowest and highest income quintile, December 2006 – December 2022

Variation in inflation gap by item category Over the period studied, lower income households faced larger inflation rates than higher income households aggregated across all items in the market basket. Which item categories drive this difference? In this section, we present inflation rates by eight broad classifications called major groups. In the next section we decompose the contribution of each of these major groups to the overall inflation gap between lowest and highest income households.

Figure 13 displays inflation rates (annualized 12-month change) for each major group and the inflation gap between lowest and highest income households. For Apparel and Medical Care, highest income households faced larger inflation than lowest income households. The inflation gap was the smallest for Education and Communication and Food and Beverages. Lowest income households faced larger inflation rates than highest income households for Other Goods and Services, Housing, and Transportation. How much each major group contributed to the overall inflation gap depends on the spending shares. We present contribution information in the next section.

-1.00%

-0.50%

0.00%

0.50%

1.00%

1.50% 20

06 12

20 07

06 20

07 12

20 08

06 20

08 12

20 09

06 20

09 12

20 10

06 20

10 12

20 11

06 20

11 12

20 12

06 20

12 12

20 13

06 20

13 12

20 14

06 20

14 12

20 15

06 20

15 12

20 16

06 20

16 12

20 17

06 20

17 12

20 18

06 20

18 12

20 19

06 20

19 12

20 20

06 20

20 12

20 21

06 20

21 12

20 22

06 20

22 12

Equivalized income Unadjusted income

Figure 13: Inflation gap by CPI major group; Annualized inflation rate, Lowe Formula, December 2005 - December 2022

Item Category - Major Group

Urban Lowest Income Quintile (Q1)

Highest Income Quintile (Q5)

Inflation Gap (Q1-Q5)

Apparel 0.35 0.25 0.42 -0.17 Education and Communication

1.34 1.63 1.67 -0.04

Food and Beverages 2.89 2.90 2.89 0.01 Other Goods and Services

2.90 3.38 2.43 0.95

Housing 2.66 2.85 2.53 0.32 Medical Care 3.07 2.89 3.16 -0.27 Recreation 1.13 1.18 1.14 0.04 Transportation 2.33 2.52 2.23 0.29

To interpret these results, recall our methodology adjusts spending shares on item categories such as women’s dresses, men’s pants, and children’s clothing to reflect the shopping behavior of households in each income quintile. Price change at the major group level reflects different averages of price change across those item categories. Highest income households faced larger apparel inflation than lowest income households because they spent a larger share on item categories whose prices were rising faster than average (or smaller shares on item categories whose prices were falling or rising slower than average). These results do not indicate any differences in shopping behaviors below the item strata level.

The BLS calculates the CPI including all goods and services purchased by consumers. For some uses of the CPI, users prefer calculating a CPI over a smaller set of goods and services. The BLS calculates a CPI Less Food and Energy index which some users refer to as “core” inflation since it excludes some volatile item categories. Another subset calculation is the CPI for Food, Clothing, Shelter, and Utilities (FCSUti). The BLS uses this index to calculate a research poverty measure and can be considered an “essentials” index17. We show inflation gap results for these indexes in Figure 14.

17 The Supplemental Poverty Measure website explains the SPM methodology and use of the FCSUti CPI. https://www.bls.gov/pir/spmhome.htm

Figure 14: Inflation gap for “core” and “essentials” items; Annualized inflation rate, Lowe Formula, December 2005 - December 2022

Special Aggregation Index

Urban Lowest Income Quintile (Q1)

Highest Income Quintile (Q5)

Inflation Gap (Q1- Q5)

All Items 2.43 2.60 2.33 0.27 “Core” All Items Less Food and Energy (X)

2.35 2.57 2.23 0.34

“Essentials” Food, Shelter, Clothing, and Utilities (FCSUti)

2.63 2.71 2.60 0.11

Excluding food and energy, the inflation gap is wider between lowest and highest income households than when those categories are included. The inflation gap is less for the “essentials” index.

Which items explain the inflation gap? In the previous section, we showed the variability in the inflation gap below the All-Items level. To understand how different components of the market basket contribute to the All-Items inflation gap, we need a measure that incorporates relative weights across item categories. These measures are called contributions and effects.18 If the price change for an item category was unchanged instead of the value measured, then the effect is the resulting change in the all-items price change. The contribution scales items effects relative to the all-items price change.

These contributions and effects can be extended to explain inflation rates for income groups. In figure 15, we display the top and bottom three contributing items to the urban, first income quintile, and fifth income quintile populations. For example, inflation in owner’s equivalent rent was the largest contributor of any single item stratum for the urban, lowest income quintile, and highest income quintile populations inflation rate in 2022. If owner’s equivalent rent had not changed in 2022, the all- items price change would have been 0.5 percentage points lower for the urban population, 1.2 percentage points smaller for the lowest income quintile, and 1.4 percentage points smaller for the highest income quintile.

Figure 15: 2022 Year over year item ranking of contribution and effect (percentage) for U, Q1, and Q5

U: All Items 8.0 Q1: All Items 8.3 Q5: All Items 7.7 Rank Item Effect Contribution Item Effect Contribution Item Effect Contribution 1 HC01 1.3 16.6 HC01 1.2 15.2 HC01 1.4 18 2 TB01 1.1 13.9 TB01 1.2 14.5 TB01 0.9 11 3 TA02 0.5 6.0 HA01 0.9 10.8 TA01 0.5 7.0 209 ED03 -0.0 -0.1 ED03 -0.0 -0.1 ED03 -0.0 -0.1 210 RA01 -0.0 -0.2 RA01 -0.0 -0.2 RA01 -0.0 -0.2 211 EE04 -0.1 -0.7 EE04 -0.1 -0.7 EE04 -0.0 -0.6

18 See Footnote 1 https://www.bls.gov/news.release/cpi.t07.htm

Gasoline is the second largest contributor to inflation for all three populations in 2022. The third ranked items differ across populations: used vehicles for the urban population, rent for lowest income households, and new vehicles for highest income households.19 The bottom three ranked items display the small negative effects and contributions.

Since owner’s equivalent rent is an important contributor to all populations, what explains the inflation gap between lowest and highest income quintiles? To home in on this question, we redefine the contribution and effect measure to identify the item categories that most contribute to widening the inflation gap (positive effect) and narrowing the inflation gap (negative effect). Formulas used for this analysis are described in Appendix 2. The 2022 year over year change inflation gap is 0.5%, with a positive effect of 2.1% and a negative effect of -1.6%.

We show in figure 16 the item categories contributing most to the positive effect (greater than 0) and the negative effect (less than 0). Rent, gasoline, and electricity had the largest contributions to the positive effect. The lowest income quintile of households spends more of their budget share on these categories than the highest income quintile of households. New vehicles and airline fares had the largest contributions to the negative effect. The highest income quintile of households spends more of their budget share on these categories than the lowest income quintile of households.

19 See Appendix 7 Consumer Price Index items by publication level for item definitions https://www.bls.gov/cpi/additional-resources/index-publication-level.htm

Figure 16. 2022 Year over year inflation gap (Q1-Q5) CPI contributions to All-Items (percentage)

Summary/Conclusion/Future analysis/Next Steps BLS produces different measures of inflation used to assess the health of the American economy. With this research, we add additional measures of consumer inflation across the distribution of household income. This paper builds on the authors’ prior research by modifying the cohort definition and extending the period of analysis to 2022. As with earlier periods studied, lower income households generally faced larger inflation rates than higher income households through 2022. The long-term inflation gap between lowest and highest income households is unaffected by the cohort definition changes to better account for varying household sizes (however short-term differences in the inflation gap emerge).

The inflation gap is the result of differences in spending shares across households. Prices for rent, gasoline, electricity, new vehicles, and owner’s equivalent rent rose faster than average in 2022. The impact of rent, gasoline, and electricity spending share differences generated larger inflation measures for lowest income households. The larger spending shares highest income households dedicated to new vehicles and owner’s equivalent rent had a moderating impact on the inflation gap. Modifying the set of item categories over which inflation measures are calculated changes the spending share differences across households, leading to differences in inflation gap measures.

-5 0 5 10 15

Rent primary residence(HA01) Gasoline (all types)(TB01)

Electricity(HF01) Utility (piped) gas service(HF02)

Cigarettes(GA01) Motor vehicle insurance(TE01)

Limited service meals/snacks(FV02) Juices and drinks(FN03)

Cable & satellite tv/radio(RA02) Chicken(FF01)

Club membership (RB02) Child care & nursery school(EB03)

Owners' rent secondary res.(HC09) Leased cars and trucks(TA03)

Full service meals and snacks(FV01) Commercial Health Insurance(ME01)

Owners' rent primary residence(HC01) Lodging away from home(HB02)

Airline fare(TG01) New vehicles(TA01)

Throughout this paper, we have identified potential areas for future research. Perhaps most importantly, we recognize the importance of capturing price change differences at the lower level by income quintile. As other researchers have demonstrated, there may be considerable heterogeneity in the prices paid and unique items purchased that can have an impact on the overall measure of inflation. Previous research has found little difference in rent inflation by income group. We are interested in exploring this finding further and the impact of rent subsidies that have a larger impact on the lowest quintile of households.

We define cohorts by income quintile but recognize there may be other cohorts better suited for different uses. For example, cohorts defined by expenditure can lead to reclassification of some households into different quintiles (some lowest quintile households would fall into the highest quintile of expenditure, for example). There could also be different geographic stratifications that would be helpful (below the national level). Furthermore, there could be measures of wealth that are more useful for categorizing households.

Finally, this research is limited to the income group cohort definition. BLS is developing another product, Household Cost Indexes, that could be calculated for different subgroups. It would also be useful to develop confidence intervals and standard errors to identify statistically significant differences between inflation measures for different populations.

Appendix 1 Snapshot of spending weights by population, 2019-2020 biennial expenditure weight, equivalized income

Item Category Urban Q1 Q2 Q3 Q4 Q5 Food and beverages 14.2 14.9 14.6 14.2 14.5 13.5 Alcoholic beverages 0.9 0.5 0.6 0.8 1 1.3 Food away from home 5.1 4.5 4.9 5 5.2 5.4 Food at home 8.1 9.9 9.1 8.5 8.3 6.8 Housing 42.9 46.6 44.6 42.1 41.3 42.5 Owner’s equivalent rent 24.7 21.4 23.3 22.9 24.5 27.6 Rent 7.6 14.3 11 9.1 6.5 3.7 Fuels and utilities 4.5 5.9 5.4 4.8 4.4 3.5 Household furnishings and operations 4.8 4.1 4 4.3 4.7 5.6

Lodging away from home 0.9 0.5 0.5 0.6 0.8 1.6 Apparel 2.7 2.5 2.5 2.4 2.7 2.9 Transportation 16.5 13.9 15.3 17.5 17.6 16.5 Motor fuels 3 3.2 3.4 3.5 3.3 2.4 Public transportation 0.9 0.6 0.6 0.7 0.8 1.3 Vehicle purchase and maintenance and repair 9.4 7.1 8 9.8 10.1 10.2

Vehicle insurance 2.5 2.6 2.9 2.9 2.8 2 Medical care 8.8 7.9 10 9.7 9.1 7.9 Health insurance, retained earnings 0.8 0.5 0.7 0.9 0.9 0.8 Professional services 3.8 3.5 4.3 4.1 3.9 3.5 Recreation 5.3 4.3 4.2 4.7 5.5 6.4 Education and communication 6.9 6.8 5.7 6.4 6.7 7.8 Education 2.8 2.4 1.3 1.9 2.4 4.4 Communication 4.1 4.5 4.4 4.5 4.3 3.4 Other goods and services 2.8 3.1 3 2.9 2.7 2.6

Snapshot of spending weights by population, 2019-2020 biennial expenditure weight, unadjusted income

Item Category Urban Q1 Q2 Q3 Q4 Q5 Food and beverages 14.2 14.1 14.2 14.4 14.5 13.8 Alcoholic beverages 0.9 0.6 0.7 0.8 0.9 1.2 Food away from home 5.1 4.4 4.7 5.2 5.1 5.5 Food at home 8.1 9.2 8.8 8.4 8.4 7.2 Housing 42.9 48.4 45.4 43.2 40.8 41.4 Owner’s equivalent rent 24.7 23.4 22.9 23.2 23.8 27.2 Rent 7.6 14.3 12 9.7 6.7 3.2 Fuels and utilities 4.5 5.7 5.4 4.9 4.4 3.6 Household furnishings and operations 4.8 4.1 4.1 4.5 4.7 5.4

Lodging away from home 0.9 0.5 0.5 0.6 0.8 1.5 Apparel 2.7 2.3 2.3 2.6 2.7 3 Transportation 16.5 12.8 15.3 17.1 18.1 16.7 Motor fuels 3 2.8 3.3 3.5 3.4 2.6 Public transportation 0.9 0.7 0.6 0.7 0.8 1.3 Vehicle purchase and maintenance and repair 9.4 6.6 8.1 9.4 10.5 10.2

Vehicle insurance 2.5 2.3 2.8 2.9 2.8 2.1 Medical care 8.8 8.8 9.6 9.3 9.2 8 Health insurance, retained earnings 0.8 0.5 0.7 0.8 0.9 0.8 Professional services 3.8 3.9 4.1 3.9 4 3.5 Recreation 5.3 4 4.6 4.6 5.3 6.4 Education and communication 6.9 6.5 5.6 5.9 6.6 8.1 Education 2.8 2.3 1.2 1.4 2.3 4.7 Communication 4.1 4.2 4.4 4.5 4.4 3.5 Other goods and services 2.8 3 3.1 2.9 2.7 2.6

Appendix 2 Subpopulation difference of effects and contribution formulas

Effects: When pivot month the same across 12-month average (odd index years):

&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d461;&#x1d461;−&#x1d45b;&#x1d45b;→&#x1d461;&#x1d461;;&#x1d44e;&#x1d44e;,&#x1d456;&#x1d456;→&#x1d456;&#x1d456;,&#x1d434;&#x1d434; &#x1d45d;&#x1d45d; = �

(&#x1d438;&#x1d438;&#x1d450;&#x1d450;&#x1d461;&#x1d461;,&#x1d434;&#x1d434;,&#x1d456;&#x1d456; − &#x1d438;&#x1d438;&#x1d450;&#x1d450;&#x1d461;&#x1d461;−12,&#x1d434;&#x1d434;,&#x1d456;&#x1d456;) (&#x1d438;&#x1d438;&#x1d450;&#x1d450;&#x1d461;&#x1d461;,&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c; − &#x1d438;&#x1d438;&#x1d450;&#x1d450;&#x1d461;&#x1d461;−12,&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c;)

∗ (&#x1d43c;&#x1d43c;&#x1d43c;&#x1d43c;���&#x1d461;&#x1d461;,&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c; − &#x1d43c;&#x1d43c;&#x1d43c;&#x1d43c;���,&#x1d461;&#x1d461;−&#x1d45b;&#x1d45b;,&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c;)

&#x1d43c;&#x1d43c;&#x1d43c;&#x1d43c;���&#x1d461;&#x1d461;−&#x1d45b;&#x1d45b;,&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c; ∗ 100�

When pivot month is revised across 12 month average (even index years): &#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d461;&#x1d461;−12→&#x1d461;&#x1d461;;&#x1d44e;&#x1d44e;,&#x1d456;&#x1d456;→&#x1d456;&#x1d456;,&#x1d434;&#x1d434;

&#x1d45d;&#x1d45d; =

⎣ ⎢ ⎢ ⎢ ⎡�&#x1d438;&#x1d438;&#x1d450;&#x1d450;&#x1d438;&#x1d438;&#x1d461;&#x1d461;,&#x1d434;&#x1d434;,&#x1d456;&#x1d456; − &#x1d438;&#x1d438;&#x1d450;&#x1d450;&#x1d438;&#x1d438;&#x1d461;&#x1d461;−&#x1d45b;&#x1d45b;,&#x1d434;&#x1d434;,&#x1d456;&#x1d456; ∗ �

&#x1d436;&#x1d436;&#x1d436;&#x1d436;&#x1d463;&#x1d463;,&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d45b;&#x1d45b;&#x1d434;&#x1d434;&#x1d434;&#x1d434;,&#x1d434;&#x1d434;,&#x1d456;&#x1d456; &#x1d436;&#x1d436;&#x1d436;&#x1d436;&#x1d463;&#x1d463;,&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;,&#x1d434;&#x1d434;,&#x1d456;&#x1d456;

��

�&#x1d438;&#x1d438;&#x1d450;&#x1d450;&#x1d438;&#x1d438;&#x1d461;&#x1d461;,&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c; − &#x1d438;&#x1d438;&#x1d450;&#x1d450;&#x1d438;&#x1d438;&#x1d461;&#x1d461;−&#x1d45b;&#x1d45b;,&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c; ∗ � &#x1d436;&#x1d436;&#x1d436;&#x1d436;&#x1d463;&#x1d463;,&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d45b;&#x1d45b;&#x1d434;&#x1d434;&#x1d434;&#x1d434;,&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c; &#x1d436;&#x1d436;&#x1d436;&#x1d436;&#x1d463;&#x1d463;,&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;,&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c;

�� ∗ �

&#x1d436;&#x1d436;&#x1d436;&#x1d436;&#x1d461;&#x1d461;,&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c;

&#x1d436;&#x1d436;&#x1d436;&#x1d436;&#x1d461;&#x1d461;−&#x1d45b;&#x1d45b;,&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c; ∗ &#x1d436;&#x1d436;&#x1d436;&#x1d436;&#x1d463;&#x1d463;,&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c;,&#x1d434;&#x1d434;

&#x1d436;&#x1d436;&#x1d436;&#x1d436;&#x1d463;&#x1d463;,&#x1d434;&#x1d434;&#x1d434;&#x1d434;&#x1d45b;&#x1d45b;&#x1d434;&#x1d434;&#x1d434;&#x1d434;,&#x1d43c;&#x1d43c;,&#x1d434;&#x1d434; − 1� ∗ 100

⎦ ⎥ ⎥ ⎥ ⎤

&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d456;&#x1d456; &#x1d444;&#x1d444;1,&#x1d444;&#x1d444;5 = &#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d456;&#x1d456;

&#x1d444;&#x1d444;1 − &#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d456;&#x1d456; &#x1d444;&#x1d444;5 (Normalized based on absolute value).

Contributions: For an individual population contribution the terms highlighted in gray are removed. For subpopulation contribution difference the absolute value of item &#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d456;&#x1d456;

&#x1d444;&#x1d444;1,&#x1d444;&#x1d444;5 is evaluated relative to the sum representing the subpopulation proportional effect. The sum of subpopulation proportional effects equals 100%. Positive subpopulation item effects represent items where Q1>Q5. Negative subpopulation item effects represent items where Q1<Q5.

&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d438;&#x1d438;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446; &#x1d446;&#x1d446;&#x1d45d;&#x1d45d;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d45d;&#x1d45d;&#x1d438;&#x1d438;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446; &#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d456;&#x1d456; &#x1d444;&#x1d444;1,&#x1d444;&#x1d444;5 =

�&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d456;&#x1d456; &#x1d444;&#x1d444;1,&#x1d444;&#x1d444;5�

∑�&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d438;&#x1d456;&#x1d456; &#x1d444;&#x1d444;1,&#x1d444;&#x1d444;5�

t = current period CW= cost weight p = Q1 or Q5 A = aggregate All US i = lower-level item t –12 = period 12 months prior

AWnew = new aggregation weight

AWnew = new aggregation weight v = pivot month index I = aggregate All items

  • Abstract
  • Executive Summary
  • Background and Issues
  • Methodology
    • Income Cohort Definition
    • Income Cohort Demographic Characteristics
  • Data Inputs
    • CPI Basic Item-Area Expenditure Weight coverage
    • Income Quintile Spending Weights
    • Price Analysis
  • Results
    • Overall Index Results
    • Variation of income inflation gap over time
    • Variation in inflation gap by item category
    • Which items explain the inflation gap?
  • Summary/Conclusion/Future analysis/Next Steps
  • Appendix 1
  • Appendix 2

Impact of Weight Timeliness on the US CPI

Languages and translations
English

1 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Impact of Weight Timeliness on the US CPI

Anya Stockburger, Joshua Klick, Chris Miller, Jessie Park

Bureau of Labor Statistics

Meeting of the Group of Experts on CPIs

June 9, 2023

2 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

BLS Consumer Price Indexes

100

110

120

130

140

150

160

170

180

190

CPI-U Final C-CPI-U Initial C-CPI-U

3 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Motivation: Weighting Improvements Product Release Lag

(difference between index reference period and publication)

Weight Lag (difference between weight and index reference period)

Improvement Goals

CPI-U ~10 days 2-4 years 2 years

Reduce weight lag

Final Chained CPI-U

9-12 months None Reduce release lag Increase visibility

Initial Chained CPI-U

~10 days 2-4 years 2 years

Reduce revision size Increase visibility

4 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Outline

Motivation

Impact of weight timeliness

Reducing weight lag in CPI-U

Future research

5 — U.S. BUREAU OF LABOR STATISTICS • bls.gov5 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Annual Weight Changes

6 — U.S. BUREAU OF LABOR STATISTICS • bls.gov6 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Airline Fares Monthly Expenditure Weights

_202004, 0.021%

_202107, 0.931%

0.0%

0.2%

0.4%

0.6%

0.8%

1.0%

1.2%

1.4%

1.6%

1.8%

_201201 _201301 _201401 _201501 _201601 _201701 _201801 _201901 _202001 _202101

7 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Impact of chain drift

Cage, Williams, Church 2021 “Chain Drift in the Chained Consumer Price Index: 1999-2017”

8 — U.S. BUREAU OF LABOR STATISTICS • bls.gov8 — U.S. BUREAU OF LABOR STATISTICS • bls.gov -0.40%

-0.20%

0.00%

0.20%

0.40%

0.60%

0.80%

1.00%

2001200220032004200520062007200820092010201120122013201420152016201720182019202020212022

Difference in 12-month Percent Change CPI-U less Final C-CPI-U

Historical Impact of Timely Weights

9 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Recent Impact of Timely Weights

-0.30%

-0.20%

-0.10%

0.00%

0.10%

0.20%

0.30%

0.40%

0.50%

0.60%

0.70% Ja

n M

ar M

ay Ju l

Se p

No v

Ja n

M ar

M ay Ju

l Se

p No

v Ja

n M

ar M

ay Ju l

Se p

No v

Ja n

M ar

M ay Ju

l Se

p No

v Ja

n M

ar M

ay

2018 2019 2020 2021 2022

Difference in 12-month Percent Change CPI-U – Final C-CPI-U

10 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Snapshot #1: December 2020 -0.02 -0.015 -0.01 -0.005 0 0.005 0.01 0.015 0.02

Full-service meals and snacks

Limited-service meals and snacks

Admissions

College tuition and fees

Airline fare**

Personal computers

Toys

New vehicles

Motor vehicle insurance

Owner's Equivalent Rent

Upper-Level Substitution Bias Contribution - December 2021 (-0.13%)

11 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

2020 Price and Weight Change

Full-service meals and snacks

Limited-service meals and snacks

Admissions

College tuition and fees

Airline fare**

Personal computers

Toys

New vehicles

Motor vehicle insurance

Owner's Equivalent Rent

-100%

-50%

0%

50%

100%

150%

-20.00% -15.00% -10.00% -5.00% 0.00% 5.00% 10.00%W ei

gh t C

ha ng

e

Price Change

12 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Snapshot #2: December 2021 -0.2 -0.1 0 0.1 0.2 0.3 0.4 0.5

Used cars and trucks

Gasoline

New vehicles

Full-service meals and snacks

Limited-service meals and snacks

Food at employee sites and schools

Clocks, lamps, and décor

Jewelry**

Motor vehicle insurance

Owner's equivalent rent

Upper-Level Substitution Bias Contribution - December 2021 (0.54%)

13 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

2021 Price and Weight Change

Used cars and trucks

GasolineNew vehicles

Full-service meals and snacks

Limited-service meals and snacksFood at employee

sites and schools

Clocks, lamps, and décor

Jewelry**

Motor vehicle insurance

Owner's equivalent rent

-50%

-30%

-10%

10%

30%

50%

70%

90%

110% -60.00% -40.00% -20.00% 0.00% 20.00% 40.00% 60.00%

W ei

gh t c

ha ng

e

Price change

14 — U.S. BUREAU OF LABOR STATISTICS • bls.gov14 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Improvements to Weight Timeliness

 Implemented annual weight updates with January 2023 indexes Reduced historical upper-level substitution bias by

0.03 percentage points per year More relevant weights (replace 2019/2020 with

2021 expenditure data) See website for more information on 2022 and

2023 weight updates

15 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Weight Changes in 2023 Apparel Food, Alcohol Away

& Haircuts

Recreation &

Transportation

Lodging Away

Food, Alcohol at Home Other & Housing Goods

Hospital Services & Medicinal Drugs

16 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

What’s next – quarterly weight updates?

 Reduces aggregate upper-level substitution bias

 Issues at sub- aggregate level Chain drift Outlier impact

 Klick, Park 2022

Weight update frequency

Annualized 12-month percent change (2002-2020)

Upper-level substitution bias

Biennial 2.06 0.24

Annual 2.03 0.21

Quarterly 1.95 0.13

Tornqvist (monthly)

1.82 -

17 — U.S. BUREAU OF LABOR STATISTICS • bls.gov17 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

What’s next – improve timeliness of the C-CPI?

 Medium-term: 6-month lag to publish final C-CPI Survey protocol (placement dates) Processing efficiencies (auto-coding, monthly processing,

streamlined outlier review) Design changes (survey recall length)

 Long-term: real-time capture of expenditure information Funding for pilot test included in FY24 President’s budget

request

Contact Information

18 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Anya Stockburger Chief, Branch of Revision Methodology

Division of Consumer Price Indexes www.bls.gov/cpi

[email protected]

  • Impact of Weight Timeliness on the US CPI
  • BLS Consumer Price Indexes
  • Motivation: Weighting Improvements
  • Outline
  • Annual Weight Changes
  • Airline Fares�Monthly Expenditure Weights
  • Impact of chain drift
  • Historical Impact of Timely Weights
  • Recent Impact of Timely Weights
  • Snapshot #1: December 2020
  • 2020 Price and Weight Change
  • Snapshot #2: December 2021
  • 2021 Price and Weight Change
  • Improvements to Weight Timeliness
  • Weight Changes in 2023
  • What’s next – quarterly weight updates?
  • What’s next – improve timeliness of the C-CPI?
  • Contact Information

Hedonic price estimates for new vehicles: When do rotations lead to drift? United States

Languages and translations
English

1

Hedonic price estimates for new vehicles: When do rotations lead to drift?

Brendan K. Williams

May 2023

WORKING DRAFT

Prepared for the Group of Experts on Consumer Price Indices

UNECE, Geneva, June 2023

Abstract

Using a transaction level dataset on car purchases, we document the empirical relationship between standard hedonic index methods, including hedonic imputation and time dummy estimates, and the matched-model approach. We extend this analysis to investigate the effects of product cycles on hedonic estimates and the potential for coefficient “drift” that may result in biased price indexes. We distinguish between these effects and conventional “chain drift” in the context of bilateral, pooled, similarity linking, and multilateral index approaches. We map transaction records to additional sources to incorporate more detailed vehicle attributes and performance metrics. We introduce a new method of similarity linking specific to hedonic regression. Our results offer guidance on hedonic index construction methods and incorporating alternative data for industries into official price statistics.

JEL Codes: C43, E31

__________________________________________________________ Brendan K. Williams is a Senior Economist in the Branch of Consumer Prices at the Bureau of Labor Statistics (BLS). This research arose out of an earlier project with Leonard Nakamura, Ryan Michaels, and Erik Sager. Thank you to Bill Thompson and Nicole Shepler for their comments and review.

2

1. Introduction

The availability of transaction and scanner data has created a shift in focus on price index research, with methods related to chain drift receiving a great deal of attention. In many cases, downward drift may be generated by a product cycle and not the conventional “chain drift” mechanism, product cycle effects have received relatively little attention in the literature. We conduct our empirical analysis on new vehicle sales data, which have documented product cycle effects related to intertemporal price discrimination and price changes being introduced simultaneously with product updates. In a traditional, fixed-sample consumer price index, item replacement is used to address issues that result from this drift and quality adjustment emerges as a necessity to address the quality bias that may ensue. Hedonic methods are associated with measuring technological improvements and have negative price index impacts as a result. However, when hedonic imputation methods are used on data with a product cycle, we often see positive effects when hedonic imputation allows improved measurement of long-run price change.

The relationships between product cycles and quality change, and how they relate to various price index construction methods have not been well investigated. We use a dataset of new vehicle sales records to compare several, standard approaches to price index construction to analyze the empirical effects of product cycles. New vehicles were the subject of the very first hedonic analysis and have continued to be a subject of interest in the literature. We revisit some earlier hedonic models of vehicles and combine these specifications with more recent hedonic index methods.

When using transaction data product matching and grouping or hedonic imputation may be used to address drift. Multilateral methods play a role in offsetting product cycle drift, but mainly as a means of time aggregating hedonic imputations and addressing drift they may induce. We find that multilateral methods applied without hedonic imputation or product matching do not address product cycle drift. Within just the past few years, similarity linking methods have shown promise for dealing with chain drift as an alternative to GEKS-type multilaterals (Diewert, 2021). In an apparent first, we combine similarity linking methods with hedonic imputation. Similarity linking methods entail finding similar time periods for bilateral price index comparisons. Several different methods have been proposed to quantify “similarity” in practice. We introduce a new method where Chow test statistics are used with hedonic regression estimates to assess relative similarity between time periods.

We begin with an explanation of our data. We receive transaction level data from J.D. Power. These data are currently in estimation for the U.S. CPI based on the methodology in Williams and Sager (2019). We match these sales observations to more detailed specification information, including measures of vehicle performance, from Wards. The addition of this information allows us to produce more detailed hedonic models and reproduce specifications of historical interest.

We move to a discussion of previous work on hedonic estimates for vehicles. We conduct our empirical analysis on new vehicle sales. New vehicles have a long history in hedonic research going back to the seminal papers introducing and popularizing hedonic methods. We revisit these earlier model specifications and evaluate them in terms of more recent hedonic price index methods.

Next, we review evidence for product cycles and the intuition behind their effect on price indexes.

3

We then review the existing, established methodologies for using hedonic, multilateral, and similarity linking methods. We detail the use of our novel similarity linking method based on hedonic imputation.

Much of this paper is devoted to documenting the behavior of various standard hedonic and other price index methods on a single data source with well-documented product cycle behavior. We show that matched model price indexes tend to show implausibly large decreases. Hedonic estimates tend to show less of a decline. Our results are consistent with several other papers where matched model indexes are downwardly biased. We find that hedonic imputation methods may address product cycle issues by allowing long-run price comparisons to be made over a long time horizon.

2. Data Transaction records with pricing data from J.D. Power were linked to specification data from Wards. The two sources were not consistent with each other, especially when identifying trim and packages. An algorithmic-assisted process was used to link records between the two sources. Concatenations of the model, trim, and package fields for both sources were created and compared. One-to-one matches were assumed to be correct. When a J.D. Power record matched several potential Wards vehicles, records were prioritized based on the highest number of words in common followed by the minimum string distance. These matches were then manually reviewed. The Wards data did not cover all observations in the J.D. Power data with certain trims and even a few models were omitted.

Our Wards data contain specification information from the 2005 to 2019 model year. 2020 model year vehicles began sales in 2019, meaning our specification information only covers our transaction data through 2018. Data for the 2019 model year also does not include mileage estimates (presumably these were not available at the time when we received this data from Wards), which limits model specifications that include fuel efficiency to the index through 2017.

This paper will focus on the pricing of passenger cars (defined as vehicles with listed body types of sedans, convertibles, hatchbacks, coupes, and wagons). Truck and van vehicle configurations (such as cabin type, bed length, and van height) can add variation to price, and our data does not indicate these specifications consistently. Moreover, many of these configurations are intended for commercial use and outside the scope of a Consumer Price Index.

Unlike much of the research published by BLS, the indexes created here are not intended as candidates for production use in the U.S. CPI. We have made many simplifications that would not be used in a production series (such as estimating a single-stage price index at the national level rather estimating area level indexes and performing an aggregation).

3. Background on New Vehicle Hedonic Research New vehicle pricing has been the focus of landmark hedonic1 methods papers including the foundational research in Court (1939) and the popularization of hedonics following Griliches (1961)—and more recently in the broader demand estimation literature with Berry, Levinsohn, and Pakes (1995). These early hedonic papers focused on tangible aspects of vehicles and their performance with Court

1 Court first used the term “hedonic” and attributed the name to a suggestion from Alexander Sachs. Court’s paper is typically described as the beginning of hedonic, and it seems to have laid theoretical foundations, but other papers in agriculture proceeded it in using features as predictors of price. See Colwell and Dilmore (1999).

4

proposing a three-variable specification of weight, wheelbase, and horsepower. Griliches (1961) added dummy variables for V8 engines, hardtops, transmission, compact body type, and power brakes and steering. Triplett (1969) followed this specification but combined power brakes and steering and also proposed a truncated model. Cowling and Cubbin (1972) introduced several other variables including vehicle fuel efficiency.

Of these historical specifications, we can most directly reproduce the specifications used in Court (1939) and Ohta and Griliches (1976). Power brakes have long been standard on almost all vehicles sold in the United States and only a few models are available with manual steering (even manual transmission has become uncommon). The simple, three-variable model in Court remains directly applicable even to modern vehicles. The other models are less applicable as options like power brakes and steering are now nearly universally standard and while others no longer exist—namely, the pillarless “hardtop,” which has not been sold since the 1970s. Omitting hardtop as an obsolete feature, the Ohta and Griliches specification is producible given our data and has the advantage of accounting for vehicle make. Since “make” is generally indicative of the level finishings in a vehicle, including make gives a rough control on interior quality (an aspect generally otherwise omitted in our data and the papers discussed below).

Table 1: Comparison of historical model specifications

Court (1939)

Griliches (1961)

Triplett (1969)

Triplett Trunc. (1969)

Cowling & Cubbin (1972)

Ohta & Griliches (1976)

Weight x x x x

x Wheelbase x length/wheelbase

Horsepower x x x

x x Length

length/wheelbase x

x x

V8

x x

x Hardtop

x x

x

Transmission

x X Comb.

Power brakes

x Comb. Comb. x

Power steering

x Comb. Comb.

Compact

x x x

Over4Gears

x Luxury

x

PassengerArea

x Efficiency

x

Make

Indicator variables

5

4. Product Cycle We refer to regular patterns in product entry and exit and their effects price and quantity measurement as “product cycles.” For cars, we focus on two elements of the product life cycle: price declines over a single product iteration driven by intertemporal price discrimination and the tendency for price change to be associated with model updates. Both of these elements of the vehicle product cycle lead to potential bias in estimating price change. Taking a simple matched model approach with product entry and exit through overlap would result in persistent downward index movement since price discrimination leads to a strong tendency for price discounts over a single iteration. Similarly, if sellers update their pricing strategies with new product entry, a matched model index with overlap would not reflect the change between pricing regimes. While these product cycle effects pertain to the new vehicle industry, other item categories also exhibit product cycle behavior that result in similar measurement issues.

Aizcorbe, et al. (2010) and Williams and Sager (2019) document evidence for intertemporal price discrimination related to consumer heterogeneity over the product cycle. Chained price comparisons that reflect the price change across variants fail to offset product life cycle effects. Williams and Sager (2019) found multilateral indexes without linking across product versions failed to counter downward drift and proposed a year-over-year, model-on-model measurement for the trend price in order to avoid the effects of price discrimination and account for price change with product updates.

Reinsdorf, et al. (1996) noted that sellers often introduced prices alongside new models. If indexes only show price change for the same version of an item (and overlap old and new products as they enter and exit the market), price change between regimes, which is the most important in the long run, will be omitted. In cases when sellers update their price strategy or schedules for inflation at the time of changing product offerings, omitting this price change will result in downward bias. This issue is addressed in traditional fixed sample surveys by showing price change between new and replacement items. Williams (2021) finds that the effects of item replacement and class-mean imputation, which is motivated by the need to capture and impute price change across products updates, are very large— larger than estimated quality bias in the index. Moreover, the need to correct for quality bias during these comparisons is ultimately motivated by the need to measure the price change across updates.

In a scanner data context, product matching and grouping have often been used. However, results can be sensitive to the producers used to map products together. Moreover, the timing and other aspects of the item replacement and dynamic weighting further complicate translating “item replacement” methods to scanner data.

Another approach is to use hedonic estimation. Looking at apparel data, Greenlees and McClelland (2010) found a similar pattern that we see in the vehicle market where prices decline strongly in within version price change. Their results varied greatly depending on the specific technique of hedonic index construction used. Multilateral methods did not address product cycle effects as “the relentless downward march of prices completely overwhelm the chain drift issue.” Below we investigate various approaches to hedonic index construction and how they relate to product cycles.

6

5. Hedonic Methods Hedonic methods basically predict a product’s price as a function of its attributes. We can revisit the early model in Court to serve as an example.

&#x1d43f;&#x1d43f;&#x1d43f;&#x1d43f;(&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;) = &#x1d6fc;&#x1d6fc; + &#x1d6fd;&#x1d6fd;1 × &#x1d44a;&#x1d44a;ℎ&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d452;&#x1d452;&#x1d452;&#x1d452;&#x1d452;&#x1d452;&#x1d452;&#x1d452;&#x1d443;&#x1d443; + &#x1d6fd;&#x1d6fd;2 ×&#x1d44a;&#x1d44a;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d44a;&#x1d44a;ℎ&#x1d461;&#x1d461; + &#x1d6fd;&#x1d6fd;3 × &#x1d43b;&#x1d43b;&#x1d43b;&#x1d43b;&#x1d443;&#x1d443;&#x1d452;&#x1d452;&#x1d443;&#x1d443;&#x1d43b;&#x1d43b;&#x1d43b;&#x1d43b;&#x1d43b;&#x1d43b;&#x1d443;&#x1d443;&#x1d443;&#x1d443;

Using linear regression, Court estimated the coefficient values (for wheelbase in inches, weight in hundredweight) in a joint time period regression for 1925 to 1930:

&#x1d43f;&#x1d43f;&#x1d43f;&#x1d43f;(&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;) = 4.1256 + 0.0161 ×&#x1d44a;&#x1d44a;ℎ&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d452;&#x1d452;&#x1d452;&#x1d452;&#x1d452;&#x1d452;&#x1d452;&#x1d452;&#x1d443;&#x1d443; + 0.0461 × &#x1d44a;&#x1d44a;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d44a;&#x1d44a;ℎ&#x1d461;&#x1d461; + −0.0003 × &#x1d43b;&#x1d43b;&#x1d43b;&#x1d43b;&#x1d443;&#x1d443;&#x1d452;&#x1d452;&#x1d443;&#x1d443;&#x1d43b;&#x1d43b;&#x1d43b;&#x1d43b;&#x1d43b;&#x1d43b;&#x1d443;&#x1d443;&#x1d443;&#x1d443;

To estimate the price of a Model T in 1925 we can enter in the specification values for the Model T:2

&#x1d43f;&#x1d43f;&#x1d43f;&#x1d43f;(&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;) = 4.1256 + 0.0161 × 100 + 0.0461 × 12 + −0.0003 × 20

This estimates the price of a Model as $535.29 in 1925.

We can perform the same exercise using a model estimated on modern data and estimate what a new Model T would cost in 2019 as

&#x1d43f;&#x1d43f;&#x1d43f;&#x1d43f;(&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;&#x1d443;) = 9.9 + −0.0149 × 100 + 0.0004 × 1200 + 0.0029 × 20

an estimate of $7673.06.

Hedonic methods are often suggested to address selection bias related to the immediate entry and exit of a product. While the literature generally expects hedonic indexes or hedonic adjustment to have downward impact on indexes, BLS research has found that hedonic adjustment has small or even upward effects shows that, were the BLS to omit the item replacement process entirely, indexes would generally be substantially lower. Previous research has found that these adjustments of have little impact on the U.S. CPI (Brown and Stockburger, 2006; Johnson, et al., 2006; Williams, 2021). Williams (2021) finds that product cycle effects are much larger than estimates of quality bias. Here, we focus on hedonic imputation as a means of calculating long-run price change in order to address these product cycle effects.

Model We continue in the vein of the previous model discussed above in focusing on vehicle performance attributes and basic elements of vehicle size. In addition to the horsepower, we also have data on torque, and mileage broken out into city and highway estimates. Automotive engineers face basic tradeoffs in terms of power, weight, and efficiency. We create a highly interacted model to allow parameter estimates to account for the underlying relationships between these variables and better fit our data. We produce the Court and Ohta and Griliches models for their ease of interpretation and historical interest. The Ohta and Griliches specification is used as a benchmark for several comparisons in this paper.

2 Court originally estimated with weight as “hundred weight,” so we use 12 instead of 1200.

7

Hedonic Imputation Indexes Hedonic imputation (HI) is often cited as the preferred approach to hedonic indexes (Diewert, 2019). In a hedonic imputation index, a hedonic regression is estimated on each period. The prices for the sets of goods in other periods are then estimated. Following the example above, taking a Model T imputed price from 1925, $535.29, and the imputed price of the Model T in 2019 gives us a Laspeyres price index increase of 1422% which is not far off from the overall CPI change of 1455% for the same period.

Silver and Heravi (2007) find them preferable to adjacent period time dummy hedonics (TDH). Unlike time product dummy (TPD) and TDH indexes, the dependent variable of price is not restricted to the natural log transformation. In TPD and TDH approaches, the dependent variable must be in the form of a natural logarithm to allow the time dummy to be interpreted as a proportional change in price.

Here we focus on full imputation, HI indexes where both omitted and observed are replaced with the predicted value produced by a hedonic regression for the corresponding period. Imputed “missing” observations comprise of an imputed price with a zero-value quantity and expenditure weight.

In our approach, all observations with the same set of values for a given specification are grouped together into one unit. For a detailed specification, this is equivalent or nearly equivalent to defining a unit by product identifier. In less detailed specifications, such as the Court model, a unit consists of transactions from multiple product identifiers.

Time Dummy Hedonic Time dummy hedonic regressions constrain coefficients to have the same value over time. If the underlying parameter shifts between periods, the residual will be correlated with time period. The time dummy variable will then capture the difference.

In a time dummy model estimated on pooled dataset, the coefficient data is pooled over a long-time period. Another approach is to use an adjacent period TDH where a series of regressions is estimated on data pairs of adjacent periods with a dummy variable indicating the later period. These time dummy variables can be accumulated into a chained multiperiod index. The adjacent period index allows the

The time dummy hedonic equation is

ln&#x1d43b;&#x1d43b;&#x1d456;&#x1d456;&#x1d461;&#x1d461; = &#x1d6fc;&#x1d6fc; + �&#x1d6ff;&#x1d6ff;&#x1d461;&#x1d461; &#x1d447;&#x1d447;

&#x1d461;&#x1d461;=1

&#x1d437;&#x1d437;&#x1d461;&#x1d461; + �&#x1d6fd;&#x1d6fd;&#x1d458;&#x1d458;&#x1d467;&#x1d467;&#x1d456;&#x1d456;&#x1d458;&#x1d458;

&#x1d43e;&#x1d43e;

&#x1d458;&#x1d458;=1

Given the close relationship between a TPD and matched model, and a TPD as a “fully” interacted TDH, we should be wary that a TDH is susceptible to the same issues we see in the matched model.

Time-Product Dummy The Time-Product Dummy variable assigns a “dummy” or “indicator” variable to each “product” or model where each product is identified by a model number or a particular set of features.

Following the representation in de Haan et al. (2021), the time-product-dummy equation is:

ln &#x1d43b;&#x1d43b;&#x1d456;&#x1d456;&#x1d461;&#x1d461; = &#x1d6fc;&#x1d6fc; + �&#x1d6ff;&#x1d6ff;&#x1d461;&#x1d461; &#x1d447;&#x1d447;

&#x1d461;&#x1d461;=1

&#x1d437;&#x1d437;&#x1d461;&#x1d461; + �&#x1d6fe;&#x1d6fe;&#x1d456;&#x1d456;&#x1d437;&#x1d437;&#x1d456;&#x1d456;

&#x1d447;&#x1d447;

&#x1d461;&#x1d461;=1

8

The TPD can accommodate additional fixed effects specific to a product. Any omitted variables from a hedonic specification, would contribute to a product specific fixed effect. TPD could also accommodate differences in coefficient values (e.g., differences in how a given feature contributes to the price of a car versus a truck). The TPD is the equivalent of a flexible, data-driven hedonic—Krsinich (2016) describes the TPD as a fully interacted TDH. However, it fails in the case of hedonic adjustment’s reason for being, product entry and exit. The time-product dummy approach approximates a matched model index and only trivially includes “unmatched” observations. The TPD produces a similar index to the geometric matched model (Aizcorbe, 2014).

For a “good” hedonic specification, we would expect that the fixed effects for a product i would be approximately equal to the coefficient effects from the hedonic model. As de Haan points, the TPD is a special case of the TDH where:

&#x1d6fe;&#x1d6fe;&#x1d456;&#x1d456; = �&#x1d6fd;&#x1d6fd;&#x1d458;&#x1d458;&#x1d467;&#x1d467;&#x1d456;&#x1d456;&#x1d458;&#x1d458;

&#x1d43e;&#x1d43e;

&#x1d458;&#x1d458;=1

Given the close relationship between a TPD and matched model, and a TPD as a “fully” interacted TDH, we should be wary that a TDH is susceptible to the same issues we see in the matched model.

Multilaterals with Hedonic Imputation Ivancic, Diewert, and Fox (2011) introduced the GEKS formula and, more generally, sparked interest in multilateral approaches to address chain drift in price indexes. Chain drift can broadly be defined as divergence between the chained and fixed-base versions of a price index. The literature on chain drift focuses on the “stock” economic explanation for chain drift.3 Here, consumers buy a product at price p0 with a frequency of quantity q0. The product goes on sale and quantity increases dramatically to q1 and price decreases to p1. Consumers stock up on the product the sale price, p1, and satiate their demand over a longer time horizon than the measurement period. As such, even though the price returns to p0 in period 2 (p2=p0), quantity is significantly lower than the original amount demanded at the same price (q2<q0). Many price indexes, including the generally preferred superlative indexes, will show this as a permanent price decrease since the weight on the price increase is not symmetric with the weight on the price decrease. (Other indexes, such as the Jevons, an unweighted geometric index, would not show a permanent decrease). Other factors may lead to a divergence between chained and fixed-base index results especially when product turnover requires methods for product matching or grouping.

These methods have been combined with hedonic imputation in research beginning with de Haan and Krsinich (2014). De Haan and Daalmans (2019) discuss “single imputation,” where only missing prices are imputed, and “double imputation” where a missing price and the observed price that corresponds to it in a price relative are imputed. They note that the double imputation may mitigate the effects of omitted variable bias.

Similarity Linking with Hedonic Imputation Similarity linking has recently gained attention as a means of addressing chain drift in price indexes. Like multilateral methods, similarity linking first arose in the context of spatial price measurement but has

3 See Diewert (2021).

9

been translated to intertemporal price indexes. As noted in Modernizing the Consumer Price Index for the 21st Century (National Academies, 2022), similarity linking has two advantages over multilateral indexes: first, the indexes satisfy the multiperiod identity test, and, second, are fully transitive, unlike rolling window extensions of multilateral indexes. The National Academies’ report also suggests that hedonic imputation could be combined with similarity, and we explore that recommendation.

Similarity linking methods create chained price indexes where each period’s price relative is a bilateral comparison between the given period and the prior period determined to be most “similar.” The intuition being that price comparisons between periods with similar consumption patterns and weight distributions will reduce drift. The question arises of how to quantify “similarity.”

Proposed methods include using the dissimilarity in predicted product shares between periods and the relative dispersion of the Laspeyres and Fisher indexes.

We introduce a method where the period specific hedonic regressions themselves are used to determine the similarity between periods using the test for regression model similarity proposed in Chow (1960). To determine the most similar link for a month t, we run Chow tests between t and each preceding period and take the period with the minimum Chow statistic as the link. Following the same process as other similarity linking procedures, index level for t, &#x1d43c;&#x1d43c;&#x1d461;&#x1d461;, is calculated based on the bilateral price index, P, and most similar period to t, &#x1d461;&#x1d461;&#x1d45a;&#x1d45a;&#x1d456;&#x1d456;&#x1d45a;&#x1d45a;:

&#x1d43c;&#x1d43c;&#x1d461;&#x1d461; = &#x1d43c;&#x1d43c;&#x1d461;&#x1d461;&#x1d45a;&#x1d45a;&#x1d45a;&#x1d45a;&#x1d45a;&#x1d45a; × &#x1d443;&#x1d443;(&#x1d43b;&#x1d43b;&#x1d461;&#x1d461; ,&#x1d45e;&#x1d45e;&#x1d461;&#x1d461;,&#x1d43b;&#x1d43b;&#x1d461;&#x1d461;&#x1d45a;&#x1d45a;&#x1d45a;&#x1d45a;&#x1d45a;&#x1d45a; , &#x1d45e;&#x1d45e;&#x1d461;&#x1d461;&#x1d45a;&#x1d45a;&#x1d45a;&#x1d45a;&#x1d45a;&#x1d45a;)

The Chow statistic measures how well a model estimated on a combination of two samples compares with models fitted on the samples individually. The Chow test consists of taking two sets of data—in our case, time period t and t-a—and estimating three regressions: one for each period and one where the data is combined into a single pool. The Chow statistic is then calculated based on the sum of squared errors from each regression, SSE, number of observations from each sample, N, and number of parameter estimates, k. These values produce the F-distributed Chow statistic:

&#x1d439;&#x1d439; = (&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d436;&#x1d436;&#x1d436;&#x1d436;&#x1d45a;&#x1d45a;&#x1d436;&#x1d436;&#x1d436;&#x1d436; − (&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d461;&#x1d461; − &#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d461;&#x1d461;−&#x1d44e;&#x1d44e;))/&#x1d458;&#x1d458;

(&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d461;&#x1d461; − &#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d461;&#x1d461;−&#x1d44e;&#x1d44e;)/(&#x1d441;&#x1d441;&#x1d461;&#x1d461; + &#x1d441;&#x1d441;&#x1d461;&#x1d461;−&#x1d44e;&#x1d44e; − 2&#x1d458;&#x1d458;)

We modify the typical Chow test to include a dummy variable for time in the combined regression, which, for t and t-1, would be equivalent to an adjacent period time dummy regression (this allows time periods to match based on similar coefficients even with aggregate price change). The time period with the lowest Chow statistic, &#x1d461;&#x1d461;&#x1d45a;&#x1d45a;&#x1d456;&#x1d456;&#x1d45a;&#x1d45a;, is determined to be the most similar regression model to t and is selected as the link.

In addition to numerical advantages, similarity linking has significantly reduced computational requirements compared to multilateral indexes. Each new period of data is directly compared to each preceding period once and then an index is calculated. This means given w periods in an index window, w comparisons must be made to update a similarity index with w-1 similarity comparisons and one index calculation. This is a substantial reduction from GEKS-type indexes which require index comparisons on the order of w2 for an w-length window.

10

6. Matched Model Indexes and Product Definition The “matched model” index is the standard approach to measuring price change. Individual “models” are identified by either an indicator (for example, UPC or GTIN) or a set of specification values. The price for the same good is then compared from period-to-period. In a traditional, fixed sample, survey-based price index, product cycle effects are dealt with by comparing the price of a discontinued product with the price of a similar, successor product. When a replacement product is not considered comparable, the difference between the two items may be imputed. The item replacement and related-imputation process can have extremely large effects on a price index (See Williams, 2021).

In a scanner data context, the direct relationship between an exiting and entering good does exist as it does in a fixed sample survey. Products may be allowed to come and go from calculations as they enter and exit the market or remain on the market without any recorded sales. When “matched model” allows goods to fluidly enter and exit calculations, the “maximum overlap” approach to product turnover is used. However, this omits price change that may be introduced with model updates and allows for bias from product cycle effects. When working with scanner data, some researchers use the concept of a “product relaunch” to link old and new products together. Similarly, “product grouping” can be used so that multiple products can be grouped together and treated as one.

Here we investigate product definition in terms of aggregating transactions to a given model specification level. For example, following the Court specification, all observations that are all 180 inches long, 160 horsepower, and 2000 pounds would all be aggregated together to form a mean price and total quantity used in regression and matched model price index estimates. Once again, we use the specifications from Court, Ohta and Griliches, and our own specification. For a given specification, transactions are aggregated into an arithmetic mean price across all transactions meeting a given combination of variables and the total number of transactions as the quantity (with expenditure implied by the product of the mean price and total quantity). These indexes constitute a matched model index (without imputation) where a unique set of variable values for a given model specification constitutes a product definition.

7. Results We reproduced the specifications used in Court (1939) and Ohta and Griliches (1976) for various forms of hedonic indexes namely pooled, adjacent period, and single period. Regression results for pooled version of the Court and Ohta and Griliches specifications are presented below in tables 2 and 3, respectively. The dummy variables for month have been excluded from both and nameplate has been excluded in table 3. The pooled result for our interacted specification is in the appendix. Results were similar to adjacent and single period coefficient estimates. Variables were generally significant and had the expected sign with the exception of “length” and “wheelbase” where these were negative (except for wheelbase in the interacted model).

11

Table 2: Regression results for the pooled Court model

Estimate Std. Error t value Pr(>|t|)

(Intercept) 1.06E+01 1.57E-02 677.231 < 2e-16 *** Wheelbase -2.84E-02 1.66E-04 -171.311 < 2e-16 *** Weight 6.31E-04 2.35E-06 268.856 < 2e-16 *** Horsepower 2.28E-03 7.41E-06 308.125 < 2e-16 *** MONTHS - - - - - Multiple R-squared: 0.6887 Adjusted R-squared: 0.6884

Table 3: Regression results for the pooled Ohta & Griliches model

Estimate Std. Error t value Pr(>|t|)

(Intercept) 9.75E+00 1.31E-02 742.181 < 2e-16 *** Length -5.54E-03 6.38E-05 -86.912 < 2e-16 *** Weight 3.50E-04 1.98E-06 176.808 < 2e-16 *** Horsepower 1.12E-03 6.37E-06 176.371 < 2e-16 *** Cylinders 4 6.49E-02 8.02E-03 8.096 5.69E-16 *** Cylinders 5 -2.46E-02 8.63E-03 -2.854 0.004313 ** Cylinders 6 1.73E-01 8.20E-03 21.034 < 2e-16 *** Cylinders 8 4.29E-01 8.47E-03 50.681 < 2e-16 *** Cylinders 10 9.59E-01 1.05E-02 91.744 < 2e-16 *** Cylinders 12 7.56E-01 1.28E-02 58.97 < 2e-16 *** NAMEPLATE - - - - - MONTHS - - - - - Multiple R-squared: 0.8717 Adjusted R-squared: 0.8715

Matched Model Index Results When using basic, matched model methods indexes drifted downward substantially. The results align with expectations from basic cost-of-living index theory: The Laspeyres and Paasche form upper and lower bounds (respectively), and the Törnqvist and Fisher are essentially equivalent. Applying multilateral methods does not address drift. This reinforces the finding from Williams and Sager (2019) that the index declines resulted from product cycles pricing patterns not weight-driven “chain drift.” Interestingly, the multilateral indexes with longer window lengths showed more of a decline than shorter window lengths. The opposite of what we see in the hedonic imputation indexes. The final period chained bilateral Törnqvist and Fisher indexes are in between the 13- and 24-month multilateral indexes, suggesting little overall effect. Since no matches are being made across different versions of a product, longer windows for multilateral indexes will capture more sales for very old cars. This suggests extending the window will only worsen “drift.” Extending the window length to 36 months resulted in bilateral comparisons with no matched observations.

12

The prevailing expectation may be that hedonic estimates would show lower indexes than conventional matched. To the contrary, we see that, of all the methodologies, matched model indexes produce the largest declines. This is consistent with similar research including Greenlees and McClelland (2010) and de Haan and Daalmans (2019). Matched model, maximum overlap price indexes show price change only for the same item so constant quality is maintained. These indexes also allow products to enter and exit calculations. They do not exhibit “quality bias” in the sense that price comparisons are made between goods of differing quality, which is often the motivation behind applying hedonic methods. However, the indexes are still subject to selection bias and product life cycle effects.

Product Grouping and Multilateral Indexes As an alternative to hedonic imputation, cross-version price change can be measured by aggregating products with the same set of specification values and treating them as one product. As new iterations are introduced. Using the Court and Ohta and Griliches specifications to group products leads to indexes that decline much less than the matched model index based on a product identifier. Moreover, the application of multilateral formulas reduces the declines further. These indexes still do not represent plausible estimates for price change. For the decline of one product to be offset, it must have another

40

50

60

70

80

90

100

Ja n-

07 M

ay -0

7 Se

p- 07

Ja n-

08 M

ay -0

8 Se

p- 08

Ja n-

09 M

ay -0

9 Se

p- 09

Ja n-

10 M

ay -1

0 Se

p- 10

Ja n-

11 M

ay -1

1 Se

p- 11

Ja n-

12 M

ay -1

2 Se

p- 12

Ja n-

13 M

ay -1

3 Se

p- 13

Ja n-

14 M

ay -1

4 Se

p- 14

Ja n-

15 M

ay -1

5 Se

p- 15

Ja n-

16 M

ay -1

6 Se

p- 16

Ja n-

17 M

ay -1

7 Se

p- 17

Ja n-

18 M

ay -1

8 Se

p- 18

Ja n-

19 M

ay -1

9 Se

p- 19

In de

x (Ja

nu ar

y 20

07 =1

00 )

Matched Model (SquishVIN) Price Indexes

Tornqvist Laspeyres Paasche Fisher

GEKSmean13 GEKSmean24 CCDImean13 CCDImean24

13

exact match in terms of the specified features and continue to sell in the market. If an exact specification match does not exist, product cycle effects will bias the index.

Product matching is often viewed as incidental to price index methods, however, our results show that making price comparisons across broader time horizons is essential for accurately measuring long-run price change. In other words, accumulations of short-term, same version price change do not result in accurate price measures—even when multilateral and similarity linking methods are applied. Hedonic and product grouping and matching methods are needed.

It is important to consider that many of the issues related to chain drift may arise as secondary effects that result from the method of product matching grouping or hedonic estimation applied to the data rather than a feature of the data in terms of a matched model.

Pooled and Adjacent Period Time Dummy Hedonic Pooled regressions TDH were consistently higher than corresponding adjacent period indexes. Pooled regressions constrain coefficients to the same value over the entire period. The effect constrains the valuation of different features to remain the same over the entire period, which does not accommodate changes in consumer tastes. Pooled TDH also are also subject to revision as previous period values are reestimated with each additional month of data leading to revision.

60

65

70

75

80

85

90

95

100

105

Ja n-

07

Ju n-

07

N ov

-0 7

Ap r-

08

Se p-

08

Fe b-

09

Ju l-0

9

De c-

09

M ay

-1 0

O ct

-1 0

M ar

-1 1

Au g-

11

Ja n-

12

Ju n-

12

N ov

-1 2

Ap r-

13

Se p-

13

Fe b-

14

Ju l-1

4

De c-

14

M ay

-1 5

O ct

-1 5

M ar

-1 6

Au g-

16

Ja n-

17

Ju n-

17

N ov

-1 7

Ap r-

18

Se p-

18

Product Grouped Product Multilaterals

OhtaGrilGrouped OhtaGrilGroupedCCDI13 OhtaGrilGroupedCCDI24

CourtGrouped CourtGroupCCDI13 CourtGroupCCDI24

14

Bilateral Hedonic Imputation and Time-Product Dummy Comparing these same adjacent period TDH indexes with their bilateral hedonic imputation counterparts shows little difference between the methods with an exception of period of divergence in the Court models. Both the Ohta and Griliches and interacted models were within a few percentage points of each other. The Court specification with hedonic imputation showed volatile behavior in 2015 that caused a divergence from its adjacent period counterpart. Shortly after the Court hedonic imputation index appears to stabilize and run close to parallel with the adjacent period index.

While the adjacent period and hedonic imputations appear plausible, there are still concerns that they may reflect product cycle bias.

Our results confirm the expectation that TPD and a geometric matched model index would perform similarly as the resulting indexes are extremely close.

85

90

95

100

105

110

115

120

125 Ja

n- 07

Ju n-

07

N ov

-0 7

Ap r-

08

Se p-

08

Fe b-

09

Ju l-0

9

De c-

09

M ay

-1 0

O ct

-1 0

M ar

-1 1

Au g-

11

Ja n-

12

Ju n-

12

N ov

-1 2

Ap r-

13

Se p-

13

Fe b-

14

Ju l-1

4

De c-

14

M ay

-1 5

O ct

-1 5

M ar

-1 6

Au g-

16

Ja n-

17

Ju n-

17

N ov

-1 7

Ap r-

18

Se p-

18

Pooled vs Adjacent Period TDH Indexes

CourtPooled CourtAdj

OktaGrilPooled OktaGrilsAdj

InteractPool InteractAdj

15

Hedonic Imputation with Multilateral Methods The 13-month extension window had a downward effect compared to a bilateral hedonic imputation index (constructed on single period index imputation). The shorter window would reduce the occurrence of longer-run relatives compared to indexes with longer extensions, but it is unclear why it would lower an index below the bilateral hedonic imputation index.

Longer window multilaterals decline less than those with shorter windows. In the matched model indexes above this relationship was inverted with the 24-month window multilateral falling more than the 13-month. This suggests that the positive effects of extending the window are not related to addressing weight fluctuations that lead to drift, but, rather, increasing the representation of weight placed on longer-term, hedonically imputed price change. A fixed base, hedonic imputation index should not be sensitive to drift or product cycle effects, but the index will lose representivity over time as the base period set of products becomes less relevant. We construct a fixed base, Törnqvist index hedonic imputation which, over a 12-year span, is about 1.5% higher than the hedonic imputation CCDI (Caves- Christensen-Diewert-Inklaar index, a GEKS-type multilateral index based on Törnqvist bilateral comparisons) with a 36-month window.

To avoid product cycle effects in cars, an index must reflect price change across different iterations of goods (model years). A fully transitive index is not dependent on intervening periods, so within model year price change would not alter the long-run measurement of the index. However, full period multilaterals are difficult to calculate because of product turnover and computational demand.

55

60

65

70

75

80

85

90

95

100

105

110

115

120 Ja

n- 07

Ju n-

07

N ov

-0 7

Ap r-

08

Se p-

08

Fe b-

09

Ju l-0

9

De c-

09

M ay

-1 0

O ct

-1 0

M ar

-1 1

Au g-

11

Ja n-

12

Ju n-

12

N ov

-1 2

Ap r-

13

Se p-

13

Fe b-

14

Ju l-1

4

De c-

14

M ay

-1 5

O ct

-1 5

M ar

-1 6

Au g-

16

Ja n-

17

Ju n-

17

N ov

-1 7

Ap r-

18

Se p-

18

Adjacent TDH, Hedonic Imputation, TPD

CourtAdj CourtHITorn

OhtaGrilsAdj OhtaGrilHItorn

InteractAdj InterHItorn

MMTornqvist TPDummyWLS

16

Moreover, they lead to revisions of prior months which are not acceptable for the publication of many official statistics. Extension methods can lead to indexes that are nearly transitive, but longer windows are preferred to better capture long-run price comparisons.

Similarity Linking with Hedonic Imputation The three methods of similarity linking combined with hedonic imputation all produced similar results. The Chow similarity and predicted share methods were highly correlated. The similarity link indexes without hedonic imputation (matched model indexes with similarity linking) showed large declines. The case mirrors the results of applying multilateral methods to matched model indexes: Without product matching or hedonic imputation to offset product cycle effects and capture price change with model updates, indexes will decline to implausible levels.

Our indexes using similarity linking showed index results comparable to a CCDI index with a 36-month extension window. However, the most similar month was typically the proceeding month with 105 of the 143 periods tested selecting the month prior as the most similar. Unlike GEKS-type multilateral, similarity linking does not necessarily force a comparison over a longer-time horizon. If changes are incremental or if the most “similar” link remains the previous month even after a pricing regime changes, similarity linking may not address aspects of the product cycle.

95

97

99

101

103

105

107

109

111

113

Ja n-

07

Ju n-

07

N ov

-0 7

Ap r-

08

Se p-

08

Fe b-

09

Ju l-0

9

De c-

09

M ay

-1 0

O ct

-1 0

M ar

-1 1

Au g-

11

Ja n-

12

Ju n-

12

N ov

-1 2

Ap r-

13

Se p-

13

Fe b-

14

Ju l-1

4

De c-

14

M ay

-1 5

O ct

-1 5

M ar

-1 6

Au g-

16

Ja n-

17

Ju n-

17

N ov

-1 7

Ap r-

18

Se p-

18

Ohta and Griliches HI with CCDI

OhtaGrilHItorn HIccdiOhtaGril13 HIccdiOhtaGril24

HIccdiOhtaGril36 HiFixOhtaGril

17

8. Conclusion Hedonic estimates have often been used to impute the prices for entering and exiting products. Hedonic estimates may also be used to estimate long-run price relatives, which allow better measurement of price change across product cycles. Product cycle effects have generally been neglected and the focus has been on “quality bias.” In matched model indexes, quality bias emerges as a secondary effect from the use of product matching as a means of addressing product cycle issues including price change with model updates and price discrimination.

Previous research has also found that multilateral indexes with hedonic imputation tend to fall less when a longer extension window is used. We find evidence that this is mostly due to the additional influence long-term, cross-product cycle relatives have in multilaterals with longer extended windows.

Estimates from hedonic imputation can be used with similarity linking methods. Like other multilateral methods, similarity linking without product replacement or hedonic imputation does not remedy product cycle effects. Using regression model similarity as a method for linking produces similar results to other multilateral methods but with greater simplicity and less computational demand.

90

95

100

105

110

115

Ja n-

07 Ju

n- 07

N ov

-0 7

Ap r-

08 Se

p- 08

Fe b-

09 Ju

l-0 9

De c-

09 M

ay -1

0 O

ct -1

0 M

ar -1

1 Au

g- 11

Ja n-

12 Ju

n- 12

N ov

-1 2

Ap r-

13 Se

p- 13

Fe b-

14 Ju

l-1 4

De c-

14 M

ay -1

5 O

ct -1

5 M

ar -1

6 Au

g- 16

Ja n-

17 Ju

n- 17

N ov

-1 7

Ap r-

18 Se

p- 18

In de

x (Ja

nu ar

y 20

07 =1

00 )

Similarity Linking

ChowSimilarityOktaImp PredShareImpOktaSimilar PLSpreadOktaImpSIm

18

Works Cited Aizcorbe, Ana M. 2014. A Practical Guide to Price Index and Hedonic Techniques. Oxford: Oxford

University Press.

Aizcorbe, Ana, Benjamin Bridgman, and Jeremy Nalewaik. 2010. "Heterogeneous car buyers: A stylized fact." Economic Letters 109 (1): 50-53. doi:10.1016/j.econlet.2010.08.003.

Berry, Steven, James Levinsohn, and Ariel Pakes. 1995. "Automobile Prices in Market Equilibrium." Econometrica 63 (4): 841-890. doi:10.2307/2171802.

Brown, Craig, and Anya Stockburger. 2006. "Item Replacement and Quality Change in Apparel Price Indexes." Monthly Labor Review 129 (12): 35-45. https://www.bls.gov/opub/mlr/2006/12/art3full.pdf.

Chow, Gregory C. 1960. "Tests of Equality Between Sets of Coefficients in Two Linear Regressions." Econometrica 28 (3): 591-605. doi:10.2307/1910133.

Colwell, Peter F., and Gene Dilmore. 1999. "Who Was First? An Examination of an Early Hedonic Study." Land Economics 75 (4): 620-626. doi:10.2307/3147070.

Court, A.T. 1939. "Hedonic Price Indexes with Automotive Examples." General Motors Corporation, Automobile Demand.

Cowling, Keith, and John Cubbin. 1972. "Hedonic Price Indexes for United Kingdom Cars." Economic Journal 82 (327): 963-978.

de Haan, Jan, and Jacco Daalmans. 2019. "Scanner Data in the CPI: the imputation CCDI Index revisited." 16th meeting of the Ottawa Group.

de Haan, Jan, Rens Hendriks, and Michael Scholz. 2016. "A Comparison of Weighted Time-Product Dummy and Time Dummy Hedonic Indexes." Graz Economics Papers.

Diewert, Erwin. 2018. "“Scanner Data, Elementary Price Indexes and the Chain Drift Problem”." (Vancouver School of Economics) Discussion Paper 20-07.

Diewert, W. Erwin. 2019. "Quality Adjustment and Hedonics: A Unified Approach." Vancouver School of Economics.

Feenstra, Robert C. 1995. "Exact Hedonic Price Indexes." The Review of Economics and Statistics 77 (4): 634-653.

Goodman, Allen C. 1998. "Andrew Court and the Invention of Hedonic Price Analysis." Journal of Urban Economics 44 (2): 291-298. doi:10.1006/juec.1997.2071.

Greenlees, J., and R. McClelland. 2010. "Superlative and Regression-Based Consumer Price Indexes for Apparel Using U.S. Scanner Data." St. Gallen, Switzerland: Conference of the International Association for Research in Income and Wealth.

Greenlees, John S., and Robert McClelland. 2011. "Does Quality Adjustment Matter for Technologically Stable Products? An Application to the CPI for Food." The American Economic Review 101 (3): 200-205. https://www.jstor.org/stable/29783739.

19

Griliches, Zvi. 1961. "Hedonic Price Indexes for Automobiles: An Econometric Analysis of Quality Change." In The Price Staistics of the Federal Government, 173-96. NBER.

Ivancic, Lorraine, W. Erwin Diewert, and Kevin J. Fox. 2011. "Scanner data, time aggregation and the construction of price indexes." Journal of Econometrics 161 (1): 24-35. doi:10.1016/j.jeconom.2010.09.003.

Johnson, David S, Stephen B. Reed, and Kenneth J Stewart. 2006. "Price measurement in the United States: a decade after the Boskin Report." Monthly Labor Review 129 (5): 10-19. https://www.bls.gov/opub/mlr/2006/05/art2full.pdf.

Konny, Crystal G., Brendan K. Williams, and David M. Friedman. 2019. "Big Data in the U.S. Consumer Price Index: Experiences and Plans." In Big Data for Twenty-First Century Economic Statistics, edited by Katharine G. Abraham, Ron S. Jarmin, Brian Moyer and Matthew D. Shapiro. University of Chicago Press. http://www.nber.org/chapters/c14280.

Krsinich, Frances. 2014. "The FEWS index: Fixed effects with a window splice." Meeting of the Group of Experts on Consumer Price Indices. Geneva, Switzerland. https://unece.org/sites/default/files/datastore/fileadmin/DAM/stats/documents/ece/ces/ge.22 /2014/New_Zealand_-_FEWS.pdf.

National Academies of Sciences, Engineering, and Medicine. 2022. Modernizing the Consumer Price Index for the 21st Century. Washington, DC: The National Academies Press.

Ohta, Makoto, and Zvi Griliches. 1976. "Automobile Prices Revistited: Extensions of the Hedonic Hypothesis." In Household Production and Consumption, edited by Nestor E. Teleckyj, 325-398. NBER.

Pakes, Ariel. 2003. "A Reconsideration of Hedonic Price Indexes with an Application to PC's." The American Economic Review 93 (5): 1578-1596. https://www.jstor.org/stable/3132143.

Reinsdorf, Marshall B., Paul Liegey, and Kenneth Stewart. 1996. "New Ways of Handling Quality Change in the U.S. Consumer Price Index." BLS Working Paper 276. https://www.bls.gov/osmr/research- papers/1996/pdf/ec960040.pdf.

Silver, Mick, and Saeed Heravi. 2007. "The Difference between Hedonic Imputation Indexes and Time Dummy Hedonic Indexes." Journal of Business & Economic Statistics 25 (2): 239-246. http://www.jstor.org/stable/27638928.

Triplett, Jack E. 1969. "Automobiles and Hedonic Quality Measurement." Journal of Political Economy 77 (3): 408-417. doi:10.1086/259524.

Triplett, Jack. 2006. Handbook on Hedonic Indexes and Quality Adjustments in Price Indexes. Paris: OECD Publishing.

Williams, Brendan. 2021. "Twenty-One Years of Adjustments for Quality Change in the US Consumer Price Index." Group of Experts on Consumer Price Indices. Online: UNECE. https://unece.org/sites/default/files/2021-05/Session_3_US-BLS_Paper_0.docx.

20

Williams, Brendan, and Erick Sager. 2019. "A New Vehicles Transaction Price Index: Offsetting the Effects of Price Discrimination and Product Cycle Bias with a Year-Over-Year Index." BLS Working Papers (Working Paper 514 ). https://www.bls.gov/osmr/pdf/ec190040.pdf.

21

Appendix

Table 4: Pooled Interacted Regression

Estimate Std. Error

t value Pr(>|t|)

(Intercept) 8.5590 0.0268 319.715 < 2e-16 *** BASE..ins.. 0.0058 0.0002 33.998 < 2e-16 *** Length..ins.. -0.0002 0.0001 -2.464 0.013759 * weight 0.0003 0.0000 45.374 < 2e-16 *** horsepower 0.0016 0.0001 28.577 < 2e-16 *** AWDdummy 0.0806 0.0013 62.736 < 2e-16 *** displacement -0.2948 0.0044 -66.646 < 2e-16 *** height -0.0155 0.0002 -67.134 < 2e-16 *** MPGCity 0.0216 0.0011 20.094 < 2e-16 *** MPGHwy -0.0168 0.0011 -14.834 < 2e-16 *** HybrDummy 0.4583 0.0179 25.607 < 2e-16 *** torque 0.0011 0.0000 121.504 < 2e-16 *** I(horsepower/weight) 18.0400 0.1710 105.457 < 2e-16 *** Make1 -0.2496 0.0027 -91.931 < 2e-16 *** Make2 -0.1644 0.0470 -3.497 0.000471 *** Make3 -0.1594 0.0038 -41.437 < 2e-16 *** Make4 -0.1441 0.0027 -52.95 < 2e-16 *** Make5 -0.1405 0.0021 -67.696 < 2e-16 *** Make6 -0.1396 0.0055 -25.335 < 2e-16 *** Make7 -0.1328 0.0022 -59.761 < 2e-16 *** Make8 -0.1301 0.0039 -33.104 < 2e-16 *** Make9 -0.1283 0.0025 -50.53 < 2e-16 *** Make10 -0.1239 0.0022 -55.747 < 2e-16 *** Make11 -0.1148 0.0066 -17.365 < 2e-16 *** Make12 -0.0870 0.0032 -26.788 < 2e-16 *** Make13 -0.0521 0.0038 -13.8 < 2e-16 *** Make14 -0.0333 0.0021 -15.912 < 2e-16 *** Make15 -0.0215 0.0025 -8.516 < 2e-16 *** Make16 -0.0211 0.0021 -9.951 < 2e-16 *** Make17 -0.0194 0.0024 -8.062 7.56E-16 *** Make18 -0.0066 0.0029 -2.305 0.021175 * Make19 0.0632 0.0022 28.216 < 2e-16 *** Make20 0.1381 0.0059 23.425 < 2e-16 *** Make21 0.1722 0.0030 57.081 < 2e-16 *** Make22 0.1824 0.0039 47.164 < 2e-16 ***

22

Make23 0.1907 0.0029 65.505 < 2e-16 *** Make24 0.2104 0.0124 16.971 < 2e-16 *** Make25 0.2244 0.0032 69.286 < 2e-16 *** Make26 0.2343 0.0112 20.872 < 2e-16 *** Make27 0.2479 0.0026 94.48 < 2e-16 *** Make28 0.2561 0.0043 60.248 < 2e-16 *** Make29 0.3073 0.0026 117.379 < 2e-16 *** Make30 0.3261 0.0030 108.382 < 2e-16 *** Make31 0.3711 0.0026 144.678 < 2e-16 *** Make32 0.4131 0.0033 125.356 < 2e-16 *** Make33 0.4226 0.0028 153.396 < 2e-16 *** Make34 0.5864 0.0278 21.129 < 2e-16 *** Make35 0.8271 0.0180 45.862 < 2e-16 *** Make36 0.8463 0.0030 279.909 < 2e-16 *** cylinders3 -0.0625 0.0084 -7.429 1.10E-13 *** cylinders4 -0.1437 0.0041 -35.445 < 2e-16 *** cylinders5 -0.2308 0.0042 -54.936 < 2e-16 *** cylinders6 -0.1201 0.0027 -44.113 < 2e-16 *** cylinders10 0.2364 0.0056 42.428 < 2e-16 *** cylinders12 0.2428 0.0080 30.451 < 2e-16 *** BODYSTYLEconvertible 0.1467 0.0017 85.684 < 2e-16 *** BODYSTYLEcoupe 0.0261 0.0014 19.003 < 2e-16 *** BODYSTYLEhatchback 0.0209 0.0015 14.213 < 2e-16 *** BODYSTYLEwagon 0.0823 0.0017 48.44 < 2e-16 *** MPGCity:HybrDummy 0.0018 0.0004 4.759 1.95E-06 *** MPGHwy:HybrDummy -0.0102 0.0005 -19.042 < 2e-16 *** HybrDummy:torque 0.0005 0.0000 15.657 < 2e-16 *** displacement:MPGHwy 0.0207 0.0003 65.344 < 2e-16 *** displacement:MPGCity -0.0160 0.0004 -40.57 < 2e-16 *** horsepower:MPGCity 0.0001 0.0000 14.697 < 2e-16 *** horsepower:MPGHwy -0.0002 0.0000 -68.501 < 2e-16 *** weight:MPGHwy 0.00001 0.0000 13.83 < 2e-16 *** weight:MPGCity 0.0000 0.0000 -0.219 0.827012

  • 2. Data
  • 3. Background on New Vehicle Hedonic Research
  • 4. Product Cycle
  • 5. Hedonic Methods
    • Model
    • Hedonic Imputation Indexes
    • Time Dummy Hedonic
    • Time-Product Dummy
    • Multilaterals with Hedonic Imputation
    • Similarity Linking with Hedonic Imputation
  • 6. Matched Model Indexes and Product Definition
  • 7. Results
    • Matched Model Index Results
    • Product Grouping and Multilateral Indexes
    • Pooled and Adjacent Period Time Dummy Hedonic
    • Bilateral Hedonic Imputation and Time-Product Dummy
    • Hedonic Imputation with Multilateral Methods
    • Similarity Linking with Hedonic Imputation
  • 8. Conclusion
  • Works Cited
  • Appendix

Hedonic price estimates for new vehicles: When do rotations lead to drift? United States

Languages and translations
English

1 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Hedonic price estimates for new vehicles:

When do rotations lead to drift?

Brendan Williams Senior Economist

Consumer Price Index Division

Bureau of Labor Statistics

Prepared for Group of Experts 6 June 2023

2 — U.S. BUREAU OF LABOR STATISTICS • bls.gov2 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Summary  New vehicles and other items subject to

product cycle effects  Multilateral methods alone do not address

product cycle Price change must be measured across versions Hedonic methods allow price measurement

across product cycles

 Hedonic imputation can be used with similarity linking New link method based on the similarity of

regression estimates based on a Chow test

3 — U.S. BUREAU OF LABOR STATISTICS • bls.gov3 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Data  J.D. Power: Transaction records of car sales SquishVIN as product ID Some information on features

 Wards: Specification information Data on vehicle performance and other features

– Horsepower, torque, fuel efficiency, vehicle size…

 Combine based on manufacturer, engine type, and string matching for model and trim

4 — U.S. BUREAU OF LABOR STATISTICS • bls.gov4 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Product Cycle  Intertemporal price discrimination Evidence documented in Aizcorbe, et al. (2010)

and Williams and Sager (2019) Price decreases consistently for a product over a

given model year

 Price updates with model updates Sellers introduce new pricing regimes with

product updates Related to theory of price rigidity

5 — U.S. BUREAU OF LABOR STATISTICS • bls.gov5 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

20

30

40

50

60

70

80

90

100 Ja

n- 07

M ay

-0 7

Se p-

07

Ja n-

08

M ay

-0 8

Se p-

08

Ja n-

09

M ay

-0 9

Se p-

09

Ja n-

10

M ay

-1 0

Se p-

10

Ja n-

11

M ay

-1 1

Se p-

11

Ja n-

12

M ay

-1 2

Se p-

12

Ja n-

13

M ay

-1 3

Se p-

13

Ja n-

14

M ay

-1 4

Se p-

14

Ja n-

15

M ay

-1 5

Se p-

15

Ja n-

16

M ay

-1 6

Se p-

16

Ja n-

17

M ay

-1 7

Se p-

17

Ja n-

18

M ay

-1 8

Se p-

18

Matched Model (SquishVIN) Price Indices

Tornqvist Laspeyres Paasche Fisher

GEKSmean13 GEKSmean24 CCDImean13 CCDImean24

6 — U.S. BUREAU OF LABOR STATISTICS • bls.gov6 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

20

30

40

50

60

70

80

90

100 Ja

n- 07

M ay

-0 7

Se p-

07

Ja n-

08

M ay

-0 8

Se p-

08

Ja n-

09

M ay

-0 9

Se p-

09

Ja n-

10

M ay

-1 0

Se p-

10

Ja n-

11

M ay

-1 1

Se p-

11

Ja n-

12

M ay

-1 2

Se p-

12

Ja n-

13

M ay

-1 3

Se p-

13

Ja n-

14

M ay

-1 4

Se p-

14

Ja n-

15

M ay

-1 5

Se p-

15

Ja n-

16

M ay

-1 6

Se p-

16

Ja n-

17

M ay

-1 7

Se p-

17

Ja n-

18

M ay

-1 8

Se p-

18

Matched Model (SquishVIN) Price Indices

Tornqvist Laspeyres Paasche Fisher GEKSmean13

GEKSmean24 CCDImean13 CCDImean24 SimPredShare SimPaaLaspSpread

7 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

60

65

70

75

80

85

90

95

100

105 Ja

n- 07

Ap r-

07 Ju

l-0 7

Oc t-0

7 Ja

n- 08

Ap r-

08 Ju

l-0 8

Oc t-0

8 Ja

n- 09

Ap r-

09 Ju

l-0 9

Oc t-0

9 Ja

n- 10

Ap r-

10 Ju

l-1 0

Oc t-1

0 Ja

n- 11

Ap r-

11 Ju

l-1 1

Oc t-1

1 Ja

n- 12

Ap r-

12 Ju

l-1 2

Oc t-1

2 Ja

n- 13

Ap r-

13 Ju

l-1 3

Oc t-1

3 Ja

n- 14

Ap r-

14 Ju

l-1 4

Oc t-1

4 Ja

n- 15

Ap r-

15 Ju

l-1 5

Oc t-1

5 Ja

n- 16

Ap r-

16 Ju

l-1 6

Oc t-1

6 Ja

n- 17

Ap r-

17 Ju

l-1 7

Oc t-1

7 Ja

n- 18

Ap r-

18 Ju

l-1 8

Oc t-1

8

Product Grouping with CCDI

OhtaGrilGrouped OhtaGrilGroupedCCDI13 OhtaGrilGroupedCCDI24

CourtGrouped CourtGroupCCDI13 CourtGroupCCDI24

8 — U.S. BUREAU OF LABOR STATISTICS • bls.gov8 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Historical Models Court (1939)

Griliches (1961)

Triplett (1969)

Triplett Truncated (1969)

Cowling & Cubbin (1972)

Ohta & Griliches (1976)

Weight x x x x x Wheelbase x Length/wheelbase Horsepower x X x x x Length Length/wheelbase x x x V8 X x x Hardtop x x x Transmission x X Comb.

Power brakes

x Comb. Comb. x

Power steering x Comb. Comb.

Compact x x x Over4Gears x Luxury x PassengerArea x

Efficiency x Make Indicator

variables

9 — U.S. BUREAU OF LABOR STATISTICS • bls.gov9 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Interacted Model  Wheelbase, length, weight, horsepower,

displacement, height, MPGCity, MPGHwy, torque

 Indicators: # of Cylinders, Make, Bodystyle, Hybrid, AWD

 Interactions: Hybrid (MPGCity, MPGHwy, torque) MPGCity/MPGHwy (weight, horsepower,

displacement) horsepower/weight

10 — U.S. BUREAU OF LABOR STATISTICS • bls.gov10 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Basic Hedonic Methods  A bilateral Time-Product Dummy, WLS

regression Nearly identical to matched model

 TPD is a “fully interacted” time-dummy hedonic, Krsinich (2016)

 Pooled TDH constrains feature values to a constant over pooled time period

 Hedonic imputation: Hedonic predicted price for each specification weighted with observed quantities

11 — U.S. BUREAU OF LABOR STATISTICS • bls.gov11 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

85

90

95

100

105

110

115

120

125 Ja

n- 07

M ay

-0 7

Se p-

07

Ja n-

08

M ay

-0 8

Se p-

08

Ja n-

09

M ay

-0 9

Se p-

09

Ja n-

10

M ay

-1 0

Se p-

10

Ja n-

11

M ay

-1 1

Se p-

11

Ja n-

12

M ay

-1 2

Se p-

12

Ja n-

13

M ay

-1 3

Se p-

13

Ja n-

14

M ay

-1 4

Se p-

14

Ja n-

15

M ay

-1 5

Se p-

15

Ja n-

16

M ay

-1 6

Se p-

16

Ja n-

17

M ay

-1 7

Se p-

17

Ja n-

18

M ay

-1 8

Se p-

18

Pooled vs Adjacent Period Hedonic Indexes

CourtPooled CourtAdj

OhtaGrilPooled OhtaGrilsAdj

InteractPool InteractAdj

12 — U.S. BUREAU OF LABOR STATISTICS • bls.gov12 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Pooled Regression Fits

Model Specification R-Squared Adjusted R-Squared

Court 0.6887 0.6884

Ohta & Griliches 0.8717 0.8715

Interacted 0.9253 0.9252

13 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

55

65

75

85

95

105

115

Ja n-

07

M ay

-0 7

Se p-

07

Ja n-

08

M ay

-0 8

Se p-

08

Ja n-

09

M ay

-0 9

Se p-

09

Ja n-

10

M ay

-1 0

Se p-

10

Ja n-

11

M ay

-1 1

Se p-

11

Ja n-

12

M ay

-1 2

Se p-

12

Ja n-

13

M ay

-1 3

Se p-

13

Ja n-

14

M ay

-1 4

Se p-

14

Ja n-

15

M ay

-1 5

Se p-

15

Ja n-

16

M ay

-1 6

Se p-

16

Ja n-

17

M ay

-1 7

Se p-

17

Ja n-

18

M ay

-1 8

Se p-

18

Adjacent TDH vs Hedonic Imputation

CourtAdj CourtHITorn

OhtaGrilsAdj OhtaGrilHItorn

InteractAdj InterHItorn

MMTornqvist TPDummyWLS

14 — U.S. BUREAU OF LABOR STATISTICS • bls.gov14 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Product Cycle and Measurement

 Long-term objective price change should fully reflect the difference between completely different regimes

 In the case of IPD, long-run relative may still be biased, but will not compound and cycle patterns may disperse over a longer time horizon

15 — U.S. BUREAU OF LABOR STATISTICS • bls.gov15 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Index Methods and Product Cycle: Accuracy Issues

 Matched model, TPD Inaccurate: Price change omitted between regimes

 Product grouping or product matching Partially accurate: Dependent on matching method and

weighting

 Short-term hedonic imputation and adjacent period TDH Partially accurate: Dependent on weighting of transition

16 — U.S. BUREAU OF LABOR STATISTICS • bls.gov16 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Index Methods and Product Cycle: Most Accurate Measures

 Long-term relatives  Intermediate price changes are transitive or

not included  Methods: Hedonic imputation with fixed base

– No chain drift, but dependent on base period and losses representivity

Hedonic imputation multilaterals Similarity linking

17 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

95

97

99

101

103

105

107

109

111

113

115

Ja n-

07

M ay

-0 7

Se p-

07

Ja n-

08

M ay

-0 8

Se p-

08

Ja n-

09

M ay

-0 9

Se p-

09

Ja n-

10

M ay

-1 0

Se p-

10

Ja n-

11

M ay

-1 1

Se p-

11

Ja n-

12

M ay

-1 2

Se p-

12

Ja n-

13

M ay

-1 3

Se p-

13

Ja n-

14

M ay

-1 4

Se p-

14

Ja n-

15

M ay

-1 5

Se p-

15

Ja n-

16

M ay

-1 6

Se p-

16

Ja n-

17

M ay

-1 7

Se p-

17

Ja n-

18

M ay

-1 8

Se p-

18

Ohta and Griliches Model Imputes with Multilaterals

Adj TDH Hed Impute HI CCDI Mean13 HI CCDI Mean 24 HI CCDI Mean 36 HI Fixed Base

18 — U.S. BUREAU OF LABOR STATISTICS • bls.gov18 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Similarity Linking Indexes  Pass multiperiod identity test and are fully

transitive  A new period index, &#x1d43c;&#x1d43c;&#x1d461;&#x1d461;, is found by creating a

bilateral index of the most similar previous period

 Different methods exist for quantifying similarity or dissimilarity

&#x1d43c;&#x1d43c;&#x1d461;&#x1d461; = &#x1d43c;&#x1d43c;&#x1d461;&#x1d461;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446; × &#x1d443;&#x1d443;(&#x1d45d;&#x1d45d;&#x1d461;&#x1d461;, &#x1d45e;&#x1d45e;&#x1d461;&#x1d461;, &#x1d45d;&#x1d45d;&#x1d461;&#x1d461;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;, &#x1d45e;&#x1d45e;&#x1d461;&#x1d461;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;)

19 — U.S. BUREAU OF LABOR STATISTICS • bls.gov19 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Similarity Linking and Hedonics  Use hedonic imputed price with a similarity

linking method  New similarity linking method Estimate a single period hedonic regression and

find the previous period with the closest fit

 For all prior periods to t, t-a, find  &#x1d439;&#x1d439; = (&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d436;&#x1d436;&#x1d436;&#x1d436;&#x1d446;&#x1d446;&#x1d436;&#x1d436;&#x1d436;&#x1d436;−(&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d461;&#x1d461;−&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d461;&#x1d461;−&#x1d44e;&#x1d44e;))/&#x1d458;&#x1d458;

(&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d461;&#x1d461;−&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d446;&#x1d461;&#x1d461;−&#x1d44e;&#x1d44e;)/(&#x1d441;&#x1d441;&#x1d461;&#x1d461;+&#x1d441;&#x1d441;&#x1d461;&#x1d461;−&#x1d44e;&#x1d44e;−2&#x1d458;&#x1d458;)

The prior period with the lowest Chow test statistic is used as a link

20 — U.S. BUREAU OF LABOR STATISTICS • bls.gov20 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

95

97

99

101

103

105

107

109

111

Ja n-

07 Ap

r- 07

Ju l-0

7 Oc

t-0 7

Ja n-

08 Ap

r- 08

Ju l-0

8 Oc

t-0 8

Ja n-

09 Ap

r- 09

Ju l-0

9 Oc

t-0 9

Ja n-

10 Ap

r- 10

Ju l-1

0 Oc

t-1 0

Ja n-

11 Ap

r- 11

Ju l-1

1 Oc

t-1 1

Ja n-

12 Ap

r- 12

Ju l-1

2 Oc

t-1 2

Ja n-

13 Ap

r- 13

Ju l-1

3 Oc

t-1 3

Ja n-

14 Ap

r- 14

Ju l-1

4 Oc

t-1 4

Ja n-

15 Ap

r- 15

Ju l-1

5 Oc

t-1 5

Ja n-

16 Ap

r- 16

Ju l-1

6 Oc

t-1 6

Ja n-

17 Ap

r- 17

Ju l-1

7 Oc

t-1 7

Ja n-

18 Ap

r- 18

Ju l-1

8 Oc

t-1 8

Similarity Linking Indices for Ohta & Griliches Model

Min Chow Test Predicted Share Paasche Laspeyres Divergence

21 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

95

97

99

101

103

105

107

109

111

113

115

Ja n-

07

M ay

-0 7

Se p-

07

Ja n-

08

M ay

-0 8

Se p-

08

Ja n-

09

M ay

-0 9

Se p-

09

Ja n-

10

M ay

-1 0

Se p-

10

Ja n-

11

M ay

-1 1

Se p-

11

Ja n-

12

M ay

-1 2

Se p-

12

Ja n-

13

M ay

-1 3

Se p-

13

Ja n-

14

M ay

-1 4

Se p-

14

Ja n-

15

M ay

-1 5

Se p-

15

Ja n-

16

M ay

-1 6

Se p-

16

Ja n-

17

M ay

-1 7

Se p-

17

Ja n-

18

M ay

-1 8

Se p-

18

Ohta and Griliches Model Imputes with Multilaterals

Adj TDH Hed Impute HI CCDI Mean13 HI CCDI Mean 24

HI CCDI Mean 36 Sim Chow Test HI Fixed Base

22 — U.S. BUREAU OF LABOR STATISTICS • bls.gov22 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Conclusions  Product cycle effects dominate quality change

in certain markets.  Product cycle can drive drift and chain

drift/multilateral may be a secondary effect.  Hedonic imputation should be preferred over

product matching when possible.  Similarity linking with hedonic imputation

appears promising.

23 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Contact Information

Brendan Williams Senior Economist

Consumer Prices Division Office of Prices and Living Conditions

www.bls.gov/cpi [email protected]

  • Hedonic price estimates for new vehicles: �When do rotations lead to drift?
  • Summary
  • Data
  • Product Cycle
  • Matched Model Price Indices
  • Matched Model Price Indices 2
  • Product Grouping with CCDI
  • Historical Models
  • Interacted Model
  • Basic Hedonic Methods
  • Pooled vs Adjacent Period TDH
  • Pooled Regression Fits
  • Adjacent TDH vs Hedonic Imputation
  • Product Cycle and Measurement
  • Index Methods and Product Cycle: Accuracy Issues
  • Index Methods and Product Cycle: Most Accurate Measures
  • Ohta and Griliches Model
  • Similarity Linking Indexes
  • Similarity Linking and Hedonics
  • Similarity Linking Indices
  • Ohta and Griliches Model 2
  • Conclusions
  • Contact Information

Expanding the family of US Consumer Price Indexes

Languages and translations
English

1 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Expanding the family of US Consumer Price Indexes

Anya Stockburger, Bill Johnson, Joshua Klick, Paul Liegey, Robert Martin,

Bureau of Labor Statistics

Meeting of the Group of Experts on CPIs

June 8, 2023

2 — U.S. BUREAU OF LABOR STATISTICS • bls.gov2 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

CPI Family of Indexes - Concepts

CPI-U

Chained CPI-U

CPI-W, R-CPI-E

R-CPI-Income

• Best for escalationHousehold Cost Indexes

Measure change in purchasing power of the average dollar of expenditure

Measure tied to outlays explicitly related to household purchasing

3 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Outline

Motivation

Income-based indexes

Household Cost Indexes

Next steps

4 — U.S. BUREAU OF LABOR STATISTICS • bls.gov4 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Motivation – Increased need for data granularity

 Committee on National Statistics recommendation  Federal Reserve Bank interest  Office of Management and Budget, Bureau of

Economic Analysis, and other government interest  General user interest (major media)  Publications: Initial working paper, Spotlight on

Statistics

5 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

CPI by Income Methodology

$12,000

$118,000

0

20,000

40,000

60,000

80,000

100,000

120,000

140,000

Q1 Q5

Median Equivalized Income (Interview Survey - 2021)

Expenditure weights Group CE respondents into weighted ranking of equivalized income quintiles

Prices/rents All lower-level data the same (prices, outlets, rents)

Index aggregation Lowe, Tornqvist aggregation from lowest-level basic indexes

6 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Snapshot of spending weights by population, 2019-2020 biennial expenditure weight share, equivalized income

0% 5% 10% 15% 20% 25% 30%

Rent

Food at home

Motor fuel

Owner's equivalent rent

Vehicles and maintenance

Food away from home

Recreation

Q1 U Q5

7 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Annualized Inflation Gap Annualized inflation rate, CPI by income quintile, Lowe Formula, December 2005 -

December 2022

2.6

2.5

2.5

2.4

2.3

2.4

2.1

2.2

2.3

2.4

2.5

2.6

2.7

Q1 Q2 Q3 Q4 Q5

Income Quintiles Urban

8 — U.S. BUREAU OF LABOR STATISTICS • bls.gov8 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Inflation Gap Variation Lowest income quintile – Highest income quintile

Annual 12-month percent change December 2006 – December 2022

-1.0%

-0.5%

0.0%

0.5%

1.0%

1.5%

Equivalized income Unadjusted income

9 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

2022 Year over year inflation gap (Q1-Q5) CPI contributions to All-Items (percentage)

-5 0 5 10 15

Rent primary residence Gasoline (all types)

Electricity Utility (piped) gas service

Cigarettes Motor vehicle insurance

Limited service meals/snacks Nonfrozen noncarbonated juices…

Cable & satellite tv/radio Chicken

Club membership Child care & nursery school

Owners' rent secondary residence Leased cars and trucks

Full service meals and snacks Commercial Health Insurance

Owners' rent primary residence Lodging away from home

Airline fare New vehicles

10 — U.S. BUREAU OF LABOR STATISTICS • bls.gov10 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Limitations and Future Improvements

 Lower-level price heterogeneity Re-weighting housing prices shows little impact (Malloy,

Larson 2021) 2/3rd price change in grocery items missed (Jaravel 2019)

 BLS future research Further investigate housing adjustments Re-weighting alternative data (gasoline, new vehicles) Interested in a scanner data program (CNSTAT

recommendation), but funding…

11 — U.S. BUREAU OF LABOR STATISTICS • bls.gov11 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Household Cost Index

 Inspired by Office for National Statistics and Statistics New Zealand

 Definition: Measure the change in cash outflows required, on average, for households to access the goods and services they consume

 Methodology:  Household (democratic) aggregation,  Payments-approach to owner-occupied housing  Urban population

12 — U.S. BUREAU OF LABOR STATISTICS • bls.gov12 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Household (Democratic) Aggregation

 Create household-level expenditure shares using Consumer Expenditure Survey data  Eligible expenditures from the Diary survey imputed to the Interview sample

using a matching procedure based on Hobijn, et. al. (2009)

 Aggregation across households  Lowe formula with lagged expenditure weights

 Limitations  Infrequent purchases (particularly vehicles) pose a challenge  Limit to household with 4 quarters of expenditures (limits use of data to about

a third)

13 — U.S. BUREAU OF LABOR STATISTICS • bls.gov13 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Payments Approach – Mortgage Interest Payment

 Weights Consumer Expenditure Survey

 Prices Mortgage interest payment index =

Debt index * Interest rate index Data sources: • Federal Housing Finance Agency’s All Transactions House Price

Index • Freddie Mac Primary Mortgage Market Survey

14 — U.S. BUREAU OF LABOR STATISTICS • bls.gov14 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Payments Approach – Property Tax Payments

 Weights Consumer Expenditure Survey

 Prices Property Tax Payment Index =

Total property tax payments * Constant quality Total housing stock value home price index

Data source: CE

15 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

HCI – Relative Importance Major Group CPI-U HCI-U

Food and Beverages 15.2 20.1 Housing 42.4 34.3 Apparel 2.7 3.1 Transportation 15.2 14.3 Medical 8.9 11.1 Recreation 5.8 6.6 Education and Comm. 6.8 6.7 Other 3.2 3.7 Reference Period 2017-18 2019 CE Sample Full 4-quarter

16 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

HCI – Index Results

1

1.05

1.1

1.15

1.2

1.25

2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021

lowe-u (ew, req) lowe-u (ew, pay) cpi-u hci-u

Average 12-month % change CPI-U 1.86% HCI (Payments Approach + Household Aggregation)

1.52%

HCI-U (Payments Approach Only) 1.48%

17 — U.S. BUREAU OF LABOR STATISTICS • bls.gov17 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Limitations - HCI

 Household aggregation Infrequent purchases (challenge especially with

Tornqvist) Include in HCI given small impact?

 Payments approach Investigate a microdata approach for mortgage

interest index Investigate including mortgage principal

18 — U.S. BUREAU OF LABOR STATISTICS • bls.gov18 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

What’s next?

Improve methodology

Income-group specific lower- level indexes

Next step for HCI research?

Stakeholder outreach

Group of Experts BLS advisory committees Federal Committee on Statistical Methodology

Publish regular updates

R-CPI-Income C-CPI-Income

HCI?

Contact Information

19 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Anya Stockburger Chief, Branch of Revision Methodology

Division of Consumer Price Indexes www.bls.gov/cpi

[email protected]

  • Expanding the family of US Consumer Price Indexes
  • CPI Family of Indexes - Concepts
  • Outline
  • Motivation – Increased need for data granularity
  • CPI by Income Methodology
  • Snapshot of spending weights by population, 2019-2020 biennial expenditure weight share, equivalized income
  • Annualized Inflation Gap�Annualized inflation rate, CPI by income quintile, Lowe Formula, December 2005 - December 2022
  • Inflation Gap Variation�Lowest income quintile – Highest income quintile�Annual 12-month percent change�December 2006 – December 2022
  • 2022 Year over year inflation gap (Q1-Q5) CPI contributions to All-Items (percentage)�
  • Limitations and Future Improvements
  • Household Cost Index
  • Household (Democratic) Aggregation
  • Payments Approach – Mortgage Interest Payment
  • Payments Approach – Property Tax Payments
  • HCI – Relative Importance
  • HCI – Index Results
  • Limitations - HCI
  • What’s next?
  • Contact Information

Expanding the family of US Consumer Price Indexes

Languages and translations
English

1 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Expanding the family of US Consumer Price Indexes

Anya Stockburger, Bill Johnson, Joshua Klick, Paul Liegey, Robert Martin,

Bureau of Labor Statistics

Meeting of the Group of Experts on CPIs

June 8, 2023

2 — U.S. BUREAU OF LABOR STATISTICS • bls.gov2 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

CPI Family of Indexes - Concepts

CPI-U

Chained CPI-U

CPI-W, R-CPI-E

R-CPI-Income

• Best for escalationHousehold Cost Indexes

Measure change in purchasing power of the average dollar of expenditure

Measure tied to outlays explicitly related to household purchasing

3 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Outline

Motivation

Income-based indexes

Household Cost Indexes

Next steps

4 — U.S. BUREAU OF LABOR STATISTICS • bls.gov4 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Motivation – Increased need for data granularity

 Committee on National Statistics recommendation  Federal Reserve Bank interest  Office of Management and Budget, Bureau of

Economic Analysis, and other government interest  General user interest (major media)  Publications: Initial working paper, Spotlight on

Statistics

5 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

CPI by Income Methodology

$12,000

$118,000

0

20,000

40,000

60,000

80,000

100,000

120,000

140,000

Q1 Q5

Median Equivalized Income (Interview Survey - 2021)

Expenditure weights Group CE respondents into weighted ranking of equivalized income quintiles

Prices/rents All lower-level data the same (prices, outlets, rents)

Index aggregation Lowe, Tornqvist aggregation from lowest-level basic indexes

6 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Snapshot of spending weights by population, 2019-2020 biennial expenditure weight share, equivalized income

0% 5% 10% 15% 20% 25% 30%

Rent

Food at home

Motor fuel

Owner's equivalent rent

Vehicles and maintenance

Food away from home

Recreation

Q1 U Q5

7 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Annualized Inflation Gap Annualized inflation rate, CPI by income quintile, Lowe Formula, December 2005 -

December 2022

2.6

2.5

2.5

2.4

2.3

2.4

2.1

2.2

2.3

2.4

2.5

2.6

2.7

Q1 Q2 Q3 Q4 Q5

Income Quintiles Urban

8 — U.S. BUREAU OF LABOR STATISTICS • bls.gov8 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Inflation Gap Variation Lowest income quintile – Highest income quintile

Annual 12-month percent change December 2006 – December 2022

-1.0%

-0.5%

0.0%

0.5%

1.0%

1.5%

Equivalized income Unadjusted income

9 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

2022 Year over year inflation gap (Q1-Q5) CPI contributions to All-Items (percentage)

-5 0 5 10 15

Rent primary residence Gasoline (all types)

Electricity Utility (piped) gas service

Cigarettes Motor vehicle insurance

Limited service meals/snacks Nonfrozen noncarbonated juices…

Cable & satellite tv/radio Chicken

Club membership Child care & nursery school

Owners' rent secondary residence Leased cars and trucks

Full service meals and snacks Commercial Health Insurance

Owners' rent primary residence Lodging away from home

Airline fare New vehicles

10 — U.S. BUREAU OF LABOR STATISTICS • bls.gov10 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Limitations and Future Improvements

 Lower-level price heterogeneity Re-weighting housing prices shows little impact (Malloy,

Larson 2021) 2/3rd price change in grocery items missed (Jaravel 2019)

 BLS future research Further investigate housing adjustments Re-weighting alternative data (gasoline, new vehicles) Interested in a scanner data program (CNSTAT

recommendation), but funding…

11 — U.S. BUREAU OF LABOR STATISTICS • bls.gov11 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Household Cost Index

 Inspired by Office for National Statistics and Statistics New Zealand

 Definition: Measure the change in cash outflows required, on average, for households to access the goods and services they consume

 Methodology:  Household (democratic) aggregation,  Payments-approach to owner-occupied housing  Urban population

12 — U.S. BUREAU OF LABOR STATISTICS • bls.gov12 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Household (Democratic) Aggregation

 Create household-level expenditure shares using Consumer Expenditure Survey data  Eligible expenditures from the Diary survey imputed to the Interview sample

using a matching procedure based on Hobijn, et. al. (2009)

 Aggregation across households  Lowe formula with lagged expenditure weights

 Limitations  Infrequent purchases (particularly vehicles) pose a challenge  Limit to household with 4 quarters of expenditures (limits use of data to about

a third)

13 — U.S. BUREAU OF LABOR STATISTICS • bls.gov13 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Payments Approach – Mortgage Interest Payment

 Weights Consumer Expenditure Survey

 Prices Mortgage interest payment index =

Debt index * Interest rate index Data sources: • Federal Housing Finance Agency’s All Transactions House Price

Index • Freddie Mac Primary Mortgage Market Survey

14 — U.S. BUREAU OF LABOR STATISTICS • bls.gov14 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Payments Approach – Property Tax Payments

 Weights Consumer Expenditure Survey

 Prices Property Tax Payment Index =

Total property tax payments * Constant quality Total housing stock value home price index

Data source: CE

15 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

HCI – Relative Importance Major Group CPI-U HCI-U

Food and Beverages 15.2 20.1 Housing 42.4 34.3 Apparel 2.7 3.1 Transportation 15.2 14.3 Medical 8.9 11.1 Recreation 5.8 6.6 Education and Comm. 6.8 6.7 Other 3.2 3.7 Reference Period 2017-18 2019 CE Sample Full 4-quarter

16 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

HCI – Index Results

1

1.05

1.1

1.15

1.2

1.25

2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021

lowe-u (ew, req) lowe-u (ew, pay) cpi-u hci-u

Average 12-month % change CPI-U 1.86% HCI (Payments Approach + Household Aggregation)

1.52%

HCI-U (Payments Approach Only) 1.48%

17 — U.S. BUREAU OF LABOR STATISTICS • bls.gov17 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Limitations - HCI

 Household aggregation Infrequent purchases (challenge especially with

Tornqvist) Include in HCI given small impact?

 Payments approach Investigate a microdata approach for mortgage

interest index Investigate including mortgage principal

18 — U.S. BUREAU OF LABOR STATISTICS • bls.gov18 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

What’s next?

Improve methodology

Income-group specific lower- level indexes

Next step for HCI research?

Stakeholder outreach

Group of Experts BLS advisory committees Federal Committee on Statistical Methodology

Publish regular updates

R-CPI-Income C-CPI-Income

HCI?

Contact Information

19 — U.S. BUREAU OF LABOR STATISTICS • bls.gov

Anya Stockburger Chief, Branch of Revision Methodology

Division of Consumer Price Indexes www.bls.gov/cpi

[email protected]

  • Expanding the family of US Consumer Price Indexes
  • CPI Family of Indexes - Concepts
  • Outline
  • Motivation – Increased need for data granularity
  • CPI by Income Methodology
  • Snapshot of spending weights by population, 2019-2020 biennial expenditure weight share, equivalized income
  • Annualized Inflation Gap�Annualized inflation rate, CPI by income quintile, Lowe Formula, December 2005 - December 2022
  • Inflation Gap Variation�Lowest income quintile – Highest income quintile�Annual 12-month percent change�December 2006 – December 2022
  • 2022 Year over year inflation gap (Q1-Q5) CPI contributions to All-Items (percentage)�
  • Limitations and Future Improvements
  • Household Cost Index
  • Household (Democratic) Aggregation
  • Payments Approach – Mortgage Interest Payment
  • Payments Approach – Property Tax Payments
  • HCI – Relative Importance
  • HCI – Index Results
  • Limitations - HCI
  • What’s next?
  • Contact Information