Skip to main content

France

(France, UK) Proposal for a new supplement to UN Regulation No. 155

Languages and translations
English

Submitted by the experts from France and the United Kingdom of Great Britain and Northern Ireland

Informal document GRVA-17-13 17th GRVA, 25-29 September 2023 Provisional agenda item 5(a)

ECE/TRANS/WP.29/1129

Proposal for a new supplement to UN Regulation No. 155

The text below was prepared by the experts from France and the United Kingdom of Great Britain and Northern Ireland. The modifications to the existing text of the Regulation are marked in bold for new or strikethrough for deleted characters.

I. Proposal

Paragraph 1.1., amend to read:

“1.1. This Regulation applies to vehicles, with regard to cyber security, of the Categories L, M and , N, O, R, S and T, if fitted with at least one electronic control unit.

This Regulation also applies to vehicles of Category O if fitted with at least one electronic control unit.”

Paragraph 1.2., shall be deleted:

1.2. This Regulation also applies to vehicles of the Categories L6 and L7 if equipped with automated driving functionalities from level 3 onwards, as defined in the reference document with definitions of Automated Driving under WP.29 and the General Principles for developing a UN Regulation on automated vehicles (ECE/TRANS/WP.29/1140).

Paragraphs 1.3. (former) and 1.4., renumber as paragraphs 1.2. and 1.3.

Paragraph 7.3.1., amend to read:

“7.3.1. The manufacturer shall have a valid Certificate of Compliance for the Cyber

Security Management System relevant to the vehicle type being approved. However, for type approvals of vehicles of Categories M, N and O first issued before 1 July 2024, and for type approvals of vehicles of Categories L, R, S and T first issued before 1 July 2027, and for each extension thereof, if the vehicle manufacturer can demonstrate that the vehicle type could not be developed in compliance with the CSMS, then the vehicle manufacturer shall demonstrate that cyber security was adequately considered during the development phase of the vehicle type concerned.”

Paragraph 7.3.4., amend to read:

“7.3.4. The vehicle manufacturer shall protect the vehicle type against risks identified in the vehicle manufacturer’s risk assessment. Proportionate mitigations shall be implemented to protect the vehicle type. The mitigations implemented shall include all mitigations referred to in Annex 5, Part B and C which are relevant for the risks identified. However, if a mitigation referred to in Annex 5, Part B or C, is not relevant or not sufficient for the risk identified, the vehicle manufacturer shall ensure that another appropriate mitigation is implemented. In particular, for type approvals of vehicles of Categories M, N and O first issued before 1 July 2024, and for type approvals of vehicles of Categories L, R, S and T first issued before 1 July 2027, and for each extension thereof, the vehicle manufacturer shall ensure that another appropriate mitigation is implemented if a mitigation measure referred to in Annex 5, Part B or C is technically not feasible. The respective assessment of the technical feasibility shall be provided by the manufacturer to the approval authority.”

II. Justification

1. At the 16th session of GRVA in May 2023, the subsidiary Working Party accepted the Chair’s proposal to finalise the discussion of the inclusion of all categories of vehicles in UN Regulation No. 155 at its 17th session in September.

2. The purpose of UN Regulation No. 155 is to offer an international framework for the homologation of road vehicles with regard to cyber security. Therefore, GRVA should strive to offer the broadest scope possible to its Contracting Parties, and to allow manufacturers of vehicles of any relevant category to apply for a type approval.

3. During the previous sessions of GRVA and of its informal working group on cyber security and software updates, no technical argument was put forward to justify the exclusion of vehicles of Categories L, R, S and T from the scope of the Regulation. Not including these categories thus forces Contracting Parties and regional organisations to use national or regional laws on cyber security for these categories of vehicles. This could lead to unique requirements and a level of divergence that could be onerous on the industry.

4. The scope of UN Regulation No. 156 already includes all categories of vehicles: this current discrepancy between the two Regulations is an implicit statement that some vehicles, while able to receive over-the-air software updates, should not be type approved with regard to cyber security. Aligning the scope of UN Regulation No. 155 with that of UN Regulation No. 156 is a logical step towards a comprehensive regulatory environment for connected vehicles.

5. Similarly to what was granted to Categories M and N in the original version of the Regulation (paragraphs 7.3.1. and 7.3.4.), an adequate lead time is necessary for manufacturers of vehicles of the categories introduced in this proposal to demonstrate adequate cybersecurity measures for the approval of vehicle types whose development phase started prior to the implementation of the manufacturer’s Cyber Security Management System. Category L vehicles that were already in scope of the Regulation have been included in this lead time to simplify the drafting and remove reference to SAE levels of automation. As the provisions still require demonstration that cyber security was adequately addressed and any alternative mitigations are appropriate, there should be no issues in allowing additional time in this case.

Submitted by the experts from France and the United Kingdom of Great Britain and Northern Ireland

Informal document GRVA-17-13 17th GRVA, 25-29 September 2023 Provisional agenda item 5(a)

1

Proposal for a new supplement to UN Regulation No. 155

The text below was prepared by the experts from France and the United Kingdom of Great Britain and Northern Ireland. The modifications to the existing text of the Regulation are marked in bold for new or strikethrough for deleted characters.

I. Proposal

Paragraph 1.1., amend to read:

“1.1. This Regulation applies to vehicles, with regard to cyber security, of the Categories L, M and, N, O, R, S and T, if fitted with at least one electronic control unit.

This Regulation also applies to vehicles of Category O if fitted with at least one electronic control unit.”

Paragraph 1.2., shall be deleted:

“1.2. This Regulation also applies to vehicles of the Categories L6 and L7 if equipped with automated driving functionalities from level 3 onwards, as defined in the reference document with definitions of Automated Driving under WP.29 and the General Principles for developing a UN Regulation on automated vehicles (ECE/TRANS/WP.29/1140).”

Paragraphs 1.3. (former) and 1.4., renumber as paragraphs 1.2. and 1.3.

Paragraph 7.3.1., amend to read:

“7.3.1. The manufacturer shall have a valid Certificate of Compliance for the Cyber

Security Management System relevant to the vehicle type being approved. However, for type approvals of vehicles of Categories M, N and O first issued before 1 July 2024, and for type approvals of vehicles of Categories L, R, S and T first issued before 1 July 2027, and for each extension thereof, if the vehicle manufacturer can demonstrate that the vehicle type could not be developed in compliance with the CSMS, then the vehicle manufacturer shall demonstrate that cyber security was adequately considered during the development phase of the vehicle type concerned.”

Paragraph 7.3.4., amend to read:

“7.3.4. The vehicle manufacturer shall protect the vehicle type against risks identified in the vehicle manufacturer’s risk assessment. Proportionate mitigations shall be implemented to protect the vehicle type. The mitigations implemented shall include all mitigations referred to in Annex 5, Part B and C which are relevant for the risks identified. However, if a mitigation referred to in Annex 5, Part B or C, is not relevant or not sufficient for the risk identified, the vehicle manufacturer shall ensure that another appropriate mitigation is implemented. In particular, for type approvals of vehicles of Categories M, N and O first issued before 1 July 2024, and for type approvals of vehicles of Categories L, R, S and T first issued before 1 July 2027, and for each extension thereof, the vehicle manufacturer shall ensure that another appropriate mitigation is implemented if a mitigation measure referred to in Annex 5, Part B or C is technically not feasible. The respective assessment of the technical feasibility shall be provided by the manufacturer to the approval authority.”

2

II. Justification

1. At the 16th session of GRVA in May 2023, the subsidiary Working Party accepted the Chair’s proposal to finalise the discussion of the inclusion of all categories of vehicles in UN Regulation No. 155 at its 17th session in September.

2. The purpose of UN Regulation No. 155 is to offer an international framework for the homologation of road vehicles with regard to cyber security. Therefore, GRVA should strive to offer the broadest scope possible to its Contracting Parties, and to allow manufacturers of vehicles of any relevant category to apply for a type approval.

3. During the previous sessions of GRVA and of its informal working group on cyber security and software updates, no technical argument was put forward to justify the exclusion of vehicles of Categories L, R, S and T from the scope of the Regulation. Not including these categories thus forces Contracting Parties and regional organisations to use national or regional laws on cyber security for these categories of vehicles. This could lead to unique requirements and a level of divergence that could be onerous on the industry.

4. The scope of UN Regulation No. 156 already includes all categories of vehicles: this current discrepancy between the two Regulations is an implicit statement that some vehicles, while able to receive over-the-air software updates, should not be type approved with regard to cyber security. Aligning the scope of UN Regulation No. 155 with that of UN Regulation No. 156 is a logical step towards a comprehensive regulatory environment for connected vehicles.

5. Similarly to what was granted to Categories M and N in the original version of the Regulation (paragraphs 7.3.1. and 7.3.4.), an adequate lead time is necessary for manufacturers of vehicles of the categories introduced in this proposal to demonstrate adequate cybersecurity measures for the approval of vehicle types whose development phase started prior to the implementation of the manufacturer’s Cyber Security Management System. Category L vehicles that were already in scope of the Regulation have been included in this lead time to simplify the drafting and remove reference to SAE levels of automation. As the provisions still require demonstration that cyber security was adequately addressed and any alternative mitigations are appropriate, there should be no issues in allowing additional time in this case.

Exploring methodologies to integrate new scanner data in the French CPI: Making use of multilateral methods

Languages and translations
English

MEETING OF THE GROUP OF EXPERTS ON CPI 7 JUNE 2023

Exploring methodologies to integrate new scanner data in the French CPI:

Making use of multilateral methods

MEETING OF THE GROUP OF EXPERTS ON CPI 2

1 CONTEXT AND GOALS 2 THEORY 3 RESULTS : BY VARIETY (COICOP 7 DIGITS) 4 RESULTS : BY COICOP 6 DIGITS(MAKE UP) 5 RESULTS : CONTRIBUTIONS

MEETING OF THE GROUP OF EXPERTS ON CPI 3

INTRODUCTION01

MEETING OF THE GROUP OF EXPERTS ON CPI 4

CONTEXT

– We are starting to receive data from 2 hard discounters. – We already have and use in production (since Jan 2020) scanner data from other retailers – Our current methodology with scanner data requires an external referential allowing us from GTIN/EAN to

have ● Additional characteristics (volume, unit, label, color ...) ● Nomenclature

– With classification rules and using the characteristics we classify at the variety level (level 7 of COICOP, French specificity).

● We are able to group EAN into equivalence classes to follow products better, avoid basket churn and catch the relaunches.

● We compute a Geometric Laspeyres, the methodology is similar than with the field collected data and the quality adjustment is slightly different since we can use the price history for the replacement product.

– Hard discount data has for now a low match rate with the referential (17 % of expenditure share according to 1 test file for one retailer and 39% for the other)

– We will experiment multilateral methods mainly to check what we could do without the referential and with the constraints of avoiding chain drift and basket churn.

MEETING OF THE GROUP OF EXPERTS ON CPI 5

TEST PROTOCOL/STRATEGY

– We will use our already possess scanner data (not enough history with hard discounters)

– Our product definition will vary between using GTIN/EAN or a article grouping methods (extended article number)

– We will compute micro indexes at the outlet level.

– We follow the average price of each product per month.

MEETING OF THE GROUP OF EXPERTS ON CPI 6

DATA OF THE EXPERIMENT

– Scanner data from January 2020 to December 2022, from 6 retailers (without hard discount because we don’t have background data).

– 3 varieties & their corresponding 6 digits COICOP level ● Whole milk & whole milk=> few replacements ● Foie gras & canned meat=> a high seasonality and 85% of

replacement during the year ● Lipstick & make up and care products => a lot of distinct

GTIN/EAN.

MEETING OF THE GROUP OF EXPERTS ON CPI 7

MULTILATERAL METHODS02

MEETING OF THE GROUP OF EXPERTS ON CPI 8

MULTILATERAL INDEXES TESTED

– We focus on GEKS-Törnqvist

where and

– The sample S can be a COICOP 6 digit level or a variety

– The product i can be the GTIN/EAN or an Extended article number ● Choice of the window size and splicing:

– Rolling window of size 13 and mean splice – Rolling window of size 25 and half splice

– Using R and IndexNumR package

IGEKS 0 , t =∏l=0

T ( I

0 , l

I t ,l ) 1 T +1=∏l=0

T ( I 0 ,l∗I l , t)

1 T +1

I T 0 ,t=∏i∈S

( pi t

p i 0 ) si 0+ si

t

2 si t=

pi tq i t

∑ j∈S p j t q j

t

MEETING OF THE GROUP OF EXPERTS ON CPI 9

RESULTS : VARIETY03

MEETING OF THE GROUP OF EXPERTS ON CPI 10

PRESENCE RATE PER VARIETY

01 /0

1/ 20

20

01 /0

3/ 20

20

01 /0

5/ 20

20

01 /0

7/ 20

20

01 /0

9/ 20

20

01 /1

1/ 20

20

01 /0

1/ 20

21

01 /0

3/ 20

21

01 /0

5/ 20

21

01 /0

7/ 20

21

01 /0

9/ 20

21

01 /1

1/ 20

21

01 /0

1/ 20

22

01 /0

3/ 20

22

01 /0

5/ 20

22

01 /0

7/ 20

22

01 /0

9/ 20

22

01 /1

1/ 20

22 0

0,2

0,4

0,6

0,8

1

1,2

Proportion of EAN x Outlet present in January 2020 and at the month m

foie gras presence rate

whole milk presence rate

lipstick/gloss presence rate

The presence rate is computed as

|N i∩N1| |N1|

Where are the products (EAN X Outlet in our case) sold in period i.

N i

– The presence rate is low for foie gras and lipstick

– There is a seasonality for foie gras

MEETING OF THE GROUP OF EXPERTS ON CPI 11

USING GTIN OR EXPENDED ARTICLE GROUP

90

95

100

105

110

115

120

-1 -0,7 -0,4 -0,1 0,2 0,5 0,8

Price indices for the variety whole milk between Jan 2020 and Dec 2022

(GEKS - GEKS by EAN) GEKS 25 months GEKS by ean 25 months

01 /0

1/ 20

20

01 /0

4/ 20

20

01 /0

7/ 20

20

01 /1

0/ 20

20

01 /0

1/ 20

21

01 /0

4/ 20

21

01 /0

7/ 20

21

01 /1

0/ 20

21

01 /0

1/ 20

22

01 /0

4/ 20

22

01 /0

7/ 20

22

01 /1

0/ 20

22 90

95

100

105

110

115

-1

-0,5

0

0,5

Price indices for the variety foie gras between January 2020 and December

2022

(GEKS - GEKS by EAN) GEKS 25 by extended article GEKS 25 by EAN

88 90 92 94 96 98 100 102

-3 -2,2 -1,4 -0,6 0,2

1 1,8 2,6

Price indices for the variety lipstick/gloss between Jan 2020 and Dec 2022

(GEKS - GEKS by EAN) GEKS by EAN 25 HASP GEKS 25 HASP with extend article number

Indexes using EAN or extended article number are very close for milk and foie gras.

There is more volatility for lipstick.

MEETING OF THE GROUP OF EXPERTS ON CPI 12

GEKS VS CURRENT CPI

jan vie

r-2 0

av ril-

20

jui lle

t-2 0

oc to

br e-

20

jan vie

r-2 1

av ril-

21

jui lle

t-2 1

oc to

br e-

21

jan vie

r-2 2

av ril-

22

jui lle

t-2 2

oc to

br e-

22 95

100 105 110 115 120 125

-2 -1,3 -0,6 0,1 0,8 1,5

Price indices for the variety whole milk between Jan 2020 and Dec 2022

CPI - GEKS by EAN CPI base 100 janv 20 GEKS by EAN half spliced 25 mois

01 /0

1/ 20

20

01 /0

3/ 20

20

01 /0

5/ 20

20

01 /0

7/ 20

20

01 /0

9/ 20

20

01 /1

1/ 20

20

01 /0

1/ 20

21

01 /0

3/ 20

21

01 /0

5/ 20

21

01 /0

7/ 20

21

01 /0

9/ 20

21

01 /1

1/ 20

21

01 /0

1/ 20

22

01 /0

3/ 20

22

01 /0

5/ 20

22

01 /0

7/ 20

22

01 /0

9/ 20

22

01 /1

1/ 20

22 90

95

100

105

110

115

120

-4 -3,1 -2,2 -1,3 -0,4 0,5 1,4 2,3 3,2

Price indices for the variety foie gras between January 2020 and December 2022

CPI - GEKS by ean CPI, base 100 = Jan 2020 GEKS by ean

01 /0

1/ 20

20

01 /0

4/ 20

20

01 /0

7/ 20

20

01 /1

0/ 20

20

01 /0

1/ 20

21

01 /0

4/ 20

21

01 /0

7/ 20

21

01 /1

0/ 20

21

01 /0

1/ 20

22

01 /0

4/ 20

22

01 /0

7/ 20

22

01 /1

0/ 20

22 80 85 90 95

100 105

-1 0,5 2 3,5 5

Price indices for the variety lipstick/gloss between Jan 2020 and Dec 2022

(CPI - GEKS ) CPI Base 100 janv 20 GEKS by EAN 25 HASP

There is more difference between GEKS and CPI than between two GEKS.

A highest volatility in some period (June 2020 for lipstick for instance)

MEETING OF THE GROUP OF EXPERTS ON CPI 13

RESULTS: COICOP 6 DIGITS (MAKE UP)04

MEETING OF THE GROUP OF EXPERTS ON CPI 14

MAKE UP AND CARE PRODUCT PRESENCE RATES

The match rate is computed as

|N i∩N j| |N i∪N j|

Where are the products (EAN X Outlet in our case) sold in period I.

There might be lockdowns effects in some periods.

Even for two consecutive periods, the match rate is quite low.

N i

MEETING OF THE GROUP OF EXPERTS ON CPI 15

UNCLASSIFIED DATA AT THE COICOP 6 DIGITS LEVEL

– In some COICOP 6 digits level we have a high proportion of unclassified data they can be

● Linked to field varieties (we do not have yet a corresponding scanner data variety) : nail make up for instance

● Do not correspond to the classification rules (a canned meat with honey flavour for instance)

They aren’t followed taken into account our current CPI.

– What would be the impact of keeping them ?

Expenditure share by variety for the poste make up and care product between Jan 2020 and Dec 2022

MEETING OF THE GROUP OF EXPERTS ON CPI 16

YEAR-ON-YEAR INFLATION PER VARIETY

13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36

-20

-15

-10

-5

0

5

10

15

20

Year-on-year inflation (GEKS 25 half spliced) for varieties of the poste make up and care products

between Jan 2021 and Dec 2022

face cleanser

mascara

Lipstick/gloss

face powder

women face cream

body care cream

unclassified

– The index for unclassified data is more volatile

● Beginning of 2021 ● Summer 2022

MEETING OF THE GROUP OF EXPERTS ON CPI 17

UNCLASSIFIED DATA AT THE POSTE LEVEL

1 4 7 10 13 16 19 22 25 28 31 34 70,00 %

75,00 %

80,00 %

85,00 %

90,00 %

95,00 %

100,00 %

105,00 %

110,00 %

115,00 %

120,00 %

Price indexes for the poste make up and care products

GEKS 25 HASP with unclassi- fied

GEKS 25 HASP without un- classified

GEKS 25 HASP by variety with CPI annual weights

CPI (Scanner data varieties only) aggregated

The trend is kept with unclassified data but there is a high volatility

MEETING OF THE GROUP OF EXPERTS ON CPI 18

RESULTS : CONTRIBUTIONS05

MEETING OF THE GROUP OF EXPERTS ON CPI 19

CONTRIBUTIONS BETWEEN TWO PERIODS

IGEKS−TQ t1 , t2 =∏i∈N

( pi t 2)wi

∗ , t 2

( pi t 1)w i

∗ , t 1∏t∈W ( pi

t) w i t , t 1−w i

t , t 2

cardW IGEKS−TQ t1 , t2 =∏i∈N

contributioni t1 ,t 2

ln (IGEKS−TQ t1 , t2 )=∑i∈N

ln (contributioni t1 ,t 2)

with unspliced indexes we can use the transitivity : IGEKS−TQ 12,13 =

IGEKS−TQ 1,13

IGEKS−TQ 1,12

– Goal : understand and explain the index variation from the observations

– We could also make sub groups (retailer, geo) to sum log(contrib)

MEETING OF THE GROUP OF EXPERTS ON CPI 20

CONTRIBUTIONS FOR LIPSTICK

17 18 19 20 21 22 23 24 25 26 27 28 29 92

93

94

95

96

97

98

99

100

101

102

GEKS-Tq ean X out- let 13 Mean splice

GEKS-Tq by EAN wi- thout splicing (period 17 as reference)

The decrease of the index between period 28 (June 2022) and 29 is carried by few products

Using R and GEKSDecomp

package

MEETING OF THE GROUP OF EXPERTS ON CPI 21

CONCLUSION

– Learnings ● At a really fine scale (GTIN/EAN), the GEKS indexes behave

quite closely to our current methodology ● At a more aggregate scale, there is more volatility and we have

to progress in our understanding and tools including classification issues.

– Future works ● Classification tools ● Theoretical understanding of the link with micro-economic

theory

7 JUNE 2023 MEETING OF THE GROUP OF EXPERTS ON CPI

insee.fr

Join us on

Adrien Montbroussous & Martin Monziols Methodologist & Head of the methodology unit Consumer Prices Division [email protected] [email protected]

  • Diapo-Titre
  • Diapo 2
  • Diapo 3
  • Diapo 4
  • Diapo 5
  • Diapo 6
  • Diapo 7
  • Diapo 8
  • Diapo 9
  • Diapo 10
  • Diapo 11
  • Diapo 12
  • Diapo 13
  • Diapo 14
  • Diapo 15
  • Diapo 16
  • Diapo 17
  • Diapo 18
  • Diapo 19
  • Diapo 20
  • Diapo 21
  • Diapo-Fin

Exploring methodologies to integrate new scanner data in the French CPI: Making use of multilateral methods

Languages and translations
English

MEETING OF THE GROUP OF EXPERTS ON CPI 7 JUNE 2023

Exploring methodologies to integrate new scanner data in the French CPI:

Making use of multilateral methods

MEETING OF THE GROUP OF EXPERTS ON CPI

1 CONTEXT AND GOALS 2 THEORY 3 RESULTS : BY VARIETY (COICOP 7 DIGITS) 4 RESULTS : BY COICOP 6 DIGITS(MAKE UP) 5 RESULTS : CONTRIBUTIONS

MEETING OF THE GROUP OF EXPERTS ON CPI

INTRODUCTION01

MEETING OF THE GROUP OF EXPERTS ON CPI

CONTEXT

– We are starting to receive data from 2 hard discounters. – We already have and use in production (since Jan 2020) scanner data from other retailers – Our current methodology with scanner data requires an external referential allowing us from GTIN/EAN to

have ● Additional characteristics (volume, unit, label, color ...) ● Nomenclature

– With classification rules and using the characteristics we classify at the variety level (level 7 of COICOP, French specificity).

● We are able to group EAN into equivalence classes to follow products better, avoid basket churn and catch the relaunches.

● We compute a Geometric Laspeyres, the methodology is similar than with the field collected data and the quality adjustment is slightly different since we can use the price history for the replacement product.

– Hard discount data has for now a low match rate with the referential (17 % of expenditure share according to 1 test file for one retailer and 39% for the other)

– We will experiment multilateral methods mainly to check what we could do without the referential and with the constraints of avoiding chain drift and basket churn.

MEETING OF THE GROUP OF EXPERTS ON CPI

TEST PROTOCOL/STRATEGY

– We will use our already possess scanner data (not enough history with hard discounters)

– Our product definition will vary between using GTIN/EAN or a article grouping methods (extended article number)

– We will compute micro indexes at the outlet level.

– We follow the average price of each product per month.

MEETING OF THE GROUP OF EXPERTS ON CPI

DATA OF THE EXPERIMENT

– Scanner data from January 2020 to December 2022, from 6 retailers (without hard discount because we don’t have background data).

– 3 varieties & their corresponding 6 digits COICOP level ● Whole milk & whole milk=> few replacements ● Foie gras & canned meat=> a high seasonality and 85% of

replacement during the year ● Lipstick & make up and care products => a lot of distinct

GTIN/EAN.

MEETING OF THE GROUP OF EXPERTS ON CPI

MULTILATERAL METHODS02

MEETING OF THE GROUP OF EXPERTS ON CPI

MULTILATERAL INDEXES TESTED

– We focus on GEKS-Törnqvist

where and

– The sample S can be a COICOP 6 digit level or a variety

– The product i can be the GTIN/EAN or an Extended article number ● Choice of the window size and splicing:

– Rolling window of size 13 and mean splice – Rolling window of size 25 and half splice

– Using R and IndexNumR package

IGEKS 0 , t =∏l=0

T ( I

0 , l

I t ,l ) 1 T +1=∏l=0

T ( I 0 ,l∗I l , t)

1 T +1

I T 0 ,t=∏i∈S

( pi t

p i 0 ) si 0+ si

t

2 si t=

pi tq i t

∑ j∈S p j t q j

t

MEETING OF THE GROUP OF EXPERTS ON CPI

RESULTS : VARIETY03

MEETING OF THE GROUP OF EXPERTS ON CPI

PRESENCE RATE PER VARIETY

01 /0

1/ 20

20

01 /0

3/ 20

20

01 /0

5/ 20

20

01 /0

7/ 20

20

01 /0

9/ 20

20

01 /1

1/ 20

20

01 /0

1/ 20

21

01 /0

3/ 20

21

01 /0

5/ 20

21

01 /0

7/ 20

21

01 /0

9/ 20

21

01 /1

1/ 20

21

01 /0

1/ 20

22

01 /0

3/ 20

22

01 /0

5/ 20

22

01 /0

7/ 20

22

01 /0

9/ 20

22

01 /1

1/ 20

22 0

0,2

0,4

0,6

0,8

1

1,2

Proportion of EAN x Outlet present in January 2020 and at the month m

foie gras presence rate

whole milk presence rate

lipstick/gloss presence rate

The presence rate is computed as

|N i∩N1| |N1|

Where are the products (EAN X Outlet in our case) sold in period i.

N i

– The presence rate is low for foie gras and lipstick

– There is a seasonality for foie gras

MEETING OF THE GROUP OF EXPERTS ON CPI

USING GTIN OR EXPENDED ARTICLE GROUP

90

95

100

105

110

115

120

-1 -0,7 -0,4 -0,1 0,2 0,5 0,8

Price indices for the variety whole milk between Jan 2020 and Dec 2022

(GEKS - GEKS by EAN) GEKS 25 months GEKS by ean 25 months

01 /0

1/ 20

20

01 /0

4/ 20

20

01 /0

7/ 20

20

01 /1

0/ 20

20

01 /0

1/ 20

21

01 /0

4/ 20

21

01 /0

7/ 20

21

01 /1

0/ 20

21

01 /0

1/ 20

22

01 /0

4/ 20

22

01 /0

7/ 20

22

01 /1

0/ 20

22 90

95

100

105

110

115

-1

-0,5

0

0,5

Price indices for the variety foie gras between January 2020 and December

2022

(GEKS - GEKS by EAN) GEKS 25 by extended article GEKS 25 by EAN

88 90 92 94 96 98 100 102

-3 -2,2 -1,4 -0,6 0,2

1 1,8 2,6

Price indices for the variety lipstick/gloss between Jan 2020 and Dec 2022

(GEKS - GEKS by EAN) GEKS by EAN 25 HASP GEKS 25 HASP with extend article number

Indexes using EAN or extended article number are very close for milk and foie gras.

There is more volatility for lipstick.

MEETING OF THE GROUP OF EXPERTS ON CPI

GEKS VS CURRENT CPI

jan vie

r-2 0

av ril-

20

jui lle

t-2 0

oc to

br e-

20

jan vie

r-2 1

av ril-

21

jui lle

t-2 1

oc to

br e-

21

jan vie

r-2 2

av ril-

22

jui lle

t-2 2

oc to

br e-

22 95

100 105 110 115 120 125

-2 -1,3 -0,6 0,1 0,8 1,5

Price indices for the variety whole milk between Jan 2020 and Dec 2022

CPI - GEKS by EAN CPI base 100 janv 20 GEKS by EAN half spliced 25 mois

01 /0

1/ 20

20

01 /0

3/ 20

20

01 /0

5/ 20

20

01 /0

7/ 20

20

01 /0

9/ 20

20

01 /1

1/ 20

20

01 /0

1/ 20

21

01 /0

3/ 20

21

01 /0

5/ 20

21

01 /0

7/ 20

21

01 /0

9/ 20

21

01 /1

1/ 20

21

01 /0

1/ 20

22

01 /0

3/ 20

22

01 /0

5/ 20

22

01 /0

7/ 20

22

01 /0

9/ 20

22

01 /1

1/ 20

22 90

95

100

105

110

115

120

-4 -3,1 -2,2 -1,3 -0,4 0,5 1,4 2,3 3,2

Price indices for the variety foie gras between January 2020 and December 2022

CPI - GEKS by ean CPI, base 100 = Jan 2020 GEKS by ean

01 /0

1/ 20

20

01 /0

4/ 20

20

01 /0

7/ 20

20

01 /1

0/ 20

20

01 /0

1/ 20

21

01 /0

4/ 20

21

01 /0

7/ 20

21

01 /1

0/ 20

21

01 /0

1/ 20

22

01 /0

4/ 20

22

01 /0

7/ 20

22

01 /1

0/ 20

22 80 85 90 95

100 105

-1 0,5 2 3,5 5

Price indices for the variety lipstick/gloss between Jan 2020 and Dec 2022

(CPI - GEKS ) CPI Base 100 janv 20 GEKS by EAN 25 HASP

There is more difference between GEKS and CPI than between two GEKS.

A highest volatility in some period (June 2020 for lipstick for instance)

MEETING OF THE GROUP OF EXPERTS ON CPI

RESULTS: COICOP 6 DIGITS (MAKE UP)04

MEETING OF THE GROUP OF EXPERTS ON CPI

MAKE UP AND CARE PRODUCT PRESENCE RATES

The match rate is computed as

|N i∩N j| |N i∪N j|

Where are the products (EAN X Outlet in our case) sold in period I.

There might be lockdowns effects in some periods.

Even for two consecutive periods, the match rate is quite low.

N i

MEETING OF THE GROUP OF EXPERTS ON CPI

UNCLASSIFIED DATA AT THE COICOP 6 DIGITS LEVEL

– In some COICOP 6 digits level we have a high proportion of unclassified data they can be

● Linked to field varieties (we do not have yet a corresponding scanner data variety) : nail make up for instance

● Do not correspond to the classification rules (a canned meat with honey flavour for instance)

They aren’t followed taken into account our current CPI.

– What would be the impact of keeping them ?

Expenditure share by variety for the poste make up and care product between Jan 2020 and Dec 2022

MEETING OF THE GROUP OF EXPERTS ON CPI

YEAR-ON-YEAR INFLATION PER VARIETY

13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36

-20

-15

-10

-5

0

5

10

15

20

Year-on-year inflation (GEKS 25 half spliced) for varieties of the poste make up and care products

between Jan 2021 and Dec 2022

face cleanser

mascara

Lipstick/gloss

face powder

women face cream

body care cream

unclassified

– The index for unclassified data is more volatile

● Beginning of 2021 ● Summer 2022

MEETING OF THE GROUP OF EXPERTS ON CPI

UNCLASSIFIED DATA AT THE POSTE LEVEL

1 4 7 10 13 16 19 22 25 28 31 34 70,00 %

75,00 %

80,00 %

85,00 %

90,00 %

95,00 %

100,00 %

105,00 %

110,00 %

115,00 %

120,00 %

Price indexes for the poste make up and care products

GEKS 25 HASP with unclassi- fied

GEKS 25 HASP without un- classified

GEKS 25 HASP by variety with CPI annual weights

CPI (Scanner data varieties only) aggregated

The trend is kept with unclassified data but there is a high volatility

MEETING OF THE GROUP OF EXPERTS ON CPI

RESULTS : CONTRIBUTIONS05

MEETING OF THE GROUP OF EXPERTS ON CPI

CONTRIBUTIONS BETWEEN TWO PERIODS

IGEKS−TQ t1 , t2 =∏i∈N

( pi t 2)wi

∗ , t 2

( pi t 1)w i

∗ , t 1∏t∈W ( pi

t) w i t , t 1−w i

t , t 2

cardW IGEKS−TQ t1 , t2 =∏i∈N

contributioni t1 ,t 2

ln (IGEKS−TQ t1 , t2 )=∑i∈N

ln (contributioni t1 ,t 2)

with unspliced indexes we can use the transitivity : IGEKS−TQ 12,13 =

IGEKS−TQ 1,13

IGEKS−TQ 1,12

– Goal : understand and explain the index variation from the observations

– We could also make sub groups (retailer, geo) to sum log(contrib)

MEETING OF THE GROUP OF EXPERTS ON CPI

CONTRIBUTIONS FOR LIPSTICK

17 18 19 20 21 22 23 24 25 26 27 28 29 92

93

94

95

96

97

98

99

100

101

102

GEKS-Tq ean X out- let 13 Mean splice

GEKS-Tq by EAN wi- thout splicing (period 17 as reference)

The decrease of the index between period 28 (June 2022) and 29 is carried by few products

Using R and GEKSDecomp

package

MEETING OF THE GROUP OF EXPERTS ON CPI

CONCLUSION

– Learnings ● At a really fine scale (GTIN/EAN), the GEKS indexes behave

quite closely to our current methodology ● At a more aggregate scale, there is more volatility and we have

to progress in our understanding and tools including classification issues.

– Future works ● Classification tools ● Theoretical understanding of the link with micro-economic

theory

7 JUNE 2023 MEETING OF THE GROUP OF EXPERTS ON CPI

insee.fr

Join us on

Adrien Montbroussous & Martin Monziols Methodologist & Head of the methodology unit Consumer Prices Division [email protected] [email protected]

  • Diapo-Titre
  • Diapo 2
  • Diapo 3
  • Diapo 4
  • Diapo 5
  • Diapo 6
  • Diapo 7
  • Diapo 8
  • Diapo 9
  • Diapo 10
  • Diapo 11
  • Diapo 12
  • Diapo 13
  • Diapo 14
  • Diapo 15
  • Diapo 16
  • Diapo 17
  • Diapo 18
  • Diapo 19
  • Diapo 20
  • Diapo 21
  • Diapo-Fin

Exploring methodologies to integrate new scanner data in the French CPI: making use of multilateral methods

Scanner data has been used in production to compute the HICP and the CPI for France since January 2020, for most of French retailers. The current methodology with this data uses a product referential bought from an external provider giving us detailed characteristics for each article. These characteristics allow us to match articles in our data with the COICOP and to create homogeneous groups of articles. Thanks to this information, we can compute a unit price value for each group of articles and each month.

Languages and translations
English

Exploring methodologies to integrate new scanner data in the French CPI: making use of multilateral

methods

Unece Conference – Geneva

June 2023

Authors: Adrien Montbroussous, Martin Monziols (Insee, France)

1/37

Abstract:

Scanner data has been used in production to compute the HICP and the CPI for France since January 2020, for most of French retailers. The current methodology with this data uses a product referential bought from an external provider giving us detailed characteristics for each article. These characteristics allow us to match articles in our data with the COICOP and to create homogeneous groups of articles. Thanks to this information, we can compute a unit price value for each group of articles and each month. The following steps in the methodology are very similar to the process with field-collected data: we select a sample of observations that will be used to compute price evolutions and aggregate them using a geometric Laspeyres formula at the lowest level. And as with field data, replacements are made for unavailable products. This choice is quite specific to France whereas the use of multilateral methods is more widespread in other countries. But recently, new retailers (hard discounters) have started to implement a data flux to provide Insee with their scanner data. The specificity of their data is that most of their articles aren’t covered by the product referential, which makes the current methodology hard to apply at first sight.

In this study, the goal is to be able to use these new scanner data in the following years. We will address two questions to do so. First, relying on the other INS experiences, we will test the generalization of multilateral methods on our already received and used scanner data to document on a large scale the behaviours of such methods in the French context. This experience is an opportunity to gain practical and theoretical skills with known data. Second, we will present a strategy to make use of these new scanner data with these methods. Given the raw data of these retailers, the process to be developed goes from classifying the products to integrating the computed indexes in our main process that produces our French CPI.

2/37

Contents I) Context of this study : scanner data in France now and future.........................................................4

1) Usual data....................................................................................................................................4 2) Scanner data : current methodology............................................................................................5 3) Information technology infrastructure.........................................................................................5 4) New data and data not yet used...................................................................................................6

a) Hard discounters.....................................................................................................................6 b) Overseas scanner data.............................................................................................................6 c) Other sectors not yet used.......................................................................................................6

II) Theory and strategy of our experimentation...................................................................................7 1) Multilateral methods and milestones of the process....................................................................7

a) Individual product specification ............................................................................................7 b) Multilateral Index...................................................................................................................8 c) Time windows & splicing.......................................................................................................9 d) Aggregation structure...........................................................................................................10

i. There is no way of having a decomposition of the multilateral indexes...........................10 ii. Choosing a level and aggregating these indexes.............................................................10

2) Test protocol..............................................................................................................................11 III) Results..........................................................................................................................................11

1 ) Presence of references across time...........................................................................................11 a) Milk.......................................................................................................................................13 b) Foie gras................................................................................................................................14 c) Lipstick.................................................................................................................................14 d) Canned meat.........................................................................................................................15 e) Make up and care products...................................................................................................15

2) Indexes at the « variety » level..................................................................................................16 a) Whole milk ...........................................................................................................................16 b) Foie gras................................................................................................................................17 c) Lipstick / gloss :....................................................................................................................19

3) Indexes at the « poste » level.....................................................................................................20 a) Whole Milk...........................................................................................................................20 b) Canned meat.........................................................................................................................23 c) Make-up and care products...................................................................................................27

4) Contributions behind GEKS-Tq variation.................................................................................31 a) Theory...................................................................................................................................31 b) Experiment............................................................................................................................32

IV) Next steps ? Our « research » agenda..........................................................................................34 1) Link between those multilateral indexes and microeconomic theory.......................................34 2) Explore the outlet dimension.....................................................................................................34 3) Going further with classification methods................................................................................34 4) Strategy to include those indexes inside our current methodology...........................................35

V) Conclusion.....................................................................................................................................35 References..........................................................................................................................................36 Appendix............................................................................................................................................37

3/37

I) Context of this study : scanner data in France now and future

After almost 10 years of experimentation, scanner data have been introduced in the French CPI and HICP in January 2020. During these experimentations, first, some retailers were collaborating, then a law has been issued to frame this operation and regulates the transmission of data. Almost all the field of retailers was transmitting data except hard discounters and some retailers in oversea departments.

Recently new providers (hard-discounters) started to send their data as expected by the law. As it will be detailed below, these data are challenging in order to use them in our CPI and HICP given our current methodology.

1) Usual data In France, scanner data is used in production to compute our CPI since January 2020. Data was until now provided by all the Super and Hypermarket, hard discounter excluded. We are getting data from retailers, thanks to an article of law originally published in 2017, and modified in 20211, making mandatory for retailers to provide us data for any day and shop, each day. These are used for prices’ statistics and turnover indicators. The data requested is the following:

• EAN (European Article Numbering)

• Outlet id

• Date of the sale

• At least two variables among the 3 following: number of article sold, the whole expenditure and the unit price of the article.

• A label, which can be relatively short and rarely exceeds 25 characters (space included)

• The intern nomenclature code given by the retailer

The law as it is written at the moment implicitly supposes that there is a « referential » that we can use to our purpose of describing and classifying products into a nomenclature. Because, as it is written, the only descriptive information is the label. These data are indeed used in our process with a referential (bought from an external society, CIRCANA previously known as IRI) allowing us to get more information on the data  : characteristics of the products and a « family number ». With these data, we are able to classify at a granular level our article (variety, which is even finer than COICOP on 6 positions – a specific level of France) using rules made for each variety to select observations. Lastly, according to the law, we keep 3 years of archives and the data for the current year.

The “SKU” (Stock Keeping Unit) is mentioned in the « guide on Multilateral Methods ». It’s a code associated with article that retailers keep to group EAN representing products of the same nature in order to have a better stock and supply process. Unfortunately, we do not possess that kind of information because not specified in the law.

1 https://www.legifrance.gouv.fr/loda/id/LEGITEXT000034540407

4/37

2) Scanner data : current methodology

To compute our CPI and HICP indexes with scanner data, we use sampling and a geometric Laspeyres. Thanks to the product referential, we are able to define an « expanded article number  » which allow us to keep tract of similar articles, even if they do not keep the same EAN (European Article Number) – it could be interpreted as a SKU code, but for statistical purposes. For instance, if a glue stick changes packaging (with a “Halloween theme” for example), with our extended article number we will be able to keep following the price whereas with only EAN it will be considered as a different product. We are then able to classify these expanded article number into ECOICOP at the variety level. For each group of articles, classification rules of the variety predicted are checked, and if verified the product is linked to the variety. Otherwise it is stashed in the corresponding “poste” (the French specific 6-positions level of COICOP) special category « unclassified ». Each year, operations are made in order to update the basket and the classification rules when it is needed.

There is some sampling because we only follow varieties representing at least 1% of their “poste” (which is an applicative constraint) so that if EANs represent a product not bought enough we do not follow it. In the end, nevertheless, we follow more than 80 millions of EANs. These varieties are updated and modified if necessary each year when we update the basket and the associated weights. These weights are computed with the previous year expenditures as the rest of the CPI basket. Micro-indexes are computed at the outlet x variety scale and then aggregated at the variety level, which is in turn included in the CPI “poste” calculation.

The quality adjustment for replacement is slightly different than the bridged overlap method  : since we are able to have an history for the replacement product, we do not need to impute prices of previous periods for the replacement product. The process is automatized for scanner data : we have classification rules that classifies article in homogeneous group and outlay candidates for replacement. Then, among the potential candidates for replacement, we proceed with a sampling with replacement (a product can replace more than one product). Products are replaced if absent for two months in a row. Also, we anticipate some replacements : if a product has both quantity and price declining for two consecutive months, it is replaced the next month.

In each 6 digits COICOP level (“poste”) we have varieties using scanner data and varieties using field collected data. We then aggregate the micro indexes to have an higher level index. For field data, we have micro-indexes computed by geographical areas, for scanner data index we compute the index at the outlet level then aggregate it at the whole France level.

The scope of scanner data used in production is hyper and supermarkets in Metropolitan France, for sales of processed food products, cleaning products and hygiene and beauty products and also, some durable goods. The scanner data expenditure share in the CPI weights is around 10 % of the whole basket. We do not use a larger consumption scope because we do not possess informations in our referential about these products. Hence, we cannot classify these products, we cannot control for their units, etc.

This current methodology was chosen because it was very close to what we do on the field so that our methodology is homogeneous.

3) Information technology infrastructure These huge datasets (approx. 9 Go per day of received data) is managed with a specific big data infrastructure, divided into:

5/37

• A Postgre database containing some information about metadata, referential, nomenclatures, indicators, composition of our scanner data baskets, etc.

• A NoSQL infrastructure for the detailed scanner data, accessible through HUE (Hadoop user experience) on which we can make HiveQL requests. For experimentation and in order to interfere at the least with production processes, we extract subset of the data to explore them with R and dedicated packages. This process is a bit long and laborious since the platform is not designed for such work.

We are on the verge of putting in place a new infrastructure, which will be more flexible and at the state of

art, allowing us to foster our experimentation work.

4) New data and data not yet used

a) Hard discounters

We are just in the process of getting data from 2 hard-discounters. Using these data will allow us to have a better coverage of hard-discount and also a better geographic coverage because they are more present in a specific area (north-east). The match rate of these data with the referential is relatively low: for one Saturday of sales, 16,5 % of EANs of one hard discounter are present in our referential, representing approximately 45 % of the data and 40 % of total expenditure. We were not able to compute such statistics with the other retailer since we only have a test file at this stage. Since the data is compliant with what is required by the law, we only have few descriptive information and item labels for each EAN. It has to be noted that the labels seem to be richer than for other retailers, with a lot of products with a 25 character label.

With this situation, we are facing two questions:

• how to classify the product in the COICOP

• how to calculate an index without detailed information about a product (volumes, weight, etc.)

Work will be done on these two subjects, this paper focuses mainly on the later question.

b) Overseas scanner data As for the hard discounters, we receive and are making new contacts with some retailers to have overseas’ scanner data. A large proportion of products are specifically sold in this region of the globe, hence not all the scope of these data is covered by the referential. Then, the same two questions need to be answered: are we able to classify these products with the sole label and is it possible to calculate a good quality price index.

So far we received data for some retailers at La Reunion and are making progress to receive some data from Guadeloupe. It has a relatively low impact on the whole French index but could be source of efficiency and

precision for the CPI of these territories2.

c) Other sectors not yet used Among the already received scanner data, we restrict our scope to specific products:

• Food

• Hygiene and makeup products

• House cleaning products

2 France does not publish geographic CPI except for the overseas’ departments.

6/37

• Some durable goods such as pregnancy test, highlighters…

But, for instance, we have data on clothes that we do not use so far. The reason is that this remaining data is not covered by the referential so that products are not classified.

We expect that our work on multilateral methods for the hard discounters will have some positive externalities on existing scanner data not used yet.

II) Theory and strategy of our experimentation We describe in this section our strategy of experimentation with some reminders regarding multilateral indexes. This work builds on previous experiments with these methods by previous colleagues. They used in their experiment homogeneous product groups using an « expanded article number » created thanks to the referential, on whole milk and foie gras, products that we kept in our study.

In parallel of this experiment, we will start a process to classify hard discounter data in COICOP nomenclature or even granularly (French nomenclature). As for multilateral methods, it will be continuing some previous work done on overseas products.

1) Multilateral methods and milestones of the process Since our data from hard discounters has only been provided since recently, we decided to start our experiment on the data of the other retailers we already possess. The idea is to calculate indexes based on two assumptions :

• we can classify products at a certain level (COICOP 6 positions for instance) • we do not possess any detailed information about the product except this classification

In our experiment, we always work at a fixed outlet dimension (at the outlet scale), we will discuss the results according several choices on the product dimension. Since we are dealing in our experiment only with goods, we assume that for the consumer the good has the same utility for each day of the month. We will follow the average price of products by month.

The benefit we expect to have from using a multilateral method is to bypass the classification issues, prevent a basket churn and avoid chain drifts problem caused by bilateral indexes. Instead of comparing only two periods, we will use all the available data within a window of time to compute an index.

Thanks to the multilateral guide produced by Eurostat, we highlighted 4 steps in which we had to analyse several methods/choices, presented below.

a) Individual product specification 

In our experiments, we test the following specifications for products and outlets :

• Products : as explained earlier, the only identifier that we have in the raw data is the EAN. So the goal here is to document how these multilateral methods behave when we consider the EAN as the identifier of a product or the « expanded article number » which gathers several products that are very similar. These expanded article numbers have been developed in order to capture commercial relaunches and are currently used in our current method. What we want to see is whether we can do without such an « expanded article number » or not. An option not yet explored, and that we will consider if this test is not conclusive, is to build such « expanded article number » based on the available information in the raw data through clustering methods.

7/37

• Outlets : this dimension has not been fully explored yet. As in the current methodology, we considered the outlets as outlets. We did not aggregate them in any manner, except in the subsection dedicated to contributions below.

At the end, in our data, the way we identify a price and a quantity is at the couple (product x outlet), where product is either EAN or the expanded article number.

b) Multilateral Index There are several family of multilateral methods:

• Geary Khamis (GK)   is a quality adjusted value index. It is an additive method, the index is obtained by solving the following system of equations:

I GK 0 ,t =

∑i∈N t

pi t q i

t/∑i∈N0

pi 0q i

0

∑i∈N t

vi q i t /∑i∈N0

v iq i 0

where vi inside the window W is vi=∑z∈W

qi z

∑s∈W q i

s

pi z

I GK 0 , z

• Weighted time-product dummy method consisting in an econometric model including dummies for each time period and characteristics. In our context, since we cannot revise our indexes, it is not the more appropriate.

• The last one consists on making transitive these bilateral indexes by averaging across all the possible paths between two dates, inside a time window. It is the GEKS ( Gini-Eltetö-Köves-Szulc) , which consists of a geometric mean of couple of bilateral indexes:

I GEKS 0 ,t =∏l=0

T ( I 0 ,l

It ,l ) 1

T +1=∏l=0

T (I 0 ,l∗I l ,t)

1 T +1 where T represents the size of the window.

◦ GEKS Törnqvist is also called CCD. 3

In the continuation of this paper, we will focus on GEKS indexes.

• GEKS method is based on bilateral Indexes that are reversible. In our experiments, we consider two of these bilateral indexes :

• Törnqvist : Index that is frequent in the literature. It is based on the micro-indexes of products and their relative shares in the expenditures at the two periods of time considered.

IT 0 , t=∏i∈S

( pi

t

pi 0 )

si 0+si

t

2

where the expenditure share of product i in the sample S is si t=

p i t qi

t

∑ j∈S p j

t q j t . S is the

intersection of the basket at time 0 and the basket at time t, i.e. all the products present at both periods.

3 Caves, D. W., L. R. Christensen, and W. E. Diewert. 1982. "Multilateral Comparisons of Output, Input, and Productivity Using Superlative Index Numbers." The Economic Journal 92, no. 365: 73-86.

8/37

• Fisher : a common index with good property. It is the geometric mean of a Laspeyres index and a Paasche index. One for the structure of consumption at the first period of time, the other for the one at the other period of time.

I F 0 , t=√∑i∈S

p i t qi

0

pi 0 qi

0 ∑ i∈S

pi t q i

t

pi 0 qi

t

c) Time windows & splicing The GEKS method has some parameters :

• The nature of the window for which we consider the mean of the bilateral indexes

◦ Rolling window (each month, the time window is shifted forward by 1 month) with the sub- question of the length of this window : 13 months and 25 months have proved to be useful. The latter has the drawback of needing 25 months before starting to publish indexes but can handle seasonality better.

◦ Expansive window (Each month, the time window is extended by 1 month). This method allows to start the production without any background data.

An index between period 0 and a period t can be computed within several windows and hence lead to several results. In order to avoid revising previous indexes, we apply splicing technique to link the index of the latest period with the previous ones. Two choices are possible: using as link the previously published indexes or the recalculated with the new window indexes.

Technically, the splicing is operated via (a) link month(s), that can be

◦ Mean splicing : all overlap periods between the two windows are used in order to link the indexes by computing a geometric average of the pairs of corresponding indexes.

▪ Linking with previously (with previous windows) calculated series version

I pub 0 , t =I pub

0 ,t−1∗∏k=t−T +1

t−1 ( I [t−T ,t−1 ]

t−1 , k ∗I[t−T +1, t ] k, t )

1 T−1

▪ Linking to published series version

I pub 0 , t =I pub

0 ,t−1∗∏k=t−T +1

t−1 ( I [ pub]

t−1 , k∗I[ t−T +1 ,t ] k ,t )

1 T−1

◦ Half splicing : period t – ((T+1)/2)+1

▪ Linking with previously (with previous windows) calculated series:

• I pub 0 , t =I pub

0 ,t−1∗I[ t−T , t−1]

t−1 ,t−(T + 1 2 ) ∗I [t−T +1 ,t ]

t−(T +1 2 )+1 ,t

▪ Linking to published series:

• I pub 0 , t =I pub

0 ,t−1∗I pub

t−1 ,t−(T + 1 2 ) ∗I [t−T +1 ,t ]

t−(T +1 2 )+1 ,t

In our study we will focus on mean and half splicing, as implemented in the R package IndexNumR4.

4 https://rdrr.io/cran/IndexNumR/

9/37

d) Aggregation structure.

i. There is no way of having a decomposition of the multilateral indexes

We demonstrate here that there is no way to decompose a multilateral index in a sum or product of multilateral indexes. Lets consider a set of product N, which can be decomposed in two subsets N1 and N2.

I Tornqvist , N 0 ,t =∏i∈N

( pi

t

p i 0 )

s̄i

where s̄i= 1 2 (

pi 0 qi

0

∑i ∈N p i

0q i 0 +

p i t qi

t

∑i∈N pi

t qi t )

Working on s̄i  :

s̄i=

1 2 ∑k ∈(1,2)

I {i∈Nk }( p i

0 qi 0×∑i∈Nk

p i 0 qi

0

∑i∈Nk pi

0 q i 0×∑i∈N

pi 0 q i

0 + p i

t qi t×∑i∈Nk

pi t q i

t

∑i∈Nk pi

t q i t×∑i∈N

pi t q i

t )

s̄i= 1 2 ∑k ∈(1,2)

I {i∈Nk }( p i

0 qi 0

∑i∈Nk pi

0 q i 0 pNk

0 + pi

t q i t

∑i∈Nk p i

t qi t pNk

t )

with pNk t the share of expenditures of the subset Nk of products in the total set N. As we can see,

as long as pNk t ≠pNk

0 we can’t have something like ∀ i , s̄ i=a I {i∈N 1}s̄i N 1+b I {i∈N 2 }s̄i

N 2 which

would have given this decomposition of the bilateral Törnqvist index :

ITornqvist , N 0 , t =∏i∈N

( pi

t

pi 0 )

s̄i

=∏i∈N 1 (

pi t

pi 0 )

s̄i

∏i∈N 2 (

p i t

p i 0 )

s̄i

I Tornqvist , N 0 ,t =∏i∈N 1

( pi

t

pi 0 )

a s̄i N 1

∏i∈N 2 (

p i t

pi 0 )

b s̄i N 2

=(∏i∈N 1 (

p i t

p i 0 )

s̄i N 1

) a

(∏i∈N 2 (

pi t

pi 0 )

s̄i N 2

) b

I Tornqvist , N 0 ,t =(I Tornqvist ,N 1

0 ,t )a(I Tornqvist , N 2 0 ,t )b

Hence, a GEKS index cannot be decomposed as the sum or product of GEKS indexes. We may have approximate decomposition (that could be useful for analysis) but we have to choose a level at which we would compute the index that we will publish.

ii. Choosing a level and aggregating these indexes

There are advantages and drawbacks of choosing a high or low level of aggregation. The lower we compute our micro-indexes with multilateral formula and dynamic weight, the harder it is to include new products or outlets during the year, we also apply dynamic weights only at a low level and may not catch well changes of expenditure. However, it introduce stability in the index which makes easier the interpretation and the consistency of it.

In practice, there will be not that much choices for the level at which computing the multilateral index. It will depend on the performance of our classification tool.

As recommended by the Eurostat guide on multilateral methods5, we would use fixed weights at the subclass level at least. In the French context, in which we publish indexes at a more dis-aggregated level, ECOICOP on 6 positions (postes).

5Guide on Multilateral Methods in the Harmonised Index of Consumer Prices, Chapter 6, 2022 edition, Eurostat

10/37

2) Test protocol Given these elements about our current methodology and the multilateral index (GEKS), we aim at testing some elements :

• Is the multilateral index far from the one we publish on a comparable field ?

• Do the results differ when considering the EAN or the « expanded article number » ?

• What does this new method give at the « poste » level ?

In order to proceed we have the following steps :

• Extracting data : given our data infrastructure, we have to construct and extract our data in order to use them with our usual statistical tools rather than coding the index in HiveQL. To limit the time spent in doing so, we choose 3 products. The level of aggregation should allow us to try several methods, we need to extract data at the EAN level. We keep the following information :

◦ The unit price (price per unity of volume)

◦ The sales (price per article X number of articles sold)

◦ The total volume (number of articles sold X volume of each article)

◦ The number of articles sold

◦ EAN

◦ Extended article number

• Choosing products :

◦ Milk, because it has a low replacement rate

◦ Foie gras, because it has a high replacement rate and high seasonality

◦ Lipstick, because it has a high number of EAN by « expanded article number »

◦ Then, we generalise at the whole poste to which they belong.

▪ Milk : 2 varieties + unclassified. ▪ Canned meat : 7 varieties + unclassified. ▪ Make-up and care : 6 varieties + unclassified.

III) Results

1 ) Presence of references across time Before computing indexes, we looked at the disappearance rate inside each group in order to get some sense of how data behave. This is some useful information to know to understand how indexes will behave on the one hand. On the other hand, this is the kind of side informations that will be useful to index producers in practice.

One measure has a bilateral approach, it is to follow the products sold in January 2020 and check if they are still available the following months.

11/37

Figure 1: Source: scanner data. Scope: Metropolitan France. Reading note : in January 2021, 13,5 % of the lipstick’s EAN sold in January 2020 are still sold in the same outlet

We can see that the product (at the EAN x outlet scale) available in January 2020 disappear rapidly from the market. It makes clear that chain drift is a risk with these methods. When product disappearing is followed with commercial relaunched, EAN change, even if the products are very closed with substantial price rise.

Interestingly, we can see that some products are appearing and disappearing with seasonal patterns. According the variety considered, we can identify a “stock” of products present on the market for many months : approximatively 80-85% for the milk, 10% for the lipstick and foie-gras.

We keep the dates in order to see the clear impact of the Covid-19 crisis, and the lockdown in France.

Figure 2: Source: scanner data. Scope: Metropolitan France. Reading note : in January 2021, 61,9 % of the canned meat EAN sold in an outlet January 2020 are still sold in the same outlet

12/37

01 /0

1/ 20

20

01 /0

4/ 20

20

01 /0

7/ 20

20

01 /1

0/ 20

20

01 /0

1/ 20

21

01 /0

4/ 20

21

01 /0

7/ 20

21

01 /1

0/ 20

21

01 /0

1/ 20

22

01 /0

4/ 20

22

01 /0

7/ 20

22

01 /1

0/ 20

22 0

0,2

0,4

0,6

0,8

1

1,2

Presence rate by variety

foie gras presence rate

whole milk presence rate

lipstick/gloss presence rate

01 /0

1/ 20

20

01 /0

4/ 20

20

01 /0

7/ 20

20

01 /1

0/ 20

20

01 /0

1/ 20

21

01 /0

4/ 20

21

01 /0

7/ 20

21

01 /1

0/ 20

21

01 /0

1/ 20

22

01 /0

4/ 20

22

01 /0

7/ 20

22

01 /1

0/ 20

22 0

0,2

0,4

0,6

0,8

1

1,2

Presence rate by poste

canned meat presence rate

make up and care products presence rate

whole milk presence rate

With the same analysis with one level of aggregation, we can see that the trend for canned meat is quite different than the one for foie gras. It is due to the seasonality of the sales that is specific to this variety.

To catch better the matching process that is used in multilateral indexes, we also produced heat maps by comparing the presence of EAN x outlet between each couples of periods within the windows.

a) Milk

Figure 3: Source: scanner data. Scope: Metropolitan France. Reading note : the quantity represented in the heat map is the EAN match rate computed as (number of EAN X outlet present in both period)/(number of EAN X outlet present in at least one period). Example, 51 % of the EAN X outlet sold in period 1 or 5 are sold in both periods .

The match rate of period 5 (May 2020) is the lowest comparing with other periods. It is more likely related to the lockdown in France following the Covid-19 pandemic.

13/37

b) Foie gras

Figure 4: Source: scanner data. Scope: Metropolitan France. Reading note : the quantity represented in the heat map is the EAN match rate computed as (number of EAN X outlet present in both period)/(number of EAN X outlet present in at least one period). Example, between 10 and 20 % of the EAN X outlet sold in period 1 or 5 are sold in both periods .

There is a specificity in the December months (periods 12, 24 and 36) : they have a lower match rate with other months of the year. Indeed, during the winter holidays new foie gras products are introduced into the markets.

c) Lipstick

Figure 5: Source: scanner data. Scope: Metropolitan France. Reading note : the quantity represented in the heat map is the EAN x outlet match rate computed as (number of EAN X outlet present in both period)/(number of EAN X outlet present in at least one period). Example: between 20 and 30 % of the EAN X outlet sold in period 1 or 2 are sold in both periods

14/37

Period 4 (April 2020) has the lowest match rate with other periods, it is most likely, as for the milk, related to the lockdown.

d) Canned meat

Figure 6: Source: scanner data. Scope: Metropolitan France. Unclassified data are included. Reading note : between 30 % and 40 % of the EAN sold in outlets in period 1 or 25 are present in both periods in the same outlet.

The match rate are higher at the canned meat poste level than for the variety foie gras. An explanation is that most of the varieties does not have the seasonality that foie gras has in the sales.

e) Make up and care products

Figure 7: Source: scanner data. Scope: Metropolitan France. Unclassified data are not included. Reading note : between 20 % and 30 % of the EAN sold in outlets in period 1 or 25 are present in both periods in the same outlet.

15/37

For the poste make up and care product, the match rate are computed without the unclassified data for reasons of performance and duration of computation.

It seems that there is heterogeneity among products regarding their presence over time. A larger study is needed to have an idea of the scope of possible values of presence over time.

2) Indexes at the « variety » level For the first comparisons, we only computed the GEKS Indexes within windows of 25 months. The idea is to firstly analyse the difference between a multilateral index and our current index and then the differences between using only the EAN and using groups of article (extended article number in our experiment). We used the half splicing method.

a) Whole milk 

Figure 8: Source : Scanner data Scope : Metropolitan France .Reading note : In December 2022, the price index computed using the GEKS method on article sold grouped by EAN in outlet is 117 it is 1.0 point less than the index computed with a use of « expanded article number ».

16/37

jan vie

r-2 0

m ar

s- 20

m ai-

20

jui lle

t-2 0

se pt

em br

e- 20

no ve

m br

e- 20

jan vie

r-2 1

m ar

s- 21

m ai-

21

jui lle

t-2 1

se pt

em br

e- 21

no ve

m br

e- 21

jan vie

r-2 2

m ar

s- 22

m ai-

22

jui lle

t-2 2

se pt

em br

e- 22

no ve

m br

e- 22

95

100

105

110

115

120

125

-2 -1,5 -1 -0,5 0 0,5 1 1,5 2

Price indices for the variety whole milk between Jan 2020 and Dec 2022

CPI - GEKS by EAN CPI base 100 janv 20 GEKS by EAN half spliced 25 mois

Figure 9: Source : Scanner data Scope : Metropolitan France  Reading note : In June 2021, the price index computed using the GEKS method on article sold grouped by EAN is 100.0, it is 0.3 point less than the index computed with a use of « expanded article number ».

Both graphs exhibit very similar price trajectories: between grouping articles by EAN or by «  expanded article number » and between a GEKS and the current CPI. This is working well because whole milk has stable products, few products disappear and the relative shares of the sub-products are relatively stable across time. The price trajectories are the same across all sub- products.

b) Foie gras During the year 2020, 85% of the products present in our CPI basket in December 2019 were replaced for the variety foie gras.

17/37

01 /0

1/ 20

20

01 /0

4/ 20

20

01 /0

7/ 20

20

01 /1

0/ 20

20

01 /0

1/ 20

21

01 /0

4/ 20

21

01 /0

7/ 20

21

01 /1

0/ 20

21

01 /0

1/ 20

22

01 /0

4/ 20

22

01 /0

7/ 20

22

01 /1

0/ 20

22 90

95

100

105

110

115

120

-1 -0,8 -0,6 -0,4 -0,2 0 0,2 0,4 0,6 0,8 1

Price indices for the variety whole milk between Jan 2020 and Dec 2022

(GEKS - GEKS by EAN) GEKS 25 months GEKS by ean 25 months

Figure 10: Source : Scanner data Scope : Metropolitan France  Reading note : In December 2021, the price index computed using the GEKS method on article sold grouped by « expanded article number » and outlet is 98.48 , it is 0.74 point less than the index computed with EAN .

There is a small difference between grouping the expenditure by EAN or by expanded article number in the case of foie gras. CPI and GEKS index using EAN are relatively comparable, the trend is the same but there is up to 3 points of difference, in a stronger inflation context.

These small differences are even smaller if considered with year-to-year inflation.

18/37

Figure 11: Source : Scanner data and French CPI. Scope : Metropolitan France  Reading note : In February 2022, the French CPI (rebased in January 2020) is 104.25 it is 2.78 points more than a GEKS index computed with a use of EAN.

01 /0

1/ 20

20

01 /0

3/ 20

20

01 /0

5/ 20

20

01 /0

7/ 20

20

01 /0

9/ 20

20

01 /1

1/ 20

20

01 /0

1/ 20

21

01 /0

3/ 20

21

01 /0

5/ 20

21

01 /0

7/ 20

21

01 /0

9/ 20

21

01 /1

1/ 20

21

01 /0

1/ 20

22

01 /0

3/ 20

22

01 /0

5/ 20

22

01 /0

7/ 20

22

01 /0

9/ 20

22

01 /1

1/ 20

22 90

95

100

105

110

115

120

-4 -3,1 -2,2 -1,3 -0,4 0,5 1,4 2,3 3,2

Price indices for the variety foie gras between January 2020 and December 2022

CPI - GEKS by ean CPI, base 100 = Jan 2020 GEKS by ean

01 /0

1/ 20

20

01 /0

3/ 20

20

01 /0

5/ 20

20

01 /0

7/ 20

20

01 /0

9/ 20

20

01 /1

1/ 20

20

01 /0

1/ 20

21

01 /0

3/ 20

21

01 /0

5/ 20

21

01 /0

7/ 20

21

01 /0

9/ 20

21

01 /1

1/ 20

21

01 /0

1/ 20

22

01 /0

3/ 20

22

01 /0

5/ 20

22

01 /0

7/ 20

22

01 /0

9/ 20

22

01 /1

1/ 20

22 90

95

100

105

110

115

-0,8

-0,6

-0,4

-0,2

0

0,2

0,4

0,6

Price indices for the variety foie gras between January 2020 and December 2022

(GEKS - GEKS by EAN) GEKS 25 by extended article

GEKS 25 by EAN

c) Lipstick / gloss :

For lipstick and gloss, each expanded article number gathers a high number of EAN: 775 « expanded article » representing 5564 EAN.

363534333231302928272625242322212019181716151413121110987654321 88

90

92

94

96

98

100

102

-3 -2,5

-2 -1,5

-1 -0,5

0 0,5

1 1,5

2 2,5

3

Price indices for the variety lipstick/gloss between Jan 2020 and Dec 2022

(GEKS - GEKS by EAN) GEKS by EAN 25 HASP

GEKS 25 HASP with extend article number

Figure 12: Source : Scanner data. Scope : Metropolitan France. Reading note : In June 2020, the price index computed using the GEKS method on article sold grouped by EAN with a window of size 25 and the half splicing method is 89.6, is it 2.3 point less than the index computed with a use of « expanded article number ».

This first comparison gives similar results with a bit more volatility with the index constructed at the EAN level.

We analysed at the expanded article level the price dynamic and expenditure share to understand better the dynamic, graphics are available in appendix, figures 29 and 30.

19/37

01 /0

1/ 20

20

01 /0

3/ 20

20

01 /0

5/ 20

20

01 /0

7/ 20

20

01 /0

9/ 20

20

01 /1

1/ 20

20

01 /0

1/ 20

21

01 /0

3/ 20

21

01 /0

5/ 20

21

01 /0

7/ 20

21

01 /0

9/ 20

21

01 /1

1/ 20

21

01 /0

1/ 20

22

01 /0

3/ 20

22

01 /0

5/ 20

22

01 /0

7/ 20

22

01 /0

9/ 20

22

01 /1

1/ 20

22 82

84

86

88

90

92

94

96

98

100

102

-1

0

1

2

3

4

5

6

Price indices for the variety lipstick/gloss between Jan 2020 and Dec 2022

(CPI - GEKS ) CPI Base 100 janv 20 GEKS by EAN 25 HASP

Figure 13: Source : Scanner data and French CPI. Scope : Metropolitan France. Reading note : In February 2022, the French CPI (rebased in January 2020) is 104.25 it is 2.78 points more than a GEKS index computed with a use of EAN x outlet.

The index here again are giving globally the same trends but larger differences than for the 2 other examples. The GEKS is more subject to volatility: each drop is a bit stronger.

The largest difference is July 2020, where some COVID-19 consequences are probably at stake.

3) Indexes at the « poste » level In our current methodology, in “poste level”, there are varieties using scanner data and varieties using field collected data. They are the aggregated together using an arithmetic Laspeyres.

a) Whole Milk This table presents the weight distribution among all the varieties regarding whole milk – from scanner data and field collected data, the one from scanner data are prefixed by “DC”.

YEAR Label WEIGHT 2020 WHOLE MILK PASTEURISED 9 2020 Whole milk UHT 17 2020 DC_Whole Milk 60 2020 DC_Fresh pasteurised whole milk 14

The scanner data weight 74 % in 2020 in our poste index. In our raw data, we have 2 varieties that are included in the index compilation and some unclassified data, not used. The data size of 3 years, aggregated by EAN X Outlet X Month represents approximatively 4,3*10⁶ lines.

20/37

Expenditure share of varieties inside the whole milk poste between Jan 2020 and Dec 2022.

The variety whole milk represent the large majority of the scanner data varieties in the poste whole milk. The unclassified products, are almost negligible.

The thing with these unclassified data is that we won’t be able with our classifying tool to have such non stable and excluded data. We will maybe have some unclassified observations because our tool won’t be able to classify them with enough confidence but with no guaranty that it will be same kind of products.

21/37

Figure 14 Source : Scanner data. Scope : Metropolitan France. The dotted lines represent annual expenditure shares and the continuous one monthly shares.

01 /0

1/ 20

20

01 /0

4/ 20

20

01 /0

7/ 20

20

01 /1

0/ 20

20

01 /0

1/ 20

21

01 /0

4/ 20

21

01 /0

7/ 20

21

01 /1

0/ 20

21

01 /0

1/ 20

22

01 /0

4/ 20

22

01 /0

7/ 20

22

01 /1

0/ 20

22 90

95

100

105

110

115

120

125

Indexes for the whole milk poste between Jan 2020 and Dec 2022

GEKS per variety weighted (annual weights)

aggregation of scanner data CPI

GEKS poste 25 HASP with unclassified

GEKS poste 25 HASP without unclassified

Figure 15: Scanner data Scope : Metropolitan France. Reading note : In July 2021, the GEKS index computed with a window of 25 months grouping by EAN x outlet, half splicing method and including the unclassified data is 101.5. The GEKS indexes are computed with a Törnqvist index formula, the splicing method is mean for the window size 13 and half for the window size 25.

1 4 7 10 13 16 19 22 25 28 31 34 90,00 %

95,00 %

100,00 %

105,00 %

110,00 %

115,00 %

120,00 %

GEKS indexes by varieties inside the whole milk poste between Jan 2020 and Dec 2020

Whole milk

Fresh pasteurised whole milk

Unclassified

Figure 16: Source: Scanner data. Scope : Metropolitan France. Reading note : In period 28 (April 2022), for the variety fresh pasteurised whole milk the GEKS index computed with a window of 25 months grouping by EAN x outlet and half splicing method is 103.75.

The unclassified products have a more erratic price variation, but they weight very lightly in this poste. It explains the fact that the several GEKS indexes lead to very close results at the whole milk poste level.

22/37

b) Canned meat Scanner data weights 65% in 2020 in our poste index, it represents 7 varieties that are included in our index and unclassified data, which weight more in this “poste” than for whole milk.

The data size of 3 years aggregated by EAN X Outlet X Month is approximatively 22,5*10⁶ lines

In order to understand what weights more in the indexes variation, we firstly looked at the monthly and annual expenditure shares of each varieties within the poste canned meat.

Year Label WEIGHT 2020 Canned charcuterie 35 2020 DC_Canned rillettes 4 2020 DC_Canned duck confit 20 2020 DC_Canned country style pâté 19 2020 DC_Canned liver pâté 4 2020 DC_Canned poultry pâté 3 2020 DC_Canned full foie gras 9 2020 DC_Canned bloc of foie gras 6

Expenditure share of varieties inside the canned meat poste between Jan 2020 and Dec 2022.

We can see the seasonality in the sale of some varieties : • Foie gras are more sold during the end of the years (December principally). • Country style pâté & unclassified are less sold in December.

Unclassified data has the most important weight in all periods (approx 35% annually), it is really different than for milk.

23/37

Figure 17 Source: Scanner data. The dotted lines represent annual expenditure shares and the continuous one monthly shares.

We wanted to investigate more these unclassified data, to do so we used the nomenclature we have from Circana which provide us with the referential of products.

Figure 18: Source: scanner data in 2020 and 2021. The dotted lines represent annual expenditure shares and the continuous one monthly shares. Reading note: in January 2020, the Circana family tinned foie gras represented 8.6% of the expenditure of unclassified data in the poste canned meat.

The EAN represented are part of 4 different “Ciracana families” (a specific nomenclature). Among these families, one could be linked to a field collected variety: “Corned beef and ham”. It weights less than 10% of the products in most periods, including this data in our computation could induce “double counts” with the field variety and lead to an overestimation of the weight of the variety canned charcuterie.

24/37

GEKS indexes by Circana family in 2020 for unclassified data at the canned meat poste level

Figure 19: Source: scanner data. Scope: Metropolitan France. Reading note: in December 2020, GEKS index for the Circana (previously IRI) family “canned pâtés and tinned rillettes” grouping by EAN x outlet with a window of 25 month and half splicing was 105.2.

Figure 20: Source: scanner data. Scope: Metropolitan France. Reading note: in December 2020, GEKS index for the unclassified data among the poste canned meat grouping by EAN x outlet with a window of 25 month and half splicing was 106.5.

There is an increase of the index for unclassified data in December 2020. Thanks to the Figure 18, we can see that it is most likely due to unclassified pate, rillettes and confits.

25/37

1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 95,00 %

100,00 %

105,00 %

110,00 %

115,00 %

120,00 %

125,00 %

130,00 %

135,00 %

140,00 %

GEKS indexes by varieties inside the canned meat poste between Jan 2020 and Dec 2020

Canned duck confit

Canned country style pâté

Canned liver pâté

Canned Poultry pâté

Canned rillettes

Canned full foie gras

Canned Block of foie gras

Unclassified

The increase in the index including unclassified data in December 2020 is still present. All the indexes are relatively close and have the same trend, except for this month.

01 /0

1/ 20

20

15 /0

3/ 20

20

28 /0

5/ 20

20

10 /0

8/ 20

20

23 /1

0/ 20

20

05 /0

1/ 20

21

20 /0

3/ 20

21

02 /0

6/ 20

21

15 /0

8/ 20

21

28 /1

0/ 20

21

10 /0

1/ 20

22

25 /0

3/ 20

22

07 /0

6/ 20

22

20 /0

8/ 20

22

02 /1

1/ 20

22 95

100

105

110

115

120

GEKS price indices for the poste canned meat between Jan 2020 and Dec 2022

GEKS-Fisher half 25 with unclassified

GEKS-Törnqvist 25 half with unclassified

Figure 22 Source: scanner data. Scope: Metropolitan France. Reading note: in December 2020, GEKS index for the poste canned meat including the unclassified data grouping by EAN x outlet with the Fisher Index is 106.1. The GEKS indexes are computed with a window size of 25 and half splicing method.

26/37

Figure 21: Source: scanner data and French CPI. Scope: Metropolitan France. Reading note: in period 12 (December 2020), GEKS index for the poste canned meat including the unclassified data grouping by EAN x outlet with a window of 13 month and mean splicing was 106.2. The GEKS indexes are computed with a Törnqvist index formula, the splicing method is mean for the window size 13 and half for the window size 25.

1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 90

95

100

105

110

115

120

125

Price indexes at the canned meat poste level :

GEKS 25 by variety aggregated (annual weights)

CPI scanner data aggregated

GEKS 25 with unclassified

GEKS 25 without unclassified

GEKS 13 with unclassified

GEKS 13 without unclassified

CPI poste

We also compared the results between a GEKS-Törnqvist and a GEKS-Fisher. It gives relatively similar results, Fisher index seems to lead to higher values.

c) Make-up and care products Year Label Weight 2020 Lipstick 2 2020 Face powder 5 2020 Nail polish 4 2020 Sun products 6 2020 Cleansing milk 10 2020 Care cream 16 2020 MASCARA 5 2020 Depilatory products 5 2020 Body moisturising milk 7 2020 DC_Face women care cream 17 2020 DC_Face cleanser 6 2020 DC_Body care cream/milk 5 2020 DC_Mascara 5 2020 DC_Face powder 4 2020 DC_Lipstick and gloss 3

Scanner data weight 40% in 2020 in our “Make-up and care products“ poste index, it is composed of 6 varieties contributing to the publish CPI and also some unclassified data. The data size of 3 years aggregated by EAN X Outlet X Month is approximatively 148,1*10⁶ lines.

Expenditure share by variety for the poste make up and care product between Jan 2020 and Dec 2022

27/37

Figure 23 Source: scanner data. Scope: Metropolitan France. Reading note: in December 2020, the expenditure share of unclassified data among the poste make-up and care products is 53.1%

Here also, unclassified data weight a lot, with a strong seasonality. Given that, we can anticipate that at the poste level, with this unclassified data, we could have something quite different from our published index: it weights a lot and has some seasonal pattern.

The unclassified has a high weight for several reasons. First, it is not always easy to make homogeneous class of products. Second, there is an applicative constraint which is that a variety has to be at least 1% of a “poste” so that homogeneous class of products have to gather enough expenditure shares. Third, with time available, the most promising unclassified are prioritise. Hence, some are not studied.

Figure 24: Figure 24 Source: scanner data. Scope: Metropolitan France. Reading note: in December 2020, the GEKS index using a window size of 25, half splicing and EAN x outlet level for unclassified data among the poste make-up and care products is 118.1.

Here we have the multilateral indexes at the “variety” level. The unclassified exhibits some weird behaviour. There are probably some micro-trajectories very steep that have some macro-impact. This case is of interest: we have to develop tools to elucidate that kind of observations: either to understand what it is going on or to cancel these observations if not reliable.

28/37

1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 85,00 %

90,00 %

95,00 %

100,00 %

105,00 %

110,00 %

115,00 %

120,00 %

GEKS HALF SPLICE window 25 by variety for the poste make up and care products 

body care cream

face cleanser

face powder

mascara

Lipstick/gloss

women face cream

unclassified

Figure 25: Figure 25: Source: scanner data. Scope: Metropolitan France. Reading note : Between February 2020 and February 2021, the price level for unclassified canned meat products has

decreased of 10,0 %. Year-on-year inflation is computed as 100∗( I m, y

Im , y−1

−1)%

Figure 26 Source: scanner data and French CPI. Scope: Metropolitan France. Reading note: in period 12 (December 2020), GEKS index for the poste make up and care product including the unclassified data grouping by EAN x outlet with a window of 13 month and mean splicing was 105.9. The GEKS indexes are computed with a Törnqvist index formula, the splicing method is mean for the window size 13 and half for the window size 25. As we can see just above, the unclassified data have a strong impact. Also, we can see that the window length has no real impact on the multilateral indexes if unclassified are excluded. But, regardless of this

29/37

1 4 7 10 13 16 19 22 25 28 31 34 70,00 %

75,00 %

80,00 %

85,00 %

90,00 %

95,00 %

100,00 %

105,00 %

110,00 %

115,00 %

120,00 %

Price indexes for the poste make up and care products

GEKS MEAN 13 without unclassified

GEKS MEAN 13 with unclassified

GEKS 25 HASP with un- classified

GEKS 25 HASP without unclassified

CPI poste

13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36

-20

-15

-10

-5

0

5

10

15

20

Year-on-year inflation (GEKS 25 half spliced) for varieties of the poste make up and care products

face cleanser

mascara

Lipstick/gloss

face powder

women face cream

body care cream

unclassified

length, the multilateral indexes are quite far from the current one. This current index shows an increase of prices when the multilateral ones demonstrate more stability.

GEKS indexes by Circana family in 2020 for unclassified data

at the make up and care poste level

Figure 28: Source: scanner data in 2020. Scope : Metropolitan France. Reading note: in November 2020, the Circana (previously IRI) family “eyes make-up” represented 23.1% of the expenditure of unclassified make up and care products. Only families representing more than 1% of the expenditure are represented.

30/37

Figure 27: Source: scanner data in 2020. Scope : Metropolitan France. Reading note: in November 2020, GEKS index for the Circana (previously IRI) family “eyes make-up” grouping by EAN x outlet with a window of 25 month and half splicing was 97.5. Only the four family with the highest expenditure share are represented

Among unclassified data, some are linked to Circana Families representing field collected varieties, some others don’t respect the precise specifications of varieties : for instance eyes make-up isn’t in our scanner data varieties currently.

4) Contributions behind GEKS-Tq variation

a) Theory To understand better the variation of the indexes computed on the previous section of this paper, we wanted to take a look into the contributions of individual products to the index variation6.

A contribution is defined between two periods

To do so, we have to start by looking at the contribution in the bilateral index. If the product is present at both periods, its expenditure share in the corresponding bilateral index is :

w i t 1 ,t 2=0,5∗(

pi t 1q i

t 1

∑ j∈N t 1∩N t2

p j t 1q j

t 1 + pi

t 2 qi t 2

∑j∈N t1∩N t 2

p j t 2 q j

t 2 )

were pi t is the price of product i at period t and q i

t is the number of product i sold at period t. It is the

weight it has in the Törnqvist index.

We are then able to compute the average bilateral share of this product from a period t with all the other periods in the window in the multilateral index:

w i ∗ , t= 1

card W ∑r∈W w i

r ,t

To look at the contribution of a product in the index variation between period t1 and t2, we have to apply these weights to the product of price variation in each period: which give the formula:

I GEKS−TQ t 1 ,t2 =∏i∈N

(p i t 2)wi

∗ , t2

(p i t 1)wi

∗ , t1 ∏t∈W (p i

t) w i

t, t 1−w i t, t 2

cardW

and so we have a decomposition

I GEKS−TQ t 1 ,t2 =∏i∈N

contributioni t 1 ,t 2

With this formula, the index is represented as the product of the contribution of each product. In order to facilitate the interpretation by having a summability between contributions, we looked at the log of the index and the log of the contributions.

ln( I GEKS−TQ t 1, t 2 )=∑i∈N

ln (contributioni t 1 , t 2)

ln ( IGEKS−TQ t 1, t 2 )=∑i∈N

ln ( ( pi

t 2)w i ∗ , t 2

( pi t 1)w i

∗ , t 1 ∏t∈W ( pi

t) w i

t, t 1−w i t , t2

|W| )

Our product definition here, is still an EAN x Outlet.

6 Thanks to the paper, Decomposing Multilateral Price Indexes into the Contributions of Individual Commodities, and the guide on Multilateral Methods (Chapter 8) guide we looked into this direction.

31/37

Due to expensive computation costs, these contribution are computed only at a level EAN x Outlet. We had to explore ways to reduce the number of observation and time of computation.

We used this R package: https://github.com/MjStansfi/GEKSdecomp/. It seems to be compatible only with comparing the two last periods of the window. We would have to adapt this tool to have more flexible options to analyse contributions to evolution. Also, this is limited at analysis “inside” a given window. But for longer evolution, splicing has probably to be taken into account.

b) Experiment From our previous results, several periods for each varieties/poste where interesting to look at in order understand better the indexes variation (between period 4 and 5 for lipstick and 5 and 6 28,29 et 29,30 between 1 and 2 for unclassified make up & car products, between period 1 and 12 for unclassified canned meat, between period 3 and 4 and 4 and 5 for unclassified canned meat).

For practical reasons, we chose to calculate contributions for lipstick between period 28 and 29, with a window size of 13, without separating outlets. We studied the contributions for each EAN.

Figure 29: Source: scanner data. Reading note: Each bar represent the log contribution of an EAN in the price evolution of lipstick/gloss between period 28 (April 2022) and period 29 (May 2022), measured inside a window of 13 months. A log contribution superior than 0 means that the EAN contribute positively (price increase) and a negative one negatively.

By transitivity of the GEKS index, we can compare contributions between period 28 and 29 to the ratio of

I GEKS−TQ 17,29

I GEKS−TQ 17,28 . We cannot theoretically compare the ratio of spliced indexes.

32/37

GEKS using EAN and window size of 13 for lipstick:

Time Period

GEKS-Tq EAN 13

Mean splice

GEKS-T ean X outlet 13 Mean splice

GEKS by EAN without splicing (period 17 as reference)

17 102,70 98,90 100 18 104,01 99,05 100,88 19 100,56 97,05 96,88 20 104,60 98,74 99,94 21 104,06 97,30 98,89 22 104,25 98,00 99,55 23 103,39 98,90 100,8 24 103,31 98,33 99,66 25 94,12 96,73 95,42 26 98,87 98,22 96,77 27 105,15 96,91 97,31 28 105,90 98,90 99,38 29 104,57 95,55 96,97

I GEKS−TQ 17,29

I GEKS−TQ 17,28 =

96,97 99,38

= 0,9756 = 97,5%

With the contributions computed with the R package GEKSdecomp we have the following results:

e ∑i∈EANs

log(contribi) = 0.9691 = 96,9% We are also able to find the EAN with the contribution the furthest from 1. It has a log(contribution) of - 0.00353 and a contribution of 0.9965

Figure 30: Source: scanner data. Scope: Metropolitan France. Reading note: in period 20 (July 2021), the average price for the EAN studied is 13.21€. The average price is computed as an average weighted by the expenditure share.

33/37

IV) Next steps ? Our « research » agenda

This work is the beginning of a longer project about multilateral methods and their interest given our context. If this work shows some interesting leads, a lot has still to be done.

1) Link between those multilateral indexes and microeconomic theory First, while we kept some very close methodology for our scanner data, we were able to use the same explanation for our methodology. This is a fixed basket representing the mean consumption of French households optimising their utility. As some links exist between Laspeyres, Paasche and Fisher indexes on one hand and micro-economic theory one the other hand: there is some theoretical grounds to our current method.

At this stage, we need to better understand the economical approach on which are based multilateral methods and how to communicate and interpret results with these methods. This is of interest to make this index understandable by anyone in society.

2) Explore the outlet dimension In our current CPI methodology for field collected varieties, we use sampling and define targets

among the outlet according to their classification (supermarket, hypermarket, specialized shop). For scanner data varieties, they represent only two kind of outlet: supermarket and hypermarket. In this experiment, we are producing micro indexes at the outlet index, which means that we consider for the customer there is no substitution between buying in a shop or another. The latter point can be discussed, because for instance we could consider that outlets of the same size, from the same retailer and in a close geographic area could be considered equivalent. Following ean into a group of shop could improve the quality of the index because it could improve the match rate between periods.

3) Going further with classification methods As explained above, this work requires a classifying tool. Without this, we cannot classify data into the COICOP and consequently, we cannot compute relevant indexes. This task will be tackle in the following months by making progress with the existing tools we have.

We use a fasttext algorithm which is a neural network tool specialized in dealing with characters strings. By extracting labels of products and their corresponding expenditures we will optimize the classifying function. Our goal is to have a good performance at the “poste” level – going further seems to be unreasonable given the information we have.

4) Strategy to include those indexes inside our current methodology Before hoping to use these indexes in production, we have to deepen our look into the contributions, the interpretability/decomposition of an index evolution. We presented some first contribution computation but we will need to conceive more practical routines for understanding such index evolutions.

34/37

And when we will be able to classify product, to compute multilateral index with enough understanding of it (from both statistical and theoretical approaches), we will have to have a reflection on how it will be possible to use this kind of methods with the rest of the basket we follow and to see how to adapt this with our current methodology (whether to change everything or to have some cohabitation).

V) Conclusion

This first real experimentation of multilateral index with our scanner data gives us some first learnings : • at a really fine scale, this index behaves quite closely to our current methodology and consequently

seems possible to work at the EAN scale – it sill remain to be confirmed at a larger scale • working at the EAN scale seems to be acceptable but it has to be confirmed with a larger scale

experiment • at a more aggregate scale (our “poste level”), there is more volatility and we have to progress in our

understanding and tools for this

This first work emphasises two major elements that we need to work on : • Classification tool to classify the products in the COICOP nomenclature • Theoretical understanding of the links with micro-economic theory

35/37

References

• Guide on Multilateral Methods in the Harmonised Index of Consumer Prices, Eurostat, 2022.

• MARS: A method for defining products and linking barcodes of item relaunches, Antonio G. Chessa, Statistics Netherlands.

• “Chain drift” in the Chained Consumer Price Index: 1999–2017, Monthly Labor Review, BLS December 2021.

• Évaluation des méthodes multilatérales de calcul de l'indice, STATBEL, Ken Van Loon et Dorien Roels, 07/2019

• Eliminating Chain Drift in Price Indexes Based on Scanner Data, Jan de Haana and Heymerik van der Grient, Statistics Netherlands,2 April 2009

• FMI, CPI Manual, 2020.

• From GEKS to cycle method , 11/2017, Leon Willenborg

• A Closer Look at the Rolling Window GEKS Index with a Movement Splice, Jan De Haan, 16 October 2017

• Extension of multilateral index series over time: Analysis and comparison of methods, Antonio G. Chessa, 7 May 2021

• Transitivity of price indexes, Leon Willenborg , May 2018

• Comparing Price indexes of Clothing and Footwear for Scanner Data and Web Scraped Data

• Antonio G. Chessa* and Robert Griffioen**, Statistics Netherlands, Team CPI ,1 st

April 2019

• Leclair (2019), « Utiliser les données de caisses pour le calcul de l’indice des prix à la consommation », Le Courrier des statistiques, n°3

• Decomposing Multilateral Price Indexes into the Contributions of Individual Commodities, Michaël Webster and Rory C. Tarnow-Mordy , 2019

• Introducing multilateral index methods into consumer price statistics, Liam Greenhough , ONS, 28 November 2022

• The Economic Theory of Index Numbers and the Measurement of Input, Output, and Productivity, Caves, Christensen and Diewert, 1982

• The use of weighted GEKS for the calculation of consumer price indexes: an experimental application to Italian scanner data Alessandro Brunetti (Istat), Stefania Fatello (Istat), Tiziana Laureti (Università della Tuscia), Federico Polidoro (Istat) 17th Ottawa Group Meeting, Rome, 7 – 10 June 2022

36/37

Appendix Average price evolution of Circana families among unclassified make up and care products between January

2020 and December 2022

Figure 31: Source: Scanner data. Scope: Metropolitan France. Reading note : The evolution is computed as the ratio of average price at period m/ average price at period 0

Expenditure share of Circana families among unclassified make up and care products between January 2020 and December 2022

Figure 31: Source: Scanner data. Scope: Metropolitan France. The dotted lines represent annual expenditure shares and the continuous one monthly shares. Results are presented for extended group representing more than 1 % of the expenditure in 2020

37/37

  • I) Context of this study : scanner data in France now and future
    • 1) Usual data
    • 2) Scanner data : current methodology
    • 3) Information technology infrastructure
    • 4) New data and data not yet used
      • a) Hard discounters
      • b) Overseas scanner data
      • c) Other sectors not yet used
  • II) Theory and strategy of our experimentation
    • 1) Multilateral methods and milestones of the process
      • a) Individual product specification 
      • b) Multilateral Index
      • c) Time windows & splicing
      • d) Aggregation structure.
        • i. There is no way of having a decomposition of the multilateral indexes
        • ii. Choosing a level and aggregating these indexes
    • 2) Test protocol
  • III) Results
    • 1 ) Presence of references across time
      • a) Milk
      • b) Foie gras
      • c) Lipstick
      • d) Canned meat
      • e) Make up and care products
    • 2) Indexes at the « variety » level
      • a) Whole milk 
      • b) Foie gras
      • c) Lipstick / gloss :
    • 3) Indexes at the « poste » level
      • a) Whole Milk
      • b) Canned meat
      • c) Make-up and care products
    • 4) Contributions behind GEKS-Tq variation
      • a) Theory
      • b) Experiment
  • IV) Next steps ? Our « research » agenda
    • 1) Link between those multilateral indexes and microeconomic theory
    • 2) Explore the outlet dimension
    • 3) Going further with classification methods
    • 4) Strategy to include those indexes inside our current methodology
  • V) Conclusion
  • References
  • Appendix

Presentation

Languages and translations
English

Measurement of gender inequalities

in the French labour market

using efficiency measures

Poleth VEGA RUALES

Audrey DUMAS

CDEDYS - University of Perpignan

Meeting of the UNECE Group of Experts on Gender Statistics

Geneva, Switzerland, 10 May 2023

Content

• Motivation

• Literature

• Methodology

• Data

• Results

• Conclusion

Motivation

The existence of wage gaps between men and women could be

justified because women:

- work fewer hours;

- have less work experience;

- are engaged in low paying job (sectoral and occupational

segregation).

However, when isolating these factors, wage gaps does not disappear

(Carillo et al., 2014).

Labour market discrimination

Motivation Literature Methodology Data Results Conclusion

Two related but distinct concepts in the realm of gender workplace

inequality:

➢Glass ceiling: prevents women from advancing to senior leadership

positions, despite their qualifications and achievements.

→Systemic bias

➢Sticky floor: women stuck in low-paying, low-status jobs, with limited

opportunities for advancement.

→Lack of access to training and education, and social expectations

about gender roles

Labour market discrimination

Motivation

Motivation Literature Methodology Data Results Conclusion

Methods of measurement

Unadjusted GPG

Adjusted

various factors (occupation, education,

experience, etc.)

Blinder-Oaxaca (BO) decomposition

(1973)

Efficiency approaches:

DEA-based Malmquist Index

(Amado et al., 2018)

Motivation Methods of measuring the gender pay gap

Motivation Literature Methodology Data Results Conclusion

1. Better understand the differences between the traditional measures

of GPG and the Malmquist Index (MI).

2. Measure “glass ceilings” and “sticky floors” using a new

methodology based on MIs.

3. Contribute to the empirical work by measuring the GPG in the

French labour market with an efficiency approach.

4. Show “glass ceilings” and “sticky floors” by economic activity and

occupation to propose recommendations.

Motivation Literature Methodology Data Results Conclusion

Motivation Research objectives

Methods of measurement

Unadjusted GPG: 𝑦𝐻−𝑦𝐹

𝑦𝐻

Adjusted

various factors (occupation, education,

experience, etc.)

Blinder-Oaxaca (BO) decomposition

(1973)

Efficiency approaches:

DEA-based Malmquist Index

(Amado et al., 2018)

Motivation Literature Methodology Data Results Conclusion

Literature Methods of measuring the gender pay gap

8

a woman F

a man M y

(ex: wage)

Mean predicted y given x for male: 𝐸(𝑌𝑀/𝑋𝑀) estimated with linear regression

Mean predicted y given x for female: 𝐸(𝑌𝐹/𝑋𝐹) estimated with linear regression

Fx Mx

unadjusted GPG

Fy

My adjusted

GPG

Blinder-Oaxaca Decomposition

one input

one output

x (ex: tenure)

Motivation Literature Methodology Data Results Conclusion

9

x

y

Malmquist Index Male frontier: Highest y given x for male computed with DEA method

Female frontier: Highest y given x for female computed with DEA method

Average distance for male-to-male frontier

Average distance for female to male frontierAdjusted GPG

Distance to the frontier : efficiency score computed with DEA method

Motivation Literature Methodology Data Results Conclusion

Motivation Literature Methodology Data Results Conclusion

Blinder-Oaxaca Descomposition Malmquist Index

Definition of “average” point Average characteristics (x) Average efficiency score

Definition of the “reference”

situation Average predicted wage given x Max wage given x

Computation of the “line” or

“frontier” Assumption on the functional forms

No assumption on functional forms

 non-parametric method

Number of outputs “y” One Multi-output

11

Input

Output

Female frontier

Male frontier

x

y

• To measure a MI

- an output-oriented MI

- an input-oriented MI

Output-oriented distance: “Glass ceiling”

Input-oriented distance: “Sticky floor”

Methodology

Motivation Literature Methodology Data Results Conclusion

12

Input

Output

x

y 𝑀𝐼𝑂 = 𝐷𝑂 𝑀 𝑋𝑂

𝐹 , 𝑌𝑂 𝐹

𝐷𝑂 𝑀 𝑋𝑂

𝑀, 𝑌𝑂 𝑀 ∙

𝐷𝑂 𝐹 𝑋𝑂

𝐹 , 𝑌𝑂 𝐹

𝐷𝑂 𝐹 𝑋𝑂

𝑀, 𝑌𝑂 𝑀

Τ1 2

𝑀𝐼𝐼 = 𝐷𝐼 𝑀 𝑋𝐼

𝐹 , 𝑌𝐼 𝐹

𝐷𝐼 𝑀 𝑋𝐼

𝑀, 𝑌𝐼 𝑀 ∙

𝐷𝐼 𝐹 𝑋𝐼

𝐹 , 𝑌𝐼 𝐹

𝐷𝐼 𝐹 𝑋𝐼

𝑀, 𝑌𝐼 𝑀

Τ1 2

𝑝𝑠𝑒𝑢𝑑𝑜 𝐻𝑀𝐼 = 𝑀𝐼𝑂

𝑀𝐼𝐼

1/2

 adjusted GPG

Motivation Literature Methodology Data Results Conclusion

Methodology

• 2019 Labour Force Survey (LFS) from France (before Covid): Cross-sectional survey

of over 50,000 households

• Sample: working age individuals (15 years and older) employed in formal economy, only

one job.

• Input variables: Measures of human capital investment:

- Theoretical number of years of education of the highest diploma obtained

- Number of work seniority

• Output variables:

- Hourly earnings in euros

• Final sample: 40,978 workers; 19,294 men and 21,684 women

• Disaggregation: 18 economic activities and 9 occupations

Data

Motivation Literature Methodology Data Results Conclusion

Descriptive statistics

Motivation Literature Methodology Data Results Conclusion

Mean no.

of years of

education

Mean no.

of years of

work

seniority

Mean

hourly

earnings

(€)

Mean no.

of years of

education

Mean no.

of years of

work

seniority

Mean

hourly

earnings

(€)

Mean 12.8 11.9 14.3 13.2 12.6 13.0

Variation 0.2 0.9 0.4 0.2 0.9 0.4

Min 5 0 2.2 5 0 2.4

Max 20 48 58.8 20 47 65.6

Unadjusted gender pay gap : 13.4%

Male Female

Summary statistics of the inputs and outputs (LFS, 2019)

Results

Output-oriented MI Input-oriented MI

Glass ceilings Sticky-floors

Agriculture, forestry and fishing 0.9 3.1 0.8 1.9

Manufacturing 10.1 10.5 6.4 8.5

Electricity, gas, steam and air conditioning supply 14.1 12.3 2.7 7.4

Water supply; sewerage, waste management 8.0 14.0 16.5 15.3

Construction 2.0 9.9 13.0 11.4

Wholesale and retail trade; repair of motor vehicles 7.7 7.7 4.5 6.1

Transportation and storage 1.6 11.0 9.5 10.3

Accommodation and food service activities 4.6 6.1 6.6 6.4

Information and communication 10.3 12.2 4.2 8.1

Financial and insurance activities 11.6 11.9 2.6 7.2

Real state activities 4.8 5.0 5.5 5.3

Professional, scientific and technical activities 9.3 10.0 3.8 6.8

Administrative and support service activities 6.3 5.7 4.9 5.3

Public administration and defence; social security 6.1 9.5 6.7 8.1

Education 15.3 14.3 4.5 9.2

Human health and social work activities 8.5 9.0 1.1 5.0

Arts, entertainment and recreation 7.1 10.6 6.2 8.4

Other service activities 11.2 10.3 1.6 5.9

Pseudo HMIEconomic Activity

Values of the Malmquist index by economic activity (in percentage)

Unadjusted GPG

Motivation Literature Methodology Data Results Conclusion

Results

Output-oriented MI Input-oriented MI

Glass ceilings Sticky-floors

Managers 10.4 13.2 2.5 7.7

Professionals 7.8 13.0 5.1 9.0

Technicians and associate professionals 8.2 9.6 7.4 8.5

Clerical support workers 5.9 7.5 4.5 6.0

Service and sales workers 11.4 14.0 0.8 7.2

Skilled agricultural, forestry and fishery workers 14.5 11.0 8.6 9.8

Craft and related trades workers 11.3 10.6 2.0 6.2

Plant and machine operators, and assemblers 10.7 11.9 4.3 8.0

Elementary occupations 11.9 10.0 -1.8 3.9

Unadjusted GPG Pseudo HMIOccupation

Values of the Malmquist index by occupation (in percentage)

Motivation Literature Methodology Data Results Conclusion

Conclusion

• Original methodology based on efficiency approach that bring complementary

results of traditional measures of the GPG.

• Limitations:

The MI is computed on the “average” distance of male and female to the frontier.

The “average” is still not always representative of the sample.

• Future research:

New measure based on the Hicks-Moorsteen Index.

Motivation Literature Methodology Data Results Conclusion

Thank you for your attention

Poleth VEGA RUALES

Audrey DUMAS

CDEDYS - University of Perpignan

Meeting of the UNECE Group of Experts on Gender Statistics

Geneva, Switzerland, 10 May 2023

19

Input

Output

x

y

Motivation Literature Methodology Data Results Conclusion

(France) Review of WP.29 UN Regulations and GTRs on their fitness for ADS

Languages and translations
English

Review of UN Regulations and GTRs on their fitness for ADS

1

Transmitted by the representative of France on behalf of the chairs of the WP29 screening taskforces

Informal document WP.29-189-20 189th WP29, 7–9 March 2023 Agenda item 8.6.2.

At its 186th session in March 2022, WP.29 requested all its subsidiary working parties to perform a screening of the UN Regulations and Global Technical Regulations (GTR) of relevance regarding their fitness for Automated Driving Systems (ADS) until March 2023.

At its 14th session in September 2022, GRVA gave additional guidance as to what kind of automated vehicles to consider, etc. (informal document GRVA-14-54r1)

Pending request for deadline extension until June 2023 to give additional time to GRs with staggered session schedules

Mandate from WP.29

2

2

Common work method across GRs

The taskforces of the different GRs have agreed to work on the same deliverables:

High-level summaries for each Regulation

Comprehensive files for the detailed screening

A “whitebook” for handling automated driving when drafting new Regulations

3

The review does not include definitive solutions for changing Regulations, but taskforces may offer suggestions (amendments, new Regulations, new vehicle categories…)

3

This template can both serve as a preliminary analysis of a Regulation before its detailed review, and as a final report to give a summary of the detected issues and possible options

High-level assessment

4

4

Detailed screening (1/2)

Points of interest:

Use cases: full automation, dual-mode, no occupants, etc.

Possible approaches: amending Regulations, drafting specific Regulations for automated vehicles, creating new vehicle categories, etc.

Consider both explicit and implicit concepts

5

5

Online collaborative environment and example of a detailed review for a GRVA Regulation

Detailed screening (2/2)

6

6

As of early March 2023:

GRBP: Detailed screening ready to start

GRE: Detailed screening and deliverables done for R48

GRPE: First meeting to take place soon

GRSG: Detailed screening in progress (55% done)

GRSP: Detailed screening in progress (80% done)

GRVA: Detailed screening in progress (75% done)

7

Status of the screening taskforces

GRSG GRVA GRSP

GRBP

GRPE

GRE

Not all taskforces might be able to get validation from their GR before the June WP29 session

7

Detailed screening

Drafting deliverables

High level assessment

8

R89 Not yet screened
R30 Applicable (or mostly applicable)
R12 Not applicable (or not applicable except in dual mode)
R13 Work needed on the Regulation / new Regulation needed
R158 Not applicable or mostly not applicable AND amendments or new Regulation needed
R48 Applicable or mostly applicable AND amendments or new Regulation needed
GRBP R9 R28 R30 R41 R51 R54 R59 R63 R64 R75 R92 R106 R108 R109 R117 R124 R138 R141 R142 R164 R165 GTR16                                          
GRE R10 R37 R48 R53 R74 R86 R99 R128 R148 R149 R150                                                                
GRPE R24 R40 R47 R49 R68 R83 R84 R85 R96 R101 R103 R115 R120 R132 R133 R143 R154 GTR2 GTR4 GTR5 GTR10 GTR11 GTR15 GTR17 GTR18 GTR19 GTR21 GTR22 GTR23                            
GRSG R18 R26 R34 R35 R36 R39 R43 R46 R52 R55 R58 R60 R61 R62 R66 R67 R71 R73 R81 R93 R97 R102 R105 R107 R110 R116 R118 R121 R122 R125 R144 R147 R151 R158 R159 R160 R161 R162 R163 R166 R167 GTR6 GTR12
GRSP R11 R12 R14 R16 R17 R21 R22 R25 R29 R32 R33 R42 R44 R80 R94 R95 R100 R114 R127 R129 R134 R135 R136 R137 R145 R146 R153 GTR1 GTR7 GTR9 GTR13 GTR14 GTR20                    
GRVA R13 R13H R78 R79 R89 R90 R130 R131 R139 R140 R152 R155 R156 R157 GTR3 GTR8                                                      

Status of all WP29 Regulations and GTRs

Work in progress, to be validated after completion of screening for all Regulations and GTRs

Examples of high level issues

Vehicle categories: current categories do not reflect the diversity of use cases for automated driving:

Dual mode vs fully automated

Carrying occupants vs freight only

Supervision inside vehicle vs remote supervision

Telltales & warning signals: a standardised way to share information is necessary: what information is relevant to whom (passenger, occupant in driver seat, remote supervisor…)

Test mode: might be necessary for certain Regulations

9

GRBP (SIG AVRS): Jan Sybren BOERSMA (NL) [email protected]

GRE (TF AVSR): Karl MANZ (DE) [email protected]

GRPE (TBD) - Point of contect:Niels DEN OUDEN (NL) [email protected]

GRSG (TF AVRS): Hans LAMMERS (NL) [email protected]

GRSP (TF AVRS): Rudolf GERLACH (DE) [email protected]

GRVA (TF FADS): Romain PESSIA (FR) and Linlin ZHANG (CN)

[email protected] [email protected]

10

Contact information

The GRVA TF can be used as a point of contact for questions related to automated driving (definitions, use cases etc.)

10

Review of UN Regulations and GTRs on their fitness for ADS

1

Transmitted by the representative of France on behalf of the chairs of the WP29 screening taskforces

Informal document WP.29-189-20 189th WP29, 7–9 March 2023 Agenda item 8.6.2.

• At its 186th session in March 2022, WP.29 requested all its subsidiary working parties to perform a screening of the UN Regulations and Global Technical Regulations (GTR) of relevance regarding their fitness for Automated Driving Systems (ADS) until March 2023.

• At its 14th session in September 2022, GRVA gave additional guidance as to what kind of automated vehicles to consider, etc. (informal document GRVA-14-54r1)

Pending request for deadline extension until June 2023 to give additional time to GRs with staggered session schedules

Mandate from WP.29

2

Common work method across GRs

The taskforces of the different GRs have agreed to work on the same deliverables:

• High-level summaries for each Regulation

• Comprehensive files for the detailed screening

• A “whitebook” for handling automated driving when drafting new Regulations

3 The review does not include definitive solutions for changing Regulations, but taskforces may offer suggestions (amendments, new Regulations, new vehicle categories…)

This template can both serve as a preliminary analysis of a Regulation before its detailed review, and as a final report to give a summary of the detected issues and possible options

High-level assessment

4

Detailed screening (1/2)

• Points of interest: • Use cases: full automation, dual-mode, no occupants, etc. • Possible approaches: amending Regulations, drafting specific

Regulations for automated vehicles, creating new vehicle categories, etc. • Consider both explicit and implicit concepts

5

Online collaborative environment and example of a detailed review for a GRVA Regulation

Detailed screening (2/2)

6

As of early March 2023: GRBP: Detailed screening ready to start GRE: Detailed screening and deliverables done for R48 GRPE: First meeting to take place soon GRSG: Detailed screening in progress (55% done) GRSP: Detailed screening in progress (80% done) GRVA: Detailed screening in progress (75% done)

7

Status of the screening taskforces

High level assessment

Detailed screening

Drafting deliverables

GRSG GRVA GRSPGRBPGRPE GRE

Not all taskforces might be able to get validation from their GR before the June WP29 session

8

R89 Not yet screened

R30 Applicable (or mostly applicable)

R12 Not applicable (or not applicable except in dual mode)

R13 Work needed on the Regulation / new Regulation needed

R158 Not applicable or mostly not applicable AND amendments or new Regulation needed

R48 Applicable or mostly applicable AND amendments or new Regulation needed

GRBP

R 9

R 2 8

R 3 0

R 4 1

R 5 1

R 5 4

R 5 9

R 6 3

R 6 4

R 7 5

R 9 2

R 1 0 6

R 1 0 8

R 1 0 9

R 1 1 7

R 1 2 4

R 1 3 8

R 1 4 1

R 1 4 2

R 1 6 4

R 1 6 5

G T R 1 6

GRE R 1 0

R 3 7

R 4 8

R 5 3

R 7 4

R 8 6

R 9 9

R 1 2 8

R 1 4 8

R 1 4 9

R 1 5 0

GRPE

R 2 4

R 4 0

R 4 7

R 4 9

R 6 8

R 8 3

R 8 4

R 8 5

R 9 6

R 1 0 1

R 1 0 3

R 1 1 5

R 1 2 0

R 1 3 2

R 1 3 3

R 1 4 3

R 1 5 4

G T R 2

G T R 4

G T R 5

G T R 1 0

G T R 1 1

G T R 1 5

G T R 1 7

G T R 1 8

G T R 1 9

G T R 2 1

G T R 2 2

G T R 2 3

GRSG

R 1 8

R 2 6

R 3 4

R 3 5

R 3 6

R 3 9

R 4 3

R 4 6

R 5 2

R 5 5

R 5 8

R 6 0

R 6 1

R 6 2

R 6 6

R 6 7

R 7 1

R 7 3

R 8 1

R 9 3

R 9 7

R 1 0 2

R 1 0 5

R 1 0 7

R 1 1 0

R 1 1 6

R 1 1 8

R 1 2 1

R 1 2 2

R 1 2 5

R 1 4 4

R 1 4 7

R 1 5 1

R 1 5 8

R 1 5 9

R 1 6 0

R 1 6 1

R 1 6 2

R 1 6 3

R 1 6 6

R 1 6 7

G T R 6

G T R 1 2

GRSP

R 1 1

R 1 2

R 1 4

R 1 6

R 1 7

R 2 1

R 2 2

R 2 5

R 2 9

R 3 2

R 3 3

R 4 2

R 4 4

R 8 0

R 9 4

R 9 5

R 1 0 0

R 1 1 4

R 1 2 7

R 1 2 9

R 1 3 4

R 1 3 5

R 1 3 6

R 1 3 7

R 1 4 5

R 1 4 6

R 1 5 3

G T R 1

G T R 7

G T R 9

G T R 1 3

G T R 1 4

G T R 2 0

GRVA R 1 3

R 1 3 H

R 7 8

R 7 9

R 8 9

R 9 0

R 1 3 0

R 1 3 1

R 1 3 9

R 1 4 0

R 1 5 2

R 1 5 5

R 1 5 6

R 1 5 7

G T R 3

G T R 8

Status of all WP29 Regulations and GTRs

Work in progress, to be validated after completion of screening for all Regulations and GTRs

Examples of high level issues

• Vehicle categories: current categories do not reflect the diversity of use cases for automated driving:

• Dual mode vs fully automated • Carrying occupants vs freight only • Supervision inside vehicle vs remote supervision

• Telltales & warning signals: a standardised way to share information is necessary: what information is relevant to whom (passenger, occupant in driver seat, remote supervisor…)

• Test mode: might be necessary for certain Regulations

9

GRBP (SIG AVRS): Jan Sybren BOERSMA (NL) [email protected]

GRE (TF AVSR): Karl MANZ (DE) [email protected]

GRPE (TBD) - Point of contect:Niels DEN OUDEN (NL) [email protected]

GRSG (TF AVRS): Hans LAMMERS (NL) [email protected]

GRSP (TF AVRS): Rudolf GERLACH (DE) [email protected]

GRVA (TF FADS): Romain PESSIA (FR) and Linlin ZHANG (CN) [email protected] [email protected]

10

Contact information

  • Review of UN Regulations and GTRs on their fitness for ADS
  • Slide Number 2
  • Slide Number 3
  • Slide Number 4
  • Slide Number 5
  • Slide Number 6
  • Slide Number 7
  • Slide Number 8
  • Examples of high level issues
  • Slide Number 10

-- Presentation

Languages and translations
English

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

A New Definition and Measurement of Extreme Poverty Michaël Sicsic (Insee)

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Introduction

– Poverty definitions harmonized at the European level are based either

● on low income ● on the existence of several material or social deprivations

– But poverty is multidimensional, and there is no shared definition of extreme poverty, despite its importance in the social and policy debate.

⇒ How to measure extreme poverty ?

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Methodology for people living in ordinary housing

We define extreme poverty as the combination of:

– Low income : ● Household standard of living below 50% of the median ● Using administrative data in France (in SILC)

– Severe material and social deprivation in daily life: ● The individual suffers at least 7 basic deprivations from a

questionnaire of 13 deprivations relating to living conditions ● Using SILC data

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Methodology for people living in ordinary housing

Share of the French population concerned in 2018 according to the definition of poverty

Champ : people living in ordinary housing in France except Mayotte Source : Insee, enquête Statistiques sur les ressources et les conditions de vie (SRCV) 2018

Severe monetary poverty: standard of living below 50% of the median standard of living (930 euros per month)

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Methodology for people living in ordinary housing

Share of the French population concerned in 2018 according to the definition of poverty

Severe monetary poverty: standard of living below 50% of the median standard of living (930 euros per month)

7 material and social deprivations among 13

Champ : people living in ordinary housing in France except Mayotte Source : Insee, enquête Statistiques sur les ressources et les conditions de vie (SRCV) 2018

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Methodology for people living in ordinary housing

Share of the French population concerned in 2018 according to the definition of poverty

7 material and social deprivations among 13

Champ : people living in ordinary housing in France except Mayotte Source : Insee, enquête Statistiques sur les ressources et les conditions de vie (SRCV) 2018

Severe monetary poverty: standard of living below 50% of the median standard of living (930 euros per month)

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Better measure of extreme poverty than a standard of living threshold of 40% of the median

The threshold at 40% of the median does not make it possible to target a more intense, multidimensional and more persistent poverty than income poverty at the 60% threshold the extreme poverty measure does⇒

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Methodology for for people living in non ordinary housing

Different methods and sources used for people in non ordinary housing :

– People in mobile homes & homeless ● Using census and specific survey

– People living in community housing ● Elderly in retirement homes: CARE‑Institutions survey ● Young in a university residence or hostel: ENRJ survey ● Others (workers' hostels, housing for persons in need of medical

care, military establishments): gender and age poverty rates calculated in the main data on poverty are applied to the gender and age structure of each community (census)

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

About 2 million people in extreme poverty in 2018 in France

– 1,9 million people living in extreme poverty ● 1,8 million in ordianry housing ● 153 000 homeless or in mobile home

– 165 000 people in communities likely to be in extrem poverty

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Who are they ? (in ordinary housing)

● 35% are children ; 7% are over 65 years old

● Among adults living in ordinary housing and experiencing severe poverty, women are more numerous

● 25% of people in extreme poverty live in a single-parent family (compared to 10% in the general population)

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Very low resources

– Only 13% of people in extreme poverty are no longer in poverty 3 years later

– Half of the people living in extreme poverty have a standard of living below 800 euros

– 48% have no savings

– 79% do not own a home

– More than 40% of adults living in extreme poverty cannot ask for financial or material assistance from a relative

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Extreme poverty among people in ordinary housing has been relatively stable since 2010

The previous definition of material and social deprivations allows us to compute extreme poverty indicator before 2013

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Extreme poverty very low in Nordic countries

● High in Romania and Bulgaria (around 10%).

● France ranks in intermediate position

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Conclusion

● A new measure of extreme poverty, which better reflects extreme poverty than a threshold at 40%

● The innovation is threefold: ● it provides for the first time figures and characteristics of people in

extreme poverty, ● it expends the analysis, beyond an income approach ● it uses different statistical sources in order to encompass the whole

French population, not only people living in ordinary households.

● In our study, France ranks in intermediate position and extreme poverty is very low in Nordic countries. But this European comparison is computed only for people living in ordinary housing

⇒ It would be thus interesting to know more about extreme poverty in other countries.

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Reference

https://www.insee.fr/fr/statistiques/5371273?sommaire=5371304

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

THANK YOU !

APPENDIX

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Very frequent deprivation

Champ : Personnes vivant en logement ordinaire en France hors Mayotte Source : Insee, enquête Statistiques sur les ressources et les conditions de vie (SRCV) 2018

GROUP OF EXPERTS ON MEASURING POVERTY AND INEQUALITY – EXTREME POVERTY 8-9 DEC. 2022

Extreme poverty often persists over a long time in France

Only 13% of people in extreme poverty are no longer in poverty 3 years later

  • Diapo-Titre
  • Slide 2
  • Slide 3
  • Slide 4
  • Slide 5
  • Slide 6
  • Slide 7
  • Slide 8
  • Slide 9
  • Slide 10
  • Slide 11
  • Slide 12
  • Slide 13
  • Slide 14
  • Slide 15
  • Slide 16
  • Slide 17
  • Slide 18

(France) Automated and connected road transport - regulatory framework in France - Latest update

Languages and translations
English

Automated and connected road transport

Regulatory framework in France

Latest developments

Website : https://www.ecologie.gouv.fr/en/automated-vehicles

Transmitted by the expert from France Informal document GRVA-14-22 14th GRVA, 26–30 September 2022

Provisional agenda item 11

1

National regulatory framework for deployment

2019 Mobility Law + 2021 Ordinance and Decree + 3 orders (intervention operator and approved qualified bodies)

Definitions

automation = delegation to a driving system (partial if handover requests anytime)

automated road transport system (ARTS) = automated vehicles + remote capabilities

remote intervention = capabilities to order and acquit automated manoeuvers

Liability : automated system responsible if active (unless driver fails to takeover)

Autorisation process for ARTS (highly or totally automated vehicles integrated in a technical system, deployed on predefined route or zone)

Vehicle

System ↔ Zone ↔ Operation

Service

Type approval Safety demonstration + Third party advice Commissioning Monitoring

Entry into force : 1st September 2022

Préciser différence entre highly et totally

8.1. Véhicule partiellement automatisé : véhicule équipé d'un système de conduite automatisé exerçant le contrôle dynamique du véhicule dans un domaine de conception fonctionnelle particulier, devant effectuer une demande de reprise en main pour répondre à certains aléas de circulation ou certaines défaillances pendant une manœuvre effectuée dans son domaine de conception fonctionnelle ; 8.2. Véhicule hautement automatisé : véhicule équipé d'un système de conduite automatisé exerçant le contrôle dynamique d'un véhicule dans un domaine de conception fonctionnelle particulier, pouvant répondre à tout aléa de circulation ou défaillance, sans exercer de demande de reprise en main pendant une manœuvre effectuée dans son domaine de conception fonctionnelle. Ce véhicule peut être intégré dans un système technique de transport routier automatisé tel que défini au 1° de l'article R. 3151-1 du code des transports ; 8.3. Véhicule totalement automatisé : véhicule équipé d'un système de conduite automatisé exerçant le contrôle dynamique d'un véhicule pouvant répondre à tout aléa de circulation ou défaillance, sans exercer de demande de reprise en main pendant une manœuvre dans le domaine de conception technique du système technique de transport routier automatisé auquel ce véhicule est intégré, tels que définis aux 1° et 4° de l'article R. 3151-1 du code des transports

2

National policy for deployment

Order of 2 August 2022 implementing Article R. 3152-3 of the Transport Code on the authorisation of remote participants in road transport systems

https://www.legifrance.gouv.fr/jorf/id/JORFTEXT000046151685

Order of 2 August 2022 in application of article R. 3152-30 of the transport code, relating to the approval procedure for qualified organisations

https://www.legifrance.gouv.fr/jorf/id/JORFTEXT000046216833

Order of 5 August 2022 taken in application of article R. 3152-24 of the transport code relating to the content of the opinions of approved qualified organisations

https://www.legifrance.gouv.fr/jorf/id/JORFTEXT000046174400

Decree No. 2022-1034 of July 21, 2022 publishing the amendment to the Vienna International Convention on Road Traffic of 8 November 1968, adopted in Geneva on 14 January 2022

https://www.legifrance.gouv.fr/jorf/id/JORFTEXT000046081252

3

French safety demonstration architecture

Vehicle

Technical system

= vehicle + equipments + control center

Approved by a type approval authority

Approved qualified third party advice

Decision of the service organizer

System

= technical system + route

+ operating and maintenance rules

In-service operation

Monitoring + Audit

Commissioning

Fields

Validation approach

1.

2.

3.

4.

5.

4

4

FR and UN/EU regulations

Vehicle(s)

System

Service

including remote intervention, connectivity, …

+ external capabilities

Validation architecture

in a predefined route / zone

Vehicle type approval

UN or EU Regulation for vehicles equipped with automated driving systems (ADS) international categories (M and N)

French order on national categories (urban shuttles)

Technical system validation

If the system includes a remote operator, UN or EU ADS regulation to specifie capabilities, comparable to the French ARTS decree

French ARTS decree sets procedures and roles

ARTS validation

1.

2.

3.

Question sur bloc 1 « external capabilities » : capabilities of the vehicle to receive and emit signal to infrastructure (lights, supervision center, …) or other vehicles to extend capabilities

UN or EU regulation to specifie capabilities : explain

5

FR – UN/EU : use cases covered by regulation

French decree (system+service-oriented) UN or EU ADS Regulation (vehicle-oriented) WP29/GRVA FRAV work
Automated road transport system (ARTS) Highly or fully automated vehicles and their technical installations (technical system) Deployed in a predefined route or zone for a transport service, both public and private For the carriage of passengers NB : freight and logistics = forthcoming Vehicles type approval Highly or fully automated vehicles, including dual modes vehicles Fully automated vehicles designed and constructed for use on a predined area or on a predefined route with fixed start and end points of a journey/trip For the carriage of passengers or goods

6

FR – UN/EU : safety requirements and demonstration

UN VMAD or EU ADS regulation sets requirements on safety, demonstration methods (tests) and documentation for automated vehicles

FR regulation sets

drivers / remote operator roles and responsabilities

general safety requirements on automated systems

requirements on safety demonstration procedures

Safety demonstration guidance will need to be developped accordingly, e.g :

Scenario-based approach for validation

Reference safety target

Remote intervention functions characterisation

Route or zone characterisation

….

Safety standards for automated transport systems 1/3: published documents

Theme / title Document State Remarks (EN versions for recent methodological documents)
Horizontal regulation - use case approach Methodological document DGITM Published 2017 https://www.ecologie.gouv.fr/sites/default/files/DGITM_Automated-vehicle-horizontal-regulation-2017-EN.pdf
New safety validation methods Methodological document DGITM-PFA Published2020 https://www.ecologie.gouv.fr/sites/default/files/DGITM_Nouvelle-approche-validation-systemes-avril-2019-EN.pdf
Human/system articulation: manoeuvre approach Methodological document DGITM Published 2020 https://www.ecologie.gouv.fr/sites/default/files/DGITM_AD-articulation-roles-systemes-octobre-2020-EN.pdf
Characterisation of the routes: division into sections Research report UGE-STRMTG Published 2021 https://www.ecologie.gouv.fr/sites/default/files/DGITM_STPA-analyse-securite-parcours-mars-2021-EN.pdf
Application of the GAME approach Implementation Guide STRMTG Published 2021 https://www.ecologie.gouv.fr/sites/default/files/DGITM_Guide-application-STRA-principe-GAME-EN.pdf
Scenario-based validation approach Methodological document DGITM Published 2022 https://www.ecologie.gouv.fr/sites/default/files/DGITM_Approche-par-scenarios-fevrier-2022-EN.pdf

Safety standards for automated transport 2/3: methodological documents to come

Theme / title Document Deadline Remarks
Scenario validation: interactions with law enforcement and priority vehicles Methodological framework document DGITM Published 2022 https://www.ecologie.gouv.fr/sites/default/files/DGITM_Approche-par-scenarios-fevrier-2022-EN.pdf
Accidentology references for safety objectives Research report DGITM Published 2022 https://www.ecologie.gouv.fr/sites/default/files/DGITM-Rapport_accidentalite-juillet_2022-EN.pdf
ODD definition Methodological framework document DGITM Published 2022 https://www.ecologie.gouv.fr/sites/default/files/DGITM-ODD_dezscriptors-juin_2022-EN.pdf
Route characterisation Methodological framework document DGITM Q3 2022 Will incorporate elements from the ODD document and the "route cutting" study report. To be integrated into the STRMTG GAME safety demonstration guide (see below)
Characterisation of remote intervention functions Methodological framework document DGITM Q3 2022 To be integrated into the STRMTG GAME safety demonstration guide (see below)
Scenario generation Methodological framework document DGITM-SystemX Published 2022 https://www.ecologie.gouv.fr/sites/default/files/DGITM-L1-septembre_2022-EN.pdf
Scenario rating Q3 2022 To be included in the STRMTG GAME safety demonstration guide (see below)
Use of scenarios in the safety demonstration Q4 2022

Safety standards for automated transport systems 3/3: forthcoming guidance

Theme / title Document Deadline Remarks
GAME safety demonstration Definitions from the glossary Safety demonstration method, including: Pathway decomposition Functional decomposition Hazards and high level requirements Implementation Guide STRMTG Published https://www.ecologie.gouv.fr/sites/default/files/DGITM-GuideD%C3%A9monstration_GAME_STRA_v1_31aout2022-EN.pdf
Additions to be considered in the short term: Use of scenarios and safety analysis of routes STRMTG 2023
RETEX - Handling of events Implementation Guide STRMTG 2023
Cyber requirements Implementation Guide STRMTG Q4 2022 NB: The STRMTG is aiming for a 'martyr' version of this guide by Q3 2022
Missions of the accredited qualified bodies Implementation Guide STRMTG Q3 2022

Automated and connected road transport

Regulatory framework in France Latest developments

Website : https://www.ecologie.gouv.fr/en/automated-vehicles

Transmitted by the expert from France Informal document GRVA-14-22 14th GRVA, 26–30 September 2022 Provisional agenda item 11

National regulatory framework for deployment • 2019 Mobility Law + 2021 Ordinance and Decree + 3 orders (intervention

operator and approved qualified bodies) • Definitions

• automation = delegation to a driving system (partial if handover requests anytime) • automated road transport system (ARTS) = automated vehicles + remote

capabilities • remote intervention = capabilities to order and acquit automated manoeuvers

• Liability : automated system responsible if active (unless driver fails to takeover)

• Autorisation process for ARTS (highly or totally automated vehicles integrated in a technical system, deployed on predefined route or zone)

Vehicle System ↔ Zone ↔ Operation Service Type approval Safety demonstration + Third party advice Commissioning Monitoring

 Entry into force : 1st September 2022

National policy for deployment

• Order of 2 August 2022 implementing Article R. 3152-3 of the Transport Code on the authorisation of remote participants in road transport systems

https://www.legifrance.gouv.fr/jorf/id/JORFTEXT000046151685

• Order of 2 August 2022 in application of article R. 3152-30 of the transport code, relating to the approval procedure for qualified organisations

https://www.legifrance.gouv.fr/jorf/id/JORFTEXT000046216833

• Order of 5 August 2022 taken in application of article R. 3152-24 of the transport code relating to the content of the opinions of approved qualified organisations

https://www.legifrance.gouv.fr/jorf/id/JORFTEXT000046174400

• Decree No. 2022-1034 of July 21, 2022 publishing the amendment to the Vienna International Convention on Road Traffic of 8 November 1968, adopted in Geneva on 14 January 2022

https://www.legifrance.gouv.fr/jorf/id/JORFTEXT000046081252

French safety demonstration architecture

Vehicle

Technical system = vehicle + equipments +

control center

Approved by a type approval authority

Approved qualified third party advice

Decision of the service organizer

System = technical system + route

+ operating and maintenance rules

In-service operation Monitoring + Audit

Commissioning

Fields Validation approach

1.

2.

3.

4.

5.

4

FR and UN/EU regulations

Vehicle(s)

System

Service

including remote intervention,

connectivity, …

+ external capabilities

Validation architecture

in a predefined route / zone

Vehicle type approval • UN or EU Regulation for vehicles

equipped with automated driving systems (ADS) international categories (M and N)

• French order on national categories (urban shuttles)

Technical system validation

• If the system includes a remote operator, UN or EU ADS regulation to specifie capabilities, comparable to the French ARTS decree

• French ARTS decree sets procedures and roles

ARTS validation

1.

2.

3.

FR – UN/EU : use cases covered by regulation

French decree (system+service- oriented)

UN or EU ADS Regulation (vehicle-oriented) WP29/GRVA FRAV work

Automated road transport system (ARTS) • Highly or fully automated vehicles and

their technical installations (technical system)

• Deployed in a predefined route or zone for a transport service, both public and private

• For the carriage of passengers • NB : freight and logistics =

forthcoming

Vehicles type approval • Highly or fully automated vehicles, including

dual modes vehicles

• Fully automated vehicles designed and constructed for use on a predined area or on a predefined route with fixed start and end points of a journey/trip

• For the carriage of passengers or goods

FR – UN/EU : safety requirements and demonstration

UN VMAD or EU ADS regulation sets requirements on safety, demonstration methods (tests) and documentation for automated vehicles

FR regulation sets • drivers / remote operator roles and responsabilities • general safety requirements on automated systems • requirements on safety demonstration procedures

Safety demonstration guidance will need to be developped accordingly, e.g : • Scenario-based approach for validation • Reference safety target • Remote intervention functions characterisation • Route or zone characterisation • ….

Safety standards for automated transport systems 1/3: published documents

Theme / title Document State Remarks (EN versions for recent methodological documents)

Horizontal regulation - use case approach

Methodological document DGITM

Published 2017

https://www.ecologie.gouv.fr/sites/default/files/DGITM_Automated-vehicle- horizontal-regulation-2017-EN.pdf

New safety validation methods

Methodological document DGITM-PFA

Published 2020

https://www.ecologie.gouv.fr/sites/default/files/DGITM_Nouvelle-approche- validation-systemes-avril-2019-EN.pdf

Human/system articulation: manoeuvre approach

Methodological document DGITM

Published 2020

https://www.ecologie.gouv.fr/sites/default/files/DGITM_AD-articulation-roles- systemes-octobre-2020-EN.pdf

Characterisation of the routes: division into sections

Research report UGE-STRMTG

Published 2021

https://www.ecologie.gouv.fr/sites/default/files/DGITM_STPA-analyse- securite-parcours-mars-2021-EN.pdf

Application of the GAME approach

Implementation Guide STRMTG

Published 2021

https://www.ecologie.gouv.fr/sites/default/files/DGITM_Guide-application- STRA-principe-GAME-EN.pdf

Scenario-based validation approach

Methodological document DGITM

Published 2022

https://www.ecologie.gouv.fr/sites/default/files/DGITM_Approche-par- scenarios-fevrier-2022-EN.pdf

Safety standards for automated transport 2/3: methodological documents to come

Theme / title Document Deadline Remarks Scenario validation: interactions with law enforcement and priority vehicles

Methodological framework document DGITM

Published 2022

https://www.ecologie.gouv.fr/sites/default/files/DGITM_A pproche-par-scenarios-fevrier-2022-EN.pdf

Accidentology references for safety objectives Research report DGITM Published

2022 https://www.ecologie.gouv.fr/sites/default/files/DGITM- Rapport_accidentalite-juillet_2022-EN.pdf

ODD definition Methodological framework document DGITM

Published 2022

https://www.ecologie.gouv.fr/sites/default/files/DGITM- ODD_dezscriptors-juin_2022-EN.pdf

Route characterisation Methodological framework document DGITM

Q3 2022 Will incorporate elements from the ODD document and the "route cutting" study report. To be integrated into the STRMTG GAME safety demonstration guide (see below)

Characterisation of remote intervention functions

Methodological framework document DGITM

Q3 2022 To be integrated into the STRMTG GAME safety demonstration guide (see below)

Scenario generation Methodological framework document DGITM-SystemX

Published 2022

https://www.ecologie.gouv.fr/sites/default/files/DGITM- L1-septembre_2022-EN.pdf

Scenario rating Q3 2022 To be included in the STRMTG GAME safety demonstration guide (see below)Use of scenarios in the safety Q4 2022

Safety standards for automated transport systems 3/3: forthcoming guidance

Theme / title Document Deadline Remarks

GAME safety demonstration • Definitions from the glossary • Safety demonstration method, including:

• Pathway decomposition • Functional decomposition • Hazards and high level requirements

Implementation Guide STRMTG Published

https://www.ecologie.gouv.fr/sites/default/f iles/DGITM- GuideD%C3%A9monstration_GAME_ST RA_v1_31aout2022-EN.pdf

Additions to be considered in the short term: • Use of scenarios and safety analysis of

routes

STRMTG 2023

RETEX - Handling of events Implementation Guide STRMTG 2023

Cyber requirements Implementation Guide STRMTG Q4 2022 NB: The STRMTG is aiming for a 'martyr'

version of this guide by Q3 2022

Missions of the accredited qualified bodies Implementation Guide STRMTG Q3 2022

  • Slide Number 1
  • Slide Number 2
  • Slide Number 3
  • Slide Number 4
  • Slide Number 5
  • Slide Number 6
  • Slide Number 7
  • Safety standards for automated transport systems�1/3: published documents
  • Safety standards for automated transport�2/3: methodological documents to come
  • Safety standards for automated transport systems�3/3: forthcoming guidance

(France) Proposal for amendments to ECE/TRANS/WP.29/GRVA/2022/22

Languages and translations
English

Submitted by the expert from France Informal document GRVA-14-21 14th GRVA, 26–30 September 2022

Provisional agenda item 7

Proposal for amendments to ECE/TRANS/WP.29/GRVA/2022/22 (Proposal for a supplement to the 02 series of amendments to UN Regulation No. 131)

This proposal aims to amend the French and English versions of ECE/TRANS/WP.29/GRVA/2022/22 submitted by the experts from the Informal Working Group on Advanced Emergency Braking Systems of Heavy-Duty Vehicles (AEBS-HDV).

The wording of the English proposal conduct to a confused translation in French.

The additions and deletions are shown in red bold text to facilitate identification of these proposed changes within the previous proposal.

I. Proposal

The English version of ECE/TRANS/WP.29/GRVA/2022/22 reads:

Paragraph 5.4.1.1., amended to read:

“5.4.1.1. The AEBS function shall be automatically reinstated at the initiation of each new ignition engine start/run cycle engine start or run cycle, as relevant. This requirement does not apply when a new engine start/run cycle engine start or run cycle, as relevant, is performed automatically, e.g. the operation of a stop/start system.

The French version ECE/TRANS/WP.29/GRVA/2022/22 reads:

Paragraphe 5.4.1.1 , lire :

« 5.4.1.1 La fonction AEBS doit être réactivée automatiquement chaque fois que le contacteur de mise en marche du véhicule est actionné, à chaque nouveau démarrage du moteur à chaque nouveau démarrage du moteur ou à chaque nouveau cycle de roulage selon le cas.

Cette prescription n’est pas applicable aux redémarrages automatiques du moteur liés, par exemple, au fonctionnement d’un système arrêt-démarrage automatique.

Cette prescription n’est pas applicable lorsqu’un démarrage du moteur ou un nouveau cycle de roulage, selon le cas, est effectué automatiquement, comme par exemple, par le fonctionnement d’un système arrêt-démarrage automatique. ».

II. Justification

The French and English versions of ECE/TRANS/WP29/GRVA/2022/22 proposed for consideration at this 14th session of GRVA do not reflect the conclusion of last GRVA session concerning the interpretation of chapter 5.4.1.1..

During the discussion in GRVA 13th session of informal document GRVA-13-33 - (OICA) AEBS Regulation - Common interpretation of “new engine start/run cycle”, it was recognized that the wording of “new engine start/run cycle” may conduct to confusion, and that interpretation would follow the first proposal of OICA. The report of GRVA 13th session (ECE/TRANS/WP.29/GRVA/13) referred to in paragraph 78 mentioned : “GRVA supported the interpretation 1 referred to in that document and requested that proper attention is given to the French translation of the corresponding text in the Regulation”..

However, French version did not reflect this interpretation.

That’s why France suggests to amend both versions in order to be sure that the requirement cannot be interpreted in another way than what was agreed during 13th session of GRVA.

If this solution is agreed by GRVA, France suggests to also correct UN Regulation No. 152 in the same way.

Submitted by the expert from France Informal document GRVA-14-21 14th GRVA, 26–30 September 2022

Provisional agenda item 7

Proposal for amendments to ECE/TRANS/WP.29/GRVA/2022/22 (Proposal for a supplement

to the 02 series of amendments to UN Regulation No. 131)

This proposal aims to amend the French and English versions of ECE/TRANS/WP.29/GRVA/2022/22 submitted by the experts from the Informal Working Group on Advanced Emergency Braking Systems of Heavy-Duty Vehicles (AEBS-HDV).

The wording of the English proposal conduct to a confused translation in French.

The additions and deletions are shown in red bold text to facilitate identification of these proposed changes within the previous proposal.

I. Proposal

The English version of ECE/TRANS/WP.29/GRVA/2022/22 reads :

Paragraph 5.4.1.1., amended to read:

“5.4.1.1. The AEBS function shall be automatically reinstated at the initiation of each new ignition engine start/run cycle engine start or run cycle, as relevant. This requirement does not apply when a new engine start/run cycle engine start or run cycle, as relevant, is performed automatically, e.g. the operation of a stop/start system.”

The French version ECE/TRANS/WP.29/GRVA/2022/22 reads :

Paragraphe 5.4.1.1, lire :

« 5.4.1.1 La fonction AEBS doit être réactivée automatiquement chaque fois que le contacteur de mise en marche du véhicule est actionné, à chaque nouveau démarrage du moteur à chaque nouveau démarrage du moteur ou à chaque nouveau cycle de roulage selon le cas.

Cette prescription n’est pas applicable aux redémarrages automatiques du moteur liés, par exemple, au fonctionnement d’un système arrêt-démarrage automatique.

Cette prescription n’est pas applicable lorsqu’un démarrage du moteur ou un nouveau cycle de roulage, selon le cas, est effectué automatiquement, comme par exemple, par le fonctionnement d’un système arrêt-démarrage automatique. ».

II. Justification

The French and English versions of ECE/TRANS/WP29/GRVA/2022/22 proposed for consideration at this 14th session of GRVA do not reflect the conclusion of last GRVA session concerning the interpretation of chapter 5.4.1.1.. During the discussion in GRVA 13th session of informal document GRVA-13-33 - (OICA) AEBS Regulation - Common interpretation of “new engine start/run cycle”, it was recognized that the wording of “new engine start/run cycle” may conduct to confusion, and that interpretation would follow the first proposal of OICA. The report of GRVA 13th session (ECE/TRANS/WP.29/GRVA/13) referred to in paragraph 78 mentioned : “GRVA supported the interpretation 1 referred to in that document and requested that proper attention is given to the French translation of the corresponding text in the Regulation”.. However, French version did not reflect this interpretation. That’s why France suggests to amend both versions in order to be sure that the requirement cannot be interpreted in another way than what was agreed during 13th session of GRVA. If this solution is agreed by GRVA, France suggests to also correct UN Regulation No. 152 in the same way.

- Presentation

Languages and translations
English

Census in France: impact of the 2021 census delay on outcomes Muriel Barlet

Head of demography department

en partenariat

avec votre commune

• Principles of the rolling census in France

• Why did we delay the 2021 data collection ?

• How did we manage to produce population estimates without data collection?

• What do we learn for the future?

Census in France: impact of the 2021 census delay on outcomes

en partenariat

avec votre commune

Principles of the rolling census in France

en partenariat

avec votre commune

⚫Principles of the rolling census in France

• A process based on 3 kinds of data :

− Data from the enumeration of a part of the population each year (Annual census survey ACS)

− Data from the register of residential buildings : the sampling frame for large municipalities

− Local tax data on dwellings and their occupants

− yield detailed results every year at all relevant geographical levels (from country to municipalities, even neighbourhood level)

en partenariat

avec votre commune

⚫Why are population estimates from the census very important ?

Population estimates produced from the census are “official” municipal populations :

⚫ It is a reference for about 350 measures mentioned in the

law or decrees

⚫ It determines the annual grant from state to municipalities,

size of municipal council, authorization to set up a

pharmacy... Insee had to produce population

estimates every year …. even without a census survey

en partenariat

avec votre commune

⚫How is the annual data collection carried out ?

⚫ In small municipalities (with less than 10,000 inhabitants)

⚫ Every year : 1/5 of municipalities, all the inhabitants

⚫ All the municipalities, all the inhabitants in a five year

cycle ⚫ In large municipalities (10,000 inhabitants and more)

⚫ Every year : 8 % of the inhabitants in each municipality

(sampling in the register of residential buildings)

⚫ 40 % of the inhabitants during a five year cycle

en partenariat

avec votre commune

⚫The organisation of an annual census survey

⚫ 8,000 municipalities are concerned each year (1,000 large, 7,000 small ones)

⚫ 5 billion dwellings and 8 billion inhabitants are enumerated

⚫ 33,000 people are hired by municipalities to carry out the data collection

⚫ 400 people from insee work during the collection period to supervise and control the data collecion

⚫ A very large operation each year

en partenariat

avec votre commune

⚫Reference period ⚫ 1 January of the median year of the five year cycle

⚫ For the quality of population estimates

⚫ For a fair treatment between municipalities

⚫ Populations are disseminated at the end of year N+2

⚫ For example, in December 2022 population at January 1 2020 will be released

en partenariat

avec votre commune

⚫Method for population estimates in small municipalities

⚫ Depends on the date of the last data collection :

⚫ Enumeration before the reference period : population

extrapolation, used growth rate of number of dwellings

in tax data and a decohabitation rate

⚫ Enumeration during reference period : results of

enumeration

⚫ Enumeration after reference period : linear

interpolation between last disseminated population and

enumeration

en partenariat

avec votre commune

⚫Method for population estimates in small municipalities

⚫ RG : rotation group

en partenariat

avec votre commune

⚫Method for population estimates in large municipalities

⚫ Population = multiplication of

⚫ the number of dwellings in the register of residential

buildings

⚫ the average number of persons by dwelling,

estimated from the 5 last annual census surveys

en partenariat

avec votre commune

Why did we delay de 2021 data collection ?

en partenariat

avec votre commune

⚫A worrying health situation

⚫ In autumn 2020 :

⚫ A second lockdown during survey preparation

(November 2020)

⚫ Vaccination not started yet

⚫ Uncertainty about the health situation in early 2021

⚫ Reluctance of some municipalities to carry out the

survey in January-February 2021, despite protocol

adaptation to reduce physical contact

en partenariat

avec votre commune

⚫Three options for the census ⚫ Maintaining the census in January-February 2021 :

dismissed because incompatible with a deteriorate health

situation

⚫ Delaying the census to the spring 2021 : dismissed

because of uncertainty of health situation (a third

lockdown...)

⚫ Postponing the census to 2022 :

⚫ Selected option

⚫ Despite the need to disseminate results each

year

en partenariat

avec votre commune

How did we manage to produce population estimates without data collection ?

en partenariat

avec votre commune

⚫Principles for adaptation of the methods

⚫ In small municipalities :

⚫ Increased use of administrative data

⚫ In large municipalities :

⚫ The register of residential buildings is always available and up

to date: the number of dwellings is known even if the survey is

postponed

⚫ Extension of past trends

en partenariat

avec votre commune

⚫ In small municipalities ⚫ The gap between two surveys is increased by one year : from 5 to 6

years.

⚫ For estimating the population the year the survey is postponed :

extrapolation with tax data.

en partenariat

avec votre commune

⚫ In small municipalities

⚫ Each year :

⚫ Adaptation of the method for only 20 % of municipalities

⚫ Quality of the produced data :

⚫ Simulation of the absence of an ACS in the past and

comparison between the adapted method and the disseminated

data

⚫ The estimated population is very close to disseminated one :

0.05% difference.

⚫ The deviation at the municipal level in absolute terms is less

than 2 % for 92 % of municipalities.

en partenariat

avec votre commune

⚫ In large municipalities ⚫ Two components to the estimate :

⚫ Number of dwellings : always available thanks to the register of

residential buildings

⚫ Average number of persons per dwelling :

⚫ Usually : the average of the last 5 ACS

⚫ Creation of a pseudo 2021 ACS by prolongation of past

trends : rate of main residences, number of persons per

main residences. Calibration of 2016 ACS on 2021 targets.

⚫ Quality :

⚫ For 92 % of municipalities, the population of the adapted

method lies within the 95 % confidence interval of the

disseminated population

en partenariat

avec votre commune

⚫Socio-demographic data ⚫ Despite the postponement of 2021 survey, socio-

demographic data are also available at a geographically

detailed level

⚫ In contrast to the population, socio-demographic structure

is frozen for one year

⚫ It is acceptable, because the aim of census is not to

highlight short term phenomena

en partenariat

avec votre commune

What do we learn for the future ?

en partenariat

avec votre commune

⚫Conclusion ⚫ French rolling census is resilient. ⚫ The availability of data from survey every year and other sources is

essential: register of residential buildings and administrative data

⚫ Producing population estimates one year earlier is possible ⚫ To produce population estimate with one missing ACS is closed to produce

advanced population estimate : the degradation of quality is acceptable

⚫ The postponement allowed more time to prepare the first

survey during the health crisis, in January-February 2022. ⚫ It was a success, with a contained non-response rate : 4,8% (against 4,1%

in 2020).

Russian

Перепись населения во Франции: влияние переносов сроков переписи 2021 года на результаты

Мюриэль Барле

Начальник отдела демографии

en partenariat

avec votre commune

• Принципы проведения скользящей переписи населения во Франции

• Почему мы отложили сбор данных в 2021 году?

• Как нам удалось оценить численность населения без сбора данных?

• Какие уроки мы вынесли на будущее?

Перепись населения во Франции: влияние переносов сроков переписи 2021 года на результаты

en partenariat

avec votre commune

Принципы проведения скользящей переписи населения во Франции

en partenariat

avec votre commune

⚫ Принципы проведения скользящей переписи населения во Франции

• Процесс, опирающийся на три вида данных:

− Данные переписи части населения поступают ежегодно (Ежегодное переписное обследование (ЕПО))

− Данные из Реестра жилого фонда: основа выборки в случае крупных муниципалитетов

− Данные из местных налоговых органов о налогооблагаемом жилом фонде и жильцах

− это то, что позволяет ежегодно получать подробные сведения по всем соответствующим административно-территориальным уровням (от уровня страны до уровня муниципалитетов и даже кварталов)

en partenariat

avec votre commune

⚫ Почему оценки численности населения, получаемые в результате переписи, столь важны?

Оценки численности населения, полученные в результате переписи, представляют собой «официальную» численность населения муниципалитетов:

⚫ На этот показатель ссылаются около 350 мер, предусмотренных в

нормативно-правовой базе.

⚫ Этот показатель определяет размер ежегодного финансирования,

выделяемого муниципалитету из госбюджета, число депутатов

муниципального совета, число выдаваемых в муниципалитете разрешений

на открытие аптек и т.п. Национальный институт статистики и экономических

исследований (НИСЭИ/INSEE) обязан публиковать данные о численности населения ежегодно … даже без проведения переписных обследований

en partenariat

avec votre commune

⚫ Как осуществляется ежегодный сбор данных?

⚫ В небольших муниципалитетах (с населением до 10000 жителей)

⚫ Ежегодно: 1/5 муниципалитетов, охват всех жителей

⚫ Все муниципалитеты, охват всех жителей за

пятилетний цикл ⚫ В крупных муниципалитетах (с населением от 10 000 и более)

⚫ Ежегодно: охват 8% жителей каждого муниципалитета

(выборка формируется из Реестра жилого фонда)

⚫ охват 40% жителей за пятилетний цикл

en partenariat

avec votre commune

⚫ Организация ежегодного переписного обследования

⚫ Ежегодно задействовано 8 000 муниципалитетов (1 000 крупных, 7 000 малых)

⚫ Переписывается 5 млн единиц жилого фонда и 8 млн жителей

⚫ Для сбора данных муниципалитетами нанимается 33 000 переписчиков

⚫ В период сбора данных 400 сотрудников Национального института статистики и экономических исследований (НИСЭИ/INSEE) обеспечивают надзор и контроль за проведением этих работ

⚫ Очень большой объем работ ежегодно

en partenariat

avec votre commune

⚫Базисный период

⚫ Исходной датой отсчета является 01 января медианного года 5-летнего переписного цикла

⚫ Для обеспечения качества демографических оценок

⚫ Для обеспечения справедливого отношения ко всем муниципалитетам

⚫ Демографические данные публикуются в конце второго года после проведения переписного обследования (год N+2)

⚫ Например, в декабре 2022 года демографические данные будут опубликованы по состоянию на 01 января 2020 года

en partenariat

avec votre commune

⚫ Метод оценки численности населения в малых муниципалитетах

⚫ Зависит от того, когда проводился последний сбор данных:

⚫ Перепись до базисного периода: экстраполяция тенденций

изменения численности населения на будущие периоды,

использование сведений налоговых органов о показателях темпов

роста жилого фонда, применение коэффициента «отказа от

совместного проживания» (т.н. «декохабитации»)

⚫ Перепись в течение базисного (учетного) периода: результаты

переписи

⚫ Перепись после базисного периода: линейная интерполяция данных

за период с момента последней публикации переписных данных и до

момента проведения переписи

en partenariat

avec votre commune

⚫Метод оценки численности населения в малых муниципалитетах

⚫ РГ — ротационная группа.

en partenariat

avec votre commune

⚫Метод оценки численности населения в крупных муниципалитетах

⚫ Численность населения = умножение

⚫ количества единиц жилого фонда, взятого из

Реестра жилого фонда, на

⚫ среднее число человек на одну единицу жилого

фонда, рассчитанное по результатам 5 последних

ежегодных переписных обследований

en partenariat

avec votre commune

Почему мы отложили сбор данных в 2021 году?

en partenariat

avec votre commune

⚫ Тревожная ситуация в области здравоохранения

⚫ Осенью 2020 года:

⚫ Второй карантин во время подготовки к проведению обследования

(ноябрь 2020 г.)

⚫ К вакцинации еще не приступили

⚫ Неопределенность в области здравоохранения в начале 2021 года

⚫ Нежелание некоторых муниципалитетов проводить обследование в

январе-феврале 2021 года, несмотря на адаптацию протокола

проведения переписного обследования, предусматривающего

сокращение числа физических контактов между переписчиками и

переписываемыми

en partenariat

avec votre commune

⚫Три варианта проведения переписи

⚫ Соблюдение графика проведения переписи в январе-феврале 2021

года: вариант отклонен из-за ухудшения ситуация в области

здравоохранения

⚫ Перенос переписи на весну 2021 года: вариант отклонен из-за

неопределенности ситуации в области здравоохранения (третий

карантин...)

⚫ Перенос переписи на 2022 год:

⚫ Выбранный вариант

⚫ Несмотря на необходимость ежегодной публикации

результатов переписи

en partenariat

avec votre commune

Как нам удалось оценить численность населения без сбора данных?

en partenariat

avec votre commune

⚫Принципы адаптации методов

⚫ В малых муниципалитетах:

⚫ Более широкое использование административных данных

⚫ В крупных муниципалитетах:

⚫ Реестр жилого фонда всегда доступен и актуализирован:

количество жилых помещений известно даже в случае

переноса обследования

⚫ Экстраполяция наблюдавшихся ранее тенденций на будущие

периоды

en partenariat

avec votre commune ⚫В малых муниципалитетах

⚫ Разрыв между двумя

обследованиями

увеличивается на один год:

с 5 до 6 лет.

⚫ Расчетная оценка

численности населения в

том году, когда

обследование отложено:

экстраполяция тенденций

изменения численности

населения, опираясь на

данные налоговых органов.

en partenariat

avec votre commune

⚫В малых муниципалитетах

⚫ Ежегодно:

⚫ Адаптация метода только в 20% муниципалитетов

⚫ Качество полученных данных:

⚫ Моделирование отсутствия ЕПО в прошлом и сравнение

адаптированного метода с опубликованными данными.

⚫ Расчетная численность населения очень близка к

опубликованному показателю: разница 0,05%.

⚫ Отклонение на уровне муниципалитетов в абсолютном

выражении составляет менее 2% у 92% муниципалитетов.

en partenariat

avec votre commune ⚫В крупных муниципалитетах

⚫ Два компонента оценочного расчета:

⚫ Количество единиц жилого фонда: эти данные всегда доступны благодаря

наличию Реестра жилого фонда

⚫ Среднее количество человек на единицу жилого фонда:

⚫ Обычно: среднее значение из 5 последних ЕПО.

⚫ Проведение псевдо-ЕПО 2021 года с помощью экстраполяции ранее

выявленных тенденций: показатели основных мест жительства и числа

людей, приходящихся на основное место жительства. Сопоставление

данных ежегодного переписного обследования (ЕПО) 2016 года с

данными 2021 года.

⚫ Качество:

⚫ У 92% муниципалитетов численность населения, рассчитанная по

адаптированному методу, находится в пределах 95% доверительного интервала

опубликованных данных о численности населения

en partenariat

avec votre commune

⚫Социально-демографические данные

⚫ Несмотря на перенос сроков проведения переписного обследования

2021 года, социально-демографические данные с административно-

территориальной детализацией также доступны.

⚫ В отличие от показателей населения, социально-демографическая

структура «замораживается» на один год.

⚫ И это приемлемо, потому что цель переписи не в том, чтобы отражать

краткосрочные явления.

en partenariat

avec votre commune

Какие уроки мы вынесли на будущее?

en partenariat

avec votre commune ⚫Заключение

⚫ Французская скользящая перепись носит устойчивый характер.

⚫ Наличие данных ежегодного обследования и данных из других источников

обязательно: Реестр жилого фонда (РЖФ) и административные данные

⚫ Возможно получение расчетной численности населения на год раньше

⚫ В условиях отсутствия данных одного ежегодного переписного обследования

(ЕПО) возможно получение детализированных данных о расчетной численности

населения: снижение качества данных носит приемлемый характер

⚫ Перенос сроков переписи дал больше времени на подготовку первого

переписного обследования во время кризиса в области здравоохранения

в январе-феврале 2022 года.

⚫ Переписное обследование прошло успешно, а доля респондентов, не

прошедших перепись, не сильно возросла: 4,8% (по сравнению с 4,1% в 2020

году).