1 / 27

Anders Jäder and Anders Norberg, Statistics Sweden

A selective editing method considering both suspicion and potential impact, developed and applied to the Swedish foreign trade statistics Topic (ii), WP 12. Anders Jäder and Anders Norberg, Statistics Sweden. The data. Main variables collected monthly: Commodity code (8-digit CN codes)

Télécharger la présentation

Anders Jäder and Anders Norberg, Statistics Sweden

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A selective editing method considering both suspicion and potential impact, developed and applied to the Swedish foreign trade statisticsTopic (ii), WP 12 Anders Jäder and Anders Norberg, Statistics Sweden

  2. The data Main variables collected monthly: Commodity code (8-digit CN codes) Country of dispatch/arrival Quantity (weight and supplementary unit) Invoiced Value 350 000 observations per month

  3. Score function Computed as a weighted geometric mean of measures of Suspicion and Potential impact

  4. Selective editing The 1,500 observations with the highest scores are flagged

  5. Suspicion The difference between Unit priceand the lower/upper quartile, divided by inter-quartiles distance. Logarithmic scale (Euro/Kg)

  6. Potential Impact The difference between Invoiced Value and the median of Unit price multiplied by Quantity(Euro)

  7. Hit rate = 30%

  8. Hit rate=46% Impact=65%

  9. Hit rate=30% Impact=80%

  10. Hit rate=34%Impact=81% Best!

  11. Potential impact The 8-digit commodity codes can be aggregated to 6, 4 and 2-digit commodity codes (CN6, CN4, CN2) and other classifications , e.g. the SITC classification.  Over 10,000 estimates to be computed

  12. Potential impact We have developed a formula with which the impact of an error on the statistics on all aggregation levels and sizes of estimates can be expressed in one single variable.

  13. Potential impact Excel demonstration

  14. Potential impact

  15. Strategy • SCB has saved raw and corrected data for all months since 2000. We analyzed them • New system with parameters • Produce monthly process data for a continuous search of best parameter values Will we be misled when we analyze data that has been flagged by the old method ???

  16. Study • We need many months of historical data – current data is not enough • Homogenous groups – modest demand on number of observations • Computation of median and quartiles weighted by Quantity • Suspicion versus probability of error – transformation of Suspicion

  17. Suspicion versus probability of error Suspicion

  18. Experiences from production Hit rate by variable:

  19. Experiences from production Impact by variable:

  20. Experiences from production - Impact on variable invoiced value:

  21. Thank You!

More Related