1 / 23

A Bayesian mixture model for detecting unusual time trends Modelling burglary counts in Cambridge

A Bayesian mixture model for detecting unusual time trends Modelling burglary counts in Cambridge. Guangquan (Philip) Li 4 th ESRC Research Methods Festival July 5-8, 2010 Joint work with Nicky Best, Sylvia Richardson and Robert Haining. Outline. Motivations

adsila
Télécharger la présentation

A Bayesian mixture model for detecting unusual time trends Modelling burglary counts in Cambridge

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Bayesian mixture model for detecting unusual time trendsModelling burglary counts in Cambridge Guangquan (Philip) Li 4th ESRC Research Methods Festival July 5-8, 2010 Joint work with Nicky Best, Sylvia Richardson and Robert Haining

  2. Outline • Motivations • A Bayesian mixture model for detecting unusual time trends • Preliminary results from analysis of burglary data in Cambridge

  3. Reasons for detecting unusual time trends Highlighting areas deserving of further scrutiny Emergence of local risk factors? Identifying possible risk factors Change of population composition? Modelling Impact of a new policing scheme? Assessing effect of policy Informing policy making

  4. Preventing residential burglary in CambridgePolice research Series paper 108 • The report describes the work of the Domestic Burglary Task Force (DBTF) in Cambridge, which was established to examine the nature of residential burglary in Cambridge and to design and implement initiatives to prevent it. • Analysis of the burglary counts (1993-1994), the DBFT identified the largest ‘hot spot’ in the north of the City, and the two wards which contained the ‘hot spot’, as the targeted area. • After a series of seminars, a number of burglary prevention strategies were identified and implemented. • Question: whether the strategies helped to reduce residential burglary rates?

  5. Trend comparisons Two targeted wards Cambridge city as a whole Need modelling

  6. A Bayesian detection model • We have proposed a detection method that, for each area, provides estimates independently from the common trend component and the area-specific trend component and selects estimates between the two to describe the observed data. • For each area, the posterior probability of selecting the common trend component is used to classify the area/trend as “unusual” or not.

  7. A schematic diagram of the detection model Common spatial pattern Common time trend Area-specific time trends Space-time separable Space-time inseparable Space-Time variations Common spatial pattern Common time trend Area-specific time trends

  8. Specific model components (1) • A conditional autoregressive (CAR) model is used to impose the spatial correlation. Spatial smoothing

  9. Specific model components (2) • A random walk of order 1 is used to define the temporal structure. Temporal smoothing S varies S fixed (>1) Time Non-informative priors are assigned to other parameters in the model t-1 t t+1

  10. Classification • For each area, the posterior mean of zi (denoted by pi) presents evidence for area ito follow the common trend pattern • a small value of pisuggests that the area is unlikely to follow the common trend • The area is unusual if the above probability is less than some threshold, i.e.,

  11. The idea of classification Unusual Usual • Choose cutoff to achieve pre-specified false detection rate (FDR) • Cutoff values cannot be obtained using conventional approaches such as Storey 2002 since null hypothesis is specific to each areas. • We have proposed a novel simulation approach to obtain area-specific cutoffs so that we can maximize the sensitivity while controlling for FDR. pi Prob (An area follows the common trend pattern) ⌘

  12. A simulation study

  13. Simulation results 15 (out of 354) areas were selected according to the population sizes and spatial risks and assigned the unusual trend. Scenario 1 Scenario 1 S-vary S=5 SaTScan Comparing the gain/loss of sensitivity amongst the following 4 models 1. S-vary 2. S=2 (the optimal setting, the reference) 3. S=5 4. SaTScan (space-time permutation test) Scenario 2 Scenario 3 Small departures Large departures

  14. Simulation results Reference: S = 2 s-vary S-vary S=5 SaTScan

  15. Summary: Key features of the model The comprehensive simulation study has shown some key features of our model: • Our model can detect various realistic departure patterns; • The performance is robust over different model settings; • Our model outperforms the popular SaTScan; • Our detection model works relatively well on sparse data.

  16. Burglary data in Cambridge • Geo-referenced offence records in Cambridgeshire (2001-2008) are made available by the Cambridgeshire Constabulary; • In this analysis, we focus on the burglary counts in Cambridge at the Lower Super Output Area (LSOA) level for each quarter from 2001 to 2002 (2584 reported burglary cases). • Numbers of houses were taken from the 2001 Census then aggregated to LSOA level (≈600 houses).

  17. Overall spatial/temporal pattern

  18. Detected LSOA (FDR=0.01)

  19. High risks and unusual LSOA

  20. Future work • We are currently working closely with the Cambridgeshire Police to assess effectiveness of possible policing schemes; • The framework can be extended to a prospective surveillance system by applying the detection model sequentially to observed data; • Incorporation of time-varying covariates (e.g., unemployment from surveys) can enrich the detection analysis.

  21. Summary • We have proposed a Bayesian mixture model for detecting unusual time trends; • The extensive simulation study has shown the superior performance of the model in detecting various “real” departures; • Applying the model to the offence data can assist/inform policy making (by identifying abrupt changes) and help to assess policy.

  22. Acknowledgement • Funded by ESRC • The BIAS project (PI Nicky Best), based at Imperial College London, is a node of the Economic and Social Research Council’s National Centre for Research Methods (NCRM) • The offences data are kindly provided by the Cambridgeshire Constabulary.

  23. References SaTScan • Kulldorff M, Heffernan R, Hartman J, Assunção RM, Mostashari F. A space-time permutation scan statistic for the early detection of disease outbreaks. PLoS Medicine, 2:216-224, 2005. False discovery rate (FDR) • Storey J. A direct approach to false discovery rates. JRSS(B), 64: 479-498, 2002. • Newton M, Noueiry A, Sarkar D, Ahlquist P. Detecting differential gene expression with a semiparametric hierarchical mixture method. Biostatistics, 5:155-176, 2004. Crime • Bennett T. and Durie L. Preventing residential burglary in Cambridge: From crime audits to targeted strategies. Police research series Paper 108, 1998.

More Related