DATA SCIENCE FOR CRIME SCIENTISTS (ADVANCED CRIME ANALYSIS) 2018/2019

Repo for the UCL 3rd year UG module Advanced Crime Analysis (Data Science)


DATA SCIENCE FOR CRIME SCIENTISTS (ADVANCED CRIME ANALYSIS) 2018/2019

This is the companion website for the 2018-2019 module for 3rd year undergraduate students of the BSc in Crime Science at UCL.

Resources

The module handbook provides you with all information around assessment, learning outcomes, timetables, and a general overview of the module. Use the module handbook as your go-to guide throughout the module.

Week 1: INTRODUCTION

Suggested reading:

  • Williams, M. L., Burnap, P., & Sloan, L. (2017). Crime Sensing With Big Data: The Affordances and Limitations of Using Open-source Communications to Estimate Crime Patterns. The British Journal of Criminology, 57(2), 320–340. https://doi.org/10.1093/bjc/azw031

Tutorial:


Week 2: WEB SCRAPING 1

Required reading/preparation:

Suggested reading:

  • Solymosi, R., Bowers, K. J., & Fujiyama, T. (2018). Crowdsourcing Subjective Perceptions of Neighbourhood Disorder: Interpreting Bias in Open Data. The British Journal of Criminology, 58(4), 944–967. https://doi.org/10.1093/bjc/azx048
  • Founta, A.-M., Djouvas, C., Chatzakou, D., Leontiadis, I., Blackburn, J., Stringhini, G., … Kourtellis, N. (2018). Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior. ArXiv:1802.00393 [Cs]. Retrieved from http://arxiv.org/abs/1802.00393

Week 3: WEB SCRAPING 2

  • Lecture 3: Webscraping with R (slides), pdf
  • Homework: -

Required reading:

Suggested reading:

  • ElSherief, M., Kulkarni, V., Nguyen, D., Wang, W. Y., & Belding, E. (2018). Hate Lingo: A Target-based Linguistic Analysis of Hate Speech in Social Media. ArXiv:1804.04257 [Cs]. Retrieved from http://arxiv.org/abs/1804.04257

Tutorial: Webscraping and APIs in R, SOLUTIONS

Week 4: TEXT DATA 1

Required reading:

Suggested tutorials/reading:

Tutorial: -

Week 5: TEXT DATA 2

Required reading:

  • Kleinberg, B., Mozes, M., & Van der Vegt, I. (2018). Identifying the sentiment styles of YouTube’s vloggers. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 3581–3590. Retrieved from http://aclweb.org/anthology/D18-1394
  • Pérez-Rosas, V., Kleinberg, B., Lefevre, A., & Mihalcea, R. (2018). Automatic Detection of Fake News. In Proceedings of the 27th International Conference on Computational Linguistics (pp. 3391–3401). Santa Fe, New Mexico, USA: Association for Computational Linguistics. Retrieved from http://aclweb.org/anthology/C18-1287

Week 6: MACHINE LEARNING 1

Required reading:

  • http://www.r2d3.us/visual-intro-to-machine-learning-part-1/
  • Kuhn, M., & Johnson, K. (2013). Applied Predictive Modeling. New York: Springer-Verlag. Retrieved from https://www.springer.com/de/book/9781461468486
    • Chapter: “Introduction”
    • Chapter: “A Short Tour of the Predictive Modeling Process”

Recommended reading:

  • Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition (2nd ed.). New York: Springer-Verlag. Retrieved from https://www.springer.com/de/book/9780387848570
    • Chapter: “Overview of Supervised Learning”
    • Chapter: “Linear Methods for Classification”

Week 7: MACHINE LEARNING 2

Required reading:

  • Gatto, L. (n.d.). An Introduction to Machine Learning with R. Retrieved from https://lgatto.github.io/IntroMachineLearningWithR/unsupervised-learning.html
    • Chapter 4: Unsupervised learning

Recommended:

Week 8: PROMISES AND PROBLEMS

  • Lecture 8: Advances, promises and problems of data science for crime science (slides), (pdf)
  • No tutorial
  • Homework: peer-feedback + your project + revision

Required reading

  • Coveney, P. V., Dougherty, E. R., & Highfield, R. R. (2016). Big data need big theory too. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2080), 20160153. https://doi.org/10.1098/rsta.2016.0153

Recommended reading

  • Quijano-Sánchez, L., Liberatore, F., Camacho-Collados, J., & Camacho-Collados, M. (2018). Applying automatic text-based detection of deceptive language to police reports: Extracting behavioral patterns from a multi-step classification model to understand how we lie to the police. Knowledge-Based Systems, 149, 155–168. https://doi.org/10.1016/j.knosys.2018.03.010
  • Kadar, C., & Pletikosa, I. (2018). Mining large-scale human mobility data for long-term crime prediction. EPJ Data Science, 7(1), 26. https://doi.org/10.1140/epjds/s13688-018-0150-z
  • Burnap, P., & Williams, M. L. (2016). Us and them: identifying cyber hate on Twitter across multiple protected characteristics. EPJ Data Science, 5(1), 11. https://doi.org/10.1140/epjds/s13688-016-0072-6

Week 9: RECAP, CASE STUDIES, PEER-FEEDBACK

  • Lecture 9: Module recap, case studies (slides), (pdf)
  • Tutorial: project work.

Module convenor and author: Bennett Kleinberg (bennett.kleinberg@ucl.ac.uk)

Department of Security and Crime Science, UCL