Advanced Quantitative Toolkit - SPARK: a centre for social research innovation

Journal article introducing Bayesian multivariate modelling

Trying to wrap your head around how to apply Bayesian reasoning to multivariate modelling? This article illustrates how these concepts intersect, with practical steps for applying it to your own research.

Method article for Bayesian multilevel modelling

This article offers a step-by-step guide on how to conduct a multilevel analysis, with an accompanying R script and dataset that make it possible to practice the concepts outlined.

Introduction to hierarchical models

As a social scientist, you may deal with context-dependent clustered data. This chapter will showcase one statistical model to account for the relationships between observations in this type of data.

Bayesian hierarchical models

Working with clustered data and unsure how to account for within-group variability? This lecture introduces Bayesian hierarchical models as a way to model the relationship between an outcome and predictors for clustered data.

Video lecture on the Bayesian approach to generalised linear models

Review the general framework for GLMs, then learn how Bayesian modelling can improve estimates about the relationships between variables in your data.

The linear regression model

This rigorous discussion of Bayesian modelling is grounded in an approachable example.

Blog post explaining Bayesian linear regression

Learn how to apply Bayesian reasoning to estimate linear model parameters.

Bayesian reasoning for qualitative social science

Sometimes frequentists statistics’ assumptions don’t apply to qualitative data- this video explains the advantages of Bayesian statistics in these cases.

Discriminant analysis

Unsure about how to solve classification problems? Learn more about the statistical models used in LDA and QDA.

Video explaining linear and quadratic discriminant analyses

Learn how to conceptualise LDA and QDA within the context of Gaussian Maximum Likelihood Classification methods.

Logistic regression and discriminant analysis

Step-by-step guides for discriminant analysis and logistic regression procedures

Video tutorial showcasing how to select a time series ARIMA model in R

Model selection can sometimes be more of an art than a precise science, and this video tutorial illustrates this by walking through an example in R.

Video tutorial describing common time series and ARIMA models

This model allows you to get an accurate estimate of how a certain variable changes over time, by factoring in past observations of the same variable.

Brief guide on time series models

This concise explanation of time series models is a quick guide or refresher on when to use which.

Generalized Additive models for location scale and shape (GAMLSS) in R

This article offers a primer on GAMLSS, as a robust model for data with a distribution which deviates from the normal because of its shape (and hence violates the assumption of other commonly used models like GLMs). The authors provide examples in R using 4 different examples.

GAMLSS: A distributional regression approach

Trying to go beyond the mean with your regression model? When neither GLM nor GAM provide an adequate picture, GAMLSS might be the place to start.

Video lecture introducing general additive models

You may know about GAMs, but do you know how to make one in R? Watch this code-snippet-infused lecture and learn how to make your own model.

Limited dependent variable models

Are you unsure which model to use for your truncated data? This lecture may give you a better sense of what each model does and when to use which.

GLM, GAM and more

Sometimes a linear model just doesn't cut it- in these cases, we can extend the model to deal with cases where data violates important assumptions such as linearity.

The simple linear regression model

If you’re unsure of when to use a simple linear regression model or how to interpret one, this tutorial explains the basics with a simple example.

Introduction to GLMs

urious about what makes up a linear regression model? This tutorial provides a simple introduction to generalized linear models and the structure of the most common types.

Article discussing multilevel models

Article providing an accessible and detailed introduction to multilevel models, with examples.

Secondary analysis of national survey datasets

Short article discussing methodological challenges with secondary data analysis.

A call for qualitative power analyses

Choosing an appropriate and well-powered sample is crucial for effective research. This paper explores best practices for qualitative research.

A typology of mixed methods sampling designs in social science research

How do you develop sampling designs in mixed methods research? This paper presents a framework for researchers.

Validity and reliability in Qualitative research (6 strategies to increase validity)

How do you make sure your study is valid and reliable? This video provides guidelines for researchers.

Questionnaire data preparation in R

R is a powerful and efficient tool for survey question analysis- this tutorial explains how to harness it for your own research.

Writing effective survey questions and questionnaires

How we frame research questions dictates the types of conclusions we can make from our data. This resource explains key considerations for survey question design.

Article on survey methodology

This article offers guidance on the survey process, with a section on designing questions that are best suited for rigorous analysis.

Difference-in-Difference Estimation

This brief overview of DiD explains the method’s assumption, strengths and limitations, and best practices.

Difference-in-Differences

This comprehensive guide to DiD is organised into the sequence of steps needed to get an estimate of an effect

Book Chapter on Difference-in-Differences

This resource starts from the basics, providing a simple introduction to DiD methods.

Video Lecture about Differences in Differences

This resource is an overview on Differences in Differences methods, including treatment effects estimation and region-based methods.

Advanced Difference-in-Differences

Starting with a review of the canonical model, this resource advances to the deviations from the model that researchers use currently.

Time-Series Cross-Section Methods

Learn about time-series cross-section methods in this approachable book chapter.

Time-Series Cross-Section

An introduction to TSCS models that explores its advantages. This chapter also describes methods for minimizing errors and generate more robust conclusions.

Issues Associated with Some Popular TSCS Modelling Practices

Learn about the complexities of TSCS data analysis.

Journal Article Exploring TSCS Data Analysis for Social Science

TSCS modelling is an iterative process- while there is no single way to analyse this type of data, this article provides strategies to model its dynamic and cross-sectional components.

Panel Data Models

Learn how to choose the best model for your panel data with this thorough introduction to panel data modelling.

Panel Data

This resource explains how to estimate causal effects in longitudinal data using OLS and fixed effects models.

Introduction to Classification and Regression Trees

Unsure how to apply the theory of tree partitioning to social science examples? This tutorial is an accessible tutorial.

Model-Based Recursive Partitioning in the Social Sciences

Machine learning methods have a lot to offer for social science research- this article introduces basic principles of tree based partitioning and explains key terminology.

Tree-Based Methods

Unsure how decision trees, root nodes and leaves can help you learn from your data? This resource explains decision trees and partitioning with plenty of visual aids.

Journal article on argument schemes and visualisation

How to we visualise the process of argument logic? This journal introduces AVIZE, a diagramming tool for argument schemes.

How to store EDFS Data Triple stores

How do we use RDF triples to carry out efficient SPARQL queries? This video breaks down four methods.

Telling stories about and with urban data and dashboards with Rob Kitchin

How do we make sense of massive amounts of population data? Data dashboards use large amounts of data to convey information at a glance.

Databases

How do you manage large amounts of information as a researcher? This textbook chapter introduces database management systems and how to use them.

A talk about white supremacy and how it has shaped statistical norms.

How do we use mathematics to study people, and what assumptions are baked into our methods? This talk challenges researchers to thoughtfully consider the statistical methods they follow.

Quantitative content analysis and the measurement of collective identity

An introduction to content analysis and its application to the largely implicit concept of identity.

Journal article with an overview of multilevel analysis

Article providing an in-depth overview of multilevel analysis, with examples

Secondary data analysis: a method of which the time has come

Short article presenting a systematic process for using secondary data to conduct research

Experimental design in the social sciences

What type of experiment can justify causal claims? This video presents guidelines on experimental design

Book Chapter on Multidimensional Scaling

This book chapter provides a conceptual introduction to MDS.

Video on Multidimensional Scaling

How to you conduct a multidimensional scaling analysis in R? The idea of multidimensional scaling is explained using intuitive examples.

Introduction to Multidimensional Scaling

How do you quantify the similarity among items in your data? This paper offers an overview of multidimensional scaling, a technique that allows you to visually represent similarity in your data.

Data Theory and Dimensional Analysis: Introduction

What scaling technique is best suited to your data? These book chapters provide a concise definition of data theory, followed by a discussion of scaling techniques.

Multiple Correspondence Analysis – Introduction

What can geometric data analysis offer social scientists? The book chapter provides a nontechnical introduction to this method as a way to link geometric representations to statistical interpretations.

The Essentials of Correspondence Analysis: A Simple Example

What are the conceptual underpinnings of correspondence analysis? This guide provides a thorough answer to this question by using an example on crime statistics.

Principal Component Analysis Tutorial in R

Unsure how to translate your conceptual knowledge about PCA to practical data analysis skills? This resource breaks down PCA in R in a beginner-friendly way.

Principal Components Analysis

This resource provides a comprehensive introduction to principal components analysis with relevant examples such as crime statistics and determinants of economic expenditures.

Factor Analysis

This video explains the power of factor analysis as a way to reveal the underlying basic factors from a more complicated set of variables.

Paper Comparing PCA and Factor Analysis

This article makes it clear the differences between descriptive methods like PCA and modelling methods like factor analysis; both should be used for the most robust research outcomes.

Multidimensional Item Response Theory Diagnostics and Evaluation

Unsure how to evaluate your MIRT model? This book chapter explains how to diagnose these models and how to choose the best one.

Summated Rating Scale Construction

A thorough guide on summated rating scales based on classical test theory.

Machine Learning for Social Science: An Agnostic Approach

What distinguishes the social science approach to machine learning (ML) from the rest of ML? This review encourages readers to apply conventional ML techniques to data collection and causal inferences.

Keynote: Machine Learning for Social Science

What can computer science offer social science researchers? This keynote from a computational social scientist describes how machine learning modelling is used to uncover the latent structure in count data.

An Introduction to Bag of Words (BoW)

How can machine learning move you from unstructured text to well-defined vectors? The bag of words technique allows you to convert text into the numerical data which machine learning models rely on.

Exploring Feature Extraction Techniques for Natural Language Processing

Feature extraction is a technique that encompasses several techniques. This blog post provides a summary and evaluation of the most popular ones.

Naive Bayes, Text Classification, and Sentiment

Unsure how sentiment analysis works, or how to apply it? This book chapter provides a conceptual and practical introduction to sentiment analysis.

Dictionary-Based Text Analysis

Learn how to tailor apply word-count techniques to a pre-defined group of words in R.

Text Analysis Basics

This video introduces quantitaive text analysis, including character encoding, GREP and cleaning text data in R.

An Introduction to Topic Modeling

This video explains topic modelling with an analysis of scientific journal abstracts.

Structural Topic Modeling for Social Scientists

What is topic modelling and how can it enhance your language data analysis toolkit? This article introduces topic modelling, exploring its applications for social science research.

Natural language processing for the social sciences and humanities

What do natural language processing (NLP) methods have to offer? This talk explores the intersection between NLP and two questions in social science.

Regression discontinuity designs

How do you minimise selection bias when estimating treatment effects? Regression discontinuity designs allow you to leverage quasi-randomisation by looking at data points around a threshold.

Regression discontinuity

How do you estimate treatment effects of a continuous variable, without selection bias? Regression discontinuity has emerged as a powerful experimental design for this purpose.

Conjoint survey experiments

This book chapter is an introduction to conjoint analyses, with theory grounded in political science examples.

Video talk explaining conjoint experiments

How do people choose between options that vary in multiple ways? Conjoint experiments and analysis are well suited to ask these questions, where people rate multidimensional options.

Parametric and nonparametric: demystifying the terms

Understanding the nature of your data, and whether it is parametric or nonparametric, will dictate the types of inferential statistics you can do with your data. This guide provides a simple explanation.

Estimating a multilevel model with complex survey data

An article providing an in-depth overview of multilevel analysis, with applied examples using the the TIMSS dataset.

Secondary analysis of national survey datasets

A short article discussing biases that arise with secondary analysis of large national survey datasets and related considerations.

Secondary data analysis: a method of which the time has come

A detailed article presenting a simple process for using secondary data to answer research questions.

Resource Listing