Resource Listing
Journal article introducing Bayesian multivariate modelling
Trying to wrap your head around how to apply Bayesian reasoning to multivariate modelling? This article illustrates how these concepts intersect, with practical steps for applying it to your own research.
Method article for Bayesian multilevel modelling
This article offers a step-by-step guide on how to conduct a multilevel analysis, with an accompanying R script and dataset that make it possible to practice the concepts outlined.
Introduction to hierarchical models
As a social scientist, you may deal with context-dependent clustered data. This chapter will showcase one statistical model to account for the relationships between observations in this type of data.
Bayesian hierarchical models
Working with clustered data and unsure how to account for within-group variability? This lecture introduces Bayesian hierarchical models as a way to model the relationship between an outcome and predictors for clustered data.
Video lecture on the Bayesian approach to generalised linear models
Review the general framework for GLMs, then learn how Bayesian modelling can improve estimates about the relationships between variables in your data.
The linear regression model
This rigorous discussion of Bayesian modelling is grounded in an approachable example.
Blog post explaining Bayesian linear regression
Learn how to apply Bayesian reasoning to estimate linear model parameters.
Bayesian reasoning for qualitative social science
Sometimes frequentists statistics’ assumptions don’t apply to qualitative data- this video explains the advantages of Bayesian statistics in these cases.
Discriminant analysis
Unsure about how to solve classification problems? Learn more about the statistical models used in LDA and QDA.
Video explaining linear and quadratic discriminant analyses
Learn how to conceptualise LDA and QDA within the context of Gaussian Maximum Likelihood Classification methods.
Logistic regression and discriminant analysis
Step-by-step guides for discriminant analysis and logistic regression procedures
Video tutorial showcasing how to select a time series ARIMA model in R
Model selection can sometimes be more of an art than a precise science, and this video tutorial illustrates this by walking through an example in R.
Video tutorial describing common time series and ARIMA models
This model allows you to get an accurate estimate of how a certain variable changes over time, by factoring in past observations of the same variable.
Brief guide on time series models
This concise explanation of time series models is a quick guide or refresher on when to use which.
Generalized Additive models for location scale and shape (GAMLSS) in R
This article offers a primer on GAMLSS, as a robust model for data with a distribution which deviates from the normal because of its shape (and hence violates the assumption of other commonly used models like GLMs). The authors provide examples in R using 4 different examples.
GAMLSS: A distributional regression approach
Trying to go beyond the mean with your regression model? When neither GLM nor GAM provide an adequate picture, GAMLSS might be the place to start.
Video lecture introducing general additive models
You may know about GAMs, but do you know how to make one in R? Watch this code-snippet-infused lecture and learn how to make your own model.
Limited dependent variable models
Are you unsure which model to use for your truncated data? This lecture may give you a better sense of what each model does and when to use which.
GLM, GAM and more
Sometimes a linear model just doesn't cut it- in these cases, we can extend the model to deal with cases where data violates important assumptions such as linearity.
The simple linear regression model
If you’re unsure of when to use a simple linear regression model or how to interpret one, this tutorial explains the basics with a simple example.
Introduction to GLMs
urious about what makes up a linear regression model? This tutorial provides a simple introduction to generalized linear models and the structure of the most common types.
Article discussing multilevel models
Article providing an accessible and detailed introduction to multilevel models, with examples.
Secondary analysis of national survey datasets
Short article discussing methodological challenges with secondary data analysis.
A call for qualitative power analyses
Choosing an appropriate and well-powered sample is crucial for effective research. This paper explores best practices for qualitative research.
A typology of mixed methods sampling designs in social science research
How do you develop sampling designs in mixed methods research? This paper presents a framework for researchers.
Validity and reliability in Qualitative research (6 strategies to increase validity)
How do you make sure your study is valid and reliable? This video provides guidelines for researchers.
Questionnaire data preparation in R
R is a powerful and efficient tool for survey question analysis- this tutorial explains how to harness it for your own research.
Writing effective survey questions and questionnaires
How we frame research questions dictates the types of conclusions we can make from our data. This resource explains key considerations for survey question design.
Article on survey methodology
This article offers guidance on the survey process, with a section on designing questions that are best suited for rigorous analysis.
Difference-in-Difference Estimation
This brief overview of DiD explains the method’s assumption, strengths and limitations, and best practices.
Difference-in-Differences
This comprehensive guide to DiD is organised into the sequence of steps needed to get an estimate of an effect
Book Chapter on Difference-in-Differences
This resource starts from the basics, providing a simple introduction to DiD methods.
Video Lecture about Differences in Differences
This resource is an overview on Differences in Differences methods, including treatment effects estimation and region-based methods.
Advanced Difference-in-Differences
Starting with a review of the canonical model, this resource advances to the deviations from the model that researchers use currently.
Time-Series Cross-Section Methods
Learn about time-series cross-section methods in this approachable book chapter.
Time-Series Cross-Section
An introduction to TSCS models that explores its advantages. This chapter also describes methods for minimizing errors and generate more robust conclusions.
Issues Associated with Some Popular TSCS Modelling Practices
Learn about the complexities of TSCS data analysis.
Journal Article Exploring TSCS Data Analysis for Social Science
TSCS modelling is an iterative process- while there is no single way to analyse this type of data, this article provides strategies to model its dynamic and cross-sectional components.
Panel Data Models
Learn how to choose the best model for your panel data with this thorough introduction to panel data modelling.
Panel Data
This resource explains how to estimate causal effects in longitudinal data using OLS and fixed effects models.
Introduction to Classification and Regression Trees
Unsure how to apply the theory of tree partitioning to social science examples? This tutorial is an accessible tutorial.
Model-Based Recursive Partitioning in the Social Sciences
Machine learning methods have a lot to offer for social science research- this article introduces basic principles of tree based partitioning and explains key terminology.
Tree-Based Methods
Unsure how decision trees, root nodes and leaves can help you learn from your data? This resource explains decision trees and partitioning with plenty of visual aids.
Journal article on argument schemes and visualisation
How to we visualise the process of argument logic? This journal introduces AVIZE, a diagramming tool for argument schemes.
How to store EDFS Data Triple stores
How do we use RDF triples to carry out efficient SPARQL queries? This video breaks down four methods.
Telling stories about and with urban data and dashboards with Rob Kitchin
How do we make sense of massive amounts of population data? Data dashboards use large amounts of data to convey information at a glance.
Databases
How do you manage large amounts of information as a researcher? This textbook chapter introduces database management systems and how to use them.
A talk about white supremacy and how it has shaped statistical norms.
How do we use mathematics to study people, and what assumptions are baked into our methods? This talk challenges researchers to thoughtfully consider the statistical methods they follow.
Quantitative content analysis and the measurement of collective identity
An introduction to content analysis and its application to the largely implicit concept of identity.
Journal article with an overview of multilevel analysis
Article providing an in-depth overview of multilevel analysis, with examples
Secondary data analysis: a method of which the time has come
Short article presenting a systematic process for using secondary data to conduct research
Experimental design in the social sciences
What type of experiment can justify causal claims? This video presents guidelines on experimental design
Book Chapter on Multidimensional Scaling
This book chapter provides a conceptual introduction to MDS.
Video on Multidimensional Scaling
How to you conduct a multidimensional scaling analysis in R? The idea of multidimensional scaling is explained using intuitive examples.
Introduction to Multidimensional Scaling
How do you quantify the similarity among items in your data? This paper offers an overview of multidimensional scaling, a technique that allows you to visually represent similarity in your data.
Data Theory and Dimensional Analysis: Introduction
What scaling technique is best suited to your data? These book chapters provide a concise definition of data theory, followed by a discussion of scaling techniques.
Multiple Correspondence Analysis – Introduction
What can geometric data analysis offer social scientists? The book chapter provides a nontechnical introduction to this method as a way to link geometric representations to statistical interpretations.
The Essentials of Correspondence Analysis: A Simple Example
What are the conceptual underpinnings of correspondence analysis? This guide provides a thorough answer to this question by using an example on crime statistics.
Principal Component Analysis Tutorial in R
Unsure how to translate your conceptual knowledge about PCA to practical data analysis skills? This resource breaks down PCA in R in a beginner-friendly way.
Principal Components Analysis
This resource provides a comprehensive introduction to principal components analysis with relevant examples such as crime statistics and determinants of economic expenditures.
Factor Analysis
This video explains the power of factor analysis as a way to reveal the underlying basic factors from a more complicated set of variables.
Paper Comparing PCA and Factor Analysis
This article makes it clear the differences between descriptive methods like PCA and modelling methods like factor analysis; both should be used for the most robust research outcomes.
Multidimensional Item Response Theory Diagnostics and Evaluation
Unsure how to evaluate your MIRT model? This book chapter explains how to diagnose these models and how to choose the best one.
Summated Rating Scale Construction
A thorough guide on summated rating scales based on classical test theory.
Machine Learning for Social Science: An Agnostic Approach
What distinguishes the social science approach to machine learning (ML) from the rest of ML? This review encourages readers to apply conventional ML techniques to data collection and causal inferences.
Keynote: Machine Learning for Social Science
What can computer science offer social science researchers? This keynote from a computational social scientist describes how machine learning modelling is used to uncover the latent structure in count data.
An Introduction to Bag of Words (BoW)
How can machine learning move you from unstructured text to well-defined vectors? The bag of words technique allows you to convert text into the numerical data which machine learning models rely on.
Exploring Feature Extraction Techniques for Natural Language Processing
Feature extraction is a technique that encompasses several techniques. This blog post provides a summary and evaluation of the most popular ones.
Naive Bayes, Text Classification, and Sentiment
Unsure how sentiment analysis works, or how to apply it? This book chapter provides a conceptual and practical introduction to sentiment analysis.
Dictionary-Based Text Analysis
Learn how to tailor apply word-count techniques to a pre-defined group of words in R.
Text Analysis Basics
This video introduces quantitaive text analysis, including character encoding, GREP and cleaning text data in R.
An Introduction to Topic Modeling
This video explains topic modelling with an analysis of scientific journal abstracts.
Structural Topic Modeling for Social Scientists
What is topic modelling and how can it enhance your language data analysis toolkit? This article introduces topic modelling, exploring its applications for social science research.
Natural language processing for the social sciences and humanities
What do natural language processing (NLP) methods have to offer? This talk explores the intersection between NLP and two questions in social science.
Regression discontinuity designs
How do you minimise selection bias when estimating treatment effects? Regression discontinuity designs allow you to leverage quasi-randomisation by looking at data points around a threshold.
Regression discontinuity
How do you estimate treatment effects of a continuous variable, without selection bias? Regression discontinuity has emerged as a powerful experimental design for this purpose.
Conjoint survey experiments
This book chapter is an introduction to conjoint analyses, with theory grounded in political science examples.
Video talk explaining conjoint experiments
How do people choose between options that vary in multiple ways? Conjoint experiments and analysis are well suited to ask these questions, where people rate multidimensional options.
Parametric and nonparametric: demystifying the terms
Understanding the nature of your data, and whether it is parametric or nonparametric, will dictate the types of inferential statistics you can do with your data. This guide provides a simple explanation.
Estimating a multilevel model with complex survey data
An article providing an in-depth overview of multilevel analysis, with applied examples using the the TIMSS dataset.
Secondary analysis of national survey datasets
A short article discussing biases that arise with secondary analysis of large national survey datasets and related considerations.
Secondary data analysis: a method of which the time has come
A detailed article presenting a simple process for using secondary data to answer research questions.