- Overview
- Instructors
- Schedule
Course Description
This 10-day course provides a comprehensive introduction to Generalised Linear Models (GLMs) using the R programming language. GLMs are a powerful extension of linear models that allow for response variables with error distributions other than the normal distribution, such as binary, count, or proportion data. Through a mix of lectures and practical exercises, this course introduces GLM theory and walks participants through data preparation, model fitting, diagnostics, interpretation, and visualisation. Each session focuses on a key GLM type, such as logistic regression or Poisson regression, using ecological datasets. No prior experience with GLMs is required, but basic familiarity with R and linear models is recommended. The course is suited to researchers, analysts, and students who want to build a strong foundation in applied statistical modelling.
What You’ll Learn
- The theoretical foundations of Generalised Linear Models (GLMs),
- How to choose appropriate error structures and link functions,
- How to fit logistic, Poisson, and other GLMs in R,
- Techniques for model selection and checking model assumptions,
- How to interpret model coefficients and assess goodness of fit,
- Visualising and communicating model results effectively,
- Common pitfalls and how to avoid them when using GLMs,
- How to apply GLMs to real-world problems.
Course Format
Interactive Learning Format
Each day features a well-balanced combination of lectures and hands-on practical exercises, with dedicated time for discussing participants’ own data, time permitting.
Global Accessibility
All live sessions are recorded and made available on the same day, ensuring accessibility for participants across different time zones.
Collaborative Discussions
Open discussion sessions provide an opportunity for participants to explore specific research questions and engage with instructors and peers.
Comprehensive Course Materials
All code, datasets, and presentation slides used during the course will be shared with participants by the instructor.
Personalized Data Engagement
Participants are encouraged to bring their own data for discussion and practical application during the course.
Post-Course Support
Participants will receive continued support via email for 30 days following the course, along with on-demand access to session recordings for the same period.
Who Should Attend / Intended Audiences
This course is intended for ecologists, data analysts, postgraduate students, and early-career researchers who have a basic background in using R and RStudio, such as importing data and running simple functions. Participants are expected to have a foundational understanding of statistics, including concepts like mean, variance, correlation, and linear regression. While prior experience with linear models is helpful, it is not essential, as key concepts will be reviewed during the course. Additional experience with data wrangling using packages like dplyr or tidyr, basic plotting with ggplot2, and interpreting model output is not required but would be beneficial.
Equipment and Software requirements
A laptop or desktop computer with a functioning installation of R and RStudio is required. Both R and RStudio are free, open-source programs compatible with Windows, macOS, and Linux systems.
A working webcam is recommended to support interactive elements of the course. We encourage participants to keep their cameras on during live Zoom sessions to foster a more engaging and collaborative environment.
While not essential, using a large monitor—or ideally a dual-monitor setup—can significantly enhance your learning experience by allowing you to view course materials and work in R simultaneously.
All necessary R packages will be introduced and installed during the workshop. A comprehensive list of required packages will also be shared with participants ahead of the course to allow for optional pre-installation.
Dr. Niamh Mimnagh
Niamh is a statistician working at the interface of ecology, epidemiology, and data science. Her research focuses on applying and developing statistical and machine learning methods to address real-world challenges such as estimating species population sizes from count and trace data and predicting livestock disease re-emergence using sparse or imbalanced datasets. She works with a wide array of statistical approaches, including Bayesian hierarchical models, N-mixture models, anomaly detection algorithms, and spatial analysis techniques.
Niamh earned her PhD in Statistics, with a focus on multispecies abundance modelling, and holds a first-class MSc in Data Science. Alongside her research, she is actively engaged in science communication and education, running a popular blog on applied statistics for non-specialists, and regularly delivering workshops and guest lectures on topics such as GLMs and machine learning with imbalanced data.
Education & Career
- PhD in Statistics (Multispecies Abundance Modelling)
- MSc in Data Science (First Class Honours)
- Instructor, consultant, and science communicator in statistical ecology and epidemiology
Research Focus
Niamh’s work centres on extracting meaningful insights from complex ecological and epidemiological data. She is particularly interested in population estimation techniques and predictive modelling for conservation and disease management, using advanced statistical tools and reproducible workflows.
Current Projects
- Development of Bayesian and ML approaches for estimating species abundance from imperfect data
- Modelling livestock disease risk using spatial and temporal predictors
- Creating accessible educational materials for teaching applied statistics in R
Professional Consultancy
Niamh provides expert statistical support to academic and applied research projects, with a focus on ecological monitoring, conservation planning, and disease modelling. She also advises on study design and data workflows for interdisciplinary teams.
Teaching & Skills
- Teaches topics including GLMs, Bayesian statistics, machine learning for imbalanced data, and spatial statistics in R
- Advocates for reproducibility, open science, and accessible statistical training
- Experienced in communicating complex methods to broad audiences
Links
Session 1- 01:15:00 – Introduction to Linear Models
Overview of the normal linear model as the foundation for GLMs; assumptions, model structure, categorical predictors, and limitations.
Session 2- 01:15:00 – From Linear Models to GLMs
Motivation for GLMs; overview of the exponential family, link functions, and the structure of GLMs.
Session 3- 01:15:00 – Introduction to GLMs in R
Using glm() in R; simulating and exploring simple GLM-compatible data; inspecting model objects and understanding output structure.
Session 4 – 01:15:00 – Binary Logistic Regression I
Theoretical background of logistic regression; fitting and interpreting binary outcome models.
Session 5 – 01:15:00 – Binary Logistic Regression II
Predicted probabilities, odds ratios, nested model comparison using likelihood ratio tests, and visualising predictions.
Session 6 – 01:15:00 – Assessing Model Fit
Diagnostics for binary logistic models: residuals, influence, and assessing linearity in the logit.
Session 7- 01:15:00 – Binomial Logistic Regression
Modelling grouped binary/proportion data; understanding weights, logit vs probit vs cloglog links.
Session 8- 01:15:00 – Multinomial Logistic Regression
Polychotomous outcomes and the multinomial logit model, interpreting output and visualising effects.
Session 9- 01:15:00 – Ordinal Logistic Regression
Ordered categorical outcomes, cumulative logit models, assumptions and interpretation.
Session 10 – 01:15:00 – Poisson Regression
Theoretical and practical introduction to Poisson regression; rate-based modelling and interpreting log link coefficients.
Session 11 – 01:15:00 – Poisson Regression in Practice
Offset terms, exposure variables, and visualising count model predictions.
Session 12 – 01:15:00 – Case Study: Poisson Modelling in Ecology
Worked example applying Poisson regression to real-world count data, including plotting and interpretation.
Session 13- 01:15:00 – Overdispersion and Quasi-Likelihood
Identifying and diagnosing overdispersion in Poisson and binomial models; fitting quasi-Poisson and quasi-binomial models.
Session 14 – 01:15:00 – Overdispersion Models
Fitting overdispersion models, when and why to use each approach.
Session 15 – 01:15:00 – Interpreting and Comparing Overdispersion Models
Model selection with AIC, BIC, and cross-validation; visual comparisons and residual diagnostics.
Session 16 – 01:15:00 – Zero-Inflated Models I
Understanding zero-inflation and the structure of ZIP and ZINB models.
Session 17 – 01:15:00 – Zero-Inflated Models II
Interpreting zero-inflated models, model diagnostics, and visualising predictions.
Session 18 – 01:15:00 – Hurdle Models and Alternatives
Distinguishing between hurdle and zero-inflated models; when to use each; fitting in R.
Session 19 – 01:15:00 – Data Wrangling and Preparation for GLMs
Long vs wide formats, dealing with missing data, preparing factors and numeric covariates, centering and scaling data.
Session 20 – 01:15:00 – Model Extensions: Interactions and Nonlinear Terms
Fitting interaction terms, polynomial and spline expansions, interpretation and plotting.
Session 21 – 01:15:00 – Visualising GLMs
Plotting fitted GLM effects, and confidence intervals.
Session 22 – 01:15:00 – Introduction to Bayesian GLMs
Overview of Bayesian inference, the role of prior distributions, and how posterior inference differs from frequentist approaches. Emphasis on the probabilistic interpretation of coefficients and model uncertainty in the context of GLMs.
Session 23 – 01:15:00 – Bayesian GLMs in Practice
Hands-on fitting of Bayesian logistic and Poisson regression models using R (e.g., with brms or rstanarm). Comparison of Bayesian and frequentist estimates, intervals, and interpretations. Introduction to posterior predictive checks and visualising uncertainty in model predictions.
Session 24 – 01:15:00 – Model Evaluation and Interpretation in Bayesian GLMs
Deeper exploration of how to evaluate and interpret Bayesian GLMs. Includes assessing model fit (e.g., using posterior predictive distributions, WAIC), interpreting posterior intervals and probabilities, and presenting Bayesian model results clearly. How Bayesian approaches inform decision-making and communication of uncertainty in applied contexts (e.g., ecology, epidemiology).
Session 25 – 01:15:00 – GLMs for Presence-Absence and Occupancy Data
Applying logistic regression to presence-absence and detection/non-detection data; interpreting coefficients in ecological terms; modelling site-level covariates and detection bias.
Session 26 – 01:15:00 – Count Models in Ecology
Poisson and negative binomial models for abundance and count data; modelling species richness, trap counts, or observation events.
Session 27 – 01:15:00 – Ecological Case Study: Species Distribution and Abundance Modelling
A guided walkthrough of real ecological data: fitting GLMs for species occurrence or abundance, selecting predictors (e.g. habitat, climate), and communicating model results with ecological interpretation and plots.
Session 28 – 01:15:00 – Mixed Effects GLMs (Intro to GLMMs)
Motivation for using random effects in GLMs; fitting random intercept models with lme4::glmer(), interpreting output, and diagnosing convergence.
Session 29 – 01:15:00 – GLMMs Continued: Nested and Crossed Designs
Random slopes and more complex structures; fixed vs random effects; challenges with non-convergence and overfitting.
Session 30 – 01:15:00 – Model Comparison, Best Practices, Wrap-Up and Further Directions
AIC, BIC, and likelihood-based model selection in GLMs and GLMMs; model simplification vs overfitting; common errors and how to avoid them. Course review, recommendations for further learning (e.g. GAMs, zero-inflated Bayesian models, Bayesian GLMMs); curated list of packages, vignettes, and readings.
Frequently asked questions
Everything you need to know about the product and billing.
When will I receive instructions on how to join?
You’ll receive an email on the Friday before the course begins, with full instructions on how to join via Zoom. Please ensure you have Zoom installed in advance.
Do I need administrator rights on my computer?
I’m attending the course live — will I also get access to the session recordings?
I can’t attend every live session — can I join some sessions live and catch up on others later?
I’m in a different time zone and plan to follow the course via recordings. When will these be available?
I can’t attend live — how can I ask questions?
Will I receive a certificate?
When will I receive instructions on how to join?
You’ll receive an email on the Friday before the course begins, with full instructions on how to join via Zoom. Please ensure you have Zoom installed in advance.
Do I need administrator rights on my computer?
I’m attending the course live — will I also get access to the session recordings?
I can’t attend every live session — can I join some sessions live and catch up on others later?
I’m in a different time zone and plan to follow the course via recordings. When will these be available?
I can’t attend live — how can I ask questions?
Will I receive a certificate?
Still have questions?
Can’t find the answer you’re looking for? Please chat to our friendly team.





5.0
