Module 6: Q-Matrix

What is the Q-Matrix?

A table
Where rows are items
And columns are skills
Tatsuoka (1983) B. Barnes (2005)
Also called a KC [knowledge component] Model
Or a skill-item mapping

What is the Q-Matrix? Tatsuoka (1983) B. Barnes (2005)

Example

How do we get a skill-item mapping?

Automatic model discovery
Hand-development and refinement
Hybrid approaches

How do we get a skill-item mapping?

Automatic model discovery
Hand-development and refinement
Hybrid approaches

Automated Model Discovery

Learn the mapping between items and skills solely from data

Initial algorithm

Hill-climbing based method T. Barnes, Bitzer, and Vouk (2005)

More common approach lately

Non-negative matrix factorization
Desmarais (2012)
Can be combined!
Picones et al. (2022)

First question

How many skills should we use?

This is determined empirically
1. Try 1 skill
2. Try 1 more skill than previous model (e.g. 2,3,4,5…)
3. Does the new model do better than the previous model?
  If so, go to step 2.
  If not, quit and use the previous model.

How do we know if one Q-matrix is better than another

Several definitions

Barnes et al.’s definition

Better models have the property that if a student knows skill X
And item 1 and item 2 both have skill X
Then a student who gets item 1 right will be more likely to get item 2 right
- And item 1 wrong → item 2 wrong
- And item 2 right → item 1 right
- And item 2 wrong → item 1 wrong

Barnes et al.’s definition

Given a skill-item mapping, you can predict, for each combination of skills whether a student should get each item correct or not
A model’s degree of error is based on how many item-student pairs the prediction gets wrong

And forward from there

You can compare models with different numbers of skills using BIC or AIC or cross-validation Effenberger, Pelánek, and Čechák (2020)

Subtlety

Is skill conjunctive? (as in Barnes)
- You need all relevant skills to get an item right
Or is it compensatory? Beck and Heffernan (2008)
- Any relevant skill leads to getting an item right

Assumption

Barnes’s approach and NNMF (and most approaches to q-matrix discovery) assume no learning

Alternate Test of Model Goodness

Look at student improvement over time
Fit a model like PFA or BKT from Week 4, and see how well it fits data, given the skill-item mapping
- No point to doing this with DKT-family, since they either skip or fit their own q-matrix!
Liu and Koedinger (2017), Effenberger, Pelánek, and Čechák (2020), Picones et al. (2022) give examples

How do we get a Q-Matrix?

Automatic model discovery
Hand-development and refinement
Hybrid approaches

Tool for doing this

Pittsburgh Science of Learning Center DataShop

https://pslcdatashop.web.cmu.edu/

Learning Curve In Brief

Shows relationship between amount of practice and performance

Spikes in learning curves…

Often imply two (or more) skills are being treated as a single skill

Spikes in learning curves…

Often imply two (or more) skills are being treated as a single skill

Spikes in learning curves…

Often imply two (or more) skills are being treated as a single skill

Example

Possible to look at learning curves for different skill models
(we will discuss this more in a future lecture)

You can inspect curves for individual skills

Also look for problems with unexpected error rates

DataShop can apply model for you!

Applies a mathematical model called LFA (similar to PFA) to data
Can give AIC and BIC goodness measures for different skill-item mappings

References

Barnes, Barry. 2005. “Practice as Collective Action.” In The Practice Turn in Contemporary Theory, 25–36. Routledge.

Barnes, Tiffany, Donald Bitzer, and Mladen Vouk. 2005. “Experimental Analysis of the q-Matrix Method in Knowledge Discovery.” In International Symposium on Methodologies for Intelligent Systems, 603–11. Springer.

Beck, Zachary A Pardos–Joseph E, and Carolina Ruiz–Neil T Heffernan. 2008. “The Composition Effect: Conjunctive or Compensatory? An Analysis of Multi-Skill Math Questions in ITS.” Educational Data Mining 2008, 147.

Desmarais, Michel C. 2012. “Mapping Question Items to Skills with Non-Negative Matrix Factorization.” ACM Sigkdd Explorations Newsletter 13 (2): 30–36.

Effenberger, Tomáš, Radek Pelánek, and Jaroslav Čechák. 2020. “Exploration of the Robustness and Generalizability of the Additive Factors Model.” In Proceedings of the Tenth International Conference on Learning Analytics & Knowledge, 472–79.

Liu, Ran, and Kenneth R Koedinger. 2017. “Going Beyond Better Data Prediction to Create Explanatory Models of Educational Data.” The Handbook of Learning Analytics 1: 69–76.

Picones, Gio, Benjamin PaaBen, Irena Koprinska, and Kalina Yacef. 2022. “Combining Domain Modelling and Student Modelling Techniques in a Single Automated Pipeline.” International Educational Data Mining Society.

Tatsuoka, Kikumi K. 1983. “Rule Space: An Approach for Dealing with Misconceptions Based on Item Response Theory.” Journal of Educational Measurement, 345–54.