UnderstandIQ

Stop measuring what students get right.
Start measuring whether they truly understand.

The Core Insight

Every quiz platform asks: "Did you get it right?"

UnderstandIQ asks: "Do you know whether you got it right?"

That second question reveals something the first one never can — the Illusion of Understanding: the gap between how confident a learner feels and how well they actually know something. Cognitive science shows this gap predicts learning failure more reliably than accuracy alone.

See it happen in 15 seconds:

A learner answers with high confidence — then sees the calibration gap. That moment is what traditional assessment never captures.

The Problem with Traditional Assessment

Every quiz, exam, and AI tutoring tool answers one question:

"Did you get it right?"

But that question is broken. A student can guess correctly. A student can recall a definition without understanding it. A student can score 80% on a test and fail when the same concept appears in a different form.

UnderstandIQ asks a different question:

"Do you know whether you got it right?"

That second question — the metacognitive one — reveals something the first one never can: the Illusion of Understanding. The gap between how confident you feel and how well you actually know something is the strongest predictor of future learning failure.

What UnderstandIQ Does

Upload any learning material — a research paper, lecture notes, a textbook chapter, an article. UnderstandIQ generates four types of questions at surface, conceptual, and applied depth levels.

For each question, you answer, capture your reasoning, and rate your confidence — all before seeing the result.

The system then reveals:

Accuracy Score — What percentage you got right (with partial credit for open-ended answers)
Calibration Score — How well your confidence matched your actual performance
UnderstandIQ Score — The composite metric: you need both knowledge and self-awareness to score high
Cognitive Archetype — A psychologically grounded profile of how you think and learn

The result isn't a grade. It's a cognitive fingerprint — a precise map of where your understanding is solid, where it's brittle, and where confidence is masking a gap.

The Science Behind It

UnderstandIQ operationalizes three validated cognitive science constructs:

Confidence Calibration — Brier scores and calibration curves have long been used in forecasting and clinical psychology. UnderstandIQ adapts them to measure learner self-assessment accuracy.

Illusion of Understanding — Documented extensively in Dunning-Kruger research and Bjork's work on desirable difficulties. High confidence + wrong answer = the most dangerous cognitive state in learning.

Cognitive Stability — From the HCMS framework: consistency of reasoning across repeated and varied exposures to the same concept.

Research Foundation: Built on the Human Cognition Measurement System (HCMS)
Preprint: DOI: 10.5281/zenodo.18269740
Muhammad Rayan Shahid — Independent AI Researcher, ByteBrilliance AI

Features

Feature	Description
📄 Document Upload	PDF, DOCX, or raw text paste
🧠 Four Question Types	MCQ, Short Answer, Application, and Explain-It — each probing a different cognitive layer
✍️ Reasoning Capture	Students explain their thinking per question, enabling pattern analysis beyond scores
📊 Confidence Calibration	Per-question confidence rating before results are shown
🎯 Calibration Gap Chart	Visualises where confidence diverges from actual performance
🔬 AI Cognitive Analysis	LLM-powered archetype detection, misconception identification, and deep insight generation
🏷️ Cognitive Archetypes	Named learning profiles: Calibrated Thinker, Knowledge Illusion Risk, Reflective Analyst, and more
📋 PDF Report	Full downloadable cognitive assessment report, generated in-memory
⬇ Zero Setup	Deployed and live — no installation needed

Four Question Types

Most assessment tools only ask MCQs. UnderstandIQ uses four types because each reveals something different about how a person thinks:

Type	What It Tests	Why It Matters
MCQ	Recall speed and recognition	Fast signal on factual knowledge
Short Answer	Articulation of understanding	Can you say it in your own words?
Application	Transfer thinking	Does knowledge survive a new context?
Explain-It	Depth of understanding	True understanding enables simplification

Open-ended answers receive partial credit based on conceptual overlap with the model answer, surfacing degrees of understanding rather than binary pass/fail.

Cognitive Archetypes

Archetype	Pattern
Calibrated Thinker	High accuracy, well-calibrated confidence
Confident Executor	Strong performance, confidence slightly ahead of knowledge
Reflective Analyst	Knows what they don't know — underconfident despite solid answers
Surface Memorizer	Strong recall, weaker conceptual depth
Knowledge Illusion Risk	High confidence despite significant gaps
Intuitive Guesser	Performs better than their reasoning suggests

UnderstandIQ Score Levels

Score	Level	What It Means
85–100	Calibrated Mastery	High accuracy + well-calibrated confidence
70–84	Solid Understanding	Good accuracy, minor calibration gaps
55–69	Surface Knowledge	Moderate accuracy but overconfidence detected
40–54	Knowledge Illusion	Significant gap between confidence and performance
0–39	Foundational Gap	Low accuracy with overconfidence — highest-risk state

Quick Start (Local)

git clone https://github.com/RayanAIX/understandiq
cd understandiq
pip install -r requirements.txt
cp .env.example .env
# Add your Groq API key to .env
streamlit run app.py

Get a free Groq API key at console.groq.com — generous free tier, extremely fast inference.

The Scoring Math

# Convert confidence (1-5 scale) to percentage
conf_pct = ((confidence - 1) / 4) * 100

# Performance percentage (1.0 for correct, partial credit for open-ended)
perf_pct = credit * 100

# Calibration gap per question
gap = abs(conf_pct - perf_pct)

# Calibration score
calibration = 100 - mean(all gaps)

# UnderstandIQ composite
understandiq = (accuracy * 0.5) + (calibration * 0.5)

Overconfidence flagged when: confidence ≥ 4 AND performance < 40%
Underconfidence flagged when: confidence ≤ 2 AND performance > 60%

Technical Stack

Frontend: Streamlit (Python)
AI: Groq API (LLaMA 3.3 70B) — question generation and cognitive analysis
Document Parsing: pdfplumber, python-docx
Visualization: Plotly (dark theme)
PDF Export: fpdf2 (in-memory, no disk write)
Deployment: Streamlit Community Cloud

Author

Muhammad Rayan Shahid
Independent AI Researcher

Website · GitHub · LinkedIn

"Correctness is easy to fake. Understanding isn't."

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UnderstandIQ

The Core Insight

The Problem with Traditional Assessment

What UnderstandIQ Does

The Science Behind It

Features

Four Question Types

Cognitive Archetypes

UnderstandIQ Score Levels

Quick Start (Local)

The Scoring Math

Technical Stack

Author

About

Releases

Packages

Contributors

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
core		core
ui		ui
utils		utils
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
understandiq.gif		understandiq.gif

Folders and files

Latest commit

History

Repository files navigation

UnderstandIQ

The Core Insight

The Problem with Traditional Assessment

What UnderstandIQ Does

The Science Behind It

Features

Four Question Types

Cognitive Archetypes

UnderstandIQ Score Levels

Quick Start (Local)

The Scoring Math

Technical Stack

Author

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages

Contributors

Languages