Research & Evidence

The Foundation

Cognitive Load Theory.

The bedrock of the Crucibl framework is John Sweller's Cognitive Load Theory, originally formalized in 1988 and developed across four decades of replication. The core finding: human working memory has hard capacity limits. Learning depends on managing the load — distinguishing intrinsic load (the irreducible difficulty of the material), extraneous load (the load imposed by poor instructional design), and germane load (the load that builds long-term schemas).

Unstructured AI assistance reliably increases extraneous load. Lepine et al. (2026) demonstrated that AI introduces approximately three times more extraneous cognitive load than the underlying task complexity itself when deployed without scaffolding. The Crucibl framework's job is to prevent that.

Foundational

Sweller, J. (1988). Cognitive load during problem solving: Effects on learning. Cognitive Science, 12(2), 257–285.

The original formulation of Cognitive Load Theory, establishing the working-memory framework that grounds all subsequent instructional-design research.

Foundational

Sweller, J., van Merriënboer, J. J. G., & Paas, F. G. W. C. (1998). Cognitive architecture and instructional design. Educational Psychology Review, 10(3), 251–296.

The integrative framework that connects cognitive architecture to instructional design choices — including the design principles Crucibl operationalizes.

The Empirical Signal

What recent randomized trials show.

Three recent trials supply the precise quantitative case for constraint-driven AI design. They are why Crucibl exists.

17%

Performance drop on independent assessment

In a randomized field experiment with approximately 1,000 high-school students, those given access to unconstrained GPT-4 during practice scored 17% worse than the control group on independent assessment once AI was removed. The harm was concentrated in skill transfer, not problem-solving accuracy with the tool present.

Bastani, H., et al. (2024). RCT, N≈1,000 students.

2.85×

Differential harm to novice learners

Lepine et al. (2026) demonstrated that unstructured AI interaction harms novice learners 2.85× more than advanced learners. The harm gradient runs in the opposite direction of where institutional pressure typically focuses adoption — early-stage students bear the greatest cost of poorly designed AI.

Lepine et al. (2026).

0.73–1.3 SD

Beats in-class active learning — not 1980s lecture

Kestin et al. (2025) ran the first published RCT testing scaffolded AI tutoring against today's pedagogical gold standard — in-class active learning, not lecture-and-textbook. Students using a purpose-built AI tutor with explicit guardrails outperformed the active-learning condition at an effect size of 0.73–1.3 SD, while spending less time on task (median 49 minutes with the AI vs. 60 in the active-learning class). The paper's authors note explicitly that generic chatbots allow students to bypass critical thinking; their AI succeeded only because it was “meticulously engineered with pedagogical best practices” — the constraint-design discipline Crucibl operationalizes at platform scale.

Kestin et al. (2025). Scientific Reports, 15:17458. Harvard PS2, N≈180.

The Active-Engagement Evidence

Doing has six times the effect of reading.

The complementary case for active engagement over passive content delivery comes from Carnegie Mellon's Open Learning Initiative. Koedinger, McLaughlin, Kim, Jia, and Bier (2015), analyzing learning data from more than 12,500 students across multiple courses, established the “doer effect”: practice activities embedded in the learning experience are associated with approximately six times the learning outcome of reading or watching alone. Subsequent work by the same research group (2016–2018) established the relationship as causal, not merely correlational, replicating across multiple courseware contexts.

The Crucibl scaffold-fade architecture — FULL → SHARED → LIGHT → NONE — is engineered to produce that doing-density while preserving the working-memory discipline Cognitive Load Theory requires. The Karpicke-grounded adaptive spaced retrieval-practice mechanism on the Riley v3 roadmap (Fall 2026) operationalizes the doer effect at the assessment-architecture layer: low-stakes, effortful, cross-contextually varied practice scheduled adaptively against each student's prior-session corpus engagement.

Foundational

Koedinger, K. R., McLaughlin, E. A., Kim, J., Jia, J. Z., & Bier, N. L. (2015). Learning is not a spectator sport: Doing is better than watching for learning from a MOOC. Proceedings of the Second ACM Conference on Learning @ Scale, 111–120.

Established the doer effect across more than 12,500 students: practice has approximately six times the learning outcome of reading or watching alone.

Causal replication

Koedinger, K. R., Kim, J., Jia, J. Z., McLaughlin, E. A., & Bier, N. L. (2018). Learning is not a spectator sport: Doer effect causally validated through learning engineering studies. Carnegie Mellon Open Learning Initiative.

Established the causal nature of the doer effect through learning-engineering replication studies — not merely correlational evidence.

The Crucibl Pilot

From framework to operational evidence.

The Crucibl methodology was first deployed at scale in the FIN 485 finance capstone at Ensign College, taught by Chris Wasden. The deployment implemented seven of the ten principles in their full form, with two more partially implemented and one held back for the Phase II platform release. Pre/post outcome differentials, capstone project quality scores, and instructor evaluation data are documented in the case study currently in peer review.

The FIN 485 deployment also surfaced the operational mechanics that the Crucibl platform now codifies: the FULL → SHARED → LIGHT → NONE scaffold curve, the Credit Analyst and PE Operating Partner persona configurations, the Investor Pitchday peer-market assessment, and the Model Comprehension Quiz AI-as-examiner protocol.

On May 5, 2026, a second course — FIN 345 (Financial Institutions) — launched as Crucibl's second production deployment at Ensign College. FIN 345 is a modular financial-institutions curriculum and tests the methodology's transferability across course structures: where FIN 485 is a project-based capstone, FIN 345 is a sequenced multi-module course. Outcome data from the first cohort will publish at semester's end and join the FIN 485 evidence base.

Crucibl case study

Wasden, C. (forthcoming 2026). AI-augmented finance capstone: A 10-principle framework deployment at Ensign College. Manuscript currently in peer review.

The case study that documents the Crucibl methodology in operation at Ensign College's FIN 485 finance capstone. Anchors the Phase I NSF SBIR research proposal as the empirical foundation. Expected publication late 2026.

Intellectual Property

Thirteen provisional patent applications pending.

Thirteen provisional patent applications filed with the USPTO covering the orchestration logic that distinguishes Crucibl from generic LLM-wrapper products and from research prototypes that lack the institutional-deployment infrastructure. The initial twelve provisionals were filed May 1, 2026; a thirteenth (Canonical Persona Library with Tunable Knob Layer and Runtime Composition) was added May 27, 2026 following live A/B verification of the per-course tuning architecture on Crucibl's production FIN 345 deployment.

The portfolio includes the multi-agent pedagogy architecture, the constraint-set enforcement mechanism, the audit-trail integrity protocol, diagnostic remediation, fading-scaffold audit, investor pitchday, regression-validity capstone scoring, psychographic synthesis, authority-gradient architecture, behavioral discipline library, iterative cohort feedback loop, chat debate advisor, three-way attributed feedback, pre-class teaching preparation generator, and approval-gated AI-to-AI modification mechanisms.

The patent strategy is defensive and licensable. Crucibl's commercialization model contemplates direct institutional licensing, foundation-funded pilots, and corporate enterprise contracts — and, contingent on award, an open-source release of the self-hosted deployment package developed under the planned NSF SBIR Phase I scope.

Planned Research

Phase I: offline-first deployment for underserved learners.

The current Crucibl platform requires continuous internet connectivity to commercial frontier-model APIs. That requirement excludes two of the highest-need higher-education populations: incarcerated learners in US correctional facilities (approximately 30,000+ Pell-eligible postsecondary students, growing 20–40% annually following 2023 Pell Grant restoration) and international learners in low-resource education contexts.

Crucibl's planned NSF SBIR Phase I proposal — currently in preparation, not yet submitted — would adapt the methodology and architecture for self-hosted deployment using open-weight large language models such as Llama 3.3 70B, Qwen 3 72B, and Mixtral 8x22B running on facility-scale hardware. The proposed pilot site is the Ensign College Prison Education Program, with the FIN 485 finance capstone as the deployed course. Discussions with Ensign about the partnership are active and the proposal is being developed jointly. None of this is funded yet.

Proposed Phase I deliverables include an open-source-released self-hosted deployment package, a pedagogical-fidelity characterization study comparing four open-weight models against the cloud-hosted reference, empirical pilot results from the proposed Ensign deployment, and an analysis of the technical adaptations required for international low-resource deployment. Crucibl is also pursuing a Utah Technology Innovation Funding (UTIF) microgrant to support Phase I proposal development; that application is pending.

"The strongest examples of AI use won't necessarily be the campuses running the flashiest pilots, but those that can demonstrate responsible and broad, consistent adoption aligned with real outcomes."

— Ryan Lufkin, VP Global Academic Strategy, Instructure (the company that makes Canvas) · Tech Outlook 2026

"Canvas delivered its agentic AI tool, though it is too early to evaluate impact due to cautious campus adoption."

— Phil Hill, On EdTech · April 2026 LMS-Market coverage

See target markets Pilot or partner with us

Grounded in three decades of Cognitive Load Theory.

Cognitive Load Theory.

Sweller, J. (1988). Cognitive load during problem solving: Effects on learning. Cognitive Science, 12(2), 257–285.

Sweller, J., van Merriënboer, J. J. G., & Paas, F. G. W. C. (1998). Cognitive architecture and instructional design. Educational Psychology Review, 10(3), 251–296.

What recent randomized trials show.

Performance drop on independent assessment

Differential harm to novice learners

Beats in-class active learning — not 1980s lecture

Doing has six times the effect of reading.

Koedinger, K. R., McLaughlin, E. A., Kim, J., Jia, J. Z., & Bier, N. L. (2015). Learning is not a spectator sport: Doing is better than watching for learning from a MOOC. Proceedings of the Second ACM Conference on Learning @ Scale, 111–120.

Koedinger, K. R., Kim, J., Jia, J. Z., McLaughlin, E. A., & Bier, N. L. (2018). Learning is not a spectator sport: Doer effect causally validated through learning engineering studies. Carnegie Mellon Open Learning Initiative.

From framework to operational evidence.

Wasden, C. (forthcoming 2026). AI-augmented finance capstone: A 10-principle framework deployment at Ensign College. Manuscript currently in peer review.

Thirteen provisional patent applications pending.

Phase I: offline-first deployment for underserved learners.