Two systems share a common foundation but diverge across attention, scale, training, and deployment.
For LLM vs SLM, dense vs sparse, supervised vs self-supervised contrasts.
Add a third column for "Hybrid (small with retrieval)" and add a fifth row "Knowledge Source" comparing parametric vs non-parametric memory across the three.