KDD'25 A Tutorial on Small Language Models in the Era of Large Language Models: Architecture, Capabilities, and Trustworthiness

Fali Wang¹, Minhua Lin¹, Yao Ma², Hui Liu³, Qi He, Xianfeng Tang³, Jiliang Tang⁴, Jian Pei⁵, Suhang Wang¹
1 The Pennsylvania State University
2 Rensselaer Polytechnic Institute
3 Amazon
4 Michigan State University
5 Duke University
Time: Aug 3, 2025 8:00 AM - 11:00 AM (EDT)
Location: The Metro Toronto Convention Centre on the 700 level, Room 713 A, Toronto, Canada

Abstract

Large language models (LLMs) based on the Transformer architecture are powerful but face challenges with deployment, inference latency, and costly fine-tuning. These limitations highlight the emerging potential of small language models (SLMs), which can either replace LLMs through innovative architectures and technologies or assist them as efficient proxies or reward models. Emerging architectures such as Mamba and xLSTM address the quadratic scaling of inference with window length in Transformers by enabling linear scaling. To maximize SLM performance, test-time compute scaling strategies reduce the performance gap with LLMs by allocating extra compute budget during test time. Beyond standalone usage, SLMs could also assist in LLMs via weak-to-strong learning, proxy tuning, and guarding, fostering secure and efficient LLM deployment. Lastly, the trustworthiness of SLMs remains a critical yet underexplored research area. However, there is a lack of tutorials on cutting-edge SLM technologies, prompting us to conduct one.

This tutorial covers recent progress in Small Language Models (SLMs) in the era of Large Language Models (LLMs), focusing on architecture, capabilities, and trustworthiness.

LLM Foundations: Overview of recent LLM developments that support and inspire SLM design.
SLM Architecture: Efficient architectures tailored for small-scale models, including Transformer variants and state-space models.
From Weak to Strong: Techniques to boost SLM performance include distillation, test-time scaling, retrieval augmentation, and agent collaboration.
Trustworthiness: SLM robustness in adversarial settings, jailbreak resistance, fairness, and privacy.

Schedule

Introduction: 5 mins (8:00–8:05) — Suhang Wang
Part I: LLM Foundations: 20 mins (8:05–8:25) — Suhang Wang
Part II: Architecture of SLMs: 35 mins (8:25–9:00) — Fali Wang
Coffee Break: 15 mins (9:00–9:15)
Part III: Weak to Strong Methods: 45 mins (9:15–10:00) — Fali Wang
Part IV: Trustworthiness of SLMs: 45 mins (10:00–10:45) — Minhua Lin
Conclusion + Q&A Session: 15 mins (10:45–11:00) — All

Slides and Survey Paper

Slides are here [Slides]

A Survey on Small Language Models in the Era of Large Language Models: Architecture, Capabilities, and Trustworthiness [Tutorial Survey Paper]

Related survey paper: A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness [Comprehensive Survey Paper]

Authors

Fali Wang is a senior Ph.D. candidate in informatics at the Pennsylvania State University (PSU), under the guidance of Dr. Suhang Wang. Before joining PSU, he earned his M.S. degree in 2021 from UCAS and his B.E. in 2018 from NEFU. His research primarily focuses on LLM Knowledge and Graph Learning. Website: https://FairyFali.github.io/

Minhua Lin is a senior Ph.D. candidate in informatics at the Pennsylvania State University (PSU), under the supervision of Dr. Suhang Wang and Dr. Xiang Zhang. Before joining PSU, he earned his B.E. degree in Computer Science from Huazhong University of Sciences and Technology in 2021. His research primarily focuses on large language models, graph learning, trustworthy AI, and their intersections.

Yao Ma Yao Ma is an Assistant Professor in the Department of Computer Science at the Rensselaer Polytechnic Institute (RPI). Before joining RPI, he worked as an Assistant Professor at the New Jersey Institute of Technology (NJIT) for two years. He received his Ph.D. in Computer Science from Michigan State University (MSU) in 2021, with a focus on ML with graph-structured data. His research contributions to this area have led to numerous innovative works presented at top-tier conferences such as KDD, WWW, WSDM, ICLR, NeurIPS, and ICML. More on https://yaoma24.github.io/

Suhang Wang is an Associate Professor of the College of IST at The Pennsylvania State University. His research interests include graph mining, trustworthy machine learning, and generative artificial intelligence. He has published 150+ papers in top-tier machine learning and data mining conferences and journals, which have garnered 22,900+ Google Scholar citations with an h-index of 61. He is a recipient of the AI 2000 Most Influential Scholar Honorable Mention from AMiner and the 2022 Global Top Chinese Young Scholars in Artificial Intelligence. More on https://suhangwang.ist.psu.edu/

Hui Liu Hui Liu is an Applied Scientist at Amazon Search. He received his PhD from Queen’s University, Canada, and a Bachelor’s degree from Peking University. His research mainly focuses on natural language processing, large language models, text mining, and machine learning. He has publications in top-tier conferences such as ACL, EMNLP, ICLR, etc.

Qi He Qi He is a technical leader in AI and its business applications, with a track record of 20 years of experience leading and executing large, complex AI projects. He serves as a Steering Committee member of ACM CIKM and an advisory board member of Neurocomputing Journal. He held many editorial and conference chair positions, including Associate Editor of IEEE TKDE and Neurocomputing Journal, General Chair of CIKM 2013, PC Chair of CIKM 2019, and Industry Chair of Web 2024, while also serving as a (senior) program committee member of SIGKDD, SIGIR, WWW, CIKM, and WSDM for over a decade. More on https://www.linkedin.com/in/qi-he/

Xianfeng Tang is a senior Applied Scientist at Amazon. His research is mainly about machine learning, graph neural networks, data mining, and natural language understanding. He obtained a PhD degree from PSU and a Bachelor’s degree from the University of Science and Technology of China. He has published works in top-tier conferences such as ICLR, ICML, NeurIPS, KDD, etc.

Jiliang Tang is a University Foundation Professor in the computer science and engineering department at Michigan State University. His research interests include graph machine learning, trustworthy AI, and their applications in Education and Biology. He authored the first comprehensive book “deep learning on graphs” with Cambridge University Press and developed various well-received open-sourced tools, including scikit-feature for feature selection, DeepRobust for trustworthy AI, and DANCE for single-cell analysis. More on https://www.cse.msu.edu/~tangjili/

Jian Pei is a Professor and Chair at Duke University, holding a joint position among Computer Science, Biostatistics and Bioinformatics, and Electric and Computer Engineering. He is a renowned researcher in data science, big data, data mining, and database systems. He is recognized as a Fellow of the Royal Society of Canada (i.e., the national academy of Canada), the Canadian Academy of Engineering, the ACM, and the IEEE. At the same time, he is also renowned for his active and productive professional leadership. Jian Pei is one of the most cited authors in data mining, database systems, and information retrieval. More on https://sites.google.com/view/jpei/jian-peis-homepage

BibTeX

If our survey is useful for your research, please kindly cite:

@inproceedings{10.1145/3711896.3736563,
author = {Wang, Fali and Lin, Minhua and Ma, Yao and Liu, Hui and He, Qi and Tang, Xianfeng and Tang, Jiliang and Pei, Jian and Wang, Suhang},
title = {A Survey on Small Language Models in the Era of Large Language Models: Architecture, Capabilities, and Trustworthiness},
year = {2025},
isbn = {9798400714542},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3711896.3736563},
doi = {10.1145/3711896.3736563},
booktitle = {Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2},
pages = {6173–6183},
numpages = {11},
keywords = {small language models, trustworthiness, weak-to-strong},
location = {Toronto ON, Canada},
series = {KDD '25}
}

@article{wang2024comprehensive,
  title={A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness},
  author={Wang, Fali and Zhang, Zhiwei and Zhang, Xianren and Wu, Zongyu and Mo, Tzuhao and Lu, Qiuhao and Wang, Wanjing and Li, Rui and Xu, Junjie and Tang, Xianfeng and others},
  journal={arXiv preprint arXiv:2411.03350},
  year={2024}
}