Roger Waleffe

Selected Publications

Language Models

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models.

NVIDIA

2025.

arXiv '25 ADLR Blog

An Empirical Study of Mamba-based Language Models.

Waleffe, R., Byeon, W., Riach, D., Norick, B., Korthikanti, V., Dao, T., Gu, A., Hatamizadeh, A., Singh, S., Narayanan, D., Kulshreshtha, G., Singh, V., Casper, J., Kautz, J., Shoeybi, M., Catanzaro, B.

2024.

arXiv '24

Graphs

Armada: Memory-Efficient Distributed Training of Large-Scale Graph Neural Networks.

Waleffe, R., Sarda, D., Mohoney, J., Vlatakis-Gkaragkounis, E., Rekatsinas, T., Venkataraman, S.

2025.

arXiv '25

MariusGNN: Resource-Efficient Out-of-Core Training of Graph Neural Networks.

Waleffe, R., Mohoney, J., Rekatsinas, T., Venkataraman, S.

18th European Conference on Computer Systems (EuroSys ’23). 2023.

arXiv '22 EuroSys '23 Talk at Cornell (Spring '22) Talk at Northeastern (Spring '23)

Demo of Marius: A System for Large-scale Graph Embeddings.

Xie, A., Carlsson, A., Mohoney, J., Waleffe, R., Peters, S., Rekatsinas, T., Venkataraman, S.

Proceedings of the VLDB Endowment, 14(12). 2021.

VLDB '21

Marius: Learning Massive Graph Embeddings on a Single Machine.

Mohoney, J., Waleffe, R., Xu, Y., Rekatsinas, T., Venkataraman, S.

15th Symposium on Operating Systems Design and Implementation. 2021.

arXiv '21 OSDI '21

Other CS

Chameleon: a Heterogeneous and Disaggregated Accelerator System
for Retrieval-Augmented Language Models.

Jiang, W., Zeller, M., Waleffe, R., Hoefler, T., Alonso, G.

Proceedings of the VLDB Endowment, 18(1). 2024.

arXiv '23 VLDB '24

Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning.

Okanovic, P.*, Waleffe, R.*, Mageirakos, V., Nikolakakis, K. E., Karbasi, A., Kalogerias, D., Gürel, N. M., Rekatsinas, T.

The Twelfth International Conference on Learning Representations. 2024.

*Equal contribution.

arXiv '23 DMLR at ICML '23 ICLR '24

Principal Component Networks:
Utilizing Low-Rank Activation Structure to Reduce Parameters Early in Training.

Waleffe, R., Rekatsinas, T.

ACM/IMS Journal of Data Science. 2023.

arXiv '20 HAET at ICML '22 ACM JDS '23

A laboratory model for the Parker spiral and magnetized stellar winds.

Peterson, E. E., Endrizzi, D. A., Beidler, M., Bunkers, K. J., Clark, M., Egedal, J., Flanagan, K., McCollam, K. J., Milhone, J., Olson, J., Sovinec, C. R., Waleffe, R., Wallace, J., Forest, C. B.

Nature Physics. 2019.

Media coverage: PBS, Quanta, Wired, Cosmos, Science News, UW-Madison

Nature Physics '19

Driving magnetic turbulence using flux ropes in a moderate guide field linear system.

Brookhart, M. I., Stemo, A., Waleffe, R., Forest, C. B.

Journal of Plasma Physics. 2017.

Journal of Plasma Physics '17

Investigation of magnetic mirror configurations at the WiPPL facility and their applications.

Waleffe, R., Peterson, E. E., Anderson, J., Clark, M., Wallace, J., Forest, C. B.

60th Annual Meeting of the APS Division of Plasma Physics. 2018.

Poster (presented by Anderson, J.)

Stability of an axisymmetric, non-paraxial mirror and its applications for a fusion neutron source.

Waleffe, R., Perterson, E. E., Mirnov, V., Forest, C. B.

59th Annual Meeting of the APS Division of Plasma Physics. 2017.

Poster

APS DPP '17 Poster APS DPP '17 Abstract

High Current, High Density Arc Plasma Source for WiPAL.

Waleffe, R., Endrizzi, D. A., Peterson, E. E., Forest, C. B.

58th Annual Meeting of the APS Division of Plasma Physics. 2016.

Poster