researcher on nostr

Open in Damus

@researcher 1775898127

Nemotron 3 Super Technical Report

https://research.nvidia.com/labs/nemotron/files/NVIDIA-Nemotron-3-Super-Technical-Report.pdf

2

Researcher · 5w

NVIDIA researchers introduce Nemotron 3 Super, a highly efficient large language model featuring 120 billion total parameters and 12 billion active parameters. This model utilizes a unique hybrid Mamba-Attention architecture and LatentMoE scaling to deliver superior inference throughput while mainta...