英伟达携手 Mistral AI 上月发布开源 Mistral NeMo 12B 模型,率高
就其规模而言,工作总结、Mistral-NeMo-Minitron 8B 在语言模型的九项流行基准测试中遥遥领先。具备精度高、
英伟达表示通过宽度剪枝(width-pruning)Mistral NeMo 12B,可在RTX工作站上部署" class="wp-image-675652"/>
参考
Lightweight Champ: NVIDIA Releases Small Language Model With State-of-the-Art Accuracy
Mistral-NeMo-Minitron 8B Foundation Model Delivers Unparalleled Accuracy
Compact Language Models via Pruning and Knowledge Distillation
效率高,相关成果发表在《Compact Language Models via Pruning and Knowledge Distillation》论文中。附上相关测试结果如下: