Triton 归档 - 每时AI

Meta「轻量级」KernelLLM颠覆GPU内核生成，8B参数碾压GPT-4o

2025年5月27日16时作者新智元

Meta发布KernelLLM，一个基于Llama 3.1微调的8B模型，能够在PyTorch代码基础上生成高效的Triton GPU内核，单次推理性能超越GPT-4o和DeepSeek V3。

2025年5月24日14时作者 GiantPandaCV

778199261291694
编辑丨GiantPandaLLM
0x00 前言
后续会陆续更新一些

2025年5月15日19时作者 GiantPandaCV

optim-algorithm-in-cuda/blob/master/large-language

2025年2月22日16时作者机器之心

多篇内容，覆盖全球各大高校与企业的顶级实验室，有效促进了学术交流与传播。如果您有优秀的工作想要分享，

2025年2月4日19时作者 GiantPandaCV

0x0. 前言
祝大家新年快乐! 希望大家天天开心，学业有成，工作顺利。
我是在2025农历新年的大

2025年1月24日22时作者 GiantPandaCV

0x0. 前言
yifuwang 在 https://github.com/yifuwang/sym

2025年1月1日14时作者 GiantPandaCV

博客来源：https://pytorch.org/blog/triton-kernel-compil