With the closure of the HuggingFace LLM leaderboard, and no access to powerful GPUs, I stopped running experiments. But with the flood of new Open Source models (Qwen, MiniMax, GLM, and more), and finally having just enough compute at home, I have started working on the current batch of LLMs. The heatmaps keep coming back with the same general story, but every architecture has its own neuroanatomy. The brains are different. The principle is the same. And some models are looking really interesting (Qwen3.5 27B in particular). I will release the code along with uploading new RYS models and a blog post once my Hopper-system finishes grinding on MiniMax M2.5.
17:47, 2 марта 2026Путешествия
。关于这个话题,wps提供了深入分析
酱酒春节成绩的相对亮眼引来行业的广泛关注,但相较于成绩本身,上述企业是如何取得这些成绩的更加值得关注。
Популярная российская блогерша пожаловалась на тяжелый развод и расплакалась20:49
В России были побиты два десятка метеорекордов за деньСиноптик Тишковец: На Русской равнине было установлено 20 тепловых рекордов