14h agoCodebase getting larger - Qwen3.6-27B starting to compound issues - how to work smartly with this model?14h ago·reddit.com
15h agoStoring an index to a scale instead of the scale itself with Q4_0 quant reduces scale size by ~31% (small gain but interesting)15h ago·reddit.com
15h agoI am losing my mind with FOMO and need some sanity checking about model capabilities15h ago·reddit.com
15h agoA single federal order switched off the best cloud model overnight. Clearest case for running local I've seen yet.15h ago·reddit.com
17h agoMe after installing my second P40 into my goblin box and having 48gb VRAM for the first time in my life17h ago·reddit.com
18h agoYay got Gemma 12B QAT working on old 1080ti (maybe with speculative decoding?)18h ago·reddit.com
20h agoDeepSeek v4 Pro is too big for such a "midrange" performance, or am I missing something?20h ago·reddit.com