34 min agoMLX/OMLX/DMR with OpenCode/Hermes/Open WebUI with no manual configuration in one command - Harbor v0.5.034 min ago·reddit.com
2h agoIntroducing the Heretic Grimoire: The takedown-resilient, local-first backup system that keeps uncensored models available forever2h ago·reddit.com
2h agoBuilt a local AI assistant because I always knew this day would come, yesterday just made it feel very real2h ago·reddit.com
3h agoit sounds like Meta is abandoning in-house LLM development and reassigning those employees3h ago·reddit.com
4h agoXiaomi is now serving MiMo V2.5 at 1000-3000tps using DFlash & Persistent kernel. DFLash model is out, open-source release promised coming soon4h ago·reddit.com
7h agoDual DGX Sparks- 40tk/s single 1M ; 350 tk/s agg. - Deepseek V4 Flash (vs RTX Pro 6000 vs Mac M2 Ultra 192)7h ago·reddit.com
15h agoCodebase getting larger - Qwen3.6-27B starting to compound issues - how to work smartly with this model?15h ago·reddit.com
16h agoStoring an index to a scale instead of the scale itself with Q4_0 quant reduces scale size by ~31% (small gain but interesting)16h ago·reddit.com
16h agoI am losing my mind with FOMO and need some sanity checking about model capabilities16h ago·reddit.com
16h agoA single federal order switched off the best cloud model overnight. Clearest case for running local I've seen yet.16h ago·reddit.com