IBM Granite 4 Tiny Preview served from local GGUF server

Granite 4 Tiny is an open-source LLM supporting a 128k context window. This demo uses only 2K context. View Documentation

0 1
0 2
0 1
0 100
1 2000