Nemati AI | ggml-org/llama.cpp — LLM inference in C/C++ | Nemati AI