rkargul 17 Apr 2025 GenAI: Optimizing Local Large Language Models Performance gen-ai llm generative-ai local-llm ollama quantization model-optimization neural-networks model-inference open-source