搜索: llm-inference - iMakething 开源项目库

搜索项目

搜索 "llm-inference" 找到 3 个结果

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

3/5 1264

fine-tuning gpt llama

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agent

3/5 260

edge-ai llm-inference multimodal

Self-hosted personalized AI in a mirror.

3/5 116

ai llm-inference local-llm