搜索项目

搜索 "inference" 找到 12 个结果

jetson-inference

Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA J

⭐⭐⭐☆☆ (3/5) 12396
caffe computer-vision deep-learning
airllm

AirLLM 70B inference with single 4GB GPU

⭐⭐⭐☆☆ (3/5) 8008
chinese-llm chinese-nlp finetune
Paddle-Lite

PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)

⭐⭐⭐☆☆ (3/5) 6508
arm baidu deep-learning
LiteRT-LM

LiteRT-LM is Google's production-ready, high-performance, open-source inference framework for deploying Large Language M

⭐⭐⭐☆☆ (3/5) 2116
edge-ai on-device-ai on-device-llm
shimmy

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single

⭐⭐⭐☆☆ (3/5) 1980
api-server command-line-tool developer-tools
lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

⭐⭐⭐☆☆ (3/5) 1264
fine-tuning gpt llama
microsoft/nn-Meter

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

⭐⭐⭐⭐☆ (4/5) 268
deep-learning deep-neural-networks edge-ai
dusty-nv/NanoLLM

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agent

⭐⭐⭐☆☆ (3/5) 260
edge-ai llm-inference multimodal
orangekame3/mirrormate

Self-hosted personalized AI in a mirror.

⭐⭐⭐☆☆ (3/5) 116
ai llm-inference local-llm
awesome-tinyml

TinyML & Edge AI: On-device inference, model quantization, embedded ML, ultra-low-power AI for microcontrollers and IoT

⭐☆☆☆☆ (1/5) 44
esp32 arduino cortex-m
xiaoclaw

Local AI Agent firmware running on ESP32-S3, integrating offline voice wake-up with cloud TTS, supporting local LLM infe

⭐⭐☆☆☆ (2/5) 30
esp32 assistant chatbot
Arduino-and-Machine-Learning

无线多模态传感:推断传感硬件数据(如温湿度、光照)。

⭐⭐☆☆☆ (2/5) 0