搜索: inference - iMakething 开源项目库

搜索项目

搜索 "inference" 找到 12 个结果

jetson-inference

Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA J

⭐⭐⭐☆☆ (3/5) 12396

caffe computer-vision deep-learning

airllm

AirLLM 70B inference with single 4GB GPU

⭐⭐⭐☆☆ (3/5) 8008

chinese-llm chinese-nlp finetune

Paddle-Lite

PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎）

⭐⭐⭐☆☆ (3/5) 6508

arm baidu deep-learning

LiteRT-LM

LiteRT-LM is Google's production-ready, high-performance, open-source inference framework for deploying Large Language M

⭐⭐⭐☆☆ (3/5) 2116

edge-ai on-device-ai on-device-llm

shimmy

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single

⭐⭐⭐☆☆ (3/5) 1980

api-server command-line-tool developer-tools

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

⭐⭐⭐☆☆ (3/5) 1264

fine-tuning gpt llama

microsoft/nn-Meter

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

⭐⭐⭐⭐☆ (4/5) 268

deep-learning deep-neural-networks edge-ai

dusty-nv/NanoLLM

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agent

⭐⭐⭐☆☆ (3/5) 260

edge-ai llm-inference multimodal

orangekame3/mirrormate

Self-hosted personalized AI in a mirror.

⭐⭐⭐☆☆ (3/5) 116

ai llm-inference local-llm

awesome-tinyml

TinyML & Edge AI: On-device inference, model quantization, embedded ML, ultra-low-power AI for microcontrollers and IoT

⭐☆☆☆☆ (1/5) 44

esp32 arduino cortex-m

xiaoclaw

Local AI Agent firmware running on ESP32-S3, integrating offline voice wake-up with cloud TTS, supporting local LLM infe

⭐⭐☆☆☆ (2/5) 30

esp32 assistant chatbot

Arduino-and-Machine-Learning

无线多模态传感：推断传感硬件数据（如温湿度、光照）。

⭐⭐☆☆☆ (2/5) 0