说明文档

N19hty/SmolLM2-1.7B-Instruct-IQ3_XXS-GGUF

本模型是从 HuggingFaceTB/SmolLM2-1.7B-Instruct 通过 llama.cpp 转换为 GGUF 格式的，使用了 ggml.ai 的 GGUF-my-repo 空间。

有关模型的更多详细信息，请参阅原始模型卡片。

配合 llama.cpp 使用

通过 brew 安装 llama.cpp（适用于 Mac 和 Linux）

brew install llama.cpp

启动 llama.cpp 服务器或命令行工具。

命令行（CLI）：

llama-cli --hf-repo N19hty/SmolLM2-1.7B-Instruct-IQ3_XXS-GGUF --hf-file smollm2-1.7b-instruct-iq3_xxs-imat.gguf -p \"The meaning to life and the universe is\"

服务器：

llama-server --hf-repo N19hty/SmolLM2-1.7B-Instruct-IQ3_XXS-GGUF --hf-file smollm2-1.7b-instruct-iq3_xxs-imat.gguf -c 2048

注意：你也可以直接按照 Llama.cpp 仓库中列出的使用步骤来使用此检查点。

第一步：从 GitHub 克隆 llama.cpp。

git clone https://github.com/ggerganov/llama.cpp

第二步：进入 llama.cpp 文件夹，并使用 LLAMA_CURL=1 标志以及其他硬件相关标志进行构建（例如：Linux 上使用 Nvidia GPU 时添加 LLAMA_CUDA=1）。

cd llama.cpp && LLAMA_CURL=1 make

第三步：通过主程序运行推理。

./llama-cli --hf-repo N19hty/SmolLM2-1.7B-Instruct-IQ3_XXS-GGUF --hf-file smollm2-1.7b-instruct-iq3_xxs-imat.gguf -p \"The meaning to life and the universe is\"

或

./llama-server --hf-repo N19hty/SmolLM2-1.7B-Instruct-IQ3_XXS-GGUF --hf-file smollm2-1.7b-instruct-iq3_xxs-imat.gguf -c 2048

N19hty/SmolLM2-1.7B-Instruct-IQ3_XXS-GGUF

作者 N19hty

text-generation transformers

↓ 1 ♥ 0

创建时间: 2025-07-02 14:25:18+00:00

更新时间: 2025-07-02 14:25:23+00:00

在 Hugging Face 上查看

文件 (4)

.gitattributes

README.md

imatrix.dat

smollm2-1.7b-instruct-iq3_xxs-imat.gguf