mnn llm Download - mnn llm Quellcode herunterladen

mnn llm

C/C++

llama3-8b-instruct-mnn

Herunterladen

Lokal kompilieren

 # clone
git clone --recurse-submodules https://github.com/wangzhaode/mnn-llm.git
cd mnn-llm

# linux
./script/build.sh

# macos
./script/build.sh

# windows msvc
./script/build.ps1

# python wheel
./script/py_build.sh

# android
./script/android_build.sh

# android apk
./script/android_app_build.sh

# ios
./script/ios_build.sh

Einige Kompilierungsmakros:

BUILD_FOR_ANDROID : Auf Android-Geräte kompilieren;
LLM_SUPPORT_VISION : Ob visuelle Verarbeitungsfunktionen unterstützt werden sollen;
DUMP_PROFILE_INFO : Leistungsdaten nach jeder Sitzung an die Befehlszeile ausgeben;

CPU wird standardmäßig verwendet. Wenn Sie andere Backends oder Funktionen verwenden, können Sie beim Kompilieren von MNN das MNN Kompilierungsmakro hinzufügen.

cuda: -DMNN_CUDA=ON
opencl: -DMNN_OPENCL=ON
Metall: -DMNN_METAL=ON

4. Ausführung

 # linux/macos
./cli_demo ./Qwen2-1.5B-Instruct-MNN/config.json # cli demo
./web_demo ./Qwen2-1.5B-Instruct-MNN/config.json ../web # web ui demo

# windows
. D ebug c li_demo.exe ./Qwen2-1.5B-Instruct-MNN/config.json
. D ebug w eb_demo.exe ./Qwen2-1.5B-Instruct-MNN/config.json ../web

# android
adb push android_build/MNN/OFF/arm64-v8a/libMNN.so /data/local/tmp
adb push android_build/MNN/express/OFF/arm64-v8a/libMNN_Express.so /data/local/tmp
adb push android_build/libllm.so android_build/cli_demo /data/local/tmp
adb push Qwen2-1.5B-Instruct-MNN /data/local/tmp
adb shell " cd /data/local/tmp && export LD_LIBRARY_PATH=. && ./cli_demo ./Qwen2-1.5B-Instruct-MNN/config.json "