JetStream下載JetStream源代碼下載

JetStream

其他源碼

v0.2.2

下載

Jetstream是XLA設備上LLM推斷的吞吐量和內存優化引擎。

關於

JetStream是XLA設備上LLM推斷的吞吐量和內存優化引擎，從TPU開始（將來GPU和GPU-歡迎PRS）。

Jetstream Engine實施

當前，有兩個可用的參考引擎實現 - 一種用於JAX模型，另一種用於Pytorch型號。

JAX

git：https：//github.com/google/maxtext
README：https：//github.com/google/jetstream/blob/main/main/docs/online-inline-inline-with-maxtext-engine.md

Pytorch

git：https：//github.com/google/jetstream-pytorch
readme：https：//github.com/google/jetstream-pytorch/blob/main/main/readme.md

文件

在V5E Cloud TPU VM上使用Maxtext在線推斷[README]
在V5E Cloud TPU VM上與Pytorch在線推斷[readme]
使用tpus在gke上使用jetstream使用tpu
基準測試服務器
Jetstream服務器中的可觀察性
在Jetstream服務器中進行分析
Jetstream獨立本地設置

Jetstream獨立本地設置

入門

設定

make install-deps

運行本地服務器和測試

使用以下命令在本地運行服務器：

# Start a server
python -m jetstream.core.implementations.mock.server

# Test local mock server
python -m jetstream.tools.requester

# Load test local mock server
python -m jetstream.tools.load_tester

測試核心模塊

# Test JetStream core orchestrator
python -m unittest -v jetstream.tests.core.test_orchestrator

# Test JetStream core server library
python -m unittest -v jetstream.tests.core.test_server

# Test mock JetStream engine implementation
python -m unittest -v jetstream.tests.engine.test_mock_engine

# Test mock JetStream token utils
python -m unittest -v jetstream.tests.engine.test_token_utils
python -m unittest -v jetstream.tests.engine.test_utils

展開

附加信息

版本 v0.2.2
類型其他源碼
更新時間 2025-02-19
大小 2.57MB
來自於 Github

相關應用

waymo open dataset

2024-11-18
Sunamu

2024-12-14
MySchedule.py

2024-12-15
chat.petals.dev

2024-11-30
SmartTube

2024-12-14
viptools for eslam

2024-12-15

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
waymo open dataset

其他源碼

December 2023 Update
Sunamu

其他源碼

Release 2.2.0
MySchedule.py

其他源碼

Updates to the fetching of week codes
waymo open dataset

其他源碼

December 2023 Update
termwind

其他類別

v2.3.0
wp functions

其他類別

1.0.0

相關資訊全部