sarathi serve ดาวน์โหลด - sarathi serve ดาวน์โหลดซอร์สโค้ด

sarathi serve

ซอร์สโค้ดอื่น ๆ

1.0.0

ดาวน์โหลด

สารธี-เสิร์ฟ

Sarathi-Serve เป็นเฟรมเวิร์กการให้บริการ LLM ที่มีปริมาณงานสูงและมีความหน่วงต่ำ โปรดดูเอกสาร OSDI'24 ของเราสำหรับรายละเอียดเพิ่มเติม

ตั้งค่า

ตั้งค่า CUDA

Sarathi-Serve ได้รับการทดสอบกับ CUDA 12.3 บน GPU H100 และ A100

พื้นที่เก็บข้อมูลโคลน

git clone [email protected]:microsoft/sarathi-serve.git

สร้างสภาพแวดล้อม mamba

ตั้งค่า mamba หากคุณยังไม่มี

wget https://github.com/conda-forge/miniforge/releases/latest/download/Mambaforge-Linux-x86_64.sh
bash Mambaforge-Linux-x86_64.sh # follow the instructions from there

สร้างสภาพแวดล้อม Python 3.10

mamba create -p ./env python=3.10

ติดตั้ง สารธี-เสิร์ฟ

pip install -e . --extra-index-url https://flashinfer.ai/whl/cu121/torch2.3/

การสร้างผลลัพธ์ขึ้นมาใหม่

อ้างถึง readmes ในแต่ละโฟลเดอร์ที่สอดคล้องกับแต่ละรูปใน osdi-experiments

การอ้างอิง

หากคุณใช้งานของเรา โปรดพิจารณาอ้างอิงบทความของเรา:

 @article{agrawal2024taming,
  title={Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve},
  author={Agrawal, Amey and Kedia, Nitin and Panwar, Ashish and Mohan, Jayashree and Kwatra, Nipun and Gulavani, Bhargav S and Tumanov, Alexey and Ramjee, Ramachandran},
  journal={Proceedings of 18th USENIX Symposium on Operating Systems Design and Implementation, 2024, Santa Clara},
  year={2024}
}

รับทราบ

เดิมทีที่เก็บนี้เริ่มต้นจากทางแยกของโปรเจ็กต์ vLLM Sarathi-Serve เป็นต้นแบบการวิจัยและไม่มีฟีเจอร์ที่เทียบเท่ากับ vLLM แบบโอเพ่นซอร์สอย่างสมบูรณ์ เรายังคงรักษาคุณลักษณะที่สำคัญที่สุดไว้เท่านั้นและนำโค้ดเบสมาใช้เพื่อการทำซ้ำการวิจัยที่รวดเร็วยิ่งขึ้น

ขยาย

ข้อมูลเพิ่มเติม

เวอร์ชัน 1.0.0
ประเภท ซอร์สโค้ดอื่น ๆ
เวลาอัปเดต 2025-01-09
ขนาด 253.84KB
มาจาก Github

แอปที่เกี่ยวข้อง

GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

2024-11-03
Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-03
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01
GitHub the via/releases

2024-11-01

แนะนำสำหรับคุณ

chat.petals.dev

ซอร์สโค้ดอื่น ๆ

1.0.0
GPT Prompt Templates

ซอร์สโค้ดอื่น ๆ

1.0.0
GPTyped

ซอร์สโค้ดอื่น ๆ

GPTyped 1.0.5
waymo open dataset

ซอร์สโค้ดอื่น ๆ

December 2023 Update
SmartTube

ซอร์สโค้ดอื่น ๆ

24.71 Stable
Sunamu

ซอร์สโค้ดอื่น ๆ

Release 2.2.0
waymo open dataset

ซอร์สโค้ดอื่น ๆ

December 2023 Update
wp functions

หมวดหมู่อื่นๆ

1.0.0
termwind

หมวดหมู่อื่นๆ

v2.3.0

ข้อมูลที่เกี่ยวข้อง ทั้งหมด