kani下載 - kani源代碼下載

kani

其他源碼

v1.2.3

下載

測試包

卡尼（カニ）

卡尼（Kani）（カニ）是一個輕巧且高度可入侵的框架，用於帶有工具使用/功能調用的基於聊天的語言模型。

與其他LM框架相比，卡尼（Kani）的主意較少，並且在控制流的各個部分方面提供了更精細的可定制性，這使其成為NLP研究人員，業餘愛好者和開發人員的理想選擇。

Kani在開箱即用的以下模型的支持下提供了支持，並帶有模型不合時宜的框架，以增加對更多支持的支持：

託管模型

OpenAI型號（GPT-3.5-Turbo，GPT-4，GPT-4-Turbo，GPT-4O）
人類模型（Claude，Claude Instant）

開源模型

Kani支持通過transformers或llama.cpp擁抱面孔上可用的每個聊天模型！

特別是，我們為以下基本模型及其微型提供了參考實現：

美洲駝3（所有尺寸）
Mistral-7b，Mixtral-8x7b和Mixtral-8x22b
命令R和命令R+
Gemma（所有尺寸）
駱駝2（所有尺寸）
Vicuna v1.3

查看模型動物園，以了解如何在應用程序中使用這些模型！

有興趣貢獻嗎？查看我們的指南。

閱讀有關ReadThedocs的文檔！

閱讀有關Arxiv的論文！

特徵

輕量級和高級- 卡尼（Kani）實現了通用的樣板與語言模型接口，而無需強迫您使用自以為是的及時框架或複雜的圖書館特定工具。
模型不可知論-Kani提供了一個簡單的接口來實現：令牌計數和完成生成。 Kani允許開發人員切換哪種語言模型在沒有主要代碼重構的情況下在後端運行。
自動聊天內存管理- 允許聊天會話流動，而不必擔心管理歷史記錄中的代幣數量 - 卡尼（Kani）照顧了它。
使用模型反饋和重試的功能調用- 僅在一行代碼中允許模型訪問功能。卡尼優雅地提供了有關幻覺參數和錯誤的反饋，並允許模型重試呼叫。
您可以控制提示- 沒有隱藏的提示黑客。與其他流行的語言模型庫不同，我們將永遠不會為您決定如何格式化自己的數據。
快速迭代和直觀的學習- 與Kani一起，您只能寫Python-我們處理其餘的。
從一開始，異步設計- Kani可以擴展以輕鬆地並行運行多個聊天會話，而不必管理多個過程或程序。

安裝

卡尼需要Python 3.10或更高。為了安裝特定於模型的依賴關係，Kani使用了各種附加功能（ pip install中的庫名稱後的括號）。要確定要安裝哪個額外的（S），請參閱模型表，或使用[all]額外安裝所有內容來安裝所有內容。

 # for OpenAI models
$ pip install " kani[openai] "
# for Hugging Face models
$ pip install " kani[huggingface] " torch
# or install everything:
$ pip install " kani[all] "

對於最新的更改和新型號，您還可以從Git的main分支機構安裝開發版本：

$ pip install " kani[all] @ git+https://github.com/zhudotexe/kani.git@main "

Quickstart

卡尼需要Python 3.10或更高。

首先，安裝庫。在此QuickStart中，我們將使用OpenAI引擎，儘管Kani是模型不可替代的。

$ pip install " kani[openai] "

然後，讓我們使用Kani使用Chatgpt作為後端創建一個簡單的聊天機器人。

 # import the library
import asyncio
from kani import Kani , chat_in_terminal
from kani . engines . openai import OpenAIEngine

# Replace this with your OpenAI API key: https://platform.openai.com/account/api-keys
api_key = "sk-..."

# kani uses an Engine to interact with the language model. You can specify other model 
# parameters here, like temperature=0.7.
engine = OpenAIEngine ( api_key , model = "gpt-4o-mini" )

# The kani manages the chat state, prompting, and function calling. Here, we only give 
# it the engine to call ChatGPT, but you can specify other parameters like 
# system_prompt="You are..." here.
ai = Kani ( engine )

# kani comes with a utility to interact with a kani through your terminal...
chat_in_terminal ( ai )


# or you can use kani programmatically in an async function!
async def main ():
    resp = await ai . chat_round ( "What is the airspeed velocity of an unladen swallow?" )
    print ( resp . text )


asyncio . run ( main ())

卡尼（Kani）花了時間來簡短設置工作聊天模型，同時在每個提示，功能調用甚至基礎語言模型上為程序員提供深度的自定義性。

函數調用

函數調用使語言模型能夠選擇何時調用基於文檔提供的函數。

使用Kani，您可以在Python中編寫功能，並僅使用一行代碼將其暴露於模型： @ai_function Decorator。

 # import the library
import asyncio
from typing import Annotated
from kani import AIParam , Kani , ai_function , chat_in_terminal , ChatRole
from kani . engines . openai import OpenAIEngine

# set up the engine as above
api_key = "sk-..."
engine = OpenAIEngine ( api_key , model = "gpt-4o-mini" )


# subclass Kani to add AI functions
class MyKani ( Kani ):
    # Adding the annotation to a method exposes it to the AI
    @ ai_function ()
    def get_weather (
        self ,
        # and you can provide extra documentation about specific parameters
        location : Annotated [ str , AIParam ( desc = "The city and state, e.g. San Francisco, CA" )],
    ):
        """Get the current weather in a given location."""
        # In this example, we mock the return, but you could call a real weather API
        return f"Weather in { location } : Sunny, 72 degrees fahrenheit."


ai = MyKani ( engine )

# the terminal utility allows you to test function calls...
chat_in_terminal ( ai )


# and you can track multiple rounds programmatically.
async def main ():
    async for msg in ai . full_round ( "What's the weather in Tokyo?" ):
        print ( msg . role , msg . text )


asyncio . run ( main ())

Kani保證函數調用在達到您的方法的同時使您專注於編寫代碼時有效。有關更多信息，請查看函數調用文檔。

流

Kani支持基礎語言模型逐式的流響應，即使在有函數調用的情況下也是如此。流媒體旨在成為chat_round和full_round方法的液位超集，使您可以逐漸重構代碼，而無需將其置於破裂狀態。

 async def stream_chat ():
    stream = ai . chat_round_stream ( "What does kani mean?" )
    async for token in stream :
        print ( token , end = "" )
    print ()
    msg = await stream . message ()  # or `await stream`


async def stream_with_function_calling ():
    async for stream in ai . full_round_stream ( "What's the weather in Tokyo?" ):
        async for token in stream :
            print ( token , end = "" )
        print ()
        msg = await stream . message ()

為什麼卡尼？

諸如Langchain和SimpleAichat之類的語言模型的現有框架是有用的和/或重量級的 - 他們在引擎蓋下編輯了開發人員的提示，很難學習，並且在不添加很多高維護膨脹的情況下，很難自定義。

我們將Kani建立為一種更靈活，簡單和強大的替代方案。框架之間的一個很好的類比是說，卡尼是蘭班，因為燒瓶（或fastapi）是django。

卡尼（Kani）適合從學術研究人員到行業專業人士到業餘愛好者的所有人，而不必擔心黑客。

文件

要了解有關如何使用自己的及時包裝器，功能呼叫等自定義卡尼的更多信息，請閱讀文檔！

或者查看此存儲庫中的動手實例。

演示

想看到卡尼在行動嗎？使用4位量化來縮小模型，我們在GitHub動作上運行Llama V2作為測試套件的一部分：

https://github.com/zhudotexe/kani/actions/workflows/pytest.yml?query=branch%3amain+isas%3Asuccess

只需單擊最新的構建即可查看Llama的輸出！

我們是誰

賓夕法尼亞大學徽標

核心開發團隊由賓夕法尼亞大學計算機和信息科學系的三名博士學位學生組成。我們都是克里斯·卡里森（Chris Callison-Burch）教授的實驗室的成員，旨在促進NLP的未來。

Andrew Zhu始於2022年秋季。他的研究興趣包括自然語言處理，編程語言，分佈式系統等。他還是一位全棧軟件工程師，精通各種後端，DevOps，數據庫和前端工程。安德魯（Andrew）努力製定慣用，清潔，表現和低維護代碼 - 在學術界通常很少見的哲學。他的研究得到了NSF研究生研究獎學金的支持。
利亞姆·杜根（Liam Dugan）始於2021年秋季。他的研究主要關注大型語言模型以及人類如何與他們互動。特別是，他對人類對生成的文本的檢測以及我們是否可以將這些見解應用於自動檢測系統感興趣。他還對大型語言模型在教育中的實際應用感興趣。
Alyssa Hwang始於2020年秋季，由Chris Callison-Burch和Andrew Head提供建議。她的研究重點是有效地傳達複雜信息的AI助手，例如語音助手通過說明或有聲讀物指導用戶，允許用戶無縫地通過口語文本導航。除了研究之外，Alyssa主席Penn CIS博士協會，創立了CIS PhD指導計劃，並得到了NSF研究生研究獎學金計劃的支持。

我們在研究中積極使用Kani，並旨在使其與現代NLP實踐保持最新狀態。

引用

如果您使用kani，請引用我們為：

 @inproceedings{zhu-etal-2023-kani,
    title = "Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications",
    author = "Zhu, Andrew  and
      Dugan, Liam  and
      Hwang, Alyssa  and
      Callison-Burch, Chris",
    editor = "Tan, Liling  and
      Milajevs, Dmitrijs  and
      Chauhan, Geeticka  and
      Gwinnup, Jeremy  and
      Rippeth, Elijah",
    booktitle = "Proceedings of the 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS 2023)",
    month = dec,
    year = "2023",
    address = "Singapore",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.nlposs-1.8",
    doi = "10.18653/v1/2023.nlposs-1.8",
    pages = "65--77",
}

致謝

我們要感謝Chris Callison-Burch實驗室的成員對我們的論文和Kani存儲庫的內容進行了詳細的反饋。此外，我們要感謝Henry Zhu（與第一作者無關）對該項目的早期和熱情的支持。

這項研究基於空軍研究實驗室（合同FA8750-23-C-0507），IARPA HIATUS計劃（合同2022-220722005）和NSF（1928631獎）的部分支持。批准公開發布，分銷無限。本文所包含的觀點和結論是作者的觀點，不應被解釋為一定代表IARPA，NSF或美國政府的官方政策，即表示或暗示的官方政策。

展開

附加信息

版本 v1.2.3
類型其他源碼
更新時間 2025-03-03
大小 12.76MB
來自於 Github

相關應用

waymo open dataset

2024-11-18
Sunamu

2024-12-14
chat.petals.dev

2024-11-30
SmartTube

2024-12-14
MySchedule.py

2024-12-15
viptools for eslam

2024-12-15

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
waymo open dataset

其他源碼

December 2023 Update
Sunamu

其他源碼

Release 2.2.0
chat.petals.dev

其他源碼

1.0.0
waymo open dataset

其他源碼

December 2023 Update
termwind

其他類別

v2.3.0
wp functions

其他類別

1.0.0

相關資訊全部