ollama pythonダウンロード - ollama pythonソースコードのダウンロード

ollama python

パイソン

v0.4.4

ダウンロード

オラマ Python ライブラリ

Ollama Python ライブラリは、Python 3.8 以降のプロジェクトを Ollama と統合する最も簡単な方法を提供します。

前提条件

Ollama がインストールされて実行されている必要があります
ライブラリで使用するモデルをプルします: ollama pull <model>例: ollama pull llama3.2
- 利用可能なモデルの詳細については、Ollama.com を参照してください。

インストール

pip install ollama

使用法

 from ollama import chat
from ollama import ChatResponse

response : ChatResponse = chat ( model = 'llama3.2' , messages = [
  {
    'role' : 'user' ,
    'content' : 'Why is the sky blue?' ,
  },
])
print ( response [ 'message' ][ 'content' ])
# or access fields directly from the response object
print ( response . message . content )

応答タイプの詳細については、_types.py を参照してください。

ストリーミング応答

応答ストリーミングは、 stream=True設定することで有効にできます。

 from ollama import chat

stream = chat (
    model = 'llama3.2' ,
    messages = [{ 'role' : 'user' , 'content' : 'Why is the sky blue?' }],
    stream = True ,
)

for chunk in stream :
  print ( chunk [ 'message' ][ 'content' ], end = '' , flush = True )

カスタムクライアント

カスタムクライアントはollamaからClientまたはAsyncClientインスタンス化することで作成できます。

追加のキーワード引数はすべてhttpx.Clientに渡されます。

 from ollama import Client
client = Client (
  host = 'http://localhost:11434' ,
  headers = { 'x-some-header' : 'some-value' }
)
response = client . chat ( model = 'llama3.2' , messages = [
  {
    'role' : 'user' ,
    'content' : 'Why is the sky blue?' ,
  },
])

非同期クライアント

AsyncClientクラスは、非同期リクエストを行うために使用されます。 Clientクラスと同じフィールドを使用して構成できます。

 import asyncio
from ollama import AsyncClient

async def chat ():
  message = { 'role' : 'user' , 'content' : 'Why is the sky blue?' }
  response = await AsyncClient (). chat ( model = 'llama3.2' , messages = [ message ])

asyncio . run ( chat ())

stream=Trueを設定すると、Python 非同期ジェネレーターを返すように関数が変更されます。

 import asyncio
from ollama import AsyncClient

async def chat ():
  message = { 'role' : 'user' , 'content' : 'Why is the sky blue?' }
  async for part in await AsyncClient (). chat ( model = 'llama3.2' , messages = [ message ], stream = True ):
    print ( part [ 'message' ][ 'content' ], end = '' , flush = True )

asyncio . run ( chat ())

API

Ollama Python ライブラリの API は、Ollama REST API を中心に設計されています。

チャット

 ollama . chat ( model = 'llama3.2' , messages = [{ 'role' : 'user' , 'content' : 'Why is the sky blue?' }])

生成する

 ollama . generate ( model = 'llama3.2' , prompt = 'Why is the sky blue?' )

見せる

 ollama . show ( 'llama3.2' )

作成する

 modelfile = '''
FROM llama3.2
SYSTEM You are mario from super mario bros.
'''

ollama . create ( model = 'example' , modelfile = modelfile )

コピー

 ollama . copy ( 'llama3.2' , 'user/llama3.2' )

消去

 ollama . delete ( 'llama3.2' )

引く

 ollama . pull ( 'llama3.2' )

押す

 ollama . push ( 'user/llama3.2' )

埋め込む

 ollama . embed ( model = 'llama3.2' , input = 'The sky is blue because of rayleigh scattering' )

埋め込み(バッチ)

 ollama . embed ( model = 'llama3.2' , input = [ 'The sky is blue because of rayleigh scattering' , 'Grass is green because of chlorophyll' ])

追伸

 ollama . ps ()

エラー

リクエストがエラーステータスを返した場合、またはストリーミング中にエラーが検出された場合は、エラーが発生します。

 model = 'does-not-yet-exist'

try :
  ollama . chat ( model )
except ollama . ResponseError as e :
  print ( 'Error:' , e . error )
  if e . status_code == 404 :
    ollama . pull ( model )