archgwダウンロードarchgwソースコードのダウンロード

archgw

その他のソースコード

release 0.1.5 ?

ダウンロード

高速、観察可能、およびパーソナライズされたAIエージェントを構築します。

Archは、AIエージェントをAPIで保護、観察、およびパーソナライズするように設計されたインテリジェントレイヤー7分散プロキシです。

専用のLLMSを使用して設計されたArchは、脱獄の試みの検出と拒否、「バックエンド」APIの検出と拒否など、プロンプトの取り扱いと処理、インテリジェントなユーザーのリクエストを満たすためにインテリジェントに呼び出すことに関連する重要ではあるが未分化のタスクを処理します。

Archは、次の信念を持ってEnvoy Proxyに基づいて構築されています（および）

プロンプトは微妙で不透明なユーザーリクエストがあります。これには、セキュアハンドリング、インテリジェントルーティング、堅牢な観測可能性、パーソナライズのためのバックエンド（API）システムとの統合など、従来のHTTP要求と同じ機能が必要です。

コア機能：

Envoyに基づいて構築：Archはアプリケーションサーバーと並んで実行され、プロンプトとLLMに関連するイングレスと出口のトラフィックを処理するために、Envoyの実績のあるHTTP管理とスケーラビリティ機能の上に構築されます。
高速エージェントおよびRAGアプリを呼び出す関数。専用のLLMを使用して、機能/API呼び出しなどの高速で費用対効果の高い、正確なプロンプトベースのタスク、およびプロンプトからのパラメーター抽出を処理します。
プロンプトガード：Archは、脱獄の試みを防ぎ、コードの1行を記述せずに安全なユーザーインタラクションを確保するために、プロンプトガードレールを集中化します。
トラフィック管理：ArchはLLMコールを管理し、スマートレトリ、自動カットオーバー、継続的な可用性のために上流の上流接続を回復します。
標準ベースの観測可能性：ARCHは、W3Cトレースコンテキスト標準を使用して、アプリケーション全体の完全な要求トレースを有効にし、観測可能性ツールとの互換性を確保し、レイテンシ、トークン使用、エラーレートを監視するメトリックを提供し、AIアプリケーションのパフォーマンスを最適化します。

ドキュメントにジャンプして、 Archを使用してGenaiアプリの速度、セキュリティ、パーソナライズを改善する方法を学びます。

重要

今日、エージェントおよびRAGシナリオ向けに設計されたLLM（Arch-Function）を呼び出す関数は、米国中心の地域で無料でホストされています。一貫したレイテンシとスループットを提供し、費用を管理するために、開発者キーを介してホストバージョンへのアクセスをすぐに可能にし、そのLLMをローカルに実行するオプションを提供します。詳細については、この問題＃258を参照してください

接触

私たちと連絡するために、Discordサーバーに参加してください。私たちはその積極的に監視し、そこでサポートを提供します。

デモ

天気予報 - 天気予報サービスを使用して、Arch Gatewayのコア関数呼び出し機能を歩く
保険代理店 - アーチで完全な保険代理店を構築する
ネットワークエージェント - アーチでネットワーキングの共同操縦剤/エージェントエージェントを構築する

クイックスタート

このガイドに従って、アーチをすばやくセットアップし、生成AIアプリケーションに統合する方法を学びます。

前提条件

開始する前に、次のことを確認してください。

Docker ＆ Pythonがシステムにインストールされています
LLMプロバイダー向けのAPI Keys （外部LLMを使用する場合）

ステップ1：アーチをインストールします

ArchのCLIを使用すると、Arch Gatewayを効率的に管理および対話できます。 CLIをインストールするには、次のコマンドを実行するだけです。ヒント：開発者は、ARCHをインストールする前に依存関係を分離する新しいPython仮想環境を作成することをお勧めします。これにより、Archgwとその依存関係がシステム上の他のパッケージに干渉しないようにします。

さらに進む前に、次のユーティリティがインストールされていることを確認してください。

Dockerシステム（V24）
Docker Compose（v2.29）
Python（v3.12）
詩（v1.8.3。注：現地開発にのみ必要です）

$ python -m venv venv
$ source venv/bin/activate   # On Windows, use: venvScriptsactivate
$ pip install archgw

ステップ2：アプリケーションでアーチを構成します

ARCHは、LLMプロバイダー、プロンプトのターゲット、GuardRailsなどを定義できる構成ファイルに基づいて動作します。以下は、開始するための例の構成の例です。

 version : v0.1
listener :
  address : 127.0.0.1
  port : 8080 # If you configure port 443, you'll need to update the listener with tls_certificates
  message_format : huggingface

# Centralized way to manage LLMs, manage keys, retry logic, failover and limits in a central way
llm_providers :
  - name : OpenAI
    provider : openai
    access_key : $OPENAI_API_KEY
    model : gpt-3.5-turbo
    default : true

# default system prompt used by all prompt targets
system_prompt : |
  You are a network assistant that helps operators with a better understanding of network traffic flow and perform actions on networking operations. No advice on manufacturers or purchasing decisions.

prompt_targets :
    - name : device_summary
      description : Retrieve network statistics for specific devices within a time range
      endpoint :
        name : app_server
        path : /agent/device_summary
      parameters :
        - name : device_ids
          type : list
          description : A list of device identifiers (IDs) to retrieve statistics for.
          required : true  # device_ids are required to get device statistics
        - name : days
          type : int
          description : The number of days for which to gather device statistics.
          default : " 7 "
    - name : reboot_devices
      description : Reboot a list of devices
      endpoint :
        name : app_server
        path : /agent/device_reboot
      parameters :
        - name : device_ids
          type : list
          description : A list of device identifiers (IDs).
          required : true
        - name : days
          type : int
          description : A list of device identifiers (IDs)
          default : " 7 "

# Arch creates a round-robin load balancing between different endpoints, managed via the cluster subsystem.
endpoints :
  app_server :
    # value could be ip address or a hostname with port
    # this could also be a list of endpoints for load balancing
    # for example endpoint: [ ip1:port, ip2:port ]
    endpoint : host.docker.internal:18083
    # max time to wait for a connection to be established
    connect_timeout : 0.005s

ステップ3：Archを使用してOpenaiクライアントを出口ゲートウェイとして使用する

アーチ経由でアウトバウンドコールを作成します

 from openai import OpenAI

# Use the OpenAI client as usual
client = OpenAI (
  # No need to set a specific openai.api_key since it's configured in Arch's gateway
  api_key = '--' ,
  # Set the OpenAI API base URL to the Arch gateway endpoint
  base_url = "http://127.0.0.1:12000/v1"
)

response = client . chat . completions . create (
    # we select model from arch_config file
    model = "--" ,
    messages = [{ "role" : "user" , "content" : "What is the capital of France?" }],
)

print ( "OpenAI Response:" , response . choices [ 0 ]. message . content )