nlpcloud pythonダウンロード - nlpcloud pythonソースコードのダウンロード

NLP クラウド用の Python クライアント

これは、NLP Cloud API の Python クライアントです。詳細については、ドキュメントを参照してください。

NLP クラウドは、NER、感情分析、分類、要約、対話要約、言い換え、意図分類、製品説明と広告生成、チャットボット、文法とスペルの修正、キーワードとキーフレーズの抽出、テキスト生成のための高性能の事前トレーニング済みモデルまたはカスタムモデルを提供します。、画像生成、ソースコード生成、質問応答、自動音声認識、機械翻訳、言語検出、意味検索、意味類似性、トークン化、POS タグ付け、埋め込み、依存関係解析中。これは実稼働の準備ができており、REST API を通じて提供されます。

NLP Cloud の事前トレーニング済みモデルを使用することも、独自のモデルを微調整することも、独自のモデルをデプロイすることもできます。

問題に直面した場合は、ためらわずに Github の問題として報告してください。ありがとう！

インストール

pip経由でインストールします。

pip install nlpcloud

例

以下は、偽のトークンを使用して Facebook の Bart Large CNN モデルを使用してテキストを要約した完全な例です。

 import nlpcloud

client = nlpcloud . Client ( "bart-large-cnn" , "4eC39HqLyjWDarjtT1zdp7dc" )
client . summarization ( """One month after the United States began what has become a 
  troubled rollout of a national COVID vaccination campaign, the effort is finally 
  gathering real steam. Close to a million doses -- over 951,000, to be more exact -- 
  made their way into the arms of Americans in the past 24 hours, the U.S. Centers 
  for Disease Control and Prevention reported Wednesday. That s the largest number 
  of shots given in one day since the rollout began and a big jump from the 
  previous day, when just under 340,000 doses were given, CBS News reported. 
  That number is likely to jump quickly after the federal government on Tuesday 
  gave states the OK to vaccinate anyone over 65 and said it would release all 
  the doses of vaccine it has available for distribution. Meanwhile, a number 
  of states have now opened mass vaccination sites in an effort to get larger 
  numbers of people inoculated, CBS News reported.""" )

以下は、GPU 上で同じことを行う完全な例です。

 import nlpcloud

client = nlpcloud . Client ( "bart-large-cnn" , "4eC39HqLyjWDarjtT1zdp7dc" , True )
client . summarization ( """One month after the United States began what has become a 
  troubled rollout of a national COVID vaccination campaign, the effort is finally 
  gathering real steam. Close to a million doses -- over 951,000, to be more exact -- 
  made their way into the arms of Americans in the past 24 hours, the U.S. Centers 
  for Disease Control and Prevention reported Wednesday. That s the largest number 
  of shots given in one day since the rollout began and a big jump from the 
  previous day, when just under 340,000 doses were given, CBS News reported. 
  That number is likely to jump quickly after the federal government on Tuesday 
  gave states the OK to vaccinate anyone over 65 and said it would release all 
  the doses of vaccine it has available for distribution. Meanwhile, a number 
  of states have now opened mass vaccination sites in an effort to get larger 
  numbers of people inoculated, CBS News reported.""" )

以下は同じことを行う完全な例ですが、フランス語のテキストに対して行われます。

 import nlpcloud

client = nlpcloud . Client ( "bart-large-cnn" , "4eC39HqLyjWDarjtT1zdp7dc" , True , "fra_Latn" )
client . summarization ( """Sur des images aériennes, prises la veille par un vol de surveillance 
  de la Nouvelle-Zélande, la côte d’une île est bordée d’arbres passés du vert 
  au gris sous l’effet des retombées volcaniques. On y voit aussi des immeubles
  endommagés côtoyer des bâtiments intacts. « D’après le peu d’informations
  dont nous disposons, l’échelle de la dévastation pourrait être immense, 
  spécialement pour les îles les plus isolées », avait déclaré plus tôt 
  Katie Greenwood, de la Fédération internationale des sociétés de la Croix-Rouge.
  Selon l’Organisation mondiale de la santé (OMS), une centaine de maisons ont
  été endommagées, dont cinquante ont été détruites sur l’île principale de
  Tonga, Tongatapu. La police locale, citée par les autorités néo-zélandaises,
  a également fait état de deux morts, dont une Britannique âgée de 50 ans,
  Angela Glover, emportée par le tsunami après avoir essayé de sauver les chiens
  de son refuge, selon sa famille.""" )

json オブジェクトが返されます。

{
  "summary_text" : " Over 951,000 doses were given in the past 24 hours. That's the largest number of shots given in one day since the  rollout began. That number is likely to jump quickly after the federal government gave states the OK to vaccinate anyone over 65. A number of states have now opened mass vaccination sites. "
}

使用法

クライアントの初期化

初期化中に、使用するモデルと NLP クラウドトークンをクライアントに渡します。

モデルは、 en_core_web_lg 、 bart-large-mnli ... などの事前トレーニング済みモデルにすることもできますが、 custom_model/<model id> (例、 custom_model/2568 ) を使用してカスタムモデルの 1 つにすることもできます。利用可能なすべてのモデルの包括的なリストについては、ドキュメントを参照してください。

トークンは、NLP Cloud ダッシュボードから取得できます。

 import nlpcloud

client = nlpcloud . Client ( "<model>" , "<your token>" )

GPU を使用する場合は、 gpu=Trueを渡します。

 import nlpcloud

client = nlpcloud . Client ( "<model>" , "<your token>" , gpu = True )

英語以外のテキストを処理するために多言語アドオンを使用する場合は、 lang="<your language code>"を渡します。たとえば、フランス語のテキストを処理する場合は、 lang="fra_Latn"を設定する必要があります。

 import nlpcloud

client = nlpcloud . Client ( "<model>" , "<your token>" , lang = "<your language code>" )

非同期リクエストを行う場合は、 asynchronous=Trueを渡します。

 import nlpcloud

client = nlpcloud . Client ( "<model>" , "<your token>" , asynchronous = True )

非同期リクエストを行っている場合は、URL を含むクイック応答を常に受け取ります。次に、結果が利用可能かどうかを確認するために、この URL をasync_result()で定期的に (たとえば 10 秒ごとに) ポーリングする必要があります。以下に例を示します。

 client . async_result ( "https://api.nlpcloud.io/v1/get-async-result/21718218-42e8-4be9-a67f-b7e18e03b436" )

上記のコマンドは、応答の準備ができたら JSON オブジェクトを返します。それ以外の場合はNoneを返します。

自動音声認識 (Speech to Text) エンドポイント

asr()メソッドを呼び出して、次の引数を渡します。

(オプション: これまたはエンコードされたファイルを設定する必要があります) url : オーディオまたはビデオファイルがホストされている URL
(オプション: これまたは URL を設定する必要があります) encoded_file : ファイルの Base 64 でエンコードされたバージョン
(オプション) input_language : ISO コードとしてのファイルの言語

 client . asr ( "Your url" )

上記のコマンドは JSON オブジェクトを返します。

チャットボットエンドポイント

chatbot()メソッドを呼び出して入力を渡します。オプションとして、辞書のリストであるコンテキストと会話履歴を渡すこともできます。各辞書は、チャットボットからのinputとresponseで構成されます。

 client . chatbot ( "Your input" , "You context" , [{ "input" : "input 1" , "response" : "response 1" }, { "input" : "input 2" , "response" : "response 2" }, ...])

上記のコマンドは JSON オブジェクトを返します。

分類エンドポイント

classification()メソッドを呼び出して、次の引数を渡します。

分類したいテキストを文字列として
文字列のリストとしてのテキストの候補ラベル
(オプション) multi_class : 分類をマルチクラスにするかどうか (ブール値)。デフォルトは true です。

 client . classification ( "<Your block of text>" , [ "label 1" , "label 2" , "..." ])

上記のコマンドは JSON オブジェクトを返します。

コード生成エンドポイント

code_generation()メソッドを呼び出して、生成するプログラムの命令を渡します。

 client . code_generation ( "<Your instruction>" )

上記のコマンドは JSON オブジェクトを返します。

依存関係エンドポイント

dependencies()メソッドを呼び出し、品詞タグ付け (POS) + アークを実行するテキストを渡します。

 client . dependencies ( "<Your block of text>" )

上記のコマンドは JSON オブジェクトを返します。

埋め込みエンドポイント

embeddings()メソッドを呼び出し、埋め込みを抽出するテキストブロックのリストを渡します。

 client . embeddings ([ "<Text 1>" , "<Text 2>" , "<Text 3>" , ...])

上記のコマンドは JSON オブジェクトを返します。

エンティティエンドポイント

entities()メソッドを呼び出し、名前付きエンティティ認識 (NER) を実行するテキストを渡します。

 client . entities ( "<Your block of text>" )

上記のコマンドは JSON オブジェクトを返します。

生成エンドポイント

generation()メソッドを呼び出して、次の引数を渡します。

生成されたテキストを開始するテキストのブロック。 CPU 上の GPT-J の場合は最大 256 トークン、GPU 上の GPT-J および GPT-NeoX 20B の場合は最大 1024 トークン、GPU 上の Fast GPT-J および Finetuned GPT-NeoX 20B の場合は最大 2048 トークン。
(オプション) max_length : オプション。生成されたテキストに含める必要があるトークンの最大数。 CPU 上の GPT-J の場合は最大 256 トークン、GPU 上の GPT-J および GPT-NeoX 20B の場合は最大 1024 トークン、GPU 上の Fast GPT-J および Finetuned GPT-NeoX 20B の場合は最大 2048 トークン。 length_no_inputが false の場合、生成されるテキストのサイズは、 max_lengthと入力テキストの長さの差になります。 length_no_inputが true の場合、生成されるテキストのサイズは単純にmax_lengthになります。デフォルトは 50 です。
(オプション) length_no_input : min_lengthとmax_length入力テキストの長さをブール値として含めるべきかどうか。 false の場合、 min_lengthとmax_lengthには入力テキストの長さが含まれます。 true の場合、min_length とmax_lengthには入力テキストの長さは含まれません。デフォルトは false です。
(オプション) end_sequence : 生成されたシーケンスの終わりとなる特定のトークン (文字列として)。たとえば、である可能性があります.またはnまたは###または 10 文字未満のその他の文字列。
（オプション） remove_input : 結果から入力テキストを削除するかどうかをブール値として指定します。デフォルトは false です。
(オプション) num_beams : ビーム検索のビーム数。 1 はビームサーチを行わないことを意味します。これは整数です。デフォルトは 1 です。
(オプション) num_return_sequences : バッチ内の各要素に対して個別に計算されて返されたシーケンスの数 (整数)。デフォルトは 1 です。
(オプション) top_k : top-k フィルタリング用に保持する最も確率の高い語彙トークンの数 (整数)。最大 1000 トークン。デフォルトは 0 です。
(オプション) top_p : float < 1 に設定すると、合計が top_p 以上になる確率を持つ最も可能性の高いトークンのみが生成のために保持されます。これはフロートです。 0 から 1 の間である必要があります。デフォルトは 0.7 です。
(オプション) temperature : 次のトークンの確率をモジュール化するために使用される値 (float として)。 0 から 1 の間である必要があります。デフォルトは 1 です。
(オプション) repetition_penalty : 反復ペナルティのパラメータ (float として)。 1.0 はペナルティがないことを意味します。デフォルトは 1.0 です。
(オプション) bad_words : 文字列のリストとして、生成が許可されていないトークンのリスト。デフォルトは null です。
(オプション) remove_end_sequence : オプション。結果からend_sequence文字列を削除するかどうか。デフォルトは false です。

 client . generation ( "<Your input text>" )

上記のコマンドは JSON オブジェクトを返します。

文法とスペル修正のエンドポイント

gs_correction()メソッドを呼び出して、修正したいテキストを渡します。

 client . gs_correction ( "<Your block of text>" )

上記のコマンドは JSON オブジェクトを返します。

画像生成エンドポイント

image_generation()メソッドを呼び出して、生成する新しいイメージのテキスト命令を渡します。

 client . image_generation ( "<Your block of text>" )

上記のコマンドは JSON オブジェクトを返します。

意図分類エンドポイント

intent_classification()メソッドを呼び出して、インテントを抽出するテキストを渡します。

 client . intent_classification ( "<Your block of text>" )

上記のコマンドは JSON オブジェクトを返します。

キーワードとキーフレーズの抽出エンドポイント

kw_kp_extraction()メソッドを呼び出して、キーワードとキーフレーズを抽出するテキストを渡します。

 client . kw_kp_extraction ( "<Your block of text>" )

上記のコマンドは JSON オブジェクトを返します。

言語検出エンドポイント

langdetection()メソッドを呼び出し、言語を検出するために分析するテキストを渡します。

 client . langdetection ( "<The text you want to analyze>" )

上記のコマンドは JSON オブジェクトを返します。

エンドポイントの言い換え

paraphrasing()メソッドを呼び出し、言い換えるテキストを渡します。

 client . paraphrasing ( "<Your text to paraphrase>" )

上記のコマンドは JSON オブジェクトを返します。

質問応答エンドポイント

question()メソッドを呼び出して、以下を渡します。

あなたの質問
(オプション) モデルが質問に答えるために使用するコンテキスト

 client . question ( "<Your question>" , "<Your context>" )

上記のコマンドは JSON オブジェクトを返します。

セマンティック検索エンドポイント

semantic_search()メソッドを呼び出して、検索クエリを渡します。

 client . semantic_search ( "Your search query" )

上記のコマンドは JSON オブジェクトを返します。

意味的類似性エンドポイント

semantic_similarity()メソッドを呼び出し、比較する 2 つのテキストブロックで構成されるリストを渡します。

 client . semantic_similarity ([ "<Block of text 1>" , "<Block of text 2>" ])

上記のコマンドは JSON オブジェクトを返します。

文の依存関係エンドポイント

sentence_dependencies()メソッドを呼び出して、POS + アークを実行する複数の文で構成されるテキストのブロックを渡します。

 client . sentence_dependencies ( "<Your block of text>" )

上記のコマンドは JSON オブジェクトを返します。

感情分析エンドポイント

sentiment()メソッドを呼び出して、以下を渡します。

分析して感情を把握したいテキスト
(オプション) センチメントを適用するターゲット要素

 client . sentiment ( "<Your block of text>" , "<Your target element>" )

上記のコマンドは JSON オブジェクトを返します。

音声合成エンドポイント

speech_synthesis()メソッドを呼び出して、オーディオに変換するテキストを渡します。

 client . speech_synthesis ( "<Your block of text>" )

上記のコマンドは JSON オブジェクトを返します。

要約エンドポイント

summarization()メソッドを呼び出して、要約するテキストを渡します。

 client . summarization ( "<Your text to summarize>" )

上記のコマンドは JSON オブジェクトを返します。

トークン化エンドポイント

tokens()メソッドを呼び出して、トークン化するテキストを渡します。

 client . tokens ( "<Your block of text>" )

上記のコマンドは JSON オブジェクトを返します。

翻訳エンドポイント

translation()メソッドを呼び出して、翻訳するテキストを渡します。

 client . translation ( "<Your text to translate>" )

上記のコマンドは JSON オブジェクトを返します。

拡大する