structured generation benchmark Download - structured generation benchmark Source code download

English

中文(简体) 中文(繁体) 한국어 日本語 English Português Español Русский العربية Indonesia Deutsch Français ภาษาไทย

Home>Programming related>Other source code

structured generation benchmark

Other source code

1.0.0

Download

structured-generation-benchmark

To use Large Language Models (LLMs) effectively and reliably, it's essential to include structured generation techniques. Being able to get outputs like regular expressions, JSON, or a Pydantic data model is key for making useful software.

But what's the real effect of using libraries like Outlines or Instructor to achieve that goal?

This repository has put together evaluations to answer this question.

Function Calling

The ability of the LLM to call functions.

Datasets

Berkeley Function Calling Leaderboard [April 16, 2024 update]

Evaluation

We deployed a modal function to run open-source models using Transformers + Outlines.
We created different model handlers to run the Gorilla BFCL scripts [April 6, 2024 version] for the AST simple evaluation category.
We evaluated and reported the results comparing them with the Leaderboard Website [April 26, 2024 version].

Reports

Outlines Function Calling Evaluation
Instructor Function Calling Evaluation

Synthetic Data Generation

Using an LLM to create artificial data.

Reports

Outlines Synthetic Data Generation

Expand

Additional Information

Version 1.0.0
Type Other source code
Update Time 2024-12-01
size 12.85MB
From Github

Related Applications

Parameter Efficient Transfer Learning Benchmark

2024-11-06
GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01
Generation Zero Challenges CODEX

2022-11-02
Generation Zero – Alpine Unrest

2022-08-20

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
waymo open dataset

Other source code

December 2023 Update
SmartTube

Other source code

24.71 Stable
Sunamu

Other source code

Release 2.2.0
waymo open dataset

Other source code

December 2023 Update
wp functions

Other categories

1.0.0
termwind

Other categories

v2.3.0

Related Information All