Microsoft launches PromptBench, a large language model evaluation tool library
Microsoft recently released the PromptBench tool library designed for evaluating large language models. The tool library supports a variety of models and tasks, provides standard, dynamic and semantic evaluation methods, and integrates multiple hint engin
2025-01-11