opea-project

opea-project/genaieval

Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination

Jupyter Notebook

40

58

Apache License 2.0

GenAIEval is a comprehensive evaluation, benchmark, and scorecard tool designed to assess the performance, accuracy, safety, and hallucination of AI models, particularly large language models. It supports popular evaluation harnesses like lm-evaluation-harness and bigcode-evaluation-harness, making it ideal for developers and researchers working on AI model optimization and benchmarking.

Total donated

Undistributed

Share with your subscribers:

Recipients

How the donated funds are distributed

Support the dependencies

Support the repos that depend on this repository

Top contributors

22 contributions

19 contributions

16 contributions

15 contributions

11 contributions

10 contributions

9 contributions

9 contributions

9 contributions

8 contributions

Recent events

Kivach works on the Obyte network, and therefore you can track all donations.

No events yet