opea-project

opea-project/genaieval

Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination

Jupyter Notebook
40
58
Apache License 2.0

GenAIEval is a comprehensive evaluation, benchmark, and scorecard tool designed to assess the performance, accuracy, safety, and hallucination of AI models, particularly large language models. It supports popular evaluation harnesses like lm-evaluation-harness and bigcode-evaluation-harness, making it ideal for developers and researchers working on AI model optimization and benchmarking.

Total donated
Undistributed
Share with your subscribers:

Recipients

How the donated funds are distributed

Support the dependencies

Support the repos that depend on this repository

Top contributors

lkk12014402's profile
lkk12014402
22 contributions
chensuyue's profile
chensuyue
19 contributions
changwangss's profile
changwangss
16 contributions
lvliang-intel's profile
lvliang-intel
15 contributions
joshuayao's profile
joshuayao
11 contributions
VincyZhang's profile
VincyZhang
10 contributions
gavinlichn's profile
gavinlichn
9 contributions
ZePan110's profile
ZePan110
9 contributions
adkakne's profile
adkakne
9 contributions
NeoZhangJianyu's profile
NeoZhangJianyu
8 contributions

Recent events

Kivach works on the Obyte network, and therefore you can track all donations.

No events yet