zuowei/tw2

Files

codex-bot a64378956a

Pre-commit / run (ubuntu-latest) (push) Has been cancelled

Details

Deploy Sphinx documentation to Pages / build_en (ubuntu-latest, 3.10) (push) Has been cancelled

Details

Deploy Sphinx documentation to Pages / build_zh (ubuntu-latest, 3.10) (push) Has been cancelled

Details

Python Unittest Coverage / test (macos-15, 3.10) (push) Has been cancelled

Details

Python Unittest Coverage / test (macos-15, 3.11) (push) Has been cancelled

Details

Python Unittest Coverage / test (macos-15, 3.12) (push) Has been cancelled

Details

Python Unittest Coverage / test (ubuntu-latest, 3.10) (push) Has been cancelled

Details

Python Unittest Coverage / test (ubuntu-latest, 3.11) (push) Has been cancelled

Details

Python Unittest Coverage / test (ubuntu-latest, 3.12) (push) Has been cancelled

Details

Python Unittest Coverage / test (windows-latest, 3.10) (push) Has been cancelled

Details

Python Unittest Coverage / test (windows-latest, 3.11) (push) Has been cancelled

Details

Python Unittest Coverage / test (windows-latest, 3.12) (push) Has been cancelled

Details

2026-03-02 22:32:27 +08:00

ACEBench Example

This is an example of agent-oriented evaluation in AgentScope.

We take ACEBench as an example benchmark, and run a ReAct agent with Ray-based evaluator, which supports distributed and parallel evaluation.

To run the example, you need to install AgentScope first, and then run the evaluation with the following command:

python main.py --data_dir {data_dir} --result_dir {result_dir}

Further Reading