APIEval-20 evaluates the performance of API testing agents by providing them with only a JSON schema and one sample payload, then assessing their generated test suite against live reference APIs with planted bugs. It measures bug detection accuracy, coverage, and efficiency objectively. This tool is particularly useful for developers and QA teams looking to rigorously test and improve their API testing strategies.