ProImage-Bench: Rubric-Based Evaluation
for Professional Image Generation

Minheng Ni*, Zhengyuan Yang*, Yaowen Zhang*, Linjie Li, Chung-Ching Lin, Kevin Lin, Zhendong Wang, Xiaofei Wang, Shujie Liu, Lei Zhang, Wangmeng Zuo, Lijuan Wang
ProImage-Bench Overview
We propose ProImage-Bench, a rubric-based evaluation benchmark for professional image generation. For 654 figures collected from real textbooks and technical reports, we construct detailed image instructions and a hierarchy of rubrics that decompose correctness into 6,076 criteria and 44,131 binary checks.

Dataset Domains

The biology domain contains illustrations sourced from biology textbooks, covering a wide range of scientific content, including cellular and organelle structures, physiological and metabolic processes, and ecological or evolutionary diagrams.
* Click on any image to view details.

Case Studies

Examples of model outputs on the benchmark

Case 1 of 30
Detailed Description

Case Studies of Evaluation Results

Select Model Output:
Model Output
GPT-4o
Detailed Description
Evaluation Rubric

Overall Results on ProImage-Bench

Model Biology Engineering General Overall
Acc Score Acc Score Acc Score Acc Score
Nano Banana Pro 0.8490.625 0.7080.434 0.8160.601 0.7910.553
Wan2.5 0.7140.433 0.6060.309 0.7550.519 0.6920.420
GPT-4o 0.7040.425 0.5560.258 0.7180.463 0.6600.382
Nano Banana 0.6970.400 0.5790.276 0.7160.468 0.6640.381
Seedream 0.6800.393 0.5600.260 0.6880.442 0.6420.365
Imagen-3 0.6000.288 0.4920.195 0.6380.377 0.5770.287
FLUX-dev 0.5920.286 0.4440.167 0.6160.359 0.5510.270
Comparison of different models on ProImage-Bench across three domains.

Citation

If you find ProImage-Bench useful in your research, please consider citing our paper:

BibTeX
@article{ni2025proimage,
  title={ProImage-Bench: Rubric-Based Evaluation for Professional Image Generation},
  author={Ni, Minheng and Yang, Zhengyuan and Zhang, Yaowen and Li, Linjie and Lin, Chung-Ching and Lin, Kevin and Wang, Zhendong and Wang, Xiaofei and Liu, Shujie and Zhang, Lei and others},
  journal={arXiv preprint arXiv:2512.12220},
  year={2025}
}